key: cord- - t o kn authors: yu, xiaobo; song, lusheng; petritis, brianne; bian, xiaofang; wang, haoyu; viloria, jennifer; park, jin; bui, hoang; li, han; wang, jie; liu, lei; yang, liuhui; duan, hu; mcmurray, david n.; achkar, jacqueline m.; magee, mitch; qiu, ji; labaer, joshua title: multiplexed nucleic acid programmable protein arrays date: - - journal: theranostics doi: . /thno. sha: doc_id: cord_uid: t o kn rationale: cell-free protein microarrays display naturally-folded proteins based on just-in-time in situ synthesis, and have made important contributions to basic and translational research. however, the risk of spot-to-spot cross-talk from protein diffusion during expression has limited the feature density of these arrays. methods: in this work, we developed the multiplexed nucleic acid programmable protein array (m-nappa), which significantly increases the number of displayed proteins by multiplexing as many as five different gene plasmids within a printed spot. results: even when proteins of different sizes were displayed within the same feature, they were readily detected using protein-specific antibodies. protein-protein interactions and serological antibody assays using human viral proteome microarrays demonstrated that comparable hits were detected by m-nappa and non-multiplexed nappa arrays. an ultra-high density proteome microarray displaying > k proteins on a single microscope slide was produced by combining m-nappa with a photolithography-based silicon nano-well platform. finally, four new tuberculosis-related antigens in guinea pigs vaccinated with bacillus calmette-guerin (bcg) were identified with m-nappa and validated with elisa. conclusion: all data demonstrate that multiplexing features on a protein microarray offer a cost-effective fabrication approach and have the potential to facilitate high throughput translational research. protein microarrays display individual proteins at high density on a chemically-modified slide that can be tested simultaneously with high sensitivity, high specificity, and low reagent consumption. they have been widely applied in basic and translational research, such as protein interaction studies, immune profiling, vaccine development, biomarker discovery and clinical diagnostics, etc. [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . for example, zhang et al. used a human protein microarray to better understand how arsenic, which is used in chemotherapy, disrupts cancer signaling pathways and, further, to identify potential targets of novel therapeutic treatments. of the , proteins that were screened, arsenic binding proteins were identified, which may be novel targets for cancer treatment [ ] . anderson et al. used protein ivyspring international publisher microarrays to discover a -autoantibody biomarker signature of early stage breast cancer with a sensitivity and specificity of . % and . %, respectively [ ] . by combining those autoantibodies with several protein biomarkers, provista diagnostics developed the first protein-based blood test for early breast cancer detection called videssa® breast [ ] . ayoglu et al. screened sera from multiple sclerosis (ms) patients using protein microarrays containing , purified protein fragments and then validated those results using bead-based arrays [ ] . the arrays indicated that anoctamin autoantibodies and the ms-associated hla complex drb * allele were strongly associated. additional experiments showed that anoctamin aggregates near and inside lesions within human ms brain tissue [ ] . protein microarrays can be classified into two different types, purified or cell-free, based on whether the proteins are produced in vivo or in vitro, respectively [ ] . purifying proteins is labor-intensive, requires method optimization and multiple manipulations, exhibits highly variable yields of different proteins, and may not result in naturally-folded or functional mammalian products due to expression in non-mammalian systems (e.g., e. coli, yeast). cell-free protein microarrays overcome these challenges by depositing rna or dna on the slide surface and rapidly expressing them just before an experiment (~ h) through the use of various cell-free expression systems (e.g., lysate from wheat germ, insect cells, rabbit reticulocyte and human cells). compared to purified protein microarrays, cell-free protein microarrays are more likely to produce naturally-folded mammalian proteins due to the decreased sample manipulation and use of enhanced cell extracts with native chaperone proteins. moreover, the use of nucleic acids vastly simplifies the production of custom arrays since any protein can be produced as long as the gene-of-interest is synthesized; for example, arrays can be produced that represent a specific proteome or signaling pathway [ ] [ ] [ ] [ ] . a primary disadvantage of planar-based cell-free protein microarrays is the diffusion of mrna or expressed proteins during in vitro transcription and translation (ivtt), which can then be captured by neighboring features (i.e., cross-talk). thus, the closer the features are to each other, the higher the cross-talk [ ] [ ] [ ] [ ] . planar-based cell-free protein microarrays include the protein in situ array (pisa) [ ] , dna array to protein array (dapa) [ ] , nucleic acid programmable protein array (nappa) [ , [ ] [ ] [ ] , and in situ puromycin-capture array [ ] . dapa, nappa and puromycin-capture arrays employ a probe (e.g., ni-nta or anti-tag antibody) on a microarray surface that captures the expressed recombinant proteins in situ during ivtt. of the cell-free approaches, nappa has achieved the highest densities with ~ , plasmids per slide where the distance between neighboring spots is µm) and the cross-talk is less than %. however, cross-talk is increased when the feature spacing is reduced to µm [ ] . with ~ , plasmids per slide, five nappa slides are needed to screen a proteome-scale array with over , genes [ , ] . therefore, an increase in spot density would reduce the amount of labor, time, reagents, and cost needed for large-scale proteome analyses like target discovery and validation experiments. to address this issue, angenendt et al. printed cdna and expressed the proteins in nanowells using piezoelectric dispensers [ ] . takulapalli et al. demonstrated the fabrication of high-density cell-free protein arrays by combining photolithographicallyetched silicon nanowells (n= , /slide), nappa, and a piezo-inkjet printer [ ] . here we utilized a different strategy to produce high density arrays that does not require any specialized equipment or substrates. we developed the multiplexed nucleic acid programmable protein array (m-nappa) method by combining as many as five different dna plasmids within one spot, which increases the number of displayed proteins per microarray by five-fold. we first demonstrate that multiplexed proteins are displayed on m-nappa using protein-specific antibodies. second, we compare the ability of m-nappa with non-multiplexed nappa to detect different protein-protein interactions and the serological antibody reactivity against viral proteins. next, we show the feasibility of m-nappa in performing high throughput screening for immune-dominant tuberculosis (tb) antigens through the use of an ultra-high density m-nappa tb proteome array containing four subarrays with , tb open reading frames (orfs) on one slide. using m-nappa tb protein microarrays, four new immune-dominant antigens in the sera of bcg-vaccinated guinea pigs were identified, which were then validated using elisa. finally, we propose a high throughput target discovery and verification pipeline based on the m-nappa approach. all sera samples were collected with written informed consent with the approval of institutional review boards (irb) at university of florida (gainsville, fl), arizona state university (tempe, az) and albert einstein college of medicine (bronx, ny). detailed sample information was provided in our previous work [ , ] . the sera from guinea pig tb models vaccinated with bcg were kindly provided by dr. david n. mcmurray from texas a&m college of medicine [ ] . all experiments using clinical sera samples were executed according to the declaration of helsinki. a mathematical model was built based on a two-step analysis process. the first round of screening would use multiplexed plasmids with the primary objective of identifying potential protein "hits." the second round would be non-multiplexed, in which each multiplexed "hit" from the first round would be printed separately, with the primary goals of validating and identifying specific individual hits. the total number of printed spots (n) needed for the combined two-round screening of k proteins was determined by the number of plasmids printed per spot and the anticipated hit rates (i.e., percentage of displayed proteins that will be identified as significant in the study). the probability p of an individual protein being a true hit can be estimated from previous studies of a similar nature (e.g., antibody biomarkers). the following equation assumes that p follows a bernoulli distribution and that its corresponding plasmid is randomly multiplexed where the number of different plasmids per spot is k: the number of spots needed in the first round is , the probability that a multiplexed spot would be a hit (i.e., containing at least one immune-dominant the optimal level of multiplexing of k different plasmids per spot results in the smallest n. all human and viral orf plasmids were obtained from dnasu (https://dnasu.org/), and transferred into a t -based mammalian expression vector, pant -cgst, as previously described [ , , ] . purified dna plasmids were prepared by our automated dna factory robot as previously described [ , , ] , and were normalized to , ng/μl, such that multiplexed plasmids contributed equally to the final concentration. in other words, a plasmid in a five-multiplexed spot would represent ng/μl. five ( ) different plasmids containing a different gene-of-interest were mixed with a master printing mixture containing bsa (sigma), bs cross-linker (thermo fisher scientific, il) and polyclonal α-gst antibody (thermo fisher scientific, il) [ ] , and subsequently incubated at o c for h. m-nappa and nappa were printed by the nappa protein array core (http://nappaproteinarray.org/) according to published protocols [ , , ] . the quality of printed plasmid dna on m-nappa and nappa was determined using picogreen dna staining [ ] . each m-nappa microarray was blocked with superblock solution (pierce, rockford, il) for h at °c, briefly washed with water, centrifuged at rpm for min to dry, and covered with a hybridization chamber (grace biolabs, or). the array was then incubated with μl of human in vitro transcription & translation (ivtt) solution containing human hela cell lysate, accessory proteins, reaction mixture, and nuclease-free water (thermo fisher scientific, il) for . h at °c and . h at °c to express the gst-tagged proteins-of-interest. the gst-tagged proteins were displayed on the slide surface via the polyclonal α-gst antibody that was included in the printing mixture. then, the resulting protein microarray was incubated with % (w/v) milk in xpbs with . % (v/v) tween- (pbst) for h at °c, followed by three brief washes with pbst. the protein specific antibodies were diluted with % milk-pbst at : or : , respectively, and incubated with the protein microarray for h at °c followed by a h incubation at °c with an alexa fluor labeled secondary antibody (jackson immunoresearch laboratories, pa). after washing three times with pbst, the m-nappa slides were briefly rinsed with water and dried by centrifugation ( , rpm, min). the arrays were scanned by a tecan scanner (männedorf, switzerland). after proteins were expressed on m-nappa, the resulting protein arrays were blocked with blocking buffer ( ×pbs, %tween and % bsa, ph . ) for h at o c. in parallel, the query proteins (e.g., rb , jun, fos, lida) fused to a halotag were produced by incubating ng of dna in µl human cell-free expression system (thermo fisher scientific, il) for h at o c. to screen protein-protein interactions, the protein array was incubated with unpurified rb -halo protein in human hela lysate for h at o c, and then washed with cold washing buffer (pbs, mm mgcl , . % tween , % bsa and . % dtt, ph . at o c) three times to remove unbound molecules. the arrays were consecutively incubated with a chicken anti-halo tag antibody (genetel, wi) and alexa fluor goat anti-chicken secondary antibody (jackson immunoresearch laboratories, pa) for h at o c. arrays were washed and dried with brief centrifugation at , rpm for min, and scanned as described above. the protein binding signal was quantified using array-pro analyzer (media cybernetics) software as previously reported [ , ] . for a given experiment, a tab-separated file with the interaction information was generated and loaded into the cytoscape software [ ] with an attribute file that contained signal intensities of features on m-nappa and nappa. in figures and , proteins within a multiplexed m-nappa feature and its five corresponding non-multiplexed proteins on nappa were displayed as connecting large and small nodes, respectively, with color gradients depicting signal intensities. after proteins were expressed on m-nappa, the arrays were blocked with % milk-pbst for h and then incubated with sera at : dilution in % milk-pbst for h at °c. after washing three times with pbst, the resulting arrays were incubated with alex fluor labeled anti-human igg antibody (jackson immunoresearch laboratories, pa) h at °c. the slides were washed with pbst, briefly rinsed with water, and dried by centrifugation ( , rpm, min). the fluorescent scanning was performed using a tecan scanner (männedorf, switzerland). the antibody binding event was quantified by fluorescence signal intensity using array-pro analyzer (media cybernetics) software as previously reported [ , ] . protein microarrays have been used in functional protein and antibody biomarker studies to screen for target(s)-of-interest, which are generally rare in the tested protein population ( figure b) . for example, the median hit rates (± standard deviation, sd) of studies employing protein microarrays in the past five years for screening protein function and autoantibody biomarkers (table s ) were . % ± . % and . % ± . %, respectively ( figure b) . since false positives are not uncommon during initial screens, all initial candidates require an independent verification step performed using different samples [ , , , , [ ] [ ] [ ] . considering that a two-step approach for target discovery and verification often uses hundreds to thousands of samples, the cost of such studies using full-scale arrays can be inhibitory. to decrease the cost of high throughput screening experiments, we hypothesized that the plasmid cdna encoding for different proteins could be multiplexed (by combining m different plasmids) within each feature to create a high-density array, m-nappa ( figure a) . this multiplexed array could be implemented during the initial functional screen, testing entire proteomes (p proteins) using only a fraction of the features (p/m). multiplexed hits identified during the screening step could then be de-convoluted in the subsequent verification step using the standard, non-multiplexed nappa array where each feature displays only one protein (i.e., m= ). the objectives of the second step would be to identify which proteins were responsible for the positive multiplexed signal and to verify whether the hits were real. this approach exploits the high flexibility of cell-free microarrays, in which arrays can be customized by simply re-arraying individual plasmids encoding for the multiplexed features-of-interest. the schematic illustration of how m-nappa arrays are processed is shown in figure a . using a standard pin-based arrayer, each spot on m-nappa contains plasmids encoding for different proteins-of-interest with the same fusion tag. the genes are then transcribed and translated into recombinant proteins in two hours using a cell-free expression system, and captured to the slide surface in situ via a fusion tag antibody. the optimal number of proteins to multiplex depends upon several factors, including hit frequency, cost, array space, and number of proteins. as the frequency of hits in the screen increases, more proteins will need to be tested as individual features during the verification step. taken to the extreme, if one protein per multiplexed feature were a hit (hit rate = /m), all multiplexed features would require deconvolution, making the multiplexing approach impractical. however, such a high hit rate is not reflected by data collected by numerous studies; for example, the hit rate was < % in most of our previous nappa-based screening studies with k human genes ( figure b) . we generated a mathematical model (materials and methods) to find the optimal m that would take into consideration array space and the cost of screening and verifying hits using our k protein human collection at different hit rates. in figure c , the x-axis represents the number of genes per spot (m) while the y-axis represents the number of spots or proteins that are needed for the two-step screening process. notably, when the hit rate is < % for k proteins, a relatively small number of spots would be needed for the entire study (screening + verification) with proteins multiplexed per feature in the initial screen, thus representing a good compromise between the number of initial features screened and the subsequent number of features that would be needed for deconvolution and verification. to assess the difference in transcriptional/translational efficiency as well as display competition between large and small proteins within one feature, we multiplexed proteins of varying molecular weights (mw; - kda) covering % of the size range in our human protein collection ( figure s ) on m-nappa. as indicated in figure , we prepared nappa and m-nappa slides in parallel where nappa had only one plasmid per spot and m-nappa multiplexed five plasmids per spot. after ivtt, the protein arrays were probed with eight antibodies that bound targets ranging in size from to kda (methods and figure s ). these antibodies were specific to ia- ( kda), gad ( kda), clusterin ( kda), p ( kda), fos ( kda), pp a ( kda), sfn ( kda) and bcl l ( kda). we compared the protein display between the two array types using groups of five proteins with either similar (figure a and figure b ) or varied molecular weights (figure a and figure c) , and then calculated the signal ratio of m-nappa to nappa. in both cases, all of the antibodies readily detected their corresponding antigens. for the spots with similarly-sized proteins ( kda to kda), the signal ratio of m-nappa to nappa was . ± . . for the spots containing five proteins covering a wide range of molecular weight, from kda to kda (figure c) , the binding signal ratio of m-nappa to nappa was . ± . ( figure s ) . thus, multiplexing proteins of similar size did not confer any advantage over random multiplexing. to further demonstrate that there were no biases in the expression of different proteins produced from mixed plasmids, five-plasmid mixtures containing various combinations of seven different genes (abl , ia- , gad , jun, rhou, bcl l and mt ) were co-expressed in ivtt solution and analyzed via western blot. despite a wide range of protein sizes, all proteins were expressed at similar amounts in their relevant combinations ( figure s ) . these data indicate that, although each plasmid in m-nappa is present at one-fifth the amount present in standard nappa, there was no significant difference of protein display levels between the arrays (p-value = . , paired sample t-test). in addition, background signals that resulted from non-specific antibody binding were comparable between the platforms, demonstrating that multiplexing does not result in an accumulation of background signal that could contribute to the identification of false positives ( figure s ) . therefore, we randomly mixed different gene plasmids in the following m-nappa studies. to demonstrate that the signal intensity for m-nappa was reproducible, we also tested the spot-to-spot, zone-to-zone and slide-to-slide variations by printing gene plasmids on different locations across the m-nappa and nappa slides. protein display was then examined with anti-gst antibody staining (materials and methods). the coefficient of variations (cvs) for spot-to-spot, zone-to-zone and slide-to-slide were . ± . %, . ± . % and . ± . % for m-nappa, respectively, and . ± . %, . ± . %% and . ± . %% for nappa, respectively (table s ) . we purified viral orf plasmids from ~ viruses, normalized their concentrations to , ng/µl, and printed viral nappa and m-nappa arrays in duplicate [ ] . analyses of the deposited dna and displayed protein levels indicate that most viral dna plasmids were successfully printed, expressed, and captured onto the microarrays in a reproducible manner (figure a) . for example, plasmid dna deposition across technical replicates of nappa and m-nappa had correlations (r) of . and . , respectively. the protein display correlation (r) across technical replicates of nappa and m-nappa were . and . , respectively ( figure b, figure s ). "non-spots" containing printing buffer alone without plasmid dna was used as a negative control. % and % of the spots on nappa and m-nappa viral arrays, respectively, produced signal that was at least two sds above the average signal intensity of these "non-spots" (figure s ). together with figure , the results indicate that the majority of viral proteins can be displayed on m-nappa arrays. in addition, we compared the s/b (signal to background) ratios between direct fluorescence and tyramide signal amplification (tsa) using a fluorophore-linked or hrp-conjugated anti-p antibody, respectively. using the signal from "non-spots" as background, we found that the s/b ratio of fluorescence detection using an antibody with a directly-conjugated fluorophore, the dylight rabbit anti-mouse igg, was higher (s/b ratio = ± ) than the tsa method with the hrp-labeled goat anti-mouse igg (s/b ratio = ± ). thus, directly-conjugated fluorescent secondary antibodies were used for the following assays ( figure s ). to determine whether nappa and m-nappa detect similar protein-protein interactions, both arrays were programmed to display proteins that are known to interact with the tumor suppressor protein rb . the arrays were then probed with a rb query protein fused to halotag, and interactions were detected using an anti-halotag antibody. the rb -halotag query protein bound to several targets (red arrow) on nappa and m-nappa arrays; the query also bound to diffused targets outside of each spot, which appear as a "ring" around each feature (figure a) . in figure b , we used a flower pattern diagram to depict the multiplexed proteins on m-nappa (large central circle) and the deconvoluted individual proteins on nappa (five small connecting circles) (materials and methods). the blue gradient within the spot indicates target binding to the rb query protein, whereas reactivity to the "ring" [ , ] is indicated by a red circle around the spot. using custom defined criteria, where the target-to-"non-spot" signal ratio is > and the ring score is > , we found that and hits were identified on nappa and m-nappa, respectively, out of the possible candidate target proteins ( table s , table s ). five of the hits on m-nappa were e a, hpv -e , hpv -e , hpv -e and hpv -e , which agrees with previous studies [ ] [ ] [ ] . the sixth hit on m-nappa was not detected with nappa, thus suggesting that the hit may be a false positive. to further examine the utility of m-nappa to test protein-protein interactions, additional interactions were analyzed with displayed proteins on nappa and m-nappa using halotagged-jun, -fos, and -lida queries. jun, fos and lida bound to their expected interaction partners (i.e., fos, jun and three rab family proteins, respectively) on both m-nappa and nappa arrays (figure s ) . regarding the protein interactions that were identified, the spot-to-spot and zone-to-to zone cvs were . ± . % and . ± . % for nappa, respectively, and . ± . % and . ± . % for m-nappa, respectively (table s ) . these results indicate that m-nappa can be used for preliminary high throughput (ht) screening of novel protein-protein interactions. the screen can then be followed by a verification step using deconvoluted spots via nappa to identify the specific proteins that are involved. to test whether m-nappa can be used to detect proteomic serological response, we screened ten serum samples from patients with type diabetes that had been previously characterized using nappa arrays [ ] . a dozen hits were observed with m-nappa and nappa (figure ) . forty-nine of the antigens ( . %) identified by nappa were also detected by m-nappa. four antigens, however, were detected with only one platform (i.e., two with nappa, two with m-nappa). these uncommon discrepancies may be due to variations in surface chemistry, plasmid concentration, printing or array processing. since the multiplex concept to increase feature density was successful in detecting protein-protein interactions and serological antibody responses on planar microarrays, we wanted to determine whether m-nappa could also be applied to a nano-well microarray platform. we previously increased feature number by printing plasmids into photolithography-etched silicon nano-wells to create a high-density nappa (hd-nappa) platform [ ] . hd-nappa can have as many as k features per slide, and has successfully detected antiviral antibodies in autoimmune diseases with different proteins displayed on the array in quadruplicate. these tiny wells hold only pl and use only . ng of plasmid dna. we applied the multiplex concept to hd-nappa using a mixture of plasmids encoding for ia- , gad and p proteins. we then detected their expression and display using specific antibodies; all of these proteins were readily detectable when printed as a three-plexed mixture ( figure s ) . we then multiplexed , tuberculosis (tb) orfs [ ] onto hd-nappa microarrays as four separate subarrays using three gene plasmids per well (m= ), resulting in an m-hd-nappa microarray displaying > k proteins on a single slide. this lower multiplicity was based on the mathematical model ( figure c ) that took into account that the high number of conserved proteins in endemic, non-pathogenic mycobacterial species results in a higher hit rate (~ % [ ] ). over % of the spots generated a signal that was at least sds above the background, which indicates that the vast majority of proteins were well-expressed and displayed (figure a) , with a correlation of r = . across technical replicates (figure a-c) . antibody reactivity from tb patient sera was observed with m-hd-nappa (figure d) . the technical reproducibility of these immune-dominant antigens across different m-nappa arrays using the same sera was very high, with a correlation of r = . (figure d-f) . all immune-dominant antigens identified with m-hd-nappa screening were then deconvoluted in the verification step using single protein nappa ( figure g ) and validated with rapid-elisa as previously described [ ] (figure h ). we screened the sera from guinea pigs immunized with bacillus calmette-guerin (bcg), a tb vaccine, using m-hd-nappa tb proteome microarrays. the aim of this experiment was to identify potential protective antibodies induced with bcg. the representative fluorescence images are shown in figure a . compared to the control mock sera pool using pbs buffer (n= ), four features on m-nappa arrays showed increased signals with the bcg samples (n= ) (figure b) . to deconvolute and validate those targets, we repeated the serological assay for those candidate proteins, along with two non-responsive control proteins (rv a and rv c), using rapid-elisa and the individual sera from the guinea pigs. the antibody levels of four antigens (rv c, rv , rv and rv ) in bcg-vaccinated guinea pigs were significantly higher than that of the pbs control with a p-value < . ( figure c, figure s ). according to the tuberculist database (http://tuberculist.epfl.ch/), these proteins are involved in regulation, cell wall and cell processes, and are considered to be in the proline-glutamic acid / proline-protein-glutamic acid (pe/ppe) protein families ( figure d ). a primary advantage of cell-free protein microarrays is that the arrays have a long shelf life. we compared the protein expression of m-nappa tb arrays immediately after printing and then again after months of storage at room temperature in a nitrogen atmosphere. a gst-tagged protein, detected with an anti-gst antibody, was considered to be displayed if it had a signal that was two sds above the signal of the "non-spots." over % of the proteins were displayed on new m-nappa arrays; this number, as well as the anti-gst signal intensity, did not change even after months of storage ( figure s ). nappa has been widely applied in protein-protein interactions, post-translational modifications (ptms), antibody epitope mapping and discovery of (auto) antibody biomarkers for a variety of human diseases, including markers that are currently being used in the clinic for the detection of breast cancer [ , , , , , , [ ] [ ] [ ] . due to mrna and protein diffusion during ivtt, the number of features per planar microscope slide has been limited to ~ , to minimize cross-talk to neighboring spots. the feature density limit has thus required that multiple slides be used to study large proteomes. here, we developed a new strategy, m-nappa, that significantly increases the number proteins that can be tested per slide multiple-fold. by combining five different plasmids within one feature, > k proteins can be printed on one microscope slide for ht, low cost analyses when compared to studies using one-plasmid-per-feature arrays. the multiplexed hits that are identified with m-nappa can then be deconvoluted during the subsequent verification step (figure ) . first, we constructed a mathematical model to determine the optimal level of multiplexing, which considers the number of proteins, cost, array size, and hit rate to predict the number of arrays that would be needed for a two-step screening and verification study. a survey of ht unbiased target screening studies that used protein microarrays, both in the literature and our own results using nappa, revealed that hits are rare (typically < %) (figure b) . for k proteins and a hit rate of %, the mathematical model indicated that multiplexing proteins per spot (figure c) would provide a good balance of maximizing the number of features, minimizing the number of arrays, and yielding the minimum overall workload when compared to using non-multiplexed arrays for both the screening and verification steps. second, we demonstrated that there was nothing inherent to printing plasmids as a five-plasmid mixture that prevented their expressed proteins from routine detection regardless of protein size. however, display levels of large proteins (> kd) were decreased by ± . % for gad and ± % for ia- ( figure c ). this may be because the plasmids were mixed equally together based on their masses, a requirement imposed by the printing chemistry; this would result in a lower molarity of large plasmids (i.e., large proteins) relative to small plasmids in the printing mixture. another possible reason is that larger proteins are produced more slowly than smaller proteins due to their longer mrna sequences. third, we showed that m-nappa can be used in protein-protein interaction and serological screening studies. the results from m-nappa agreed strongly with those observed with non-multiplexed nappa (figure - , table s and s , figure s ). these data indicate that m-nappa presents a labor-and cost-effective strategy to initially screen for hits. fourth, we further increased the feature density by applying this method to our previously-published nano-well platform [ ] . with m-hd-nappa, the entire tb proteome containing , genes was successfully printed on a nano-well array in quadruplicate [ ] (figure ). this generates the highest density nano-well protein microarray to date and increases the previously demonstrated content by more than five-fold [ ] . our data indicate that the multiplexing strategy has great potential value for use with different microarray platforms (figure and figure ) [ ] . finally, we evaluated the reproducibility of m-nappa arrays for protein array preparation and protein-protein interactions. we found m-nappa can be reproducibly fabricated with spot-to-spot, zone-to-zone and slide-to-slide cvs that are similar to those obtained with nappa ( table s ) . the spot-to-spot and zone-to-to zone cvs for protein-protein interactions were also similar between the two array platforms (table s ) . while the correlations within and between different m-nappa slides were good (i.e., r = . for both) (figure and figure s ), with some size adaptation, the reproducibility could eventually be further improved with the use of automation equipment like the hs pro hybridization station (tecan trading ag; männedorf, switzerland). in some ways, m-nappa resembles "natural protein" microarrays that print unpurified or partially fractionated proteins from lysates of human cells, tissues or body fluids, but in a much more controlled manner. each feature of a natural protein microarray typically represents a mixture of unknown proteins. thus, responsive hits on natural protein arrays require a challenging and time-consuming process to determine the identity of the protein responsible for the response. this may require further purification, identification by mass spectrometry and additional response testing of recombinant proteins [ , ] . in the case of m-nappa, the identities of the proteins in each mix are known in advance and the plasmids encoding for each protein are available for secondary testing. m-nappa would be useful in unbiased ht screening studies, such as protein-protein interactions, protein-dna interactions, discovery of drug binding target as well as (auto)antibody biomarkers for a variety of human diseases. however, it should be noted that there are situations in which using a non-multiplexed array format would be more appropriate. for example, nappa should be used when investigating protein functions or when the number of proteins to be screened is low. additional attributes of m-nappa should be considered as well. large, multiplexed proteins (> kda) on m-nappa are displayed at a lower level ( - %) than their non-multiplexed counterparts ( figure c ). this issue could be resolved by increasing plasmid dna concentration before printing or reducing multiplicity per spot. alternatively, since large proteins represent a small fraction of the proteome, a hybrid array containing multiplexed spots with plasmids encoding for proteins with low to moderate mws and non-multiplexed spots for large proteins (> kda) could be employed. in addition, ptms that occur during cell-free protein expression may affect the protein display or activity on m-nappa arrays [ ] . we have observed that the human expression system contains the ability to phosphorylate some proteins (data not shown); other types of ptms (e.g., glycosylation, acetylation) by the expression system are not well known or reported. in our studies, ptms did not appear to affect protein expression, protein-protein interactions, or the identification of serological antigens on m-nappa when compared to nappa (figure , figures - and figure s ). we developed a method that multiplexes five different proteins within the same feature, called m-nappa, which significantly increases array density while decreasing experimental time and cost. although we used this approach with nappa and hd-nappa, the same concept could be applied toward other microarray technologies or platforms. our results show that m-nappa identified hits in protein interaction and serum screening studies, thus highlighting its potential to be employed in high throughput proteomics studies. supplementary table s -s and supplementary figures s -s . http://www.thno.org/v p s .pdf protein analysis on a proteomic scale protein microarray technology severe acute respiratory syndrome diagnostics using a coronavirus protein microarray a burkholderia pseudomallei protein microarray reveals serodiagnostic and cross-reactive antigens protein acetylation microarray reveals that nua controls key metabolic target regulating gluconeogenesis protein microarrays for personalized medicine systematic identification of arsenic-binding proteins reveals that hexokinase- is inhibited by arsenic identification of serum biomarkers for gastric cancer diagnosis using a human proteome microarray aagatlas . : a human autoantigen database advancing translational research with next-generation protein microarrays nappa as a real new method for protein microarray generation emerging protein array technologies for proteomics protein microarray signature of autoantibody biomarkers for the early detection of breast cancer integration of serum protein biomarker and tumor associated autoantibody expression data increases the ability of a blood-based proteomic assay to identify breast cancer anoctamin identified as an autoimmune target in multiple sclerosis advancing translational research with next generation protein microarrays identification of antibody targets for tuberculosis serology using high-density nucleic acid programmable protein arrays high-throughput identification of proteins with ampylation using self-assembled human protein (nappa) microarrays plasma autoantibodies associated with basal-like breast cancers immunoproteomic profiling of anti-viral antibodies in new-onset type diabetes using protein arrays high density diffusion-free nanowell arrays single step generation of protein arrays from dna by cell-free expression and in situ immobilisation (pisa method) optimised 'on demand' protein arraying from dna by cell free expression with the 'dna to protein array' (dapa) technology cell free expression put on the spot: advances in repeatable protein arraying from dna (dapa) printing protein arrays from dna arrays next-generation high-density self-assembling functional protein arrays self-assembling protein microarrays mapping transcription factor interactome networks using halotag protein arrays protein chip fabrication by capture of nascent polypeptides copper-catalyzed azide-alkyne cycloaddition (click chemistry)-based detection of global pathogen-host ampylation on self-assembled human protein microarrays cell-free protein expression and functional assay in nanowell chip format mycobacterial membrane vesicles administered systemically in mice induce a protective immune response to surface compartments of mycobacterium tuberculosis host-pathogen interaction profiling using self-assembling human protein arrays cytoscape: a software environment for integrated models of biomolecular interaction networks mapping transcription factor interactome networks using halotag protein arrays autoantibody signature for the serologic detection of ovarian cancer dynamic antibody responses to the mycobacterium tuberculosis proteome systematic identification of interactions between host cell proteins and e oncoproteins from diverse human papillomaviruses functional studies of e proteins from different hpv types adenovirus e a, simian virus tumor antigen, and human papillomavirus e protein share the capacity to disrupt the interaction between transcription factor e f and the retinoblastoma gene product antiviral antibody profiling by high-density protein arrays a contra capture protein array platform for studying post-translationally modified auto-antigenomes exploration of panviral proteome: high-throughput cloning and functional implications in virus-host interactions serological autoantibody profiling of type diabetes by protein arrays mycobacterium tuberculosis proteome microarray for global studies of protein function and immunogenicity the identification of phosphoglycerate kinase- and histone h autoantibodies in pancreatic cancer patient serum using a natural protein microarray development of natural protein microarrays for diagnosing cancer based on an antibody response to tumor antigens we thank for dr. mark atkinson (university of florida) for providing sera samples. this work was supported by the national natural science the authors have declared that no competing interest exists. key: cord- -ebl yc authors: hoppe, sebastian; bier, frank f.; von nickisch-rosenegk, markus title: identification of antigenic proteins of the nosocomial pathogen klebsiella pneumoniae date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: ebl yc the continuous expansion of nosocomial infections around the globe has become a precarious situation. key challenges include mounting dissemination of multiple resistances to antibiotics, the easy transmission and the growing mortality rates of hospital-acquired bacterial diseases. thus, new ways to rapidly detect these infections are vital. consequently, researchers around the globe pursue innovative approaches for point-of-care devices. in many cases the specific interaction of an antigen and a corresponding antibody is pivotal. however, the knowledge about suitable antigens is lacking. the aim of this study was to identify novel antigens as specific diagnostic markers. additionally, these proteins might be aptly used for the generation of vaccines to improve current treatment options. hence, a cdna-based expression library was constructed and screened via microarrays to detect novel antigens of klebsiella pneumoniae, a prominent agent of nosocomial infections well-known for its extensive antibiotics resistance, especially by extended-spectrum beta-lactamases (esbl). after screening clones, previously unknown immunogenic proteins were identified. subsequently, each protein was expressed in full-length and its immunodominant character examined by elisa and microarray analyses. consequently, six proteins were selected for epitope mapping and three thereof possessed linear epitopes. after specificity analysis, homology survey and d structural modelling, one epitope sequence gavvalsttfa of kpn_ , an ion channel protein, was identified harboring specificity for k. pneumoniae. the remaining epitopes showed ambiguous results regarding the specificity for k. pneumoniae. the approach adopted herein has been successfully utilized to discover novel antigens of campylobacter jejuni and salmonella enterica antigens before. now, we have transferred this knowledge to the key nosocomial agent, k. pneumoniae. by identifying several novel antigens and their linear epitope sites, we have paved the way for crucial future research and applications including the design of point-of-care devices, vaccine development and serological screenings for a highly relevant nosocomial pathogen. klebsiella pneumoniae is a gram-negative, facultative anaerobic rod-shaped bacterium belonging to the family of enterobacteriaceae. it is a non-motile, lactose fermenting organism, which has been known to cause severe lung damage if aspirated. other clinical symptoms common with klebsiella pneumoniae infections encompass urinary-tract-infections (uti) and wound infection potentially causing bacteremia and septicemia [ ] . in recent years it has become one of the most persistent nosocomial agents, especially due to the increasing distribution of multiple resistances to antibiotics. the most prominent group of k. pneumoniae harboring a broad resistance spectrum incorporates the extendedspectrum beta-lactamase (esbl) expressing strains. due to their outstanding clinical relevance and occurrence as agents of nosocomial infections, it is highly desirable to rapidly detect the presence of these organisms and to find suitable measures to effectively counter any infection in the early stages [ ] . while numerous dna-based typing methods exist [ ] , these are often laborious and time-consuming. in contrast, user-friendly point-of-care devices applying antigen-antibody interactions would allow for a quick and reliable detection [ ] . nevertheless, the knowledge about suitable antigens to be incorporated into such a device is scarce. thus, we have utilized a method to quickly assess novel immunogenic proteins of k. pneumoniae, which might serve as potential targets for a diagnostic tool. recently, we have successfully employed this approach to unveil immunogenic proteins for both campylobacter jejuni [ ] and salmonella enterica [ ] . concisely, prokaryotic cdna libraries are created, fusion proteins expressed and these constructs covalently attached to microarray surfaces via the use of a halotag (promega). subsequently, the microarrays are screened using polyclonal antibodies reactive to the donor species of the cdna. therefore, this approach enables a broad and reliable screening, while reducing cross-reactivity and background to a minimum [ ] due to the highly selective and covalent binding of halotag to its specific ligand [ ] . moreover, the high specificity of said interaction renders excessive protein purification steps normally encountered in microarray-based screening applications [ ] obsolete. thus, it is a faster and more direct approach as spotting combines both the deposition of samples and enables the immediate purification by a simple washing step. in connection with the above screening approach, cdna derived expression libraries were generated to express a vast number of proteins from k. pneumoniae. these libraries offer the advantage of a smaller sample size as compared to genomic libraries. this is mainly due to genomic libraries encompassing highly truncated dna fragments as well as dna representing regions that do not encode proteins in the original organism. contrastingly, cdna libraries generated via the in-fusion smarter directional cdna library construction kit (clontech) have been known to lead to longer fragments and possess a high abundance of full-length clones [ ] . thus, the overall number of clones required for screening is substantially reduced. still, prokaryotic cdna libraries display one main disadvantage. as bacterial mrna rarely contains a poly(a)-tail [ , ] , isolation of the mrna from other rna species is tedious. while some methods exist to isolate the mrna prior to reverse transcription [ , ] , we rather chose to normalize the cdna afterwards. consequently, the entire rna of k. pneumoniae was used for reverse transcription. next, normalization was performed using a duplex-specific nuclease [ ] . this treatment has been shown to effectively reduce the highly abundant rrna derived cdna portions without implementing a bias, thus altering the overall composition of the cdna in favour of the mrna derived molecules [ ] . in addition to this, ligation-independent cloning and electroporation were employed to enhance cloning efficiency [ ] . in this work, we have screened clones to detect the presence of previously unknown immunogenic proteins. in summary, we identified proteins that have not been described as immunogenic before. after further analyses and epitope mapping of several promising candidates, three proteins -a channel receptor, a putative transport protein and a hypothetical protein -revealed linear antigenic sites with varying specificity. our results offer the potential to be used for a wide array of applications including the generation of monoclonal antibodies that might be used in diagnostic examination. furthermore, several of the identified antigens or parts thereof might be suitable for vaccine development, either used in passive or active immunization. additionally, many virulence-associated factors harbor some immunogenic potential. thus, identification of novel immunogenic proteins might elucidate proteins involved in the pathogenicity and virulence of k. pneumoniae. consequently, this advances the understanding of this pathogen and illuminates new approaches to counter infections. the mean rin value of rna used for cdna generation and library construction was calculated to . . (n = ). after successful normalization, the cdna was cloned to create an expression library. of this library, different clones were screened using halolink slides. the known immunogenic proteins [ ] , outer membrane protein a ompf (uniprot/swiss-prot: a t ) and outer membrane protein a ompa (a t ), were used as positive references, whereas both dihyroorotase pyrc (a t d ) and glyceraldehye- -phosphate-dehydrogenase gapa (a t l ) served as negative references. after screening, the signal intensities of each sample were compared to the references and grouped accordingly. generally, three distinct groups were established. group i represents samples exceeding the intensities of both positive reference proteins, group ii encompasses samples ranging in between the different intensities of ompa and ompf, whereas group iii entails those samples that albeit showing higher intensities than the negative controls, are below both ompa and ompf. here, approximately % of samples belong to group i, while ii and iii contain % and % respectively. the remaining samples, %, fall into the same range as the negative protein references. consequently, clones or . % of the entire screening approach were selected for sequencing. these clones were all taken from group i. after sequencing, artificial fragments and known antigens were discarded to reveal potentially novel immunogenic proteins, see table s for a list of the initial identification via sequencing. some of the inserts were too heavily truncated to reliably identify the corresponding gene. moreover, in some cases subcloning the initially identified genes in full-length failed even after numerous attempts. therefore, those genes were removed from further characterization as the translated peptide fragments were too short to be significant. despite these limitations, potentially novel immunogenic proteins were expressed in full length and used for further characterization. the antigen candidates are summarized in table including their locus tag, protein name, length and size in kda. the difference in immunodominant behaviour was assessed by ten independent microarray and elisa analyses employing two different antibodies. in summary, table reveals the resulting mean q values and corresponding errors (n = ). a q value above one represents higher intensity signals than ompa, the used positive reference. the highest mean q value was obtained for kpn_ , a coa-linked acetaldehyde dehydrogenase and irondependent alcohol dehydrogenase; pyruvate-formate-lyase deactivase with . . . however, as this protein is highly conserved within all bacteria, it was not considered for further analyses regarding specific antigenic sites. the same holds true for kpn_ , the s ribosomal protein l methyltransferase. although a q value of . . was attained, the highly conserved nature of this protein renders it unsuitable within a diagnostic approach. contrastingly, kpn_ and kpn_ , two hypothetical proteins, were selected for future investigations as little is known about the function of these proteins. their mean q values were . . and . . respectively. moreover, kpn_ shows some homology to a known superantigen of yersinia pseudotuberculosis according to the ncbi protein cluster database [ ] . thus, the potential utilization of these proteins within a diagnostic tool seems plausible. in addition, kpn_ , a histidine triad protein, kpn_ , glucose- -phosphate uridylyltransferase, kpn_ , a nucleoside channel and receptor of phage t and colicin k and kpn_ , a putative transport protein, were selected for further investigations via epitope mapping. while kpn_ and kpn_ revealed mean q values above one with . . and . . , respectively, kpn_ and kpn_ failed to reach this level. rather, the q values attained were . . and . . . while the q values of the latter two proteins are lower compared to some of the other proteins identified, their functional descriptions and membraneassociation render them highly attractive within a diagnostic question. hence, they might be more easily accessible in a whole cell detection approach than cytoplasmic proteins. epitope mapping revealed the potential presence of linear epitopes within three of the six proteins investigated, namely kpn_ , kpn_ and kpn_ . for kpn_ seven distinct regions were identified with intensities above a.u., see figure . these comprised the peptides and , - , - , - , - , - and - . the highest mean value for these peptides was obtained for peptide with more than a.u. the positive reference, i.e. rabbit igg, reached a mean value of more than a.u., whereas the negative control mbp showed intensities of less than a.u. as the adjacent peptides are identical in all but amino acids in sequence, a consensus can easily be derived from two or more neighboring peptides. as figure reveals, only the first two peptides, llaagavvalsttfa and gavvalsttfaagaa showed some specificity during specificity control assays. here, the arrays were incubated with additional antibodies reactive to different bacterial species, namely campylobacter jejuni, staphylococcus aureus, e. coli and s. enterica. all remaining peptides display similar intensities when these antibodies were used as compared to the original k. pneumoniae antibodies. however, for the first two peptides a significant difference is observed. the mean value for peptide was approximately a.u. with k. pneumoniae antibodies and dropped to less than a.u. with the other antibodies. a similar trend is discernible for peptide , where a drop from a.u. to less than zero is visible. thus, the consensus sequence gavvalsttfa is likely a suitable linear epitope featuring specificity for k. pneumoniae. for the putative transport protein, kpn_ , three regions scored intensities above a.u. including peptides and , and as well as - , see figure . the positive reference again reached a mean value above a.u., while the negative control levelled out at less than a.u. after specificity assays, the peptides and revealed predominantly specific binding by the k. pneumoniae antibody as the mean values dropped from approximately and a.u. respectively, to less than zero for all other antibodies tested, see figure . thus, a consensus for this epitope can be derived to giafgavelfd. contrastingly, the remaining peptides revealed equal intensities independent of the antibody used. finally, for the third protein, kpn_ , one pair of adjacent peptides, namely and , achieved mean values of approximately a.u. within close proximity to the positive reference, as depicted in figure . additionally, peptide displayed the highest overall mean value with more than a.u., however neither of the two neighboring peptides ( and ) attained values of any significance. thus, the presence of a linear epitope in that region is rather unlikely. besides, as specificity assays illustrated, none of the three peptides showed specific interaction to the k. pneumoniae antibodies alone, see figure . rather, signal intensities with antibodies reactive to c. jejuni fell into the same scope. therefore, nonspecific binding to these peptides is probable. the remaining three proteins under investigation failed to disclose any linear peptide region with significant signal intensities to assume linear epitopes to exist. for a better understanding of the suitability of the identified linear epitopes, structural modelling was employed using the swiss model automated mode. for kpn_ a model was constructed based on the crystal structure of the bacterial nucleoside transporter tsx of e. coli. however, the derived model only spans residues to and as such does not contain the derived consensus sequence of the linear epitope gav-valsttfa. nevertheless, the model displays a beta barrel structure typical of outer membrane-spanning transport proteins, see figure for details. the first residues of the derived model, starting at position , are marked in orange and present a coiled region outside of the beta barrel. therefore, the likelihood of the gavvalsttfa region to reside within the barrel is slim. rather, an extension of the truncated coil seems plausible. consequently, the potential accessibility of the identified linear epitope appears high. furthermore, the predicted d structure of the first part of kpn_ is displayed in figure . contrary to kpn_ , two models were devised for kpn_ . however, only one encompasses the identified linear epitope giafgavelfd, which is highlighted in orange and is situated as part of an alpha helix and an adjacent loop. as the sequence is not enclosed by other residues or structures, good accessibility ought to be provided. in order to predict the potential specificity of the epitopes, homology analyses were performed. whereas giafgavelfd of kpn_ exhibits a broad homology throughout with all residues identical in closely related species, see figure s : homology of kpn_ , gavvalsttfa of kpn_ features four residues likely specific for k. pneumoniae within this particular sequence. the variance of the latter epitope's sequence considering closely related species to k. pneumoniae is summarized in figure . specifically, the following residues have been replaced: valine by leucine (position four), both threonine residues by serine residues (position and ) as well as alanine by threonine (position ). the influence of sequence variations on the binding capacity of the identified epitope gavvalsttfa was subsequently examined by performing an alanine scan. its results are summarized in figure . the original consensus epitope sequence shows a mean intensity of approximately a.u. alterations of the first, second, fifth, eighth and ninth residue lead to a significant drop of signal intensities. consequently, signals of less than a.u., in close proximity to the negative control mbp with a.u. are obtained. contrastingly, by altering residues three or six of the consensus sequence gavvalsttfa, i.e. replacing the first valine or leucine by alanine, the resulting mean signal intensities are significantly increased to more than a.u. for replacing valine and surpassing a.u. after changing leucine to alanine. additionally, specificity assays showed no significant signal intensity for antibodies reactive to closely related species, i.e. e. coli and s. enterica. incubation with antibodies reactive to either of those two bacterial species resulted in signal intensities in the neighbourhood of the negative control independent of sequence alterations, see figure . this was also true for gavlalsssft and saalaltssft. the screening of a cdna based expression library of k. pneumoniae has successfully identified a number of novel, previously unknown immunogenic proteins. this is well in accordance to previous achievements of this method, which we have employed for the identification of both c. jejuni [ ] and s. enterica [ ] antigens. consequently, in this current study we aimed to detect suitable proteins for the specific identification of k. pneumoniae. furthermore, identification of specific antigens might improve treatment opportunities including the development of suitable vaccines [ ] . finally, detecting proteins with yet unknown functional and structural information to be antigenic might be an indication of their potential involvement in the pathogenic nature of an organism. thus, these proteins might be suitable access points for future investigations to improve the understanding of the underlying pathogenicity and virulence of k. pneumoniae. within the proteins only a subset was selected for further analysis using epitope mapping. the rationale for these selections was based on three distinct features: the resulting q value, homology as well as functional and structural properties. while the q value mirrors a normalized intensity compared to a positive reference, it does not fully account for differences in expression levels, misfolding and other factors which might have had an influence on the binding reaction and thus on the overall intensity. still, it facilitates to rank the proteins and increases the likelihood of a given protein to be immunogenic if its q value is significantly high. however, proteins with outstanding q values were not chosen for epitope mapping by default. rather, the known homology and corresponding distribution of each protein was carefully evaluated. thus, proteins like kpn_ and kpn_ albeit scoring high q values were exempt from epitope mapping as they display a broad spectrum of homologous proteins across bacterial species. finally, the intrinsic properties of each protein, if available, were closely scrutinized. therefore, proteins with hypothetical character were predominantly chosen, as information on them is confined. in addition, all types of membrane-associated proteins deserved a better look, as these type of proteins offer a more direct route and accessibility, which might be of utmost importance in a future rapid point-ofcare device detecting whole organisms. after epitope mapping, three of the six proteins under investigation revealed sites with potential linear epitopes. still, the remaining proteins displayed no such sequences. this is, however, in accordance to the general prevalence of structural epitopes in comparison with linear epitopes in nature. in fact, approximately % of all epitopes are conformational rather than sequential [ , ] . despite these potential shortcomings of the used linear epitope mapping method, a number of intriguing linear epitopes have been identified. notably, three proteins, kpn_ , kpn_ and kpn_ , harbored sites with potential linear epitopes. further careful examination, however, excluded kpn_ from future applications as the identified epitope sequence were nonspecific for k. pneumoniae, which might be due to the conserved character of the protein within the family enterobacteriaceae. still, kpn_ is a membrane protein and upcoming investigations might help to elucidate applications using this protein either for prevention of k. pneumoniae infections or detection thereof. in contrast, the other two proteins displaying linear epitopes, kpn_ and kpn_ , indicated some specificity with the antibodies tested and two linear consensus sequences could be derived, gavvalsttfa and giafgavelfd, respectively. while the former sequence is present in kpn_ , an ion channel protein tsx, which is conserved among enterobacteriaceae, the latter sequence is part of kpn_ , a hypothetical protein with high similarity to a cation:proton antiport protein and conserved among proteobacteria. consequently, the homology of giafgavelfd (kpn_ ) is high throughout different bacterial species, which renders the sequence inapt for specific k. pneumoniae detection and other applications. in contrast, gavvalsttfa (kpn_ ) displays alterations within this sequence for bacterial species other than k. pneumoniae. as small as these alterations appear, they appear to have a crucial effect on the binding of antibodies to the target sequence and thus benefit specificity. this is well in line with the experimental results that indicate no binding by antibodies reactive to other bacteria to this sequence or any of its alterations. moreover, alanine scanning revealed a number of residues to be paramount for antibody binding. consequently, replacing the glycine (position ), alanine (position ), alanine (position ) or either threonine (positions and ) leads to a significant loss of antibody binding observable by a dramatic reduction in signal intensity. on the contrary, removing either the first valine or leucine and inserting an alanine residue as a replacement results in a significant increase in signal intensity, potentially hinting at an improved antibody binding. the reason for these effects remains nebulous; however, it is not caused by a simple change in secondary structure. this is apparent as both aavvalsttfa and gavvaasttfa, two sequences at opposite sides regarding signal intensity, are predicted to consist solely of an alpha-helix as compared to the original sequence, which is predicted to contain a beta-strand (position to ) and a helical part (position to ) by emboss garnier. furthermore, as the original sequence is fully constructed of uncharged amino acids, alterations via alanine or glycine do not change the overall charge of the peptide. consequently, the differences in signal intensity cannot be deduced to implementation or removal of charged residues. finally, changes in hydrophobicity albeit present are minimal at best and may not suffice to explain the observed characteristics. still, one has to bear in mind that the accuracy of prediction based tools is often lacking, especially for emboss garnier with an accuracy in the range of %. consequently, some of the predicted secondary structures might differ. despite some uncertainty as to the mechanistic cause of the altered peptides to behave as they did, one fact remains obvious. none of the peptides showed any significant binding to antibodies not raised against k. pneumoniae. therefore, after alanine scanning, gavvalsttfa remains a suitable candidate for specific k. pneumoniae detection or treatment. still, accessibility of the epitope is paramount to a quick diagnostic tool. thus, modelling of the d structure of the protein was performed. unfortunately, the d modelling only succeeded for the part of the protein, which does not contain said linear epitope sequence. nevertheless, the pronounced beta barrel structure of the channel protein is visible and the linear epitope sequence, albeit absent, likely to be an extension of the freely accessible n-terminal region outside the barrel structure. thus, accessibility of the epitope by an antibody might be pronounced without the need to enter the cell or channel. membrane proteins harboring a beta barrel like structure have been shown in bacteria to be exclusively found in the outer membrane [ ] . moreover, the mobile coils on the extracellular sides of these membrane proteins are pivotal for their function or interaction with other molecules. this renders the identified sequence an intriguing target for antibody-based detection. additionally, channel proteins harboring beta barrel like structures have been shown to be immunodominant in other bacterial species such as salmonella, haemophilus influenzae, e. coli, neisseria meningitides, shigella dysenteriae and chlamydia trachomatis [ ] [ ] [ ] [ ] [ ] [ ] . on the contrary, the model for kpn_ encompasses the identified consensus sequence of the linear epitope giafga-velfd, which is part of a loop between two alpha helices. the abundance of alpha helices suggests the protein to span the inner membrane [ ] . in this topological design, helices are mostly located within the membrane, notably as transmembrane domains, whereas loops are located either on the cytoplasmic or periplasmic side of the cell. when considering prediction based methods, such as s_tmhmm for topological domains [ , ] or emboss antigenic [ , ] , part of giafgavelfd is assumed to be extracellular. combined with the high flexibility and degrees of freedom of random coil structures, the likelihood for good accessibility is high rendering the sequence a potentially attractive target for whole cell detection despite its lack of specificity. furthermore, these findings support the d model and underline the accuracy of the specified structure. another key aspect in determining the accuracy of the d model prediction is a so-called z-score [ ] . the z-score for the model of kpn_ is . and . for kpn_ respectively. although the values are significantly below zero that does not inevitably indicate models of poor accuracy. in fact, low z-scores are often obtained if the protein under investigation is membraneassociated. this is mainly due to the inverse physicochemical properties of membrane proteins in comparison to soluble ones. hence, the low z-scores are more likely induced by this effect than caused by an insufficient accuracy of the models. in conclusion, we have successfully identified several novel antigens of k. pneumoniae and identified three proteins potentially harboring linear epitopes. subsequently, we achieved to identify two sequences displaying specificity during experimental investigations; however, one of these is doubtful as homology analysis has revealed it to be highly conserved among a broad spectrum of bacterial species. still, gavvalsttfa of kpn_ was identified to be specific both experimentally and has shown four residues within the eleven amino acid sequence to occur predominantly in k. pneumoniae only. thus, the likelihood for this linear epitope to be specific is high. this assumption was confirmed by alanine scanning revealing a number of pivotal residues for antibody binding. moreover, it was unearthed that neither e. coli nor s. enterica antibodies were able to bind to any of the sequences, original and modified. subsequent investigations might help to further nurture the insight into the suitability of this peptide for diagnostic and therapeutic applications. thus, monoclonal antibodies ought to be devised to be used for affinity investigations via biacore and to determine kinetics. furthermore, monoclonal antibodies could be used within a potential diagnostic tool and after validation ought to be tested with whole bacteria. if these antibodies are able to specifically detect intact k. pneumoniae cells, the resulting antibody might well be suited for integration into a point-of-care device. in a different approach, the identified epitope sequence could easily be produced in large quantity. this peptide might serve some role in serological screenings, especially if it proves to be immunodominant. consequently, antibodies against this epitope might be present in a plethora of patient sera. finally, all proteins identified here might be suitable candidates for vaccine development independent of the existence of a linear epitope, as structural epitopes might well be present and antigenicity ensured. nevertheless, additional in-depth analysis is required to determine a number of the key aspects of vaccine development prior to use. as a donor of rna the fully-sequenced strain k. pneumoniae mgh was grown on solid trypticase-soy-agar (tsa) for h at uc under aerobic conditions. for rna isolation a liquid culture was prepared by inoculating ml of brain-heart-infusion broth (bhi) with a single colony and incubated overnight at uc, rpm. this overnight culture was used to inoculate a flask containing ml fresh bhi medium. the cells were harvested h after inoculation. for initial screening a rabbit polyclonal igg antibody to k. pneumoniae (acris ap pu-n) was used. for further micro-array analyses of a subset of candidate proteins, elisa and epitope mapping this antibody was used as well as rabbit polyclonal igg antibody to k. pneumoniae (abcam ab ). the antibodies were generated with k. pneumoniae atcc serving as an immunogen. specificity assays were performed employing rabbit polyclonal igg antibody to c. jejuni (acris ap pu-n), s. aureus (fitzgerald c-cr rp), e. coli (abcam ab ) and s. enterica (abcam ab ). detection was achieved by usage of secondary antibodies. goat polyclonal antibody to rabbit igg conjugated with chromeo- (abcam ab ) for fluorescent and antibody conjugated with horseradish peroxidase (abcam ab ) for a colorimetric readout were applied where appropriate. the cells were harvested by centrifugation ( g, min) and the resulting supernatant discarded. the pellets were resuspended in fresh bhi medium. for stabilisation of the rna, ml of rnaprotect bacteria reagent (qiagen) was added to . ml of bacterial suspension and processed according to the manufacturer's instructions. lysis was performed with ml of lysis buffer ( mm tris-cl, mm edta, mg/ml lysozyme, . mau proteinase k) by pipetting and vortexing for s. after incubation, ml buffer rlt and ml % ethanol were added and the lysate applied to rneasy bacteria mini kit spin columns (qiagen) for rna isolation following the manufacturer's instructions. during this procedure an on-column dnase digest was performed using rnase-free dnase i solution (qiagen) according to the manufacturer's instructions. the isolated total rna was eluted in ml of rnase-free water and its concentration and purity analyzed by nanodrop (peqlab) measurements. the quality of isolated rna was assessed using the rna pico kit and bioanalyzer (agilent). the total rna was diluted to a working concentration of - pg/ml. the analysis was performed following manufacturer's instructions and the rna integrity number (rin) calculated by the expert software (agilent). the rin is defined to fall into a range of to , with a higher score indicating a more intact rna, whereas lower numbers are associated with degraded rna molecules [ ] . in order to use bacterial mrna as a substrate in cdna synthesis, polyadenylation was mandatory. the tailing was achieved using the poly(a) polymerase tailing kit (epicentre) following the alternate protocol offered by the manufacturer. briefly, up to mg of total rna were combined with ml poly(a) polymerase reaction buffer, ml mm atp, . ml riboguard rnase inhibitor and ml poly(a) polymerase ( u) in a total reaction volume of ml. the reaction was incubated for min at uc, terminated by the addition of ml . m edta and purified by rneasy mini kit (qiagen) following manufacturer's instructions. yield and purity were determined by nanodrop measurements. for cdna synthesis the in-fusion smarter directional cdna library construction kit (clontech) was used according to manufacturer's instructions with slight modifications. . ml total, polyadenylated rna were mixed with ml of in-fusion smarter cds primer, heated first for min at uc and then incubated for additional min at uc. after addition of . ml mastermix ( ml x first strand buffer, . ml mm dtt, ml mm dntps, ml mm smarter v oligonucleotide, . ml rnase inhibitor and ml smartscribe reverse transcriptase) the tubes were incubated for min at uc. the reaction was terminated at uc for min. for second strand cdna synthesis two ml aliquots of first strand reaction were used in long distance pcr using phusion polymerase (finnzymes). each pcr reaction was comprised as follows: ml first-strand reaction, ml rnase-free water, ml x phusion hf buffer and ml each of dntp mix ( mm), figure . specificity binding analysis of epitope peptides of kpn_ . bar chart representing the mean relative fluorescence intensities (n = ) of each peptide potentially harboring a linear epitope site after incubation with polyclonal antibodies reactive to k. pneumoniae (green) and c. jejuni (orange). none of the peptides shows a peculiar specific interaction; rather signal intensities are in the same vicinity for each peptide independent of the antibody used. this indicates mainly non-specific binding to occur. doi: . /journal.pone. .g pcr primer ii a ( mm), in-fusion smarter pcr primer ( mm) and phusion polymerase with a total reaction volume of ml. the pcr reactions were subjected to the cycling program with uc for min as initial denaturation followed by cycles of s denaturation at uc, s of primer annealing at uc and min extension at uc. for improved pcr results optimization was performed as follows; ml of the cycle experimental tube were transferred to a separate pcr tube, cycling commenced and aliquots of ml each were collected after , , , and cycles total. the different cycles were compared by gel electrophoresis and the experimental tubes subjected to additional cycles if necessary. finally, pcr reactions were purified using the qiaquick pcr purification kit (qiagen). the purity and yield of each reaction were analyzed by nanodrop measurements. normalization of double-stranded cdna was achieved with the trimmer- cdna normalization kit (evrogen) to reduce the number of cdna molecules derived from rrnas. briefly, ml of cdna (approx. ng/ml) were mixed with ml of x hybridization buffer. for the trimming reaction ml of this mixture were distributed to four different pcr tubes and overlaid with a drop of pcr-grade mineral oil. after centrifugation ( g, min), the tubes were incubated for min at uc followed by h at uc. next, pre-heated ( uc) duplex-specific nuclease (dsn) master buffer was added to each tube and incubation prolonged for min. dsn was added to the first three tubes in decreasing concentrations - u/ml, . u/ml and . u/ml -with the fourth tube receiving dsn storage buffer and no enzyme as a control reaction. the incubation prolonged for min at uc. after addition of ml dsn stop solution and subsequent incubation for min at uc, the tubes were placed on ice. the chilled reaction was diluted by addition of ml sterile, rnase-free water. for amplification of normalized cdna, ml of each reaction was used as template in pcr. each pcr reaction contained ml of template from the normalization reaction, ml nuclease-free water, ml x phusion hf buffer, ml mm dntp mix (neb), ml of each primer pcr primer ii a ( mm), in-fusion smarter pcr primer ( mm) and ml phusion polymerase. the pcr was performed with initial denaturation at uc for min and seven cycles of denaturation at uc for s, primer annealing at uc for s and extension at uc for min, respectively. for optimization, the control tube was subjected to , , , and cycles with ml aliquots taken every two cycles. the optimization samples were analyzed by gel electrophoresis ( % agarose, tae, v) and the optimal cycle number determined. the remaining three tubes were subjected to + x cycles with x being the differential of the optimized cycles to the originally performed seven cycles. after the second pcr, the experimental reactions were compared to the optimal control reaction using gel electrophoresis as above. reactions showing a successful normalization were combined and used in a third pcr reaction. after the final pcr, the reactions were purified by qiaquick pcr purification kit. a major feature of the given model is the prominent beta barrel structure that originates from the abundance of beta strands. this is a typical feature of transport and channel proteins spanning the outer bacterial membrane. contrastingly, the identified linear epitope gavvalsttfa is located at the very beginning of the protein and thus not included in the given model. however, it is likely an extension of the truncated n-terminal region marked in orange. doi: . /journal.pone. .g figure . d model of predicted structure of kpn_ . the model was predicted using the automated mode of the swiss model application by expasy (university basel). as a template the crystal structure of a na(+)/h(+) antiporter nhaa of e. coli was used. the resulting model spans residues to of the full-length protein and was subsequently dyed using the chimera . software. coils are depicted in light green, beta strands in purple and alpha helices in blue. the potential linear epitope giafgavelfd is highlighted in orange. it comprises part of an alpha helix, a connective coil and the start of the next alpha helix. doi: . /journal.pone. .g in-fusion cloning and cloning vector for cloning pfn a (promega) was used as a vector, as it features a n-terminal encoded halotag fusion protein, which allows for specific and covalent binding to a unique ligand, thus reducing background and minimizing cross-reactivity in immunoassays with halolink microarrays harboring the ligand on its surface. first, the vector needed to be linearized to be used with the in-fusion cloning technology. this was achieved by reverse pcr using ifs a for ( ttgataccactgcttttc-catggcgatcgcgttatc ) and ifs a rev ( tctcatcgtaccccgtgtttaaacgaattcgggctcg ). each reaction contained ml each of : diluted pfn a ( ng/ml) and the two primers, ml x phusion hf buffer, ml mm dntps, . ml phusion polymerase and . ml nuclease-free water to reach a total reaction volume of ml. the pcr was run using a cycle two-step program with uc denaturation for s and min extension at uc. after completion, ml of dpni ( u/ml) were added to the reaction and incubated at uc for h. the presence of a single band was checked by gel electrophoresis and the remaining reaction purified by qiaquick pcr purification kit. cloning of normalized cdna and linearized pfn a vector was performed following the manufacturer's instructions within the in-fusion smarter directional cdna library construction kit (clontech). electroporation ml of the cloning reaction were mixed with ml of electrocompetent acella e.coli cells (mobitec), a bl derivative, figure . homology of linear epitope sequence gavvalstffa of kpn_ . the sequence derived from k. pneumoniae mgh was used as a reference. identical residues are marked by dots, gaps by a horizontal dash and differences by the single letter amino acid code. seven of nine k. pneumoniae strains show identical epitope sequences, while two strains display changes in two residues. threonine replaces alanine at position , a change observed not only in these two strains but in almost all other bacteria within the list. additionally, threonine at position is substituted by either serine or phenylalanine. bacteria other than k. pneumoniae show an additional number of amino acid substitutions, most prominently leucine for valine at position and serine for threonine at position . in s.enterica the changes become more pronounced. glycine at position is replaced by serine, valine at position replaced by alanine and threonine inserted for serine at position . in some rare cases, other residues have also been substituted, e.g. valine replaces alanine at position in shigella dysenteriae. doi: . /journal.pone. .g figure . alanine scan of gavvalsttfa of kpn_ . box-whisker plot (n = ) of gavvalsttfa after alanine/glycine scanning. the box comprises % of the data, while the whiskers enclose %. the median is represented by a small horizontal line and the mean by a small rectangle. rabbit igg served as a positive reference, whereas mbp was used as a negative control. if alanine was present in the original present it was replaced by glycine, otherwise each amino acid was stepwise replaced by alanine. additionally, gavlalsssft and saalaltssft were included as they resemble sequences present in e. coli and s. enterica. switching glycine (position ), alanine (positions or ), or threonine (positions and ) to alanine or glycine, results in a significant drop in signal intensities to levels below or at the negative control. in contrast, substituting valine (position ) or leucine (position ) by alanine, leads to an increase in signal intensities to a.u. and more than a.u., respectively. note the different axis scales prior and after axis break at a.u. doi: . /journal.pone. .g and electroporated in mm cuvettes using the easyject plus electroporator (peqlab). conditions for electroporation were as follows: voltage = v, capacity = mf, resistance = v and a pulse duration of ms. the electroporated cells were added to ml of super optimal broth with catabolite expression (soc) and incubated at uc for h with shaking at rpm. afterwards, ml of the transformation reaction were plated on lysogeny broth (lb) agar containing ampicillin. for each reaction at least two plates were prepared and incubated at uc for h. a total number of clones were selected and transferred to . ml u deepwell plates (nunc) containing . ml lb-amp. the plates were incubated overnight at uc, rpm. on the next day, the deepwell plates were centrifuged, the supernatant discarded and the pellets resuspended in ml of lb-amp. a new set of u deepwell plates was prepared with ml of fresh lbamp and inoculated with ml each from the resuspended overnight cultures. the remaining ml of resuspended overnight culture were mixed with ml of sterile-filtered dmso and stored at uc. the newly inoculated plates were incubated for h at uc, rpm. afterwards, the temperature was reduced to uc, incubation continued for h and protein expression induced by addition of ml of . m b-d- -thiogalactopyranoside (iptg). incubation persisted overnight at uc, rpm. the cells were harvested by centrifugation ( g, min), the supernatant discarded and the pellets frozen at uc. after min the pellets were resuspended in ml of easylyse bacterial protein extraction solution (epicentre) and incubated for min at room temperature. dnase i was mixed with dnase reaction buffer ( mm tris-hcl, . mm mgcl , mm cacl ), added to the reaction and incubation was carried on for min at uc. the plates were centrifuged to collect cell debris for min at g. for each sample ml of lysate were transferred to microtiterplates (genetixx), which were used as reservoirs for the spotting procedure. the samples were spotted onto halolink slides (promega) using the qarray microarray spotter (molecular devices). different samples were spotted per slide with three replicate slides per screening. in total samples were screened on slides (n = ). each sample was spotted as quadruplicates with controls in two identical sets of eighteen subarrays each (total number of spots per slide ). the controls used included ht-ompa and ht-ompf as positive reference proteins as these have been described as immunodominant before. as specificity controls ht-argc and ht-pyrc were used, representing proteins without known immunodominant behaviour, thus binding of the polyclonal antibodies is not expected. in addition two different e.coli strains -acella electrocompetent cells and krx single-step figure . specificity assay of gavvalsttfa and derivatives. the bar chart represents the mean signal intensities (n = ) of gavvalsttfa and several modified peptides with single amino acid replacements incubated with antibodies reactive to k. pneumoniae (green), e. coli (orange) or s. enterica (purple). the sequences on the left represent the original epitope and modified versions displaying an increase in signal intensity for k. pneumoniae antibodies. in contrast, sequences on the right harbor modifications causing a significant drop in intensity for k. pneumoniae. rabbit igg is used as a positive reference and mbp serves as a negative control. none of the sequences tested displayed any significant signal intensity above the negative control when incubated with either e. coli or s. enterica antibodies. doi: . /journal.pone. .g competent cells (promega) -were spotted as further controls. as those two lack proteins expressed with a halotag, they are used as negative controls. after spotting of the samples, the slides were incubated for h at room temperature in a humidity chamber. next, slides were washed with pbs+ . % igepal ca- (pbsi, sigma aldrich) and dried by nitrogen flow. the well proplate module (grace biolabs) was attached to each slide. the top chamber was filled with . ml of rabbit-polyclonal antibody to k. pneumoniae (acris, mg/ml) in pbs. the bottom chamber was incubated with pbs only. after h of incubation at room temperature with gentle rocking, both chambers of each slide were washed three times with ml of pbsi. secondary antibody (goat-polyclonal to rabbit igg conjugated with chromeo- , abcam, mg/ml) was subjected to each chamber in pbs and the slides were incubated at room temperature for h in the dark under gentle rocking. finally, slides were washed three times with pbsi, the proplate modules removed and the slides dried by nitrogen flow. the slides were scanned on an axon genepix a laser scanner (molecular devices) with the following settings: nm laser, pmt gain , % laser power, lines to average , mm resolution and standard green emission filter at nm. the raw data sets of all the microarray analyses in this publication have been deposited in ncbi's gene expression omnibus [ ] and are accessible through geo series accession numbers gse , gse , gse , and gse . the median fluorescence intensity of each spot corrected by the local background (median f -b ) was used. further, relative fluorescence intensity (rfi) was calculated by subtracting the signals of the bottom chamber from the raw data signals of the top chamber to account for non-specific binding of secondary antibodies. for screening of expression libraries we used the contrast method with either argc or pyrc as specificity control to determine the contrast via the formula: with rf f i control the median of all rfis of the control used. clones harboring strong signals in microarray screening were selected to be sequenced. sequencing was performed externally by lgc genomics using ht f ( acatcggcccgggtct-gaatc ) and flxr ( cttcctttcgggctttgttag ) primers. after sequencing and identification of potentially immunodominant proteins, primers were designed to generate full-length clones for each identified gene, see table s for a list of the primers used. cloning was performed as mentioned above with slight modifications. the pfn a vector was linearized using the following primer set; a if linear for ( gtttaaac-gaattcgggctc ) and a if linear rev ( ggcgatcgcgttatcgctctg ) with pcr conditions as mentioned before. protein expression, lysis, and spotting of fulllength proteins were performed as described above. the slides were incubated for h at room temperature in a humidity chamber. for incubation with antibodies well or well proplate modules (grace biolabs) were attached to the halolink slides. processing of the slides was done similar to the original screening, however several different antibodies were used, see section antibodies. for testing of immunodominant characteristics with elisa, the crude lysate was first purified using halolink magnetic beads (promega) following the manufacturer's instructions. the proteins of interest were subsequently cleaved off by digestion with protev protease (promega) and concentration was determined by nanodrop measurements. the samples were diluted to a total protein content of mg/ml in pbs and ml of each sample was added to maxisorb plates (nunc). each sample was analyzed at least in triplicate. the elisa plate was covered with a lid and incubated overnight at uc in a humidity chamber. after five washing steps each with pbs+ . % tween- (pbst), the plates were blocked using ml % non fat dried milk in pbs per well for h. afterwards, plates were washed three times with pbst. ml of primary antibody solution (c = mg/ml) in pbs containing % non fat dried milk were applied to each well using the respective desired antibody or pbs for controls. the plates were incubated for h at room temperature and washed four times with pbst. next, ml of conjugated secondary antibody (goat polyclonal to rabbit igg conjugated with horseradish peroxidase, abcam ab , c = ng/ml) were added to each well and incubation carried on for h. finally, plates were washed once again four times with pbst and ml , , , -tetramethylbenzidine (tmb, sigma-aldrich) was added to each well for detection. after min of incubation at room temperature in the dark, the reaction was stopped by applying ml of m h so to each well. the optical density of each well was measured using the omega fluostar (bmg labtech) at a wavelength of nm. primers were designed using primer [ ] within geneious pro . . [ ] . the sequenced inserts were identified by blast [ ] . peptide sequence secondary structures were predicted using the emboss garnier [ ] algorithm and the transmembrane regions predicted by tmhmm . [ , ] . antigenic sites were predicted by emboss antigenic [ , ] . data evaluation was performed by originpro g (originlab) and microsoft excel. -dimensional structure predictions were performed using the swiss model automated mode [ ] [ ] [ ] [ ] [ ] and pdb files were visualized and analyzed by the ucsf chimera package [ ] .chimera is developed by the resource for biocomputing, visualization, and informatics at the university of california, san francisco (supported by nigms p -gm ). analysis of full-length proteins was achieved by combining the results from elisa and microarray data. hence, the rfi of each sample was calculated. next, a normalized rfi was generated by dividing the rfi of each sample by the median rfi of all the samples within an area of interest, i.e. incubation compartment. from these normalized rfis a median and standard deviation was calculated. if the median normalized rfi of the positive control was below the median normalized rfi of any of the negative references whilst taking the standard deviations into account, the test was rendered invalid. if the test passed the above criterion, the q values were calculated as follows: with rf f i sample the median of normalized rfis of the sample and rf f i pos:control the median of the normalized rfis of the positive control ompa respectively ompf. the resulting error was calculated by error propagation according to gauss. finally, incorporating all valid tests, the mean q value was determined along with its resulting error following error propagation by gauss, see table . several proteins were chosen for epitope mapping. these were the proteins encoded by kpn_ , kpn_ , kpn_ , kpn_ , kpn_ , and kpn_ . the proteins were divided into -mer oligopeptides with an overlap of amino acids in silico. the synthesis and coupling to microarray slides was performed externally by jpt peptide technologies gmbh. each peptide sequence was applied times to one slide. the slides were used with proplate -well chamber system (grace) allowing for incubation with different antibodies. first, the slides were blocked with superblock blocking buffer (thermo fischer) for h, washed five times with pbs+ . % tween- , primary antibodies applied, incubated overnight at uc with mild rocking, washed again, secondary antibodies applied for h in the dark and after a final washing procedure, dried and scanned as above. two different primary antibodies to k.pneumoniae were tested. the bottom chamber was always used as a control chamber, incubated only with secondary antibody. the peptide gavvalsttfa and modifications thereof created by substituting one amino acid by alanine/glycine were synthesized by jpt peptide technologies gmbh. these peptides in combination with two peptides showing closely related sequences, gavlalsssft and saalaltssfte, were applied times to slides. incubation procedure was performed as described above for epitope mapping the expression of the desired halotag fusion proteins was checked by sds-page. after lysis of cells, ml of each protein extract was mixed with ml of mm halotag alexa ligand. after addition of ml x tbs ( mm tris, mm nacl, ph . ) the reaction was incubated at room temperature for minutes. ml of each reaction were removed, mixed with ml of x loading buffer (fermentas) and ml dtt and heated for min at uc. the separation was performed on a mini-protean tgx gel (biorad, any kd, wells) in a protean ii xi cell chamber (biorad) for min at v. as a size reference benchmark fluorescent protein standard (life technologies) was used. fluorescence was measured in a fla- (fujifilm) with excitation at nm. figure s homology of giafgavelfd of kpn_ . the sequence derived from k. pneumoniae mgh was used as a reference. the best matches after blast analysis are shown in the figure with dots indicating identical residues. for differentiation of the sequences the ncbi accession number of the parent protein is given followed by the strain designation, if available. only three e. coli strains in lines , and feature a valine residue at the second position instead of the consensus isoleucine. consequently, this sequence is highly conserved within the enterobacteriaceae including e. coli, klebsiella, salmonella, and enterobacter among others. (pdf) table s list of sequenced clones after screening the clones were sequenced by lgc genomics using ht f and flx primers. clones that were not successfully sequenced are indicated by ''-'', clones carrying inserts too short to be reliably mapped to a gene are marked as truncated. additionally, a few inserts were detected deriving from primer concatamers. these are displayed as ''artificial''. the remaining clones are indicated by the corresponding locus tag and protein name. several clones apparently carry identical inserts, especially obvious for kpn_ or kpn_ . these were discarded from further analysis as the mapped inserts are very short and might have an artificial origin. inserts that were highly unlikely to garner new immunogenic proteins, antigens described previously, e.g. ompa, other molecules like trna and rrna were abolished from further analysis. table s primers used in this study. each primer is given with a name, its sequence in to direction and the target gene or vector. for each target f represents forward and r the reverse primer. the primers were used for cloning in the in-fusion smarter directional cdna library construction kit. (xls) manual of clinical microbiology clinical epidemiology of the global expansion of klebsiella pneumoniae carbapenemases extended-spectrum beta-lactamase producing klebsiella spp. in chicken meat and humans: a comparison of typing methods modern clinical microbiology: new challenges and solutions rapid identification of novel immunodominant proteins and characterization of a specific linear epitope of campylobacter jejuni application of a microarray-based immunoscreening for rapid identification of novel antigens of salmonella enteritidis microarray-based method for screening of immunogenic proteins from bacteria halotagbased purification of functional human kinases from mammalian cells severe acute respiratory syndrome diagnostics using a coronavirus protein microarray reverse transcriptase template switching: a smart approach for full-length cdna library construction polyadenylic acid sequences in e. coli messenger rna identification of the gene for an escherichia coli poly (a) polymerase a simple method to enrich mrna from total prokaryotic rna magnetic capturehybridization method for purification and probing of mrna for neutral protease of bacillus cereus dsn depletion is a simple method to remove selected transcripts from cdna populations duplex-specific nuclease efficiently removes rrna for prokaryotic rna-seq ligation-independent cloning of pcr products (licpcr) identification of vaccine candidate antigens of an esbl producing klebsiella pneumoniae clinical strain by immunoproteome analysis the national center for biotechnology information's protein clusters database protective efficacy of dna vaccines encoding outer membrane protein a and ompk of klebsiella pneumoniae in mice b-cell epitopes: fact and fiction x-ray crystallography of antibodies the structure of bacterial outer membrane proteins identification and characterization of ompl as a potential vaccine candidate for immune-protection against salmonellosis in mice the unique structure of haemophilus influenzae protein e reveals multiple binding sites for host factors directed evaluation of enterotoxigenic escherichia coli autotransporter proteins as putative vaccine candidates structure of the c-terminal domain of neisseria heparin binding antigen (nhba), one of the main antigens of a novel vaccine against neisseria meningitidis in vivo versus in vitro protein abundance analysis of shigella dysenteriae type reveals changes in the expression of proteins involved in virulence, stress and energy metabolism surface expression, singlechannel analysis and membrane topology of recombinant chlamydia trachomatis major outer membrane protein predicting transmembrane protein topology with a hidden markov model: application to complete genomes a hidden markov model for predicting transmembrane helices in protein sequences a semi-empirical method for prediction of antigenic determinants on protein antigens new hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and x-ray-derived accessible sites toward the estimation of the absolute quality of individual protein structure models the rin: an rna integrity number for assigning integrity values to rna measurements gene expression omnibus: ncbi gene expression and hybridization array data repository primer on the www for general users and for biologist programmers basic local alignment search tool analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins the swiss-model workspace: a web-based environment for protein structure homology modelling the swiss-model repository and associated resources swiss-model: an automated protein homology-modeling server swiss-model and the swiss-pdb viewer: an environment for comparative protein modeling protein modeling by email ucsf chimera-a visualization system for exploratory research and analysis the k. pneumoniae strain mgh was a kind gift of the group of s. bereswill (department of microbiology and hygiene, charité -university medicine berlin, berlin, germany). sh is greatly indebted to martina obry for her assistance during expression library construction and clone isolation. the authors would like to thank simone aubele for technical assistance. we also gratefully acknowledge michaela schellhase for microarray printing. conceived and designed the experiments: sh mvnr. performed the experiments: sh. analyzed the data: sh ffb mvnr. contributed reagents/materials/analysis tools: sh. wrote the paper: sh ffb mvnr. key: cord- -ikni acz authors: li, zengbin; zou, zixiao; jiang, zeju; huang, xiaotian; liu, qiong title: biological function and application of picornaviral b protein: a new target for antiviral drug development date: - - journal: viruses doi: . /v sha: doc_id: cord_uid: ikni acz picornaviruses are associated with acute and chronic diseases. the clinical manifestations of infections are often mild, but infections may also lead to respiratory symptoms, gastroenteritis, myocarditis, meningitis, hepatitis, and poliomyelitis, with serious impacts on human health and economic losses in animal husbandry. thus far, research on picornaviruses has mainly focused on structural proteins such as vp , whereas the non-structural protein b, which plays vital roles in the life cycle of the viruses and exhibits a viroporin or viroporin-like activity, has been overlooked. viroporins are viral proteins containing at least one amphipathic α-helical structure, which oligomerizes to form transmembrane hydrophilic pores. in this review, we mainly summarize recent research data on the viroporin or viroporin-like activity of b proteins, which affects the biological function of the membrane, regulates cell death, and affects the host immune response. considering these mechanisms, the potential application of the b protein as a candidate target for antiviral drug development is discussed, along with research challenges and prospects toward realizing a novel treatment strategy for picornavirus infections. the picornaviridae family consists of genera and species, mainly including enterovirus, hepatovirus, cardiovirus, aphthovirus, and rhinovirus [ ] . to date, research on picornaviruses has mainly focused on enterovirus (ev) , coxsackievirus (cv), poliovirus (pv), encephalomyocarditis virus (emcv), foot-and-mouth disease virus (fmdv), human rhinovirus (hrv), and hepatitis a virus (hav). picornavirus infections can cause enormous damage in humans and animals. the ev , cva , and cva cause hand, foot, and mouth disease in millions of children in asia-pacific region each year and can cause more serious clinical symptoms such as aseptic meningitis, acute flaccid paralysis, and neurological respiratory syndrome [ ] [ ] [ ] . picornaviruses are non-enveloped spherical viruses with an icosahedral-structured viral capsid. the picornaviral genome consists of a single-stranded positive-sense rna, which is approximately . - . kilobases in length, with a highly conserved structure, including a -noncoding region , an open reading frame, a -ncr, and a -end polya tail [ ] . the -ncr contains multiple rna secondary structural elements, including the internal ribosome entry site. the open reading frame of the viral genome consists of three regions: p , p , and p . the p region is translated and processed to form the structural proteins vp , vp , vp , and vp , which compose the capsid structure of a picornavirus. the p and p regions are separately translated to the non-structural proteins a, b, and c and a, b, c, and d, respectively. the majority of related research has focused on structural proteins of picornaviruses, such as vp , whereas the importance of the non-structural protein b has been relatively overlooked. viroporins are proteins found in a variety of viruses and are generally comprised of to amino acids. each viroporin contains a highly hydrophobic domain capable of forming at least one amphipathic α-helical structure, which oligomerizes to form transmembrane hydrophilic pores [ , ] . the b protein is a crucial component of picornaviruses that exhibits viroporin or viroporin-like activity, plays a key role in the picornavirus life cycle by inducing a series of cytotoxic reactions to promote picornaviral replication and release [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . the b protein has a highly conserved sequence, which can be exploited for viral detection [ ] [ ] [ ] , vaccine development [ ] [ ] [ ] , and rna interference [ ] [ ] [ ] [ ] [ ] . in addition, the b protein exhibits a viroporin or viroporin-like activity, and thus, targeted drugs against viroporin could potentially target b protein as a novel strategy to treat or prevent picornavirus infections. however, the detailed mechanism of action of the b protein has not been elucidated to date. therefore, here, we review the recent research data on the role of the b protein in the picornaviral life cycle and discuss its possible application in antiviral therapy. the picornaviral b protein is a relatively short molecule, containing a maximum of two predicted putative transmembrane hydrophobic helices, along with n-and c-terminal domains, which are connected by a short stretch of amino acid residues. the α-helix-turn-α-helix sequence of the b protein is the basis for forming a transmembrane pore through homo-multimerization and the major determinant of the b protein function [ , , [ ] [ ] [ ] [ ] [ ] . a computational approach has demonstrated that the ev b protein is a tetramer, and b proteins with different orientations have different activities [ , ] . the b protein belongs to the type ii family of viroporins, which can be further divided into different types according to the number and orientation of the membrane-spanning domains ( figure ). in type iia viroporins, the n-and c-termini stretch to the organelle lumen, such as in the b protein of cvb [ , , , ] , whereas the n-and c-termini of type iib viroporins face the cytoplasmic matrix, such as in pv [ , , ] and fmdv [ , , ] . in addition, the c-terminus of the hav b protein has a viroporin-like activity [ ] . picornaviral b proteins target the membrane and form pores mainly through their transmembrane regions. the protein molecules are first inserted into the membrane individually and then self-interact and homo-oligomerize to form higher-order structures, which are important for the pore-forming activity, determined by the specific sequence and structure [ , , ] . the majority of b proteins are localized to organelles, with predominant co-localization with the golgi apparatus and the endoplasmic reticulum (er) in cvb , pv, and hrv ( figure ) [ ] . the hrv and fmdv b proteins are mainly localized to the er, whereas the emcv b protein is not localized to either the golgi complex or the er [ , , ] . seggewiss et al. [ ] found that the hav b protein was not localized to the er either but was involved in the amendment of the er-golgi apparatus intermediate compartment. since the protein structure determines its ultimate function and b proteins belonging to different viroporin species show both similarities and differences in their functions, much insight can be gained from research on the same viroporin b protein and on different viroporins. picornavirus infects the host cell, and then viral gene encodes b proteins. the b protein belongs to the type ii family of viroporins, including two transmembrane hydrophobic helices which are the basis for forming a transmembrane pore that results in the changes in cell membrane permeability. in type iia viroporins, the n-and c-termini stretch to the organelle lumen, whereas the n-and c-termini of type ii b viroporins face the cytoplasmic matrix. the majority of b proteins are localized to organelles, with predominant co-localization with the golgi apparatus and the endoplasmic reticulum (er), resulting in an obvious decrease in ca + of the er and golgi complex, along with a decrease in calcium uptake by the mitochondrion, and causing an influx of extracellular ca + . the b protein can induce many cellular reactions, such as changing membrane permeability, regulating apoptosis and autophagy, and affecting host immune responses. these functions are all related to changes in ion concentrations, especially of calcium ions (ca + ). therefore, here, we mostly focus on the role of ca + in these b protein activities. a common feature of infection by animal viruses is the damage to the ion balance in host cells. the picornaviral b protein may change the membrane permeability of target cells, disturbing the ion balance, especially that of ca + , in organelles, such as the er and the golgi apparatus ( figure ) [ , , ] . the changes in membrane permeability, caused by the b protein, have also been suggested to be regulated by the content of specific membrane phospholipids [ , ] . the ca + are involved in the activation of enzymes in cells and play a crucial role in viral replication and other viral biological processes [ ] [ ] [ ] . however, the role of the b protein in ca + homeostasis remains unclear. initial studies only indicated that host cells had elevated the ca + levels, owing to the expression of the b protein [ , , , , [ ] [ ] [ ] , but the mechanism was not clarified. since then, some researchers have proposed that the decrease in the concentration of ca + stored in organelles triggers the opening of specific calcium ion channels on the plasma membrane of cells, causing an influx of extracellular ca + [ ] . this idea was supported by the findings that expression of the cvb and pv b proteins resulted in an obvious decrease in the ca + concentration in the er and golgi complex, along with a decrease in calcium uptake by the mitochondria. meanwhile, the increased ca + level in the cytoplasm was suggested to be mainly due to the influx of extracellular ca + [ , , ] . similarly, the expression of the hrv b protein was shown to decrease the ca + concentrations in the er and golgi apparatus, whereas the emcv b protein only significantly reduced the ca + concentration in the er [ ] . in contrast, other studies showed that expression of the hav and fmdv b proteins elevated the cytoplasmic ca + level but did not alter the level of stored ca + in organelles, such as the er and golgi complex [ , ] . taken together, these studies suggest that there are different mechanisms by which b proteins affect the ca + concentrations, depending on the virus type. furthermore, it is unknown whether ca + directly pass through the channel formed by the b protein. pham et al. [ ] demonstrated, using a planar lipid bilayer and liposome patch-clamp electrophysiological technique, that the rotavirus non-structural protein (nsp ) viroporin region acts as a ca + conduction channel. although there is currently no direct evidence that the b protein can directly induce the observed changes in the ca + concentration in host cells upon infection, the above-reviewed studies suggest an association, and the mechanism requires further investigation. given the importance of ca + signaling for numerous cellular processes, further studies on picornaviral b protein function should include determination of the ca + concentration, which may provide more insight into the detailed function of the b protein. in particular, the b protein may change the ca + concentration to regulate autophagy and apoptosiswhich are distinct cell death mechanisms controlled by the virus to effectively evade the host immunity, thereby promoting viral replication and release [ ] [ ] [ ] [ ] [ ] . picornaviruses can form new cytoplasmic vesicles by inducing membrane remodeling, thereby promoting their own proliferation [ , , ] . the b protein is capable of binding to the membrane and inducing target membrane remodeling to form a unique membrane structure that can serve as a viral replication site. this site, known as the viroplasm, is generated from the er to accumulate all of the cellular components required for viral replication ( figure ) [ , , [ ] [ ] [ ] . the viroplasm is also the main membrane source of autophagy [ , ] . the cvb b protein is dependent on its transmembrane hydrophobic region to induce autophagy [ ] , which may be related to alterations in membrane permeability, especially with regard to the ca + concentration. moreover, at an early stage of fmdv cell infection, the virus specifically recognizes and binds to the cell surface receptors, and the b protein rapidly upregulates the autophagy pathway, leading to punctate aggregation of a large number of autophagy marker proteins, such as the microtubule-associated protein light chain (map -lc ) [ , ] . in addition, rotavirus encodes the nsp viroporin, which releases the er-stored ca + into the cytoplasm, thereby activating the ca + /calmodulin-dependent kinase kinase-β (camkk-β) signaling pathway, leading to autophagy ( figure ) [ ] . further, cvb induces autophagy in a calpain-dependent manner, causing an accumulation of lc lipids and autophagosomes [ ] . considering the ability of the b protein to alter cellular calcium homeostasis, along with its viroporin-like activity, it is feasible that the b protein may regulate autophagy mainly by changing the ca + concentration. the b protein has also been shown to regulate apoptosis through the endogenous pathway, which can be divided into er stress and the mitochondrial pathway, providing another potential mechanism of bypassing the host immune response to facilitate infection [ , , , , ] . the ca + plays a pivotal role in er stress-dependent apoptosis by regulating the flow between the er and the mitochondria [ , ] . excessive mitochondrial uptake of ca + exerts a cytotoxic effect because a high ca + concentration can open numerous mitochondrial transition pores, increase mitochondrial permeability, and destroy the mitochondrial outer membrane; consequently, cytochrome c and other proapoptotic factors are released, leading to apoptosis ( figure ) [ , , , , ] . the cvb b protein was shown to inhibit caspase activation and cell death induced by actinomycin d and cycloheximide by regulating the intracellular ca + concentration [ , ] . additionally, the b protein of hrv induced an er stress response, accompanied by an increased expression of cleaved caspase- and ccaat-enhancer-binding protein homologous protein (chop), which might have also involved a change in the ca + level [ ] . collectively, these results suggest that the b protein may regulate apoptosis by altering calcium homeostasis. furthermore, the b protein can regulate apoptosis through the mitochondrial pathway. madan et al. [ ] showed that the pv b protein interacted with the mitochondria and altered the mitochondrial morphology, in addition to the release of cytochrome c after the pv b gene expression. cong et al. [ ] reported that the ev b protein was localized to the mitochondria and induced apoptosis by directly activating the proapoptotic b-cell lymphoma -associated x (bax) protein, without a significant uptake of ca + by the mitochondria. therefore, the activation of the mitochondrial apoptotic pathway and subsequent apoptosis, induced by the ev b protein, may not involve ca + signaling. collectively, picornaviral b proteins can induce cell death in a variety of ways, with ca + playing an important role in most of these mechanisms. figure . b protein regulates autophagy and apoptosis. the b protein induces target membrane remodeling to form the viroplasm, which is generated from the endoplasmic reticulum (er). the isolation membrane is produced by the viroplasm. activation of the ca + /calmodulin-dependent kinase kinase-β (camkk-β) signal pathway is due to an increased intracellular calcium concentration. furthermore, mitochondrion takes up ca + from the er, thereby cytochrome cis released, leading to apoptosis. the host immune system is an important line of defense against pathogens, and pathogens can affect the immune system in a variety of ways. the b protein mainly affects the host immune response through inflammasome activation and by direct antagonism of the host immune response. recognition of pathogens by the immune system is mainly mediated by pathogen-associated molecular pattern receptors, known as pattern recognition receptors, including nucleotide-binding oligomerization domain (nod)-like receptors (nlrs), retinoic acid-inducible gene-i (rig-i)-like helicases, and pyrin domain-containing (nlrp ) [ ] [ ] [ ] . activation of nlrp inflammasome occurs during a period of changes in ion concentrations [ , ] . the nlrp belongs to the nlr family of inflammasomes and causes interleukin (il)- β and il- secretion via caspase- activation [ ] . the emcv, pv, ev , and hrv b proteins all activate the nlrp inflammasome but use distinct mechanisms [ , ] . the hrv and emcv b proteins can stimulate the nlrp inflammasome pathway to activate caspase- , which catalyzes the proteolysis of pro-il- β to il- β, leading to its secretion from across the plasma membrane by inducing a ca + efflux from intracellular storage ( figure ) [ , ] . wang et al. [ ] found that cvb -infected cells induced the nlrp activation in association with a k + efflux. the influenza virus m protein, which is also a viroporin, is capable of transporting na + and k + , resulting in activation of the nlrp inflammasome [ , ] . since the cvb b protein acts as a viroporin and can disrupt the intracellular ion balance [ , ] , it has been speculated that the induction of nlrp activation in cvb -infected cells may be related to the b protein. in addition to activating the inflammasome, the b protein also antagonizes the host immune response. both in vitro and in vivo studies have suggested that inhibition of protein trafficking would effectively allow viral evasion of the host immune response (figure ) [ ] [ ] [ ] . moreover, inhibition of protein transport may be related to changes in the ca + concentration [ ] . similar to the cvb b protein, the b proteins of pv, hrv and hrv , were shown to significantly inhibit the protein transport through the golgi complex, whereas the hav, fmdv, and emcv b proteins did not inhibit the protein transport [ , , ] . the fmdv b and c proteins did not block protein secretion, whereas the transport of proteins from the er to the golgi complex were blocked by the fmdv bc protein, and this effect was reproduced upon co-expression of the c and b genes [ , ] . collectively, these findings suggest that the b protein may participate in a viral evasion of the host immune response, mainly by inhibiting protein transport. figure . b protein affects the host immune response. the b protein can stimulate the nlrp inflammasome pathway to activate caspase- , which catalyzes the proteolysis of pro-il- β to il- β, which leads to their secretion across the plasma membrane by inducing a ca + efflux from intracellular storage. moreover, the b protein inhibits protein transport through the golgi protein which may be effective to evade the host immune response. the b protein has also been suggested to facilitate the viral evasion of the host immune response through other means. thus, the b protein antagonizes rig-i-mediated antiviral responses by inhibiting the expression of rig-i as an fmdv-specific reaction [ ] . the rna helicase lgp (also known as dexh-box helicase , dhx ) is a crucial factor involved in the host antiviral immune response [ ] . the fmdv leader protein (lpro), c protein, and the b protein have the ability to induce a decrease in lgp protein expression [ ] . in addition, pv b variants were shown to inhibit the antiviral interferon (ifn) system [ ] , whereas the hav b protein inhibited the synthesis of ifn-β by affecting the mitochondrial antiviral signaling protein activity, thereby antagonizing the host immune response [ ] . collectively, these evidences indicate that picornaviral b proteins can affect the host immune response, thereby promoting viral amplification or the release of viral particles. as discussed above, picornaviral b proteins have a viroporin or viroporin-like activity and play an important role in the picornaviral life cycle. therefore, many common applications targeting viroporins may be translatable to those targeting b proteins. in addition, the b protein may serve as a new target for the development of antiviral drugs. thus, further studies on the structure and function of the b protein might open up new avenues for the prevention and control of picornaviruses. owing to its highly conserved sequence, the use of the b gene as a marker could effectively improve the accuracy of virus detection. li et al. [ ] designed primers and taqman probes, based on the b and d regions, which were successfully used in real-time polymerase chain reaction to accurately detect and quantify fmdv during infection and replication. in addition, wang et al. [ ] developed a lateral-flow detection system, which could rapidly and easily detect fmdv using the b gene. in addition to gene-based detection, the virus could be detected using a b antibody. biswal et al. [ ] used an indirect enzyme-linked immunoassay based on a recombinant b protein to detect antibodies specific for fmdv. this method can be applied not only to fmdv but also to other picornaviruses, including cvb and ev . given the significant threat that picornaviruses pose to humans and animals, resulting in enormous economic damage to the livestock industry, development of picornavirus vaccines is of great significance. although inactivated virus vaccines can offer effective prevention, there are associated residual risk issues, including incomplete virus inactivation and escape during the vaccine production process [ ] [ ] [ ] . therefore, genetically engineered vaccines are considered more suitable options to overcome these shortcomings of inactivated viral vaccines. the ev vp protein is located outside the viral membrane and is thus exposed to the greatest amount of immune stress. accordingly, vp shows an extreme serological variability, thus providing the most reliable molecular epidemiological information. consequently, the vp region of ev has become a focus of vaccine research for picornavirus infections [ ] . however, dna constructs containing the vp gene of ev showed low levels of antigenicity. therefore, there is still a need to develop an effective adjuvant strategy to increase the antigenicity. one possibility in this regard is the use of recombinant vaccines incorporating the b gene to enhance the efficacy of vaccines. at present, applications of the b gene in recombinant vaccines have mainly concentrated on fmdv. the addition of a b fragment to a vaccine designed with vp as the core has been shown to effectively enhance the vaccine efficacy [ ] [ ] [ ] and reduce the dose and side effects [ ] . these effects may be similar to those leading to a greater efficacy of the adenoviral vector vaccine fused to the fmdv b protein against serotype o, which is associated with the induction of specific cd + and cd + protective t cell responses [ ] . therefore, future designs of other picornaviral genetically engineered vaccines would benefit from the addition of the b gene to increase the vaccine efficacy, including the addition of the b gene to a genetically engineered ev vaccine with the vp gene as the core. since viroporin plays an important role in all life stages of the virus, it is an attractive antiviral therapeutic target, and there have been great breakthroughs in this regard. by contrast, research and development of drugs targeting the b protein are relatively delayed. since b proteins have a viroporin or viroporin-like activity, screening for anti-picornavirus drugs among existing viroporin-targeting drugs may be a viable approach. there are four main types of inhibitors of viroporin activity, including adamantane, amiloride, alkyl iminosugar, and spirane amine [ ] . adamantane (amantadine and rimantadine) inhibits the m channel of influenza a virus by destroying the transmembrane network of hydrogen-bonded water molecules, thereby inhibiting the viral amplification [ ] . in bhk- cells infected with fmdv, the virus titer gradually decreased with an increase in amantadine concentration, which may have been due to abrogation of the pore-forming activity of the b protein and ultimate inhibition of fmdv replication [ ] . however, clinical trials showed that amantadine was not only selective for specific resistance mutations in hepatitis c virus (hcv) p [ ] , but also caused a rapid emergence of amantadine-resistant variants of influenza a virus during monotherapy for influenza [ ] . amiloride is a composite of two drugs, -(n,n-hexamethylene) amiloride and a novel inhibitor, bit , targeting hcv p and hiv- vpu, which can together block the viroporin ion channel activity or prevent ion channel formation, resulting in a potent antiviral effect [ ] [ ] [ ] [ ] . the alkyl iminosugar inhibits the formation of ion channels by targeting the hcv p viroporin [ ] . finally, spirane amines, such as bl- , also inhibit the influenza a virus m protein, with an antiviral mechanism similar to that of amantadine [ ] . there are also other drugs that act as viroporin inhibitors, including , -dibenzyl- ( h- , , , tetraazol- -yl) hexahydropyrimidine (cd), n-( -phenylethyl)- -[ -(phenylsulfonyl)- -piperazinyl]- quinazolinamine (lds ), and -methyl- , , -trihydroxyanth-raquinone (emodin), among others [ , , ] . the mechanism of action of these viroporin inhibitors is based on the inhibition of the viroporin channel activity. therefore, these drugs may have the potential to be applied for the treatment of picornavirus infections by targeting the b gene. however, this application will require further detailed investigations and drug screening. nevertheless, the b protein has the potential to widen the range of antiviral treatment strategies. furthermore, specific degradation of complementary mrna can be triggered by small interfering rnas (sirnas) or folded short hairpin rnas (shrnas) [ ] , which can be explored as an rna interference strategy, a relatively novel technology that has already been applied to treat many important pathogens, including hiv- , hepatitis b virus, and herpes simplex virus [ ] [ ] [ ] . currently, shrnas targeting the highly conserved b gene sequence are widely used in picornavirus research, including fmdv [ , ] , emcv [ ] , and cvb [ ] , and significant experimental viral suppression has been achieved. basically, rna interference against b gene affects the stability and integrity of the whole viral genome. the high nucleotide sequence conservation makes the b gene an attractive target for rna interference, which may potentially be effective against multiple picornavirus types, and open the door for additional sirna drugs. to date, there have been few studies specifically focusing on inhibitors of the b protein. xie et al. [ ] found that , -diisothiocyano- , -stilbenedisulfonic acid (dids) blocked a chloridedependent current, mediated by the ev b protein, and suppressed viral amplification. however, further research is needed to uncover the underlying mechanism. despite the many challenges in drug development, new technologies such as fourier-transform infrared spectroscopy and design of molecular dynamics analogs, as well as cryo-electron microscopy and spectroscopy, are expected to greatly contribute to the development of antiviral drugs. recent studies have gradually clarified the function and the potential of the b protein, along with increasingly recognizing its importance in the viral life cycle. however, there are still some challenges to overcome in investigations of the picornaviral b protein. in particular, its strong hydrophobicity makes it difficult to achieve soluble expression. ao et al. [ ] conjugated the small ubiquitin-like modifier (sumo) protein to the n-terminus of the fmdv b protein and successfully achieved soluble expression. therefore, this method can be tested for other picornaviral b proteins. moreover, the detailed molecular mechanism of the action of the b protein requires further study, along with the identification of interactions of b protein with host proteins, to better understand the role of the b protein in the pathogenesis of picornaviruses. in murine cells, the b protein was suggested to react with host proteins to promote rhinovirus proliferation [ ] . using a yeast two-hybrid system, the fmdv b protein was found to interact with the host elongation factor γ (eef g), and mislocalization of eef g demonstrated that the eef g deletion affected the synthesis of membrane proteins [ , ] . although a yeast two-hybrid system is a common laboratory protein-screening technique, it has a low success rate and is time-consuming. alternatively, affinity purification-mass spectrometry can be used to overcome these shortcomings, which has already been widely used in studies on dengue, zika, and ebola viruses [ , ] . at present, the development of antiviral drugs against viroporins is focused on three aspects, including viroporin and membrane fusion inhibitors, ion channel inhibitors, and targeted viroporin antibodies [ ] . with respect to the biological function of the b protein, antiviral drugs targeting the b protein could be designed based on the following three approaches: broad-spectrum screening for anti-picornavirus drugs among existing viroporin inhibitors, screening for b protein and membrane fusion inhibitors, and screening for b protein pore activity inhibitors. as discussed herein, the most important basis for the function of the b protein is that it can be polymerized into pores, thereby changing the permeability of the membrane. therefore, the design of drugs targeting b protein should be based on inhibiting polymerization of the b protein into pores, thereby reducing its effects on cellular ion homeostasis. however, these designs first require detailed determination of the refined atomic structure of the b protein, along with the expansion of screening techniques and applications of meticulous medicinal chemistry. furthermore, to develop better antiviral drugs, it will be necessary to elucidate the exact role of the b protein channel in the viral life cycle. thus, the main points of focus for research on the structure and function of the b protein toward ultimate drug development are: ( ) mechanism of increasing membrane permeability to disturb the ion balance, ( ) regulation of autophagy and apoptosis, ( ) inhibition of the host immune response, and ( ) promotion of viral replication and release. taken together, as research aimed at further elucidation of the role of the b protein progresses, along with the adoption of new technologies, it is expected that more strategies will come to light for antiviral drug development and disease control. author contributions: z.l., z.z. and z.j. wrote the manuscript; q.l. and x.h. revised for its integrity and accuracy; q.l. approved the final version of this manuscript and takes responsibility for its contents. the authors declare no conflicts of interest. human parechoviruses-biology and clinical significance examples of the two-stage membrane protein folding model. viruses viroporins: structure and biological functions membrane-active peptides derived from picornavirus b viroporin molecular characterization of the viroporin function of foot-and-mouth disease virus nonstructural protein b protein b of coxsackievirus b induces autophagy relying on its transmembrane hydrophobic sequences. viruses topology and biological function of enterovirus non-structural protein b as a member of the viroporin family single point mutation in the rhinovirus b protein reduces the requirement for phosphatidylinositol -kinase class iii beta in viral replication the c-terminal region of the non-structural protein b from hepatitis a virus demonstrates lipid-specific viroporin-like activity encephalomyocarditis virus viroporin b activates nlrp inflammasome structural basis for host membrane remodeling induced by protein b of hepatitis a virus foot-and-mouth disease virus viroporin b antagonizes rig-i-mediated antiviral effects by inhibition of its protein expression development and validation of a duplex quantitative real-time rt-pcr assay for simultaneous detection and quantitation of foot-and-mouth disease viral positive-stranded rnas and negative-stranded rnas rapid detection of foot-and-mouth disease virus using reverse transcription recombinase polymerase amplification combined with a lateral flow dipstick detection of antibodies specific for foot-and-mouth disease virus infection using indirect elisa based on recombinant nonstructural protein b delivery of a foot-and-mouth disease virus empty capsid subunit antigen with nonstructural protein b improves protection of swine versatility of the adenovirus-vectored foot-and-mouth disease vaccine platform across multiple foot-and-mouth disease virus serotypes and topotypes using a vaccine dose representative of the adta conditionally licensed vaccine poly iclc increases the potency of a replication-defective human adenovirus vectored foot-and-mouth disease vaccine transgenically mediated shrnas targeting conserved regions of foot-and-mouth disease virus provide heritable resistance in porcine cell lines and suckling mice cross-inhibition to heterologous foot-and-mouth disease virus infection induced by rna interference targeting the conserved regions of viral genome specific small interfering rnas-mediated inhibition of replication of porcine encephalomyocarditis virus in bhk- cells short hairpin rna targeting b gene of coxsackievirus b exhibits potential antiviral effects both in vitro and in vivo adenovirus-vectored shrnas targeted to the highly conserved regions of vp and b in tandem inhibits replication of foot-and-mouth disease virus both in vitro and in vivo determinants for membrane association and permeabilization of the coxsackievirus b protein and the identification of the golgi complex as the target organelle viroporin-mediated membrane permeabilization. pore formation by nonstructural poliovirus b protein viroporin activity of the foot-and-mouth disease virus non-structural b protein coxsackievirus protein b modifies endoplasmic reticulum membrane and plasma membrane permeability and facilitates virus release membrane permeability induced by hepatitis a virus proteins b and bc and proteolytic processing of hav bc model generation of viral channel forming b protein bundles from polio and coxsackie viruses a peptide based on the pore-forming domain of pro-apoptotic poliovirus b viroporin targets mitochondria coxsackie b virus protein b contains cationic amphipathic helix that is required for viral rna replication mechanisms of membrane permeabilization by picornavirus b viroporin mutational analysis of different regions in the coxsackievirus b protein: requirements for homo-multimerization, membrane permeabilization, subcellular localization, and virus replication functional analysis of picornavirus b proteins: effects on calcium homeostasis and intracellular protein trafficking non-structural protein b of human rhinovirus activates both perk and atf rather than ire to trigger er stress cellular localization and effects of ectopically expressed hepatitis a virus proteins b, c, a and their intermediates bc, ab and abc membrane integration of poliovirus b viroporin poliovirus b insertion into lipid monolayers and pore formation in vesicles modulated by anionic phospholipids release of intracellular calcium stores facilitates coxsackievirus entry into polarized endothelial cells calcium signals and calpain-dependent necrosis are essential for release of coxsackievirus b from polarized intestinal epithelial cells a dual role for ca( +) in autophagy regulation the coxsackievirus b protein suppresses apoptotic host cell responses by manipulating intracellular ca + homeostasis enterovirus protein b po(u)res out the calcium: a viral strategy to survive? the coxsackievirus b protein increases efflux of ions from the endoplasmic reticulum and golgi, thereby inhibiting protein trafficking through the golgi the rotavirus nsp viroporin domain is a calcium-conducting ion channel picornaviruses and apoptosis: subversion of cell death regulation of apoptosis during flavivirus infection virus infection and death receptor-mediated apoptosis autophagy promotes the replication of encephalomyocarditis virus in host cells coxsackievirus b infection induces autophagic flux, and autophagosomes are critical for efficient viral replication viral reorganization of the secretory pathway generates distinct organelles for rna replication remodeling the endoplasmic reticulum by poliovirus infection and by individual viral proteins: an autophagy-like origin for virus-induced vesicles a cytopathic and a cell culture adapted hepatitis a virus strain differ in cell killing but not in intracellular membrane rearrangements induction of intracellular membrane rearrangements by hav proteins c and bc subversion of the cellular autophagy pathway by viruses foot-and-mouth disease virus induces autophagosomes during cell entry via a class iii phosphatidylinositol -kinase-independent pathway autophagy hijacked through viroporin-activated calcium/calmodulin-dependent kinase kinase-beta signaling is required for rotavirus replication coxsackievirus b uses autophagy for replication after calpain activation in rat primary neurons cytochrome c binds to inositol ( , , ) trisphosphate receptors, amplifying calcium-dependent apoptosis viral calciomics: interplays between ca + and virus enterovirus b induces cell apoptosis by directly inducing the conformational activation of the proapoptotic protein bax nod proteins: regulators of inflammation in health and disease rigorous detection: exposing virus through rna sensing regulation of adaptive immunity by the innate immune system nlrp inflammasome activation: the convergence of multiple signalling pathways on ros production? inflammasomes: current understanding and open questions rhinovirus-induced calcium flux triggers nlrp and nlrc activation in bronchial cells involvement of nlrp inflammasome in cvb -induced viral myocarditis influenza virus activates inflammasomes via its intracellular m ion channel response of host inflammasomes to viral infection inhibition of protein trafficking by coxsackievirus b : multiple viral proteins target a single organelle the different tactics of foot-and-mouth disease virus to evade innate immunity foot-and-mouth disease virus c protease induces fragmentation of the golgi compartment and blocks intra-golgi transport human rhinovirus causes golgi apparatus fragmentation without blocking protein secretion inhibition of the secretory pathway by foot-and-mouth disease virus bc protein is reproduced by coexpression of b with c, and the site of inhibition is determined by the subcellular location of c protective role of lgp in influenza virus pathogenesis foot-and-mouth disease virus infection inhibits lgp protein expression to exaggerate inflammatory response and promote viral replication poliovirus intrahost evolution is required to overcome tissue-specific innate immune responses hepatitis a virus protein b suppresses beta interferon (ifn) gene transcription by interfering with ifn regulatory factor activation new insights into physiopathology of immunodeficiency-associated vaccine-derived poliovirus infection; systematic review of over decades of data causes of impaired oral vaccine efficacy in developing countries recent progress towards novel ev anti-therapeutics and vaccines. viruses increased efficacy of an adenovirus-vectored foot-and-mouth disease capsid subunit vaccine expressing nonstructural protein b is associated with a specific t cell response viroporins: structure, function and potential as antiviral targets inhibitors of the m proton channel engage and disrupt transmembrane networks of hydrogen-bonded waters resistance mutations define specific antiviral effects for inhibitors of the hepatitis c virus p ion channel combination chemotherapy for influenza a phase b/ a study of the safety, pharmacokinetics and antiviral activity of bit in patients with hiv- infection antiviral efficacy of the novel compound bit against hiv- release from human macrophages understanding the inhibitory mechanism of bit drug against p viroporin using computational study a novel hepatitis c virus p ion channel inhibitor, bit , inhibits bovine viral diarrhea virus in vitro and shows synergism with recombinant interferon-alpha- b and nucleoside analogues the hepatitis c virus p protein forms an ion channel that is inhibited by long-alkyl-chain iminosugar derivatives growth impairment resulting from expression of influenza virus m protein in saccharomyces cerevisiae: identification of a novel inhibitor of influenza virus =structure-guided design affirms inhibitors of hepatitis c virus p as a viable class of antivirals targeting virion release emodin inhibits current through sars-associated coronavirus a protein specific inhibition of gene expression by small double-stranded rnas in invertebrate and vertebrate systems rna interference approaches for treatment of hiv- infection fatality in mice due to oversaturation of cellular microrna/short hairpin rna pathways an sirna-based microbicide protects mice from lethal herpes simplex virus infection dids blocks a chloride-dependent current that is mediated by the b protein of enterovirus amino acid changes in proteins b and a mediate rhinovirus type growth in mouse cells kinectin-dependent assembly of translation elongation factor- complex on endoplasmic reticulum regulates protein synthesis eef g interaction with foot-and-mouth disease virus nonstructural protein b: identification by yeast two-hybrid system protein interaction mapping identifies rbbp as a negative regulator of ebola virus replication comparative flavivirus-host protein interaction mapping reveals mechanisms of dengue and zika virus pathogenesis this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license key: cord- - stcx dd authors: mushegian, a. r.; koonin, e. v. title: cell-to-cell movement of plant viruses: insights from amino acid sequence comparisons of movement proteins and from analogies with cellular transport systems date: journal: arch virol doi: . /bf sha: doc_id: cord_uid: stcx dd cell-to-cell movement is a crucial step in plant virus infection. in many viruses, the movement function is secured by specific virus-encoded proteins. amino acid sequence comparisons of these proteins revealed a vast superfamily containing a conserved sequence motif that may comprise a hydrophobic interaction domain. this superfamily combines proteins of viruses belonging to all principal groups of positive-strand rna viruses, as well as single-stranded dna containing geminiviruses, double-stranded dna-containing pararetroviruses (caulimoviruses and badnaviruses), and tospoviruses that have negative-strand rna genomes with two ambisense segments. in several groups of positive-strand rna viruses, the movement function is provided by the proteins encoded by the so-called triple gene block including two putative small membrane-associated proteins and a putative rna helicase. a distinct type of movement proteins with very high content of proline is found in tymoviruses. it is concluded that classification of movement proteins based on comparison of their amino acid sequences does not correlate with the type of genome nucleic acid or with grouping of viruses based on phylogenetic analysis of replicative proteins or with the virus host range. recombination between unrelated or distantly related viruses could have played a major role in the evolution of the movement function. limited sequence similarities were observed between i) movement proteins of dianthoviruses and the mip family of cellular integral membrane proteins, and ii) between movement proteins of bromoviruses and cucumoviruses and m protein of influenza viruses which is involved in nuclear export of viral ribonucleoproteins. it is hypothesized that all movement proteins of plant viruses may mediate hydrophobic interactions between viral and cellular macromolecules. it is generally accepted that plant viruses exploit plasmodesmata to move from initially infected cells, which are usually negligible in amount, into neighbouring healthy cells. without cell-to-cell movement productive virus infection is not established. also it is known that specific movement proteins encoded by virus genomes are essential for cell-to-cell spread. frequently, a plant virus protein is referred to as movement protein if i) it is not a capsid protein and ii) disruption of the coding sequence of this protein abolishes infection in whole plants but has no effect on virus replication in protoplasts. putative movement proteins have been identified in this manner in about different plant virus groups (see [ , , ] for recent reviews). in some cases, additional virus proteins are also required for movement, e.g. coat protein [ , , ] or a specific segment in a replication protein [ ] , but these phenomena are not universal. on the other hand, a specific movement protein or at least movement domain is thought to be present in all viruses that are able to spread from cell to cell [ ] . apart from this genetic evidence for the movement function, the current knowledge of mechanisms which facilitate intercellular spread of plant virus genomes is insufficient. recently certain properties of some movement proteins, notably of the kda movement protein of tobacco mosaic virus (tmv), were characterized in vitro and in vivo and models for movement of plant viruses have been prompted by these studies [ , ] . additionally, movement proteins were subjects of comparative sequence analysis, which allowed to delineate common motifs in movement proteins from otherwise unrelated groups implying existence of common features among many movement proteins [ , ] . in this work, we sought to critically analyze currently available biochemical and ultrastructural data on movement proteins. we then investigated whether sequences of movement proteins could provide information on their function based on similarities with other viral or cellular proteins. fluorescent probes injected into a plant cell are either transported into neighbouring cells via plasmodesmata or retained in the initially injected cell. the crucial determinant for the exclusion of a particle from plasmodesmatal movement is its hydrodynamic (stokes) radius [ ] . studies of transgenic tobacco plants constitutively expressing tmv kda movement protein (mp + plants) revealed that certain molecules, which were too large to be transported in wild type plants, gained such an ability in mp + tobaccos. specifically, fitc-labelled dextran (mr . kda) moved from injected to neighbouring cells only in plants transformed with and expressing tmv kda protein but not in control mp-plant virus movement proteins plants [ ] . interestingly, plasmodesmata in mp + and mp-plants appeared to be structurally indistinguishable [ , ] . these data indicate that movement protein of tmv functionally modifies plasmodesmata. it is not understood yet whether these modifications and the changes required for moving virus genome through plasmodesmata are the same or different. virus nucleic acid would not be expected to exist in naked form in the cell; rather, it seems reasonable that, like all nucleic acids in eukaryotic cells, it is associated with cellular and/or virus proteins throughout its lifetime. while stokes radius of a kda dextran molecule is estimated to be about . nm [ , ] , ribonucleoproteins (rnps) are clearly larger, many of them having sizes within the range of - nm [ ] , which would exclude them from movement even in mp + plants described in [ ] . recently, plasmodesmatal permeability in nicotiana clevelandii leaves infected with tobacco rattle virus (trv) has been investigated [ ] . movement protein of trv, the kda protein, shares high sequence similarity with tmv movement protein, and both proteins are considered to be functionally very similar. simultaneous injection of virus and fluorescent tracers into cells of leaf trichomes allowed to observe movement of the labels under the conditions when the virus itself was known to move in the same cells. it has been shown that fitc-labelled dextran (mr . kda) moved from cell to cell only in trvinfected cells and synchronously with the virus. however, lucifer yellow-labelled dextran (mr kda) was excluded from plasmodesmatal movement even in infected cells. thus, certain molecules with the dimensions resembling the smallest of the known rnps did not move under conditions when virus rnps readily moved. apparently, either an extremely thin virus-specific rnp should be postulated (see below), or it might be concluded that slight increase in plasmodesmatal permeability induced by movement proteins hints at some important circumstances but is not sufficient to explain celt-to-cell movement of the virus genome. binding of movement proteins to single-stranded nucleic acids tmv kd movement protein has been overexpressed in e. coli, and purified protein has been shown to bind single-stranded (ss) dnas and rnas nonspecifically and cooperatively [ ] . a model has been proposed, in which movement protein binds genomic rna stoichiometrically to unfold and shape it into a thin complex [ , , ] . electron microscopy of such complexes formed in vitro revealed thin structures at the limit of microscope resolution; it was concluded that, as it was barely seen under em, the complex was thin enough to fit in plasmodesmata with size exclusion limit of nm [ ] . additionally, ss nucleic acid-binding properties have been reported for movement proteins of cauliflower mosaic caulimovirus (camv) [ ] , red clover necrotic mosaic dianthovirus (rcnmv) ( [ ] , d. cookmeyer, pers. comm.) and alfalfa mosaic virus (aimv) [- ] . interestingly, p , the movement protein of cauliflower mosaic virus, has been shown to preferentially bind rna rather than dna a.r. mushegian and e. v. koonin [ ] ; it was speculated that s transcript, a genome length virus replication intermediate, might be the form which is transported from cell to cell. these in vitro observations offer new insights into a mechanism of virus cell-to-cell movement. however, is should not be forgotten that virus rnas in vivo are associated with proteins, as revealed for tmv-specific rnas [ ] ; it is thought that such an association is maintained throughout the whole life of the rnps in the cell; a mechanism of stripping virus rna from these proteins has not yet been proposed. also, as binding of movement proteins to rna is thought not to be sequence-specific [ , ] , it could be anticipated that the majority of movement protein will bind to cellular rnas, which in the case of tmv infection are in excess over virus rnas at the time when kd protein is transiently expressed [ ] . the "ss nucleic acid-binding" model, therefore, has to be modified to deal with these difficulties. biochemical fractionation of infected cells revealed that in many virus-plant interactions movement proteins are found in cell wall-enriched fraction. immunogold labelling has confirmed these observations and demonstrated localization of tmv, a mv, camv, and rcmv movement proteins in plasmodesmatal channels or in cell wall near plasmodesmata [ , ] . movement protein of cowpea mosaic comovirus (cpmv), the kd/ kda protein, has been found to form tubular structures perpendicular to and intruding into plasma membrane and cell wall. these tubules have been found both in whole leaves and in protoplasts [- ] . virions have been observed inside the tubules [ ] . it has been speculated that the tubular inclusions are the structures required for movement. cucumber mosaic virus (cmv) movement requires the kda movement protein [ ] . this protein has not been found to associate with the cell wall or from tubules; instead, it was located in nucleoli of infected cells [- ] . observations on movement proteins of tmv and cpmv have given rise to the idea of two different types of virus movement, "tobamo-type" and "comotype". it has been proposed that the tobamo-type is characterized by movement protein localization in the plasmodesmatal area of the cell wall and by the virus infectious entity capable of cell-to-cell movement in the absence of the viral capsid protein. como-type of movement was thought to require both movement protein and coat protein and to involve formation of tubular structures protruding from cell wall into cytoplasm and formed by the movement protein [ , ] . complementation data argued, however, that there is certain compatibility among both types of movement as tobamoviruses were able to complement comovirus movement [ ] . when all available ultrastructural data are considered, the picture becomes not so clear-cut. it should be expected that rcmv, a comovirus closely related to cpmv, moves according to the como-type; instead, movement protein of rcmv was found exclusively in plasmodesmata of the infected plants [- ] which had been proposed to be a tobamo-type feature. camv movement protein is found in cell wall fraction and, more specifically, in plasmodesmata [ , ] ; this seemingly supports the attribution of caulimovirus movement to the tobamo-type. however, tubular structures of unknown composition have been observed in the cytoplasm of camv-infected cells [ ] and recently it has been shown that p , the movement protein of camv, forms tubular structures on the surface of inoculated protoplasts [ ] . these findings seem to be more consistent with the como-type action of the caulimovirus movement protein. the above discussion shows that it may be premature to draw too sharp a distinction between different types of movement based on ultrastructural observations. it also should be remembered that we see movement proteins at the sites where they are most easily observed and/or are retained for longer periods, but not necessarily at the sites of their essential activity. with full-length dna copies of numerous virus genomes available, attempts have been made to genetically dissect movement protein coding sequences. in most cases, lengthy deletions have been introduced, and effect of these deletions on the overall infectivity or individual properties of the movement proteins has been investigated. it has been shown that even small changes near the n-terminus of tmv kd movement protein abolish virus infectivity [ ] ; at the c-terminus, up to of the amino acid residues of this protein could be deleted without apparent effect on infectivity or cell wall localization of the movement protein. when c-terminal amino acids were deleted, virus moved slowly though movement protein was still found in plasmodesmata [- ] . deletion of cterminal residues ( to ) was lethal, and, at the same time, movement protein was no longer found in plasmodesmata. it has been noted that essential residues from to could comprise a domain with a specific function, or their deletion might simply cause misfolding of the protein i- ]. in another study, c-terminal and internal deletions have been introduced into tmv movement protein to determine location of ss nucleic acid-binding site(s) [- ] . initially, it was thought that residues to constitute rnabinding domain [ ] . later, this domain has been shown to rather decrease protein solubility [ ] . up to amino acids from the c-terminus of tmv movement protein could be eliminated without effect on ss nucleic acid binding capacity of the protein [- ]. in the remaining part, deletion of amino acids - abolished ss nucleic acid binding, but when the gene segment encoding these amino acids was fused to an unrelated sequence, the resulting himeric protein did not bind to ss nucleic acids [ ] . movement protein of a mv associates with cell wall upon infection and in transgenic tobacco plants; however, this did not happen when n-proximal amino acids ( - ) have been deleted [- ] . strains of tmv have been characterized that are temperature sensitive in a.r. mushegian and e. v. koonin: plant virus movement proteins cell-to-cell movement or break the host resistance which is based on plant genes restricting virus movement. involvement of mutations in kd movement protein cistron could be implied, and, indeed, in many cases nucleotide changes responsible tbr the altered phenotype were shown to be confined to the movement protein gene [ ] . many of these mutations were scattered around the middle one-third of kd movement protein and resulted in a change in polarity of the encoded amino acid [ ] . as no known function could be mapped to the area, these data are difficult to interpret. with representative sequences of plant virus genomes from many groups available, efforts were made to identify regions of similarity between known movement proteins and to predict movement functions for uncharacterized orfs. early observations made by this approach have been reviewed previously (e.g. [ ] ) and included strong similarity between tobamovirus and tobravirus movement proteins [ ] as well as less pronounced but significant local similarity between segments of tobamovirus and caulimovirus movement proteins [ ] . later, similarities between movement proteins of tricornaviruses and dianthoviruses were observed [ ] . in fact, the movement function for the respective proteins of tobraviruses, caulimoviruses and dianthoviruses has been implied by these observations and only subsequently confirmed by site-directed mutagenesis [ , , ] . more detailed analyses performed by melcher [ ] and by ourselves [ ] revealed longer (ca. amino acids) segments of sequence similarity between movement proteins from different virus groups. interestingly, one family of movement proteins included proteins encoded by viruses with positive-strand rna genomes, single-strand dna geminiviruses and by pararetroviruses with virion dna, namely caulimoviruses [ ] . in the time past after the publication of these analyses, further significant increase in the number of sequenced virus genomes was achieved and it has been argued that the current collection of such sequences may represent the majority of the existing virus groups [ , ] . with this in mind, we undertook a systematic comparison of the sequences of the movement proteins with each other and with the current sequence databases. using the blast program [ ] , statistically highly significant similarities were observed only between proteins of viruses belonging to a single group or two closely related groups. further comparisons were done using programs ma-caw [ a] and optal [ ] to generate multiple alignments according to the progressive alignment strategy [ ] , i.e. starting with pairs of most closely related sequences and proceeding to incorporate more and more distant ones. it was demonstrated that movement proteins of viruses belonging to groups contain a weak but confidently identified conserved motif consisting of approximately amino acid residues (fig. ) . this motif includes a nearly invariant aspartic acid residue (replaced by asparagine in dianthoviruses and ilarviruses) preceded by a region enriched in hydrophobic and nonpolar residues. in some of the movement proteins this region was predicted to form a transmembrane helix using the algorithm of rao and argos ( [ ] , data not shown; see also discussion below). remarkably, this conserved motif unites viruses with single-stranded dna genomes (geminiviruses of group ii), pararetroviruses with double-stranded dna genomes (caulimoviruses and badnaviruses), negative-strand rna viruses with ambisense genome segments (tospoviruses), and numerous groups of positive-strand rna viruses. among the latter, all the three major phylogenetically defined divisions [ , ] are represented, namely run-like viruses (tobamoviruses, tobraviruses, cucumoviruses, bromoviruses, ilarviruses, capilloviruses, furoviruses, raspberry bushy dwarf virus), picornalike viruses (comoviruses, parsnip yellow fleck virus, pea enation mosaic virus), and flavi-like viruses (dianthoviruses and tombusviruses). several groups of movement proteins showed significant similarity on longer sequence stretches. in fig. we show the amino acid sequence alignments for four of these groups, delineation of which included identification of new putative movement proteins. such new identifications are: i) kda protein of apple stem grooving virus, which is related to the putative movement proteins of the other capilloviruses, namely apple chlorotic leaf spot virus and potato virus t (fig. a) ; ii) kda protein of soil-borne wheat mosaic furovirus related to the movement proteins ofdianthoviruses (fig. b) ; iii) kda protein of pea enation mosaic virus and kda protein of tombusviruses related to the movement proteins of bromoviruses and cucumoviruses (fig. c) ; and iv) nsm protein of fig. . amino acid sequence alignments for groups of proteins including newly identified putative movement proteins. for each group of boundaries of the conserved segments were determined using the macaw program and the alignment itself was constructed hierarchically using the optal program. in the consensus u indicates a bulky aliphatic residue (i, l, v, or m), a indicates an aromatic residue (f, i , or w) & indicates a bulky hydrophobic residue (aliphatic or aromatic) and x indicates any residue. the conserved motif of the " k superfamily" is highlighted by bold typing, a putative movement proteins of capilloviruses. aclsv apple chlorotic leaf spot virus; pvt potato virus t; asgv apple stem grooving virus; b movement protein of red clover necrotic mosaic dianthovirus (rcnmv), putative movement proteins of raspberry bushy dwarf virus (rbd v) and soil-borne wheat mosaic furovirus (sb wmv); e movement proteins of cucumber mosaic cucumovirus (cmv) and brome mosaic bromovirus (bmv), putative movement proteins of pea enation mosaic virus (pemv) and cucumber necrosis tombusvirus (cnv); the similarity between kda proteins of tombusviruses and movement proteins of bromoviruses and cucumoviruses has been independently reported by u. melcher (pers. comm.); d putative movement proteins of caulimoviruses, badnaviruses, parsnip yellow fleck virus, and tomato spotted wilt tospovirus; asterisks show identical residues and colons show similar residues between the sequences of pyfv and fmv, and coymv and tswv tospoviruses and the n-terminal domain of the parsnip yellow fleck virus polyprotein related to the putative movement proteins of caulimoviruses and badnaviruses (fig. d; the similarity between caulimovirus movement proteins and n-terminal domains of badnavirus polyproteins was noted by u. melcher and cited in [ ] ). even among these smaller groups, the latter three brought together proteins of very different viruses. group iv) is particularly notable in that it compiles proteins of a positive-strand rna virus, negative-strand (ambisense) rna viruses, and dna-containing pararetroviruses. curiously, the array of domains, namely n-movement domain-capsid protein(s)-replicative domains-c in the polyproteins of such seemingly unrelated viruses as parsnip yellow fleck virus and the badnaviruses is very similar. a relevant observation from our previous work is the relationship between movement proteins of two component geminiviruses (bl /bc ) with those of tobamoviruses and tobraviruses [ ] . all the proteins containing the conserved motifs shown in fig. should be considered a single vast, and highly diverged superfamily of plant virus movement proteins. we would like to coin the nickname " k superfamily", after the most thoroughly studied movement protein of tobamoviruses, and also because most of the proteins of this superfamily have the size around this value (fig. a) . several groups of positive-strand rna viruses, namely potexviruses, carlaviruses, hordeiviruses, and a group including beet necrotic yellow vein virus, nicotiana velutina mosaic virus, and peanut clump virus, encompass the triple gene block that consists of two small membrane proteins and a putative rna helicase [ , ] . it has been demonstrated that disruption of any of the triple block genes abolished the cell-to-cell movement of hordeiviruses, potexviruses and bnyvv [ a, a, a]. the putative helicases in the triple block comprise a distinct group within superfamily i of dna and rna helicases and contain several unusual amino acid substitutions in the conserved helicase motifs [ , ] . unlike the genes coding for the proteins of the " k superfamily", the triple block is found only in positive-strand rna viruses. carmoviruses and necroviruses encode a pair of small proteins, one of which is predicted to be an integral membrane protein. it has been shown that disruption of either of these genes precluded cell-to-cell movement of turnip crinkle carmovirus [ ] . these two genes perhaps may be considered a "truncated triple block" although it was hard to demonstrate this at the level of sequence comparison due to the small size of the hydrophobic proteins. a third type of movement protein has been revealed in tymoviruses. a kda proline-rich protein that is encoded in the ' portion of the genomic rna and overlaps with the replicative polyprotein gene is essential for the movement of tymv in infected plants [ ] . this protein may have specific secondary and tertiary structure similar to that of other proline-rich filamentous proteins. interestingly, upon screening of the amino acid sequence databases with the sequence of the tymovirus movement protein the highest (albeit moderate) similarity was observed with the plant cell wall-associated protein extensin (data not shown). recently it has been shown that cell-to-cell movement of geminivirus msv is dependent on the small protein v [ ] . this protein and the related proteins of other one component geminiviruses did not show appreciable sequence similarity to movement proteins of other viruses. interestingly, however, alignment of their amino acid sequences revealed the conservation of a highly hydrophobic central domain terminating at an invariant aspartic acid residue and thus distantly resembling the conserved motif of the k superfamily (fig. ) . this domain was flanked by two proline-rich domains showing some similarity to extensins (fig. ) . thus, in a sense the movement proteins of one component geminiviruses combined the k superfamily and the tymovirus structural themes. for several virus groups, in which at least one member has been completely sequenced, the movement function has not been genetically mapped and identification of a putative movement proteins by amino acid sequence comparison was very uncertain if possible at all. among these, uncharacterized proteins of nepoviruses and closteroviruses showed a very limited sequence similarities to some members of the k superfamily, in particular to caulimovirus movement proteins [ ] . the observations discussed in the previous sections show that classification of movement proteins based on comparison of their amino acid sequences does not correlate with the type of genome nucleic acid or with grouping of viruses based on phylogenetic analysis of replicative proteins - , , ] . also, we were unable to notice any correlation between grouping of movement proteins and biological properties of the viruses encoding these proteins (e.g. host ranges). recombination between unrelated or distantly related viruses could have played a major role in the evolution of the movement function. this is clearly the preferential explanation for situations when related movement proteins are found in viruses with different type of genome nucleic acid (e.g. the striking similarity between the putative movement proteins of caulimo/badnaviruses fig. d) . in some cases recombinational transfer of movement protein genes appears very likely also between remote groups of positive-strand rna viruses (e.g. dianthoviruses and soil borne wheat mosaic furovirus; fig. c) . another equally important trend in the evolution of movement protein genes is the apparent high rate of mutational change leading to the very limited level of sequence conservation as featured above. even within compact groups of viruses that share common genome organization and functional properties (e.g. tobarnoviruses), the divergence between the movement protein sequences is much higher than that between the principal repticative domains although comparable to that between capsid proteins. it has been proposed that plasmodesmata are plant analogues for certain animal cell-cell contacts, namely gap junctions, based mainly on studies in permeability of both to fluorescent dye-labelled molecules [ , ] . details of protein organization of plasmodesmata are poorly understood, although a protein serologically related to a major gap junctions protein, connexin, was reportedly found in plasmodesmata and gene encoding this plant protein was cloned [ ] . in animal kingdom, gap junctions are important for electrical coupling of cells in various tissues [ ] . interestingly, permeability of both plasmodesmata and gap junctions for some low molecular weight tracer dyes is down-regulated by activation of protein kinase c [ ] . however, movement of macromolecules, in particular nucleoproteins, through gap junctions, has not been described. a case of movement of a nucleoprotein through a channel in a membrane is represented by nucleocytoplasmic trafficking of rnps (reviewed in [ , ] ). among the features of the nucleocytoplasmic export of cellular rnas revealed thus far, several might be relevant to the mechanisms of movement of plant virus genomes. in particular, it has been shown that at all stages of nucleocytoplasmic export, mrna is attached to specific protein frameworks, being transferred from nucleoskeleton to nuclear pore complex to cytoplasm [ ] ; specific proteins in mrnp are thought to chaperone association-dissociation of the mrna to and from these frameworks [ , ] . components of mrnp activate enzymes within the nuclear pore complex which are essential for energydependent translocation of rnp through nuclear pore; two of these enzymatic activities are atpase and rna helicase [ ] . it is tempting to speculate that mechanistic analogies may exist between the nucleocytoplasmic transport of cellular rnps and intracellular and/or intercellular movement of viral rnps. more specifically, the parallel between the rna helicase in the nuclear pore and the putative viral rna helicase in the triple block that is involved in virus movement (see above) certainly is provocative. it should be noted that plant viruses, which undergo some stages in the nucleus of the infected cells (i.e. pararetroviruses and geminiviruses), should be able to perform both nucleocytoplasmic movement and cell-to-cell movement of their genomes; one may wonder to what extent these two processes are similar and whether movement proteins are involved in both of them. comparison of the amino acid sequences of dianthovirus movement proteins with amino acid sequence databases revealed a marginally significant but provocative similarity with membrane proteins of the mip family [ ] . with the blast score of and probability of random matching of . , the similarity of the movement protein of rcnmv with tonoplast membrane protein tip from arabidopsis was the highest in the entire database except for the similarities with the homologous proteins of the other dianthoviruses. analysis of the multiple alignment of the dianthovirus movement proteins with a representative set of the mip proteins (fig. ) showed that the region of similarity included the conserved motif of the " k superfamily" described above and the characteristic conserved motif of the mip family [ ] . each of these motifs consisted of a putative transmembrane helix succeeded by a loop containing conserved amino acid residues (fig. ) . the drastic difference between the two types of proteins is that mip proteins contain six transmembrane helices whereas in the virus movement proteins the region shown in figs. and is the only such predicted helix. nevertheless, we believe that the observed sequence similarity may be functionally relevant indicating that the conserved motif of the " k superfamily" may mediate membrane interaction. for many proteins of this superfamily, membrane interaction is compatible with the observed localization in the cell wall [ ] . however, caution is due in the interpretation of these observations as the hydrophobicity of the conserved region in different movement proteins differed significantly and not for all of them a membrane-spanning helix was confidently predicted ( fig. b and fig. . a conserved sequence motif in the movement proteins of dianthoviruses and the mip family of membrane proteins. asterisks show identical residues and colons show similar residues in the sequences of the movement protein of rcnmv and tip . the consensus (for symbols see legend to fig. ) includes the conserved residues, with one possible exception in the mip-related proteins. the hydrophobic regions predicted to form transmembrane helices are underlined. the motif was extracted from an alignment generated using the macaw program in some of these proteins the conserved motif may be involved in hydrophobic interactions other than membrane spanning. as mentioned above, membrane localization also has been proposed for the triple block proteins and strongly predicted for the movement proteins of monopartite geminiviruses. the observations on the involvement of influenza virus m protein in nuclear export of virus ribonucleoproteins [- ] prompted us to directly compare the m sequence to those of plant virus movement proteins. moderate similarity was revealed between m and the movement proteins of bromoviruses and cucumoviruses (fig. ) . the region of similarity included the conserved motif of the " k superfamily", with the invariant aspartic acid residue and the preceding hydrophobic stretch. m protein chaperones newly formed influenza virus rnps from the nucleus to the cytoplasm; upon formation of virions, this protein is thought to provide the link between the rnp and the virion membrane (reviewed in [ ] ). both of these functions are apparently based on the ability of m to interact both with rnp and the membrane providing a provocative analogy with the plant virus movement proteins. very recently, a mutation in m protein gene resulting in impaired nucleocytoplasmic movement was mapped within the region of similarity to bromovirus and cucumovirus movement proteins [ ] . finally, our previous observations indicated a weak sequence similarity between a domain of cellular kda heat shock proteins with some of the movement proteins, specifically those of caulimoviruses [ ] . a common denominator for all these observations may be that the function of plant virus movement proteins is determined by domains mediating hydrophobic interactions. taking into account the limited sequence conservation and the absence of strictly invariant amino acid residues, it is very unlikely that these proteins (with the exception for the putative helicase in the triple block) have any enzymatic activity. among the known groups of plant viruses [ ] , movement proteins so far have been identified genetically or by sequence comparison for about percent. the variety of these proteins could be reduced to only three types, namely the " k superfamily", with the distantly similar proteins of monopartite geminiviruses; the triple block; and the proline-rich proteins of tymoviruses. obviously, future studies may lead to characterization of movement proteins unrelated to any of these groups. different types of movement proteins may affect different aspects in viruscell interaction leading to virus spread in planta. however, it appears likely that in many cases these proteins mediate hydrophobic interactions between viral and cellular components, in a general analogy with molecular chaperone action [ , ] . as discussed in the first part of this review, a large fraction of experimental data on plant virus movement proteins is very hard to be interpreted unequivocally. conceivably, this reflects the highly complex nature of virus-plant interaction. in this situation, it appears reasonable to focus further experiments on probing the specific functions fo conserved domains revealed by amino acid sequence comparison. hopefully, this approach will highlight new important aspects of plant virus lifestyle. recently, it has been shown that tmv movement protein expressed by transgenic tobacco behaves as an integral membrane protein [moore pj, fenczik ca, deom cm, beachy rn ( ) developmental changes in plasmodesmata in transgenic tobacco expressing the movement protein of tobacco mosaic virus. protoplasma : - ]. between nucleus and cytoplasm cauliflower mosaic virus gene i product detected in cell wall-enriched fraction basic local alignment search tool expression of a plant virus-coded transport function by different viral genomes dynamic continuity of cytoplasmic and membrane compartments between plant cells triple gene block proteins of white clover mosaic potexviurs are required for transport the tmv movement protein: role of the c-terminal amino acids in subcellular localization and function mutational analysis of cis-acting sequences and gene function in rna of cucumber mosaic virus an analysis of the complete nucleotide sequence of a sugarcane bacilliform virus genome infectious to banana and rice replication of maize streak virus mutants in maize protoplasts: evidence for a movement protein expression of orf- of turnip yellow mosaic virus is necessary for virus spread in plants nucleotide sequence analysis of the movement genes of resistance breaking strains of tomato mosaic virus a three-dimensional view of precursor messenger rna metabolism within the mammalian nucleus potato virus x as a vector for gene expression in plants the p movement protein of tobacco mosaic virus is a single-stranded nucleic acid binding protein gene i, a potential cell-to-cell movement locus of cauliflower mosaic virus, encodes an rna-binding protein how do plant virus nucleic acids move through intercellular connections visualization and characterization of tobacco mosaic virus movement protein binding to single-stranded nucleic acids relationship of tobacco mosaic virus gene expression to movement within plant plant virus movement proteins increase in plasmodesmatal permeability during cell-to-cell spread of tobacco rattle virus from individually inoculated cells of urfs and orfs. a primer on how to analyze derived amino acid sequences the plant virus movement proteins informosome-like virus-specific ribonucleoprotein (vrnp) may be involved in the transport of tobacco mosaic virus infection an influenza virus temperature-sensitive mutant defective in the nuclear-cytoplasmic transport of the negative-sense viral rnas an n-proximal sequence of alfalfa mosaic virus movement protein is necessary for association with cell walls in transgenic plants classification and nomenclature of viruses. fifth report of the international committee on taxonomy of viruses efficient cell-tocell movement of beet necrotic yellow vein virus requires ' proximal genes located on rna a novel superfamily of nucleoside triphosphate-binding motif-containing proteins which are probably involved in duplex unwinding in dna and rna replication and recombination an atp-binding motif is the most conserved sequence in a highly diverged group of proteins involved in positive strand rna viral replication turnip crinkle virus genes required for rna replication and virus movement the complete nucleotide sequence of tobacco rattle virus rna- the sequence of carnation etched ring virus dna: comparison with cauliflower mosaic virus and retroviruses the phylogeny of rna-dependent rna polymerases of positivestrand rna viruses virus evolution: time for sturm und drang diverse groups of plant dna and rna viruses share related movement proteins that may possess chaperonelike activity evolution and taxonomy of positive-strand rna viruses: implications of comparative analysis of amino acid sequences genes and proteins of the influenza viruses the subcellular localization of gene i product of cauliflower mosaic virus is consistent with a function associated with virus spread ultrastructural location of non-structural protein a of cucumber mosaic virus in infected tissue using monoclonal antibodies to a cloned chimeric fusion protein nuclear mrna export nuclear transport of influenza virus ribonucleoproteins: the viral matrix protein (m ) promotes export and inhibits import virus movement in infected plants translocation of a specific premessenger ribonucleoprotein particle through the nuclear pore studied with electron microscope tomography gap junction protein homologue in arabidopsis thaliana: evidence for connexins in plants similarities between putative transport proteins of plant viruses probable reassortment of genomic elements among elongated rna-containing plant viruses nuclear import-export: in search of signals and mechanisms detection of the movement protein of red clover necrotic mosaic virus in a cell wall fraction from infected nicotiana clevetandii plants cooperative binding ofthe red clover necrotic mosaic movement protein to single stranded nucleic acids evolution of the mip family of integral membrane transport proteins cauliflower mosaic virus gene i product (p ) forms tubular structures which extend from the surface of infected protoplasts identification of barley stripe mosaic virus genes involved in viral rna replication and systemic spread a conformation preference parameter to predict helices in integral membrane proteins spray dc (eds) ( ) parallels in cell-to-cell junctions in plants and animals effects of deletions in the n-terminal basic arm of brome mosaic virus coat protein on rna packaging and systemic infection evidence for involvement of a nuclear envelope-associated rna helicase activity in nucleocytoplasmic rna transport a workbench for multiple alignments construction and analysis a mutation in cauliflower mosaic virus gene i interferes with virus movement but not virus replication deletion analysis of brome mosaic virus a protein: effects on rna replication and virus spread tubular structures involved in movement of cowpea mosaic virus are also formed in infected cowpea protoplasts plasmodesmatal function is probed using transgenic tobacco plants that express a virus movement protein cell-to-cell transport of cowpea mosaic virus requires both the / k proteins and the capsid proteins the roles of the red clover necrotic mosaic virus capsid and cell-to-cell movement proteins in systemic infection tobacco rattle virus rna- k gene product potentiates viral movement and also affects symptom production in tobacco evolution of rna viruses we appreciate helpful discussions with dr. v. v. dolja and dr. u. melcher. we would like to thank drs. d. cookmeyer, s. a. demler, r. kormelink, u. k. melcher, n. olszewski and yu. shirako for communicating data prior to publication. a. m. is grateful to dr. r. j. shepherd for constant support and encouragement. received june , key: cord- -nqkw v w authors: qu, zehui; gao, fei; li, liwei; zhang, yujiao; jiang, yifeng; yu, lingxue; zhou, yanjun; zheng, hao; tong, wu; li, guoxin; tong, guangzhi title: label‐free quantitative proteomic analysis of differentially expressed membrane proteins of pulmonary alveolar macrophages infected with highly pathogenic porcine reproductive and respiratory syndrome virus and its attenuated strain date: - - journal: proteomics doi: . /pmic. sha: doc_id: cord_uid: nqkw v w significant differences exist between the highly pathogenic (hp) porcine reproductive and respiratory syndrome virus (prrsv) and its attenuated pathogenic (ap) strain in the ability to infect host cells. the mechanisms by which different virulent strains invade host cells remain relatively unknown. in this study, pulmonary alveolar macrophages (pams) are infected with hp‐prrsv (hun ) and ap‐prrsv (hun ‐f ) for h, then harvested and subjected to label‐free quantitative ms. a total of proteins are identified, including that are differentially expressed. among them, proteins are located on the membrane. the most differentially expressed proteins are involved in response to stimulus, metabolic process, and immune system process, which mainly have the function of binding and catalytic activity. cluster of differentiation cd , vimentin (vim), and nmii as well as detected proteins are assessed together by string analysis, which elucidated a potentially different infection mechanism. according to the function annotations, prrsv with different virulence may mainly differ in immunology, inflammation, immune evasion as well as cell apoptosis. this is the first attempt to explore the differential characteristics between hp‐prrsv and its attenuated prrsv infected pams focusing on membrane proteins which will be of great help to further understand the different infective mechanisms of hp‐prrsv and ap‐prrsv. porcine reproductive and respiratory syndrome virus (prrsv) has been a leading economically significant viral pathogen of swine worldwide for almost years. [ ] [ ] [ ] prrsv, equine arteritis virus, simian hemorrhagic fever virus, and lactate dehydrogenaseelevating virus are members of the family arteriviridae. [ , ] prrsv is a positive-sense, single-stranded, rna virus with a full-length genome of kb that has a cap and a poly (a) tail. [ , [ ] [ ] [ ] prrsv was first reported in the united states in the late s. [ ] in , several large-scale, severe outbreaks of highly pathogenic (hp) atypical prrsv (hp-prrsv) were reported in china and neighboring asian countries. [ , , ] reported hp-prrsv morbidity and mortality rates were much higher than previous pandemic prrsv strains and associated with more severe clinical presentations and higher rectal temperature (> °c). [ ] the emergence of hp-prrsv has caused great economic loss to the swine industry in china and made preventing and controlling prrsv outbreaks difficult. therefore, elucidating the causes of the greater virulence of prrsv and the differences between the hp and attenuated pathogenic (ap) strains has become even more important. to this end, several trials have been conducted to identify virulence factors; these studies have resulted in some successes. [ , ] however, changes in virulence and pathogenic mechanisms are difficult to discern. other than virulence in vivo, many distinctions in the biological aspects of hp-prrsv and ap-prrsv have been noted, such as in viral binding and entry into pulmonary alveolar macrophages (pams). prrsv exhibits highly restricted cell tropism both in vivo and in vitro. [ ] the virus can be detected only in well-differentiated macrophages of lungs, lymph nodes, peyer's patches, spleen, tonsils, and thymus. pams are the main target cells of prrsv. [ ] prrsv can also replicate in vitro in the african www.advancedsciencenews.com www.proteomics-journal.com membrane proteins (mps) of pams infected by highly proteomic (hp)-and attenuated proteomic (ap)-prrsv have been elucidated by lc-ms/ms for label-free quantitative proteomics. ninety-five differentially expressed proteins were identified and characterized. the most significant difference in the biological process between pams infected with hp-and ap-prrsv is the metabolic process. most different molecular functions were classified as binding and catalytic activities. cellular component categories showed that differentially expressed proteins were confirmed as mps based on the annotation of uniprot database, such as rap a, vcl (vinculin), ifitm function in cell-cell junctions, erk signaling pathway, g protein signaling pathways, biotic stimulus, and so on. among them, vcl is a kind of f-actin-binding protein which is involved in cell-matrix adhesion and cell-cell adhesion in humans. it was demonstrated that over expression of vcl could inhibit the replication of both hp-prrsv and the attenuated prrsv in the mrna level. there were obvious differences in the inhibiting ability for hp-prrsv and its attenuated strain. this is the first attempt to explore the differential characteristics between hp-prrsv and its attenuated prrsv-infected pams focusing on mps which will be of great help to further understand the different infective mechanisms of hp-prrsv and ap-prrsv. green monkey kidney cell line ma- and its derivatives, marc- and cl- , which are considered permissive cell lines for prrsv. [ , ] reportedly, prrsv targets cellular membrane proteins (mps) and enters target cells through receptormediated endocytosis during viral infection. [ ] studies have investigated possible mechanisms employed by prrsv to infect pams. [ ] elucidating the characteristics of viruses and interactions between viruses and host cells is increasingly important. however, exploring individual proteins within the many proteins in cells is difficult. nonetheless, gradual advancements have come through use of proteomic techniques. of these, ms-based quantitative proteomic techniques offer the advantage of better accuracy and sensitivity, and have been widely used to analyze host cell responses to viral infection. among them, liquid chromatographytandem ms (lc-ms/ms) for label-free quantitative proteomics (lfqp) is an important mass spectrometric tool to detect and quantify large amounts of proteins. [ ] compared with quantitative proteomics using stable isotope labeling such as stable isotope labeling by amino acids in cell culture and isobaric tags for relative and absolute quantitation, lfqp detects greater amounts of proteins and important signaling pathways and networks. the aim of this study was to determine potentially different infection mechanisms used by the hp strain vhun [ ] and its derivative attenuated strain vhun -f [ , ] using lfqp. the animal study protocols were approved by the animal care and use committee of shanghai veterinary research institute, chinese academy of agricultural sciences. hp-prrsv strain vhun [ , ] at a titer of % tissue culture infective dose (tcid ) ml − and cell-passaged attenuated virus strain vhun -f (ap-prrsv) [ , ] at a titer of tcid ml − were stored as viral stocks. porcine circovirus , classical swine fever virus, prrsv antibody, and antigen-free -day-old piglets were used. animals were sacrificed in accordance with the ethics statement. lungs were dissected and lavaged with pbs (pbs; life technologies, inc., gibco/brl division, grand island, ny, usa) supplemented with % penicillinstreptomycin (gibco/brl), then centrifuged at × g for min, resuspended in pbs, centrifuged, and resuspended in pbs. pams were collected in roswell park memorial institute (rpmi) medium (gibco/brl) containing % fetal bovine serum (gibco/brl) [ ] and incubated in cm dishes (corning, inc., corning, ny, usa) for h at °c in a % co atmosphere. after pams were washed with pbs three times, dead and nonadherent cells were removed when confluency exceeded %. three dishes were inoculated with vhun and another three with vhun -f at multiplicity of infection . an additional three dishes were inoculated with dmem (gibco-brl) as a blank control. all dishes were incubated at °c in an atmosphere of % co , as described previously. [ ] after incubation for h, inocula were discarded and pams were washed with pbs three times. cell monolayers in all dishes were overlaid with rpmi- medium containing % fetal bovine serum and incubated at °c in a % co atmosphere for h. pams were digested with ml . % trypsinethylenediaminetetraacetic acid solution (gibco/brl), collected by gently pipetting, centrifuged at × g for min and lysed using the proteoextract transmembrane protein extraction kits (novagen, emd biosciences, inc., madison, wi, usa), [ , ] according to the manufacturer's instructions. cells were resuspended in extraction buffer and protease inhibitor cocktail, incubated for min at °c with gentle agitation, and centrifuged at × g for min at °c. after removing supernatants, pellets were resuspended in . ml extraction buffer a and protease inhibition cocktail, incubated for min at room temperature with gentle agitation, and centrifuged at www.advancedsciencenews.com www.proteomics-journal.com × g for min at °c. supernatants were precipitated with ml acetone and centrifuged at × g for min. after evaporating to dryness, μl sdt buffer ( % sodium dodecyl sulfate, mm tris/hcl at ph . , . m dithiothreitol) was added and mixtures were heated in boiling water for min. after centrifugation, supernatants were collected and quantified with a bca protein assay kit (bio-rad, usa). fig. digestion of protein ( μg for each sample) was performed according to the fasp (filter-aided sample preparation) procedure. briefly, the detergent, dtt and other low-molecular-weight components were removed using μl ua buffer ( m urea, mm tris-hcl ph . ) by repeated ultrafiltration (microcon units, kd) facilitated by centrifugation. then μl . m iodoacetamide in ua buffer was added to block reduced cysteine residues and the samples were incubated for min in darkness. the filter was washed with μl ua buffer three times and then μl mm nh hco twice. finally, the protein suspension was digested with μg trypsin (promega) in μl mm nh hco overnight at °c, and the resulting peptides were collected as a filtrate. the peptide content was estimated by uv light spectral density at nm using an extinctions coefficient of . of . % (g l − ) solution that was calculated on the basis of the frequency of tryptophan and tyrosine in vertebrate proteins. [ ] the peptide of each sample was desalted on c cartridges (empore spe cartridges c (standard density), bed id mm, volume ml, sigma), then concentrated by vacuum centrifugation and reconstituted in μl of . % (v/v) trifluoroacetic acid. ms experiments were performed on a q exactive mass spectrometer that was coupled to easy nlc (proxeon biosystems, now thermo fisher scientific). five microgram peptide was loaded onto a c -reversed phase column (thermo scientific easy column, cm long, μm inner diameter, μm resin) in buffer a ( % acetonitrile and . % formic acid) and separated with a linear gradient of buffer b ( % acetonitrile and . % formic acid) at a flow rate of nl min − controlled by intelliflow technology over min. ms data was acquired using a datadependent top ten method dynamically choosing the most abundant precursor ions from the survey scan ( - m/z) for hcd fragmentation. determination of the target value is based on predictive automatic gain control. dynamic exclusion duration was s. survey scans were acquired at a resolution of at m/z and resolution for hcd spectra was set to at m/z . normalized collision energy was ev and the underfill ratio, which specifies the minimum percentage of the target value likely to be reached at maximum fill time, was defined as . %. the instrument was run with peptide recognition mode enabled. ms experiments were performed triply for each sample. [ ] the ms data were analyzed using maxquant software version . . . . ms data were searched against the uniprot sus scrofa sequence database (including sequences downloaded on / / ). an initial search was set at a precursor mass window of ppm. the search followed an enzymatic cleavage rule of trypsin/p and allowed maximal two missed cleavage sites and a mass tolerance of ppm for fragment ions. carbamidomethylation of cysteines was defined as fixed modification, while protein n-terminal acetylation and methionine oxidation were defined as variable modifications for database searching. the cutoff of global false discovery rate for peptide and protein identification was set to . . label-free quantification was carried out in maxquant as previously described. protein abundance was calculated on the basis of the normalized spectral protein intensity (lfq intensity). all statistical analyses were performed using unpaired t-tests. a p-value < . and ratio > or < . were considered to indicate significant differences. gene ontology (go) annotation and functional classification of identified proteins was with blast go ver. v . . with the current public database b g_aug (www.blast go.com). identified proteins were classified www.advancedsciencenews.com www.proteomics-journal.com using blast go steps under default parameters: blast, mapping, and annotation. protein-protein interaction networks were analyzed using (string software string-db.org/) . confidence view was assigned a score of . , indicating medium confidence. samples of prrsv-infected and dmem-inoculated pams were lysed at h post infection and protein concentrations were determined. samples ( μg) were separated by % sds-page and transferred to . μm nitrocellulose membranes (bio-rad laboratories, hercules, ca, usa). membranes were blocked with % skim milk in tris-buffered saline containing . % tween- and incubated overnight at °c with monoclonal antibodies against heat shock protein (hsp ; ab ; abcam plc, cambridge, uk) or kdel receptor (ab ; abcam plc). after washing three times, membranes were incubated at °c for min with horseradish peroxidase-conjugated anti-mouse igg or antirabbit igg (abcam plc). detection used chemiluminescence luminal reagents (pierce biotechnology, waltham, ma, usa). the porcine vinculin (vcl) were amplified from the cdna obtained from pam cells. restriction enzyme sites were incorporated into the primer sequences to facilitate molecular cloning. pcr products were cloned into the pcaggs vector to produce the porcine vcl expression vector. for transfection, cells were seeded in -well plates (corning) and transfected at - % confluency with respective constructed plasmids dna by using lipofectamine (life technologies), according to the manufacturer's instructions. the hp-hun or hun -f was infected in marc- cells at h post transfection. empty vector transfection samples served as controls in the experiment. at h post infection, total rnas of prrsv-infected cells were extracted using the rneasy mini kit (qiagen), the viral rnas in supernatants were isolated using qiaamp viral rna mini kit (qiagen), according to the instruction manual. all the isolated rnas were used as the template for synthesis of firststrand cdna by rt-pcr using rt primed by oligo (dt) primer using the primescript rt master mix (perfect real time, takara), according to the manufacturer's instructions. then the cdna templates were quantified using prrsv-specific real-time rt-qpcr. [ ] . results a total of proteins were detected by lfqp and are displayed in a heatmap (figure ) . statistical significance was determined using unpaired t-tests. for all tests, a p-value of < . and ratio of > or < . was considered to indicate a significant difference. glyceraldehyde -phosphate dehydrogenase was the internal normalization control and the ratio of glyceraldehyde -phosphate dehydrogenase between hp and ap-prrsv was . ± . . the "only one exists" group indicated that proteins were detected only in one group but not another group due to low expression level ( table ) . correlation analysis indicated good repeatability of the technology (figure ) . a total of differentially expressed proteins were identified ( table ) . a control group was used to exclude false-positive interference. data from the control group was also used to elucidate functions of target proteins. differentially expressed proteins between control and hp-prrsv-infected cells (con/hp) and the www.advancedsciencenews.com www.proteomics-journal.com p < . and ratio > or < . indicated quantitative difference between two groups. the "only one exits" group indicated that proteins were detected three times in one, but not in the other group. hp-prrsv group means proteins identified in the hp-prrsv infected pams, while ap-prrsv was proteins detected in the attenuated pathogenic prrsv infected pams. control was displayed as the mock. control and ap-prrsv-infected cells (con/ap) are in supporting information. to extend the molecular characterization of quantitative differences and only-one-exists groups, uniprot and go databases were used to characterize information about biological processes (bps), molecular functions (mf), and cellular components (cc). bps of h (vh/va > , including proteins) and a (vh/va < . , including proteins) groups are in figures a and b bp. in group h, go annotations were primarily distributed in response to stimulus ( . %), metabolic process ( . %), and immune system process ( . %). ratios were . % response to stimulus, . % metabolic process, and . % immune system process in group a. the most significant difference in bp between pams infected with hp-prrsv and ap-prrsv was seen for metabolic process, which may be the major reason for the large differences among animals challenged with different virulence of prrsv. molecular function categories of h group and a group were shown in figure a ,b mf. most different molecular functions were classified as binding ( . and . %) and catalytic activity ( . and . %). binding of h group included enzyme binding ( . %), nucleic acid binding ( . %) and protein complex binding ( . %), while group a mainly involved nucleic acid binding ( . %), nucleoside phosphate binding ( . %) and nucleotide binding ( . %) from the analysis of go distribution by level . enzyme code distribution suggested there were five transferases ( . %), one hydrolase ( . %), one lyase ( . %), and two ligases ( . %) detected in group h. while four oxidoreductases ( . %), five transferases ( . %), five hydrolases ( . %), three lyases ( . %), and three isomerases ( . %) in group a. cc categories were illustrated in figure a b cc. ninety-five detected differentially expressed proteins were annotated and categorized to ccs of macromolecular complex ( . %), membrane ( . %), membrane-enclosed lumen ( . %), and extracellular region ( . %). because of technological problems, we were unable to conclude that all the detected proteins were indeed mps. however, were confirmed based on the annotation of uniprot database and there also existed proteins partially anchored on the membrane or binding with the mps. to verify the differentially expressed proteins via lc-ms/ms for lfqp, western blots were conducted for two proteins partially located on membranes. expression of hsp and the kdel receptor from cell lysates of vhun -infected and vhun -f -infected pams, and dmem-inoculated pams were tested with antibodies to the proteins. lfqp showed that the ratios between vhun infected and vhun -f -infected pams reached . and . , respectively. western blots confirmed lfqp results ( figure ). to examine whether the differentially expressed proteins detected affects virus infection, the vcl transient overexpression vector was transfected into marc- cells, followed by the hp hun or its attenuated strain infection. the prrsv-specific rt-qpcr results showed that vcl protein could inhibit both viruses, especially for hp-hun strain. compared with the empty vector in this study, lfqp of mps of hp-prrsv-, ap-prrsv-infected pams and the control was performed. important information about target proteins related to virus infection was obtained. the hp-prrsv strain vhun and its derivative, the serially cellpassaged attenuated strain vhun -f , [ ] revealed different infection mechanisms in pams. a total of proteins were identified among the control, ap-prrsv-infected, and hp-prrsv-infected pams. of these, were detected in all three groups (figure ) . we focused on differentially expressed proteins of ap-prrsv-and hp-prrsv-infected pams; among these, were detected in only one group. prrsv has been a threat to the global pig economy for several years because of its persistent infection, immune escape, and high mortality from inflammation and high fever. [ ] the attenuated prrsv vhun -f vaccine strain attenuated from hp-prrsv vhun by serial passages, which is now used in china. [ ] we analyzed mps to identify factors associated with immunological effects and determine differences between hp-prrsv and ap-prrsv. mps classified based on go analysis are in table . a heatmap based on the uniprot database was constructed to comprehend the functions and bps of differently expressed proteins ( figure b) , which revealed clustering and abundance of the detected proteins that existed confidentially on the membrane based on uniprot database annotation. we first focused on proteins associated with immunology and inflammation with higher abundance in ap-prrsv-infected pams. ptpn (vh/va = . ), a member of the protein tyrosine phosphatase family, functions as signaling molecule that regulates cellular processes related to the jak-stat, il- , il- , and granulocyte-macrophage colony-stimulating factor (gm-csf) signaling pathways; dephosphorylation of nonreceptor kinases including jak , jak , stat , and stat ; and negative regulation of il- -, il- , il- , and ifn-mediated signaling. ptpn also functions in the response to inflammation via nf-κb. [ ] [ ] [ ] sipa (h-/a+) is a mitogen-induced gt-pase activating protein for ras-related regulatory proteins. it was related to g-protein signaling and blood-brain barrier and immune cell transmigration: vcam /cd (cluster of differentiation) signaling pathways. [ , ] c ar (h-/a+) is the receptor for the chemotactic and inflammatory peptide anaphylatoxin c a, which stimulates chemotaxis, granule enzyme release, intracellular calcium release, and superoxide anion production, participates in the innate and adaptive immune responses to www.advancedsciencenews.com www.proteomics-journal.com table . statistics analysis of proteins that existed definitely on the membrane. the lectin-induced complement pathway. [ ] [ ] [ ] [ ] [ ] [ ] polyribonucleotide -hydroxyl-kinase clp (clp ) (vh/va = . ) mainly acts as a kinase that binds atp hosting -hydroxyl-kinase activity, and functions in mrna cleavage and in sirna loading onto the rnainduced silencing complex involved in rna interference and its destruction. the kinase hclp phosphorylates and licenses synthetic sirnas to assemble into an rna-induced silencing complex for cleavage of target rna. [ ] expression of these molecules was higher in the ap-prrsv-infected group and had a similar level to the control group. therefore, we concluded that they might be related to the immune escape of hp-prrsv and that the attenuated vaccine strain would not have side effects on these immune factors expression in the host cell, and these proteins above would contribute to the resistance of prrsv. some proteins related to immunology and inflammation had higher expression in hp-prrsv-infected cells. raftlin (vh/va = . ) protein is pivotal for maintenance of lipid rafts and may be involved in regulation of b-cell antigen receptor-mediated signaling. raftlin promotes binding of double-stranded rna, activations of b cell receptors and toll-like receptor signaling pathways, and is involved in il- production to release proinflammatory cytokines. hence, the higher abundance of raftlin in the hp-prrsv group compared to the ap-prrsv and control groups may explain the more severe inflammation triggered by hp-prrsv. [ , ] iars (h+/a-), isoleucyl-trna synthetase is a target of autoantibodies in autoimmune diseases. [ , ] vcl (h+/a-), vinculin is a cytoskeletal protein associated with cellcell and cell-matrix junctions, and is related to the il- , il- , and gm-csf signaling pathways. [ ] rap a (h+/a-) is a member of the ras oncogene family, small gtp-binding protein, of which active form interacts with several effectors [ ] [ ] [ ] related to the erk and g protein signaling pathways. [ , ] it was reported that prrsv could induces prostaglandin e production through cyclooxygenase and is related to erk signaling. [ ] the differential expression of rap a may provide information for future research on the interaction between prrsv and the erk signaling pathway. the abundance of rap a was lower in ap-prrsv-infected pams than in hp-prrsv-infected pams and control cells, which had similar levels. thus, we hypothesized that rap a was associated with immunization with the attenuated vaccine. ifn-induced transmembrane protein , ifitm (vh/va = . ), responds to biotic stimulus and had the highest expression levels among these proteins in the hp-prrsv group, as compared to the proteins identified in the ap-prrsv group but it was not detected in the control group. in humans, this protein negatively regulates the entry of multiple viruses, including influenza a virus, sars coronavirus, marburg virus (marv), ebola virus, dengue virus, west nile virus (wnv), human immunodeficiency virus (hiv) type , and vesicular stomatitis virus (vsv), into host cells. [ , ] it could inhibit hemagglutinin protein -mediated entry of influenza virus, gp , two-mediated viral entry of marburg virus and ebola virus, s protein-mediated viral entry of sars coronavirus, and g protein-mediated viral entry of vsv, thereby playing a critical role in the structural stability and function of vacuolar atpase (v-atpase). establishing physical contact with the v-atpase of endosomes is critical for proper clathrin localization and is required for v-atpase to lower the ph in phagocytic endosomes, thus establishing an antiviral state. [ ] high expression of both hp-prrsv and ap-prrsv suggests that ifitm may help inhibit prrsv entry and promote resistance to hp-prrsv infection. peptidyl-prolyl cis-trans isomerase b (ppib, vh/va = . ) has peptidyl-prolyl cis-trans isomerase activity and is involved in protein folding and protein peptidyl-prolyl isomerization, which can accelerate protein folding. [ ] ppib is positively regulated by viral genome replication, is involved with the viral processes of hepatitis c virus (hcv), and interacts with and stimulates the rna-binding activity of hcv ns b. ppib is critical for efficient replication of the hcv genome. [ ] compared with the control group, ppib was downregulated in the hp-prrsv and ap-prrsv groups. the abundance of ppib was lower in the hp-prrsv group than the ap-prrsv group, suggesting an association with prrsv replication. string network analysis was used to elucidate interactions among differentially expressed proteins. the essential factors and receptors involved in the entry of prrsv are reportedly cd (q vl ), nmhc ii-a (myh , f skj ), sialoadhesin (sn, a lcj ), cd (f ryz ), and vimentin (vim) (p ). [ ] [ ] [ ] [ ] [ ] [ ] we assessed the involvement of these mps together with proteins detected by string analysis on viral entry. receptor proteins involved in prrsv entry are in figure to better understand the characteristics of pathway-like regions, the genomrnai database was analyzed to annotate detected proteins. [ ] regions containing proteins with higher abundance in hp-prrsv-infected pams are green and in ap-prrsv-infected pams are red ( figure a) . a search of the genomrnai database determined that linked proteins ik (ik cytokine), wd repeat domain , dyskerin pseudouridine synthase , and adenylosuccinate lyase (adsl) in the green region of figure a shared similar annotations and functioned to decrease expression of nf-κb or il- , [ ] which are related to inflammation. [ , ] hence, further analysis of these linked proteins may help explain differential mechanisms between hp-prrsv and ap-prrsv infection. other proteins in the green-colored region of a are reported to influence virus infection. nucleolar and coiled-body phosphoprotein increases sindbis virus infection [ ] and adsl increases human papilloma virus -gfp infection. [ ] the lower abundance of nucleolar and coiled-body phosphoprotein and adsl in the hp-prrsv-infected group may be related to the host immune response repression of prrsv and may be used by hp-prrsv during infection to self-upregulate. the red region of figure a contains some linked proteins with some figure . protein-protein interaction network determined by string software showing interactions among differentially expressed proteins between two virulent groups with cd , vim, and myh added. red, protein abundance higher in hp-prrsv than ap-prrsv; green, protein abundance in hp-prrsv lower than in ap-prrsv. line colors represent type of evidence for association: green, neighborhood; red, fusion; purple, experimental; light blue, database; black, expression; blue, co-occurrence; yellow, text mining. gene abbreviations are shown. vh, protein abundance in hp-prrsv; va protein abundance in ap-prrsv. vh/va = , detected only in ap-prrsv; vh/va = null, detected only in hp-prrsv. information. uridine monophosphate synthetase decreases hiv- infection, [ ] while programmed cell death decreases hcv replication, [ ] both downregulate nf-κb expression. [ ] the lower abundance of these proteins in the hp-prrsv group suggested that hp-prrsv escaped inhibition through an unknown mechanism. myosin- (myh ) appears to function in cytokinesis, cell shape, and specialized functions such as secretion and capping. [ ] myh is an important factor for prrsv infection and interacts with gp of prrsv, [ ] although the underlying mechanisms remain unclear because of limited research. as shown by our results, myh was related to vcl in ap-prrsv and actl in hp-prrsv. vcl is an actin filament (f-actin)binding protein involved in cell-matrix adhesion and cell-cell adhesion in humans, regulation of cell-surface e-cadherin expression, and potentiation of mechanosensing, and may be important in cell morphology and locomotion by promoting binding with actin, alpha-catenin, cadherin, dystroglycan, and ubiquitin protein ligase. [ ] in hiv research, transient overexpression of vcl reduced the susceptibility of human cells to infection with hiv- and negatively affected paxillin phosphorylation and limited retroviral infection. [ ] just like hiv- , in our study, it was demonstrated that over expression of vcl could inhibit the replication of both hp-prrsv and the attenuated prrsv in the mrna level. there were obvious differences in the inhibiting ability for hp-prrsv and its attenuated strain. actl a, which is an actin-like protein a, is involved in transcriptional www.advancedsciencenews.com www.proteomics-journal.com activation and repression of select genes by chromatin remodeling (alteration of dna-nucleosome topology) and mainly functions in chromatin binding and transcription coactivator activities. [ ] we found that myh may regulate ap-prrsv and hp-prrsv infection via different pathways. through super pathways annotation (www.genecards.org), vcl and actl a were identified as participants in the il- , il- , gm-csf, and tnf-α/nf-κb signaling pathways, where they may help with differential mechanisms of prrsv infection with different virulence. using the genomrnai database, some linked proteins were found to be related to viruses or inflammation ( figure b ). in the green region, gnas complex locus (gnas) increased human papilloma virus -gfp, [ ] and serine/arginine repetitive matrix decreased hcv infection, influenza a replication and viral numbers, and il- expression, [ , ] small nuclear ribonucleoprotein u subunit decreased influenza a replication and viral numbers. [ ] irf- decreased infection by hcv, west nile virus, and dengue virus. [ , ] in the red region, hsf decreased hcv replication. [ ] protein kinase camp-activated catalytic subunit alpha decreased vsv infection. [ ] signal transducer and activator of transcription decreased il- expression. rap a, mink , and gnas are regulatory proteins in the ras pathway. rap a and mink were detected only in the hp-prrsv group, whereas gnas was detected only in the ap-prrsv group, in accordance with a previous report that stimulation of ras increases the replication ability of hcv by reducing ifn-jak-stat pathway activity. [ ] we proposed that prrsv was also related to the ras pathway, and differed between hp-prrsv and ap-prrsv. creb-binding protein (crebbp) is composed of doublestranded rna-activated transcription factor with irf- , and double-stranded rna-activated transcription factor is activated in many virus-infected cells to promote apoptosis. [ , ] crebbp may interact with human herpes virus virf- , which could inhibit the binding of crebbp to irf- . [ ] our results showed that irf- abundance was tenfold lower in the hp-prrsv than the ap-prrsv group, which we proposed was a protective mechanism of prrsv to escape from host immunity and ensure survival after viral infection. the ability of hp-prrsv hun to induce cell apoptosis is stronger than classical prrsv (ch- a) in immune organs and lungs of piglets. [ ] we concluded that hp-prrsv might interact with crebbp and decrease irf- expression to escape from host immunity and cause severe damage to the cell, highlighting a difference from ap-prrsv. notable differences exist during the infection of ap-prrsv and hp-prrsv. hp-prrsv could inhibit host immune function and evade the immune response via unknown mechanism. [ , ] label-free ms was performed using ap-prrsv-infected and hp-prrsv-infected pams. this is the first attempt to explore the differential characteristics between hp-prrsv and its attenuated prrsv infected pams focusing on membrane proteins. by analyzing detected proteins in hp-infected, ap-prrsv-infected, and control group, proteins related to the immune response or virus replication were identified that may elucidate unique pathways used by different virulent prrsvs for cell entry, virus replication, and immune escape mechanisms. researches on these detected proteins will help with the elucidation of the identity, the expression abundance, and significance of them, future study will be focused on functions of these key membrane proteins to deepen our understanding of differential mechanisms between hp-prrsv and ap-prrsv infection. proc. natl. acad. sci proc. natl. acad. sci supporting information is available from the wiley online library or from the author. the authors declare that they have no conflicts of interest associated with this report. keywords attenuated, highly pathogenic, infection, label-free quantitative proteomics, membrane proteins key: cord- -ni iyzdn authors: he, zhisong; zhang, jian; shi, xiao-he; hu, le-le; kong, xiangyin; cai, yu-dong; chou, kuo-chen title: predicting drug-target interaction networks based on functional groups and biological features date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: ni iyzdn background: study of drug-target interaction networks is an important topic for drug development. it is both time-consuming and costly to determine compound-protein interactions or potential drug-target interactions by experiments alone. as a complement, the in silico prediction methods can provide us with very useful information in a timely manner. methods/principal findings: to realize this, drug compounds are encoded with functional groups and proteins encoded by biological features including biochemical and physicochemical properties. the optimal feature selection procedures are adopted by means of the mrmr (maximum relevance minimum redundancy) method. instead of classifying the proteins as a whole family, target proteins are divided into four groups: enzymes, ion channels, g-protein- coupled receptors and nuclear receptors. thus, four independent predictors are established using the nearest neighbor algorithm as their operation engine, with each to predict the interactions between drugs and one of the four protein groups. as a result, the overall success rates by the jackknife cross-validation tests achieved with the four predictors are . %, . %, . %, and . %, respectively. conclusion/significance: our results indicate that the network prediction system thus established is quite promising and encouraging. identification of drug-target interaction networks is an essential step in the drug discovery pipeline [ ] . the emergence of molecular medicine and the completion of the human genome project provide more opportunity to discover unknown target proteins of drugs. many efforts have been made to discover new drugs in the past few years. however, the number of new drug approvals remains quite low (around only per year). this is partially because many compounds or drug candidates have to be withdrawn owing to unacceptable toxicity. such failures have wasted a lot of money. it would be beneficial to develop computational methods for predicting the sensitivity and toxicity before a drug candidate was synthesized [ , , ] . however, a number of problems need to be overcome in order to find out the exact effects of a drug. firstly, drugs could have numerous effects including positive and negative effects, and it is hard to find out and elucidate the possible effects; secondly, different people would have completely different responses to a drug even though the same gene products are only slightly different [ , , , ] ; thirdly, it is very hard to trace the drug effects since the biological interaction pathways are extremely complicated in human beings. therefore, it would be very helpful for drug development if the interactions between drugs and target proteins could be predicted more accurately and the underlying mechanisms could be better understood. several computational approaches have been developed for analyzing and predicting drug-protein interactions. the most commonly used are docking simulations [ , , , ] , literature text mining [ ] , and combining chemical structure, genomic sequence, and d structure information [ ] , among others (see, e.g., [ , , ] ). machine learning and data mining methods have been widely used in the computational biology and bioinformatics area. many researchers have made lots of efforts to develop useful algorithms and softwares to investigate various drug-related biological problems, such as hiv protease cleavage site prediction [ , ] , identification of gpcr (g protein-coupled receptors) type [ , ] , protein signal peptide prediction [ ] , protein subcellular location prediction [ , , ] , analysis of specificity of galnac-transferase protein [ ] , identification of protease type [ , ] , membrane protein type prediction [ , , , ] , and a series of relevant webserver predictors as summarized in a recent review [ ] . here we propose a predictor for drug-target interactions based on the nearest neighbor algorithm [ ] . since biochemical and physicochemical features [ ] are important for characterizing proteins, in this study they are used to represent proteins as done by many previous investigators (see, e.g., [ , , ] . to improve the predictor's performance, minimum redundancy maximum relevance (mrmr) algorithm [ ] is used to rank the features. meanwhile, the incremental feature selection and forward feature selection are applied for feature selection. the protein targets for drugs are divided into enzymes, ion channels [ , , , ] , gpcrs [ , ] , and nuclear receptors [ ] in this study. finally, four predictors for predicting the interactions of drugs with each of the four protein families are developed in hopes that they can help provide useful information for drug design. in addition to the dataset used by yamanishi et al. [ ] , information about drug compounds and genes can be obtained from kegg [ , ] by the ftp operations: ftp://ftp.genome.jp/ pub/kegg/ligand/drug/drug for the drugs, and ftp://ftp.genome. jp/pub/kegg/genes/fasta/gene.pep for the genes. after excluding the drug-target pairs that lack experimental information, we finally obtained a total of , drug-target pairs, of which , for enzymes, , for ion channels, for gpcrs, and for nuclear receptors. all these datasets were used as the positive datasets in the current study. the corresponding negative datasets were derived from the above positive datasets via the following steps: ( ) separate the pairs in the above positive dataset into single drugs and proteins; ( ) re-couple these singles into pairs in a way that none of them occurs in the corresponding positive dataset; ( ) randomly picked the negative pairs thus formed until they reached the number two times as many as the positive pairs. the drug-target benchmark datasets thus obtained for enzymes, ion-channels, gpcrs, and nuclear receptors are given in online supporting information s , s , s , and s , respectively. representing drugs with chemical functional groups composition. the number of drugs is extremely large. however, most of them are small organic molecules and are composed of some fixed small structures, called functional groups. since functional groups usually represent the characteristics of a compound as well as its reaction mechanism with other molecules, features derived from its functional groups could be very effective in characterizing a drug. moreover, the number of common functional groups is quite small, and hence it is possible to use the functional group composition to uniquely represent a drug [ ] . a number of functional groups are available in nature, and we selected the following common groups for the current study: ( ) alcohol, ( ) aldehyde, ( ) amide, ( ) amine, ( ) hydroxamic acid, ( ) phosphorus, ( ) carboxylate, ( ) methyl, ( ) ester, ( ) ether, ( ) imine, ( ) ketone, ( ) nitro, ( ) halogen, ( ) thiol, ( ) sulfonic acid, ( ) sulfone, ( ) sulfonamide, ( ) sulfoxide, ( ) sulfide, ( ) a_ c_ring, ( ) ar_ c_ring, ( ) non_ar_ c_ring, ( ) non_ar_ c_ring, ( ) hetero ar_ _ring, ( ) hetero non_ar_ _ring, ( ) hetero non_ar_ _ring, and ( ) hetero ar_ _ring. thus, following the same treatment as in [ ] , a drug compound can now be formulated as a -d (dimensional) vector given below: where g i (i~ , , Á Á Á , ) is the occurrence frequency of the i-th functional group in the drug d, and t the matrix transpose operator. representing target proteins with pseudo amino acid composition by incorporating biochemical and physicochemical features. now the problem is how to effectively represent a target protein. two kinds of representations are generally used in this regard: the sequential representation and the non-sequential representation. the most typical sequential representation for a protein sample is its entire amino acid sequence, which can contain the most complete information of a protein. to deal with this model, the sequence-similarity-searchbased tools, such as blast [ ] , are usually used to find the desired results. unfortunately, this kind of approach failed to work when the query protein did not have significant homology to the proteins in the training dataset. thus, various non-sequential representations or discrete models were proposed. the simplest discrete model was based on the amino acid composition (aac) (see, e.g., [ ] ). however, if using the aac model to represent a protein, all its sequence-order information will be lost. to avoid completely losing the sequence-order information, the pseudo amino acid composition (pse-aac) was proposed [ ] to represent the sample of a protein. the pseaac can be used to represent a protein sequence with a discrete model yet without completely losing its sequence-order information. for further information about pseaac, see the web-page by clicking the link http://en. wikipedia.org/wiki/pseudo_amino_acid_composition. ever since the concept of pseaac was introduced, it has been widely used to study various problems in proteins and protein-related systems (see, e.g., [ , , , , , , , , , , , , , , , , ] ). meanwhile, many different forms of discrete models were also proposed (see, e.g., [ , , , , , , , , , , , , , , , , , , , ] ). however, regardless of how much different these models are, they just belong to different forms of pseaac, as elucidated in a recent comprehensive review [ ] . here, we are to propose a different pseaac to represent drug-targeted proteins in terms of their biochemical and physicochemical features [ ] . six different types of features were considered: ( ) hydrophobicity, ( ) polarizability, ( ) polarity, ( ) secondary structure, ( ) normalized van der waals volume, and ( ) solvent accessibility. each amino acid residue in a protein sequence can be represented by a set of different states according to its features. for instance, its hydrophobicity feature can be marked by one of the following three states: ''polar'', ''neutral'', or ''hydrophobic'' [ ] ; its solvent accessibility feature by one of the two: ''buried'' or ''exposed to solvent'', as predicted by predacc [ ] ; its secondary structure feature by one of the three: ''helix'', ''sheet'', or ''coil'', as predicted by the method in [ ] ; and so forth. thus, a protein sequence can be translated to a series of codes according to the biochemical and physicochemical properties of its constituent amino acid residues. for example, if using ''p'', ''n'' and ''h'' to represent the three states of hydrophobicity: ''polar'', ''neutral'', and ''hydrophobic'', the protein sequence ''dmaeimsdkp-qagml'' can be translated to ''phnphhnppnpnnhh'' according to the codes of the hydrophobic property feature. the encoded sequences thus obtained would have different length for proteins of different sizes, which will make the prediction engine difficult to handle. to make the feature-encoded sequence to be a vector with a fixed number of dimensions, three properties of a sequence was used: composition (c), transition (t), and distribution (d). c represents the global composition of each letter in the sequence; t, the frequency of a code letter changing from one to another; d, the distribution pattern of the code letters along the sequence, measuring the percentage of the sequence length within which the first, %, %, %, and % of the amino acids of each code letter is located. take the above hydrophobic property sequence as an example: its c feature is / = . % for all of p, h, and n, while the t feature is / = %, / = % and / = % for the changes between h and p, n and h, n and p, respectively. the measurement of feature d is a little more complicated. for the letter h, the first, %, %, % and % of hs in the sequence is located at the position of , , , , and . thus its d feature is ( , with a total of components. likewise, for the sequences encoded by the other four biochemical properties, each is also corresponding to components. but for the sequence encoded by the solvent accessibility with only two states (''buried'' or ''exposed to solvent''), the encoded sequence is corresponding to only components. finally, by adding the components of aac [ ] into the vector concerned, the total number of components thus obtained for a given protein is | z z ~ ; i.e., the protein can be formulated as a -d vector given by where p i (i~ , , Á Á Á , ) is the i-th component of the protein p. of the components, are derived according to the codes of the above six biochemical and physicochemical features, and are the aac components of p. with all samples represented by a feature vector, now it is possible for us to construct our predictor using the machine learning approach. the nn (nearest neighbor) algorithm is quite popular in pattern recognition community owing to its good performance and simple-to-use feature. according to the nn rule [ ] , the query sample should be assigned to the subset represented by its nearest neighbor. in this study, if the drug-target pair with the shortest distance is a positive sample, meaning that they can interact with each other, the sample for test is seen as a positive drug-target pair. otherwise, the test sample is seen as a negative one. there are many different definitions to measure the ''nearness'' for the nn algorithm, such as euclidean distance, hamming distance [ ] , and mahalanobis distance [ , , ] . in the current study, the following equation was adopted to measure the nearness between samples v x and v y where v x : v y is the dot product of the two vectors, and v x k k and v y their modulus, respectively. when v x :v y we have d(v x ,v y )~ , indicating the ''distance'' between these two sample vectors is zero and hence they have perfect or % similarity. after constructing the drug-target interaction predictor, we have to evaluate its performance. in statistical prediction, the following three cross-validation methods are often used to examine a predictor for its effectiveness in practical application: independent dataset test, subsampling (k-fold cross-validation) test, and jackknife test [ ] . however, as elucidated by [ ] and demonstrated by eq. in [ ] , among the three cross-validation methods, the jackknife test is deemed the most objective that can always yield a unique result for a given benchmark dataset, and hence has been increasingly used and widely recognized by investigators to examine the accuracy of various predictors (see, e.g. [ , , , , , , , , , , , , ] ).'' accordingly, in this study the jackknife cross-validation was adopted to calculate the success prediction rates as well. although we've constructed the drug-target predictor based on the original feature set described above, it is possible to improve its performance with a better feature set. apparently, not every feature in the feature set is equally relevant to the drug-target interaction. what's more, features may not be independent with each other. the ''bad'' will have negative impact on the accuracy and efficiency of the predictor, so it is possible to do the feature selection process to construct a more compact and effective feature set. the first step is using maximum relevance minimum redundancy (mrmr) [ ] to do feature evaluation. maximum relevance minimum redundancy (mrmr) [ ] was firstly developed for analysis of microarray data. it ranks each feature according to its relevance to the target and redundancy to other features. the better a feature is deemed to be, the higher the rank it will be assigned to. mutual information (mi), denoted by i to indicate the dependence of two features used to quantify the relevance and redundancy. mi is defined as following: based on mi, we can quantify relevance (d) and redundancy (r) as: where f candidate is the feature to be calculated, and c is the target variable. by combining the above two equations to maximize relevance and minimize redundancy, the following mrmr function is constructed: where v s and v t are the already-selected feature set and to-beselected feature set, respectively, and m and n are the sizes of these two feature sets, respectively. the earlier a feature is selected, the better it would be though of. finally, we can get an ordered feature list with a rank for every feature to indicate its importance in the feature set. in our study, the mrmr program is obtained from: http://research. janelia.org/peng/proj/mrmr/index.htm. to calculate mi, the joint probabilistic density and the marginal probabilistic densities of the two vectors were used. a parameter t is introduced here to deal with these variables. suppose mean to be the average value of one feature in all samples, and std to be the standard deviation, the feature of each sample would be classified into one of the three groups according to the boundaries: mean+(t : std). in our study, t was assigned to be . as mentioned above, the importance of each feature is rated according to its rank in the mrmr analysis. the next step is to determine which features should be selected as the optimal feature set for our drug-target predictor. here the ifs (incremental feature selection) procedure is used to solve the problem. each feature in the mrmr feature list was added one by one, and n different feature sets are obtained if the total feature number is n, while the i-th feature set is: based on each of the n feature sets, an nn algorithm predictor was constructed and tested with the jackknife cross-validation test. with all the n overall accurate rates calculated, we could draw an ifs curve with the index i to be the x-axis and the corresponding overall accurate rate to be the y-axis. thus, s opt~f f , f , :::, f n g is regarded as the optimal feature set if the curve reach its peak where the value of its x-axis is nƒn. because four independent predictors are needed for the four different classes of drug-target pairs, the ifs analysis procedure will be processed four times with each for a specific predictor. to refine feature selection, the ffs (forward feature selection) procedure based on the result of ifs was used. ffs is a feature selection method based on ifs results which tries every feature in the candidate feature set and adds the feature that achieves the highest prediction accuracy into the already-selected feature set in each goes. suppose the ifs curve reaches its peak with apex as its x-axis, the initial ffs-selected feature set was constructed as: more features in ffs-to-be-selected feature set would be added into the ffs-selected feature set one by one. the ffs-to-beselected feature set with m features covers the features with mrmr ranks between k+ and k+ +m, where m is a user-defined positive integer smaller than n{k with n to be the size of the original feature set. in each round of ffs, each feature in ffs-tobe-selected feature set would be taken out and added to the ffsselected feature set. each predictor based on each new ffsselected feature set would be tested, and the feature set obtained the highest overall accurate rate would be used as the new ffsselected feature set. this process would be run for m times, until the ffs-to-be-selected feature set becomes a null set. an ffs curve similar to the ifs curve could be drawn with x-axis as the index and y-axis as the overall accurate rate. in this study, ffs was run for each of the four benchmark datasets based on the corresponding ifs result. m for all these processes was set to , while k for each ffs was set to be the index of the point with the first maximum value (i.e. the maximum point with the smallest index) in the corresponding ifs curve. to improve performance of the predictor of drug-target interaction, feature selection process was carried out. the first step of feature selection is feature evaluation. in this study, mrmr was used to evaluate every feature in original feature set. listed in online supporting information s are two kinds of outputs: the first one is the maxrel list which shows ranks of features for their relevance to the target; the second is mrmr list showing the mrmr ranks according to the feature order satisfying eq. . in this study, only the mrmr list was used as the results of feature evaluation. since there are four groups of samples, mrmr was run four times with each for one of them. with the four mrmr lists, ifs was processed for each of the four sample groups, generating four ifs curves. based on these results, we set k in ffs to be , , and for the data of enzymes, ion channels, gpcrs and nuclear receptors, respectively. each of these figures is the index of the point of the first maximum value in the corresponding ifs curve. shown in fig. are the four ifs curves with their corresponding ffs curves. the peaks of the four ffs curves finally reach the overall success rates of . % with features, . % with features, . % with features, and . % with features for enzyme group, ion channel group, gpcr group and nuclear receptor groups, respectively. features selected by mrmr+ffs for the four different groups are quite different from each other, showing the intrinsic differences between them. although there are more features for target than those for drug in the original feature set, more drug features were selected, showing the important role of drugs. many of the selected target features are for protein secondary structure, especially for enzyme group (half of selected target features are for this). all types of features are selected in at least one group, showing that all biochemical and physicochemical features have their irreplaceable positions in drug-target interaction process. for the details of the optimal feature-set outputs by ffs for the four benchmark datasets, see the online supporting information s . for the specificity and promiscuity, we divided the drug-protein interactions into four groups according to the targets of drugs: enzymes, ion channels, gpcrs, and nuclear receptors. we used all the known drugs and target proteins in the gold standard data as training data to predict the potential interactions between all human proteins annotated as members of the four classes in kegg genes and all compounds in kegg ligands. enzyme recognition is the primary event involved in the interaction of proteins with other proteins and with small molecules such as metabolites and therapeutics. predicting drugenzyme interactions has direct application for completing genome annotations, finding enzymes for synthetic chemistry, and predicting drug specificity, promiscuity and pharmacology. it is suggested that the secondary structure information plays the major role in determining the drug-enzyme interactions activity. for example, cytochrome p (cyp) induction-mediated interaction is one of the major concerns in clinical practice and for the pharmaceutical industry [ ] . induction of cyp a enzymes with a specific structure-stable state may activate some xenobiotics to their reactive metabolites, leading to toxicity [ , ] . amino acid composition and hydrophobicity also contribute considerably to these interactions. an insertion/deletion (i/d) polymorphism of the angiotensin i-converting enzyme (ace) have an influence on the antihypertensive response, particularly when using ace inhibitors (acei) [ ] , mirroring that the amino acid composition did contribute to the interactions. hydrophobicity plays a role in determining the coefficients of drug-enzyme interaction energy with the application to drug screening as well as in silico target protein screening [ , ] . the g-protein coupled receptor (gpcr) superfamily, which is comprised of estimated - , members, is another largest known class of molecular targets with varieties of physiological activities and proven therapeutic value [ ] . they are integral membrane proteins sharing a common global topology that consists of seven transmembrane alpha helices, intracellular cterminal, an extracellular n-terminal, three intracellular loops and three extracellular loops [ , ] . it is suggested that secondary structure and polarity would play a major role in determining the drug-gpcrs interactions activity. small secondary structures such as helices and loops are identified as entities potentially involved in stabilizing interactions with ligands [ ] . these motifs were situated mainly in the apical region of transmembrane segments and included a few extracellular residues [ ] . crystal structures of engineered human beta -adrenergic receptors (ars) in complex with an inverse agonist ligand, carazolol, provide threedimensional snapshots of an important g protein-coupled receptor (gpcr) with a beta-sheet structure and forms part of the chromophore-binding site [ ] . glida provides interaction data between gpcrs and their ligands, along with chemical information on the ligands, as well as biological information regarding gpcrs [ ] . some of the features reflect physical interactions that are responsible for the structural stability of the transmembrane, the formation of extensive networks of interhelical h-bonds and sulfur-aromatic clusters that are spatially organized as ''polarity'', the close packing of side-chains throughout the transmembrane domain. when more experimental d structures become available for gpcrs in the future, this will help building reliable models for a wider range of gpcrs that would be suitable for docking studies. joint use of ligand-based chemogenomic and docking would certainly improve the prediction. ion channels are a large superfamily of membrane proteins that pass ions across membranes and are critical to diverse physiological functions in both excitable and nonexcitable cells and underlie many diseases. as a result, they are an important target class which is proven to be highly ''druggable''. according to our analysis, secondary structure and polarity play the major role in determining the drug-ion channels interactions activity. secondary structure controls the membrane potential and interrogates ion channels in different conformational states. the drug-ion channels interaction needs gated state where they can switch conformation between a closed and an open state [ , ] . simulations on model nanopores reveal that a narrow hydrophobic region can form a functionally closed gate in the channel and can be opened by either a small increase in pore radius or an increase in polarity [ , ] . nowadays, intense research is being conducted to develop new drugs acting selectively on ion channel subtypes and aimed at the understanding of the intimate drug-channel interaction [ ] . nuclear receptors (nr) are ligand-activated transcription factors that regulate the activation of a variety of important target genes, which are the most important drug targets in terms of potential therapeutic application. according to our results, secondary structure and polarizability play the major role in determining the drug-nrs interactions. the conservative motif of the nr is typically described as three stacked alpha-helical sheets. the helices that make up the ''front'' and ''back'' sheets are aligned parallel to one another. the helices in the middle sheet run across the two outer sheets and only occupy the space in the upper portion of the domain. the space in the lower part of the domain is relatively void of protein, and for most nrs, this creates an internal cavity for small-molecule ligands [ ] . hydrogen bonds with polarizability activity play a crucial role in protein-drug interactions (see, e.g., [ ] ). our approaches and the results thus obtained could be used to demonstrate how nuclear hormone receptors form a network of direct interactions. and this network increases in complexity to describe the interactions with target genes as well as small molecules known to bind a receptor, enzyme, or transporter. a comprehensive drug-target interaction network system has been established that contains four classifiers for predicting the drugable interaction of compounds with enzymes, ion-channels, gpcrs, and nuclear receptors, respectively. it is anticipated that the network predictor system may become a very useful tool for drug development. particularly it may help us find new or potential drug-target interactions. online supporting information s the benchmark dataset for the drug-target enzyme interaction system. it contains , genedrug pair samples, of which , are positive and , negative. the st column of the table indicates the nature of samples with for positive and for negative; the nd column shows the code of target gene; and the rd column shows the code of drug. all the detailed information for the genes and drugs listed here can be found in a guide to drug discovery: target selection in drug discovery predicting human safety: screening and computational approaches assessment of chemical libraries for their druggability review: progress in computational approach to drug development against sars d structure modeling of cytochrome p c and its implication for personalized drug design molecular modeling of two cyp c snps and its implications for personalized drug design review: pharmacogenomics and personalized use of drugs review: structure of cytochrome p s and personalized drug structure-based maximal affinity model predicts small-molecule druggability a fast flexible docking method using an incremental construction algorithm review: structural bioinformatics and its impact to biomedical science binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against sars a probabilistic model for mining implicit 'chemical compound-gene' relations from literature prediction of drug-target interaction networks from the integration of chemical and genomic spaces statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening alignment-free prediction of a drug-target complex network based on parameters of drug connectivity and protein sequence of receptors a vectorized sequence-coupling model for predicting hiv protease cleavage sites in proteins review: prediction of hiv protease cleavage sites in proteins gpcr-ca: a cellular automaton image approach for predicting g-protein-coupled receptor functional classes gpcr-gia: a web-server for identifying gprotein coupled receptors and their families with grey incidence analysis signal-cf: a subsite-coupled and window-fusing approach for predicting signal peptides using functional domain composition and support vector machines for prediction of protein subcellular location cell-ploc: a package of web-servers for predicting subcellular localization of proteins in various organisms euk-mploc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites a sequence-coupled vector-projection model for predicting the specificity of galnac-transferase protident: a web server for identifying proteases and their types by fusing functional domain and sequential evolution information prediction of protease types in a hybridization space prediction of membrane protein types and subcellular locations low-frequency fourier spectrum for predicting membrane protein types memtype- l: a web server for predicting membrane proteins and their types by incorporating evolution information through pse-pssm support vector machines for predicting membrane protein types by using functional domain composition review: recent advances in developing web-servers for predicting protein attributes a k-nearest neighbor classification rule based on dempster-shafer theory predacc: prediction of solvent accessibility prediction of protein cellular attributes using pseudo amino acid composition using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes digital coding of amino acids based on hydrophobic index feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy insights from modelling three-dimensional structures of the human potassium and sodium channels the structure of phospholamban pentamer reveals a channel-like architecture in membranes mechanism of drug inhibition and drug resistance of influenza a m channel structure and mechanism of the m proton channel of influenza a virus prediction of g-protein-coupled receptor classes coupling interaction between thromboxane a receptor and alpha- subunit of guanine nucleotide-binding protein ligand: chemical database for enzyme reactions from genomics to chemical genomics: new developments in kegg predicting networking couples for metabolic pathways of evaluating the statistical significance of multiple distinct local alignments a novel approach to predicting protein structural classes in a ( - )-d amino acid composition space prediction of protein secondary structure content by using the concept of chou's pseudo amino acid composition and support vector machine use of fuzzy clustering technique and matrices to classify amino acids and its impact to chou's pseudo amino acid composition using the concept of chou's pseudo amino acid composition to predict apoptosis proteins subcellular location: an approach by approximate entropy predicting protein subcellular location using chou's pseudo amino acid composition and improved hybrid approach the modified mahalanobis discriminant for predicting outer membrane proteins by using chou's pseudo amino acid composition predicting subcellular localization of mycobacterial proteins by using chou's pseudo amino acid composition prediction of subcellular localization of apoptosis protein using chou's pseudo amino acid composition prediction of g-protein-coupled receptor classes based on the concept of chou's pseudo amino acid composition: an approach from discrete wavelet transform using the augmented chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach predicting the cofactors of oxidoreductases based on amino acid composition distribution and chou's amphiphilic pseudo amino acid composition predicting lipase types by improved chou's pseudo-amino acid composition using chou's amphiphilic pseudoamino acid composition and support vector machine for prediction of enzyme subfamily classes using chou's pseudo amino acid composition to predict subcellular localization of apoptosis proteins: an approach with immune genetic algorithm-based ensemble classifier prediction of cell wall lytic enzymes using chou's amphiphilic pseudo amino acid composition medicinal chemistry and bioinformatics -current trends in drugs discovery with networks topological indices proteomics, networks, and connectivity indices application of pseudo amino acid composition for predicting protein subcellular location: stochastic signal processing approach weighted-support vector machines for predicting membrane protein types based on pseudo amino acid composition slle for predicting membrane protein types using pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor using pseudo amino acid composition to predict protein subcellular location: approached with lyapunov index, bessel function, and chebyshev filter using cellular automata to generate image representation for biological sequences using cellular automata images and pseudo amino acid composition to predict protein subcellular location using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and lempel-ziv complexity using pseudo amino acid composition to predict protein structural class: approached by incorporating dipeptide components predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image using grey dynamic modeling and pseudo amino acid composition to predict protein structural classes predicting membrane protein type by functional domain composition and pseudo amino acid composition hum-ploc: a novel ensemble classifier for predicting human protein subcellular localization large-scale plant protein subcellular location prediction predicting membrane protein types by the llda algorithm predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic k-nearest neighbor classifiers pseudo amino acid composition and its applications in bioinformatics, proteomics and system biology recognition of a protein fold in the context of the structural classification of proteins (scop) classification the classification and origins of protein folding patterns seventy-five percent accuracy in protein secondary structure prediction predicting protein folding types by distance functions that make allowances for amino acid interactions a fuzzy k-nearest neighbours algorithm discriminant analysis; chapter multivariate analysis of variance on the generalized distance in statistics this reference also presents a brief biography of mahalanobis who was a man of great originality and who made considerable contributions to statistics review: prediction of protein structural classes review: recent progresses in protein subcellular location prediction an intriguing controversy over protein structural class prediction some insights into protein structural class prediction subcellular location prediction of apoptosis proteins cyp induction-mediated drug interactions: in vitro assessment and clinical implications cyp a : friend or foe? inhibition and induction of human cytochrome p enzymes: current status angiotensin i-converting enzyme gene polymorphism and drug response genome scale enzymemetabolite and drug-target interaction predictions using the signature molecular descriptor svm-prot: web-based support vector machine software for functional classification of a protein from its primary sequence molecular tinkering of g protein-coupled receptors: an evolutionary success critical role for the second extracellular loop in the binding of both orthosteric and allosteric g protein-coupled receptor ligands structural basis for ligand binding and specificity in adrenergic receptors: implications for gpcr-targeted drug discovery glida: gpcr-ligand database for chemical genomics drug discovery-database and tools update investigation into adamantane-based m inhibitors with fb-qsar an in-depth analysis of the biological functional studies based on the nmr m channel structure of influenza a virus ion channel pharmacology the nuclear receptor superfamily and drug discovery key: cord- -qdld hdc authors: fan, yue-nong; xiao, xuan; min, jian-liang; chou, kuo-chen title: inr-drug: predicting the interaction of drugs with nuclear receptors in cellular networking date: - - journal: int j mol sci doi: . /ijms sha: doc_id: cord_uid: qdld hdc nuclear receptors (nrs) are closely associated with various major diseases such as cancer, diabetes, inflammatory disease, and osteoporosis. therefore, nrs have become a frequent target for drug development. during the process of developing drugs against these diseases by targeting nrs, we are often facing a problem: given a nr and chemical compound, can we identify whether they are really in interaction with each other in a cell? to address this problem, a predictor called “inr-drug” was developed. in the predictor, the drug compound concerned was formulated by a -d (dimensional) vector derived from its molecular fingerprint, and the nr by a -d vector formed by incorporating its sequential evolution information and physicochemical features into the general form of pseudo amino acid composition, and the prediction engine was operated by the svm (support vector machine) algorithm. compared with the existing prediction methods in this area, inr-drug not only can yield a higher success rate, but is also featured by a user-friendly web-server established at http://www.jci-bioinfo.cn/inr-drug/, which is particularly useful for most experimental scientists to obtain their desired data in a timely manner. it is anticipated that the inr-drug server may become a useful high throughput tool for both basic research and drug development, and that the current approach may be easily extended to study the interactions of drug with other targets as well. with the ability to directly bind to dna ( figure ) and regulate the expression of adjacent genes, nuclear receptors (nrs) are a class of ligand-inducible transcription factors. they regulate various biological processes, such as homeostasis, differentiation, embryonic development, and organ physiology [ ] [ ] [ ] . the nr superfamily has been classified into seven families: nr (knirps or dax like) [ , ] ; nr (thyroid hormone like), nr (hnf -like), nr (estrogen like), nr (nerve growth factor ib-like), nr (fushi tarazu-f like), and nr (germ cell nuclear factor like). since they are involved in almost all aspects of human physiology and are implicated in many major diseases such as cancer, diabetes and osteoporosis, nuclear receptors have become major drug targets [ , ] , along with g protein-coupled receptors (gpcrs) [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] , ion channels [ ] [ ] [ ] , and kinase proteins [ ] [ ] [ ] [ ] . identification of drug-target interactions is one of the most important steps for the new medicine development [ , ] . the method usually adopted in this step is molecular docking simulation [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . however, to make molecular docking study feasible, a reliable d (three dimensional) structure of the target protein is the prerequisite condition. although x-ray crystallography is a powerful tool in determining protein d structures, it is time-consuming and expensive. particularly, not all proteins can be successfully crystallized. for example, membrane proteins are very difficult to crystallize and most of them will not dissolve in normal solvents. therefore, so far very few membrane protein d structures have been determined. although nmr (nuclear magnetic resonance) is indeed a very powerful tool in determining the d structures of membrane proteins as indicated by a series of recent publications (see, e.g., [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] and a review article [ ] ), it is also time-consuming and costly. to acquire the d structural information in a timely manner, one has to resort to various structural bioinformatics tools (see, e.g., [ ] ), particularly the homologous modeling approach as utilized for a series of protein receptors urgently needed during the process of drug development [ , [ ] [ ] [ ] [ ] [ ] [ ] . unfortunately, the number of dependable templates for developing high quality d structures by means of homology modeling is very limited [ ] . to overcome the aforementioned problems, it would be of help to develop a computational method for predicting the interactions of drugs with nuclear receptors in cellular networking based on the sequences information of the latter. the results thus obtained can be used to pre-exclude the compounds identified not in interaction with the nuclear receptors, so as to timely stop wasting time and money on those unpromising compounds [ ] . actually, based on the functional groups and biological features, a powerful method was developed recently [ ] for this purpose. however, further development in this regard is definitely needed due to the following reasons. (a) he et al. [ ] did not provide a publicly accessible web-server for their method, and hence its practical application value is quite limited, particularly for the broad experimental scientists; (b) the prediction quality can be further enhanced by incorporating some key features into the formulation of nr-drug (nuclear receptor and drug) samples via the general form of pseudo amino acid composition [ ] . the present study was initiated with an attempt to develop a new method for predicting the interaction of drugs with nuclear receptors by addressing the two points. as demonstrated by a series of recent publications [ , , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] and summarized in a comprehensive review [ ] , to establish a really effective statistical predictor for a biomedical system, we need to consider the following steps: (a) select or construct a valid benchmark dataset to train and test the predictor; (b) represent the statistical samples with an effective formulation that can truly reflect their intrinsic correlation with the object to be predicted; (c) introduce or develop a powerful algorithm or engine to operate the prediction; (d) properly perform cross-validation tests to objectively evaluate the anticipated accuracy of the predictor; (e) establish a user-friendly web-server for the predictor that is accessible to the public. below, let us elaborate how to deal with these steps. the data used in the current study were collected from kegg (kyoto encyclopedia of genes and genomes) [ ] at http://www.kegg.jp/kegg/. kegg is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. here, the benchmark dataset can be formulated as where is the positive subset that consists of the interactive drug-nr pairs only, while the negative subset that contains of the non-interactive drug-nr pairs only, and the symbol represents the union in the set theory. the so-called "interactive" pair here means the pair whose two counterparts are interacting with each other in the drug-target networks as defined in the kegg database [ ] ; while the "non-interactive" pair means that its two counterparts are not interacting with each other in the drug-target networks. the positive dataset contains drug-nr pairs, which were taken from he et al. [ ] . the negative dataset contains non-interactive drug-nr pairs, which were derived according to the following procedures: (a) separating each of the pairs in into single drug and nr; (b) re-coupling each of the single drugs with each of the single nrs into pairs in a way that none of them occurred in ; (c) randomly picking the pairs thus formed until reaching the number two times as many as the pairs in . the interactive drug-nr pairs and non-interactive drug-nr pairs are given in supplementary information s , from which we can see that the + = pairs in the current benchmark dataset are actually formed by different nrs and different compounds. since each of the samples in the current network system contains a drug (compound) and a nr (protein), the following procedures were taken to represent the drug-nr pair sample. first, for the drug part in the current benchmark dataset, we can use a -d vector to formulate it as given by where d represents the vector for a drug compound, and d i its i-th (i = , , , ) component that can be derived by following the " d molecular fingerprint procedure" as elaborated in [ ] . the molecular fingerprint vectors thus obtained for the drugs in are, respectively, given in supplementary information s . the protein sequences of the different nrs in are listed in supplementary information s . suppose the sequence of a nuclear receptor protein p with l residues is generally expressed by where r represents the st residue of the protein sequence p , r the nd residue, and so forth. now the problem is how to effectively represent the sequence of equation ( ) with a non-sequential or discrete model [ ] . this is because all the existing operation engines, such as covariance discriminant (cd) [ , , [ ] [ ] [ ] [ ] [ ] [ ] [ ] , neural network [ ] [ ] [ ] , support vector machine (svm) [ ] [ ] [ ] ] , random forest [ , ] , conditional random field [ ] , nearest neighbor (nn) [ , ] ; k-nearest neighbor (knn) [ ] [ ] [ ] , oet-knn [ ] [ ] [ ] [ ] , and fuzzy k-nearest neighbor [ , , , , ] , can only handle vector but not sequence samples. however, a vector defined in a discrete model may completely lose all the sequence-order information and hence limit the quality of prediction. facing such a dilemma, can we find an approach to partially incorporate the sequence-order effects? actually, one of the most challenging problems in computational biology is how to formulate a biological sequence with a discrete model or a vector, yet still keep considerable sequence order information. to avoid completely losing the sequence-order information for proteins, the pseudo amino acid composition [ , ] or chou's pseaac [ ] was proposed. ever since the concept of pseaac was proposed in [ ] , it has penetrated into almost all the areas of computational proteomics, such as predicting anticancer peptides [ ] , predicting protein subcellular location [ ] [ ] [ ] [ ] [ ] [ ] [ ] , predicting membrane protein types [ , ] , predicting protein submitochondria locations [ ] [ ] [ ] [ ] , predicting gaba(a) receptor proteins [ ] , predicting enzyme subfamily classes [ ] , predicting antibacterial peptides [ ] , predicting supersecondary structure [ ] , predicting bacterial virulent proteins [ ] , predicting protein structural class [ ] , predicting the cofactors of oxidoreductases [ ] , predicting metalloproteinase family [ ] , identifying cysteine s-nitrosylation sites in proteins [ ] , identifying bacterial secreted proteins [ ] , identifying antibacterial peptides [ ] , identifying allergenic proteins [ ] , identifying protein quaternary structural attributes [ , ] , identifying risk type of human papillomaviruses [ ] , identifying cyclin proteins [ ] , identifying gpcrs and their types [ , ] , discriminating outer membrane proteins [ ] , classifying amino acids [ ] , detecting remote homologous proteins [ ] , among many others (see a long list of papers cited in the references section of [ ] ). moreover, the concept of pseaac was further extended to represent the feature vectors of nucleotides [ ] , as well as other biological samples (see, e.g., [ ] [ ] [ ] ). because it has been widely and increasingly used, recently two powerful soft-wares, called "pseaac-builder" [ ] and "propy" [ ] , were established for generating various special chou's pseudo-amino acid compositions, in addition to the web-server "pseaac" [ ] built in . according to a comprehensive review [ ] , the general form of pseaac for a protein sequence p is formulated by where the subscript  is an integer, and its value as well as the components ( , , , ) u u   will depend on how to extract the desired information from the amino acid sequence of p (cf. equation ( )). below, let us describe how to extract useful information to define the components of pseaac for the nr samples concerned. first, many earlier studies (see, e.g., [ ] [ ] [ ] [ ] [ ] [ ] ) have indicated that the amino acid composition (aac) of a protein plays an important role in determining its attributes. the aac contains components with each representing the occurrence frequency of one of the native amino acids in the protein concerned. thus, such aac components were used here to define the first elements in equation ( ); i.e., ( ) ( , , , ) ii fi   ( ) where f i ( ) is the normalized occurrence frequency of the i-th type native amino acid in the nuclear receptor concerned. since aac did not contain any sequence order information, the following steps were taken to make up this shortcoming. to avoid completely losing the local or short-range sequence order information, we considered the approach of dipeptide composition. it contained × = components [ ] . such components were used to define the next elements in equation ( ); i.e., ( ) ( , , , ) jj fj where ( ) j f is the normalized occurrence frequency of the j-th dipeptides in the nuclear receptor concerned. to incorporate the global or long-range sequence order information, let us consider the following approach. according to molecular evolution, all biological sequences have developed starting out from a very limited number of ancestral samples. driven by various evolutionary forces such as mutation, recombination, gene conversion, genetic drift, and selection, they have undergone many changes including changes of single residues, insertions and deletions of several residues [ ] , gene doubling, and gene fusion. with the accumulation of these changes over a long period of time, many original similarities between initial and resultant amino acid sequences are gradually faded out, but the corresponding proteins may still share many common attributes [ ] , such as having basically the same biological function and residing at a same subcellular location [ , ] . to extract the sequential evolution information and use it to define the components of equation ( ), the pssm (position specific scoring matrix) was used as described below. according to schaffer [ ] , the sequence evolution information of a nuclear receptor protein p with l amino acid residues can be expressed by a l matrix, as given by where ( ) were generated by using psi-blast [ ] to search the uniprotkb/swiss-prot database (the universal protein resource (uniprot); http://www.uniprot.org/) through three iterations with . as the e-value cutoff for multiple sequence alignment against the sequence of the nuclear receptor concerned. in order to make every element in equation ( ) be scaled from their original score ranges into the region of [ , ], we performed a conversion through the standard sigmoid function to make it become now we extract the useful information from equation ( ) moreover, we used the grey system model approach as elaborated in [ ] to further define the next components of equation ( ) ( , , , ) in the above equation, w , w , and w are weight factors, which were all set to in the current study; f j ( ) has the same meaning as in equation ( ) where   and combining equations ( ), ( ), ( ) and ( ), we found that the total number of the components obtained via the current approach for the pseaac of equation ( ) and each of the components is given by ( ) ( since the elements in equations ( ) and ( ) are well defined, we can now formulate the drug-nr pair by combining the two equations as given by   ( ) where g represents the drug-nr pair, Å the orthogonal sum, and the + = components are defined by equations ( ) and ( ) . for the sake of convenience, let us use x i (i = , , , ) to represent the components in equation ( ); i.e., ( ) to optimize the prediction quality with a time-saving approach, similar to the treatment [ ] [ ] [ ] , let us convert equation ( ) to where the symbol means taking the average of the quantity therein, and sd means the corresponding standard derivation. in this study, the svm (support vector machine) was used as the operation engine. svm has been widely used in the realm of bioinformatics (see, e.g., [ ] [ ] [ ] [ ] [ ] [ ] [ ] ). the basic idea of svm is to transform the data into a high dimensional feature space, and then determine the optimal separating hyperplane using a kernel function. for a brief formulation of svm and how it works, see the papers [ , ] ; for more details about svm, see a monograph [ ] . in this study, the libsvm package [ ] was used as an implementation of svm, which can be downloaded from http://www.csie.ntu.edu.tw/~cjlin/libsvm/, the popular radial basis function (rbf) was taken as the kernel function. for the current svm classifier, there were two uncertain parameters: penalty parameter c and kernel parameter  . the method of how to determine the two parameters will be given later. the predictor obtained via the aforementioned procedure is called inr-drug, where "i" means identify, and "nr-drug" means the interaction between nuclear receptor and drug compound. to provide an intuitive overall picture, a flowchart is provided in figure to show the process of how the predictor works in identifying the interactions between nuclear receptors and drug compounds. to provide a more intuitive and easier-to-understand method to measure the prediction quality, the following set of metrics based on the formulation used by chou [ ] [ ] [ ] in predicting signal peptides was adopted. according to chou's formulation, the sensitivity, specificity, overall accuracy, and matthew's correlation coefficient can be respectively expressed as [ , [ ] [ ] [ ] sn where n  is the total number of the interactive nr-drug pairs investigated while n   the number of the interactive nr-drug pairs incorrectly predicted as the non-interactive nr-drug pairs; n  the total number of the non-interactive nr-drug pairs investigated while n   the number of the non-interactive nr-drug pairs incorrectly predicted as the interactive nr-drug pairs. according to equation ( ) we can easily see the following. when n    meaning none of the interactive nr-drug pairs was mispredicted to be a non-interactive nr-drug pair, we have the sensitivity sn = ; while nn    meaning that all the interactive nr-drug pairs were mispredicted to be the non-interactive nr-drug pairs, we have the sensitivity sn = . likewise, when n    meaning none of the non-interactive nr-drug pairs was mispredicted, we have the specificity sp we have mcc = meaning total disagreement between prediction and observation. as we can see from the above discussion, it is much more intuitive and easier to understand when using equation ( ) to examine a predictor for its four metrics, particularly for its mathew's correlation coefficient. it is instructive to point out that the metrics as defined in equation ( ) are valid for single label systems; for multi-label systems, a set of more complicated metrics should be used as given in [ ] . how to properly test a predictor for its anticipated success rates is very important for its development as well as its potential application value. generally speaking, the following three cross-validation methods are often used to examine the quality of a predictor and its effectiveness in practical application: independent dataset test, subsampling or k-fold (such as five-fold, seven-fold, or -fold) crossover test and jackknife test [ ] . however, as elaborated by a penetrating analysis in [ ] , considerable arbitrariness exists in the independent dataset test. also, as demonstrated in [ ] , the subsampling (or k-fold crossover validation) test cannot avoid arbitrariness either. only the jackknife test is the least arbitrary that can always yield a unique result for a given benchmark dataset [ , , , [ ] [ ] [ ] . therefore, the jackknife test has been widely recognized and increasingly utilized by investigators to examine the quality of various predictors (see, e.g., [ , , , , , , , , ] ). accordingly, in this study the jackknife test was also adopted to evaluate the accuracy of the current predictor. as mentioned above, the svm operation engine contains two uncertain parameters c and  . to find their optimal values, a -d grid search was conducted by the jackknife test on the benchmark dataset . the results thus obtained are shown in figure , from which it can be seen that the inr-drug predictor reaches its optimal status when c = and    . the corresponding rates for the four metrics (cf. equation ( )) are given in table , where for facilitating comparison, the overall accuracy acc reported by he et al. [ ] on the same benchmark dataset is also given although no results were reported by them for sn, sp and mcc. it can be observed from the table that the overall accuracy obtained by inr-drug is remarkably higher that of he et al. [ ] , and that the rates achieved by inr-drug for the other three metrics are also quite higher. these facts indicate that the current predictor not only can yield higher overall prediction accuracy but also is quite stable with low false prediction rates. as mentioned above (section . ), the jackknife test is the most objective method for examining the quality of a predictor. however, as a demonstration to show how to practically use the current predictor, we took nr-drug pairs from the study by yamanishi et al. [ ] that had been confirmed by experiments as interactive pairs. for such an independent dataset, were correctly identified by inr-drug as interactive pairs, i.e., sn = / = . %, which is quite consistent with the rate of . % achieved by the predictor on the benchmark dataset via the jackknife test as reported in table . it is anticipated that the inr-drug predictor developed in this paper may become a useful high throughput tool for both basic research and drug development, and that the current approach may be easily extended to study the interactions of drug with other targets as well. since user-friendly and publicly accessible web-servers represent the future direction for developing practically more useful predictors [ , ] , a publicly accessible web-server for inr-drug was established. for the convenience of the vast majority of biologists and pharmaceutical scientists, here let us provide a step-by-step guide to show how the users can easily get the desired result by using inr-drug web-server without the need to follow the complicated mathematical equations presented in this paper for the process of developing the predictor and its integrity. step . open the web server at the site http://www.jci-bioinfo.cn/inr-drug/ and you will see the top page of the predictor on your computer screen, as shown in figure . click on the read me button to see a brief introduction about inr-drug predictor and the caveat when using it. step . either type or copy/paste the query nr-drug pairs into the input box at the center of figure . each query pair consists of two parts: one is for the nuclear receptor sequence, and the other for the drug. the nr sequence should be in fasta format, while the drug in the kegg code beginning with the symbol #. examples for the query pairs input and the corresponding output can be seen by clicking on the example button right above the input box. step . click on the submit button to see the predicted result. for example, if you use the three query pairs in the example window as the input, after clicking the submit button, you will see on your screen that the "hsa: " nr and the "d " drug are an interactive pair, and that the "hsa: " nr and the "d " drug are also an interactive pair, but that the "hsa: " nr and the "d " drug are not an interactive pair. all these results are fully consistent with the experimental observations. it takes about minutes before each of these results is shown on the screen; of course, the more query pairs there is, the more time that is usually needed. step . click on the citation button to find the relevant paper that documents the detailed development and algorithm of inr-durg. step . click on the data button to download the benchmark dataset used to train and test the inr-durg predictor. step . the program code is also available by clicking the button download on the lower panel of figure . nuclear receptors in cell life and death nuclear receptors nuclear receptors: current concepts and future challenges the nuclear receptor superfamily non-steroid nuclear receptors: what are genetic studies telling us their role in renal life? nuclear receptor drug discovery nuclear receptors and drug disposition gene regulation a web-server for identifying g-protein coupled receptors and their families with grey incidence analysis bioinformatical analysis of g-protein-coupled receptors igpcr-drug: a web server for predicting interaction between gpcrs and drugs in cellular networking a cellular automaton image approach for predicting g-protein-coupled receptor functional classes gpcr- l: predicting g protein-coupled receptors and their types by hybridizing two different modes of pseudo amino acid compositions prediction of g-protein-coupled receptor classes in low homology using chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns prediction of g-protein-coupled receptor classes based on the concept of chou's pseudo amino acid composition: an approach from discrete wavelet transform using ensemble svm to identify human gpcrs n-linked glycosylation sites based on the general form of chou's pseaac identifying gpcrs and their types with chou's pseudo amino acid composition: an approach from multi-scale energy representation and position specific scoring matrix prediction of g-protein-coupled receptor classes identify the channel-drug interaction in cellular networking with pseaac and molecular fingerprints insights from modelling three-dimensional structures of the human potassium and sodium channels influenza m proton channels a model of the complex between cyclin-dependent kinase (cdk ) and the activation domain of neuronal cdk activator rapid and accurate structure determination of coiled-coil domains using nmr dipolar couplings: application to cgmp-dependent protein kinase ialpha the three-dimensional structure of the cgmp-dependent protein kinase i-α leucine zipper domain and its interaction with the myosin binding subunit determination of the packing mode of the coiled-coil domain of cgmp-dependent protein kinase ialpha in solution using charge-predicted dipolar couplings a guide to drug discovery: target selection in drug discovery target discovery a fast flexible docking method using an incremental construction algorithm binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against sars nmr studies on how the binding complex of polyisoprenol recognition sequence peptides and polyisoprenols can modulate membrane structure review: progress in computational approach to drug development against sars molecular modelling and chemical modification for finding peptide inhibitor against sars cov mpro an in-depth analysis of the biological functional studies based on the nmr m channel structure of influenza a virus energetic analysis of the two controversial drug binding sites of the m proton channel in influenza a virus investigation into adamantane-based m inhibitors with fb-qsar designing inhibitors of m proton channel against h n swine influenza virus insights from investigating the interaction of oseltamivir (tamiflu) with neuraminidase of the h n swine flu virus review: structural bioinformatics and its impact to biomedical science identification of proteins interacting with human sp during the process of viral infections docking and molecular dynamics study on the inhibitory activity of novel inhibitors on epidermal growth factor receptor (egfr) novel inhibitor design for hemagglutinin against h n influenza virus by core hopping method design novel dual agonists for treating type- diabetes by targeting peroxisome proliferator-activated receptors with core hopping approach insights from modeling the d structure of new delhi metallo-betalactamase and its binding interactions with antibiotic drugs insights into the mutation-induced hhh syndrome from modeling human mitochondrial ornithine transporter- mitochondrial uncoupling protein structure determined by nmr molecular fragment searching structure and mechanism of the m proton channel of influenza a virus unusual architecture of the p channel from hepatitis c virus the structure of phospholamban pentamer reveals a channel-like architecture in membranes the structural basis for intramembrane assembly of an activating immunoreceptor complex solution nmr structure of the v a drug resistant mutant of influenza a m channel mechanism of drug inhibition and drug resistance of influenza a m channel solution structure and functional analysis of the influenza b proton channel prediction of the tertiary structure and substrate binding site of caspase- prediction of the tertiary structure of a caspase- /inhibitor complex prediction of the tertiary structure of the beta-secretase zymogen coupling interaction between thromboxane a receptor and alpha- subunit of guanine nucleotide-binding protein insights from modeling the d structure of dna-cbf b complex modeling the tertiary structure of human cathepsin-e assessment of chemical libraries for their druggability predicting drug-target interaction networks based on functional groups and biological features some remarks on protein attribute prediction and pseudo amino acid composition ( th anniversary year review) identify recombination spots with trinucleotide composition and pseudo amino acid components identify recombination spots with pseudo dinucleotide composition identifying the heat shock protein families using pseudo reduced amino acid alphabet composition combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection inuc-physchem: a sequence-based predictor for identifying nucleosomes via physicochemical properties predict cysteine s-nitrosylation sites in proteins by incorporating position specific amino acid propensity into pseudo amino acid composition incorporating amino acid pairwise coupling into pseaac for predicting cysteine s-nitrosylation sites in proteins iezy-drug: a web server for identifying the interaction between enzymes and drugs in cellular networking iamp- l: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition the kegg databases and tools facilitating omics analysis: latest developments involving human diseases and pharmaceuticals review: recent progresses in protein subcellular location prediction an intriguing controversy over protein structural class prediction some insights into protein structural class prediction prediction of enzyme family classes slle for predicting membrane protein types predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image a novel approach to predicting protein structural classes in a ( - )-d amino acid composition space subcellular location prediction of apoptosis proteins boosting classifier for predicting protein domain structural class artificial neural network for predicting alpha-turn types neural network prediction of the hiv- protease cleavage sites a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix idna-prot: identification of dna binding proteins using random forest with grey model afp-pred: a random forest approach for predicting antifreeze proteins from sequence-derived properties predicting subcellular localization of proteins in a hybridization space prediction of protease types in a hybridization space predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic k-nearest neighbor classifiers hum-ploc: a novel ensemble classifier for predicting human protein subcellular localization large-scale predictions of gram-negative bacterial protein subcellular locations euk-mploc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites signal-cf: a subsite-coupled and window-fusing approach for predicting signal peptides using optimized evidence-theoretic k-nearest neighbor classifier and pseudo amino acid composition to predict membrane protein types a top-down approach to enhance the power of predicting human protein subcellular localization: hum-mploc . fuzzy knn for predicting membrane protein types from pseudo amino acid composition prediction of protein cellular attributes using pseudo amino acid composition using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes theoretical and experimental biology in one predicting anticancer peptides with chou's pseudo amino acid composition and investigating their mutagenicity via ames test predicting plant protein subcellular multi-localization by chou's pseaac formulation based multi-label homolog knowledge transfer learning euloc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of chou's pseaac predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of chou's pseudo amino acid composition using radial basis function on the general form of chou's pseudo amino acid composition and pssm to predict subcellular locations of proteins with both single and multiple sites prediction of subcellular localization of apoptosis protein using chou's pseudo amino acid composition goasvm: a subcellular location predictor by incorporating term-frequency gene ontology into the general form of chou's pseudo-amino acid composition predicting protein subchloroplast locations with both single and multiple sites via three different modes of chou's pseudo amino acid compositions predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of chou's pseudo amino acid composition a multilabel model based on chou's pseudo-amino acid composition for identifying membrane proteins with both single and multiple functional types genetic programming for creating chou's pseudo amino acid based features for submitochondria localization predicting protein submitochondria locations by combining different descriptors into the general form of chou's pseudo amino acid composition multi-kernel transfer learning based on chou's pseaac formulation for protein submitochondria localization using the augmented chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach prediction of gaba(a) receptor proteins using the concept of chou's pseudo-amino acid composition and support vector machine using chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes predicting antibacterial peptides by the concept of chou;s pseudo-amino acid composition and machine learning methods supersecondary structure prediction using chou's pseudo amino acid composition identifying bacterial virulent proteins by fusing a set of classifiers based on variants of chou's pseudo amino acid composition and on evolutionary information a novel feature representation method based on chou's pseudo amino acid composition for protein structural class prediction predicting the cofactors of oxidoreductases based on amino acid composition distribution and chou's amphiphilic pseudo amino acid composition prediction of metalloproteinase family based on the concept of chou's pseudo amino acid composition using a machine learning approach secretp: identifying bacterial secreted proteins by fusing new features into chou's pseudo-amino acid composition prediction of allergenic proteins by means of the concept of chou's pseudo amino acid composition and a machine learning approach using chou's pseudo amino acid composition to predict protein quaternary structure: a sequence-segmented pseaac approach identifying protein quaternary structural attributes by incorporating physicochemical properties into the general form of chou's pseaac via discrete wavelet transform using the concept of chou's pseudo amino acid composition for risk type prediction of human papillomaviruses prediction of cyclin proteins using chou's pseudo amino acid composition discriminating outer membrane proteins with fuzzy k-nearest neighbor algorithms based on the general form of chou's pseaac use of fuzzy clustering technique and matrices to classify amino acids and its impact to chou's pseudo amino acid composition protein remote homology detection by combining chou's pseudo amino acid composition and profile-based protein representation identification of colorectal cancer related genes with mrmr and shortest path in protein-protein interaction network hepatitis c virus network based classification of hepatocellular cirrhosis and carcinoma signal propagation in protein interaction network during colorectal cancer progression pseaac-builder: a cross-platform stand-alone program for generating various special chou's pseudo-amino acid compositions propy: a tool to generate various modes of chou's pseaac pseaac: a flexible web-server for generating various kinds of protein pseudo amino acid composition the folding type of a protein is relevant to the amino acid composition an optimization approach to predicting protein structural class from amino acid composition monte carlo simulation studies on the prediction of protein folding types from amino acid composition predicting protein folding types by distance functions that make allowances for amino acid interactions monte carlo simulation studies on the prediction of protein folding types from amino acid composition. ii. correlative effect does the folding type of a protein depend on its amino acid composition? protein secondary structural content prediction the convergence-divergence duality in lectin domains of the selectin family and its implications a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins using accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites improving the accuracy of psi-blast protein database searches with composition-based statistics and other refinements gapped blast and psi-blast: a new generation of protein database search programs a comparison of normalization methods for high density oligonucleotide array data based on variance and bias feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition low-frequency fourier spectrum for predicting membrane protein types using stacked generalization to predict membrane protein types based on pseudo amino acid composition prediction of linear b-cell epitopes using amino acid pair antigenicity scale predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model using functional domain composition and support vector machines for prediction of protein subcellular location support vector machines for predicting membrane protein types by using functional domain composition an introduction of support vector machines and other kernel-based learning methodds libsvm: a library for support vector machines prediction of protein signal sequences and their cleavage sites using subsite coupling to predict signal peptides prediction of signal peptides using scaled window some remarks on predicting multi-label attributes in molecular biosystems review: prediction of protein structural classes cell-ploc: a package of web servers for predicting subcellular localization of proteins in various organisms cell-ploc . : an improved package of web-servers for predicting subcellular localization of proteins in various organisms predicting enzyme family classes by hybridizing gene product composition and pseudo-amino acid composition identify catalytic triads of serine hydrolases by support vector machines using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of chou's pseudo amino acid composition a multi-layer classifier for predicting the subcellular localization of singleplex and multiplex gram-positive bacterial proteins drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework review: recent advances in developing web-servers for predicting protein attributes the authors would like to express their gratitude to the three anonymous reviewers, whose constructive comments are very helpful for strengthening the presentation of the paper. the authors declare no conflict of interest. key: cord- -d iwkatn authors: henry, kevin a.; arbabi-ghahroudi, mehdi; scott, jamie k. title: beyond phage display: non-traditional applications of the filamentous bacteriophage as a vaccine carrier, therapeutic biologic, and bioconjugation scaffold date: - - journal: front microbiol doi: . /fmicb. . sha: doc_id: cord_uid: d iwkatn for the past years, phage display technology has been an invaluable tool for studies of protein–protein interactions. however, the inherent biological, biochemical, and biophysical properties of filamentous bacteriophage, as well as the ease of its genetic manipulation, also make it an attractive platform outside the traditional phage display canon. this review will focus on the unique properties of the filamentous bacteriophage and highlight its diverse applications in current research. particular emphases are placed on: (i) the advantages of the phage as a vaccine carrier, including its high immunogenicity, relative antigenic simplicity and ability to activate a range of immune responses, (ii) the phage’s potential as a prophylactic and therapeutic agent for infectious and chronic diseases, (iii) the regularity of the virion major coat protein lattice, which enables a variety of bioconjugation and surface chemistry applications, particularly in nanomaterials, and (iv) the phage’s large population sizes and fast generation times, which make it an excellent model system for directed protein evolution. despite their ubiquity in the biosphere, metagenomics work is just beginning to explore the ecology of filamentous and non-filamentous phage, and their role in the evolution of bacterial populations. thus, the filamentous phage represents a robust, inexpensive, and versatile microorganism whose bioengineering applications continue to expand in new directions, although its limitations in some spheres impose obstacles to its widespread adoption and use. the filamentous bacteriophage (genera inovirus and plectrovirus) are non-enveloped, rod-shaped viruses of escherichia coli whose long helical capsids encapsulate a single-stranded circular dna genome. subsequent to the independent discovery of bacteriophage by twort ( ) and d 'hérelle ( ) , the first filamentous phage, f , was isolated in loeb ( ) and later characterized as a member of a larger group of phage (ff, including f , m , and fd phage) specific for the e. coli conjugative f pilus (hofschneider and mueller-jensen, ; marvin and hoffmann-berling, ; zinder et al., ; salivar et al., ) . soon thereafter, filamentous phage were discovered that do not use f-pili for entry (if and ike; meynell and lawn, ; khatoon et al., ) , and over time the list of known filamentous phage has expanded to over members (fauquet et al., ) , including temperate and gram-positivetropic species. work by multiple groups over the past years has contributed to a relatively sophisticated understanding of filamentous phage structure, biology and life cycle (reviewed in marvin, ; rakonjac et al., ; rakonjac, ) . in the mid- s, the principle of modifying the filamentous phage genome to display polypeptides as fusions to coat proteins on the virion surface was invented by smith and colleagues (smith, ; parmley and smith, ) . based on the ideas described in parmley and smith ( ) , groups in california, germany, and the uk developed phage-display platforms to create and screen libraries of peptide and folded-protein variants (bass et al., ; devlin et al., ; mccafferty et al., ; scott and smith, ; breitling et al., ; kang et al., ) . this technology allowed, for the first time, the ability to seamlessly connect genetic information with protein function for a large number of protein variants simultaneously, and has been widely and productively exploited in studies of proteinprotein interactions. many excellent reviews are available on phage-display libraries and their applications (kehoe and kay, ; bratkovic, ; pande et al., ) . however, the phage also has a number of unique structural and biological properties that make it highly useful in areas of research that have received far less attention. thus, the purpose of this review is to highlight recent and current work using filamentous phage in novel and nontraditional applications. specifically, we refer to projects that rely on the filamentous phage as a key element, but whose primary purpose is not the generation or screening of phagedisplayed libraries to obtain binding polypeptide ligands. these tend to fall into four major categories of use: (i) filamentous phage as a vaccine carrier; (ii) engineered filamentous phage as a therapeutic biologic agent in infectious and chronic diseases; (iii) filamentous phage as a scaffold for bioconjugation and surface chemistry; and (iv) filamentous phage as an engine for evolving variants of displayed proteins with novel functions. a final section is dedicated to recent developments in filamentous phage ecology and phage-host interactions. common themes shared amongst all these applications include the unique biological, immunological, and physicochemical properties of the phage, its ability to display a variety of biomolecules in modular fashion, and its relative simplicity and ease of manipulation. nearly all applications of the filamentous phage depend on its ability to display polypeptides on the virion's surface as fusions to phage coat proteins ( table ) . the display mode determines the maximum tolerated size of the fused polypeptide, its copy number on the phage, and potentially, the structure of the displayed polypeptide. display may be achieved by fusing dna encoding a polypeptide of interest directly to the gene encoding a coat protein within the phage genome (type display on pviii, type display on piii, etc.), resulting in fully recombinant phage. much more commonly, however, only one copy of the coat protein is modified in the presence of a second, wild-type copy (e.g., type display if both recombinant and wild-type pviii genes are on the phage genome, type + display if the parmley and smith ( ), mcconnell et al. ( ) , rondot et al. ( ) hybrid (type and + systems) type + system < smith and scott ( ) , smith and petrenko ( ) pvi hybrid (type + system) yes < > kda hufton et al. ( ) pvii fully recombinant (type system) no ∼ > kda kwasnikowski et al. ( ) hybrid (type + system) yes < gao et al. ( ) pviii fully recombinant (landscape phage; type system) no ∼ - residues kishchenko et al. ( ) , petrenko et al. ( ) hybrid (type and + systems) type + system ∼ - > kda scott and smith ( ) , greenwood et al. ( ) , smith and fernandez ( ) pix fully recombinant (type + * system) yes ∼ > kda gao et al. ( ) hybrid (type + system) no < gao et al. ( ) , shi et al. ( ) , tornetta et al. ( ) asterisks indicate non-functional copies of the coat protein are present in the genome of the helper phage used to rescue a phagemid whose coat protein has been fused to a recombinant polypeptide. the copy number depends on polypeptide size; typically < copy per phage particle but for pviii peptide display can be up to ∼ % of pviii molecules in hybrid virions. the total number of pviii molecules depends on the phage genome size; one pviii molecule is added for every . nucleotides in the viral genome. recombinant gene is on a plasmid with a phage origin of replication) resulting in a hybrid virion bearing two different types of a given coat protein. multivalent display on some coat proteins can also be enforced using helper phage bearing nonfunctional copies of the relevant coat protein gene (e.g., type * + display). by far the most commonly used coat proteins for display are the major coat protein, pviii, and the minor coat protein, piii, with the major advantage of the former being higher copy number display (up to ∼ % of recombinant pviii molecules in a hybrid virion, at least for short peptide fusions), and of the latter being the ability to display some folded proteins at an appreciable copy number ( - per phage particle). while pviii display of folded proteins on hybrid phage is possible, it typically results in a copy number of much less than per virion (sidhu et al., ) . for the purposes of this review, we use the term "phage display" to refer to a recombinant filamentous phage displaying a single polypeptide sequence on its surface (or more rarely, bispecific display achieved via fusion of polypeptides to two different capsid proteins), and the term "phage-displayed library" to refer to a diverse pool of recombinant filamentous phage displaying an array of polypeptide variants (e.g., antibody fragments; peptides). such libraries are typically screened by iterative cycles of panning against an immobilized protein of interest (e.g., antigen for phage-displayed antibody libraries; antibody for phage-displayed peptide libraries) followed by amplification of the bound phage in e. coli cells. early work with anti-phage antisera generated for species classification purposes demonstrated that the filamentous phage virion is highly immunogenic in the absence of adjuvants (meynell and lawn, ) and that only the major coat protein, pviii, and the minor coat protein, piii, are targeted by antibodies (pratt et al., ; woolford et al., ) . thus, the idea of using the phage as carrier to elicit antibodies against poorly immunogenic haptens or polypeptide was a natural extension of the ability to display recombinant exogenous sequences on its surface, which was first demonstrated by de la cruz et al. ( ) . the phage particle's low cost of production, high stability and potential for high valency display of foreign antigen (via pviii display) also made it attractive as a vaccine carrier, especially during the early stages of development of recombinant protein technology. building upon existing peptide-carrier technology, the first filamentous phage-based vaccine immunogens displayed short amino acid sequences derived directly from proteins of interest as recombinant fusions to pviii or piii (de la cruz et al., ) . as library technology was developed and refined, phage-based antigens displaying peptide ligands of monoclonal antibodies (selected from random peptide libraries using the antibody, thus simulating with varying degrees of success the antibody's folded epitope on its cognate antigen; geysen et al., ; knittelfelder et al., ) were also generated for immunization purposes, with the goal of eliciting anti-peptide antibodies that also recognize the native protein. some of the pioneering work in this area used peptides derived from infectious disease antigens (or peptide ligands of antibodies against these antigens; table ) , including malaria and human immunodeficiency virus type (hiv- ). when displayed on phage, peptides encoding the repeat regions of the malarial circumsporozoite protein and merozoite surface protein were immunogenic in mice and rabbits (de la cruz et al., ; greenwood et al., ; willis et al., ; demangel et al., ) , and antibodies raised against the latter cross-reacted with the full-length protein. various peptide determinants (or mimics thereof) of hiv- gp , gp , gag, and reverse transcriptase were immunogenic when displayed on or conjugated to phage coat proteins (minenkova et al., ; di marzo veronese et al., ; de berardinis et al., ; scala et al., ; chen et al., ; van houten et al., van houten et al., , , and in some cases elicited antibodies that were able to weakly neutralize lab-adapted viruses (di marzo veronese et al., ; scala et al., ) . the list of animal and human infections for which phage-displayed peptide immunogens have been developed as vaccine leads continues to expand and includes bacterial, fungal, viral, and parasitic pathogens ( table ) . while in some cases the results of these studies have been promising, antibody epitope-based peptide vaccines are no longer an area of active research for several reasons: (i) in many cases, peptides incompletely or inadequately mimic epitopes on folded proteins (irving et al., ; see below); (ii) antibodies against a single epitope may be of limited utility, especially for highly variable pathogens (van regenmortel, ); and (iii) for pathogens for which protective immune responses are generated efficiently during natural infection, peptide vaccines offer few advantages over recombinant subunit and live vector vaccines, which have become easier to produce over time. more recently, peptide-displaying phage have been used in attempts to generate therapeutic antibody responses for chronic diseases, cancer, immunotherapy, and immunocontraception. immunization with phage displaying alzheimer's disease β-amyloid fibril peptides elicited anti-aggregating antibodies in mice and guinea pigs (frenkel et al., (frenkel et al., , esposito et al., ; tanaka et al., ) , possibly reduced amyloid plaque formation in mice (frenkel et al., ; solomon, ; esposito et al., ) , and may have helped maintain cognitive abilities in a transgenic mouse model of alzheimer's disease (lavie et al., ) ; however, it remains unclear how such antibodies are proposed to cross the blood-brain barrier. yip et al. ( ) found that antibodies raised in mice against an erbb /her peptide could inhibit breast-cancer cell proliferation. phage displaying peptide ligands of an anti-ige antibody elicited antibodies that bound purified ige molecules (rudolf et al., ) , which may be useful in allergy immunotherapy. several strategies for phage-based contraceptive vaccines have been proposed for control of animal populations. for example, immunization with phage displaying follicle-stimulating hormone peptides on pviii elicited antibodies that impaired the fertility of mice and ewes (abdennebi et al., ) . phage displaying or chemically rubinchik and chow ( ) conjugated to sperm antigen peptides or peptide mimics (samoylova et al., a,b) and gonadotropin-releasing hormone (samoylov et al., ) are also in development. for the most part, peptides displayed on phage elicit antibodies in experimental animals ( table ) , although this depends on characteristics of the peptide and the method of its display: piii fusions tend toward lower immunogenicity than pviii fusions (greenwood et al., ) possibly due to copy number differences (piii: - copies vs. pviii: estimated at several hundred copies; malik et al., ) . in fact, the phage is at least as immunogenic as traditional carrier proteins such as bovine serum albumin (bsa) and keyhole limpet hemocyanin (klh; melzer et al., ; su et al., ) , and has comparatively few endogenous b-cell epitopes to divert the antibody response from its intended target (henry et al., ) . excepting small epitopes that can be accurately represented by a contiguous short amino acid sequence, however, it has been extremely difficult to elicit antibody responses that cross-react with native protein epitopes using peptides. the overall picture is considerably bleaker than that painted by table , since in several studies either: (i) peptide ligands selected from phage-displayed libraries were classified by the authors as mimics of discontinuous epitopes if they bore no obvious sequence homology to the native protein, which is weak evidence of non-linearity, or (ii) the evidence for cross-reactivity of antibodies elicited by immunization with phage-displayed peptides with native protein was uncompelling. irving et al. ( ) describe at least one reason for this lack of success: it seems that peptide antigens elicit a set of topologically restricted antibodies that are largely unable to recognize discontinuous or complex epitopes on larger biomolecules. while the peptide may mimic the chemistry of a given epitope on a folded protein (allowing it to crossreact with a targeted antibody), being a smaller molecule, it cannot mimic the topology of that antibody's full epitope. despite this, the filamentous phage remains highly useful as a carrier for peptides with relatively simple secondary structures, which may be stablilized via anchoring to the coat proteins (henry et al., ) . this may be especially true of peptides with poor inherent immunogenicity, which may be increased by high-valency display and phage-associated adjuvanticity (see immunological mechanisms of vaccination with filamentous phage below). the filamentous phage has been used to a lesser extent as a carrier for t-cell peptide epitopes, primarily as fusion proteins with pviii ( table ) . early work, showing that immunization with phage elicited t-cell help (kölsch et al., ; willis et al., ) , was confirmed by several subsequent studies (de berardinis et al., ; ulivieri et al., ) . from the perspective of vaccination against infectious disease, de berardinis et al. ( ) showed that a cytotoxic t-cell (ctl) epitope from hiv- reverse transcriptase could elicit antigen-specific ctls in vitro and in vivo without addition of exogenous helper t-cell epitopes, presumably since these are already present in the phage coat proteins (mascolo et al., ) . similarly, efficient priming of ctls was observed against phage-displayed t-cell epitopes from hepatitis b virus (wan et al., ) and candida albicans (yang et al., a; wang et al., wang et al., , d , which, together with other types of immune responses, protected mice against systemic candidiasis. vaccination with a combination of phagedisplayed peptides elicited antigen-specific ctls that proved effective in reducing porcine cysticercosis in a randomized controlled trial (manoutcharian et al., ; morales et al., ) . while the correlates of vaccine-induced immune protection for infectious diseases, where they are known, are almost exclusively serum or mucosal antibodies (plotkin, ) , in certain vaccine applications, the filamentous phage has been used as a carrier for larger molecules that would be immunogenic even in isolation. initially, the major advantages to phage display of such antigens were speed, ease of purification and low cost of production (gram et al., ) . e. coli f a-g adhesin (van gerven et al., ) , hepatitis b core antigen (bahadir et al., ) , and hepatitis b surface antigen (balcioglu et al., ) all elicited antibody responses when displayed on piii, although none of these studies compared the immunogenicity of the phage-displayed proteins with that of the purified protein alone. phage displaying schistosoma mansoni glutathione s-transferase on piii elicited an antibody response that was both higher in titer and of different isotypes compared to immunization with the protein alone (rao et al., ) . two studies of antiidiotypic vaccines have used the phage as a carrier for antibody fragments bearing immunogenic idiotypes. immunization with phage displaying the e idiotype scfv (mimicking a vibrio anguillarum surface epitope) elicited antibodies that protected flounder fish from vibrio anguillarum challenge (xia et al., ) . a chemically linked phage-bcl tumor-specific idiotype vaccine was weakly immunogenic in mice but extended survival time in a b-cell lymphoma model (roehnisch et al., ) , and was welltolerated and immunogenic in patients with multiple myeloma (roehnisch et al., ) . one study of dna vaccination with an anti-laminarin scfv found that dna encoding a piii-scfv fusion protein elicited stronger humoral and cell-mediated immune responses than dna encoding the scfv alone (cuesta et al., ) , suggesting that under some circumstances, endogenous phage t-cell epitopes can enhance the immunogenicity of associated proteins. taken together, the results of these studies show that as a particulate virus-like particle, the filamentous phage likely triggers different types of immune responses than recombinant protein antigens, and provide additional t-cell help to displayed or conjugated proteins. however, the low copy number of piii-displayed proteins, as well as potentially unwanted phage-associated adjuvanticity, can make display of recombinant proteins by phage a suboptimal vaccine choice. although our understanding of the immune response against the filamentous phage pales in comparison to classical model antigens such as ovalbumin, recent work has begun to shed light on the immune mechanisms activated in response to phage vaccination (figure ) . the phage particle is immunogenic without adjuvant in all species tested to date, including mice (willis et al., ) , rats (dente et al., ) , rabbits (de la cruz et al., ) , guinea pigs (frenkel et al., ; kim et al., ) , fish (coull et al., ; xia et al., ) , non-human primates (chen et al., ) , and humans (roehnisch et al., ) . various routes of immunization have been employed, including oral administration (delmastro et al., ) as well as subcutaneous (grabowska et al., ) , intraperitoneal (van houten et al., ) , intramuscular (samoylova et al., a) , intravenous (vaks and benhar, ) , and intradermal injection (roehnisch et al., ) ; no published study has directly compared the effect of administration route on filamentous phage immunogenicity. antibodies are generated against only three major sites on the virion: (i) the surface-exposed n-terminal ∼ residues of the pviii monomer lattice (terry et al., ; kneissel et al., ) ; (ii) the n-terminal n and n domains of piii (van houten et al., ) ; and (iii) bacterial lipopolysaccharide (lps) embedded in the phage coat (henry et al., ) . in mice, serum antibody titers against the phage typically reach : - : after - immunizations, and are maintained for at least year postimmunization (frenkel et al., ) . primary antibody responses against the phage appear to be composed of a mixture of igm and igg b isotypes in c bl/ mice, while secondary antibody responses are composed primarily of igg and igg b isotypes, with a lesser contribution of igg c and igg isotypes (hashiguchi et al., ) . deletion of the surface-exposed n and n domains of piii produces a truncated form of this protein that does not elicit antibodies, but also results in a non-infective phage particle with lower overall immunogenicity (van houten et al., ) . figure | types of immune responses elicited in response to immunization with filamentous bacteriophage. as a virus-like particle, the filamentous phage engages multiple arms of the immune system, beginning with cellular effectors of innate immunity (macrophages, neutrophils, and possibly natural killer cells), which are recruited to tumor sites by phage displaying tumor-targeting moieties. the phage likely activates t-cell independent antibody responses, either via phage-associated tlr ligands or cross-linking by the pviii lattice. after processing by antigen-presenting cells, phage-derived peptides are presented on mhc class ii and cross-presented on mhc class i, resulting in activation of short-lived ctls and an array of helper t-cell types, which help prime memory ctl and high-affinity b-cell responses. frontiers in microbiology | www.frontiersin.org although serum anti-phage antibody titers appear to be at least partially t-cell dependent (kölsch et al., ; willis et al., ; de berardinis et al., ; van houten et al., ) , many circulating pviii-specific b cells in the blood are devoid of somatic mutation even after repeated biweekly immunizations, suggesting that under these conditions, the phage activates t-cell-independent b-cell responses in addition to highaffinity t-cell-dependent responses (murira, ) . filamentous phage particles can be processed by antigen-presenting cells and presented on mhc class ii molecules (gaubin et al., ; ulivieri et al., ) and can activate t h , t h , and t h helper t cells (yang et al., a; wang et al., d) . anti-phage t h responses were enhanced through display of ctla- peptides fused to piii (kajihara et al., ) . phage proteins can also be cross-presented on mhc class i molecules (wan et al., ) and can prime two waves of ctl responses, consisting first of short-lived ctls and later of long-lived memory ctls that require cd + t-cell help (del pozzo et al., ) . the latter ctls mediate a delayed-type hypersensitivity reaction (fang et al., ; del pozzo et al., ) . the phage particle is self-adjuvanting through multiple mechanisms. host cell wall-derived lps enhances the virion's immunogenicity, and its removal by polymyxin b chromatography reduces antibody titers against phage coat proteins (grabowska et al., ) . the phage's singlestranded dna genome contains cpg motifs and may also have an adjuvant effect. the antibody response against the phage is entirely dependent on myd signaling and is modulated by stimulation of several toll-like receptors (hashiguchi et al., ) , indicating that innate immunity plays an important but largely uncharacterized role in the activation of anti-phage adaptive immune responses. biodistribution studies of the phage after intravenous injection show that it is cleared from the blood within hours through the reticuloendothelial system (molenaar et al., ) , particularly of the liver and spleen, where it is retained for days (zou et al., ) , potentially activating marginal-zone b-cell responses. thus, the filamentous phage is not only a highly immunogenic carrier, but by virtue of activating a range of innate and adaptive immune responses, serves as an excellent model virus-like particle antigen. long before the identification of filamentous phage, other types of bacteriophage were already being used for antibacterial therapy in the former soviet union and eastern europe (reviewed in sulakvelidze et al., ) . the filamentous phage, with its nonlytic life cycle, has less obvious clinical uses, despite the fact that the host specificity of inovirus and plectrovirus includes many pathogens of medical importance, including salmonella, e. coli, shigella, pseudomonas, clostridium, and mycoplasma species. in an effort to enhance their bactericidal activity, genetically modified filamentous phage have been used as a "trojan horse" to introduce various antibacterial agents into cells. m and pf phage engineered to express either bglii restriction endonuclease (hagens and blasi, ; hagens et al., ) , lambda phage s holin (hagens and blasi, ) or a lethal catabolite gene activator protein (moradpour et al., ) effectively killed e. coli and pseudomonas aeruginosa cells, respectively, with no concomitant release of lps (hagens and blasi, ; hagens et al., ) . unfortunately, the rapid emergence of resistant bacteria with modified f pili represents a major and possibly insurmountable obstacle to this approach. however, there are some indications that filamentous phage can exert useful but more subtle effects upon their bacterial hosts that may not result in the development of resistance to infection. several studies have reported increased antibiotic sensitivity in bacterial populations simultaneously infected with either wild type filamentous phage (hagens et al., ) or phage engineered to repress the cellular sos response (lu and collins, ) . filamentous phage f infection inhibited early stage, but not mature, biofilm formation in e. coli (may et al., ) . thus, unmodified filamentous phage may be of future interest as elements of combination therapeutics against certain drug-resistant infections. more advanced therapeutic applications of the filamentous phage emerge when it is modified to express a targeting moiety specific for pathogenic cells and/or proteins for the treatment of infectious diseases, cancer and autoimmunity (figure ) . the first work in this area showed as proof-of-concept that phage encoding a gfp expression cassette and displaying a her specific scfv on all copies of piii were internalized into breast tumor cells, resulting in gfp expression (poul and marks, ) . m or fd phage displaying either a targeting peptide or antibody fragment and tethered to chloramphenicol by a labile crosslinker were more potent inhibitors of staphylococcus aureus growth than high-concentration free chloramphenicol (yacoby et al., ; vaks and benhar, ) . m phage loaded with doxorubicin and displaying a targeting peptide on piii specifically killed prostate cancer cells in vitro (ghosh et al., a) . tumorspecific peptide:pviii fusion proteins selected from "landscape" phage (romanov et al., ; abbineni et al., ; fagbohun et al., fagbohun et al., , lang et al., ; wang et al., a) were able to target and deliver sirna-, paclitaxel-, and doxorubicincontaining liposomes to tumor cells (jayanna et al., a; wang et al., a wang et al., ,b,c, b bedi et al., bedi et al., , bedi et al., , ; they were non-toxic and increased tumor remission rates in mouse models (jayanna et al., b; wang et al., b,c) . using the b -ova tumor model, eriksson et al. ( ) showed that phage displaying peptides and/or fabs specific for tumor antigens delayed tumor growth and improved survival, owing in large part to activation of tumor-associated macrophages and recruitment of neutrophils to the tumor site (eriksson et al., ) . phage displaying an scfv against β-amyloid fibrils showed promise as a diagnostic (frenkel and solomon, ) and therapeutic (solomon, ) reagent for alzheimer's disease and parkinson's disease due to the unanticipated ability of the phage to penetrate into brain tissue (ksendzovsky et al., ) . similarly, phage displaying an immunodominant peptide epitope derived from myelin oligodendrocyte glycoprotein depleted pathogenic demyelinating antibodies in brain tissue in the murine experimental autoimmune encephalomyelitis model of multiple sclerosis (rakover et al., ) . the advantages of the filamentous phage in this context over traditional antibody-drug or protein-peptide conjugates are (i) its ability to carry very high amounts of drug or peptide, and (ii) its ability to access anatomical compartments that cannot generally be reached by systemic administration of a protein. unlike most therapeutic biologics, the filamentous phage's production in bacteria complicates its use in humans in several ways. first and foremost, crude preparations of filamentous phage typically contain very high levels of contaminating lps, in the range of ∼ - endotoxin units (eu)/ml (boratynski et al., ; branston et al., ) , which have the potential to cause severe adverse reactions. lps is not completely removed by polyethylene glycol precipitation or cesium chloride density gradient centrifugation (smith and gingrich, ; branston et al., ) , but its levels can be reduced dramatically using additional purification steps such as size exclusion chromatography (boratynski et al., ; zakharova et al., ) , polymyxin b chromatography (grabowska et al., ) , and treatment with detergents such as triton x- or triton x- (roehnisch et al., ; branston et al., ) . these strategies routinely achieve endotoxin levels of < eu/ml as measured by the limulus amebocyte lysate (lal) assay, well below the fda limit for parenteral administration of eu/kg body weight/dose, although concerns remain regarding the presence of residual virion-associated lps which may be undetectable. a second and perhaps unavoidable consequence of the filamentous phage's bacterial production is inherent heterogeneity of particle size and the spectrum of host cellderived virion-associated and soluble contaminants, which may be cause for safety concerns and restrict its use to high-risk groups. many types of bacteriophage and engineered phage variants, including filamentous phage, have been proposed for prophylactic use ex vivo in food safety, either in the production pipeline (reviewed in dalmasso et al., ) or for detection of foodborne pathogens post-production (reviewed in schmelcher and loessner, ) . filamentous phage displaying a tetracysteine tag on piii were used to detect e. coli cells through staining with biarsenical dye . m phage functionalized with metallic silver were highly bactericidal against e. coli and staphylococcus epidermidis . biosensors based on surface plasmon resonance (nanduri et al., ) , piezoelectric transducers (olsen et al., ) , linear dichroism (pacheco-gomez et al., ) , and magnetoelastic sensor technology (lakshmanan et al., ; huang et al., ) were devised using filamentous phage displaying scfv or conjugated to whole igg against e. coli, listeria monocytogenes, salmonella typhimurium, and bacillus anthracis with limits of detection on the order of - bacterial cells/ml. proof of concept has been demonstrated for use of such phage-based biosensors to detect bacterial contamination of live produce (li et al., b) and eggs (chai et al., ) . the filamentous phage particle is enclosed by a rod-like protein capsid, ∼ nm long and nm wide, made up almost entirely of overlapping pviii monomers, each of which lies ∼ angstroms from its nearest neighbor and exposes two amine groups as well as at least three carboxyl groups (henry et al., ) . the regularity of the phage pviii lattice and its diversity of chemically addressable groups make it an ideal scaffold for bioconjugation (figure ) . the most commonly used approach is functionalization of amine groups with nhs esters (van houten et al., (van houten et al., , yacoby et al., ) , although this can result in unwanted acylation of piii and any displayed biomolecules. carboxyl groups and tyrosine residues can also be functionalized using carbodiimide coupling and diazonium coupling, respectively (li et al., a) . carrico et al. ( ) developed methods to specifically label pviii n-termini without modification of exposed lysine residues through a two-step transamination-oxime formation reaction. specific modification of phage coat proteins is even more easily accomplished using genetically modified phage displaying peptides (ng et al., ) or enzymes (chen et al., ; hess et al., ) , but this can be cumbersome and is less general in application. for more than a decade, interest in the filamentous phage as a building block for nanomaterials has been growing because of its unique physicochemical properties, with emerging applications in magnetics, optics, and electronics. it has long been known that above a certain concentration threshold, phage can form ordered crystalline suspensions (welsh et al., ) . lee et al. ( ) engineered m phage to display a zns-binding peptide on piii and showed that, in the presence of zns nanoparticles, they selfassemble into highly ordered film biomaterials that can be aligned using magnetic fields. taking advantage of the ability to display substrate-specific peptides at known locations on the phage filament hess et al., ) , this pioneering figure | chemically addressable groups of the filamentous bacteriophage major coat protein lattice. the filamentous phage virion is made up of ∼ , - , overlapping copies of the -residue major coat protein, pviii, arranged in a shingle-type lattice. each monomer has an array of chemically addressable groups available for bioorthogonal conjugation, including two primary amine groups (shown in red), three carboxyl groups (show in blue) and two hydroxyl groups (show in green). the n-terminal residues generally exposed to the immune system for antibody binding are in bold underline. figure adapted from structural data of marvin, , freely available in pdb and scope databases. work became the basis for construction of two-and threedimensional nanomaterials with more advanced architectures, including semiconducting nanowires (mao et al., (mao et al., , , nanoparticles , and nanocomposites (oh et al., ; chen et al., ) . using hybrid m phage displaying co o -and gold-binding peptides on pviii as a scaffold to assemble nanowires on polyelectrolyte multilayers, nam et al. ( ) produced a thin, flexible lithium ion battery, which could be stamped onto platinum microband current collectors (nam et al., ) . the electrochemical properties of such batteries were further improved through piii-display of single-walled carbon nanotube-binding peptides (lee et al., ) , offering an approach for sustainable production of nanostructured electrodes from poorly conductive starting materials. phagebased nanomaterials have found applications in cancer imaging (ghosh et al., b; yi et al., ) , photocatalytic water splitting (nam et al., a; neltner et al., ) , light harvesting (nam et al., b; chen et al., ) , photoresponsive technologies (murugesan et al., ) , neural electrodes (kim et al., ) , and piezoelectric energy generation (murugesan et al., ) . thus, the unique physicochemical properties of the phage, in combination with modular display of peptides and proteins with known binding specificity, have spawned wholly novel materials with diverse applications. it is worth noting that the unusual biophysical properties of the filamentous phage can also be exploited in the study of structures of other macromolecules. magnetic alignment of high-concentration filamentous phage in solution can partially order dna, rna, proteins, and other biomolecules for measurement of dipolar coupling interactions (hansen et al., (hansen et al., , dahlke ojennus et al., ) in nmr spectroscopy. because of their large population sizes, short generation times, small genome sizes and ease of manipulation, various filamentous and non-filamentous bacteriophages have been used as models of experimental evolution (reviewed in husimi, ; wichman and brown, ; kawecki et al., ; hall et al., ) . the filamentous phage has additional practical uses in protein engineering and directed protein evolution, due to its unique tolerance of genetic modifications that allow biomolecules to be displayed on the virion surface. first and foremost among these applications is in vitro affinity maturation of antibody fragments displayed on piii. libraries of variant fabs and single chain antibodies can be generated via random or sitedirected mutagenesis and selected on the basis of improved or altered binding, roughly mimicking the somatic evolution strategy of the immune system (marks et al., ; bradbury et al., ) . however, other in vitro display systems, such as yeast display, have important advantages over the filamentous phage for affinity maturation (although each display technology has complementary strengths; koide and koide, ) , and regardless of the display method, selection of "improved" variants can be slow and cumbersome. iterative methods have been developed to combine computationally designed mutations (lippow et al., ) and circumvent the screening of combinatorial libraries, but these have had limited success to date. recently, esvelt et al. ( ) developed a novel strategy for directed evolution of filamentous phage-displayed proteins, called phage-assisted continuous evolution (pace), which allows multiple rounds of evolution per day with little experimental intervention. the authors engineered m phage to encode an exogenous protein (the subject for directed evolution), whose functional activity triggers gene iii expression from an accessory plasmid; variants of the exogenous protein arise by random mutagenesis during phage replication, the rate of which can be increased by inducible expression of error-prone dna polymerases. by supplying limiting amounts of receptive e. coli cells to the engineered phage variants, esvelt et al. ( ) elegantly linked phage infectivity and production of offspring with the presence of a desired protein phenotype. carlson et al. ( ) later showed that pace selection stringency could be modulated by providing small amounts of piii independently of protein phenotype, and undesirable protein functions negatively selected by linking them to expression of a truncated piii variant that impairs infectivity in a dominant negative fashion. pace is currently limited to protein functions that can be linked in some way to the expression of a gene iii reporter, such as protein-protein interaction, recombination, dna or rna binding, and enzymatic catalysis (meyer and ellington, ) . this approach represents a promising avenue for both basic research in molecular evolution (dickinson et al., ) and synthetic biology, including antibody engineering. filamentous bacteriophage have been recovered from diverse environmental sources, including soil (murugaiyan et al., ) , coastal fresh water (xue et al., ) , alpine lakes (hofer and sommaruga, ) and deep sea bacteria (jian et al., ) , but not, perhaps surprisingly, the human gut (kim et al., ) . the environmental "phageome" in soil and water represent the largest source of replicating dna on the planet, and is estimated to contain upward of viral particles (ashelford et al., ; chibani-chennoufi et al., ; suttle, ) . the few studies attempting to investigate filamentous phage environmental ecology using classical environmental microbiology techniques (typically direct observation by electron microscopy) found that filamentous phage made up anywhere from to % of all viral particles (demuth et al., ; pina et al., ; hofer and sommaruga, ) . there was some evidence of seasonal fluctuation of filamentous phage populations in tandem with the relative abundance of free-living heterotrophic bacteria (hofer and sommaruga, ) . environmental metagenomics efforts are just beginning to unravel the composition of viral ecosystems. the existing data suggest that filamentous phage comprise minor constituents of viral communities in freshwater (roux et al., ) and reclaimed and potable water (rosario et al., ) but have much higher frequencies in wastewater and sewage (cantalupo et al., ; alhamlan et al., ) , with the caveat that biases inherent to the methodologies for ascertaining these data (purification of viral particles, sequencing biases) have not been not well validated. there are no data describing the population dynamics of filamentous phage and their host species in the natural environment. at the individual virus-bacterium level, it is clear that filamentous phage can modulate host phenotype, including the virulence of important human and crop pathogens. this can occur either through direct effects of phage replication on cell growth and physiology, or, more typically, by horizontal transfer of genetic material contained within episomes and/or chromosomally integrated prophage. temperate filamentous phage may also play a role in genome evolution (reviewed in canchaya et al., ) . perhaps the best-studied example of virulence modulation by filamentous phage is that of vibrio cholerae, whose full virulence requires lysogenic conversion by the cholera toxin-encoding ctxφ phage (waldor and mekalanos, ) . integration of ctxφ phage occurs at specific sites in the genome; these sequences are introduced through the combined action of another filamentous phage, fs φ, and a satellite filamentous phage, tlc-knφ (hassan et al., ) . thus, filamentous phage species interact and coevolve with each other in addition to their hosts. infection by filamentous phage has been implicated in the virulence of yersinia pestis (derbise et al., ) , neisseria meningitidis (bille et al., (bille et al., , , vibrio parahaemolyticus (iida et al., ) , e. coli :k :h (gonzalez et al., ) , xanthomonas campestris (kamiunten and wakimoto, ) , and p. aeruginosa (webb et al., ) , although in most of these cases, the specific mechanisms modulating virulence are unclear. phage infection can both enhance or repress virulence depending on the characteristics of the phage, the host bacterium, and the environmental milieu, as is the case for the bacterial wilt pathogen ralstonia solanacearum (yamada, ) . since infection results in downregulation of the pili used for viral entry, filamentous phage treatment has been proposed as a hypothetical means of inhibiting bacterial conjugation and horizontal gene transfer, so as to prevent the spread of antibiotic resistance genes (lin et al., ) . finally, the filamentous phage may also play a future role in the preservation of biodiversity of other organisms in at-risk ecosystems. engineered phage have been proposed for use in bioremediation, either displaying antibody fragments of desired specificity for filtration of toxins and environmental contaminants (petrenko and makowski, ) , or as biodegradable polymers displaying peptides selected for their ability to aggregate pollutants, such as oil sands tailings (curtis et al., (curtis et al., , . engineered phage displaying peptides that specifically bind inorganic materials have also been proposed for use in more advanced and less intrusive mineral separation technologies (curtis et al., ). the filamentous phage represents a highly versatile organism whose uses extend far beyond traditional phage display and affinity selection of antibodies and polypeptides of desired specificity. its high immunogenicity and ability to display a variety of surface antigens make the phage an excellent particulate vaccine carrier, although its bacterial production and preparation heterogeneity likely limits its applications in human vaccines at present, despite being apparently safe and well-tolerated in animals and people. unanticipated characteristics of the phage particle, such as crossing of the blood-brain barrier and formation of highly ordered liquid crystalline phases, have opened up entirely new avenues of research in therapeutics for chronic disease and the design of nanomaterials. our comparatively detailed understanding of the interactions of model filamentous phage with their bacterial hosts has allowed researchers to harness the phage life cycle to direct protein evolution in the lab. hopefully, deeper knowledge of phage-host interactions at an ecological level may produce novel strategies to control bacterial pathogenesis. while novel applications of the filamentous phage continue to be developed, the phage is likely to retain its position as a workhorse for therapeutic antibody discovery for many years to come, even with the advent of competing technologies. kh and js conceived and wrote the manuscript. ma-g read the manuscript and commented on the text. evolutionary selection of new breast cancer cell-targeting peptides and phages with the cell-targeting peptides fully displayed on the major coat and their effects on actin dynamics during cell internalization generating fsh antagonists and agonists through immunization against fsh receptor n-terminal decapeptides metagenomics-based analysis of viral communities in dairy lagoon wastewater elevated abundance of bacteriophage infecting bacteria in soil phage displayed hbv core antigen with immunogenic activity cost effective filamentous phage based immunization nanoparticles displaying a full-length hepatitis b virus surface antigen hormone phage: an enrichment method for variant proteins with altered binding properties protective immune responses induced by the immunization of mice with a recombinant bacteriophage displaying an epitope of the human respiratory syncytial virus selection of pancreatic cancer cell-binding landscape phages and their use in development of anticancer nanomedicines targeted delivery of sirna into breast cancer cells via phage fusion proteins delivery of sirna into breast cancer cells via phage fusion protein-targeted liposomes association of a bacteriophage with meningococcal disease in young adults a chromosomally integrated bacteriophage in invasive meningococci preparation of endotoxin-free bacteriophages beyond natural antibodies: the power of in vitro display technologies a nonchromatographic method for the removal of endotoxins from bacteriophages progress in phage display: evolution of the technique and its application a surface expression vector for antibody screening raw sewage harbors diverse viral populations negative selection and stringency modulation in phage-assisted continuous evolution n-terminal labeling of filamentous phage to create cancer marker imaging agents rapid and sensitive detection of salmonella typhimurium on eggshells by using wireless biosensors phage display evolution of a peptide substrate for yeast biotin ligase and application to two-color quantum dot labeling of cell surface proteins versatile three-dimensional virus-based template for dye-sensitized solar cells with improved electron transport and light harvesting assembly of viral hydrogels for three-dimensional conducting nanocomposites protection of rhesus macaques against disease progression from pathogenic shiv- . pd by vaccination with phage-displayed hiv- epitopes phage-host interaction: an ecological perspective evaluation of filamentous bacteriophage as immunogens in atlantic salmon enhancement of dna vaccine potency through linkage of antigen to filamentous bacteriophage coat protein iii domain i bacteriophageinduced aggregation of oil sands tailings biomining with bacteriophage: selectivity of displayed peptides for naturally occurring sphalerite and chalcopyrite effects of bacteriophage on the surface properties of chalcopyrite (cufes( )), and phage-induced flocculation of chalcopyrite, glacial till, and oil sands tailings induced alignment and measurement of dipolar couplings of an sh domain through direct binding with filamentous phage exploiting gut bacteriophages for human health recognition of hiv-derived b and t cell epitopes displayed on filamentous phages phage display of peptide epitopes from hiv- elicits strong cytolytic responses immunogenicity and epitope mapping of foreign sequences via genetically engineered filamentous phage triggering dth and ctl activity by fd filamentous bacteriophages: role of cd + t cells in memory responses immunogenicity of filamentous phage displaying peptide mimotopes after oral administration reproducing the immune response against the plasmodium vivax merozoite surface protein with mimotopes selected from a phage-displayed peptide library direct electron microscopy study on the morphological diversity of bacteriophage populations in lake pluβsee monoclonal antibodies that recognise filamentous phage: tools for phage display technology a horizontally acquired filamentous phage contributes to the pathogenicity of the plague bacillus random peptide libraries: a source of specific protein binding molecules experimental interrogation of the path dependence and stochasticity of protein evolution using phage-assisted continuous evolution structural mimicry and enhanced immunogenicity of peptide epitopes displayed on filamentous bacteriophage. the v loop of hiv- gp sur un microbe invisible antagoniste des bacilles dysentérique tumor specific phage particles promote tumor regression in a mouse melanoma model tumor-specific bacteriophages induce tumor destruction through activation of tumor-associated macrophages immunogenicity and therapeutic efficacy of phage-displayed beta-amyloid epitopes a system for the continuous directed evolution of biomolecules landscape phages and their fusion proteins targeted to breast cancer cells metastatic prostate cancer cell-specific phage-like particles as a targeted genedelivery system the potential of phage display virions expressing malignant tumor specific antigen mage-a epitope in murine model virus taxonomy. th report of the international committee on the taxonomy of viruses mimicking of discontinuous epitopes by phage-displayed peptides, ii. selection of clones recognized by a protective monoclonal antibody against the bordetella pertussis toxin from phage peptide libraries a general strategy to identify mimotopes of pathological antigens using only random peptide libraries and human sera reduction of β-amyloid plaques in brain of transgenic mouse model of alzheimer's disease by efrh-phage immunization immunization against alzheimer's β-amyloid plaques via efrh phage administration filamentous phage as vector-mediated antibody delivery to the brain a method for the generation of combinatorial antibody libraries using pix phage display making artificial antibodies: a format for phage display of combinatorial heterodimeric arrays processing of filamentous bacteriophage virions in antigenpresenting cells targets both hla class i and class ii peptide loading compartments a priori delineation of a peptide which mimics a discontinuous antigenic determinant refactored m bacteriophage as a platform for tumor cell imaging and drug delivery m -templated magnetic nanoparticles for targeted in vivo imaging of prostate cancer conserved filamentous prophage in escherichia coli o :k :h and yersinia pestis biovar orientalis immunisation with phage displaying peptides representing single epitopes of the glycoprotein g can give rise to partial protective immunity to hsv- phage display as a rapid gene expression system: production of bioactive cytokinephage and generation of neutralizing monoclonal antibodies multiple display of foreign peptides on a filamentous bacteriophage. peptides from plasmodium falciparum circumsporozoite protein as antigens genetically modified filamentous phage as bactericidal agents: a pilot study augmentation of the antimicrobial efficacy of antibiotics by filamentous phage therapy of experimental pseudomonas infections with a nonreplicating genetically modified phage viral host-adaptation: insights from evolution experiments with phages pf filamentous phage as an alignment tool for generating local and global structural information in nucleic acids tunable alignment of macromolecules by filamentous phage yields dipolar coupling interactions immunological basis of m phage vaccine: regulation under myd and tlr signaling satellite phage tlcφ enables toxigenic conversion by ctx phage through dif site alteration developing strategies to enhance and focus humoral immune responses using filamentous phage as a model antigen m bacteriophage display framework that allows sortasemediated modification of surface-accessible phage proteins seasonal dynamics of viruses in an alpine lake: importance of filamentous forms on infectious substructures from e. coli bacteriophages. . demonstration and properties of "ht " particles peptide mimotopes of rabies virus glycoprotein with immunogenic activity sequential detection of salmonella typhimurium and bacillus anthracis spores using magnetoelastic biosensors programmable assembly of nanoarchitectures using genetically engineered viruses phage display of cdna repertoires: the pvi display system and its applications for the selection of immunogenic ligands selection and evolution of bacteriophages in cellstat filamentous phage associated with recent pandemic strains of vibrio parahaemolyticus exploring peptide mimics for the production of antibodies against discontinuous protein epitopes landscape phage ligands for pc prostate carcinoma cells landscape phage fusion protein-mediated targeting of nanomedicines enhances their prostate tumor cell association and cytotoxic efficiency dynamic modulation of dna replication and gene transcription in deep-sea filamentous phage sw in response to changes of host growth and temperature th -type immune response induced by a phage clone displaying a ctla -binding domain mimicmotif effect of infection with filamentous phage xf on the growth, ultrastructure and virulence of xanthomonas campestris pv. oryzae n linkage of recognition and replication functions by assembling combinatorial antibody fab libraries along phage surfaces experimental evolution filamentous phage display in the new millennium a new filamentous bacteriophage with sex-factor specificity diversity and abundance of single-stranded dna viruses in human feces genetically engineered bacteriophage delivers a tumor necrosis factor alpha antagonist coating on neural electrodes expression of a foot-and-mouth disease virus immunodominant epitope by a filamentous bacteriophage vector structure of a foreign peptide displayed on the surface of bacteriophage m epitope structures recognised by antibodies against the major coat protein (g p) of filamentous bacteriophage fd (inoviridae) mimotope vaccination-from allergy to cancer affinity maturation of single-domain antibodies by yeast surface display genetics of the immune response. i. the immune response to the phage fd in high and low responding inbred strains of mice convection-enhanced delivery of m bacteriophage to the brain multivalent display system on filamentous bacteriophage pvii minor coat protein phage immobilized magnetoelastic sensor for the detection of salmonella typhimurium specific probe selection from landscape phage display library and its application in enzyme-linked immunosorbent assay of free prostate-specific antigen efrh-phage immunization of alzheimer's disease animal model improves behavioral performance in morris water maze trials ordering of quantum dots using genetically engineered viruses fabricating genetically engineered high-power lithium-ion batteries using multiple virus genes chemical modification of m bacteriophage and its application in cancer cell imaging direct detection of salmonella typhimurium on fresh produce using phagebased magnetoelastic biosensors mimotopes selected with a neutralizing antibody against urease b from helicobacter pylori induce enzyme inhibitory antibodies in mice upon vaccination phage display for site-specific immunization and characterization of high-risk human papillomavirus specific e monoclonal antibodies inhibition of bacterial conjugation by phage m and its protein g p: quantitative analysis and model computational design of antibody-affinity improvement beyond in vivo maturation isolation of a bacteriophage specific for the f plus and hfr mating types of escherichia coli k- engineered bacteriophage targeting gene networks as adjuvants for antibiotic therapy role of capsid structure and membrane protein processing in determining the size and copy number of peptides displayed on the major coat protein of filamentous bacteriophage recombinant bacteriophage-based multiepitope vaccine against taenia solium pig cysticercosis viral assembly of oriented quantum dot nanowires virus-based toolkit for the directed synthesis of magnetic and semiconducting nanowires genetically engineered phage fibers and coatings for antibacterial applications molecular evolution of proteins on filamentous phage. mimicking the strategy of the immune system model-building studies of inovirus: genetic variations on a geometric theme filamentous phage structure, infection and assembly a fibrous dna phage (fd) and a spherical rna phage (fr) specific for male strains of e. coli. ii. physical characteristics phage display of a ctl epitope elicits a long-term in vivo cytotoxic response exposure of conjugative plasmid carrying escherichia coli biofilms to male-specific bacteriophages phage antibodies: filamentous phage displaying antibody variable domains constrained peptide libraries as a tool for finding mimotopes humoral immune response against proteophosphoglycan surface antigens of entamoeba histolytica elicited by immunization with synthetic mimotope peptides antigenicity and immunogenicity of phage library-selected peptide mimics of the major surface proteophosphoglycan antigens of entamoeba histolytica immunisation with phage-displayed variable region from meningococcal pora outer membrane protein induces bactericidal antibodies against neisseria meningitidis derivation of vaccines from mimotopes. immunologic properties of human hepatitis b virus surface antigen mimotopes displayed on filamentous phage molecular evolution picks up the pace filamentous phages specific for the i sex factor design of specific immunogens using filamentous phage as the carrier uptake and processing of modified bacteriophage m in mice: implications for phage display genetically engineered phage harbouring the lethal catabolite gene activator protein gene with an inducer-independent promoter for biocontrol of escherichia coli inexpensive anti-cysticercosis vaccine: s pvac expressed in heat inactivated m filamentous phage proves effective against naturally acquired taenia solium porcine cysticercosis recognition by human sera and immunogenicity of hbsag mimotopes selected from an m phage display library characterization of molecular correlates of the chronic humoral immune response: clues towards eliciting broadly neutralizing antibodies against hiv characterization of filamentous bacteriophage pe infecting ralstonia solanacearum strains virusbased photo-responsive nanowires formed by linking site-directed mutagenesis and chemical reaction virus-enabled synthesis and assembly of nanowires for lithium ion battery electrodes stamped microbattery electrodes based on self-assembled m viruses biologically templated photocatalytic nanostructures for sustained light-driven water oxidation virus-templated assembly of porphyrins into light-harvesting nanoantennae spr biosensor for the detection of l. monocytogenes using phage-displayed antibody production of hydrogen using nanocrystalline protein-templated catalysts on m phage quantitative synthesis of genetically encoded glycopeptide libraries displayed on m phage graphene sheets stabilized on genetically engineered m viral templates as conducting frameworks for hybrid energy-storage materials affinity-selected filamentous bacteriophage as a probe for acoustic wave biodetectors of salmonella typhimurium detection of pathogenic bacteria using a homogeneous immunoassay based on shear alignment of virus particles and linear dichroism phage display: concept, innovations, applications and future antibody-selectable filamentous fd phage vectors: affinity purification of target genes potential applications of phage display to bioremediation a library of organic landscapes on filamentous phage abundance, morphology and distribution of planktonic viruslike particles in two high-mountain lakes correlates of protection induced by vaccination targeted gene delivery to mammalian cells by filamentous bacteriophage conditional lethal mutants of the small filamentous coliphage m . ii. two genes for coat proteins selection of antigenic and immunogenic mimics of hepatitis c virus using sera from patients towards a solution for hepatitis c virus hypervariability: mimotopes of the hypervariable region can induce antibodies crossreacting with a large number of viral variants filamentous bacteriophages: biology and applications, " in els (the encyclopaedia of life sciences) filamentous bacteriophage: biology, phage display and nanotechnology applications antigen-specific therapy of eae via intranasal delivery of filamentous phage displaying a myelin immunodominant epitope expression of a -kilodalton glutathione s-transferase antigen of schistosoma mansoni on the surface of filamentous phages and evaluation of its vaccine potential structural requirements for the activity of the mirb ferrisiderophore transporter of aspergillus fumigatus chemically linked phage idiotype vaccination in the murine b cell lymphoma model phage idiotype vaccination: first phase i/ii clinical trial in patients with multiple myeloma phage display selection of peptides that affect prostate carcinoma cells attachment and invasion a helper phage to improve single-chain antibody presentation in phage display metagenomic analysis of viruses in reclaimed water assessing the diversity and specificity of two freshwater viral communities through metagenomics recombinant expression and neutralizing activity of an mhc class ii binding epitope of toxic shock syndrome toxin- epitope-specific antibody response to ige by mimotope immunization some physical-chemical and biological properties of the rod-shaped coliphage m generation and characterization of phage-gnrh chemical conjugates for potential use in cat and dog immunocontraception phage display allows identification of zona pellucida-binding peptides with species-specific properties: novel approach for development of contraceptive vaccines for wildlife infective and inactivated filamentous phage as carriers for immunogenic peptides vaccination with filamentous bacteriophages targeting dec- induces dc maturation and potent anti-tumor t-cell responses in the absence of adjuvants the use of filamentous bacteriophage fd to deliver hla-a -restricted peptides and to induce strong antitumor ctl responses selection of hiv-specific immunogenic epitopes by screening random peptide libraries with hiv- -positive sera application of bacteriophages for detection of foodborne pathogens searching for peptide ligands with an epitope library de novo selection of high-affinity antibodies from synthetic fab libraries displayed on phage as pix fusion proteins high copy display of large proteins on phage for functional selections filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface effect of dna copy number on genetic stability of phage-displayed peptides hydroxyapatite chromatography of phage-display virions phage display libraries of peptides and proteins displayed on filamentous phage generation of anti-β-amyloid antibodies via phage display technology towards alzheimer's disease vaccination filamentous bacteriophage as a novel therapeutic tool for alzheimer's disease treatment comparison of phage pviii and klh as vector in inducing the production of cytokines in c bl/ j mice bacteriophage therapy viruses in the sea a mimotope peptide of aβ fibril-specific antibodies with aβ fibrillation inhibitory activity induces anti-aβ conformer antibody response by a displayed form on an m phage in mice accessibility of peptides displayed on filamentous bacteriophage virions: susceptibility to proteinases antibody fab display and selection through fusion to the pix coat protein of filamentous phage an investigation on the nature of ultra-microscopic viruses antigenic properties of hcmv peptides displayed by filamentous bacteriophages vs. synthetic peptides in vivo characteristics of targeted drug-carrying filamentous bacteriophage nanomedicines presentation of the functional receptor-binding domain of the bacterial adhesin f a-g on bacteriophage m engineering filamentous phage carriers to improve focusing of antibody responses against peptides filamentous phage as an immunogenic carrier to elicit focused antibody responses against a synthetic peptide basic research in hiv vaccinology is hampered by reductionist thinking lysogenic conversion by a filamentous phage encoding cholera toxin induction of hepatitis b virus-specific cytotoxic t lymphocytes response in vivo by filamentous phage display vaccine crosspresentation of phage particle antigen in mhc class ii and endoplasmic reticulum marker-positive compartments bio-mimetic nanostructure self-assembled from au@ag heterogeneous nanorods and phage fusion proteins for targeted tumor optical detection and photothermal therapy enhanced tumor delivery and antitumor activity in vivo of liposomal doxorubicin modified with mcf- -specific phage fusion protein paclitaxel-loaded peg-pe-based micellar nanopreparations targeted with tumor specific landscape phage fusion protein enhance apoptosis and efficiently reduce tumors hybrid phage displaying slaqvkytsassi induces protection against candida albicans challenge in balb/c mice protective immune responses against systemic candidiasis mediated by phage-displayed specific epitope of candida albicans heat shock protein in c bl/ j mice enhanced binding and killing of target tumor cells by drug-loaded liposomes modified with tumor-specific phage fusion coat protein paclitaxel-loaded polymeric micelles modified with mcf- cell-specific phage protein: enhanced binding to target cancer cells and increased cytotoxicity cytoplasmic delivery of liposomes into mcf- breast cancer cells mediated by cell-specific phage fusion coat protein bacteriophage and phenotypic variation in pseudomonas aeruginosa biofilm development evidence for tilted smectic liquid crystalline packing of fd inovirus from x-ray fiber diffraction experimental evolution of viruses: microviridae as a model system immunological properties of foreign peptides in multiple display on a filamentous bacteriophage adsorption protein of bacteriophage fl: solubilization in deoxycholate and localization in the fl virion sensitive and selective bacterial detection using tetracysteine-tagged phages in conjunction with biarsenical dye phage display particles expressing tumor-specific antigens induce preventive and therapeutic anti-tumor immunity in murine p model development of a phage displayed disulfide-stabilized fv fragment vaccine against vibrio anguillarum high frequency of a novel filamentous phage, vcyφ, within an environmental vibrio cholerae population targeting antibacterial agents by using drug-carrying filamentous bacteriophages filamentous phages of ralstonia solanacearum: doubleedged swords for pathogenic bacteria prophylactic vaccination with phage-displayed epitope of c. albicans elicits protective immune responses against systemic candidiasis in c bl/ mice epitope mapping of mycoplasma hyopneumoniae using phage displayed peptide libraries and the immune responses of the selected phagotopes m phage-functionalized single-walled carbon nanotubes as nanoprobes for second near-infrared window fluorescence imaging of targeted tumors comparison of phage piii, pviii and gst as carrier proteins for peptide immunisation in balb/c mice spontaneous assembly of viruses on multilayered polymer surfaces characterization of murine coronavirus neutralization epitopes with phage-displayed peptides purification of filamentous bacteriophage for phage display using size-exclusion chromatography conformational mimicry of a chlamydial neutralization epitope on filamentous phage f , a rodshaped male-specific bacteriophage that contains dna biodistribution of filamentous phage peptide libraries in mice this work was supported by funding from the national research council of canada (kh, ma-g) and the canada research chair program (js). we thank jyothi kumaran and roger mackenzie for critical appraisal of the manuscript, and jasna rakonjac for inviting us to contribute it. this is national research council canada publication number . key: cord- -o duxhs authors: chandramouli, kondethimmanahalli; qian, pei-yuan title: proteomics: challenges, techniques and possibilities to overcome biological sample complexity date: - - journal: hum genomics proteomics doi: . / / sha: doc_id: cord_uid: o duxhs proteomics is the large-scale study of the structure and function of proteins in complex biological sample. such an approach has the potential value to understand the complex nature of the organism. current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. advances in protein fractionation and labeling techniques have improved protein identification to include the least abundant proteins. in addition, proteomics has been complemented by the analysis of posttranslational modifications and techniques for the quantitative comparison of different proteomes. however, the major limitation of proteomic investigations remains the complexity of biological structures and physiological processes, rendering the path of exploration paved with various difficulties and pitfalls. the quantity of data that is acquired with new techniques places new challenges on data processing and analysis. this article provides a brief overview of currently available proteomic techniques and their applications, followed by detailed description of advantages and technical challenges. some solutions to circumvent technical difficulties are proposed. the term proteomics describes the study and characterization of complete set of proteins present in a cell, organ, or organism at a given time [ ] . in general, proteomic approaches can be used (a) for proteome profiling, (b) for comparative expression analysis of two or more protein samples, (c) for the localization and identification of posttranslational modifications, and (d) for the study of protein-protein interactions. the human genome harbors - protein encoding genes [ ] ; whereas the total number of human protein products, including splice variants and essential posttranslational modifications (ptms), has been estimated to be close to one million [ , ] . it is evident that most of the functional information on the genes resides in the proteome, which is the sum of multiple dynamic processes that include protein phosphorylation, protein trafficking, localization, and protein-protein interactions [ ] . moreover, the proteomes of mammalian cells, tissues, and body fluids are complex and display a wide dynamic range of proteins concentration one cell can contain between one and more than copies of a single protein [ ] . in spite of new technologies, analysis of complex biological mixtures, ability to quantify separated protein species, sufficient sensitivity for proteins of low abundance, quantification over a wide dynamic range, ability to analyze protein complexes, and high throughput applications is not yet fulfilled [ ] . biomarker discovery remains a very challenging task due to the complexity of the samples (e.g., serum, other bodily fluids, or tissues) and the wide dynamic range of protein concentrations [ ] . most of the serum biomarker studies performed to date seem to have converged on a set of proteins that are repeatedly identified in many studies and that represent only a small fraction of the entire blood proteome [ ] . processing and analysis of proteomics data is indeed a very complex multistep process [ , ] . the consistent and transparent analysis of lc/ms and lc-ms/ms data requires multiple stages [ ] , and this process remains the main bottleneck for many larger proteomics studies. to overcome these issues, effective sample preparation (to reduce complexity and to enrich for lower abundance components while depleting the most abundant ones), state-of-the-art mass spectrometry instrumentation, and extensive data processing and data analysis are required. a wide range of proteomic approaches are available such as gel-based applications include one-dimensional and twodimensional polyacrylamide gel electrophoresis [ , ] , and gel-free high throughput screening technologies are equally available, including multidimensional protein identification technology [ ] , isotope-coded affinity tag icat [ ] ; silac [ ] ; isobaric tagging for relative and absolute quantitation (itraq) [ ] . shotgun proteomics [ ] and de dige [ ] as well as protein microarrays [ , ] are applied to obtain overviews of protein expression in tissues, cells, and organelles. large-scale western blot assays [ ] , multiple reaction monitoring assay (mrm) [ ] , and label-free quantification of high mass resolution lc-ms data [ ] are being explored for high throughput analysis. many different bioinformatics tools have been developed to aid research in this field such as optimizing the storage and accessibility of proteomic data or statistically ascertaining the significance of protein identifications made from a single peptide match [ ] . in this review we attempt to provide a overview of the major developments in the field of proteomics, some success stories as well as challenges that are currently being faced. about - % of all genes in an organism encode integral membrane proteins, which are involved in numerous cellular processes [ ] . membrane proteins constitute % of the typical proteome, yet their propensity to aggregate and precipitate in solution confounds their analysis [ ] . the target residues for tryptic cleavage (i.e., lysine and arginine) are mainly absent in transmembrane helices and preferentially found in the hydrophilic part of these lipid bilayer-incorporated proteins. because of the protein aggregation step of ief, de is unsuitable for the separation of integral membrane proteins and is limited to detection of membraneassociated proteins and membrane proteins with a low hydrophobicity [ ] . membrane solubilization methods have been deployed to analyze enriched membrane fractions and address the solubility issue by using detergents [ ] , organic solvents [ ] , and organic acids [ ] compatible with subsequent proteolytic digestion/chemical cleavage, separation and analysis by lc/ms. in this approach, ( ) an enriched yeast membrane fraction is solubilized with % formic acid in the presence of cyanogens bromide. the concentrated organic acid provides the solubilization agent, and cyanogen bromide, functional under acidic conditions, allows many embedded membrane proteins to be cleaved, ( ) a membrane-enriched microsomal fraction is solubilized by boiling in . % sds and, following isotope-coded affinity tag (icat) labeling, is diluted to reduce the concentration of sds, and ( ) by using an enriched membrane sample, the proteins are thermally denatured and sonicated in % organic solvent (methanol) in the presence of trypsin. the resultant peptide mixture is then analyzed by lc/ms. all three of these methods are effective and optimize the identifications of membrane proteins. another method using high ph and protenase k is optimized specifically for the global analysis of both membrane and soluble proteins [ ] . high ph favors the formation of membrane sheets, while proteinase k cleaves exposed hydrophilic domains of membrane proteins. commercially available nonionic detergents, dodecyl maltoside, and decaethylene glycol mono hexadecyl are proved most efficient membrane protein solubilizers [ ] . another more successful approach to isolate membrane proteins relies on cell surface labeling in combination with high resolution two-dimensional ( d) lc-ms/ms [ ] . in addition, improved analytical tools should be developed, that is, multidimensional liquid chromatography of peptide mixtures generated from membrane proteins, nanoflow chromatographic techniques for hydrophobic transmembrane peptides, and native electrophoresis of membrane protein complexes, which, in combination with mass spectrometry, should lead to the identification of the majority of proteins in the membrane proteome of simple microorganisms. it is important to quantify not only the identified membrane proteins but also to determine the levels of interacting partners. subcellular fractionation techniques that employ a combination of centrifugation steps are a common choice for preparing plasma membrane-(pm-) enriched fractions including detergent-resistant membrane fractions, commonly known as lipid rafts. these methods can offer a significant improvement in specificity for pm proteins over approaches that do not perform any subcellular fractionation, but rather use whole-cell or tissue preparations [ ] . chemical-tagging methods [ ] have been a more applied technique used to enrich for pm proteins and are often used in conjunction with physical separation strategies. this method allows for a specific class of protein or modification of interest to be physically separated from other nontagged proteins. importantly, when chemical tags are attached to the extracellular domain of pm proteins on intact cells, they offer an unrivaled specificity for pm proteins, because they offer a manner to distinguish true pm proteins from intracellular contaminants. cell-surface biotinylation, the covalent attachment of a biotin tag to the extracellular domain of pm proteins, is also a popular choice [ ] [ ] [ ] . serum is a complex body fluid, containing a large diversity of proteins. more than different proteins are present in the human serum and many of them are secreted or shed by cells during different physiology or pathology processes [ ] . serum is expected to be an excellent source of protein biomarkers because it circulates through, or comes in contact all tissues. consequently, serum proteomics has raised great expectations for the discovery of biomarkers to improve diagnosis or classification of a wide range of diseases, including cancers [ ] . however, serum has been termed as the most complex human proteome [ ] with considerable differences in the concentrations of individual proteins, ranging from several milligrams to less than one pictogram per milliliter [ ] . the analytical challenge for biomarker discovery arises from the high variability in the concentration and state of modification of some human plasma proteins between different individuals [ ] . albumin is a protein of very high abundance in serum ( - mg/ml) that would be a prime candidate for complete selective removal prior to performing a proteomic analysis of lower abundance proteins. thus, removal of albumin from serum may also result in the specific removal of low abundance cytokines, peptide hormones, and lipoproteins of interest. immunoglobulins, and antibodies are also abundant proteins in serum that function by recognizing "foreign" antigens in blood and initiating their destruction [ ] . the presence of higher abundance proteins interferes with the identification and quantification of lower abundance proteins (lower than ng/ml in serum). complexity and dynamic range of protein concentrations can be addressed with a combination of prefractionation techniques that deplete highly abundant proteins and fractionate. heparin chromatography coupled with protein g appears to be an efficient and economical strategy to pretreat serum for serum proteomics [ ] . protein prefractionation by immunodepletion and reversed-phase separation of the depleted plasma on mrp-c column provide methods compatible with lc-ms-based analysis. a polyclonal antibody-based system to rapidly deplete multiple high abundant proteins in serum, plasma, csf, and other biological fluids. individual antibody materials are mixed in selected percentages and packed into a column format. albumin can be removed by immunoaffinity columns [ ] , isoelectric trapping [ ] , dye-ligand chromatography [ ] , and peptide affinity chromatography [ ] . another approach involves the removal of igg by affinity chromatography using immobilized protein a or protein g [ ] . a recently developed depletion method that mixes high-specificity polyclonal antibodies (mars) to remove the top proteins in a single purification step is commercially available [ ] . human- multiple affinity removal column depletes the top abundant proteins from human serum, plasma, csf, and other biological fluids. to address d limitations several types of mass spectrometry, in conjunction with various separation and analysis methods, are increasingly being adopted for proteomic measurements [ ] . in contrast, d-page analysis, seldi-tof ms is a rather new method which is especially valuable for the identification of serum-derived biomarkers [ ] . this method is based on proteinchip arrays which carry various chromatographic properties, such as anion exchange, cation exchange, and hydrophilic or hydrophobic surfaces [ ] . for the analysis of serum, only - μl of serum sample is applied to these surfaces; after washing off unbound material, the protein fingerprint can be determined and visualized by time-of-flight mass spectrometry. the advantages of this method are the low amount of sample necessary for analysis, its speed, and high throughput capability. many different groups have used this method and related methods based on prefractionation of serum proteins by beads and subsequent maldi analysis for the identification of biomarkers in serum, urine, pancreatic juice, and other biological fluids [ ] . the necessity of this removal or separation is also illustrated that many proteins found useful as biomarkers [ ] . different fractionation steps (such as electrophoresis, seldi, and liquid chromatography) have been developed to reduce the complexity of serum proteome and to allow the detection and the identification of single proteins [ ] . de and maldi ms had applied to identify candidate biomarkers at early and late stages of lung cancer disease. this method identified proteins in tumor bearing mice this included disease regulated expression of orosomucoid- , a- -macroglobulin, apolipoprotein-a , apolipoprotein-c , glutathione peroxidase- , plasma retinol-binding protein, and transthyretin [ ] . recently proteins were identified by stable isotope labeled proteome (silap) standard coupled with extensive multidimensional separation with tandem mass spectrometry of which proteins were present at . -fold or greater concentrations in the sera of patients with pancreatic cancer [ ] . specimen collection (blood, serum, plasma samples) is an integral component of clinical research. access to high-quality specimens, collected and handled in standardized ways that minimize potential bias or confounding factors, is key to the "bench to bedside" aim of translational research [ ] . variables that may impact analytic outcomes include ( ) the type of additive in the blood collection tubes; ( ) sample processing times or temperatures; ( ) hemolysis of the sample; ( ) sample storage parameters; ( ) the number of freeze-thaw cycles [ , ] . the key variable in any analysis is that the case and control samples are handled in the exact same manner throughout the entire analytical process from study design and collection of samples to data analysis [ , ] . these types of differences between samples could have a significant impact on the stability of proteins or other molecules of interest in the specimens. small differences in the processing or handling of a specimen can have dramatic effects in analytical reliability and reproducibility, especially when multiplex methods are used. a representative working group, standard operating procedures internal working group, comprised of members from across early detection research network should be formed to develop standard operating procedures (sops) for various types of specimens collected and managed for biomarker discovery and validation work. limitations of two-dimensional electrophoresis. figure gives the general work flow in proteomics and table addresses their strengths and limitations. two-dimensional electrophoresis ( de) was developed two decades before the term proteomics was coined [ , ] . the de entails the separation of complex protein mixtures by molecular charge in the first dimension and by mass in the second dimension. de analysis provides several types of information about the hundreds of proteins investigated simultaneously, including molecular weight, pi and quantity, as well as possible posttranslational modifications. de is extensively used but mostly for qualitative experiments and this method falls short in its reproducibility, inability to detect low abundant and hydrophobic proteins, low sensitivity in identifying proteins with ph values too low (ph < ) or too high (ph > ) and molecular masses too small (mr < kd) or too large (mr > kd) [ ] [ ] [ ] [ ] . poor separations of basic proteins due to "streaking" of spots and membrane proteins resolution [ ] are limiting factors in de. however, de is the only technique that can be routinely applied for parallel quantitative expression profiling of complex protein mixtures such as whole cell and tissue lysates [ ] and most widely used method for efficiently separating proteins, their variants and modifications (up to proteins). there are two ways to study posttranslational modifications by means of de. first, posttranslational modifications that alter the molecular weight and or pi of a protein are reflected in a shift in location of the corresponding protein spot on the proteomic pattern. second, in combination with western blotting, antibodies specific for posttranslational modifications can reveal spots on de patterns containing proteins with these modifications [ ] . protein extraction and solubilization are key steps for proteomic analysis using de, highly hydrophobic proteins tend to precipitate during isoelectro focusing (ief), low copy number and the insolubility of transmembrane proteins renders quantitative analysis of these peptides and polypeptides are very challenging [ ] . in order to enhance protein extraction and solubilization, different treatments and conditions are necessary to efficiently solubilise different types of protein extracts [ , ] . the major challenge for protein visualization in de is the compatibility of sensitive protein staining methods with mass spectrometric analysis. therefore, several fluorescent staining methods have been developed for the visualization of de patterns, including sypro stainings and cy-dyes [ ] . although sypro ruby [ ] and silver staining [ , ] have a comparable sensitivity, sypro ruby staining allows much higher reproducibility, a significantly wider dynamic range and less false-positive staining. in addition, sypro ruby allows for the detection of lipoproteins, glycoproteins, metalloproteins, calcium-binding proteins, fibrillar proteins, and low molecular weight proteins that are less "stainable" using other methods. finally, a large number of protein spots on de patterns contain several proteins with a similar pi. a ph gradient with a narrow range allows zooming into different proteins with the same molecular weight. increased separation distance × cm gels using ca-ief [ ] could increase the proteome coverage up to proteins. use of overlapping narrow range ipgs "zoom" gels and increase in separation area could yield better membrane protein separation [ ] . this technology, however, is biased against certain classes of proteins including low abundance and hydrophobic proteins. proteins can also be fluorescently labelled with cy , cy , or cy prior to de [ ] . cydyes are cyanine dyes carrying an n-hydroxysuccinimidyl ester reactive group that covalently binds the e-amino group of lysine residues in proteins. during dige [ ] , proteins in each of up to samples can be labelled with one of these fluorescent dyes, and the differentially labelled samples can be mixed and loaded together on one single gel, allowing the quantitative comparative analysis of three samples using a single gel ( figure ). the dige technique has exhibited higher sensitivity as well as linearity, eliminated postelectrophoretic processing (fixing and destaining) steps and enhanced reproducibility by directly comparing samples under similar electrophoretic conditions [ , ] . the resulting images are then analyzed by software such as de-cyder which are specifically designed for d-dige analysis [ ] . the major advantages of d-dige are the high sensitivity and linearity of its dyes, its straightforward protocol, as well as its significant reduction of intergel variability, increasing the possibility to unambiguously identify biological variability, and reducing bias from experimental variation. moreover, the use of a pooled internal standard, loaded together with the control and experimental samples, increases quantification accuracy and statistical confidence [ ] . the dige technique has dramatically improved the reproducibility, sensitivity, and accuracy of quantitation; however, its labeling chemistry has some limitations; proteins without lysine cannot be labeled, and they require special equipment for visualization, and fluorophores are very expensive [ , ] . tag (icat) . gel-free, or ms based, proteomics techniques are emerging as the methods of choice for quantitatively comparing proteins levels among biological proteomes, since they are more sensitive and reproducible than two-dimensional gel-based methods. icat is one of the most employed chemical isotope labeling methods and the first quantitative proteomic method to be based solely on using ms [ , ] . each icat reagent consists of three essential groups: a thiol-reactive group, an isotope-coded light or heavy linker, and a biotin segment to facilitate peptide enrichment. in an icat experiment, protein samples are first labeled with either light or heavy icat reagents on cysteine thiols. the mixtures of labeled proteins are then digested by trypsin and separated through a multistep chromatographic separation procedure. peptides are identified with tandem ms, and the relative quantifications of peptides are inferred from the integrated lc peak areas of the heavy and light versions of the icat-labeled peptides [ ] . the icat concept has been widely used after its introduction [ ] [ ] [ ] . different software programs were developed to analyze icat labeled ms data (e.g., proicat from applied biosystems, spectrum mill from agilent technologies, and sashimi from the institute of system biology [ ] ). icat is extremely helpful to detect peptides with low expression levels, which is one of the bottleneck issues in analytic protein techniques [ , ] . however, major limitations of this technique include selective detection of proteins with high cysteine content and difficulties in the detection of acidic proteins [ , ] . the methods for direct comparison of dige and icat for the identification and quantification of proteins in complex biological mixtures are also being considered [ ] . while the icat reagent only interacts with the free sulfhydryl of homocysteine and % protein is noncysteine, the silac has emerged as a valuable proteomic technique [ ] which becomes more common for cell types and have been applied in many fields [ ] [ ] [ ] . the silac technique can be effectively expanded to compare the differential expression levels of tissue proteome at different pathological states, which allows to identify new candidate biomarkers [ ] . compared with the icat, a popular in vitro labeling, silac as an example of in vivo coding requires no chemical manipulation, and there is very little chemical difference between the isotopically labeled amino acid and its naturally occurring counterpart [ ] . in addition, the amount of labeled proteins requires for analysis using silac technique is far less than that with icat. therefore, the silac-based method has broadly applied in many areas of cell biology and proteomics. except that the silac-based quantitative method is powerful in comparative/differential proteomics, it has been widely used in analyzing protein posttranslational modification, such as protein phosphorylation, detection of protein-protein or peptide-protein interactions and investigating signal transduction pathways [ , ] .though there are numerous advantages for using silac-based methods compared to chemical labeling, a major drawback of silac is that it cannot be applied to tissue protein analysis directly. to overcome this shortcoming, silac has been successfully applied to tissue proteome based on n isotope labeling [ ] . microorganisms such as malaria parasite can be labeled with isoleucine [ ] . latterly the culturederived isotope tags (cdits) method was developed as an alternative quantitative approach for studying the proteome of mammalian tissues based on the application of silac [ ] . o stable isotope labeling. differential o/ o coding relies on the o exchange that takes place at the cterminal carboxyl group of proteolytic fragments, where two o atoms are typically replaced by two o atoms by enzyme-catalyzed oxygen exchange in the presence of h o [ ] . the resulting mass shift between differentially labeled peptide ions permits identification, characterization, and quantitation of proteins from which the peptides are proteolytically generated. in contrast to icat, o labeling does not favor peptides containing certain amino acids (e.g., cysteine), nor does it require an additional affinity step to enrich for these peptides [ ] . unlike itraq, o/ o labeling does not require a specific ms platform nor does it depend on fragmentation spectra (ms ) for quantitative peptide measurements. it is amenable to the labeling of human specimens (e.g., plasma, serum, tissues), which represents a limitation of metabolic labeling approaches (e.g., silac). taken together, recent advancements in the homogeneity of o incorporation, improvements made on algorithms employed for calculating o/ o ratios and the inherent simplicity of this technique should result in increased use of o labeling [ ] . in general, o labeling suffers from two potential drawbacks, inhomogeneous o incorporation and inability to compare multiple samples within a single experiment. a dual o labeling using a non-gel-based platform has been developed to overcome the major problems of existing proteolytic o labeling methods [ ] . (itraq). the itraq reagent is well known for relative and absolute quantitation of proteins. the itraq technology offers several advantages, which include the ability to multiplex several samples, quantification, simplified analysis and increased analytical precision and accuracy [ ] [ ] [ ] . the interest of this multiplexing reagent is that or analysis samples [ ] can be quantified simultaneously. in this technique, the introduction of stable isotopes using itraq reagents occurs on the level of proteolytic peptides ( figure ). this technology uses an nhs ester derivative to modify primary amino groups by linking a mass balance group (carbonyl group) and a reporter group (based on n-methylpiperazine) to proteolytic peptides via the formation of an amide bond [ ] . due to the isobaric mass design of the itraq reagents, differentially labelled peptides appear as a single peak in ms scans, reducing the probability of peak overlapping. when itraq-tagged peptides are subjected to ms/ms analysis, the mass balancing carbonyl moiety is released as a neutral fragment, liberating the isotope-encoded reporter ions which provides relative quantitative information on proteins. an inherent drawback of the reported itraq technology is due to the enzymatic digestion of proteins prior to labelling, which artificially increases sample complexity and this approach needs a powerful multidimensional fractionation method of peptides before ms identification. prefractionation of proteins based on electrokinetic methodologies in free solution essentially relaying on the isoeletric focusing (ief) has gained wide acceptance. many commercial devices are now constructed to take the advantage of this principle ( table ). reproducible fractionation steps will break down the sample complexicity while concentrating low abundant species, resulting in more confident protein identifications and quantification by d gels, mass spectrometry, and protein arrays. a good example of a innovation is liquid-phase isoelectric focusing (ief) as a prefractionation tool before the first dimension of d gel electrophoresis [ , ] . for more consistent pi separation, the zoom ief fractionator [ ] and multicompartment electrolyser (mce) [ ] are being used to prefractionate the proteins. the fractionated samples can be directly applied on standard narrow range ipg strips for d electrophoresis. this allows at least to separate proteins to be analyzed, including proteins of very low abundance. ief, a highresolution electrophoresis technique, has been widely used in shotgun proteomic experiments [ ] . ief runs in a buffer-free solution containing carrier ampholytes or in immobilized ph gradient (ipg) gels. the use of ipg-ief for the separation of complex peptide mixtures has been applied to the analysis of plasma and amniotic fluid [ , ] as well as to bacterial material [ ] . the ipg gel strip is divided into small sections for extraction and cleaning up of the peptides. this technique recovers the sample from the liquid phase and was demonstrated to be of great interest in shotgun proteomics [ ] . ief is not only a high resolution and high capacity separation method for peptides, it also provides additional physicochemical information like their isoelectric point [ , ] . the pi value provided is used as an independent validating and filtering tool during database search for ms/ms peptide sequence identification [ ] . the recent introduction of commercially available offgel fractionator system by agilent technologies provides an efficient and reproducible separation technique [ ] . this separation is based on immobilized ph gradient (ipg) strips and permits to separate peptides and proteins according to their isoelectric point (pi) but is realized in solution [ ] . its micropreparative scale provides fraction volumes large enough to perform subsequent analyses as reverse phase (rp)-liquid chromatography (lc)-maldi ms/ms. the combined use of itraq labeling and offgel fractionation methods for the proteomic study of complex sample is also being considered [ , ] . in this procedure, a large well is used to separate the sample by page and lanes are created on the membrane containing immobilized protein with the use of a manifold [ ] . compatible combinations of primary antibodies are predetermined, with the criterion of being able to identify proteins that do not comigrate. different combinations of primary antibodies are added to each well, with appropriate dilutions of each primary antibody so that expressed proteins are detected in a single condition. the scalability of the system depends on defining suitable combinations of primary antibodies, with up to antibodies in lanes being used in the largest screens. detection software is used to identify proteins based on their expected and observed gel mobility. unlike d page and hplc-ms/ms, large-scale western blotting only identifies proteins for which antibodies are already available. while this is not an appropriate screen for identifying uncharacterized proteins, it greatly simplifies the verification and functional analyses of proteins that are detected. in addition, this approach is highly flexible, and can be focused to particular sets of proteins or protein function, such as cell signaling molecules. importantly, the foundation of this approach is the large amount of data on individual antibodies, which are already available and characterized in the literature [ ] . another approach to analyse proteomes without gels is "shotgun" analysis using mudpit [ ] . in the mudpit approach, protein samples are subject to sequencespecific enzymatic digestion, usually with trypsin and endoproteinase lysc, and the resultant peptide mixtures are separated by strong cation exchange (scx) and reversed phase (rp) high performance liquid chromatography (hplc) [ , ] . peptides from the rp column enter the mass spectrometer and ms data is used to search the protein databases [ ] . the mudpit technique generates an exhaustive list of proteins present in a particular protein sample, it is fast and sensitive with good reproducibility however, it lacks the ability to provide quantitative information [ ] [ ] [ ] . a combination of hplc, liquid phase isoelectric focusing, and capillary electrophoresis provides other multimodular options for the separation of complex protein mixtures [ ] . high throughput production of human proteins using different methods is being developed to make protein array approach more practical. recently simple and efficient production of human proteins using the versatile gateway vector system has been developed [ ] . in this approach, protein expression system is applied to the in vitro expression of human proteins and assessed their biological activity in two functional categories and developed "human protein factory" infrastructure which includes the resources and expression technology for in vitro proteome research. in another approach, dna array to protein array (dapa) is utilized, which allows the "printing" of replicate protein arrays directly from a dna array template using cellfree protein synthesis [ ] . based on the nucleic acid programmable protein array (nappa) concept, high-density self-assembling protein microarray is developed to display thousands of proteins that are produced and captured in situ from immobilized cdna templates [ ] . this method will enable various experimental approaches to study protein function in high throughput. the adventage of protein-based microarrays allows the global observation of biochemical activities on an unprecedented scale, where hundreds or thousands of proteins can be simultaneously screened for protein-protein, proteinnucleic acid, and protein-small molecule interactions, as well as posttranslational modifications [ , ] . the microarray format provides a robust and convenient platform for the simultaneous analysis of thousands of individual protein samples, facilitating the design of sophisticated and reproducible biochemical experiments under highly specific conditions [ ] . the principal challenges in protein array development are -fold: ( ) creation of a comprehensive expression clone library; ( ) high-throughput protein production, including expression, isolation, and purification; ( ) adaptation of dna microarray technology to accommodate protein substrates [ ] . functional protein microarrays differ from analytical arrays in that functional protein arrays are composed of arrays containing fulllength functional proteins or protein domains (figure ) . these protein chips are used to study the biochemical activities of an entire proteome in a single experiment. they are used to study numerous protein interactions, such as protein-protein, protein-dna, protein-rna, proteinphospholipid, and protein-small molecule interactions [ , ] . companies have introduced protein arrays aimed not only at proteomic analysis but also functional analyses of proteins (e.g., biacore ab, ciphergen biosystems inc., phylos inc.). affinity proteomics aim to produce antibodies to every protein expressed by the human genome and these will be characterized against purified antigens and tested on tissue arrays to collect information about their specificity for tissue antigens [ ] . companies are focused to produce various binding partners, for example, affibodies, monoclonal antibodies, and their fragments [ ] . protein chips will likely be the next major manifestation of the revolution in proteomics and offer another solution to analyze low abundant proteins and have the potential for high throughput applications to identify biomarkers [ ] . protein chips differ from previously described methods; whereas screening by de or lc ms/ms can potentially detect any protein, and protein chips can only provide data on set of proteins selected by the investigator [ ] . the development and application of high throughput, multiplex immunoassays that measure hundreds of known proteins in complex biological matrices, is becoming a significant tool for quantitative proteomics studies, diagnostic discovery, and biomarker-assisted drug development. two broad categories of antibody microarray experimental formats have been developed [ ] , direct labelling, single antibody experiments [ ] , dual antibody, sandwich immunoassays are described [ , ] . in the direct labelling method, all proteins in a complex mixture are tagged, providing a means for detecting bound proteins following incubation on an antibody microarray. in the sandwich immunoassay format, proteins captured on an antibody microarray are detected by a cocktail of detection antibodies, each antibody matched to one of the spotted antibodies. in addition, a variety of microarray substrates have been described, including nylon membranes, plastic microwells, planar glass slides, gel-based arrays and beads in suspension arrays. much effort has been expended in optimizing antibody attachment to the microarray substrate. finally, various signal generation and signal enhancement strategies have been employed in antibody arrays, including colorimetry, radioactivity, fluorescence, chemiluminescence, quantum dots and other nanoparticles, enzyme-linked assays, resonance light scattering, tyramide signal amplification, and rolling circle amplification. each of these formats and procedures has distinct advantages and disadvantages, relating broadly to sensitivity, specificity, dynamic range, multiplexing capability, precision, throughput, and ease of use. in general, multiplexed microarray immunoassays are ambient analyte assays [ ] . given the heterogeneity of antibody array formats and procedures currently in use in proteomics studies, and the absence of a "gold standard," there exists an urgent need for development and adoption of standards that permit platform comparisons and benchmarking. regardless of the choice of a given proteomic separation technique, gel-based or gel-free, a mass spectrometer is always the primary tool for protein identification. during the last decade, significant improvements have been made in the application of ms for the determination of protein sequences [ ] . mass spectrometers consist of an ion source, the mass analyzer, and an ion detection system. analysis of proteins by ms occurs in three major steps (a) protein ionization and generation of gas-phase ions, (b) separation of ions according to their mass to charge ratio, and (c) detection of ions [ ] . in gel-free approaches such as icat and mudpit, samples are directly analyzed by ms whereas, in gel-based proteomics ( de and d-dige), the protein spots are first excised from the gel and then digested with trypsin. the resulting peptides are then separated by lc or directly analyzed by ms. the experimentally derived peptide masses are correlated with the peptide fingerprints of known proteins in the databases using search engines (e.g., mascot, sequest). there are two main ionization sources which include matrix assisted laser desorption/ionization (maldi) and electrospray ionization (esi) and four major mass analyzers, which are time-of-flight (tof), ion trap, quadrupole, and fourier transform ion cyclotron (ftic) which are currently in use for protein identification and characterization [ ] . a combination of different mass analyzers in tandem such as quadrupole-tof and quadrupole-ion trap has combined the individual strengths of different types of mass analyzers and greatly improved their capabilities for proteome analysis [ ] . simple mass spectrometers such as maldi-tof are used for only measurement of mass, whereas tandem mass spectrometers are used for amino acid sequence determination [ ] . in maldi the sample of interest is crystallized with the matrix on a metal surface and a laser ion source causes excitation of matrix along with the analyte ions, which are then released into the gas phase. maldi measures the mass of peptides derived from a trypsinized parent protein and generates a list of experimental peptide masses, often referred to as "mass fingerprints" [ , ] . in esi, the analyte is ionized from a solution and transferred into the gas phase by generating a fine spray from a high voltage needle which results in multiple charging of the analyte and generation of multiple consecutive ions. tandem mass spectrometry or ms/ms is performed by combining two different ms separation principles. in tandem ms, individual trypsin-digested peptides are fragmented after a liquid phase separation. tandem ms instruments such as triple quadrupole, quadrupole ion trap, fourier transform ion-cyclotron resonance, or quadrupole time-of-flight are used in lc-ms/ms or nanospray experiments with electrospray ionization (esi) to generate peptide fragment ion spectra [ ] . ion mobility spectrometry (ims) has been utilized as a rapid gas-phase separations strategy for biomolecular ions [ , ] . the strategy provides high sensitivity because the gas-phase dispersion of peptide ions separates features corresponding to low abundance species from interfering chemical noise [ ] . reduced spectral congestion also allows for the use of shorter experimental run times (lc separations) without sacrificing throughput; short analysis time scales are key to measuring the large numbers of samples required to determine normal protein variability prior to realizing individual plasma profiling. additionally, mobility-dispersed ions can be fragmented and mobility linked to fragment ions without ion loss from precursor mass selection [ ] . these advantages have been demonstrated in head-to-head comparisons with conventional lc-ms/ms technology using rapid ( minutes) lc gradients [ ] . accurate mass and time (amt) tag approach [ ] addresses an analogous situation in lc-ms-based proteomics studies. in this approach, initial lc-ms/ms analyses are performed on prefractionated peptide samples in order to provide peptide sequence identifications. these experiments are relatively low throughput because the peptide prefractionation can be quite extensive and require separate lc-ms/ms analyses for each fraction. the high-throughput accurate mass and time (amt) tag proteomic approach was utilized to characterize the proteomes for cytoplasm, cytoplasmic membrane, periplasm, and outer membrane fractions from aerobic and photosynthetic cultures of the gram-negative bacterium rhodobacter sphaeroides . . . there has been a recent trend in proteomics toward the development and application of technologies for the targeted analysis of proteins within complex mixtures [ ] . selected reaction monitoring (srm) is a powerful tandem mass spectrometry method that can be used to monitor target peptides within a complex protein digest [ , ] . the specificity and sensitivity of the approach, as well as its capability to multiplex the measurement of many analytes in parallel, has made it a technology of particular promise for hypothesis driven proteomics. the use of tandem mass spectrometry data acquired on an ltq ion trap mass spectrometer can accurately predict which fragment ions will produce the greatest signal in an srm assay using a triple quadrupole mass spectrometer [ ] . one of the biggest benefits of a targeted assay on a triple quadrupole mass spectrometer is high throughput. using the selectivity of multiple stages of mass selection of a tandem mass spectrometer, these targeted srm assays are the mass spectrometry equivalent of a western blot [ ] . an advantage of using targeted mass spectrometry-based assay over a traditional western blot is that it does not rely on the creation of any immunoaffinity reagent. while its application is novel in the proteomics community, srm has been utilized for several decades in the toxicology and pharmacokinetics disciplines [ ] . peptidebased immunofractionation methods show potential for proteome wide screening approaches but are limited by the availability of antibodies [ , ] . the stable isotope standards with capture by antipeptide antibodies (siscapa) approach is based on the addition of stable isotope labeled standard peptides to the digested clinical sample followed by immunoaffinity enrichment of standard and analyte peptide by highly specific antipeptide antibodies [ , ] . this approach enables the absolute quantification of selected diagnostic peptides from digested clinical samples down to physiologically relevant analyte concentrations (ng/ml) at high precision ( % cv) and accuracy [ , ] . further improvement of mrm-based biomarker quantification should be possible if whole sets of analyte peptides can be enriched by immunofractionation. since this method relies on one specific antibody per target protein/peptide the generation of more than antibodies is necessary for proteome wide screening approaches. novel peptide affinity enrichment strategies enabling proteome wide analyses of signature peptides may provide an important addition to future proteome workflows. undoubtedly, the accuracy, high throughput, and robustness of ms technologies have made the characterization of entire proteomes a realistic goal [ , ] . the major bottlenecks in proteomics research today are related to data analysis to create an environment where computer scientists and biologists and the people who collect data can work closely together, so they can develop the necessary analytical tools that will help interpret the data [ ] [ ] [ ] . processing and analysis of proteomics data is indeed a very complex multistep process ( figure ). the meaningful comparison, sharing, and exchange of data or analysis results obtained on different platforms or by different laboratories remain cumbersome mainly due to the lack of standards for data formats, data processing parameters, and data quality assessment. accurate, consistent, and transparent data processing and analysis are integral and critical parts of proteomics workflows [ ] . we can now generate huge amounts of data, and currently there is an enormous challenge to figure out how to actually analyze this data and generate real biological insights. the necessity of an integrated pipeline for processing and analysis of complex proteomics data sets has therefore become critical. validation. this step consists of the assignment of ms/ms spectra to a database search using one of several engines available (e.g., sequest, mascot, comet, x!tandem, etc.). one of the difficulties related to the use of sequest for peptide identifications is the lack of methods to globally evaluate the quality of data and the lack of methods to access global changes created by filtering schemes and/or database changes [ ] . most approaches are matching and scoring large sets of experimental spectra with predicted masses of fragment ions of peptide sequences derived from a protein database. results are scored according to a scheme specific to each search engine that also depends on the database used for the search. usually tools are linked to one specific platform or were optimized for one instrument type. the various search engines do not yield identical results as they are based on different algorithms and scoring functions, making comparison and integration of results from different studies or experiments tedious [ , ] . peptide identification via database searches is very computationally intensive and time-demanding. high quality data allow more effective searches due to tighter constrains, that is, tolerance on precursor ion mass and charge state assignment, which will drastically reduce the search time in case of an indexed database. in addition, accurate mass measurements of fragment ions further simplify the database searches and add confidence to the results. the association of identified peptides with their precursor proteins is a very critical and difficult step in shotgun proteomics strategies as many peptides are common to several proteins, thus leading to ambiguous protein assignments. therefore it becomes critical to have an appropriate tool that is able to assess the validity of the protein inference and associate a probability to it. protein prophet database tool combines probabilities assigned to peptides identified by ms/ms to compute accurate probabilities for the proteins present [ ] . : - , . impossible. the lack of common standards and protocols has led to this situation and often resulted in duplication of efforts. results were usually reported as a set of identified proteins (i.e., list of peptides identified and associated proteins) with minimal supporting data. obviously the large volume of such data sets has made publication of detailed results using classical mechanisms very challenging. sharing and exchange of data and results requires the definition of standard formats for the data at all levels (including raw mass spectrometric data, processed data, and search results) as well as a better definition (and/or standardization) of the parameters used for the data processing or the database searches. organellar proteomics aims to describe the full complement of proteins of subcellular structures and organelles. identification of the proteins contained in subcellular organelles has become a popular proteomics endeavor [ ] . when compared with whole-cell or whole-tissue proteomes, the more focused results from subcellular proteomic studies have yielded relatively simpler datasets from which biologically relevant information can be more easily extracted [ ] . subcellular fractionation consists of two major steps, disruption of the cellular organization (homogenization) and fractionation of the homogenate to separate the different populations of organelles. such a homogenate can then be resolved by differential centrifugation into several fractions containing mainly ( ) nuclei, heavy mitochondria, cytoskeletal networks, and plasma membrane; ( ) light mitochondria, lysosomes, and peroxisomes; ( ) golgi apparatus, endosomes and microsomes, and endoplasmic reticulum; ( ) cytosol. each population of organelles is characterized by size, density, charge, and other properties on which the separation relies [ ] . analyzing subcellular fractions and organelles allows tracking proteins that shuttle between different compartments, for example, between the cytoplasm and nucleus. a high dynamic range of proteins can be partially achieved by fractionation of the proteome into subproteomes by applying affinity purification may allow proteomic analysis of low copy number proteins [ ] . the nuclear, chloroplast, amyloplast, plasma membrane, peroxisome, endoplasmic reticulum, cell wall, and mitochondrial proteomes were successfully characterized in arabidopsis [ ] . several groups have taken advantage of this approach to recover a higher percentage of membrane proteins from subcellular extracts using various nonionic and zwitterionic detergents or phase-partitioning methods. these efforts resulted in the successful determination of the protein complement of the thylakoid and envelope membrane systems of the chloroplast [ ] . by enriching for the protein class of interest based on a particular chemical/physical characteristic(s), offer the advantage of reducing sample complexity and access to lower abundance proteins in a discoverydriven experimental approach [ ] . free flow electrophoresis (ffe) utilizes differences in electrophoretic mobility rather than density to separate cells or subcellular organelles [ ] . ffe has previously been used in separating endosomes from hamster ovary cells [ ] , plasma membrane from human platelets [ ] , and insulin transporting vesicles in liver cells. the separation is based on the electrophoretic motility of cells or cell organelles suspended in a vertical free flowing buffer film on which an electric field is applied at a right angle to the flow direction. ffe has been a most valuable tool in the investigation of the composition of secretory vesicles and in addition, it has clarified how the membrane of plasma membrane vesicles is oriented after nitrogen disruption of human neutrophils [ ] . importantly, subcellular fractionation is a flexible and adjustable approach that may be efficiently combined not only with d gel electrophoresis but also with gelindependent techniques. however, they do have limitations of considerable cross-contamination with other subcellular organelles. ptms of proteins are considered to be one of the major determinants regarding organisms complexity [ ] . to date, at least more than different types of ptms have been identified of which only a few are reversible and important for the regulation of biological processes. specific functions are usually mediated through ptms, such as phosphorylations, acetylations, or glycosylations, which places additional demands on the sensitivity and precision of the method [ ] . one of the most studied ptms is protein phosphorylation, because it is vital for a large number of protein functions that are important to cellular processes spanning from signal transduction, cell differentiation, and development to cell cycle control and metabolism. enzymes and receptors can be switched "on" and "off " by phosphorylation and dephosphorylation. it was estimated that - % of proteins are phosphorylated. phosphorylation often occurs on serine, threonine, and tyrosine residues in eukaryotic proteins [ ] . analysis of the entire cellular phosphoproteome has been an attractive study subject since the discovery of phosphorylation as a key regulatory mechanism of cell life. unfortunately, phosphoproteins analysis is not straightforward for five main reasons. first, the stoichiometry of phosphorylation is generally relatively low, because only a small fraction of the available intracellular pool of a protein is phosphorylated at any given time as a result of a stimulus. second, the phosphorylatation sites on proteins might vary, implying that any given phosphoprotein is heterogeneous (i.e., it exists in several different phosphorylated forms). third, many of the signaling molecules, which are major targets of phosphorylation events [ ] , are present at low abundance within cells and, in these cases; enrichment is a prerequisite before analysis. fourth, most analytical techniques used for studying protein phosphorylation have a limited dynamic range, which means that although major phosphorylation sites might be located easily, and minor sites might be difficult to identify. finally, phosphatases could dephosphorylate residues unless precautions are taken to inhibit their activity during preparation and purification steps of cell lysates. in addition, various methods for protein phosphorylation site determination have been developed, yet this task remains a technical challenge [ ] . western blot has been widely used to determine the presence of ptms. however, this technique relies on the prior knowledge of the type and position of specific modifications and the availability of antibodies. it has low throughput and not ideal for studying highly complicated samples. specific chemical or affinity enrichment steps are usually incorporated into the sample preparation or fractionation stages of the general scheme of proteomic studies [ , ] . well established methods involving the analysis of p-labeled phosphoproteins by edman degradation and two-dimensional phosphopeptide mapping have proven to be powerful but not without limitations. consequently, mass spectrometry (ms) has emerged as a reliable and sensitive method for the characterization of protein phosphorylation sites [ ] and may therefore represent a method of choice for the analysis of protein phosphorylation [ ] . immobilized metal affinity chromatography (imac), metal oxide affinity chromatography (moac), and covalent methods are all capable of selectively enriching phosphopeptides [ ] . moac based on adsorption to tio is especially attractive, but as with all techniques, loading, rinsing, and elution solutions must be carefully selected to minimize nonspecific adsorption and to maximize the detection of both monophosphorylated and multiphosphorylated species. imac might not provide the selectivity available with tio enrichment, but with appropriate reagents, imac can be selective and sensitive for monophosphorylated and tetraphosphorylated peptides. however, some buffers and reagents such as edta are not compatible with imac, so hplc purification may be needed prior to this technique [ ] . when trying to isolate and identify as many phosphoproteins as possible in a cell lysate, chromatographic column-based methods are required. multiple elutions from imac or moac columns or even gradient elutions can help to simplify fractions of proteins and reveal more peptides [ , ] . a combination of techniques can reveal large numbers of phosphopeptides in complex samples, but comprehensive phosphoproteomics is still not possible. for the highest protein coverage, future phosphoproteomic techniques will likely employ multiple enrichment techniques along with two-dimensional separations, but such studies are time consuming. combinations of affinity-based enrichment and extraction methods, multidimensional separation technologies, and mass spectrometry are particularly attractive for systematic investigation of posttranslationally modified proteins in proteomics [ ] . organisms. the application of proteomics and related technologies for the analysis of proteome is severely hampered by the lack of publicly available sequence information for most of the unsequenced organisms [ ] . despite the precision of the mass information yielded by the seldi technique, a significant number of proteins were found to have no similarity to known peptides, an aforementioned weakness of proteomics studies in nonmodel organisms [ ] . in order to circumvent this limitation, different strategies and tools were developed to make unsequenced organisms amenable to high-throughput proteomics [ ] (figure ). however, an evaluation of their performance in an integrated proteomics strategy using high-throughput shotgun ms data is currently missing. in principle, two different approaches can lead to an increase in protein identifications from unsequenced organisms. in the first approach, ms/ms data are searched against a protein database of an evolutionarily closely related organism. however, as a matter of principle of database-dependent searches, only proteins can be identified that contain at least one peptide with exactly the same sequence as the peptide from a protein in the database. with increasing evolutionary distance this will be an increasingly severe restriction [ ] . in the second approach, the amino acid sequence of a peptide is extracted from the ms/ms spectrum for de novo sequencing, that is, in a fully databaseindependent manner using exclusively the information contained in the ms/ms spectrum. several software tools for peptide de novo sequencing are now available and some of them provide sufficiently good results when applied to high-quality spectra [ ] . a basic limitation of ms de novo sequencing methods is the necessity for backbone cleavage between each pair of adjacent amino acids; a mass value representing a terminal fragment containing only one of the two residues is a first requirement for ordering of a specific pair [ , ] and this limitation urged the need for bioinformatics approaches that can help interpret the proteomics data [ ] . in the past several years there have been very important extremely useful advances in proteomics methods based on bottom-up display and bottom-up identification using peptides [ ] . these methods offer more sensitivity, greater rapidity and greater proteome coverage are often made with the explicit or implicit assertion that these methods are bound to replace more traditional methods based on topdown analysis, especially using d gels [ , ] . the combination of bottom-up display and bottom-up identification has achieved very important successes in detecting the presence of large numbers of different proteins in cells or subcellular organelles [ , ] . the use of specific fractionation schemes and prudent adoption of methods to increase the number of proteins able to be identified and quantified is enabling significant biological advances to be made. further technological developments that enable a larger proportion of the proteome to be visualized will further enhance our ability to characterize biological systems. as such, these advances in proteomics will impact not only academic pursuits but also pharmaceutical, biotechnology and diagnostic research and development [ ] . in the future gel-free techniques mudpit, itraq and o stable isotope labeling could be expected to gain more importance as they become more established. sample prefractionation system provides a highly valuable tool to fractionate proteins and peptides from complex eukaryotic samples like plasma. this approach has a positive influence on the number of proteins identified compared to scx method [ ] . itraq is a very powerful tool, recognised form its ability to relatively quantify proteins. itraq reagent improves maldi ionisation, especially for peptides containing lysine. although silac labelling is easy for any laboratory that uses cell culture, the ms technology that is required is still beyond the capabilities of most groups. one of the factors that contributed to the rapid acceptance of the silac technology was the availability of an open-source program, msquant, for interpreting results. protein microarrays offer the ability to simultaneously survey multiple protein markers in an effort to develop expression profile changes across multiple protein analytes for potential use in diagnosis, prognosis, and measurement of therapeutic efficacy [ ] . this technology is an excellent high-throughput method used to probe an entire collection of proteins for a specific function or biochemistry. it is an exceptional new way to discover previously unknown multifunctional proteins, and to discover new functionalities for well-studied proteins [ ] . a systematic and efficient analysis of vast genomic and proteomic data sets is a major challenge for researchers today. to overcome limitations of current proteomics strategies in regard to the dynamic range of peptides detected and alternative mass spectrometrybased approaches are being explored. targeted strategies exemplified by multiple reaction monitoring detect, quantify, and possibly collect a product ion spectrum to confirm the identity of a peptide with much greater sensitivity because the precursor ion is not detected in the full mass spectrum [ ] . a systematic and efficient evaluation of large-scale experimental results requires ( ) automatic retrieval of user defined information to construct a customized, queryable database; ( ) an intuitive graphical and query platform to display and analyze experimental data in the context of the customized database; ( ) efficient utilization of webbased bioinformatics software tools for data interpretation, prediction of function, and modeling; ( ) scalability and reconstruction of the database in response to changing user needs and an ever-expanding base of knowledge and bioinformatics tools [ ] . creating a software tool to encompass the four crucial features outlined above is a challenging and ongoing task, particularly with respect to the ever-expanding publicly available base of knowledge and bioinformatics tools. the data processing and analysis bottleneck can be overcome through integration of the entire suite of tools into one linear pipeline. the good news is that all of the various proteomics strategies are in phases of very rapid technological development and that important advances in sensitivity, throughput, and proteome coverage can be expected in the near future for all of them. post translational modifications de: two-dimensional gel electrophoresis dige: fluorescence d difference gel electrophoresis esi: electrospray ionization ftic: fourier transform ion cyclotron hplc: high performance liquid chromatography icat: isotope-coded affinity tag itraq: isobaric tags for relative and absolute quantitation ipg: immobilized ph gradient lc: liquid chromatography maldi: matrix-assisted laser desorption/ionization ms: mass spectrometry mudpit: multidimensional protein identification technology tof: time of flight page: polyacrylamide gel electrophoresis scx: strong cation exchange mrm: multiple reaction monitoring assay seldi: surface-enhanced laser desorption/ionization imac: immobilized metal affinity capture. progress with proteome projects: why all proteins expressed by a genome should be identified and how to do it our genome unveiled highthroughput mass spectrometric discovery of protein posttranslational modifications perspectives for mass spectrumetry and functional proteomics proteomic technology for biomarker profiling in cancer: an update d protein electrophoresis: can it be perfected? proteomics-challenges and possibilities in finland. national technology agency advances in clinical cancer proteomics: seldi-tof-mass spectrometry and biomarker discovery challenges and opportunities in proteomics data analysis bioinformatics meets proteomics-bridging the gap between mass spectrometry data analysis and cell biology statistical and computational methods for comparative proteomic profiling using liquid chromatography-tandem mass spectrometry protein identification by mass spectrometry: issues to be considered proteomic approaches in brain research and neuropharmacology recent advances in d electrophoresis: an array of possibilities proteomic analysis by multidimensional protein identification technology quantitative analysis of complex protein mixtures using isotope-coded affinity tags stable isotope labeling by amino acids in cell culture, silac, as a simple and accurate approach to expression proteomics multiplexed protein quantitation in saccharomyces cerevisiae using amine-reactive isobaric tagging reagents an automated multidimensional protein identification technology for shotgun proteomics genetic analysis of the mouse brain proteome protein arrays: the current state-of-the-art proteomics in multiplex a large-scale proteomic analysis of human embryonic stem cells high sensitivity detection of plasma proteins by multiple reaction monitoring of n-glycosites superhirn-a novel tool for high resolution lc-ms-based peptide/protein profiling proteomics in / : developments, applications and challenges generation and initial analysis of more than , fulllength human and mouse cdna sequences molecular diversity of l-type calcium channels. evidence for alternative splicing of the transcripts of three non-allelic genes a method for global analysis of complex proteomes using sample prefractionation by solution isoelectrofocusing prior to two-dimensional electrophoresis quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry enrichment of integral membrane proteins for proteomic analysis using liquid chromatography-tandem mass spectrometry large-scale analysis of the yeast proteomeby multidimensional protein identification technology a method for the comprehensive proteomic analysis of membrane proteins evaluation of nonionic and zwitterionic detergents as membrane protein solubilizers in two-dimensional electrophoresis proteomic analysis of integral plasma membrane proteins proteomic analysis of the hydrophobic fraction of mesenchymal stem cells derived from human umbilical cord blood chemical probes and tandem mass spectrometry: a strategy for the quantitative analysis of proteomes and subproteomes evaluation of biotinylated cells as a source of antigens for characterization of their molecular profile a chemical proteomics approach for the identification of accessible antigens expressed in human kidney cancer identification and relative quantification of membrane proteins by surface biotinylation and two-dimensional peptide mapping the human plasma proteome: history, character, and diagnostic prospects the case for early detection the human plasma proteome: a nonredundant list developed by combination of four separate sources recent advances in bloodrelated proteomics investigating diversity in human plasma proteins the human plasma proteome: history, character, and diagnostic prospects heparin chromatography to deplete high-abundance proteins for serum proteomics sample preparation of human serum for the analysis of tumor markers: comparison of different approaches for albumin and gamma-globulin depletion depletion of the highly abundant protein albumin from human plasma using the gradiflow isolation of albumin from whole human plasma and fractionation of albumin depleted plasma development of mammalian serum albumin affinity purification media by peptide phage display purification and some properties of streptococcal protein g, a novel igg-binding reagent multi-component immunoaffinity subtraction chromatography: an innovative step towards a comprehensive survey of the human plasma proteome quantitative analysis of bacterial and mammalian proteomes using a combination of cysteine affinity tags and n-metabolic labeling proteomic applications for the early detection of cancer the seldi-tof ms approach to proteomics: protein profiling and biomarker identification development of a novel proteomic approach for the detection of transitional cell carcinoma of the bladder in urine tietz fundamentals of clinical chemistry fractionation techniques improve the proteomic analysis of human serum a -de maldi-tof study to identify disease regulated serum proteins in lung cancer of c-myc transgenic mice stable isotope dilution multidimensional liquid chromatography-tandem mass spectrometry for pancreatic cancer serum biomarker discovery standard operating procedures for serum and plasma collection: early detection research network consensus statement standard operating procedure integration working group the need for the review and understanding of seldi/maldi mass spectroscopy data prior to analysis analytical validation of serum proteomic profiling for diagnosis of prostate cancer: sources of sample bias preanalytic influence of sample handling on seldi-tof serum protein profiles protein mapping by combined isoelectric focusing and electrophoresis of mouse tissues. a novel approach to testing for induced point mutations in mammals high resolution two dimensional electrophoresis of proteins strategies for the enrichment and identification of basic proteins in proteome projects current twodimensional electrophoresis technology for proteomics proteomic identification of proteins specifically oxidized in caenorhabditis elegans expressing human abeta ( - ): implications for alzheimer's disease membrane proteins and proteomics: un amour impossible membrane proteomics: use of additive main effects with multiplicative interaction model to classify plasma membrane proteins according to their solubility and electrophoretic properties effect of strong detergents and chaotropes on the detection of proteins in twodimensional gels applications and current challenges of proteomic approaches, focusing on two-dimensional electrophoresis background-free, high sensitivity staining of proteins in one-and two-dimensional sodium dodecyl sulfatepolyacrylamide gels using a luminescent ruthenium complex protein stains for proteomic applications: which, when, why protein detection methods in proteomics research fractionated extraction of total tissue proteins from mouse and human for -d electrophoresis sample complexity reduction for twodimensional electrophoresis using solution isoelectric focusing prefractionation difference gel electrophoresis: a single gel method for detecting changes in protein extracts recent advances in d electrophoresis: an array of possibilities d differential ingel electrophoresis for the identification of esophageal scans cell cancer-specific protein markers the development of the dige system: d fluorescence difference gel analysis technology a novel experimental design for comparative two-dimensional gel analysis: two-dimensional difference gel electrophoresis incorporating a pooled internal standard fluorescent twodimensional difference gel electrophoresis unveils the potential of gel-based proteomics quantitative analysis of complex protein mixtures using isotope-coded affinity tags quantitative protein profiling by mass spectrometry using isotopecoded affinity tags isotopecoded affinity tag (icat) approach to redox proteomics: identification and quantitation of oxidant-sensitive cysteine thiols in complex protein mixtures current trends in differential expression proteomics: isotopically coded tags phosphoprotein isotope-coded solidphase tag approach for enrichment and quantitative analysis of phosphopeptides from complex mixtures design and synthesis of visible isotope-coded affinity tags for the absolute quantification of specific proteins in complex mixtures automated statistical analysis of protein abundance ratios from data generated by stable-isotope dilution and tandem mass spectrometry quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry complementary analysis of the mycobacterium tuberculosis proteome by twodimensional electrophoresis and isotope-coded affinity tag technology mass spectrometry and proteomics quantitative proteome analysis by solid-phase isotope tagging and mass spectrometry quantitative analysis of redox-sensitive proteome with dige and icat stable isotope labeling by amino acids in cell culture, silac, as a simple and accurate approach to expression proteomics mass spectrometricbased approaches in quantitative proteomics a proteomics strategy to elucidate functional protein-protein interactions applied to egf signaling unbiased quantitative proteomics of lipid rafts reveals high specificity for signaling factors stable isotope labeling of arabidopsis thaliana cells and quantitative proteomics by mass spectrometry stable isotope labeling with amino acids in cell culture (silac) for studying dynamics of protein abundance and posttranslational modifications functional and quantitative proteomics using silac identifying dynamic interactors of protein complexes by quantitative mass spectrometry metabolic labeling of mammalian organisms with stable isotopes for quantitative proteomic analysis quantitative proteomics of the human malaria parasite plasmodium falciparum and its application to studies of development and inhibition quantitative mouse brain proteomics using culture-derived isotope tags as internal standards o stable isotope labeling in ms-based proteomics proteolytic o labeling for comparative proteomics: model studies with two serotypes of adenovirus differential protein expression in the cytosol fraction of an mcf- breast cancer cell line selected for resistance toward melphalan non-gel-based dual o labeling quantitative proteomics strategy shotgun proteomics using the itraq isobaric tags itraq is a useful method to screen for membrane-bound proteins differentially expressed in human natural killer cell types a perspective on the use of itraq reagent technology for protein complex and profiling studies eight-channel itraq enables comparison of the activity of leukaemogenic tyrosine kinases multiplexed protein quantitation in saccharomyces cerevisiae using amine-reactive isobaric tagging reagents protein fractionation in a multicompartment device using off-gel tm isoelectric focusing efficient fractionation and improved protein identification by peptide offgel electrophoresis combining microscale solution-phase isoelectric focusing with multiplexed proteomics dye staining to analyze protein posttranslational modifications solution phase isoelectric fractionation in the multi-compartment electrolyser: a divide and conquer strategy for the analysis of complex proteomes a comparison of immobilized ph gradient isoelectric focusing and strong-cation-exchange chromatography as a first dimension in shotgun proteomics two-stage off-gel tm isoelectric focusing: protein followed by peptide fractionation and application to proteome analysis of human plasma proteome analysis of human plasma and amniotic fluid by off-gel tm isoelectric focusing followed by nano-lc-ms/ms exploring glycopeptide-resistance in staphylococcus aureus: a combined proteomics and transcriptomics approach for the identification of resistance-related markers identification of brain cell death associated proteins in human post-mortem cerebrospinal fluid the focusing positions of polypeptides in immobilized ph gradients can be predicted from their amino acid sequences ingel isoelectric focusing of peptides as a tool for improved protein identification gel based isoelectric focusing of peptides and the utility of isoelectric point in protein identification isobaric tags for relative and absolute quantitation (itraq) reproducibility: implication of multiple injections technical, experimental, and biological variations in isobaric tags for relative and absolute quantitation (itraq) neuronal differentiation and long-term culture of the human neuroblastoma line sh-sy y the proteome of the human neuroblastoma cell line sh-sy y: an enlarged proteome from transcriptome to proteome: differentially expressed proteins identified in synovial tissue of patients suffering from rheumatoid arthritis and osteoarthritis by an initial screen with a panel of antibodies a large-scale proteomic analysis of human embryonic stem cells mudpit analysis: application to human heart tissue multidimensional separation of peptides for effective proteomic analysis large-scale analysis of the yeast proteome by multidimensional protein identification technology multidimensional protein identification technology: current status and future prospects multiplexed protein quantitation in saccharomyces cerevisiae using amine-reactive isobaric tagging reagents reproducibility of quantitative proteomic analyses of complex biological mixtures by multidimensional protein identification technology proteomics: from gel based to gel free human protein factory for converting the transcriptome into an in vitro-expressed proteome printing protein arrays from dna arrays nextgeneration high-density self-assembling functional protein arrays application of physicochemically modified silicon substrates as reverse-phase protein microarrays rppaml/rims: a metadata format and an information management system for reverse phase protein arrays multiplexed whole bacterial antigen microarray, a new format for the automation of serodiagnosis: the culture-negative endocarditis paradigm protein analysis on a proteomic scale regulation of gene expression by a metabolic enzyme global analysis of protein activities using proteome chips biochemical and genetic analysis of the yeast proteome with a movable orf collection analyzing antibody specificity with whole proteome microarrays severe acute respiratory syndrome diagnostics using a coronavirus protein microarray a quantitative protein interaction network for the erbb receptors using protein microarrays multiplexed protein profiling on antibody-based microarrays by rolling circle amplification measuring proteins on microarrays methods and applications of antibody microarrays in cancer research multiplexed sandwich assays in microarray format evaluating sandwich immunoassays in microarray format in terms of the ambient analyte regime mass spectrometry and protein analysis analysis of proteins and proteomes by mass spectrometry use of mass spectrometry-derived data to annotate nucleotide and protein sequence databases current initiatives in proteomics research: the plant perspective laser desorption ionization of proteins with molecular masses exceeding , daltons the characteristics of peptide collision-induced dissociation using a high-perfformance maldi-tof/tof tandem mass spectrometer mass spectrometry and proteomics gas-phase separations of protease digests development of high throughput dispersive lcion mobility-tofms techniques for analysing the human plasma proteome formation of peptide aggregates during esi: size, charge, composition, and contributions to noise mobility labeling for parallel cid of ion mixtures advances in proteomics data analysis and display using an accurate mass and time tag approach selective detection of membrane proteins without antibodies: a mass spectrometric version of the western blot analysis of protein phosphorylation by hypothesis-driven multiple-stage mass spectrometry expediting the development of targeted srm assays: using data from shotgun proteomics to automate method development quantitative, multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and stable isotope dilution current affairs in quantitative targeted proteomics: multiple reaction monitoringmass spectrometry peptidomics: a new approach to affinity protein microarrays antibodybased enrichment of peptides on magnetic beads for massspectrometry-based quantification of serum biomarkers mass spectrometric quantitation of peptides and proteins using stable isotope standards and capture by anti-peptide antibodies (sis-capa) mass spectrometry-based proteomics bioinformatics meets proteomics-bridging the gap between mass spectrometry data analysis and cell biology statistical and computational methods for comparative proteomic profiling using liquid chromatography-tandem mass spectrometry protein identification by mass spectrometry: issues to be considered challenges and opportunities in proteomics data analysis toward a human blood serum proteome: analysis by multidimensional separation coupled with mass spectrometry the need for guidelines in publication of peptide and protein identification data: working group on publication guidelines for peptide and protein identification data reporting protein identification data: the next generation of guidelines a statistical model for identifying proteins by tandem mass spectrometry proteome analysis at the level of subcellular structures complementary methods to assist subcellular fractionation in organellar proteomics subcellular fractionation, electromigration analysis and mapping of organelles affinity purification-mass spectrometry: powerful tools for the characterization of protein complexes the swiss-prot protein knowledgebase and expasy: providing the plant community with high quality proteomic data and tools proteomic study of the arabidopsis thaliana chloroplastic envelope membrane utilizing alternatives to traditional twodimensional electrophoresis subcellular fractionation and proteomics of nuclear envelopes improved subcellular fractionation of the heavy mitochondria pellet using free flow electrophoresis system rapid analytical and preparative isolation of functional endosomes by free flow electrophoresis characterization of human platelet surface and intracellular membranes isolated by free flow electrophoresis chemoattractant-regulated mobilization of a novel intracellular compartment in human neutrophils large-scale analysis of the yeast proteome by multidimensional protein identification technology phosphoproteomics for the discovery of kinases as cancer biomarkers and drug targets mass spectrometry-based proteomics and peptidomics for biomarker discovery in neurodegenerative diseases the extracellular signal-regulated kinase: multiple substrates regulate diverse cellular functions abrf-prg : phosphorylation site determination interpreting the protein language using proteomics proteomic analysis of posttranslational modifications phosphoamino acid analysis signaling- and beyond techniques for phosphopeptide enrichment prior to analysis by mass spectrometry evaluation of the impact of some experimental procedures on different phosphopeptide enrichment techniques improved enrichment strategies for phosphorylated peptides on titanium dioxide using methyl esterification and ph gradient elution simac (sequential elution from imac), a phosphoproteomics strategy for the rapid separation of monophosphorylated from multiply phosphorylated peptides modification-specific proteomics: characterization of post-translational modifications by mass spectrometry a workflow to increase the detection rate of proteins from unsequenced organisms in high-throughput proteomics experiments application of genomics and proteomics for study of the integrated response to zinc exposure in a non-model fish species, the rainbow trout sequence similarity-based proteomics in insects: characterization of the larvae venom of the brazilian moth cerodirphia speciosa a workflow to increase the detection rate of proteins from unsequenced organisms in high-throughput proteomics experiments improving gene annotation using peptide mass spectrometry proteomics studies confirm the presence of alternative protein isoforms on a large scale plips, an automatically collected database of protein lists reported by proteomics studies proteome analysis by mass spectroscopy detection technologies in proteome analysis two-dimensional gel electrophoresis in proteomics: old, old fashioned, but it still climbs up the mountains is proteomics heading in the wrong direction? proteomic mapping of brain plasma membrane proteins how much of the proteome do we see with discovery-based proteomics methods and how much do we need to see? peptides offgel electrophoresis: a suitable pre-analytical step for complex eukaryotic samples fractionation compatible with quantitative itraq labeling development and standardization of multiplexed antibody microarrays for use in quantitative proteomics protein microarray technology integrating biological databases new paradigms in cellular function and the need for topdown proteomics analysis key: cord- - kddituy authors: shirbaghaee, zeinab; bolhassani, azam title: different applications of virus‐like particles in biology and medicine: vaccination and delivery systems date: - - journal: biopolymers doi: . /bip. sha: doc_id: cord_uid: kddituy virus‐like particles (vlps) mimic the whole construct of virus particles devoid of viral genome as used in subunit vaccine design. vlps can elicit efficient protective immunity as direct immunogens compared to soluble antigens co‐administered with adjuvants in several booster injections. up to now, several prokaryotic and eukaryotic systems such as insect, yeast, plant, and e. coli were used to express recombinant proteins, especially for vlp production. recent studies are also generating vlps in plants using different transient expression vectors for edible vaccines. vlps and viral particles have been applied for different functions such as gene therapy, vaccination, nanotechnology, and diagnostics. herein, we describe vlp production in different systems as well as its applications in biology and medicine. © wiley periodicals, inc. biopolymers : – , . v irus-like particles (vlps) known as viral "empty shells" maintain the same structural properties of virions, without genome. these constructs are considered very efficient as vaccine platforms and therapeutic delivery systems. many antigens can readily be displayed on the surface of vlps. these antigens can be genetically or chemically fused to the vlp. regarding to the reports, the immune stimulation by vlps contains: (a) stimulation of innate immunity through tlrs and pattern recognition receptors (prrs) due to the expression of multivalent structures; (b) induction of strong humoral response and also igm in t-cell independent way; and (c) enhancement of the uptake, processing and presentation by apcs through mhc i and mhc ii cross-presentation pathway due to the particulate nature of vlps. vlps can be subcutaneously or intramuscularly injected. their small diameter facilitates entry into lymphatic vessels and direct drainage into local lymph nodes. once in the lymph node, vlps are taken up by lymph node resident dendritic cells (dcs). this uptake is enhanced by the size and form of vlps. vlps stimulate cd t cells via the mhc ii pathway, as well as highly efficient cross-presentation on the mhc class i pathway. generally, viral-like particles, are considered as vaccine candidates because their natural properties such as multimeric antigens and their specific structures are suitable for the stimulation of efficient humoral and cellular immunity. currently, the development of recombinant subunit vaccines (suvs) has been significantly increased using heterologous expression systems. antigens derived from many bacterial, viral, fungal and parasitic pathogens were used for safe and effective vaccination. five vlp-based vaccines have been already approved including three for hbv and two for hpv, while in the veterinary field; a vlp-based vaccine against porcine circovirus type (pcv ) has been approved. some vlp-based vaccines targeting human and animal diseases are recently in late stages of clinical trials. vlps have a positive value as academic, industrial, and commercial systems especially in gene therapy and design of nanomaterials. however, the study of the vlp-based applications (vaccination, gene and drug delivery, and imaging) must be followed to show the reliability, and cost efficiency of this technology. furthermore, the expression systems would be improved to achieve the best strategy for vlp production from different viral genes. this review will focus on vlp characteristics and its applications especially as vaccines or delivery systems for dna, sirna and drugs. it should be noted that in the vaccination field whenever a viral-like particle carries genetic material is called "vectored vaccines " and in gene therapy, they are called viral vectors. however, for simplicity in this review, we called all particles entitled as viral-like particles (vlps). viral-like particles (vlps) have been generated for over thirty various infectious viruses in animals and humans. vlps are composed of one or more structural (/capsid) proteins possessing natural properties for self-assembly, and are morphologically similar to authentic viruses. , comparing to live viruses, vlps are non-replicating and non-infective due to the lack of infectious genetic material. virus-like particles have the potential to be used as safe vaccine candidates without the need for any adjuvant. , different viruses present different structures for generation of viral-like particles such as: a. simple viral capsids with one or two major proteins (e.g., parvoviruses, papillomaviruses, circovirses, calciviruses, hepatitis e virus (hev) and polyomaviruses). b. complex viral capsids with various protein layers, encoded by many distinct mrnas, or generated from a single polyprotein (e.g., picornaviruses). c. viral capsids with lipid envelopes including a lipid bilayer obtained from the host cell, as well as viral glycoprotein spikes (e.g., influenza, hiv and hepatitis c). , figure shows the general model of vlp along with its applications. the selection of expression vector is one of the major factors in vlp generation. the reports showed the successful production of vlps indicating that bacterial systems, yeast and insect systems are used in %, %, and % of the cases. in addition, mammalian cells ( %) and plants ( %) were usually applied to produce vlps with special properties. bacterial systems are often included the commercial e.coli strains and expression vectors, to produce non-enveloped vlps in high levels compared to other systems (table i) . in addition, bacterial cells have been applied for generation of vlps which need several types of structural proteins, such as the avibirnavirus ibdv vp , vp , and vp -polyproteins. the reports indicated that the expression of the hepatitis b virus (hbv) capsid protein in e. coli leads to the formation of structures similar to the hbv core (hbc) particle. bacterial figure general model of vlp along with its applications: the picture shows the recombinant hpv l pentamers assembled in vitro into capsid-like structures. self-assembly of recombinant viral coat proteins into empty capsids is a promising strategy for production of virus-like particles (vlps) in vaccine design. the resulting vlps can induce a protective immune response by mimicking the authentic epitopes of virions. systems are not always a desired plan for vlp production due to several factors, such as (a) lack of ability to generate recombinant proteins with mammalian-like post-translational modifications (ptm), (b) failure to produce the correct disulfide bonds, (c) drawbacks of protein solubility, and d) the existence of lipopolysaccharides (lps)/or endotoxins in production of recombinant proteins (rp). viral coat proteins (cps) can be efficiently produced as insoluble inclusion bodies, purified under denaturing conditions, refolded, and selfassembled, as indicated in the parvovirus b and the ccmv and cmv plant viruses. a simple change in the cultivation conditions such as low-temperature can solve the problem of inclusion bodies and induce the formation of soluble vlps, as performed for two viral systems, the densovirus ihhnv, and the potyvirus pvy. some factors including the resistance markers of the expression plasmids and the composition of the cultivation medium can also change the vlp assembly (e.g., bacteriophage qb). another strategy applied to increase expression levels and solubility involves the use of different fusion protein systems, e.g., glutathione-s-transferase (gst) fusion proteins such as the papillomavirus l , the polyomavirus mupyv, and the picornavirus fmdv. [ ] [ ] [ ] [ ] other prokaryotic hosts have been recently used to generate vlp, e.g., lactobacillus. the intracellular assembly of hpv l vlp was reported in lactobacillus casei, a lactose-inducible expression strain. furthermore, the production of l vlps using lactobacillus developed new live mucosal prophylactic vaccines (table i) . , a pseudomonas fluorescens (p. fluorescens) expression system is an efficient choice against e. coli, because of simple manipulation, high yields of active and soluble proteins, and largescale cultivation. some differences between p. fluorescens and e. coli including the various sizes of genome, and diverse metabolic approaches can influence the generation of recombinant proteins. the capsid protein of a plant bromovirus, the cowpea chlorotic mottle virus (ccmv), has been recently expressed as a soluble form in p. fluorescens, and assembled into vlps in vivo. this construct was structurally similar to the natural viral particles provided from plants. eukaryotic expression systems are a striking alternative to bacteria, especially for solving the problem of bacterial endotoxins in vaccine development. some structural genes of mammalian viruses expressed in yeast are able to form the vlp. this expression host has been efficiently applied to generate the first licensed hbv vaccine. hbsag is one of the antigens commonly utilized for production of vlp-based hbv vaccine. hbsag has been expressed in pichia pastoris (p. pastoris), sac-charomyces cerevisiae (s. cerevisiae) and hansenula polymorpha (h. polymorpha) (table i) . , it is critical to consider that the viral-like particles are not always formed during the cultivation procedure of the yeast cells. these studies showed that the selfassembly of the vlps in pichia system should be completed during the protein purification. , [ ] [ ] [ ] the expression and selfassembly of recombinant bacteriophage q coat protein (q-cp) was indicated in saccharomyces cerevisiae and pichia pastoris. the yeast-derived q-vlps were greatly immunogenic in mouse similar to that in e.coli-derived q-vlps. ms vlps produced in saccharomyces cerevisiae could package functional heterologous mrnas. for example, the linkage of the ms packaging sequence to the human growth hormone mrna allowed the packaging of the mrna in ms vlps. indeed, the high stability of ms vlps suggests them as an efficient delivery system for rna-based vaccines. the p. pastoris system was also utilized as a potent alternative for expression of ccmv coat protein vlps due to easy manipulation and high expression levels. in addition, this system has been utilized to express efficiently the premembrane and envelope glycoproteins of dengue virus type (denv- ), hbsag, , hccag resulting in the generation of vlps. the major advantage of yeast systems is the ptm including phosphorylation or glycosylation, as indicated in hbv vlps. the studies indicated that hbc phosphorylation plays a major role in viral replication and capsid formation. such yeast-derived hbc vlps are valuable for vaccination and diagnostics. furthermore, the potent multigene expression systems have been constructed in yeasts. for example, the expression of three rotavirus structural genes from a single plasmid vector led to the generation of triple layered vlps in saccharomyces cerevisiae. , however, the multimerization of protein into vlps is not supported for the enveloped viruses (e.g., gag vlps of hiv- ), suggesting that yeast does not have the essential factors of host. thus, the generation of enveloped hiv- pr gag vlps has been performed using s. cerevisiae spheroplasts, morphologically similar to immature viral particles. , in general, the construction of yeast expression systems, especially hansenula and pichia strains, are more difficult than bacterial vectors. in addition, the yield of vlp production is less than that in e.coli. other limitation of yeast system is its dissimilarity with mammalian cells in the ptm of proteins, especially glycosylation. , therefore, this system is more suitable for the generation of non-enveloped viral-like particles. another attractive system utilized broadly for production of vlp is the baculovirus-insect cell expression system, due to shirbaghaee and bolhassani some advantages, such as the rapid growth ratios, the culture preparation in large-scale, and the ptm of the target proteins similar to mammalian cells. [ ] [ ] [ ] the results showed that both yeast and insect cells were previously used for the vp expression of several polyomaviruses, and its assembly into viral-like particle. in addition, insect cells were used to provide vlpbased vaccines, e.g., the approved hpv vaccine, cervarix. indeed, insect cells are able to generate both vlp types (i.e., enveloped and non-enveloped). there are enveloped vlps in clinical trials. the main limitation of insect cell system is protein contamination with the enveloped baculovirus particles, suggesting the development of efficient plans for purification of vlps. recently, co-expression of four genes of human influenza h n virus (i.e., ha, na, m , and m ) in insect cells led to generate influenza vlps which protected mice against h n virus challenge. these data suggested that viral-like particles are a hopeful vaccine candidate for h n influenza and probably other subtypes of virulent avian influenza viruses. the non-infectious viral-like particles of the alphavirus sav was also generated using the recombinant baculoviruses expressing sav capsid protein and two major immunodominant viral glycoproteins (e and e ) in insect cells. moreover, baculovirus expression system was utilized to generate vlps from cowpea mosaic virus (cpmv), tomato bushy stunt virus, and entorovirus (ev ). , , recently, non-replicative baculovirus have been developed to cope with the problem of baculovirus contamination. stable systems using insect cells have been also tested. moreover, silkworm expression systems were efficiently applied to generate vlps and the surface of vlps could be changed by some strategies, irrespective if their constructs are enveloped or not. silkworms show a high capability for production of recombinant proteins, in comparison with insect cells, and also easy and inexpensive protein preparation similar to e.coli expression system. for over two decades, different mammalian cell lines have been developed as a source of commercial therapeutic proteins for clinical applications, because of their ability for proper protein folding, assembly and ptm (e.g., the correct glycosylation pattern). , however, high costs of production and potential safety concerns remained a challenge for these systems. the mammalian cells were progressively utilized to produce vlpbased vaccines , , e.g., for influenza viruses. for instance, the generation of a stable mammalian cell line (e.g., vero cells) expressing four influenza structural proteins (ha, na, m , and matrix (m )) led to form hybrid vlps containing matrix proteins, and surface glycoproteins of h n and h n influenza types, respectively. , another examples are the produc- and hiv- vlp in cos- /vero cells, and hbv vlp in cho cells. [ ] [ ] [ ] plant systems plants were successfully used to express specific gene products. the feasibility of recombinant plants for generation of vaccine antigens were shown in tobacco plants, potato tubers, and others. this approach develops vaccine strategies which can stimulate mucosal as well as systemic immune responses. in addition, it can be delivered orally as part of a normal biologic function in human. the antigen expressed in plant systems shows extensive disulphide crosslinking and oligomerization for formation of virus-like particles. for example, the hepatitis b major surface antigen has been expressed in several plant systems. plants are able to express and assemble both types of vlps (i.e., enveloped and non-enveloped) as multimeric and chimeric proteins. the high expression of vlps in plant is easy and rapid (e.g., - weeks) using a tobacco mosaic virus (tmv) rna replicon system and/or a bean yellow dwarf virus (beydv) dna replicon system. another advantage of plants is the use of plant virus particles as a delivery system to present foreign epitopes. furthermore, the problem of plant-specific glycans has been partially solved using the development of transgenic plants with "humanized" glycosylation pathways. plantderived vlps can be used for oral delivery of vaccines. virallike particles are more resistant to digestive enzymes than soluble proteins in body, because of their highly ordered and packed structures. for example, the gastrointestinal virusesderived vlps including noroviruses and rotaviruses were utilized orally as potent candidates for mucosal immunization. plant-derived vlps showed the same structures with vlps generated in other expression systems accompanied by a comparable or higher immunogenicity. some plant-derived vlps could induce protective humoral and cellular immunity and also safety in clinics. the studies showed that the level of protein expressed in the recombinant plants is variable and often low. therefore, further increase in expression will be necessary for practical and efficient products. recent progress in the glyco-engineering of plants allows human-like glycol-modification and optimization of desired glycan structures for increasing safety and functionality of recombinant pharmaceutical glycoproteins. some plantbased systems can stabilize antigen and thus reduce storage and distribution costs. different applications of virus-like particles toxoplasma gondii (t. gondii) is an obligate intracellular parasite infecting the nucleated cells of warm-blood vertebrates. this parasite is able to stimulate strong humoral, cellular and mucosal immunity, and thus it can be used as an efficient delivery system for heterologous antigens. t. gondii was applied as a vector for live vaccination against infectious pathogens. [ ] [ ] [ ] [ ] [ ] [ ] recently, a non-pathogenic kinetoplastida, leishmania tarentolae, was utilized to express heterologous proteins. the studies showed that expression of mammalian glycoproteins in this parasite leads to their modification with mammalian-like oligosaccharides. [ ] [ ] [ ] [ ] recently, our group has focused on its use as a live vector or killed vaccine, [ ] [ ] [ ] and also generation of viral coat proteins and their assembly as vlp in this system. virus-like particles show an efficient strategy to deliver antigens to the immune system, inducing both arms of the adaptive immunity. indeed, vlps present antigenic epitopes in the proper conformation, leading to induce humoral responses. for example, preclinical trials with influenza vlps indicated their capacity to induce both humoral and cellular immune responses at low antigen doses. several authors have reported antibody response to parenterally or orally administered plant-derived antigens. , as exogenous antigens, vlps are taken up by professional antigen presenting cells (apcs), especially dcs, followed by antigen processing and presentation via mhc class ii molecules, dc activation and maturation through up-regulation of co-stimulatory molecules and cytokine production, and stimulation of cd t helper cells. all these events can efficiently induce both humoral and cellular immunity. in addition, the exogenous vlps can enter the cytosol of dcs, be processed and presented by mhc class i molecules to cytotoxic t lymphocytes (ctls) using cross-presentation. , , furthermore, the b-cell activation using vlps is robust enough to induce t cell-independent igm antibodies. , dcs loaded with yeast-derived hiv vlps can alter gag-specific memory cd t cells into effector cells through cross-presentation in chronically hiv-infected individuals, although some gag-specific t cells in these patients did not show any response. the reports showed that the expression system used for generation of vlp might significantly affect direction, type and outcome of immune responses. for example, potent and specific immunomodulatory effects were assigned to yeast-derived hiv vlps in comparison with other expression systems. on the other hand, plant-or insectderived vlps, consisting of the l capsid protein of hpv, were both immuno genic to an equal degree. half of mice fed trans-genic potatoes expressing hpv vlps developed l -specific antibodies. the studies indicated that the vlps are taken up by clathrin-dependent macropinocytosis and phagocytosis before being degraded in acidic lysosomal compartments. vlp-derived peptides are loaded onto mhc i that have been recycled from the cell surface. a study showed that uptake and activation of dc by vlp involves proteoglycan receptors, tlr and nf-kb, and can be inhibited by heparin. several data suggest different routes of vlp uptake by dc and langerhans cells (lc). for example, lcs and dcs internalize similar amounts of hpv-vlps in vaccine design, albeit through different uptake mechanisms. , vlp uptake by dcs results in activation and cross-presentation of mhc i-restricted peptides with co-stimulation to t-cells. on the other hand, vlp uptake by lc leads to cross-presentation in the absence of costimulation. efficient vlp cross-presentation by lcs with costimulation can be achieved by addition of cd ligand. the lack of a protective immune response after viral contact with lcs may explain why some women fail to induce an immune response against the virus. lcs endocytose hpv vlps via a non-clathrin, non-caveolae, actin-independent pathway, whereas dcs take up hpv vlps both by a clathrin-mediated mechanism and via macropinocytosis in an actin-dependent manner. this difference in endocytosis resulted in processing and presenting hpv vlp peptides by lcs similar to that by dcs on their surface, but in the absence of co-stimulation. with the addition of cd l, lcs incubated with hpv vlps generated the efficient amounts of the pro-inflammatory cytokine (il- ) and could stimulate a hpv-specific immune response after incubation with t cells. despite these differences, vlps taken up by dc and lc were able to prime naive cd t cells and induce cytolytic effector t cells in vitro. furthermore, hiv- pr gag virus-like particles could stimulate strong humoral and cellular immune responses. vlp expressed by recombinant baculoviruses activated human pbmc to release pro-inflammatory (il- , tnf-a), antiinflammatory (il- ) and th -polarizing (ifn-c) cytokines as well as gm-csf and mip- a in a dose-and time-dependent manner. furthermore, vlp-induced monocyte activation was shown by up-regulation of molecules involved in antigen presentation (mhc ii, cd , and cd ) and cell adhesion (cd ). exposure of vlp to serum inactivated its capacity to stimulate cytokine production. the linking of vlps to adjuvant molecules was also shown to improve the immunogenicity of the nano-bioparticles. adjuvanted vlps [e.g., cpg odn or poly (i: c) adjuvants] elicited a higher titer of total specific igg compared to vlps alone. furthermore, while vlps alone induced a balanced th pattern, vlps formulated with adjuvant elicited a th -biased igg subclasses (igg a and igg ), with poly (i: c) more potent than cpg odn in shirbaghaee and bolhassani animal model. in addition, mice immunization with chimeric simian immunodeficiency virus (siv) vlps containing gm-csf significantly induced siv env-specific antibodies as well as neutralizing activity at higher levels than those elicited by standard siv vlps, siv vlps containing cd l, or standard vlps mixed with soluble gm-csf. on the other hand, the incorporation of immunostimulatory molecules showed significantly increased cd and cd t-cell responses to siv env, compared to standard siv vlps. formulation of vlps with rough lps (r-lps) adjuvant as well as dna primed-vlp boosted regimen were led to increase specific immune responses as compared to vlps alone, but among them the vlp/r-lps highly enhanced immune response. recent studies demonstrated the potential of the hbc vlps as an oral immunogen. intraperitoneal immunization with the hbc vlp induced a strong, mixed th /th response. in contrast, oral administration of the hbc vlp generated a high humoral response with mainly igg a antibodies, directing toward a th response which is essential in the control of intracellular pathogens. in addition, the intranasal monovalent adjuvanted norwalk vlp vaccine was well tolerated and highly immunogenic. the studies showed that chimeric hpv-vlps are able to elicit potent ctl responses in mice against hpv transformed tumors; however, the mechanism of t cell priming has remained obscure. hpv vlp could bind to human mhc class ii-positive apcs through interaction with fccriii, and immature dcs were activated after incubation with hpv vlp. it was shown that binding and uptake of vlp by dc from fccrii, fccriii, and fccrii/iii deficient mice are reduced by up to % compared with wild-type mice. in addition, maturation of murine dc from fccrii/iii-deficient mice by vlp is also reduced, indicating that dc maturation, and thus ag presentation, is diminished in the absence of expression of fccr. poor immunogenicity of mucosally administered proteins has been a major barrier for development of efficient oral vaccines. one way to overcome this obstacle is the use of appropriate adjuvants. also, delivery of antigen to mucosal surfaces as vlp provides an efficient way of inducing mucosal immunity. after oral or intranasal immunization with norwalk vlp, or rotavirus vlp without adjuvant, intestinal iga was detected in immunized mice, which were protected from virus challenge. in addition, the plasma cell precursors that migrate to the genital tract are derived primarily from mucosal lymphoid tissues and often secrete iga. the studies indicated that immune responses generated by mucosal administration of vlp were generally weaker than systemic administration. vlp specific iga was higher in intestine washes following intrarectally (i.r.) than intravaginally (i.va.) immunization, and higher in vaginal washes following intramuscularly (i.m.) than i.r. or i.va immunization. some studies suggested that the immunogenicity of virus particles at mucosal surfaces is probably a property of particulate antigens assembled as multimers of subunits. indeed, vlp might be actively taken up by mucosal apc through the integrin receptors. lipoparticles are stable, highly purified, homogeneous, and specialized vlps containing high concentrations of an integral membrane protein. integral membrane proteins are involved in different biological functions and are targeted by % of existing therapeutic drugs. however, because of their hydrophobic domains, membrane proteins are difficult to manipulate outside of living cells. lipoparticles can incorporate a wide variety of the membrane proteins, including g proteincoupled receptors, ion channels, and viral envelopes. lipoparticles provide a platform for different applications such as antibody screening, production of immunogens, and ligand binding assays. [ ] [ ] [ ] during the assembly of enveloped viruses, lipid ordered domains of the host cell plasma membrane, known as lipid rafts, frequently function as a natural target for viral proteins. the role of lipid rafts in the organization of complex combinations of immune receptors during antigen presentation and t cell signaling is extensively recognized. on the other hand, in order to improve the immunogenicity of hiv- envelope glycoproteins, the fusion of gp was performed to a carrier protein, hepatitis b surface antigen (hbsag) which is capable of spontaneous assembly into viruslike particles. the hbsag-gp hybrid proteins assembled efficiently into - nm particles. the particles resembled native hbsag particles in size and density, consistent with a lipid composition of about % and a gp content of about per particle. particulate gp folded in its native conformation and was biologically active, as shown by high affinity binding of cd . because the particles are lipoprotein micelles, an array of gp on their surface closely mimics gp on the surface of hiv- virions. these gp -rich particles can enhance the quality, and also quantity of antibodies elicited by a gp vaccine. virus-like particles show an expanding spectrum of applications such as gene therapy, nanotechnology, vaccination, and diagnostics. , recently, the studies showed a pattern of direct conjugation of some ligands, including nucleic acids and proteins attached to vlp surface. , in addition, because of the superior accessibility of cysteine and lysine residues on vlps, bio-conjugation has been performed by commercial homo-or hetero-bifunctional linkers. [ ] [ ] [ ] [ ] for example, three foreign proteins were chemically conjugated to the vlp surface of cpmv by proper bifunctional cross-linkers. on the other hand, the researchers could produce an alphavirus vlp surrounding a functional gold nanoparticle. vlps have been also used to stimulate immune responses and generate antitumor responses, e.g., alphavirus-based virus-like replicon particles (vrp) expressing various melanoma antigens. [ ] [ ] [ ] it is interesting that the first viral-associated cancer vaccines were founded on hbv vlp and hpv vlp to prevent hbvassociated hepatocellular carcinoma (hcc) and hpvassociated cervical carcinoma, respectively. , it should be noted that these vlp formulations are viral vaccines that prevent a viral infection that may progress to carcinoma after a long time. we indicate some applications of vlps against viral diseases as following: in several studies, specific vaccine antigens were generated by various expression systems to induce protective immune responses and apply in licensed recombinant viral vaccines. , some examples of preventive vlp-based vaccines are recently commercialized worldwide including glaxos-mithkline's engerixv r (hbv) and cervarixv r (hpv), and merck and co., inc.'s recombivax hbv r (hbv) and gardasilv r (hpv). other vlp-based vaccines undergo preclinical evaluation or clinical trials, including parvovirus-, influenza-, norwalk-derived vlps and also different chimeric vlps. for generation of immunogenic vlps, eukary otic expression hosts including yeast (s. cerevisiae, p. pastoris and h. polymorpha) and mammalian cells (chinese hamster ovary cell line [cho]) were used. the studies indicated that the recombinant hbsag generated in cho and h. polymorpha have significant differences in size, molecular weight (mw), and monomer number. furthermore, the cho-derived viral-like particles include a combination of glycosylated and non-glycosylated hbsag proteins, similar to those in patients' sera, while yeastderived antigens were reported to be non-glycosylated. chobased vaccines were provided by pasteur-m erieux aventis in france (genhevac bv r ) and scigen in israel (sci-b-vac tm ). both vaccines contained not only the hbsag s pro tein but also the m protein (genhevac b) or the m and l protein (sci-b-vac). on the other hand, gardasil approved by the fda in is a quadri valent hpv types / / / l vlp vaccine produced in s. cerevisiae. cervarix is the other licensed hpv vaccine approved by the fda in . cervarix is a bivalent hpv types / l vlp vaccine expressed via a recombinant baculovirus vector. , different experiments have been concentrated on hpv vlp vaccination in mouse and human models including: (a) activation of immature human dcs by chimeric hpv vlps, (b) determination of systemic cytokine pattern elicited by hpv l vlps, (c) identification of gene expression signatures in hpv l vlp-induced human pbmcs, (d) generation of potent and prolonged neutralizing l antibodies using a single intramuscular (im) mice injection with recombinant adenoassociated virus encoding hpv l protein (raav- l ), (e) augmentation of immunogenicity of hpv l dna vaccines using genetic linkage to a chemokine and secretory signal peptide sequences, (f) potent stimulation of systemic and mucosal immune responses to vlp vaccines using the encapsulation of a genetic cytokine adjuvant (e.g., il- ), (g) improvement of hpv vlp immunogenicity by linkage to the modified adjuvant, and m) nasal immunization of mice with hpv vlps. hpv l -e chimeric virus-like particles (cvlp) could induce e -and l -specific ctls. the therapeutic potential of the cvlp also indicated a considerable safety in high grade cervical intraepithelial neoplasia patients (cin / ). several improvements in vaccine design by vlp are still in preclinical trials. some main examples are referred as following: a. co-injection of the hpv l vlp with e. coli heatlabile enterotoxin (lt) as an adjuvant significantly increased the levels of serum igg and vaginal iga after nasal or bronchial mice immunization. antigens (hiv- p /p : ty vlp) was also immunogenic and well-tolerated in phase i clinical trials. , g. several groups have focused on improving bacteriophagebased vlp vaccines, e.g., rna bacteriophage qb. these chimeric vlp vaccines were targeted against noninfectious diorders including hypertension, allergy, neurodegenerative and autoimmune diseases (e.g., diabetes mellitus type ii and alzheimer), cancer (e.g., melanoma). the vaccine candidate against alzheimer (cad- ) was constructed to display chemically coupled amyloid beta (ab - ) peptide derived from the n-terminal b cell epitope of ab protein, on the surface of qb-cp vlps. this vaccine could stimulate ab-specific igg and decrease amyloid accumulation in animal models expressing ab precursor protein, without eliciting t-cells or inflammatory reactions in brain tissue. , in addition, the angiotensin ii vaccine was synthesized by covalently conjugation of a peptide derived from angiotensin ii to the rna bacteriophage qb vlp capsid. this modified vlp could decrease blood pressure in spontaneously hypertensive rats. table i shows preclinical and clinical studies of vlps in vaccine development. generally, a major application of vlps is the stimulation of immunity against foreign protein epitopes by genetically fusing or chemically conjugating them to vlps entitled as chimeric vlp (cvlps). antigens can be fused to vlps through either covalent or non-covalent bonds. the most common covalent bond is generated by the heterobifunctional chemical cross-linkers with amine and sulfhydryl-reactive arms. for instance, cysteine-containing antigens can be conjugated to lysine residues of vlps surface at a high density (e.g., three peptides per coat protein molecules). the non-covalent conjugation strategy contains the use of streptavidin as linkers to attach biotinylated antigens and vlps through their efficient and specific interactions. sv vlps can also encapsulate various materials such as dna ( kb) and proteins as antigens. insertion of a special exogenous peptide into the surface loops of vp produced sv vlps with the ability of cell targeting. moreover, sv vlps stimulated innate immunity as a natural adjuvant. indeed, sv vlps may be a promising vaccine candidate to deliver heterologous antigens followed by the induction of ctls without synthetic adjuvants. several chimeric vlp vaccines have entered clinical trials, such as the anti-influenza a m -hbcag vlp vaccine (hbcag vlps displaying m epitope of influenza a), the anti-hiv p /p : ty vlp, two anti-malaria vaccines (hbcag vlps displaying malaria epitopes), the nicotine-qb vlp and the anti-ang ii qb vlp. genetic linkage contains a stable bond between vlp and antigen. the studies showed that only peptides shorter than amino acids (small peptides) can be presented without interfering with the correct assembly of vlps. other limitations contain the improper folding of displayed antigens and the formation of cvlps with heterogeneous size. to prevent these issues (e.g., assembly problems), structural studies have identified domains for different vlps such as hbcag, hbsag and hiv gag that were not necessary for vlp assembly as well as allowed insertion of foreign antigens. the simplest way for generation of single component cvlps, is the insertion of peptides at the n-or c-terminal regions of chimeric vlps. multiple fusion positions should be identified to produce multicomponent cvlps inducing broad immune responses. the direction and intensity of the immune responses are significantly influenced by the vlp type, the foreign antigen density, and its accessibility on or within the vlp. furthermore, preexisting immunity against the epitopes of the vlp as a delivery system may importantly change the response against the heterogenous antigen. for example, hbcag was also utilized to display a neutralizing epitope of hpv l protein. the nasal delivery of hbcag-hpv l epitope cvlps expressed in tobacco induced antigen-specific antibody responses in mouse model. on the other hand, an hpv l -based chimeric vlp was generated in transgenic tomato to present several t-cell epitopes from hpv e and e proteins. the hpv l -e /e vlps elicited a neutralizing antibody response similar to that from an equal amount of the commercial vaccine (gardasil) in preclinical study. moreover, the chimeric vlps induced ctl responses against the e and e epitopes. chimeric hpv l vlps were also designed using genetic fusion to display epitopes of influenza m protein. to overcome the problems associated with genetic fusion including the antigen size, conformation and vlp assembly, different applications of virus-like particles chemical conjugation approaches were applied to construct cvlps. in this strategy, target antigens and native vlps were generated individually and coupled together by attachment of the antigen to the surface of the pre-assembled vlps. two main advantages of this strategy include: (a) various sizes and types of antigens can be exposed, and (b) the antigen-vlp binding site can be manipulated for further presentation of the conjugated antigen. for example, vlps were used to display full-length and correctly folded proteins, such as interleukin- (il- ). generally, vlps were used for delivery of protein/peptide, dna, sirna and drugs as a brief description in following: viral-like particles were used as a peptide/protein carrier, in vitro and in vivo. there are several examples for delivery of protein/peptide using vlps as following: a. chimeric vlp vaccines have been improved based on rna bacteriophage ap , presenting peptides of selfantigens or pathogens fused to either the n-or cterminal regions of ap coat protein. ap -derived vlps were highly immunogenic in mice. furthermore, influenza m vlps stimulated an efficient m -specific antibody response and full protection against lethal influenza virus challenge. b. vlps containing flt ligand (fl-vlps), a dc growth factor, could effectively increase immunogenicity in mice. dcs exposed to vlps also produced high levels of il- . c. a plant vlp-based approach was used to develop respiratory syncytial virus (rsv) vaccine. a target peptide displaying amino acids - of the rsv g protein was delivered on the surface of recombinant alfalfa mosaic virus (almv) particles. this construct induced high pathogen-specific immune responses in immunized animals. , d. in a recent study, a peptide from an external loop of mouse ccr protein was inserted into a neutralizing epitope of hpv l . the particles generated by this chimeric l could elicit high levels of ccr antibodies that specifically recognized the surface of ccr transfected cells and blocked in vitro infection of an mtropic hiv strain in mice. in addition, chimeric vlps containing the full length hpv e oncoprotein linked to l , or the n-terminal region of e fused to l , could induce antigen-specific protection of mice from lethal challenge with e -expressing tumor cells. - e. a pre-s epitope of hbv was also inserted into the ef loop of hpv vlp recognized by hbv-specific antibody. chimeric vlps produced in e.coli carried a virus-neutralizing hbv pre-s epitope in the major immunodominant region (mir) and a highly conserved n-terminal hcv core epitope (aa to ) at the c-terminal region of the truncated hbv core vlps (hbc). the presence of two different foreign epitopes within the hbc molecule did not interfere with its vlp-forming potential, with the hbv pre-s epitope exposed on the surface and the hcv core epitope buried within the vlps. mice vaccination showed a specific t cell activation by both foreign epitopes and a highlevel antibody response against the pre-s epitope, whereas an antibody response against the hbc carrier was inhibited. f. the researchers have shown that the nanosized hbc-vlps bearing mycobacterial antigen cfp- (hbc-vlp: cfp- fusion protein) induced an increased immune response in balb/c mice compared to mixtures of native antigen. g. chimeric papillomavirus vlps based on the bovine papillomavirus type (bpv- ) l protein were designed by replacing the -carboxyl-terminal amino acids of the bpv major protein l with a synthetic "polytope" minigene, containing known ctl epitopes of human pv e protein, hiv iiib gp p , nef, and reverse transcriptase (rt) proteins, and an hpv e linear b epitope. the chimeric l protein assembled into vlps in insect cells. polytope vlps could deliver multiple b and t epitopes as immunogens to the mhc class i and class ii pathways. this study has demonstrated that hybrid vlps can be used as an efficient antigen delivery system to transfer more than one ctl epitope through mhc class i pathways. h. the chimeric hpv vlps were generated in which hpv l neutralization epitopes (l residues - or - ) are inserted within an immunodominant surface loop (between residues and ) of the l major capsid protein of bpv . immunization of rabbits with assembled particles elicited high l -specific serum antibody responses. l. the studies showed that the c-terminal region of gag fused by t cell epitopes from human cytomegalovirus pp led to the formation of hybrid vlps activating antigen-specific cd memory t cells ex vivo. regarding to previous studies, the gag polyprotein is the only retroviral protein required for vlp formation. [ ] [ ] [ ] vlps, derived from an avian retrovirus, were applied to deliver proteins to cells, either as part of gag fusion proteins (intracellular delivery) or on the surface of vlps. the construct is an effective system because the vlps are completely made of the gag fusion protein, and a single vlp will deliver - copies of gag fusion protein into a transduced cell. delivery of foreign genes to the digestive tract mucosa by oral administration of non-replicating gene transfer vectors would be a very useful method for vaccination and gene therapy. the studies indicated that plasmid dna could be packaged in vitro into a vlp composed of open reading frame (orf ) of hev, which is an orally transmissible virus. these vlps could deliver this foreign dna to the intestinal mucosa in vivo, eliciting high mucosal and systemic immunity in mice, without the use of adjuvants. an orally administered hiv dna vaccine encapsulated in hev-vlps could induce mucosal and systemic cellular and humoral immune responses. moreover, the ability of hpv vlps was examined to mediate delivery and expression of dna plasmids in vitro and in vivo. hpv pseudoviruses were provided by disrupting hpv-vlp, mixing them with dna plasmids and reassembling them into the pseudoviruses (vlps with plasmids inside). the pseudovirus induced more potent immune responses than dna vaccines. the pseudovirus could be used in gene therapy by transferring the therapeutic genes into lymphoid tissues in human. in addition, the recombinant hpv l vlps, produced in insect cells, could efficiently encapsulate a plasmid harboring either a gene for the gfp or b-galactosidase during in vitro disassembly-reassembly of vlps. vlp-mediated delivery of a gfp reporter construct in vitro showed to be highly dependent on the presence of full-length l protein within the vlps. similarly, expression of gfp and luciferase reporter plasmids in vivo was efficiently enhanced by co-administration of l /l vlps. in addition, co-administration of vlps with a hpv e -expressing plasmid increased significantly e -specific cellular immune responses. the reports indicated that the recombinant major structural protein of the bk polyomavirus (bkv vp ) was shown to self-assemble into vlps with a diameter of - nm. the potential of bkv vp vlps was investigated to transfer gene into cos- cells using three methods for the formation of pseudovirions: disassembly/reassembly, osmotic shock and direct interaction between vlps and plasmid dna. the most efficient method is the direct interaction between vlps and linearized plasmid dna. the findings generally demonstrated that bkv vlps have exogenous dnabinding activity, as a promising vehicle for gene transfer studies. sirna delivery there is a major challenge to identify novel approaches for specific and effective delivery of new types of drugs like sirnas and peptides. systemic delivery of small interfering rna (sirna) was restricted by its poor stability and low cellpenetrating properties. to overcome these limitations, an efficient sirna delivery system was designed using polyethyleneimine (pei)-coated vlps derived from adeno-associated virus type (pei-aav -vlps). generally, one of the strategies to integrate sirna into nanoparticles was to coat these particles with positively charged polymers, including pei, poly b-amino different applications of virus-like particles ester, or poly l-lysine. electrostatic coating could increase the efficiency of systemic sirna delivery due to its protective effects and improved cellular uptake. an insect/baculovirus expression system was used to generate aav -vlps. pei-aav -vlps could condense sirna, protect it from enzymatic degradation, transfer it with high efficiency and induce cell death in mcf- breast cancer cells, for breast cancer therapy. furthermore, micrornas (mirnas) play an essential role in immunoregulation and may be involved in the pathogen esis of systemic lupus erythematosus (sle). among these sle-related mirnas, mir- a, acts as a significant inhibitor of autoimmunity, myeloproliferation, and cancer. a novel mirna-delivery approach was described via bacteriophage ms vlps for evaluation of the therapeutic effects of mir- a, in bxsb lupus-prone mice. treatment with ms -mir- a vlp increased the level of mature mir- a, leading to a significant reduction in the expression of autoantibodies and total igg. furthermore, the levels of inflammatory cytokines, including ifn-a, il- b and il- were decreased in mice. the stimulation of dysregulated mirnas by an ms vlp-based delivery system may be considered as a novel therapy. [ ] [ ] [ ] the use of ms vlps was reported for selective delivery of nanoparticles, chemotherapeutic drugs, sirna cocktails, and protein toxins to human hcc. in addition, the researchers used jc virus (jcv) vlps as a vector for delivering rnai in silencing the il- cytokine gene. jcv vlps were non-toxic, and showed the therapeutic use as a gene therapy approach for autoimmune diseases (aid) including sle. , drug delivery a major challenge in pharmacology is to find methods that drugs (especially anti-cancer drugs) can be delivered specifically to target tissues. a potential strategy would be to package or encapsulate the drug molecules inside a particle which is bound to the cancerous tissue. such encapsulation would protect the drug from degradation in blood. for this purpose, it will be necessary to develop particles which can be modified on their outer surface to carry drug molecules into the target cells. novel nanocarriers such as dendrimers, liposomes, polymersomes, micelles, and vlps indicated high potency in improving drug delivery, and targeting strategies. all of these delivery systems make drugs more biocompatible, watersoluble, or colloidal, indicating low toxicity and high uptake in cells. different virus-based materials were studied for drug delivery such as: the ccmv, the cpmv, the red clover necrotic mosaic virus (rcnmv), ms rna-containing bacteriophage, the bacteriophage qb, m bacteriophage, the tmv. drug cargo can be loaded through covalent attachment of drugs or their analogs to particular reactive residues on the capsid pro-teins. several cancer cell targeting ligands were attached to different types of vlps, including small molecules, antibodies, peptides and proteins, as well as dna aptamers. folic acid (fa) was broadly used in drug delivery targeted to cancer cells. uptake of fa into cells is mediated by the folate receptor (fr). recently, lactobionic acid (la) was applied for the specific targeting of a rotavirus capsid vp to hepatocytes or hepatoma cells bearing asialoglycoprotein receptors (asgprs). human holo-transferrin (tfn) is essential for iron homeostasis. tfn is especially recognized by the tfn receptor (tfnr), which is over-expressed on the surface of various tumor cells and efficiently taken up by cells in the clathrinmediated endocytosis. , tfn has been conjugated to cpmv and bacteriophage qb. the cellular uptake of the qb-tfn particles was relative to the tfn density; while the internalization was prevented by comparable concentrations of free tfn. antibodies contain another group of targeting proteins that could be chemically linked to vlps. for instance, a single-chain (scfv) antibody that recognizes the carcinoembryonic antigen (cea) over-expressed in a variety of tumor cells, has been attached to cpmv. an important strategy to improve cellular uptake of therapeutic molecules is the use of cell-penetrating peptides (cpps). the hiv- tat peptide is one of the cpps that were extensively used in the delivery of vlps. , in general, virus-like particles represents an attractive system for drug delivery in vitro. the efficient delivery of hydrophobic drugs into target cells without the use of organic solvents or chemical linkage to delivery carriers is a critical issue in the biological field. recently, the intracellular delivery of hydrophobic dyes or drugs encapsulated in vlps through cyclodextrins (cds) showed high efficiency. as a model anticancer drug, paclitaxel (ptx)-cd complexes encapsulated inside vlps exhibited a dose-dependent cytotoxic effect with a -fold smaller ic than that of free ptx dissolved in dmso. cell targeting is aimed to effective uptake of therapeutic and/ or diagnostic reagent in a special location such as a tumor. targeting can also be achieved using proteins (mainly antibodies), peptides, nucleic acids (aptamers), small molecules, vitamins and carbohydrates. by attachment of targeting ligands, specificity for cell targeting was obtained by receptor-mediated endocytosis. for instance, bacteriophage ms vlps, were chemically conjugated to a targeting peptide (sp ) for the selective delivery of nanoparticles, chemotherapeutic drugs, sirna cocktails and protein toxins to human hcc. [ ] [ ] [ ] recently, the chemical conjugation of human epidermal growth factor (egf) to simian virus vlps allowed for cell shirbaghaee and bolhassani selective targeting. simian viruses vlps have attracted a great attention in gene delivery due to their high stability and low toxicity in blood. in design of polymeric nanoassemblies, chemical modification is necessary to conjugate the dye or probe for in vitro and in vivo imaging. however, in the case of nanobioassemblies, chemical or genetic modification can be applied for bioconjugation of fluorescent dyes or other probes. another advantage of nanobioassemblies such as vlps for bioimaging is their biological compatibility. quantum dots (qds) and gfp were used broadly for in vitro and in vivo imaging as alternatives to labelling. for example, fluorescent chimeric vlps of canine parvovirus were expressed in insect cells. to create the fluorescent chimeric vlps of canine parvovirus, gfp was genetically engineered onto the n-terminal region of the viral protein vp , as a visualization tool to understand mechanisms of viral infections. gfp was also used to design chimeric hiv vlps allowing protein to be followed during assembly and transmission using live-cell imaging. , advantages of vlps include: (a) no need to propagate pathogenic organisms, (b) repetitive and ordered surface structures, (c) multivalent as well as particulate in nature, (d) safer than other vaccines because of non-infectious and non-replicating properties: the studies showed that there is no risk of disease progress in vaccinated groups with vlp-based vaccines as compared to attenuated viral vaccines, because they lack the genomic material needed for the replication and the spread of the viruses, (e) stable in extreme environmental conditions, depending on vlp structure (i.e., envelope or non-envelope), and (f) as carrier to express foreign antigen. the potential of vlps to target dcs is a main advantage of vlp vaccines, for activating the innate and adaptive immune responses. they have a special benefit against other delivery systems in size, stability, and capacity to transfer biological molecules across cell barriers. particles in the - nm range can stimulate cd , cd cells and especially generate th responses. in addition, despite a limited number of vlp vaccines approved for human use, they represent a promising platform for the development of novel mucosal vaccine strategies. indeed, vlps are sufficiently small, and the composition of their surface chemistry can be designed to minimize hydrophobic and electrostatic adhesive interactions with mucus. they can also be engineered for recombinant expression of multiple antigenic epitopes and for incorporation of co-stimulatory and immuno-regulatory proteins. however, vlp technology can be limited by difficulties of scale-up and the need for purification from the expression systems. other limitation in chimeric vlp vaccine is to determine the compatibility of peptide with assembly of vlp and its immunogenicity property. under the host immune defence, pathogens undergo mutation which render the vlp vaccine ineffective and will be effective for only highly conserved b or t cell epitopes. the major challenge is to develop novel production platforms that can deliver vlp vaccines while significantly reducing production times and costs. viral-like particles (vlps) have shown high ability for the improvement of vaccines against infectious and non-infectious diseases. several recombinant expression systems were successfully applied for vlp production, with different efficiency. the use of vlps in vaccine development showed that they are considered safe. in addition, nano-sized vlps, can act as an adjuvant as well as antigen delivery system through increasing the antigen uptake by apcs. thus, it is not necessary for the use of adjuvants along with vlps to stimulate potent immune responses. vlps have shown a natural affinity to target host cells, and this property has been used for cell-targeting applications. regarding the advantages of vlps, it is necessary for further studies in various aspects especially easy and low-cost purification of vlps as well as their application as a delivery system in vivo. different applications of virus-like particles hum vaccine hepatitis b virus vaccine ip recombinant (genetically engineered): enivac hb hepavax-genev r . summary of product characteristics revac-b tm. available at prescribing information. merck prescribing information. glaxosmithkline medicago to present additional positive clinical data at the eswi influenza conference medicago inc. news release exp rev vaccine hum vaccine immunother different applications of virus-like particles antimicrob agents chemother intranasal norwalk vaccine hum alves, p. m. exp rev vaccine exp rev vaccines different applications of virus-like particles curr top microbiol immunol the authors are grateful to elnaz agi and negar zohrei (dept. of hepatitis and aids, pasteur institute of iran) for technical assistance. key: cord- -l ost q authors: oli, angus nnamdi; obialor, wilson okechukwu; ifeanyichukwu, martins ositadimma; odimegwu, damian chukwu; okoyeh, jude nnaemeka; emechebe, george ogonna; adejumo, samson adedeji; ibeanu, gordon c title: immunoinformatics and vaccine development: an overview date: - - journal: immunotargets ther doi: . /itt.s sha: doc_id: cord_uid: l ost q the use of vaccines have resulted in a remarkable improvement in global health. it has saved several lives, reduced treatment costs and raised the quality of animal and human lives. current traditional vaccines came empirically with either vague or completely no knowledge of how they modulate our immune system. even at the face of potential vaccine design advance, immune-related concerns (as seen with specific vulnerable populations, cases of emerging/re-emerging infectious disease, pathogens with complex lifecycle and antigenic variability, need for personalized vaccinations, and concerns for vaccines' immunological safety -specifically vaccine likelihood to trigger non-antigen-specific responses that may cause autoimmunity and vaccine allergy) are being raised. and these concerns have driven immunologists toward research for a better approach to vaccine design that will consider these challenges. currently, immunoinformatics has paved the way for a better understanding of some infectious disease pathogenesis, diagnosis, immune system response and computational vaccinology. the importance of this immunoinformatics in the study of infectious diseases is diverse in terms of computational approaches used, but is united by common qualities related to host–pathogen relationship. bioinformatics methods are also used to assign functions to uncharacterized genes which can be targeted as a candidate in vaccine design and can be a better approach toward the inclusion of women that are pregnant into vaccine trials and programs. the essence of this review is to give insight into the need to focus on novel computational, experimental and computation-driven experimental approaches for studying of host–pathogen interactions and thus making a case for its use in vaccine development. vaccination has been undeniably very helpful in promoting a healthy global population. it has severally saved lives, reduced healthcare costs and raised man's quality of life. it greatly reduces disease burden, disability and death. however, newly emerging and reemerging infectious diseases (erid), infectious agents with complex lifecycle and antigenic variability and the need for personalized vaccination present additional challenges in vaccine development. , for many pathogens (especially the emerging and those with antigenic variability), their genomes are known but their immune correlates of protection remain unclear. , some of these reasons are why vaccine development for erid and multi-lifecycle pathogenic diseases is a tall order. serendipitous discoveries in immunology coupled with knowledge of bioinformatics tools for epitope predictions have resulted in the emergence of new pattern of vaccine design. , the art and science of efficient and comprehensive information extraction and analysis of data deposited in relevant databases is now increasingly essential in researches related to immunology. even with this capacity (efficient information extraction), some challenges in the application of bioinformatics in immunology include structure and/or function analysis and immune process analyses as concern the immune interaction specificity. fortunately, although researches in immunology are experimentally costly and very intensive, colossal amounts of data are usually generated. such data can only be analyzed with high precision and speed using bioinformatics tools. for instance, genome sequencing as well as in vitro t-cell confirmation is done in few months as opposed to years using the conventional vaccine design. also, computational immunological methods drastically reduce both time and labor needs in epitopes screening. , with computational immunology techniques, it is possible to discover vaccine candidate epitopes simply by scanning the protein sequences in a pathogen of interest. many of these proteins are yet to be isolated or at least cloned. being pathogens specific and unique, they present ready candidates in vaccine construct. this review describes the need to use immunoinformatics-based techniques to unveil vital determinants of immunity made available in the genome sequence database and design vaccines. also, this review gives insight into the need to focus on novel computational, experimental and computation-driven experimental approaches for studying of host-pathogen interactions and thus making a case for its use in vaccine development. this review will further show the need for new approaches for effective drugs or vaccine design so as to combat the antigenic variability of some pathogens. the process of generating vaccine-induced immunity is somewhat challenging in immunology. current conventional vaccines came empirically when there were vague or no knowledge of vaccine immune system activation. a lot of research [ ] [ ] [ ] has been geared toward the understanding of this challenge, but the complexity of it requires a different dimension of approach. an approach that must accommodate many factors affecting vaccine development like pathogen antigenic variability, the emergence of infectious disease, human genetic variation is the goal of immunoinformatics [ figure ]. activation of the immune system involves, among many processes, induction of the immune memory. the strength of this induction determines the efficacy of a vaccine. hence, vaccine efficacy in the long run is influenced by the determinants of immunological memory stimulation, persisting antibodies and kind and type of immune memory cells induced. the primary vaccine-mediated immunological effectors (table ) are mainly the antibodies (from b lymphocytes/ cell) , and sometimes cd + and cd + t cells. these antibodies bind specifically to a particular kind of toxin or pathogen. vaccines and most antigens evoke humoral as well as cell-mediated immune responses. vaccines that mediate immune responses through these systems (b and t cell responses) are said to be more effective. although b cells are regarded as the primary vaccine immune effectors, t cells induce immune memory cells and high-affinity antibodies. studies in reverse vaccinology and immunomics had also proved t cells as prime immune effectors following the discovery of novel vaccine targets with epimatrix. [ ] [ ] [ ] this change of immune target has led to successful advances in vaccine design. even at the face of potential vaccine design advances, immune-related questions are now focused on specific vulnerable populations such as the young, elderly and immunocompromised. , these concerns have propelled a better understanding of the efficacy of current vaccines on this vulnerable population and have also paved way for the application of new approaches that can put into consideration the differences of population and better targets that can generate optimum immune induction [ ] [ ] [ ] with the exception of type ii t-cell-independent (ti- ) antigens (i.e., polysaccharide antigens). antigens that could provoke the b lymphocytes as well as the t lymphocyte responses stimulate the germinal centers causing antigen-specific highly efficient b-cell multiplication and eventual differentiation into antibodyforming plasma cells and memory b cells. all existing protein and dna antigens induce immunological memory b cells unlike type ii t-cell-independent (ti- ) antigens (i.e., polysaccharide antigens). these polysaccharide antigens do not generate memory b cells but can induce longlasting humoral immunity even when recall responses are lacking. vaccine efficacy may be short term if only the b cells are activated. the traditional approach for developing vaccines for infectious disease threats has shifted to include other vaccine design techniques like cloning and expressing major surface antigens although this frequently resulted in the formulation of vaccines with poor immunogenicity, requiring strong adjuvantation. this approach is particularly likely to be less specific for pathogens with complex lifecycles (e.g., parasites) or very high mutability (e.g., rna viruses). these pathogens do not depend on one route for their virulence of pathogenesis in human and thus to alter this process, increasing the specificity of the vaccine should be the aim and not just the effectiveness as seen in the current conventional vaccines. , vaccines for several neglected tropical diseases are in various stages of development, thanks to mega drug companies that have continued to demonstrate a willingness to invest money in the research and development as regards to diseases plaguing the developing nations. , [ ] [ ] [ ] it is very pertinent to invest in researches that have an interest in vaccine specificity on the pathogen antigens than totally computational vaccinology may now be applied to screen these genomes for possible vaccine target. with these tools, many proteins of virulence interest can be sequenced and the most essential gene of interest modeled for a potential vaccine candidate specific for that pathogen. immunoinformatics is the way forward in the identification of vaccine candidates for these tropical erid, for pathogens with varying antigens and for individualized therapy. immunology studies produce data in colossal quantities. also, with proteomics and genomics projects, extensive screening of pathogens and/or pathogen-host interaction, it has become increasingly necessary to store, manage and analyze these data, hence the birth of immunoinformatics. immunoinformatics deals with computational techniques and resources used to study the immune functions. statistical, computational, mathematical and biological knowledge and tools are applied in immunoinformatics in order to accurately and specifically store, and analyze data concerning the immune system and its functions. to handle evidence diversity, immunoinformatics uses tools that cut across several aspects of bioinformatics such as creation and management of databases, , use and definition of both structural and functional signatures and the formation and application of predictive tools. [ ] [ ] [ ] these strategies can synergize toward a better understanding of the immune system of both man and animals and fight against some less predictable pathogenesis. the complex nature of vertebrates' immune system, the variable nature of pathogens and environmental antigens coupled with the multi-regulatory pathways show that colossal quantities of data will be needed to unveil how the human immune systems work. conventionally, much cannot be achieved based on the complexity of the immune system and the virulent antigen but with the application of computational vaccinology, researches on vaccine design have been made easier, accurate and specific. applying immunoinformatics in disease study (table ) requires the knowledge of disease pathogenesis, the immune system dynamics, and computational vaccinology, painstaking searches of the database, sequence comparison, structural modeling as well as motif analysis. , these methods can go a long way in analyzing the pathogenesis of a disease and identification of vaccine candidates. in order to help understand complex pathogenrelated processes, computational models were developed for viral , bacterial, parasitic and fungal pathogens. the bioinformatics tools (table ) are used to identify possible epitopes for vaccine formulation. each tool can screen protein sequences and identify aggregates of mhc binding and supertype motifs for possible use in epitopebased vaccine development and for use among human populations with genetic variability. there are several databases ( table ) that can provide a wide range of information for all forms of immunological studies. generated data from the studies are further organized and stored in the databases (table ) to provide a means for the development and advance in immunotargets and therapy : immunological research. a tour on these databases will actually stimulate some interest in the vaccinology of emerging and re-surging disorders attributable to pathogen including cancer. emerging infections (eis) include infections that are entirely new in a population or that may have existed before in the population but are now gaining rapid and continued spread and/or wide geographical range. re-emerging or re-surging infections represent the infections that were previously of historical relevance but are now quickly becoming relevant because of either increasing incidence or increasing geographical and/or human host range while emerging infections represent the infections that were not originally observed in man. several factors such as human behavioral changes, environmental changes, and host/intermediate factors, animal-human switching and microbial genetic changes all affect infectious disease emergence and spread. these factors interact to promote the evolution of pathogens into new ecosystems, infect, spread and thrive in their new hosts. the overall consequences of these are continued epimatrix this is an in-silico product of epivax developed for predicting and identifying the immunogenicity of therapeutic proteins and epitopes. it is also used to re-design proteins and in designing t-cell vaccine has been applied in comparing strings from different strains of same pathogens and for pathogens identification. configuration of conservatrix allows for amino acid replacement at unusual positions. highly conserved t-cell epitopes in variable genomes such as some viruses are amenable to the algorithm , clustimer potential t-cell epitopes usually aggregate in specific immunogenic consensus sequence (ics) regions as clusters of - amino acids with - binding motifs instead of randomly distribute throughout protein sequences. in combination with epimatrix, the clustimer algorithm may be used to identify those peptides with epimatrix immunogenicity cluster scores ≥ + . such peptides are usually immunogenic and tend to make a promising vaccine candidate. blastimer using blastimer program, one may also choose to automatically blast "putative epitopes against the human sequence database at genbank". blasting screens off those epitopes with possible autoimmunity and cross reactivity questions and locates the epitopes that can safely be used in developing human or animal vaccine. blastimer can also blasts sequences against pdb, swissprot, pir, prf and non-redundant genbank cds translations. vaccine cad this algorithm evaluates junctional epitopes for possible immunogenicity and inserts "spacers and breakers into the design of any string-of-beads construct". infectious disease emergence and re-emergence, epidemics and public health challenges. emerging infections and multi-antibiotic-resistant strains of pathogenic bacteria usually surge from one geographic location from where it spreads to other places due to immigration of people. , most emerging infections originate from a specific population and can spread to a new population or become selectively advantaged that it can lead to the emergence of new strains of the pathogen. , , also, there could be microbial traffic, in which case, an infectious agent transfers from animals to humans or spreads from isolated groups to new populations. , , several factors, including ecology, are known to be associated with infectious disease outbreaks. these factors bring man into close contact with a natural disease reservoir/host. with an increasing world population and poor infection control, the emergence of infection and increased microbial populations are sure. the human growth population will only increase the spread of the infection across populations. the information provided in table is the list of remerging infections and current emerging diseases put forward during the who annual review. the review noted that these infections, if not well controlled, can cause disease outbreaks, bioterrorism and similar occurrences requiring urgent public health attention and that with the dearth of efficacious medicines or vaccines, there is a compelling demand for continuous as well as accelerated research and development in those areas. advances in genomics, proteomics, immunomics, vaccinomics and nanotechnology are being continually exploited in diagnostic, therapeutic and in rational drug and vaccine development. these advances have also served in the control of the afore-mentioned emergences. , the knowledge of the emerging pathogen's genome, protein make-up, pathogen-immune system interactions and researching the possible therapeutics will go a long way in directing the optimum path to containing the infection spread and controlling potential re-emergence or emergence in a different population. approaches in direct and computer-based structural determinations, protein-protein interactions predicting, and bioinformatics tools now exist and are used in modern-day development of drug and biologics. vaccine development has been sped up through the advance in the knowledge of the immune system of man. researches in the traditional targets of vaccines (adaptive immune response) and the less specific and fast-acting innate immune responses have been clear evidences for this advance. [ ] [ ] [ ] as our understanding of the intercourse between innate and adaptive immunity increases, reasons and opportunities for more effective vaccine adjuvants will open up. this can be a step forward in solving a critical world's health challenge per population. following the conventional approach of vaccine design, much cannot be achieved but when the knowledge of immunoinformatics is applied, population safety and disease control can be achieved through pathogen's genome sequencing leading to optimum new vaccine design or development of a novel vaccine for the infection. antigenic variability is an important mechanism pathogens use to evade their host immunity. the surface proteins of pathogens are normally variable. this assists them to escape recognition by the immune system. a successful infectious agent presents to the host immune system information that differs from that of its virulence. pathogenic organisms have organized systems of escaping destruction by the immune system of their hosts. for instance, toxoplasma invades and appropriates the host cells thereby circumventing phagocytosis and then spread within their host to establish infection. vertebrates on their own are endowed with immune system robust enough to efficiently and effectively surmount the non-self-attacks. yet the more the host's immune system elaborates, the better the organisms in their evasion of immune effector cells. antigenic variation refers to a pathogen's ability to modify its surface proteins such that it can circumvent the host's immunological attacks. it involves several mechanisms including the varying of surface protein's phase, shifting and drifting of surface protein antigens and/or any other form of alteration of antigenic protein. antigenic variation plays significant roles in the pathogenicity of microorganisms by evasion of the host immune responses and establishment of re-infection. when a pathogen alters its surface antigens, it can evade the host's adaptive immunity and so reestablishes infection. the immune system may battle to generate new immunoglobulins against the new antigen. certain bacteria like neisseria gonorrhea, neisseria meningitides, mycoplasma and species of the genus streptococcus show antigenic diversity. in eukaryotic pathogens, antigenic variation is shown by trypanosoma brucei and plasmodium falciparum. , another vital cause of antigenic variation in bacteria is horizontal gene transfer (more important than point mutation) through plasmid acquisition and transduction via bacteriophages. virulence genes are normally acquired by non-virulent organisms via these routes. once this occurs, the new bacteria may quickly get established and cause fresh epidemic outbreak. species of the genus neisseria are champions in the rapid change of surface antigens amongst bacterial pathogens. pathogenic forms exhibit an amount of phenotypic variability not found in the commensal species. the pathogenic forms are implicated in std and meningitis. they employ amazing varieties of antigenic variability mechanisms. • they can recombine their pilin genes in a similar manner that eukaryotes recombine their own genes, such that they can express variable surface protein. • some cell-surface proteins and enzymes synthesizing bacterial cell-surface carbohydrates are expressed in a variety of ways. this is as a result of replication slipping or slippage errors and repairs of simple tandem nucleotide repeats involving either the di-, or tri-or tetra-nucleotides. • neisseria is able to take up and incorporate environmental dna into its genomes. these are why an effective vaccine against neisseria infections is not yet developed. neisseria may be considered as an extreme example. however, many other bacterial pathogens like streptococcus and mycoplasma in promoting their antigenic variation tend to utilize one or more of these techniques. additionally, there are reports that dna-related defects have a much greater association with bacterial pathogen from symptomatic patients than samples of the same bacterial species isolated from environmental sources. [ ] [ ] [ ] pneumococcus streptococcus pneumoniae, gram-positive cocci bacteria that cause otitis media, bacteremia and pneumonia, are a public health concern, causing morbidity and mortality in adults and children. two forms of vaccines (polysaccharide and conjugate vaccines) are currently marketed for the prevention of pneumococcal infections. while the polysaccharide vaccines are for vaccination in the adult population, conjugate vaccines have an added immunogenic non-pneumococcal protein conjugated to the pneumococcal polysaccharides for enhanced immunogenicity in children. it is not yet known that these vaccines can evoke complete immunity against the infection. a polysaccharide capsule is a major virulence factor in the bacteria. several of these capsule types have been identified, and these form the basis of pathogen's antigenic serotyping. , current pneumococcal vaccines are combinations of various capsular (polysaccharide) antigens from the serotypes most common in a particular population. currently, over different serotypes are known but are not all covered in the available vaccines. the discovery of a common antigen(s) will produce an effective vaccine. knowledge of the genome of the organism and the different strains has led to a possible advance in driving the pneumococcal potential vaccine search through a different approach. and this consideration will help solve a lot of concerns about the current vaccines. with this knowledge, many methods are been tried to determine whether they can be a source of effective vaccine design that can accommodate all the serotypes of the organism. search for antigen that is common to all the serotypes can be achieved with the knowledge of genomics and immunoinformatics. the introduction of genomic and computational technologies has given new directions in the study of bacterial pathogenesis and vaccine design. , plasmodium plasmodium falciparum undergoes two life cycles: one in humans and the other in mosquitoes. the human host's erythrocytes and hepatocytes usually display modified parasite proteins called plasmodium falciparum erythrocyte membrane protein (pfemp ) and plasmodium falciparum hepatocyte membrane protein (pfhmp ), respectively. these proteins function to assist the parasite to evade destruction by the host immune systems. , the pfemp proteins were identified as the prime ligands responsible for cytoadherence and resetting. they cause the infected rbcs of host tissues to sequester thus helping the parasite to circumvent clearance by the host's spleen. the membrane proteins also shield infected host cells from destruction by the spleen by adhering to the endothelium. luckily, if the pfemp protein is expressed for a long while; it comes under attack by the naturally acquired immunity. , in defence to this, the parasite has expanded the var genes coding for pfemp such that the genes can exist as a polymorphic family of as much as over members in every genomic haploid. antigenic switches work well here in that members of the polymorphic family (also called antigenic-variant-protein family) can be interchanged and cannot express their proteins at the same time. in this way, only one particular protein at the surface of the infected rbc is expressed at any given time. , when studying antigenic-variant-protein families, it is pertinent to understand if grouping them into single-family results in any meaningful antigenic activity. studies have tried to understand the "languages" of the antigenic variant of pfemp proteins. , , they sought to know the pfemp proteins binding properties or search to understand the correlation between motifs and infection severity. the vardb database is a repository for protein sequences involved in antigenic variation and their associated functionalities. antigenic variant data obtained from several pathogens may be regrouped into a unified database. this will enable researches from several multicopy gene families to be accessed and compared swiftly in a single moment. updated vardb database contains close to , dna sequences, several protein translations, tens of infectious diseases and pathogens with their gene families. with a novel sequencing-based approach, pacbio, the different pfemp proteins can be sequenced and the related sequences used as potential vaccine targets. , trypanosoma for many pathogens, antigenic variability occurs during the infection pathogenesis and is to enable them to escape destruction by the host antibodies. for instance, some eukaryotic parasites take to genetic assortment and rearrangement thereby changing their surface antigens. a ready example is seen in trypanosoma brucei, the causative organism for sleeping sickness. trypanosoma brucei replicates in the bloodstream (outside the cells) of their host, but at maturity, it crosses the blood-brain barrier to cause several fatal complications. during replication in the bloodstream, the parasites are subjected to humoral as well as cellular immune responses. it evades the host defenses by encasing itself in homogeneous coat of glycoprotein called the variant surface glycoprotein (vsg). , though at initial invasion, the protein coat tends to protect the microbe from the immune system but on constant exposure, the coat will be identified as a foreign matter, and at this point the immune effectors can launch an attack against it. in a particular trypanosoma brucei, there are diversities of the vsg protein being coded by more than a thousand genes in the parasite's genome. unfortunately for the host, the expression of these genes is mutually exclusive. expressed vsg gene is normally due to genetic reassortments causing new alleles to be copied into the sites of expression. some trypanosomes with these abnormal vsg genes evade humoral immunity and multiply thereby causing re-infection and chronic recurring infections. influenza is a viral infectious disease due to infection by any of the three types of rna viruses, namely influenza types a, b and c. current vaccines contain double type a and single type b strains and induce strong antibody responses to neuraminidase and the surface glycoprotein hemagglutinin. these vaccines, however, cannot effectively protect against newly emerging viruses with antigenic shift and drift. , antigenic drift results in changes in the antigenic site (a minor change) while antigenic shift results in a new virus subtype. hemagglutinin and neuraminidase are the two enzymes dictating the antigenic properties of the viruses. while inside its host, defined host proteases break the peptide bonds in the hemagglutinin molecule to form hemagglutinin and subunits. virulence tendencies are decreased when the amino acids at the cleavage sites are lipophobic, the virus exhibits high virulence tendencies. the surface glycoprotein can be regarded as antigen and hence can serve as a target for the immune system which if sequenced, using the new immunoinformatics approach and a common site for the varying proteins identified, a potent vaccine can be developed which can accommodate the antigenic drift/shifts of the virus. influenza viruses are able to thrive for a long while in a given human population. , the virus has a high mutation rate such that a once effective vaccine can easily lose efficacy. antigenic variability is only one of the evidences of phenotypic variation in the biology of the influenza virus. the use of immunoinformatics in vaccine development has been accelerated toward the design of a multiepitope vaccine construct which has and will fully address the challenges faced with pathogens with mutagenic antigens. previous vaccines developed by conventional approaches consist of several proteins or a whole pathogen. this constitutes unwarranted antigenic load and increases the chances of inducing allergy. the use of peptide-based vaccines surmounts these challenges. the vaccines are made from short peptide fragments capable of eliciting highly specific immune responses, precision targeting and multiepitope constructs, in the case of varying antigenic peptides, which has been made feasible with the advancements in the field of computational biology. [ ] [ ] [ ] [ ] vaccines for pathogens with immune escape potentials can basically be constructed by using most, if not all, of their immunogenic peptides , because such vaccines prove to be better than single-epitope and classical vaccines. multiepitope vaccines enjoy the following advantages over single-epitope and classical vaccines: a) they are an assemblage of several epitopes obtained from distinct protein targets/antigens of an intended infection; b) the multiple t-cell receptors (tcrs) in the vaccine recipient can easily recognize vaccines with multiple hla epitopes; c); they can be easily adjuvanted to improve their immunogenicity; d) they can activate antibody-mediated and cell-mediated immunological responses because of their overlapping helper t lymphocytes (htl), cd + t-cell and b-cell epitopes; and e) unwanted protein antigens are excluded in such construct thereby reducing the chances of untoward effects and/ or immune responses likely to cause disease(s). [ ] [ ] [ ] [ ] [ ] [ ] thus producing a vaccine with these qualities can provide chances of combating most infections such as streptococcus pneumoniae and hiv infections. immunoinformatics can be employed in the docking of single and multiepitope vaccines and subsequently to predict their properties (physicochemical, allergenic and antigenic). this approach has seen the use of diverse tools and database in the analysis of ligands with their targets and has greatly helped to predict the binding score of antigenic peptides with the immune proteins like hla. peptides and hla allele modeling can be done by the d structural designing of the epitopes using pepfold (an online server), retrieving from protein data bank (pdb) the x-ray crystallographic structure of human population most occurring hla alleles (hla-drb : , : and hla-a : ) followed by filtration of previously bound ligands. the following is a step-wise detail on how to construct a single or multiepitope-based vaccine and its property prediction; • molecular docking analysis: to determine the interaction pattern of the screened out epitopes with the hla alleles by employing cluspro v. (a proteinprotein docking web server). this server performs this task by energy minimization, calculation of both the binding energy scores of the docked complex and electrostatic/shape complementarity. • target-protein comparative modeling and associated structure validation: the sequence of the amino acid in the target protein (e.g., tlr- ) can be retrieved from uniprot and the tertiary structure with raptor-x and i tasser (online comparative modeling tools). the server constructs and creates a d model immunotargets and therapy : (mathematical representation) of the target protein using hierarchical algorithms. personalized vaccine refers to vaccines "targeted" toward an optimized outcome. immunogenicity is maximized while either the risk of vaccine failure or reactogenicity and side effect is minimized. personalized are developed in the following cases; the individual level vaccines are developed to take care of haplotype and polymorphism knowing that they can retard the formation of a protective immune response or become pointers to the risk of an adverse vaccine reaction. this is needed when it is clear that females produce a higher antibody titre against a particular vaccine antigen than do their male counterparts. where it is clear that a particular human race or ethnic group has a higher or lower immune response to a particular vaccine antigen. personalized vaccines arise due to known complex interactions between host environmental, genetic and some other factors that may be influencing the vaccine immune responses. the associations between the immune response gene polymorphisms and variations in immune responses to a particular gene must be pine-pointed when it is clear that a particular drug either suppresses or augments the transcription of an immune response gene. , this could help in designing vaccines or vaccine adjuvants that can circumvent restrictions due to immunological differences arising from varying genetic compositions. , personalized vaccines stem from our understanding of how, within the human leukocyte antigen (hla) systemalso referred to as the major histocompatibility complex (mhc), the t cells are able to recognize peptides of pathogenic origin. , hla molecules enjoy the double advantages of having stable polymorphisms and being fully characterized. these advantages make good candidates for personalized vaccine design. hla polymorphism, although stable, is complex. for instance, more than , alleles of hla class i molecules and greater than class ii molecules have been identified among human populations. , hla class i and ii molecules have heterodimeric character comprising of α and β chains, three highly variable extracellular domains (α , α , and α ) and then transmembrane and intracytoplasmic domains that are less variable. , hla genes contain eight exons. exon is responsible for producing the leader peptide; exons , , produce α , α , and α extracellular domains, respectively, for mhc class or α , β and α , respectively, for mhc class ii; exon encodes transmembrane anchor; exons and encodes the cytoplasmic tail while exon encodes the ʹ-untranslated region. most of the several forms associated with the class i and ii genes are seen in α- and α- (as known as class i) and in α- and β- (as known as class ii) domains. mhc i and ii bind and present the peptide to t cells. t cell responses to viral pathogens differ from one patient to the other, basically because of the expression of differing hla (mhc) alleles which determine the several viral amino acid sequence brought to the t cells to read. , it is most likely that during an infection, diverse epitopes are usually presented to the t cells to read owing to the several forms of hla alleles and also because each human person expresses six hla class i and six hla class ii alleles. now, antibody-binding sites in a given hla (mhc) molecule are mostly prediction-servers predetermined on the basis of particular binding motifs and the anchor residues. , these residues refer to known amino acids located at defined locations in the peptide chain and which are peculiar to each mhc molecule. , prediction-server database of peptide motifs and/or of mhc ligands may be obtained from web-based and/or from prediction-servers dedicated to netmhc family. , in another example, sequence analysis of lassa fever virus (the lasv) and other viruses' immunoproteomic was used to identify the best immunogenic protein predicting t-cell as well as b-cell epitopes and also target sequence and binding sites. , the ssnlykgvy peptide sequence at aa - of glycoprotein (produced by the l segment) was the best candidate epitope for the induction of humoral as well as the cell-mediated immunity for lassa fever vaccine construct. hla-i and hla-ii molecules have been proven in sizable african populations and their combination with the ssnlykgvy peptide sequence may prove useful in such lassa fever virus endemic areas. this approach will strongly improve individualized vaccination and help combat emerging infections. the hla region is suspected to contribute, to a large extent, to genetic propensity to infections and differences in vaccine expected immune responses. , studies show that females exhibit stronger immune responses to immunization compared to males. , there are differential antibody responses to rubella and measles viral protein between males and females and that both hormonal and genetic difference may be influencing the immune responses. , , practical issues may stand in the way of achieving this new development (personalized vaccinology). having to use different vaccines for different persons based because of personal genetic composition requires more time and labor during the vaccination process. also, screening for individual factors for targeted vaccination can significantly increase vaccination cost. but with all these challenges, personalized vaccination is the new age approach in achieving an optimum immunization that takes into consideration the individual immune differences in a particular population and it is a new dawn for vaccine development. personalized vaccine development is strongly improved by vaccinomics. the field of vaccinomics looks at how immune response gene polymorphisms affect the cellmediated, humoral and innate immune responses to vaccine antigens at population and specific individual levels. "vaccinomics" encompasses both immunogenomics and immunogenetics as it concerns immune responses to vaccine antigen. the fields of personalized vaccinology and vaccinomics were the products of phase i of the international hapmap and that of the human genome project. also, modern molecular assay techniques permitting highthroughput detection of variations at gene level, in particular linkage disequilibrium maps and single nucleotide polymorphism (snp), played significant roles in the development of personalized vaccinology and vaccinomics. it has also been shown that polymorphisms at vital immune response genes can bring about differing immune responses to biopharmaceutical products including vaccines. [ ] [ ] [ ] newer, accurate, cheap and reproducible sequencing technologies; validated databases containing genotypephenotype data; statistical and bioinformatics tools are needed in order to analyze and interpret data that will help and improve vaccine adverse and immune response quantifiability and predictability. the information will enhance clinical practice and accelerate rational and directed vaccine development. safe vaccines are a critical requirement for any immunization program. conventional vaccination has been an approach targeted at all groups and individuals but has failed toward the enrolment of pregnant women into vaccination programs because of presumed fetal and maternal harms. , evidence on the safety of vaccination in pregnancy is very small because pregnant women were usually excluded from participating in vaccine trials. pregnancy can alter the maternal as well as fetal immunological responses. it is pertinent to explore research opportunities presented in advanced vaccine designs such as immunoinformatics (multiepitope vaccine docking) by studying human immune system functions and responses specific to pregnant women and their unborn children. according to a report from the dominican republic of congo, during the - zika virus outbreaks, over a thousand pregnant women were suspected of being infected with the virus and a sizable number were at their first trimester. the report further stated that fetal loss was approximately one-tenth of the pregnancies and that there were up to cases of fetus with head circumferences smaller than normal. the widespread morbidity during the epidemic showed that zika virus infection adversely affects pregnancy outcome. , currently, there is no proof that pregnancy predisposes to ebola virus infections in comparison with the non-pregnant population, but there is some evidence suggesting pregnancy to worsen the disease prognosis including fetal loss. also, evidence showed that the virus can pass the placental barriers to establish infection in the unborn child. designing, implementing and enrolling pregnant women as well as perspective pregnant women into vaccine trials and programs is imperative in order to protect this group and ensure good vaccine uptake by them during infection outbreaks and epidemics. , these recommendations will give an informed decision to be investigated using the immunoinformatic tools to determine the immunogenic responses worthy of safe vaccine development for the pregnant women and perspective pregnant women group. maternal immunization offers palpable benefits in several ways. some vaccines are primarily administered to shield these pregnant women from morbid conditions and/ or death including fetal death. , pregnant women stand the risk of being exposed to virulent pathogens and may be at a higher risk of morbidity and/or mortality when compared to the general population. there has been an explosion of new immunological data (table ) due to an increase in research to understand the immune system pathway in infectious disease pathogenesis and the application of the knowledge of bioinformatics has led to a better exposition of the immune system importance through immunoinformatics. the knowledge of immune system and the cost-effective, specific and effective approach like immunoinformatics, the concerns for emerging and re-surging diseases caused by pathogenic organisms, antigenic variability/complex lifecycle of pathogens and the need of personalized vaccination can be combated on a molecular level. the future of immunological research is sharpened by the ability to make discoveries in biologics (e.g., vaccines) more effectively and efficiently. this will mean reduction and better targeting of wet laboratory experiments and will only be possible if wet laboratory experimentation is combined with bioinformatics techniques. • immunoinformatics depends on experimental science (wet laboratory) to produce raw data for analysis. the predictions are not formal proofs of any concepts. they do not replace the traditional experimental research methods of actually testing hypotheses. • the quality of immunoinformatics predictions depends on the quality of data and the sophistication of the algorithms being used. sequence data from high-throughput analysis often contain errors. if the sequences are wrong, or annotations incorrect, the results from the downstream analysis are misleading as well. immunotargets and therapy is an international, peer-reviewed open access journal focusing on the immunological basis of diseases, potential targets for immune based therapy and treatment protocols employed to improve patient management. basic immunology and physiology of the immune system in health, and disease will be also covered. in addition, the journal will focus on the impact of management programs and new therapeutic agents and protocols on patient perspectives such as quality of life, adherence and satisfaction. the manuscript management system is completely online and includes a very quick and fair peer-review system, which is all easy to use. immunotargets and therapy : time for t? immunoinformatics addresses vaccine design for neglected tropical and emerging infectious diseases vaccinology in the third millennium: scientific and social challenges antigenic variability: obstacles on the road to vaccines against traditionally difficult targets complex immune correlates of protection in hiv- vaccine efficacy trials immunoinformatics approach to design a novel epitope-based oral vaccine against helicobacter pylori immunoinformatics approach for multiepitopes vaccine prediction against glycoprotein b of avian infectious laryngotracheitis virus transcriptome and proteome: the rise of omics data and their integration in biomedical sciences neoantigen vaccine: an emerging tumor immunotherapy designing of cd + and cd + -overlapped cd + epitope vaccine by targeting late and early proteins of human papillomavirus immunization with a recombinant antigen composed of conserved blocks from tsa provides broad genotype protection against scrub typhus vaccine mediated protection against zika virus-induced congenital disease vaccination with sporozoites: models and correlates of protection novel approaches for the design, delivery and administration of vaccine technologies vaccination to gain humoral immune memory humoral and cell-mediated immune responses after a booster dose of hbv vaccine in hiv-infected children, adolescents and young adults an evaluation of the cold chain technology in south-east, nigeria using immunogenicity study on the measles vaccines a review of the immunological mechanisms following mucosal vaccination of finfish reverse vaccinology . : human immunology instructs vaccine antigen design large screen approaches to identify novel malaria vaccine candidates reverse vaccinology and subtractive genomics reveal new therapeutic targets against mycoplasma pneumoniae: a causative agent of pneumonia evolution of the immune system in humans from infancy to old age the twilight of immunity: emerging concepts in aging of the immune system pneumococcal vaccination strategies. an update and perspective progress and challenges in tb vaccine development vaccines for the elderly: current use and future challenges remembrance of things past: long-term b cell memory after infection and vaccination global practices of meningococcal vaccine use and impact on invasive disease new vaccine technologies to combat outbreak situations what are the most powerful immunogen design vaccine strategies? reverse vaccinology . shows great promise the new malaria vaccine program for african children is promising but still quite limited. quartz africa merck's ebola vaccine helps combat deadly outbreak in the congo as the virus spreads. biotech and pharma innovation for the 'bottom million': eliminating neglected tropical diseases in the americas emerging and neglected infectious diseases: insights, advances, and challenges exploitation of reverse vaccinology and immunoinformatics as promising platform for genome-wide screening of new effective vaccine candidates against plasmodium falciparum the use of databases, data mining and immunoinformatics in vaccinology: where are we? systems vaccinology and big data in the vaccine development chain biochemical functional predictions for protein structures of unknown or uncertain function reverse vaccinology: the pathway from genomes and epitope predictions to tailored recombinant vaccines universal genotyping for tuberculosis prevention programs: a -year comparison with on-request genotyping integrating whole-genome sequencing data into quantitative risk assessment of foodborne antimicrobial resistance: a review of opportunities and challenges bioinformatics and drug discovery new technologies in predicting, preventing and controlling emerging infectious diseases how genomics can be used to understand host susceptibility to enteric infection, aiding in the development of vaccines and immunotherapeutic interventions in silico analysis of epitope-based vaccine candidates against hepatitis b virus polymerase protein advances in designing and developing vaccines, drugs and therapeutic approaches to counter human papilloma virus computational approaches and challenges to developing universal influenza vaccines. vaccines (basel) experimental and computational analyses reveal that environmental restrictions shape hiv- spread in d cultures dynamic computational model of symptomatic bacteremia to inform bacterial separation treatment requirements a model of plasmodium vivax concealment based on plasmodium cynomolgi infections in macaca mulatta a computational modeling approach predicts interaction of the antifungal protein afp from aspergillus giganteus with fungal membranes via its γ-core motif. msphere an overview of bioinformatics tools for epitope prediction: implications on vaccine development bioinformatics tools for identifying class i-restricted epitopes in silico-accelerated identification of conserved and immunogenic variola/vaccinia t-cell epitopes nhlbi-abdesigner: an online tool for design of peptide-directed antibodies ivax: an integrated toolkit for the selection and optimization of antigens and the design of epitope-driven vaccines from genome to vaccine: in silico predictions, ex vivo verification designing string-of-beads vaccines with optimal spacers hiv vaccine development by computer assisted design: the gaia vaccine nerve: new enhanced reverse vaccinology environment jennerpredict server: prediction of protein vaccine candidates (pvcs) in bacteria based on host-pathogen interactions genome-wide prediction of vaccine target of human herpes simplex viruses using vaxign rv vaxijen: a server for prediction of protective antigens, tumour antigens and subunit vaccines vacsol: a high throughput in silico pipeline to predict potential therapeutic targets in prokaryotic pathogens using subtractive reverse vaccinology panrv: pangenome-reverse vaccinology approach for identifications of potential vaccine candidates in microbial pangenome understanding emerging and re-emerging infectious diseases. bethesda (md: national institutes of health (us) the consequences of human actions on risks for infectious diseases: a review institute of medicine (us) forum on microbial threats. microbial evolution and co-adaptation: a tribute to the life and scientific legacies of joshua lederberg: workshop summary trends in antimicrobial resistance of bacterial pathogens in harare understanding emerging and re-emerging infectious diseases. bethesda (md): national institutes of health (us) evolution and emergence of infectious diseases in theoretical and real-world networks carbapenem-resistant enterobacteriaceae posing a dilemma in effective healthcare delivery annual review of diseases prioritized under the research and development blueprint the role of nanotechnology in the treatment of viral infections proteomics for development of vaccine computational methods in drug discovery computational approaches for prediction of pathogen-host protein-protein interactions new approaches to understanding the immune response to vaccination and infection adaptive immune responses mediated by natural killer cells friend retrovirus studies reveal complex interactions between intrinsic, innate and adaptive immunity from entry to early dissemination-toxoplasma gondii's initial encounter with its host evolution and divergence of h n equine influenza viruses circulating in the united kingdom from neisseria genomics: current status and future perspectives antigenic variation in plasmodium falciparum malaria involves a highly structured switching pattern neisseria gonorrhoeae muts affects pilin antigenic variation through mismatch correction and not by pile guanine quartet binding antigenic variation in bacterial pathogens mechanisms and regulation of extracellular dna release and its biological roles in microbial communities mobile genetic elements in neisseria gonorrhoeae: movement for change. pathog dis genomic characterization of novel neisseria species the pneumococcus: epidemiology, microbiology, and pathogenesis. cold spring harb perspect med purification of capsular polysaccharides of streptococcus pneumoniae: traditional and new methods pneumococcal capsules and their types: past, present, and future whole-genome sequencing of bacterial pathogens: the future of nosocomial outbreak analysis human genomic loci important in common infectious diseases: role of high-throughput sequencing and genome-wide association studies immune response and evasion mechanisms of plasmodium falciparum parasites plasmodium falciparum pfemp modulates monocyte/macrophage transcription factor activation and cytokine and chemokine responses plasmodium falciparum proteins involved in cytoadherence of infected erythrocytes to chemokine cx cl effect of mature blood-stage plasmodium parasite sequestration on pathogen biomass in mathematical and in vivo models of malaria controlled human malaria infection with plasmodium falciparum demonstrates impact of naturally acquired immunity on virulence gene expression multiple plasmodium falciparum erythrocyte membrane protein variants per genome can bind igm via its fc fragment fcμ plasmodium falciparum malaria parasite var gene expression is modified by host antibodies: longitudinal evidence from controlled infections of kenyan adults with varying natural exposure expression of the plasmodium falciparum clonally variant clag genes in human infections vardb: a pathogen-specific sequence database of protein families involved in antigenic variation var csa binding phenotype has ancient origin and arose before plasmodium falciparum crossed to humans: implications in placental malaria vaccine design. sci rep global genetic diversity of var csa in plasmodium falciparum with implications for malaria in pregnancy and vaccine development how does the vsg coat of bloodstream form african trypanosomes interact with external proteins? variant surface glycoprotein density defines an immune evasion threshold for african trypanosomes undergoing antigenic variation towards a universal influenza vaccine: different approaches for one goal the emerging influenza virus threat: status and new prospects for its therapy and control influenza a virus hemagglutinin antibody escape promotes neuraminidase antigenic variation and drug resistance a comprehensive review on equine influenza virus: etiology, epidemiology, pathobiology, advances in developing diagnostics, vaccines, and control strategies novel platforms for the development of a universal influenza vaccine computer aided epitope design as a peptide vaccine component against lassa virus genome-wide identification of novel vaccine candidates for plasmodium falciparum malaria using integrative bioinformatics approaches. biotech in-silico design of a multi-epitope vaccine candidate against onchocerciasis and related filarial diseases antigenic variation and immune escape in the mtbc vaccinomics approach for designing potential peptide vaccine by targeting shigella spp. serine protease autotransporter subfamily protein siga designing a multi-epitope based vaccine to combat kaposi sarcoma utilizing immunoinformatics approach efficient control of chronic lcmv infection by a cd t cell epitope-based heterologous prime-boost vaccination in a murine model a novel multi-epitope vaccine from mmsa- and dkk for multiple myeloma immunotherapy evaluation of tandem chlamydia trachomatis momp multi-epitopes vaccine in balb/c mice model development of a multi-epitope peptide vaccine inducing robust t cell responses against brucellosis using immunoinformatics based approaches identification of a cd t-cell epitope in the hemagglutinin stalk domain of pandemic h n influenza virus and its antigen-driven tcr usage signature in balb/c mice pep-fold : faster de novo structure prediction for linear peptides in solution and in complex the protein model portal-a comprehensive resource for protein structure and model information exploring leishmania secretory proteins to design b and t cell multi-epitope subunit vaccine using immunoinformatics approach ap- transcription factors as regulators of immune responses in cancer vaccinomics and personalized vaccinology: is science leading us toward a new path of directed vaccine development and discovery? milieu intérieur consortium. distinctive roles of age, sex, and genetics in shaping transcriptional variation of human immune responses to microbial challenges evaluation of levamisole as an adjuvant for typhoid fever vaccine formulation hla and infectious diseases predicting antigen presentation-what could we learn from a million peptides? front immunol development of a human leukocyte antigen-based hiv vaccine hla dna sequence variation among human populations: molecular signatures of demographic and selective events major histocompatibility complex: antigen processing and presentation harnessing the power of t cells: the promising hope for a universal influenza vaccine. vaccines (basel) recalling the future: immunological memory toward unpredictable influenza viruses human leukocyte antigen (hla)-binding epitopes dataset for the newly identified t-cell antigens of mycobacterium immunogenum fundamentals and methods for t-and b-cell epitope prediction human leukocyte antigen (hla) and immune regulation: how do classical and non-classical hla alleles modulate immune response to human immunodeficiency virus and hepatitis c virus infections? front immunol amino acid signatures of hla class-i and ii molecules are strongly associated with sle susceptibility and autoantibody production in eastern asians predicting hla class i non-permissive amino acid residues substitutions major histocompatibility complex class i binding predictions as a tool in epitope discovery prediction of mhc class i binding peptides using profile motifs design of peptide-based epitope vaccine and further binding site scrutiny led to groundswell in drug discovery against lassa virus immunoinformatics approach for epitope-based peptide vaccine design and active site prediction against polyprotein of emerging oropouche virus polymorphisms of immunoglobulin receptors and the effects on clinical outcome in cancer immunotherapy and other immune diseases: a general review sex differences in vaccine-induced humoral immunity sex differences in older adults' immune responses to seasonal influenza vaccination biological sex affects vaccine efficacy and protection against influenza in mice differential antibody responses to rubella virus infection in males and females genomics of immune response to typhoid and cholera vaccines human immune system variation omic technologies and vaccine development: from the identification of vulnerable individuals to the formulation of invulnerable vaccines advances in genetics and genomics: use and limitations in achieving malaria elimination goals. pathog glob health safety evaluation in mice of the childhood immunization vaccines from two south-eastern states of nigeria prevent working group. pregnant women & vaccines against emerging epidemic threats: ethics guidance for preparedness, research, and response vaccinations for pregnant women efficacy and safety of pertussis vaccination for pregnant women -a systematic review of randomised controlled trials and observational studies beyond passive immunity: is there priming of the fetal immune system following vaccination in pregnancy and what are the potential clinical implications? front immunol zika virus epidemic in pregnant women ebola virus disease in pregnancy: clinical, histopathologic, and immunohistochemical findings the ethics working group on zikv research & pregnancy. pregnant women & the zika virus vaccine research agenda: ethics guidance on priorities, inclusion, and evidence generation maternal immunization maternal immunization: where are we now and how to move forward strengthening maternal immunisation to improve the health of mothers and infants immunoinformatics: in silico approaches and computational design of a multi-epitope, immunogenic protein while this review has not been funded directly by them, we gratefully acknowledge the drug design laboratory of faculty of pharmaceutical sciences, nnamdi azikiwe university, nigeria, and drug discovery africa. all the needed data are included in this manuscript. all authors contributed to data analysis, drafting or revising the article, gave final approval of the version to be published, and agree to be accountable for all aspects of the work. the authors report no funding and no conflicts of interest in this work. key: cord- -dt zvebj authors: he, yonghua; schmidt, monica a.; erwin, christopher; guo, jun; sun, raphael; pendarvis, ken; warner, brad w.; herman, eliot m. title: transgenic soybean production of bioactive human epidermal growth factor (egf) date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: dt zvebj necrotizing enterocolitis (nec) is a devastating condition of premature infants that results from the gut microbiome invading immature intestinal tissues. this results in a life-threatening disease that is frequently treated with the surgical removal of diseased and dead tissues. epidermal growth factor (egf), typically found in bodily fluids, such as amniotic fluid, salvia and mother’s breast milk, is an intestinotrophic growth factor and may reduce the onset of nec in premature infants. we have produced human egf in soybean seeds to levels biologically relevant and demonstrated its comparable activity to commercially available egf. transgenic soybean seeds expressing a seed-specific codon optimized gene encoding of the human egf protein with an added er signal tag at the n’ terminal were produced. seven independent lines were grown to homozygous and found to accumulate a range of . +/- . to . +/- . μg egf/g of dry soybean seed. proteomic and immunoblot analysis indicates that the inserted egf is the same as the human egf protein. phosphorylation and immunohistochemical assays on the egf receptor in hela cells indicate the egf protein produced in soybean seed is bioactive and comparable to commercially available human egf. this work demonstrates the feasibility of using soybean seeds as a biofactory to produce therapeutic agents in a soymilk delivery platform. each year in the united states, more than , babies, approximately % of total births, are born before full weeks of gestation [ ] . as a growing health issue the rate of premature birth has increased by percent since the early s. one of the major problems associated with prematurity is the development of a condition known as neonatal necrotizing enterocolitis (nec) [ ] . this is observed clinically as the abrupt development of bloody diarrhea, abdominal swelling, and tenderness in a premature infant who is otherwise doing well [ ] . current treatment often requires surgical removal of the damaged and dead intestine, often resulting in mortality (about %) or, if the infant survives, to manifest significant resulting lifetime problems [ ] [ ] [ ] . although the direct cause of nec is not known, the most significant contributing factor is premature birth. post-partum establishment of an abnormal gut microbiome creates the opportunity for bacterial invasion into gut due to immature intracellular junctions of the intestinal mucosa [ , ] . experimental and clinical evidence suggest that prematurity and nec is associated with deficient endogenous production of epidermal growth factor (egf), which is necessary for normal intestinal development and repair [ , ] . egf is a critical growth factor found in multiple fluids that bathe the developing intestine including amniotic fluid, fetal urine, breast milk, bile, and saliva [ , , ] . in the amniotic fluid, there is an increasing concentration of egf as gestation progresses [ ] . egf amounts in mother's milk is highest first days after parturition with mothers of extreme pre-term neonates having - % higher than mother's milk of full term infants [ ] . human studies have demonstrated that egf is resistant to proteolytic degradation across a range of gastric ph [ ] . while egf is produced to some extent in duodenal brunner's glands and kidney, the vast majority of egf is produced in the salivary glands [ ] . exogenous infusion of egf in utero has been shown to accelerate the maturation of intestinal enzyme activity as well as stimulate intestinal growth [ , ] . the importance of egf to gut development is highlighted by the fact that knockout of the egf receptor in some mice strains results in death due to a bloody diarrhea that is remarkably similar to human nec [ ] . transgenic mice directed to intestinally overexpress egf displayed a number of beneficial effects, including increased body weight and villus height, after a small bowel resection compared to nontransgenic mice [ ] . conversely, inhibition of egf receptors impairs intestinal adaption following a small bowel resection [ ] . a prospective, multi-center trial demonstrated that infants fed regular formula (not containing growth factors) were to times more likely to develop nec than infants fed breast milk [ ] . while a large number of biologically active peptides and growth factors have been identified in breast milk, egf is one of the major peptides present in significant concentrations [ ] . the concentration of egf in milk is found to be inversely proportional to the gestational age of the infant, therefore, the more premature the infant, the more egf is present in the breast milk [ ] . this may be a compensatory response to the premature removal of the fetus from the egf-rich amniotic fluid. it has been demonstrated in several animal models of nec that administration of exogenous egf has been shown to significantly reduce the severity of intestinal injury [ , ] . the proactive treatment of infants at nec risk with egf supplementation could therefore accelerate intestinal maturation, thus preventing the development of nec. if the proactive egf feeding strategy is effective to induce the maturation of the neonate intestinal tract then this simple approach may mitigate the development of nec with its resulting high costs in medical resources, pain and possible life-long debilitation and for the % infants with nec that proves fatal [ ] [ ] [ ] . to accomplish such a proactive therapeutic approach adapting infant formula for egf delivery would be simple and economic and mimic the delivery of egf in mother's milk. the need and potential delivery makes infant formula containing egf a good model for food-sourced plant biotechnology. soybean-derived formula encompasses a significant fraction of the total infant formula market. soybean milk and derived products are a potential food-therapy delivery platform that could include a variety of medically-necessary products including drugs such as egf but might also include oral vaccines [ , ] . the economy of production and simple conversion into therapeutic materials that has long-shelf life makes soybean biotech products a potentially desirable commodity for use in cost-sensitive scaled applications. addressing the devastating disease of nec through a simple proactive treatment protocol is an excellent platform to explore the potential of soybean-produced therapeutics. here we report the accumulation of human egf (hegf) in geneticallyengineered soybean seeds and show that the recombinant egf is indistinguishable from authentic human egf and is bioactive at stimulating egf receptor (egfr) activity. epidermal growth factor protein from humans was produced in soybean seeds by constructing a plant gene expression cassette that involved a synthetic codon optimized egf nucleotide sequence (protein sequence from genbank accession ccq ). this bp open reading frame was placed in-frame behind a -amino acid endoplasmic reticulum (er) signal sequence from the arabidopsis chitinase gene [ , ] . the er-directed egf encoding open reading frame was developmentally regulated by the strong seed-specific storage protein glycinin regulatory elements [ ] . the entire seed specific cassette to direct egf production was placed in a vector containing the hygromycin resistance gene under the strong constitutive expression of the potato ubiquitin regulatory elements as previously described [ ] [ ] [ ] . the result plasmid pgly::shegf was sequenced using a glycinin promoter primer ( ' tcattcac cttcctctcttc ') to ensure the egf open reading frame was placed correctly between the regulatory elements. somatic soybean (glycine max l. merrill cv jack (wild type)) embryos were transformed via biolistics using mg/l hygromycin b selection and regenerated as previously described [ ] . embryos from resistant lines were analyzed by genomic pcr to confirm the presence of inserted hygromycin cassette using primers specific to the hygromycin gene (hygf 'ctcactattcctttgccctc ' and hygr 'ctgacctattgcatctcccg '), cetyl trimethyl ammonium bromide (ctab) extraction genomic dna isolation and the following amplification conditions: ng genomic dna in μl total reaction containing nm primers and u taq polymerase (neb) and the following cycling parameters (initial °c min then cycles of °c s, °c s, °c s; followed by a final extension of °c min). dry seeds from two successive generations of pcr positive plants were analyzed by elisa for the expression of egf protein until all lines were confirmed to be homozygous. egf transgenic soybean plants along with nontransgenic control wild type cultivar plants were grown side by side in a greenhouse at °c under h daylight with μm - /s. total soluble protein was extracted from dry seeds of two homozygous egf lines and a nontransgenic control by repeated acetone washes followed by acetone precipitation with the protein pellet dissolved in water. proteins with molecular weight kda and under were isolated by separately passing each extract through an amicon ultra centrifugal filter (merck, kenilworth nj). the samples were each suspended in sample buffer ( mm tris hcl, ph . % sds (w/v), . m β-mercaptoethanol, . % (w/v) bromphenol blue and % (v/v) glycerol) and then denaturated min °c. protein content was determined by bradford assay [ ] . a % sds-page gel was used to separate μg protein for each of the three samples: negative control wild type, lines and of egf transgenic soybean dry seeds. commercially available human egf (gibco, life technologies,united kingdom) was used at . μg as positive control. gel was electroblotted onto immobilon p transfer membrane (millipore, bedford ma) and blocked with % milk solution in tbs for at least hr. primary antibody was a commercially available anti-egf (calbiochem, san diego ca) and was used in a : ratio in % bsa-tbs buffer overnight at room temperature. after washes of mins each with tbs buffer, the blot was incubated with a : , ratio in tbs of secondary antibody anti-rabbit igg fabspecific alkaline phosphatase conjugate (sigma, st. louis mo). after washes, the presence of the egf protein was detected by using a color substrate (bcip/nbt: final concentrations . % (w/v) -bromo- -chloro- -indoyl phosphate and . % (w/v) nitro blue tetrazolium in % (v/v) dementhylformadmide) (kpl, gaithersburg ma). total soluble protein was extracted from dry soybean seeds as described previously [ , ] from all lines of pgly::shegf transgenic plants along with nontransgenic seeds as a negative control. egf was quantitated by commercially available human egf elisa assay (quantikine elisa kit from r&d systems, minneapolis mn) according to the manufacturer's instructions. the provided positive control was used to create a standard curve in order to determine the amount of egf in each soybean protein extract. each homozygote egf transgenic line was assayed with three biological replicates and results displayed as mean +/-standard error. total soluble proteins were extracted, quantitated and suspended in sample loading buffer as previously described [ , ] . approximately μg of protein extract from dry seeds of homozygous egf lines were separated on a - % gradient sds-page gel (biorad, hercules ca) along with extract from a nontransgenic seed. the gel was subsequently stained with . % total soluble protein was extracted from biological egf transgenic soybean dry seed samples, lines , and . as described above, proteins with molecular weights lowers than kda were concentrated using an amicon ultra centrifugal filter (merck, kenilworth nj). non-transgenic seeds were used as a negative control and μg commercially available egf (as above in immunoblot section) was the positive control. protein was precipitated by adjusting the solution to % (v/v) trichloroacetic acid and allowed to sit at °c overnight. precipitated proteins were pelleted using centrifugation, washed twice with acetone and then dried using vacuum centrifugation. the commercial egf was not filtered or precipitated, only dried. dried pellets were rehydrated with the addition of μl mm dithiothreitol in mm ammonium bicarbonate and placed at °c for minutes to reduce disulphide bonds. samples were then alkylated with addition of μl iodacetamide in mm ammonium bromide and placed at room temperature in the dark for minutes. two μg trypsin in μl mm ammonium bromide was added to each samples and placed in °c overnight for enzymatic digestion. post trypsin digest samples were desalted using a peptide reverse phase microtrap (michrom bioresources, auburn ca), dried and ultimately resuspended in μl of % (v/v) acetonitrile, . % (v/v) formic acid. separation of peptides was performed using a dionex u splitless nanoflow hplc system operated at nl minute using a gradient from - % acetonitrile over minutes, followed by a minute wash with % acetonitrile and a minute equilibration with % acetonitrile. the c column, an in-house prepared μm by cm reverse phase column packed with halo . μm, Å c material (mac-mod analytical, chadds ford pa) was located in the ion source just before a silica emitter. a potential of volts was applied using a liquid junction between the column and emitter. a thermo ltq velos pro mass spectrometer using a nanospray flex ion source was used to analyze the eluate from the u . scan parameters for the ltq velos pro were one ms scan followed by ms/ms scans of the most intense peaks. ms/ms scans were performed in pairs, a cid fragmentation scan followed a hcd fragmentation scan of the same precursor m/z. dynamic exclusion was enabled with a mass exclusion time of min and a repeat count of within sec of initial m/z measurement. spectra were collected over the entirety of each minute chromatography run. raw mass spectra were converted to mgf format using msconvert, part of the proteowizard software library [ ] x!tandem . . . [ ] and omssa [ ] algorithms were employed via the university of arizona high performance computing center to perform spectrum matching. precursor and fragment mass tolerance were set to . daltons for both omssa and x!tandem. trypsin cleavage rules were used for both algorithms with up to missed cleavages. amino acid modifications search consisted of single and double oxidation of methionine, oxidation of proline, n-terminal acetylation, carbamidomethylation of cysteine, deamidation of asparagine and glutamine and phosphorylation of serine, threonine, and tyrosine. x!tandem xml and omssa xml results were filtered using perl to remove any peptide matches with an evalue > . as well as proteins identified by a single peptide sequence. the protein fasta database for glycine max was downloaded on august , from ncbi refseq with the addition of the egf amino acid sequence. a randomized version of the glycine max fasta was concatenated to the original as a way to assess dataset quality. the mass spectrometry proteomics data have been deposited to the proteomexchange constortium (http://proteomecentral. proteomexchange.org) via the pride partner repository [ ] with the dataset identifier pxd and . /pxd . hela cells (obtained from american tissue culture collection) were cultured in minimum essential media (mem) complemented with % fetal bovine serum (fbs), units/ml penicillin, and μg/ml streptomycin. for western blotting assay, cells grown in -well plate were kept in serum free mem media for hours. cells were then either kept in serum free medium (control) or stimulated with soy milk alone, soy egf or commercial recombined human egf for different time period as indicated. cells were lysed by directly adding × sds sample buffer ( mm tris-hcl, ph . , % glycerol, % sds and % β-me) to the cells after washing times with x pbs. egf bio-activity was determined via egfr phosphorylation and downstream akt phosphorylation. total egfr was also measured since egfr is known to undergo internalization when stimulated with egf. antibodies used in western blot are anti-p-egfr (tyr ) (# , cell signaling technology), anti-total egfr (# - , millipore), anti-p-akt (# , cell signaling technology) and anti-lamin b (# , cell signaling technology) [ ] . for immunocytochemistry assay, cells were grown on coverslip in -well plate and kept in serum free media for hours before stimulation, cells were then either kept in serum free media (control) or stimulated with human or soy egf for hours. cells were washed with pbs and fixed with % formalin. egfr was labeled using anti-egfr antibody (# , cell signaling technology) and detected with alexa fluor goat anti-rabbit igg (#a , life technology). the cell nucleus were shown using mounting medium with dapi (#h- , vectorshield). to produce hegf in soybean a strong soybean seed-specific promoter and terminator was used to regulate gene expression of a synthetic soybean codon optimized hegf (shegf) gene that included an n-terminal nucleotide er-signal sequence (fig a) . in the engineering strategy for the hegf expression in soybean, the components of the prepro portions of hegf were eliminated in preference to produce only the final recombinant hegf product. to facilitate the co-translational transfer of the egf into the er lumen for disulfide bond formation a plant signal sequence was added so that the hegf synthesized would be as a pre-hegf. the gly::shegf construct was used for biolistic transformation of soybean somatic embryo cells as outlined in [ ] [ ] [ ] [ ] . embryos were selected in liquid culture by hygromycin b and individual regenerated lines were separated, propagated, and induced to form cotyledonary embryos. the cotyledonary embryos were evaluated for hegf production using egf-specific elisa that indicated a variation of heterologous protein production (data not shown). the most promising egf expressing lines were moved forward for regeneration by desiccating and subsequent germination. the initial t generation egf transgenic plants were grown in the greenhouse and further selected by genomic pcr for an additional - generations. additionally, each generation of seeds produced by the selected lines were assayed for hegf content by elisa. the hegf content of each line in seeds representative of the homozygous population is shown in fig b. the lines varied in hegf content but seeds within each line had a narrow range of hegf accumulation. the egf transgenic line produced in excess of μg hegf per gm dry seed weight, a level calculated to be much in excess of potential therapeutic requirements. by comparison, yeast stains have been used as an expression system for both human egf [ ] and mouse egf [ ] with the highest levels produced being from a multicopy insert pichia pastoris clone secreting μg egf/ml. in both the mouse and human egf yeast production systems, truncated versions of the egf were detected. the hegf soybeans and nontransgenic soybeans were evaluated to determine the biochemical authenticity of the soybean-produced egf protein. using d sds/page and parallel immunoblots probed with anti-egf, the soluble low molecular weight (< kda) seed proteins and the mr of the soybean-produced hegf was evaluated. the total protein polypeptide of the hegf expressing lines appeared to be identical to the standard parental control (fig ) . immunoblots of the d sds/page probed with anti-egf showed a lack of an immunoreactive band in the nontransgenic soybean seed control and recognized a kda mr band in the hegf expressing lines and . the soybean-produced hegf has the same apparent mr as authentic recombinant hegf fractioned in an adjacent lane (fig ) . to further assess the soybean-synthesized hegf the seed lysates were enriched in low mr total proteins and concentrated. the crude low mr proteins were reduced, alkylated, and cleaved with trypsin prior to analysis by mass spectrometry. the resulting data was queried with the hegf sequence and exact matches for peptides encompassing the majority of the sequence of the complete mature hegf protein were obtained (fig ) . together the data shows that transgenic soybeans successfully produced and accumulated hegf that is the correct mr, is immunoreactive with antibodies directed at authentic egf in both elisa and immunoblot assay, and that a majority mass spectrometry of fragments of the soybean-produced hegf match the human egf sequence. the delivery of any biopharma product in the context of compositionally complex food presents the potential that the components of the food may act to modulate bioactivity. plantsource foods in particular pose problems because plant tissues often possess a wide range of intrinsic biologically active components including proteins and natural products. the natural products of food could mask or enhance the effects of an expressed biopharma product. to evaluate the potential of egf activity in soymilk delivery commercial recombinant human egf (rhegf) was added as a supplement to soymilk and the intrinsic activity of the egf was tested with a hela cell assay. fig shows the effects of soymilk on the display of the egf receptor (egfr) on hela cells and the effect of commercial rhegf supplement to soymilk. soymilk does not modify the display of egfr on hela cells showing that soymilk alone is biologically inactive. the binding of egf to egfr results in the decrease of displayed egfr as it is internalized into the hela cells. hela cells treated with commercially available recombinant rhegfsupplemented soymilk display the same decrease in egfr as cells treated with rhegf in media without soymilk. parallel time-course experiments show that the effect of rhegf binding to efgr is rapid with a reduction of displayed efgr occuring within min of treatment and continuing out to at least min (data not shown). together these assays show that soymilk has no apparent negative bioactivity with respect to both the binding of commercial rhegf to the hela cell egfr or the viability of the hela cells over the course of the assay. to assess the bioactivity of soybean-produced hegf, samples were prepared from both shegf transgenic soybean lines and nontransgenic controls that were used to stimulate hela cells to induce egfr internalization, degradation and phosphorylation. in results shown in fig , soybean-produced hegf induces the internalization, degradation and phosphorylation of egfr that is indistinguishable from the bioactivity of commercial rhegf delivered in control samples. in contrast, samples prepared from control nontransgenic soybeans exhibited no apparent bioactivity showing the degradation and phosphorylation of egfr is the result of egf binding of either commercial rhegf added to the media or from the hegf produced by the transgenic soybeans. together these results show that nontransgenic soybean seeds have no intrinsic egf-mimic activity able to induce egfr degradation or phosphorylation, while soybeans producing hegf have identical activity in comparison to commercial rhegf. soybean produced egf displayed comparable bioactivity to commercially available egf. panel a. soybean produced hegf induces a rapid phosphorylation of hela cell egfr. serum free media (sf) and sf media with soymilk alone does not induce egfr phosphorylation and degradation. soymilk from seeds producing shegf added at different concentrations ( . , . , . μg/ml) induced concentrationdependent egfr degradation comparable to the effect of rhegf. serum free media and serum free media with nontransgenic soybean soymilk (negative controls) showed no effect on inducing pegfr. in contrast soymilk from shegf soybeans given at different concentrations ( . , . , . μg/ml) induced pegfr comparable to control rhegf. pakt indicates the functional activation of egfr. lamin b was used as a loading control. panel b. exogenous commercial rhegf and shegf induces an internalization and degradation of egfr in hela cells shown as a decrease in abundance assayed by immunoblot. the results shown demonstrate that soymilk alone has no intrinsic bioactivity with respect to egfr abundance. the rhegf is not degraded in soymilk over hours having the same bioactivity as control recombinant rhegf.-ctrl-sf media alone. soy egf and rhegf are at . μg/ml. lamin b was used as a loading control. panel c. shown is an immunohistochemical assay of hela cells showing that shegf induces internalization of the egfr comparable to that from control rhegf. in c, the cells were first treated with soy/egf or human egf for hours, fixed and then immunostained with egfr antibody overnight. egfr shows red staining while nucleus was stained by dapi and shows blue staining. in developing a food-based delivery platform for biopharma it is important to address the question of whether there are significant collateral consequences in seed composition resulting from the genetic modification. ideally a consumption plant biotechnology platform, such as soymilk, should be fully equivalent to the standard type other than the intended modification. seeds in general, including soybeans, possess an inventory of bioactive proteins and small molecules that will affect the metabolism of consumers in both advantageous and disadvantageous manner. for soybeans some of the relevant molecules are allergens, anti-metabolite proteins, and small molecules especially isoflavones. to test for potential collateral composition in the hegf-producing soybeans, the shegf transgenic and nontransgenic control soybeans were analyzed by non-targeted proteomics and metabolomics. among the significant proteins identified include various well-documented allergens and anti-metabolite proteins. a comparison of standard soybeans with hegf-producing soybean lines showed that there was no significant difference (p = . ) between nontransgenic control and shegf transgenic soybeans aside from the targeted production of hegf for any other proteins of concern. this data is available in pride partner repository with the dataset identifier pxd and . /pxd . non-targeted small molecule metabolomics was used to conduct a parallel analysis of the nontransgenic and hegf soybeans. again there were insignificant differences between nontransgenic soybean seeds and the shegf transgenic seeds (fig ) with one notable exception. soybean highly regulates sulfur availability and its allocation into protein. from a nutritional perspective soybean is considered a somewhat sulfur deficient crop. there have been a number of biotechnology experiments to increase sulfur content be either modifying assimilation and biosynthesis pathways leading to methionine or over-expressing high-methionine proteins such as maize zeins. modifying sulfur by pathway or competition has an effect on sulfurresponsive proteins including the bowman-birk trypsin inhibitor (bbi) and beta chain of the storage protein conglycinin. egf is a high sulfur content protein that broadly mimics bbi as a small globular protein synthesized by the er and presumptively competing for sulfur amino acid charge trna. expressing hegf in soybean has an effect on metabolites involved in sulfur amino acid metabolism that is consistent with producing a protein of egf's composition. a complete dataset of all metabolite abundance of the standard and hegf-expressing lines is available as an on-line spreadsheet (s table) . among the assayed molecules of particular note is the soybean molecule genistein, an isoflavone that has been shown to affect the activity of tyrosine phosphatase in the signal cascade associated with egf signaling [ ] [ ] [ ] [ ] . genistein levels were determined to be the same in both the nontransgenic and hegf-expressing soybean lines. this too demonstrates that the expression of hegf in soybeans does not produce any incidental collateral consequences of concern for its potential therapeutic use. since the inception of plant biotechnology its potential use for biopharma applications has been assessed [ ] [ ] [ ] [ ] . several different plant organs proposed as production for food/feed delivery systems [ ] [ ] [ ] . for many vaccine applications fruit are a highly advantageous delivery system providing a broadly accepted platform for even the most recalcitrant consumer (for example, [ , ] ). although fruit are perhaps one of the best delivery systems from the perspective of point of delivery, fruit also has logistical issues with relatively short time that ripened fruit are palatable requiring a tightly coordinated effort to produce, distribute, and use biopharma product that could be challenging for deployment in mass quantities. an alternative fig . relative proportion of non-targeted metabolites detected in soybean seeds shown as amount in egf transgenic compared to nontransgenic (wt). complete list of non targeted metabolites quantitated in samples in s table. doi: . /journal.pone. .g transgenic soybean production of bioactive human epidermal growth factor (egf) is to develop a biopharma platform that is broadly acceptable for food and feed delivery but can be lightly processed to preserve bioactivity and can be massively scaled to maximize the distribution potential of the product. soybean is a potentially useful biopharma platform that could have broad application in both food and feed end uses [ , , , ] . soybean has been demonstrated as a platform to produce heterologous proteins at a standard that far exceeds the levels typically needed for biopharma [ ] . soybeans can be used to produce both soymilk and formula for potential delivery to human infants or children as well as for production animals such as swine and calves. soybean can also be used to produce protein concentrates for inclusion in industrial food and feed or more simply as protein aggregates as tofu. soybean production is efficient and economic that can be massively scaled if needed. recently developed technology makes it feasible to increase the amount of recombinant protein product by silencing and exchange with a storage protein(s) [ , , ] . as a platform, soybean is an industrial crop with vast majority of its total production being directed toward products including processed food, protein used as animal feed, and its oil for food, feed, fuel, and chemical feedstock uses. many of the goals of further enhancing and modifying soybeans are largely directed at improving its utilization for industry products rather than direct food use. as a biopharma platform to produce soymilk derived products soybean seeds can be stored for years anticipating future needs while retaining the potential to be rapidly processed into formula/milk or tofu using adaptations of traditional technology in use for over a millennium. soybeans like many other seeds produce an array of intrinsic small globular proteins with secondary disulfide bonds accumulated at relatively high levels (> % of total proteins). soybean in particular accumulates the bowman-birk trypsin inhibitor that is . kda with intra-chain disulfide bonds [ ] . this suggests that soybean seeds are optimized as a potential bioreactor to produce and store proteins like egf, a . kda protein with intra chain disulfide bonds paralleling intrinsic seed proteins. in a predecessor experiment a mutant inactive bbi was expressed in transgenic soybeans showing that alternate small proteins can be expressed in soybean [ ] . expression of a construct encoding shegf regulated by the soybean seed storage protein promoter results in the accumulation of hegf at > μg /gm of dry soybean seed, a level to be many fold over the estimated therapeutic requirements of μg/kg weight of treated individual [ ] . soybean-produced hegf appears to be completely comparable to authentic hegf in its mr, immunoreactivity with specific antibodies, correspondence of fragment sequence in mass spectrometry assay, and in bioactivity inducing the internalization, degradation and phosphorylation of efgr. together the results shown demonstrate that soybean seeds will produce hegf at proto-therapeutic levels and the derived hegf from these seeds are bioactive for egf activity in a model hela cell assay. the expression of hegf in soybean has little collateral impact on seed composition soybeans have been used as an expression platform for a wide variety of heterologous proteins with investigative as well potential food/feed and biopharma goals [ , [ ] [ ] [ ] [ ] [ ] . prior biopharma expressions have included prototype expression of vaccine models [ ] as well as proinsulin and fibroblast and human growth factor [ , ] . in this study potential collateral changes in prototype product mature soybean seeds resulting from hegf expression was evaluated by non-targeted proteomics and metabolomics to assess both large and small molecules. these assessments showed that there was no significant difference in the seed proteome of the egf transgenics compared to nontransgenics. this is a pertinent result as soybeans are regulated in the us under falpa (the food and allergen labeling protection act) and unintended alterations of any of the known seed allergens or anti-nutritional proteins can be of concern. similarly the non-targeted metabolomics of the soybean seeds showed a significant lack of alteration of the small molecule profile in response to hegf accumulation. among the molecules assessed the lack of change in genistein is among the most significant as this isoflavonoid has been shown to have activity with tyrosine phosphatase that is in the signal cascade of animal and human cells that includes egf/egfr signaling [ , , ] . in the hela cell assessments there was no synergistic effect of standard soybean milk and authentic egf on egfr activity indicating that the identical genistein concentration in the standard and hegf expressing soybeans is below the threshold of effect in the assays conducted. the one significant alteration in the small molecule profile was in methionine-related metabolism. egf is a sulfur rich protein containing three disulfide bonds that has some general resemblance to the soybean bowman-birk inhibitor. soybean is a relatively sulfur deficient crop and much effort has been made to increase its sulfur amino acid content by either the co-expression of sulfurrich proteins such as zeins [ ] or by increasing the sulfur flux by altering the sulfur amino acid pathways [ ] [ ] [ ] . these studies have shown that within limits the increase of a sulfur sink such as expressing a high-sulfur content protein will collaterally induce modest increases in sulfur amino acid source. the results of increases in sulfur amino acid metabolites accompanying hegf expression in soybean is in accord with these prior experiments. together the results of the non-targeted proteome and metabolome assessments show that converting soybean into a prototype biopharma delivery platform of hegf does not result in any adverse alterations of the soybean seed's composition. soybeans could be used to produce biopharma products that might be delivered as milk or formula. as a test of this concept human epidermal growth factor (hegf) has been produced in soybeans to potentially address the devastating disease of neonatal necrotizing enterocolitis. this is a disease of premature infants of low birth weight. these infants have underdeveloped organs including the intestinal tract. the resulting gangrenous infection is treated by emergency surgery to remove dead portions of the intestinal tract that even under most optimistic situations has a high mortality rate and high cost of treatment. an alternative approach is to proactively treat infants at risk immediately post-partum to attempt to improve the integrity and maturity of the lining epithelial cells. the bioactivity results with model hela cells shows that hegf can be produced and accumulated in soybean seeds and as crude soy-milk lysate is capable of stimulating a response from the egf receptor (egfr) that occurs on epidermal surfaces such as the intestinal tract. soybean-produced hegf has potential other applications in cosmetics, burn and injury treatment, stimulating improved adaptation of the bowel to massive intestinal loss. supporting information s table. non-targeted metabolome set. births: final data for role of human milk in extremely low birth weight infants' risk of necrotizing enterocolitis or death necrotizing enterocolitis: the search for a unifying pathogenic theory leading to prevention neonatal necrotizing enterocolitis: a nine year experience: ii outcome assessment long-term survival and parenteral nutrition dependence in adult patients with the short bowel syndrome s rrna gene-based analysis of fecal microbiota from preterm infants with and without necrotizing enterocolitis fecal microbiota in premature infants prior to necrotizing entercolitis establishment of the gut microbiota in western infants bacterial colonization and gut development in preterm neonates epidermal growth factor: biology and mechanism of action early human milk feeding is associated with a lower risk of necrotizing enterocolitis in very low birth weight infants epidermal growth factor (egf) concentrations in amniotic fluid and maternal urine during pregnancy increased epidermal growth factor levels in human milk of mothers with extremely premature infants human epidermal growth factor: isolation and characterization and biological properties immunocytochemical localization of human epidermal growth factor/urogastrone in several human tissues transforming growth factor alpha and epidermal growth factor in protection and healing of gastric mucosal injury epidermal growth factor enhances intestinal adaptation after massive small bowel resection epithelial immaturity and multiorgan failure in mice lacking epidermal growth factor receptor intestinal overexpression of egf in transgenic mice enhances adaptation after small bowel resection selective inhibition of the epidermal growth factor receptor impairs intestinal adaptation after small bowel resection breast milk and neonatal necrotizing enterocolitis human milk for the premature infant epidermal growth factor reduces the development of necrotizing enterocolitis in a neonatal rat model intestinal barrier failure during experimental necrotizing enterocolitis: protective effect of egf treatment short bowel syndrome management of the short bowel syndrome in the pediatric population pediatric shortbowel syndrome: the cost of comprehensive care plant made pharmaceuticals: from edible vaccines to ebola therapeutics towards using biotechnology to modify soybean seeds as protein bioreactors. in, recent advancements in plant expression in crop plants production of escherichia coli heat labile toxin (lt) b subunit in soybean seed and analysis of its immunogenicity as an oral vaccine the collateral protein compensation mechanism can be exploited to enhance foreign protein accumulation in soybean seeds a rnai knockdown of soybean kda oleosin results in the formation of micro-oil bodies that aggregate to form large complexes of oil bodies and er containing caleosin transgenic soybean seeds accumulating β-carotene exhibit the collateral enhancements of high oleate and high protein content traits towards normalization of soybean somatic embryo maturation a rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein dye binding proteowizard: open source software for rapid proteomics tools development tandem: matching proteins with tandem mass spectra open mass spectrometry search algorithm retinoblastoma protein (prb), but not p or p , is required for maintenance of enterocyte quiescence and differentiation in small intestine the proteomics identifications (pride) database and associated tools: status in characterization of recombinant human epidermal growth factor produced in yeast production of mouse epidermal growth factor in yeast: high level secretion using pichia pastoris strains containing multiple gene copies genistein, a tyrosine kinase inhibitor, reduces egf-induced egf receptor internalization and degradation in human hepatoma hepg cells genistein alters growth factor signaling in transgenic prostate model (tramp) genistein enhances relaxation of the spontaneously hypertensive rat aorta by transactivation of epidermal growth factor receptor following binding to membrane estrogen receptors-α and activation of g protein-coupled, endothelial nitric acid synthase-dependent pathway genistein increases epidermal growth factor receptor signaling and promotes tumor progression in advanced human prostate cancer the production of recombinant pharmaceutical proteins in plants sowing the seeds of success: pharmaceutical proteins from plants recombinant pharmaceuticals from plants: the plant endomembrane system as bioreactor commercialization of biopharmaceutical and bioindustrial proteins from plants efficacy of a food plant-based oral cholera toxin β subunit vaccine oral immunization with hepatitis β surface antigen expressed in transgenic plants immunogenicity of recombinant lt-b delivered orally to humans in transgenic corn expression of hepatitis b surface antigen (hbsag) gene in transgenic banana (musa sp) severe acute respiratory syndrome (sars) s protein production in plants: development of recombinant vaccine expression and immunogenicity of an escherichia coli k fimbriae subunit antigen in soybean protein expression systems: why soybean seeds?" in soybean-molecular aspects of breeding cosuppression of the α-subunits of β-conglycinin in transgenic soybean seeds induces the formation of endoplasmic reticulum-derived protein bodies silencing of soybean seed storage proteins results in a rebalanced protein composition preserving seed protein content without major collateral changes in the metabolome and transcriptome reduction of protease inhibitor activity by expression of a mutant bowman-birk gene in soybean seed epidermal growth factor augments adaptation following small bowel resection: optimal dosage, route, and timing of administration expression of functional recombinant human growth hormone in transgenic soybean seeds processing and localization of bovine β-casein expressed in transgenic soybean seeds under control of a soybean lectin expression cassette embryo-specific silencing of a transporter reduces phytic acid content of maize and soybean seeds metabolically engineered oilseed crops with enhanced seed tocopherol an alternative to fish oils: metabolic engineering of oilseed crops to produce omega long chain polyunsaturated fatty acids correct targeting of proinsulin in protein storage vacuoles of transgenic soybean seeds high-level expression of basic fibroblast growth factor in transgenic soybean seeds and characterization of its biological activity expression of correctly processed human growth hormone in seeds of transgenic tobacco plants genistein analogues: effects on epidermal growth factor receptor tyrosine kinase and on stress-activated pathways increased sulfur amino acids in soybean plants overexpressing the maize kd zein protein enhanced levels of methionine and cysteine in transgenic alfalfa (medicago sativa l.) plants over-expressing the arabidopsis cystathionine γ-synthase gene transgenic soybean plants overexpressing o-acetylserine sulfhydrylase accumulate enhanced levels of cysteine and bowman-birk protease inhibitor in seeds effects of proteome rebalancing and sulfur nutrition on the accumulation of methionine rich δ-zein in transgenic soybeans front key: cord- -ntqyjqqk authors: huang, chih-wei; lin, hui-chen; chou, chi-yuan; kao, wei-chuo; chou, wei-yuan; lee, hwei-jen title: lys- at the interfaces of diagonal subunits of δ-crystallin plays a critical role in the reversibility of folding and subunit assembly date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: ntqyjqqk δ-crystallin is the major structural protein in avian eye lenses and is homologous to the urea cycle enzyme argininosuccinate lyase. this protein is structurally assembled as double dimers. lys- is the only residue which is arranged symmetrically at the diagonal subunit interfaces to interact with each other. this study found that wild-type protein had both dimers and monomers present in – m urea whilst only monomers of the k a mutant were observed under the same conditions, as judged by sedimentation velocity analysis. the assembly of monomeric k a mutant was reversible in contrast to wild-type protein. molecular dynamics simulations showed that the dissociation of primary dimers is prior to the diagonal dimers in wild-type protein. these results suggest the critical role of lys- in stabilization of the diagonal dimer structure. guanidinium hydrochloride (gdmcl) denatured wild-type or k a mutant protein did not fold into functional protein. however, the urea dissociated monomers of k a mutant protein in gdmcl were reversible folding through a multiple steps mechanism as measured by tryptophan and ans fluorescence. two partly unfolded intermediates were detected in the pathway. refolding of the intermediates resulted in a conformation with greater amounts of hydrophobic regions exposed which was prone to the formation of protein aggregates. the formation of aggregates was not prevented by the addition of α-crystallin. these results highlight that the conformational status of the monomers is critical for determining whether reversible oligomerization or aggregate formation occurs. introduction δ-crystallin is a taxon-specific eye lens protein. it is the major soluble protein in the eye lens of reptiles and birds and functions as a structural protein to maintain the refraction properties of the lens [ , ] . δ-crystallin and argininosuccinate lyase (asl) are homologous proteins. asl is in this study, the effects of this interaction on the folding pathway of wild-type and mutant proteins were investigated using urea as a denaturant. the different distributions of dissociated component from wild-type and mutant proteins, as measured by sedimentation velocity experiment, suggests the quaternary structure dissociates in different ways for wild-type and mutant proteins. structural simulation supports different dissociation processes for the two proteins. these results highlight the critical role of k in stabilizing the quaternary structure of δ-crystallin. the residue appears to control both the dissociation of dimers into monomers and the stability of the produced monomers. the monomers dissociated from the k a mutant protein with a stable and compact conformation provided a good model for studying the folding mechanism of the δ-crystallin. this study reveals the conformational status of the monomers, which determines whether functional protein or aggregates are formed. the recombinant wild-type and the k a mutant δ-crystallin or αa-crystallin plasmid were transformed and over-expressed in e. coli bl (de ) with induction by isopropyl-β-d-thiogalactopyranoside (iptg). proteins were purified as previously described [ , ] . the supernatants of crude cell extracts were loaded onto a q-sepharose anion exchange column (hiprep / q xl, ge healthcare) pre-equilibrated in buffer a ( mm tris-hcl buffer, ph . ) and eluted with a linear gradient of to . m nacl in buffer a. recombinant protein was eluted at approximately . m nacl. the eluted protein was pooled and treated with ammonium sulfate to . m. after filtration, the sample was loaded onto a hydrophobic interaction column (source™ phe) pre-equilibrated in buffer a containing % (v/v) glycerol and . m ammonium sulfate and eluted with a linear gradient to the same buffer lacking ammonium sulfate. the retained proteins were eluted at~ . m ammonium sulfate. fractions were pooled and loaded onto s- sephacryl column ( mm x cm) pre-equilibrated in mm tris-acetic acid buffer, ph . . fractions were analyzed by sds-page and protein concentrations determined by the method of bradford [ ] . proteins possessing a c-terminal his tag were purified on ni affinity column (chelating sepharose ff, ge healthcare) then desalted using a sephadex g- column ( mm x cm) as previously reported [ ] . equilibrium unfolding experiments were carried out by overnight incubation of δ-crystallin with various concentrations of urea or gdmcl in mm tris-acetic acid buffer, ph . at °c. the refolding experiments were undertaken by dilution of equilibrium-denatured δ-crystallin to a series of urea or gdmcl concentrations in the same buffer. the experiments for equilibrium unfolding of monomeric δ-crystallin were undertaken by overnight incubation of δ-crystallin in . m urea at °c followed by addition of various concentrations of gdmcl to the solution followed by incubation for hrs. the refolding experiments were carried out by dilution of the denatured δ-crystallin into a solution containing . m urea and . , or m gdmcl in mm tris-acetic acid buffer, ph . . to analyze the conformation and quaternary structure of the refolded δ-crystallin, the refolding experiments were undertaken by -fold dilution of the denatured monomeric δ-crystallin with buffer. the transition region in fig b for the reversible dissociation of k a mutant δ-crystallin in urea solution was analyzed using the following method. the unfolding process was described as a two-state transition for the conversion of the tetramer (t) into monomers (m). the thermodynamic parameters were obtained by global fitting of the tryptophan fluorescence signal to eq [ ] : where y is the observed signal from tryptophan fluorescence, y n and y u are the signals in the folded and unfolded states, and m f and m u are the slopes of the baselines preceding and following the transition region. Δg u is the free energy difference in the absence of urea and m is the variation in the free energy of unfolding with the urea concentration. reversible unfolding of monomeric k a mutant δ-crystallin were described as a four state process (m$i $i $u). the unfolding curve from tryptophan fluorescence was analyzed using a four-state unfolding model described as the transition from n to u with two intermediates, i and i , in the process. the thermodynamic parameters were calculated by fitting the data to eq [ , ] : where y i and y i are the signals in the i and i states. Δg , Δg and Δg are the free energy differences in the absence of gdmcl for n to i , i to i and i to u transition, respectively, and m , m and m are the variation in the free energy of unfolding with the gdmcl concentration. enzymatic activity assay δ-crystallin was assayed for asl activity by monitoring the absorption of fumarate at nm in a perkin-elmer lambda spectrophotometer. assays were performed at least in triplicate in mm tris-hcl buffer (ph . ) with mm sodium argininosuccinate as substrate. a molar absorption coefficient of . x m - cm - was used for all calculations [ ] . the fluorescence spectra were measured on a perkin-elmer ls- luminescence spectrophotometer at °c. intrinsic tryptophan fluorescence spectra of the protein were recorded with excitation wavelength set at nm and using nm band-width for both excitation and emission wavelength. all spectra were corrected for buffer or denaturant absorption. the average emission wavelength was utilized for data analysis [ ] . the ans ( -anilinonaphthalene- -sulfonic acid) (molecular probes; eugene, oregon) was used as probe to bind with proteins and the fluorescence was recorded from to nm the circular dichroism (cd) spectra were measured on a jasco j- spectropolarimeter at °c. experiments were performed in mm tris-acetic acid buffer (ph . ) with a mm path-length over a wavelength range from to nm. all spectra were averaged from three accumulations and were buffer corrected. the observed ellipticity (θ) (degrees) was converted into the mean residue ellipticity [θ] by the equation: [Θ] = Θ×m mrw / ×d×c [ ] , where m mrw is the mean residue weight, d is the light path (cm), and c is the concentration of protein in g/ml. the protein sedimentation were performed on a beckman-coulter (palo alto, ca) xl-a analytical ultracentrifuge (auc) with an an ti rotor at °c with g in standard double sectors aluminum centerpieces. the radial scans were recorded with a time interval of -min and a step size of . cm. all data were fitted to the continuous c(s) distribution model and a continuous size-distribution with respect to frictional ratio (f/f o ) model using the sedfit program [ , ] . the partial specific volume of the protein, solvent density and viscosity were calculated using the sednterp program [ ] . the solvent density and viscosity of varied urea concentrations were calculated and included in the fitting. electrophoresis was performed using a phastsystem (ge healthcare). the samples were subjected a phastgel - % gradient gel which contains the native buffer strip ( . m l-alanine, . m tris/hcl ph . ) attached to the surface of the gel and both electrodes. the electrophoresis was carried out at ma and °c for vh. after electrophoresis, the gel was fixed in % (w/v) trichloroacetic acid and stained with coomassie brilliant blue r . the turbidity of protein solution was measured using a perkinelmer lambda spectrophotometer equipped with a peltier temperature control accessory to monitor the light scattering at nm. all measurements were carried out at °c. spectra were corrected using measurements recorded for native protein in the absence of denaturants. the aggregation rate was calculated by fitting the data to the single or double exponential equation: where y t is the signal at time t, y is the signal of the final state, y i is the change in amplitude, and k i is the rate constant for aggregation. data for the linear increase in signals was fitted to a linear equation using sigmaplot . the assay was performed by setting the excitation wavelength at nm and measuring the emission spectrum from to nm. proteins unfolded in denaturant were diluted into mm tris-acetic acid buffer (ph . ) to give . mg/ml of protein in the presence of thioflavin t (tht) ( μm). the spectrum of tht alone was used as a correction for each assay. the crystal structure of goose δ-crystallin (pdb code: xwo) with all water molecule removed was subjected to the charmm force field and energy minimized with the smart minimization algorithm to satisfy (rms gradient~ . kcal/mol/Å). the implicit solvent model of generalized born was included in the calculation. the structural model was used as a template to build the k a mutant model using the build mutant protocol (accelrys discovery studio . , accelrys inc.). the best scoring model conformation was selected for energy minimization. molecular dynamics (md) simulations were performed using the standard dynamics cascade protocol. the structures of wild-type and mutant δ-crystallin in the charmm force field were subjected to initial energy minimization for steps by steepest descent followed by a conjugate gradient for steps. the minimized models were then heated from to k in ps md simulations followed by equilibration for ps at k in the absence of any structural restraint. finally, the equilibrated models were submitted to md simulations for ps at nvt (constant number of particles, volume, and temperature) using the berendsen coupling algorithm [ ] . wild-type and k a mutant δ-crystallin purified to near homogeneity were used for all analysis. equilibrium unfolding experiments were conducted by incubation of proteins in buffer supplemented with different gdmcl or urea concentrations overnight. tryptophan fluorescence was used to monitor the conformational changes during the unfolding process in the microenvironment around the tryptophan residues (fig a and b ). there are two tryptophan residues, w and w , in the structure of δ-crystallin. they are located in the solvent accessible domain and the helix bundle of domain , respectively ( fig a and b) . unfolding of the wild-type and k a mutant protein follows a multistep process in gdmcl solution and was not reversible after -fold dilution (fig a) . as shown in fig a, the dramatic changes in the signal for the first transition were due to subunit dissociation as reported previously, with the gdmcl concentrations for half transition ([d] / ) at ± . and . ± . m for wildtype and k a mutant protein, respectively [ ] . unfolding of the two proteins in urea solution followed a three-state process, with the [d] / values in the first transition at . ± . and . ± . m urea, respectively ( fig b) [ , ] . the differences in the denaturant concentration required for the half transition clearly show the potency of gdmcl when disrupting of the conformation of δ-crystallin [ ] . in the presence of urea, the k a mutant protein was dissociated and stayed in a stable conformation between and m urea. the conformation was more stable than for wild-type protein at urea concentrations exceeding m. when the urea was removed by -fold dilution into buffer, the denatured mutant protein was able to refold into a conformation similar to the original state (fig b) . in contrast, dilution of . or m urea denatured wild-type δ-crystallin or m urea denatured k a mutant protein did not result in the restoration of the properly folded conformation. these results suggest that the reversible assembly of the quaternary structure of δ-crystallin is correlated with the conformation of the dissociated monomers. the exposure of hydrophobic surfaces in the presence of urea was investigated using ans titration. a dramatic increase in fluorescence was observed at around m and m urea, for wild-type and the k a mutant, respectively, indicating that subunit dissociation had occurred due to the exposure of hydrophobic areas (figs c and ) . the result is consistent with the observed changes in the tryptophan micro-environment as probed by tryptophan fluorescence ( fig b) . the highest signals occurred around and m urea for the mutant and wildtype protein, respectively. dilution of the . m urea denatured k a mutant protein resulted in restoration of a native-like conformation. however, dilution of or m urea denatured wildtype or mutant protein, respectively, resulted in higher levels exposure of hydrophobic areas. since α-helices are the major secondary structure in δ-crystallin, the ellipticity at nm was used to analyze the structural changes induced by the presence of urea (fig b) . the results showed both proteins retaining relatively stable α-helical structure at concentrations of urea below m. there was about % and % loss of the structure at m and m urea, respectively. effect of urea on the size-and-shape changes of δ-crystallin variants the size and size-and-shape changes of wild-type and k a mutant protein in different urea concentrations were determined by sedimentation velocity measurements and using continuous c(s) distribution and c(s, f/f ) distribution analysis, respectively (figs and ) (s and s figs) [ ] . in the absence of urea, the two proteins appeared as one major component with sedimentation coefficients about . and . s, respectively (fig ) . this peak corresponds to tetrameric δ-crystallin [ ] . they possessed the native conformation as judged from the friction ratio (f/f ) distribution profile (with the centre region below . as shown by red in the contour) (fig a and b ). at . m urea two components were observed for wild-type protein with sedimentation coefficients about . and . s, and these are assumed to be the dissociated dimeric and monomeric forms. the s values of the two peaks decreased with increasing urea concentration. the proportion of the second (monomeric) peak increased from % to % to % in the presence of . , . to . m urea, respectively. dilution of the denatured wild-type protein at . m urea resulted in refolding into one major component with an s value of . (fig a) . measurement of asl activity showed that around % activity was recovered following refolding (table ) . subunit dissociation was observed at about . m urea for the k a mutant protein, with s values for the major peak of . and a shoulder at about . s (fig b) . a single peak with an s value of . was observed for the mutant protein at~ . m urea. this species is thought to be dissociated monomers possessing the native conformation as judged from the friction ratio (f/ f ) distribution profile (fig c) . the monomers were reassembled into a similar quaternary structure of wild-type protein after removing the urea (fig d) . when the urea concentration was increased to . m, the s values for the single component were gradually decreased to about . ( fig b) . dilution of the protein denatured with . m urea resulted in refolding into one major peak with a s value of . s. the refolded protein showed around % asl activity was recovered (table ) . since k a mutant protein was reversible dissociated into stable monomers at . m urea, the conformational reversibility of monomers was investigated. monomeric k a mutant protein that had been produced by equilibrium incubation of the native protein in . m urea was treated with various gdmcl concentrations. unfolding of monomeric k a mutant proteins followed a multistep pathway as measured by both tryptophan and ans fluorescence ( fig a and b ). stable intermediates were identified from the unfolding curve of ans fluorescence which included the highest ans fluorescence region at around m gdmcl and a steady state at~ m gdmcl (fig b) . the monomeric protein had lost about and % of its secondary structures at and m gdmcl, respectively. the changes in tryptophan fluorescence were dilution of monomeric k a mutant protein denatured in m gdmcl resulted in refolding to a similar conformation as the original monomeric state (fig a and b) . however, dilution of and m gdmcl denatured monomeric protein resulted in the increasing of the ans fluorescence, indicating higher exposure of hydrophobic area (fig b) . the results suggest the conformation of the partly unfolded intermediate could affect the folding reversibility of the monomeric k a δ-crystallin mutant. k a mutant δ-crystallin that denatured in m urea was reversible folded back to the original conformation after dilution (fig b) . signal changes in the tryptophan fluorescence with different urea concentration were used to calculate the thermodynamic parameters by directly fitted to the two-state mechanism (eq ) [ ] . the free energy difference in the absence urea (Δg ) for the transition was determined to be . ± . kcal/mol ( table ) . the changes of tryptophan fluorescence as a function of gdmcl concentration were used to calculate the thermodynamic parameters for the reversible unfolding of the monomeric k a δ-crystallin mutant (fig a) . the unfolding curve was best fitted into a four-state model [ , ] . the [gdmcl] / for the transitions from the m to i , i to i and i to u (denaturation) were about . ± . m, . ± . m and . ± . m, respectively. the thermodynamic parameters determined are summarized in table . the total free energy difference (Δg ) for folding of monomeric k a δ-crystallin mutant was determined to be . ± . kcal/mol. to determine the ability about refolding of denatured monomeric k a mutant followed by reassembly to a tetrameric protein, the monomeric proteins that denatured by gdmcl were diluted with buffer to remove both of the urea and gdmcl. the results showed that the quaternary structure of k a mutant protein was recovered from the denatured monomers instantly after dilution. the amount of the assembled protein was increased with the incubation time possessing about % of the activity recovered ( fig c and table ). in contrast, without denaturant treatment, respectively. samples from refolding of m gdmcl denatured wild-type and mutant protein for time period of zero or overnight are shown in lanes - and - , respectively. the protein concentrations used in the assays were . , . and . mg/ml for (a), (b) and (c), respectively. doi: . /journal.pone. .g is the concentration of denaturant at which the transition is half completed. the data was calculated by global fitting to eq . b the data were fitted to a -state unfolding model (eq ) described as the transition from m to u. two intermediates, i and i , were assumed in the process before denatured species (u). these data are the mean ± sd of at least three independent experiments. doi: . /journal.pone. .t dilution of m gdmcl denatured wild-type δ-crystallin, the quaternary structure was recovered after overnight incubation with no detectable activity ( table ). the similar result was also shown for m gdmcl denatured k a mutant protein. these results suggested the different pathway for protein folding that seems due to the distinct conformation of the denatured protein caused by different means of denaturation. since refolding of partly unfolded monomeric mutant δ-crystallin resulted in a conformation with high exposure of hydrophobic regions, the occurrence of protein aggregation in the process was determined using light scattering measurement. no protein aggregation was detected upon dilution of . , and m gdmcl denatured monomeric mutant protein into buffer containing . m urea. however, when . and m gdmcl denatured monomeric mutant protein was diluted into buffer, protein aggregation was occurred. the rates for aggregate formation were calculated to be ca. . and . min - , respectively (fig a) . when αa-crystallin, the chaperone protein, was added in a : ratio to . m gdmcl denatured monomeric mutant protein in folding buffer, no change in the rate of protein aggregation was observed. formation of aggregates by αa-crystallin alone did not occur under the same conditions. it is notable that upon dilution of m gdmcl denatured monomeric mutant protein into buffer, no aggregation occurred. the structural features of the protein aggregate were investigated using the thioflavin t assay [ ] . an increase in fluorescence intensity resulting from binding of tht with the aggregates over time was observed following dilution of . and m gdmcl denatured monomeric mutant δ-crystallin into buffer (fig b) . the results suggest the possible formation of ordered aggregates. no changes in the signal were observed during the incubation period upon refolding of m gdmcl denatured monomeric mutant protein. to determine the effect of the interactions provided by k at the diagonal subunits in disassembly of the quaternary structure, a md simulation were run for ps for wild-type and mutant δ-crystallin in the absence of any structural restraints. from the simulation trajectory, the dynamic motion for disassembly of the quaternary structure and conformational changes in the tertiary structure were elucidated. the distances between the c α of d and r , r and e and the two k or a residues were measured to evaluate the extent for subunit dissociation between the a-c, a-d and a-b dimeric pairs, respectively (fig a) . these residues interact with each other by hydrogen bonding or salt bridges at the dimeric pair interface in the native structure. these interactions are lost on replacement of k with a . the results showed that the distances between d and r , r and e and the two a residues increased linearly at a similar rate, except that the rate of change for the r -e interaction in wild-type protein was about half of that for the mutant protein. in contrast, no changes in the distance between the k residues were observed before ps. inspection of the timecourse at ps in wild-type protein showed that the primary dimers of subunit a and c or b and d showed were separated, while the diagonal dimers of subunit a and b or c and d were connected by the interactions of residue k (fig b) . however, the subunits for both of the primary and diagonal dimers were separated from each other in the mutant protein. the results suggest a different disassembly process for tetrameric wild-type and mutant proteins. the quaternary structure of δ-crystallin is assembled as two pairs of closely associated dimers. previous mutagenesis studies have used to investigate the interactions at the interfaces of double dimers to elucidate their role in the stabilization of the quaternary structure [ ] . the unique stable conformation from unfolding of k a mutant protein in the presence of urea suggests that the interactions provided by this residue at the interfaces may play a critical role in stabilization of the quaternary structure of δ-crystallin. lys- is the only residue which is arranged symmetrically at the diagonal subunit interfaces (fig b) . the ε-amino group in the side-chain of this residue forms hydrogen bonds with the carbonyl groups of m , v and k within the symmetric subunit (from pisa analysis: http://www.ebi.ac.uk/msd-srv/prpt_int/cgi-bin/piserver) (fig d) . substitution of this residue by alanine reduces the structural stability of the protein. the results from the previous study showed about a °c reduction in the thermal stability of the secondary structure and the changes in the micro-environment surrounding the tryptophan residues [ ] . this mutant protein was also more susceptible to chemical denaturation, since about half of the concentration of denaturant was required to disrupt its quaternary structure compared to wild-type protein. both of the wild-type and mutant protein showed similar and not reversible denaturation in the presence of gdmcl. however, differences in the denaturation pathway were observed when urea was used as the denaturant. the results suggest that the non-covalent interactions between the intra and inter-subunits might be disrupted by the ionic character of gdmcl, with subunit dissociation and denaturation occurring simultaneously for both proteins [ ] . the previous studies showing the presence of only a monomeric molten globule intermediate in the dissociation pathway of wild-type δ-crystallin in gdmcl supports this assumption [ , ] . thus, the role of k in the folding process of δ-crystallin cannot be distinguished under the strong denaturant. around ~ m urea, the k a mutant δ-crystallin is in a stable conformation, as judged by tryptophan fluorescence measurements (fig b) . subunit dissociation occurs under these conditions, resulting in the exposure of hydrophobic regions (figs c and b ). only monomers were identified in this state, as measured by sedimentation velocity analysis. the monomers that dissociated from the tetramers at~ . m urea possessed a nearly identical content of secondary structure as native protein and native-like conformation (figs b and c ). monomers in this conformation were able to refold and re-associate into tetramers with a similar conformation as the native protein, and possessed significant asl activity. when the urea concentration was increased to . m, the conformational changes led to further exposure of hydrophobic regions. however, following this conformational change at higher urea concentrations, protein folding was not reversible. the contrasting result found for wild-type protein was that the dissociation and denaturation were concurrent under the effect of urea (fig a) . in this condition, the conformation of the dissociated monomers was partly unfolded, as judged from the reduced level of α-helix (fig b) . the dissociated monomers seem to refold into alternative conformations then re-associate into tetramers with only part of the catalytic activity recovered (table ). these results indicated that the conformation of the monomers seems related to the assembly pattern for functional protein. it is interesting that the interactions provided by k at the interfaces seem to affect the disassembly pathway of the quaternary structure of wild-type protein. it was found that both of the dimers and monomers were dissociated from the wild-type protein at around to m urea, as measured by sedimentation velocity while only monomers were dissociated from the mutant protein (fig ) . these results suggest that the interactions provided by k at the interfaces seem to increase the energy barrier for dimeric dissociation. when these interactions were disrupted by mutation, the monomers could be isolated with intact conformation from the tetramers earlier than for the wild-type protein in urea. thus, the steps for dissociation and denaturation could be distinguished in the unfolding process of the k a mutant δ-crystallin. in this study, the dynamic motion of protein structure in the process was simulated for elucidation the dissociation mechanism of δ-crystallin. from the trajectory, the tetramers were found to disassemble at the early stage. due to the interactions by k , the interfaces between the diagonal dimers remain connected while distances increase between the subunits of the primary dimers in wild-type protein. in contrast, the subunits between both of the diagonal and primary dimers were dissociated in the k a mutant protein (fig ) . as δ-crystallin was assembled by two close contact dimers as transthyretin, dissociation at the two primary dimer interfaces would be expected to occur at the initial stage [ , ] . the simulation result provides a novel pattern for dissociation of the double dimeric protein consistent with the results of sedimentation velocity experiments. a possible explanation for this earlier dissociation of the subunit from the primary dimers compared to diagonal dimers is differences in solvent accessibility. unlike the location of the interfaces between the subunits of the primary dimer, the position of k is buried at the interior interfaces away from solvent. thus, the interactions of k at the interfaces of the protein seem to elevate the stability of the quaternary structure. for wild-type protein, two diagonal dimers were presumed to disassemble initially from the tetramers followed by subunit dissociation of the diagonal dimers. however, dissociation at the interfaces of two primary dimers would assume to be the first step in the unfolding process of the k a mutant protein (fig ) . the detail mechanism for folding of monomeric protein remains elusive due to the monomers dissociated from wild-type δ-crystallin were in a molten globule conformation. thus, the monomers that reversible dissociated from k a mutant δ-crystallin with a stable conformation and possessing similar level of secondary structure as the original state, and this would be a good model for studying the folding process. the monomeric protein was reversible denatured in a four-state mechanism in the presence of gdmcl and two intermediates were detected in the process (fig ) . refolding of the partly unfolded intermediate was not reversible which in turn resulted in a conformation with more exposure of hydrophobic regions. only denatured δ-crystallin was reversible folded into the monomers with a similar conformation to the original state. it is interesting that the refolded monomers were able to reassemble into tetramer instantly upon dilutions, with substantial recovery of activity (fig and table ). this contrasts with the slow refolding of gdmcl denatured wild-type protein into its tetrameric form with no detectable activity. the slow recovery of the quaternary structure for the latter protein is due to an energy barrier for the appropriate assembly of double dimers, as reported previously [ ] . the results suggest that the conformation of the denatured monomers which the tetrameric wild-type (t) and k a mutant (t*) δ-crystallin was dissociated through the diagonal dimer (d) and primary dimer (d*) to monomers with partial unfolded (m) and stable (m*) conformation, respectively. monomers of the wild-type or the mutant protein were then denatured through intermediate (i or i*) into respective unfolded form (u or u*). the monomers (m) of wild-type protein in partial unfolded conformation was associated in alternative pathway to form dimers (d ), and then assembled into tetramers (t ) or aggregates (a). the aggregation was prevented by αa-crystallin. refolding followed by assembly of the intermediates (i * and i *) of the mutant protein resulted in the aggregates (a and a ) formation and the chaperone function of αa-crystallin was invalid in this process. doi: . /journal.pone. .g was unfolded by stepwise dissociation or directly unfolded with m gdmcl could be different. the consequence of this might be that protein folding occurs via different pathways leading to the refolded monomers with different conformations to associate into native structure or alternative structures without function. protein aggregates are prone to form during the reassembly process from refolding of partly unfolded monomeric intermediates of δ-crystallin. the intermediate with the highest exposure of hydrophobic conformation is particularly prone to aggregate formation (fig ) . aggregate formation by monomeric intermediates with defined conformations was also reported for transthyretin under mildly acidic conditions [ ] . the result implies that the conformational status of the monomers influences subunit association. it is interesting that the presence of αcrystallin seems to increase the formation of aggregates from the monomeric intermediates of δ-crystallin with partly unfolded conformation, while α-crystallin alone was not affected under these conditions (fig ) . the studies for αa peptide which induces the aggregation of soluble α-crystallin suggested that the mechanism for aggregate formation might due to the changes in the hydrophobicity of α-crystallin induced by the peptide [ , ] . our previous study reported that aggregate formation during refolding of gdmcl denatured wild-type δ-crystallin was due to the improper assembling of double dimers and was prevented by the presence of α-crystallin [ ] . in this study, the aggregate formation was caused by assembly of the refolded monomeric intermediate which facilitated the aggregate formation of α-crystallin. it thus postulated that the electrostatic interaction with the substrate seems to be key factors to determine the chaperon-like or anti-chaperone activity of the α-crystallin [ , ] . the underlying mechanism requires further investigation. nonetheless, the result highlights the conformational status of the monomers which play a critical role in the folding pathway for reversible oligomerization or aggregate formation. in conclusion, the folding pathways of wild-type and mutant δ-crystallin are summarized as the working models shown in fig . this model depicts the key interactions from k at the interfaces of diagonal subunits not only to stabilize the quaternary structure of δ-crystallin but also to act as the energy barrier for dissociation of stable monomers. the stability might be one of the reasons for recruitment of the metabolic enzyme asl into the lens as a crystallin protein [ ] . the single polypeptide chain of δ-crystallin after translation would be assumed to fold into functional tetramers as the proposed refolding pathway for k a mutant. however, due to the interactions by k , the tetrameric protein would be assumed to dissemble in an alternative manner to form the diagonal dimers, followed by simultaneous subunit dissociation and denaturation. monomers in this status might associate into dimers via a different pathway which then assemble slowly into a non-native tetrameric form or self-associate into aggregates which can be prevented by the presence of α-crystallin. the reversible folding of the monomers that dissociated from the k a mutant protein with near native conformation provided the folding mechanism of the δ-crystallin. in this process, the ordered aggregate formation from re-association of the partly unfolded intermediate reveals a specific status of the protein to avoid the chaperone function of α-crystallin. this model proposes a possible mechanism about the aggregate formation for lens protein under stress effect and their interaction with α-crystallin. this study reveals the key role of monomers that dissociated from the oligomeric crystallin; their conformational status determines the levels of aggregate formation. , the panels show the raw sedimentation and theoretical fitted data (solid lines), and the fitting residual, respectively. (c) grayscale of residual bitmap. the raw sedimentation data were fitted to the continuous size distribution model using the sedfit program [ ] . (tif) lens crystallins: gene recruitment and evolutionary dynamism the recruitment of crystallins: new functions precede gene duplication lens crystallins. innovation associated with changes in gene regulation the structure of avian eye lens δ-crystallin reveals a new fold for a superfamily of oligomeric enzymes human argininosuccinate lyase: a structural basis for intragenic complementation evidence for neutral and selective processes in the recruitment of enzyme-crystallins in avian lenses structural comparison of the enzymatically active and inactive forms of δ crystallin and the role of histidine the effect of n-terminal truncation on double-dimer assembly of goose δ-crystallin structural studies of duck δ and δ crystallin suggest conformational changes occur during catalysis crystal structure of an inactive duck δii crystallin mutant with bound argininosuccinate substitution of residues at the double dimer interface affects the stability and oligomerization of goose δ-crystallin guanidine hydrochloride induced reversible dissociation and denaturation of duck δ -crystallin monomeric molten globule intermediate involved in the equilibrium unfolding of tetrameric duck δ -crystallin kinetic refolding barrier of guanidinium chloride denatured goose δ-crystallin leads to regular aggregate formation disruption of a salt bridge dramatically accelerates subunit exchange in duck δ crystallin guanidine hydrochloride-induced denaturation and refolding of transthyretin exhibits a marked hysteresis: equilibria with high kinetic barriers distinct interactions of αa-crystallin with homologous substrate proteins, δ-crystallin and argininosuccinate lyase, under thermal stress. biochimie a rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding determination and analysis of urea and guanidine hydrochloride denaturation curves equilibrium unfolding of bombyx mori glycyl-trna synthetase metabolism of amino acids and amines resolution of the fluorescence equilibrium unfolding profile of trp aporepressor using single tryptophan mutants the application of circular dichroism to studies of protein folding and unfolding macromolecular size-and-shape distributions by sedimentation velocity analytical ultracentrifugation determination of the sedimentation coefficient distribution by least-squares boundary modeling analytical ultracentrifugation in biochemistry and polymer science molecular dynamics with coupling to an external bath d g transthyretin is monomeric, aggregation prone, and not detectable in plasma and cerebrospinal fluid: a prescription for central nervous system amyloidosis? reversible unfolding of the severe acute respiratory syndrome coronavirus main protease in guanidinium chloride mechanism of thioflavin t binding to amyloid fibrils protein denaturation with guanidine hydrochloride or urea provides a different estimate of stability depending on the contributions of electrostatic interactions kinetic stabilization of the native state by protein engineering: implications for inhibition of transthyretin amyloidogenesis the acid-mediated denaturation pathway of transthyretin yields a conformational intermediate that can self-assemble into amyloid mechanism of the chaperone-like and antichaperone activities of amyloid fibrils of peptides from αa-crystallin the αa - peptide interacts with soluble αcrystallin and induces its aggregation and precipitation: a contribution to age-related cataract formation amyloid-induced aggregation and precipitation of soluble proteins: an electrostatic contribution of the alzheimer's beta( - ) amyloid fibril gene sharing by δcrystallin and argininosuccinate lyase we thank dr m. d. lloyd (university of bath, u. k.) for reading of this manuscript before publication. conceived and designed the experiments: hjl wyc. performed the experiments: cwh hcl wck cyc. analyzed the data: cwh cyc. wrote the paper: cwh hjl. key: cord- -r dtiu authors: liu, bin; zhang, deyuan; xu, ruifeng; xu, jinghao; wang, xiaolong; chen, qingcai; dong, qiwen; chou, kuo-chen title: combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection date: - - journal: bioinformatics doi: . /bioinformatics/btt sha: doc_id: cord_uid: r dtiu motivation: owing to its importance in both basic research (such as molecular evolution and protein attribute prediction) and practical application (such as timely modeling the d structures of proteins targeted for drug development), protein remote homology detection has attracted a great deal of interest. it is intriguing to note that the profile-based approach is promising and holds high potential in this regard. to further improve protein remote homology detection, a key step is how to find an optimal means to extract the evolutionary information into the profiles. results: here, we propose a novel approach, the so-called profile-based protein representation, to extract the evolutionary information via the frequency profiles. the latter can be calculated from the multiple sequence alignments generated by psi-blast. three top performing sequence-based kernels (svm-ngram, svm-pairwise and svm-la) were combined with the profile-based protein representation. various tests were conducted on a scop benchmark dataset that contains families and superfamilies. the results showed that the new approach is promising, and can obviously improve the performance of the three kernels. furthermore, our approach can also provide useful insights for studying the features of proteins in various families. it has not escaped our notice that the current approach can be easily combined with the existing sequence-based methods so as to improve their performance as well. availability and implementation: for users’ convenience, the source code of generating the profile-based proteins and the multiple kernel learning was also provided at http://bioinformatics.hitsz.edu.cn/main/∼binliu/remote/ contact: bliu@insun.hit.edu.cn or bliu@gordonlifescience.org supplementary information: supplementary data are available at bioinformatics online. by march , experimentally determined protein structures were deposited in the protein data bank (berman et al., ) . however, this number is only about one-sixth of , the number of protein sequences held in the uniprotkb/swiss-prot database (wu et al., ) . to timely use such vast amount of structure-unknown protein sequences for basic research and drug development, it is highly desired to determine their d structures and functions by means of homology approaches (chou, ) . unfortunately, protein remote homology detection is still a challenging problem in bioinformatics. the early methods in dealing with this problem were based on the pairwise sequence comparison approaches, such as blast (altschul et al., ) and smith-waterman local alignment algorithm (smith and waterman, ) . however, in many cases, this kind of sequence alignment method failed to detect remote homologies due to the low sequence similarities. later methods to challenge this problem were based on the generative models to induce a probability distribution over the protein family, and then to generate the unknown proteins as new members of the family from the stochastic model. for example, the hidden markov model (hmm) (karplus et al., ) can be trained iteratively in a semi-supervised manner by using both positively labeled and unlabeled samples of a particular family to generate the positive set (qian and goldstein, ) . recently, the discriminative methods, such as support vector machine (svm) (vapnik, ) , were used to address this problem by focusing on the differences between protein families. the key of the svm methods is the kernel function by which to compute the inner product between two samples in the feature space. the most straightforward approach to generate the kernels was based on the features extracted from protein sequences. svm-ngram (dong et al., ) , svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) were three of the most successful sequencebased kernels. svm-ngram (dong et al., ) was based on the feature space that contains all short subsequence of length n. in svm-pairwise (liao and noble, ) , a protein sequence was represented as a vector of pairwise similarities to all protein *to whom correspondence should be addressed. sequences in the training set, and then inner product between these vector-space representations was taken as the kernel. svm-la (saigo et al., ) measured the similarity between a pair of proteins by taking all the optimal local alignment scores with gaps between all possible subsequences into account. besides these kernels, several other sequence-based kernels were also proposed, such as mismatch (leslie et al., ) and svm-balsa (webb-robertson et al., ) . the profile-based kernels could further improve the performance by using the evolutional information extracted from the profiles. for example, top-n-grams (liu et al., ) extracted the profile-based patterns by considering the most frequent elements in the profiles; profile kernel (kuang et al., ) extracted the short substrings according to the profile-based ungapped alignment scores; some profile-based methods improved the predictive performance by developing more sensitive profiles. hhsearch method (so¨ding, ) was based on a novel profile using the hmm. in compass (sadreyev et al., ) , numerical profiles were generated to construct optimal profile-profile alignments and to estimate the statistical significance of the corresponding alignment scores. in the meantime, some other features and techniques have been applied to this field to further improve the predictive performance. for instance, the kernel combination methodology (vbkc) (damoulas and girolami, ) used a single multiclass kernel machine to combine various kernels based on different feature spaces; svm-physicochemical distance transformation (pdt) (liu et al., ) combined the amino acid physicochemical properties and the profile features via pdt to incorporate the local sequence-order information of the entire protein sequences. also, based on the similarities between protein sequences and natural languages, the natural language processing techniques were applied to this field. it was shown that the performance of building-block-based methods could be improved by using the latent semantic analysis (lsa) (dong et al., ) . moreover, p rot e mbed (melvin et al., ) detected protein remote homology by embedding protein sequences into a low-dimensional semantic space. as we can see from the aforementioned introduction, most of the top-performing methods were developed based on the features extracted from profiles. this is consistent with the fact that a profile is much richer than an individual sequence in encoding information. also, biology is a natural science with historic dimension. all biological species have developed beginning from a limited number of ancestral species. it is true for protein sequence as well (chou, ) . their evolution involves changes of single residues, insertions and deletions of several residues, gene doubling and gene fusion (chou, ) . with these changes accumulated for a long period, many similarities between initial and resultant amino acid sequences are gradually eliminated, but the corresponding proteins may still share many common features, such as having basically the same biological function (loewenstein et al., ) , folding topology, subcellular location and other attributes (chou, ) . accordingly, the key to improve the performance of these methods is to find a suitable approach to extract the evolutionary information from the profiles. in view of this, the current study was initiated in an attempt to propose a profile-based protein representation by extracting the evolutionary information from the frequency profiles. as shown by a series of publications liu et al., ; xiao et al., ; xu et al., ) and summarized in a comprehensive review , to develop a useful statistical prediction method or model for a biological system, one needs to engage the following procedures: (i) construct or select a valid benchmark dataset to train and test the predictor; (ii) formulate the samples with an effective mathematical expression that can truly reflect their intrinsic correlation with the target to be predicted; (iii) introduce or develop a powerful algorithm (or engine) to operate the prediction; (iv) properly perform cross-validation tests to objectively evaluate the anticipated accuracy of the predictor; and (v) provide the downloadable source code or a web-server for the prediction method. below, let us describe how to deal these procedures. suppose s is a remote homology system investigated in the current study that contains protein sequences, which were taken from (liao and noble, ) at http://noble.gs.washington.edu/proj/svm-pairwise/. these proteins were selected from scop version . and extracted from the astral database (brenner et al., ) . none of the proteins has sequence pairwise similarity to any other with higher than À in the e-value [for more about the e-value and its implication in reducing homology bias and redundancy, see (brenner et al., ) ]. these proteins were also used by others (dong et al., ; lingner and meinicke, ; saigo et al., ) to study remote homology detection. the proteins in s can be classified into superfamilies and families; i.e. where s f i ði ¼ , , . . . , Þ is the ith superfamily, s f k ðk ¼ , , . . . , Þ is the kth family and the symbol [ represents the 'union' in the set theory. for readers' convenience, the codes of the proteins and their sequence as well as the attributes of their families and superfamilies are given in supplementary material s . because some families and superfamilies in s do not contain significant number of protein sequences, and also because the negative dataset for each protein family can be any proteins except those belonging to its own superfamily, it is not so straightforward but a little more complicated and subtle for how to select protein samples from s to define the training and testing datasets. to provide a clear description, let us consider a different manner to address this. as demonstrated by many previous studies on a series of important biological topics, such as enzymecatalyzed reactions (zhou and deng, ) , inhibition of human immunodeficiency virus- reverse transcriptase (althaus et al., ) , drug metabolism systems (chou, ) and applying wenxiang diagram or graph to study protein-protein interactions (zhou, ; zhou and huang, ) , using graphical approaches to study complicated problems can provide an intuitive picture and useful insights for in-depth studying and analyzing various complicated relations in these systems (lin and lapointe, ) . in view of this, let us also use graphic approach to describe the feature and relation of the families and superfamilies in s, as shown in figure , where the open circles denote the families or superfamilies that have significant number of protein sequences and the gray circles denote those that do not. of the families in s (cf. equation ), contain significant number of proteins (see the third row of fig. ) and form a target family set s f target ; i.e. of the superfamilies in s, contain at least one target family (see the open circles in the second row of fig. ) and form a target superfamily set s f target ; i.e. thus, we have meaning that s f target is the subset of s f target , and s f target is the subset of s; each of the three contains , and proteins, respectively. now, for each of the families in the target family set s f target , we can define a training dataset and testing dataset given by where the positive training dataset s þ train ðkÞ contains at least of its superfamily members, none of which belongs to the kth family, and the positive testing dataset s þ test ðkÞ contains at least five protein domains within the family. the proteins in the negative training and testing datasets, s À train ðkÞ and s À test ðkÞ, were picked from s by excluding the superfamily of the kth family and randomly split between the two in the same ratio as the positive ones. the training and testing datasets thus obtained are given in the supplementary materials s and s , respectively. the frequency profile m for protein p with l amino acids can be represented by where is the total number of standard amino acids; m i,j ( m i,j ) is the target frequency, which reflects the probability of amino acid i (i ¼ , , . . . , ) occurring at the sequence position j (j ¼ , , . . . ,l) in protein p during evolutionary processes. for each column in m, the elements add up to . the target frequency is calculated from the multiple sequence alignments generated by running psi-blast (altschul et al., ) against the ncbi's nr with default parameters except that the number of iterations was not set at but was set at in the current study. the target frequency of amino acid i in sequence position j is calculated as: where f ij is the observed frequency of amino acid i in column j; is a free parameter set to a constant value of , which is initially used by psi-blast; is the number of different amino acids in column j À ; and g ij is the pseudo-count for amino acid i in protein sequence position j, which can be calculated as: where p k is the background frequency of amino acid k, and q ik is the score of amino acid i being aligned to amino acid k in blosum substitution matrix, which is the default score matrix of psi-blast (altschul et al., ) . although the methods by using amino acid sequence information achieve certain degree of success, only using sequence information cannot accurately detect protein remote homology. recent studies demonstrated that the profile-based methods would show better performance because a profile is richer than an individual sequence as far as the encoding information is concerned. however, a profile is a d matrix, whereas a protein sequence is an amino acid string. therefore, the d evolutionary profile information cannot be directly incorporated into the sequence-based methods for prediction. to deal with this problem, we propose an approach to convert the frequency profiles into a series of profile-based proteins. thus, the existing sequence-based methods can be directly performed on these proteins for the prediction. the target frequencies in the frequency profiles reflect the probabilities of the corresponding amino acids appearing in the specific sequence positions. the higher the frequency is, the more likely the corresponding amino acid occurs. it is reasonable to use the nth most frequent amino acids in the frequency profiles to represent the protein sequences. below is the description on how to convert frequency profiles into profile-based proteins. given the frequency profile m for protein p (equation ), we can sort the amino acids in each column according to a descending order. the frequency profile thus obtained by the sorting operation is called the sorted frequency profile and denoted by m . for each row in m , the amino acids are combined to produce the profile-based protein. by following this approach, the frequency profile m is converted into profile-based proteins p , p , . . . , p ( supplementary fig. s in supplementary material s ), which contain the evolutionary information in the frequency profile. these proteins have different importance. during evolutionary process, protein p is preferred to transform into p , but not preferred to transform into p . for reader's convenience, the source code for generating the profile-based proteins is accessible by clicking the link at http://bioinformatics.hitsz.edu.cn/main/ *binliu/ remote/. three state-of-the-art sequence-based kernels [svm-ngram (dong et al., ) , svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) ] were used to validate whether the proposed approach could improve their performance. in svm-ngram (dong et al., ) method, the ngrams were the set of all possible subsequences of amino acids of a fixed length. a protein sequence was mapped to a feature vector by the occurrence frequency of each ngram. the value of n was set at as suggested by the authors (dong et al., ) , and therefore the dimension of the vector is . at the heart of the svm is a kernel function that acts as a similarity score between pairs of vectors. the kernel was normalized so that each vector had length in the feature space: where x and y are two proteins in the dataset. this normalized step was also used by svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) . the normalized kernel thus obtained was then transformed into a radial basis kernel. in the svm-pairwise (liao and noble, ) method, the feature vector was a list of pairwise sequence similarity scores, computed with respect to all of the sequences in the training set. the radial basis function was used as the kernel. the rest steps were the same as the ones used in svm-ngram (dong et al., ) . in the svm-la (saigo et al., ) , the kernel was calculated by summing up scores obtained from the local alignments with gaps between the two sequences, computed by smith-waterman dynamic programming algorithm. such kernel might not be a positive definite kernel and the authors (saigo et al., ) provided two solutions for this problem. owing to its performance and simplicity, we implemented one of the two methods, namely, the la-ekm kernel. the parameters of la-ekm kernel took the optimal values ( ¼ . , d ¼ À , e ¼ À ). the kernel described in the previous section can be used by kernel methods to train the svm classifier. each kernel contains different discriminative information, and therefore combining the kernels automatically is a promising way to improve the performance. in machine learning field, this approach is called multiple kernel learning (mkl) (cortes et al., ; varma and babu, ) , which has attracted a lot of attention recently. the mkl technique aimed to combine different kernels to improve the performance, and showed the state-of-the-art results on image classification field (varma and babu, ) . in this article, we focused on the weighted linear combination of kernels. the weight of each kernel can be optimized based on different criterion, which can be categorized by two groups. one group is the one-stage kernel learning methods, which optimize the weight and the svm objective function simultaneously (varma and babu, ). these methods suffer from the high training complexity. the other group is two-stage kernel learning methods, which optimize the weight by using a criterion first and then train the svm classifier using the kernel combined by the learned weight of each kernel. compared with one-stage learning methods, the two-stage kernel learning methods showed better performance with reduced training cost. therefore, in this study, we adopted the two-stage kernel learning method. specifically, the kernel target alignment (kta) objective function was used to optimize the weight of each kernel, which showed theoretical guarantees and can improve the performance in practice (cortes et al., ; varma and babu, ) . given m training samples x , x , . . . , x m and their corresponding labels y , y , . . . , y m , the ideal kernel matrix can be formulated as k y ¼ y t y, where y is the vector of labels [y , y , . . . , y m ]. for the given n kernels k , k , . . . k n , the aim is to learn the weight of each kernel. to avoid the kernel scaling problem, we center kernel k k and the corresponding ideal kernel k y in feature space by the following equation: where k ck is normalized by: following the above steps, each kernel is normalized, and then these n kernels are linearly combined by the following equation: where w k ( w k , p n k¼ w k ¼ ) is the weight of kernel k k . the weight is learned by kta objective function, which maximizes the alignment between k comb and the centered ideal kernel k cy . this leads to a quadratic program problem and can be solved quite efficiently. for implementation details, please refer to (cortes et al., ) . in this study, three kernels (k p , k p and k p ) for each selected sequence-based method were linearly combined by using the above kta approach to further improve the performance. for reader's convenience, the source code of the mkl is accessible by clicking the link at http://bioinformatics.hitsz.edu.cn/main/ *binliu/remote/. svm is a class of supervised learning algorithms first introduced by vapnik ( ) . svm-based machine learning algorithm has been successfully used to investigate various problems in molecular biology, such as identifying dna recombination spots , membrane protein types (cai et al., ) and heat-shock protein functions (feng et al., ) , among many others. in this study, the publicly available gist svm package (http://www.chibi.ubc.ca/gist/) was used. because the test sets have many more negative than positive samples, simply measuring error-rates will not give a good evaluation of performance. for the cases in which the positive and negative samples are not evenly distributed, the best way to evaluate the trade-off between the specificity and sensitivity is to use a receiver operating characteristic (roc) score (gribskov and robinson, ) . an roc score is the normalized area under a curve that plots true positives against false positives for different classification thresholds. a score of means perfect separation of positive samples from negative ones, whereas a score of means that none of the sequences selected by the algorithm is positive. another performance measure is roc score, which is the area under the roc curve up to the first false positives. the frequency profile of a protein p can be converted into profile-based proteins (p , p , . . . , p ) by using the proposed approach (see section for details). these proteins have different importance. p is the most important protein, as it is the combination of the top frequent amino acids in frequency profile, whereas p is the profile-based protein to which protein p is the most unlikely to convert because it is the combination of the amino acids with lowest frequencies in frequency profile. if all the profile-based proteins are used in the prediction, the computational cost is relatively high. in this study, only the top n most important profile-based proteins (p , . . . , pn) were used in the prediction. to select the value of n, the following experiment was conducted. the frequencies of standard amino acids in each column of a frequency profiles add up to . therefore, the average frequency is . ( / ¼ . ). if an amino acid with frequency . , it is likely to occur during evolutionary process; otherwise, it is not likely to occur. the percentage of the amino acids with frequencies . in each profile-based protein on the scop benchmark was calculated, and the results are shown in figure . as we can see from the figure, such amino acids are abundant in profile-based proteins p , p and p ( . %, . % and . %, respectively), but for the other profile-based proteins, the percentage decreases significantly (from . to %). therefore, in this study, only the top three profile-based proteins were used in the prediction. these profile-based proteins were combined with three state-ofthe-art methods based on sequence composition, including svm-ngram (dong et al., ) , svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) , and the results are shown in supplementary table s of supplementary material s . for each of the three methods, the best performance was achieved for the top important protein p . compared with the methods performed on the raw protein sequence p, the performance of the proposed methods can be improved by . $ . % and . $ . % in terms of average roc and roc scores, respectively, indicating that the proposed profile-based protein representation is useful for protein remote homology detection. the performance of the methods performed on p is similar as that of the methods performed on the raw protein p. the predictive results of the methods performed on p were the lowest. these results are consistent with the different importance of the three profile-based proteins p , p and p . besides the current method, there are some other methods for predicting protein remote homologies based on profiles, such as svm-top-n-gram-combine-lsa (liu et al., ) , svm-pdt-profile (liu et al., ) , profile (kuang et al., ) , biosvm- l (muda et al., ) and hhsearch (so¨ding, ) . svm-top-n-gram-combine-lsa (liu et al., ) extracted the building blocks of proteins from the frequency profiles, which could be treated as the 'words' of protein language. the lsa (dong et al., ) was applied to further improve the performance of this method. svm-pdt-profile (liu et al., ) combined the amino acid physicochemical properties in the amino acid index (aaindex) (kawashima et al., ) with the frequency profiles for the prediction. the feature vector of profile method (kuang et al., ) was constructed by the short subsequences whose pssm-based ungapped alignment score was above a predefined threshold. biosvm- l constructed two-layer svm classifiers with profile-based kernels (muda et al., ) . all the above three methods were based on svm, and the difference among them was in the extracted features. hhsearch (so¨ding, ) was one of the best protein remote homology detection methods, which used a novel profile-based hmm. the results obtained by these four methods on the scop benchmark are listed in supplementary table s of supplementary material s , from which we can see that the current method outperforms svm-top-n-gram-combine-lsa (liu et al., ) , svm-pdt-profile (liu et al., ) and biosvm- l (muda et al., ) and is highly comparable with profile (kuang et al., ) and hhsearch (so¨ding, ) , indicating that the profile-based protein representation is a promising approach to extract the evolutionary information from frequency profiles for protein remote homology detection. as mentioned above, the approaches based on the top two profile-based proteins p , p and the raw protein p are among the top performing methods. it is interesting to investigate whether these methods can be combined to further improve the performance. in this study, the mkl framework was used to combine these methods. the kta method was used to automatically optimize the weight of each kernel on the training set, and then these kernels are combined with weights into a single kernel for the svm-based prediction. the results are shown in supplementary table s of supplementary material s as well as supplementary materials s -s . the mkl approach can improve the performance of svm-ngram (dong et al., ) , but only has minor impact on the svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) . to uncover the reason, the weight of each kernel was analyzed. for each kernel, the average weight on all the protein families is shown in supplementary table s of supplementary material s . for these three methods, the p -based kernel was weighted most heavily. for svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) , the weight values of their corresponding p and p kernels are . , indicating these kernels only have minor impact on the final results, and hence the performance improvement is modest. vbkc (damoulas and girolami, ) is another method based on the mkl, which combined four string kernels: svm-pairwise (liao and noble, ) , svm-la (saigo et al., ) , svm-mm (leslie et al., ) and svm-mono (lingner and meinicke, ) . our proposed svm-pairwise-kta and svm-la-kta outperform vbkc (damoulas and girolami, ) by . $ . % and . $ . % according to the average roc and roc scores, respectively. the obvious performance improvement is mainly due to the proposed profile-based protein representation and mkl approach. the svm-ngram (dong et al., ) method is based on the explicit feature space representation, which provides the possibility to measure the correlations between ngrams and protein families. the sequence-specific weight learnt from the svm training process can be used to calculate the discriminant weight for each ngram, which indicates the importance of the corresponding ngram. by following lingner and meinicke's approach (lingner and meinicke, ) , given the weight vector of a set of m sequences obtained from the kernel-based training process ¼ [ , , , . . . , m ], the discriminant weight vector w in the feature space can be calculated by the following equation: where f is the matrix of sequence representatives. the magnitude of the element in w represents the discriminative power of the corresponding feature. in most protein families, kernel p is weighted more heavily than kernel p and kernel p . two such protein families (scop id: . . . and . . . ) were selected from the scop benchmark for further study, and the results are shown in supplementary tables s and s of supplementary material s , respectively. for each kernel, the top most discriminative ngram features calculated by equation are shown in the tables too. for protein family . . . , kernel p and kernel p share some common most discriminative ngrams, such as 'mtm', 'yty', 'mtf' and 'wwf', indicating these ngrams remain stable during evolutionary process and therefore these ngrams would be the important sequence patterns for maintaining the structure and function of this protein family (supplementary table s of supplementary material s ). however, there are a few common most discriminative ngrams between kernel p and kernel p in protein family . . . . the top most discriminative ngrams of kernel p are all different from those in kernel p (supplementary table s of supplementary material s ). these ngrams would contribute to the higher discriminative power of kernel p for this protein family. although in most cases, kernel p was weighted most heavily, some exceptions were observed. for example, for protein family . . . , kernel p is the most discriminative kernel with weight value of nearly , while the other two kernels only have little contribution to the mkl (supplementary table s of supplementary material s ). the top most discriminative ngrams for each kernel were investigated, and the results are shown in supplementary table s of supplementary material s , from which some interesting patterns can be observed. the ngrams containing amino acids 'n' and 'f' tended to show strong discriminative power in both kernel p and kernel p , whereas amino acid 'a' was abundant in the top discriminative ngrams in kernel p , indicating the ngrams with amino acid 'a' could better describe the prosperities of protein family . . . in the evolutionary process. . application of the proposed remote homology detection methods for studying the d structure of nck a in addition to provide useful insights for evolution study, protein remote homology detection is useful for drug development as well. as is well known, many drug-targeted proteins are still without x-ray or nuclear magnetic resonance structure. pharmaceutical scientists have to resort to the homology modeling technique or structural bioinformatics tools (chou, ) to timely develop their d structures, so as to be able to conduct molecular docking study (chou et al., ; wang et al., ) , one of the key steps in structure-based drug design. however, a reliable template, or a structure-known protein homologous to the target protein, is the necessary prerequisite in this regard (chou, ) . unfortunately, many target proteins did not have significant sequence similarity with any structure-known proteins, and hence it was hard to find a proper template to develop their d structures. actually, many of them did have structure-known homologous proteins, but the problem was how to detect them. for example, the sequence similarity between nck a and cyclina was % (chou et al., ) and hence their homologous relationship could not be detected by the simple sequence alignment technique (mohabatkar, ) . now let us see what will happen if the current remote homology detection technique is applied. to realize this, a dataset was constructed based on scop, from which proteins were selected as the positive samples in the cyclin family (scop id: a. . . ), while negative samples were selected from the scop version . by excluding all the proteins within the cyclin-like superfamily. none of these proteins shares % sequence similarity. trained with such positive proteins and negative proteins, the proposed best performing method svm-la (p ) was used to predict nck a. it was found that nck a is homologous to cyclina, fully consistent with the experimental results obtained by the site-directed mutagenesis studies (tang et al., ) . actually, chou et al. ( ) did use cyclina as a template to construct the d structure of activation domain of nck a, one of the important parts of tau protein kinase ii, an important therapeutic target against alzheimer's disease. furthermore, based on the computed structure thus obtained, the molecular truncation experiments (zhang et al., ) were conducted with an outcome that confirmed and validated the structure computed by using such a remote homologous protein as a template. therefore, it is anticipated that the proposed method for detecting remote homology proteins will certainly enhance the power of homology modeling, and hence have impacts on drug development as well. discriminative methods based on svm are the most effective and accurate methods for protein remote homology detection. the performance of the svm-based methods depends on the kernel function, which measures the similarity between the samples in any pair. varieties of kernels based on sequence composition have been proposed. however, these methods often fail to accurately predict the proteins sharing low sequence similarity. recently, methods using the evolutionary information extracted from profiles achieved great success, such as profile (kuang et al., ) , sw-pssm (rangwala and karypis, ) , svm-top-ngram (liu et al., ) and svm-acc (liu et al., ) . a key step to improve the performance of these methods is in how to find a suitable approach to incorporate the evolutionary information extracted from the profiles for prediction. in this article, we proposed a method that can convert the frequency profile into a series of profile-based proteins. three state-of-the-art sequence-based kernels, i.e. svm-ngram (dong et al., ) , svm-pairwise (liao and noble, ) and svm-la (saigo et al., ) , were selected for demonstration on a well-known benchmark. it was shown that the methods based on the profile-based proteins p and p achieved the best performance, outperforming the original three string kernels by . $ . % and . $ . %, respectively, according to the average roc and roc scores. these results are fully consistent with our previous findings that the top two most frequent amino acids show stronger discriminative power than the other low frequent amino acids in the frequency profiles (liu et al., ) , further confirming that the proposed profile-based protein representation is a promising approach in extracting the evolutionary information from frequency profiles for protein remote homology detection. it has not escaped our notice that the current approach can be easily combined with sequence-based methods, and hence, with the development of the sequence-based kernels, the currently proposed method can be further improved accordingly. it is instructive to point out that since the concept of pseudo amino acid composition, or chou's pseaac (lin and lapointe, ) , was introduced in (chou, ) , it has been successfully used to predict various attributes of proteins (e.g. chen and li, ; chou, ; georgiou et al., ; huang and yuan, ; khosravian et al., ; mohabatkar, ; mohabatkar et al., mohabatkar et al., , mohammad beigi et al., ; nanni et al., ; sahu and panda, ; zhang et al., ; zhou et al., ; liu et al., ) . accordingly, the potential would be high to develop a powerful method for protein remote homology detection by combing pseaac with profile-based protein representation. in the original pseaac, it only uses three indices, including the hydrophobicity index, hydrophilicity index and side-chain mass index. because protein remote homology detection is a more difficult problem, proteins in the dataset only share low sequence similarity. only these three indices would not be enough to capture the different properties of various proteins. therefore, our further research will focus on incorporating new amino acid indices into pseaac and applying it to protein remote homology detection. conflict of interest: none declared steady-state kinetic studies with the non-nucleoside hiv- reverse transcriptase inhibitor u- e basic local alignment search tool gapped blast and psi-blast: a new generation of protein database search programs the worldwide protein data bank (wwpdb): ensuring a single, uniform archive of pdb data the astral compendium for sequence and structure analysis support vector machines for predicting membrane protein types by using functional domain composition irspot-psednc: identify recombination spots with pseudo dinucleotide composition predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of chou's pseudo amino acid composition the convergence-divergence duality in lectin domains of the selectin family and its implications prediction of protein cellular attributes using pseudo amino acid composition review: structural bioinformatics and its impact to biomedical science using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes graphic rule for drug metabolism systems some remarks on protein attribute prediction and pseudo amino acid composition ( th anniversary year review) some remarks on predicting multi-label attributes in molecular biosystems a model of the complex between cyclin-dependent kinase (cdk ) and the activation domain of neuronal cdk activator binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against sars wenxiang: a web-server for drawing wenxiang diagrams two-stage learning kernel algorithms probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection application of latent semantic analysis to protein remote homology detection ihsp-pseraaac: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition use of fuzzy clustering technique and matrices to classify amino acids and its impact to chou's pseudo amino acid composition use of receiver operating characteristic (roc) analysis to evaluate sequence matching a multilabel model based on chou's pseudo-amino acid composition for identifying membrane proteins with both single and multiple functional types hidden markov models for detecting remote protein homologies aaindex: amino acid index database, progress report predicting antibacterial peptides by the concept of chou's pseudo-amino acid composition and machine learning methods profile-based string kernels for remote homology detection and motif extraction mismatch string kernels for discriminative protein classification combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships theoretical and experimental biology in one remote homology detection based on oligomer distances word correlation matrices for protein sequence analysis and remote homology detection a discriminative method for protein remote homology detection and fold recognition combining top-n-grams and latent semantic analysis prediction of protein binding sites in protein structures using hidden markov support machine using amino acid physicochemical distance transformation for fast protein remote homology detection protein remote homology detection by combining chou's pseudo amino acid composition and profile-based protein representation protein remote homology detection based on auto-cross covariance transformation protein function annotation by homology-based inference detecting remote evolutionary relationships among proteins by large-scale semantic embedding prediction of cyclin proteins using chou's pseudo amino acid composition prediction of gaba(a) receptor proteins using the concept of chou's pseudo-amino acid composition and support vector machine prediction of allergenic proteins by means of the concept of chou's pseudo amino acid composition and a machine learning approach prediction of metalloproteinase family based on the concept of chou's pseudo amino acid composition using a machine learning approach remote protein homology detection and fold recognition using two-layer support vector machine classifiers identifying bacterial virulent proteins by fusing a set of classifiers based on variants of chou's pseudo amino acid composition and on evolutionary information performance of an iterated t-hmm for homology detection profile-based direct kernels for remote homology detection and fold recognition compass server for homology detection: improved statistical accuracy, speed and functionality a novel feature representation method based on chou's pseudo amino acid composition for protein structural class prediction protein homology detection using string alignment kernels identification of common molecular subsequences protein homology detection by hmm-hmm comparison cyclin-dependent kinase (cdk ) activation domain of neuronal cdk activator. evidence of the existence of cyclin fold in neuronal cdk a activator statistical learning theory more generality in efficient multiple kernel learning insights from investigating the interactions of adamantanebased drugs with the m proton channel from the h n swine virus svm-balsa: remote homology detection based on bayesian sequence alignment the universal protein resource (uniprot): an expanding universe of protein information icdi-psefpt: identify the channel-drug interaction in cellular networking with pseaac and molecular fingerprints isno-aapair: incorporating amino acid pairwise coupling into pseaac for predicting cysteine s-nitrosylation sites in proteins identification of the n-terminal functional domains of cdk by molecular truncation and computer modeling using the concept of chou's pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von neumann entropies an extension of chou's graphic rules for deriving enzyme kinetic equations to systems involving parallel reaction pathways the disposition of the lzcc protein residues in wenxiang diagram provides new insights into the protein-protein interaction mechanism using chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes the ph-triggered conversion of the prp(c) to prp(sc.) key: cord- -erh ggh authors: bentley, william e.; wang, min‐ying; vakharia, vikram title: development of an efficient bioprocess for poultry vaccines using high‐density insect cell culture date: - - journal: ann n y acad sci doi: . /j. - . .tb .x sha: doc_id: cord_uid: erh ggh nan infectious bursa disease virus (ibdv) is a pathogen of major economic importance to the world's poultry industries. it causes severe immunodeficiency in young chickens by destroying the precursors of antibody-producing b cells in the bursa of fabricius.' ibdv is a member of the birnaviridae family whose genome consists of two segments of double-stranded rna. presently, the principle method of controlling ibdv infection in young chickens is by vaccination with an avirulent strain of ibdv or by the transfer of high levels of maternal antibody to breeder hens.* recently, virulent strains that are antigentically different from previously established ibdvs have been isolated from vaccinated flocks on the delmarva peninsula. consequently, present vaccines afford only partial protection against infection. in figure , the large segment indicated is comprised of a precursor polyprotein that is processed into mature vp , vp , and vp . vp and vp are the major structural proteins of the virion. vp is the major host-protective immunogen of ibdv and contains the antigenic regions responsible for the induction of neutralizing antibodies. for this reason, we have focused on vp as a potential vaccine. when expressed in baculovirus as one of a cassette (vp , vp , and vp ), the coat proteins self-assemble into empty virions, which subsequently afford protection in challenged chickens (fig. ) : this article presents the current status of our efforts to develop a cost-efficient vaccine for ibdv using the recombinant baculovirus and insect cell culture process. although the baculovirus expression system has proven quite successful for use in laboratories, its use as an industrial expression system has yet to occur, although products for human use are in clinical trials. briefly, the system entails replacing the gene for the acnpv coat protein, polyhedrin, with the gene for the protein of interest. this coat protein is not required for the replication and maintenance of the virus in cell culture due to its biphasic life cycle. about three days after infection with a recombinant virus, the protein of interest is expressed instead of polyhedrin, which would normally constitute the majority of the total cellular protein. the system is attractive because it is nonpathogenic towards humans, and it involves the in virro propagation of cells for which technology has advanced due to previous mammalian and hybridoma culturing techniques. it provides for an abundant supply of the protein of interest in part due to the strong polyhedrin promoter and also because insect cell lines reported to date provide for glycosylation and other posttranslational processing. in addition, the construction of recombinant baculoviruses has become simplified due to commercial expression kits. several general articles, manuals, and review articles have been published that describe both the expression system'" and novel process engineering aspects.%" in this work, we subdivide the entire bioprocess into three key areas: ( ) metabolism of infected cells and specific heterologous protein yield; ( ) bioreactor configuration and high cell density continuous culture; and ( ) integration of expression and product separation. in each of the results subsections, we briefly review the recent literature as well as present our efforts to understand and advance these areas. the stock virus solutions were developed as described previ~usly.~ virus titer was determined by the end-point dilution method. the infections were performed by adding different volumes of virus solution after viable cell count. thus, the time postinfection commences from the addition of the virus solution. the multiplicity of infection (moi) was evaluated separately in spinner flask culture. samples ( ml) were taken from fermentors during the infection process, divided in two tubes, and centrifuged for min at , rpm. the supernatants were separated from cell pellets and stored at - °c until measurement of glucose and lactate. the cell pellet was resuspended in ml mm kh,po, (ph . ) buffer solution and stored at - °c until assayed (page) for quantification. another two ml samples were centrifuged for min at x g. the cell pellets were stored at - °c without resuspending in any buffer and used for the measurement of proteolytic activity and total protein. total cell counts were performed with a vwr-brand hemacytometer, and viability was determined by trypan blue dye exclusion using a . % solution (sigma). cell pellets resuspended in ml ehbs were sonicated on ice for s with a microtip and a % pulsed duty cycle. glucose and lactate were measured using a ysi model analyzer (yellow springs instruments), and ammonia was determined by an enzyme-based assay kit (sigma, no -uv). total protein was measured using a biorad protein assay kit i. alkaline protease activity was assayed using the synthetic substrate azocasein (sigma) at a concentration of % (w/v) in . m tris/hci buffer (ph . ). the cell pellets were resuspended using ml of . m tris/hci buffer (ph . ) and sonicated. one milliliter of azocasein solution was added to this ml sample, and this mixture was incubated at °c for . hours. the reaction was stopped by the addition of % trichloroacetic acid ( . ml). after standing at °c for min, the mixture was filtered, and . ml . m naoh was added to . ml of the filtrate. one unit of alkaline protease activity is defined as the amount of enzyme that gives an increase in am of . in min at °c. cells were harvested by centrifuging ml of culture and then resuspending them in ml native binding buffer ( mm sodium phosphate, mm sodium chloride, ph . ). egg white lysozyme ( pg/ml) was added to this mixture, which was incubated for minutes on ice. the cells were sonicated on ice three times for s with a % pulsed duty cycle (branson sonic power, co.) and then flash frozen in a - °c freezer. after thawing the sonicated cells at "c, a final concentration of pg/ml, rnase was added to the crude cell lysate and incubated on ice for minutes. insoluble cell debris was removed by centrifugation at , g for minutes. the column resin (invitrogen, ca) was washed with distilled water and then equilibrated with native binding buffer. cell lysate ( ml) was then loaded onto the column and washed by native binding buffer until od,,, was less than . . a step down to ph . was performed using native wash buffer ( mm sodium phosphate, mm sodium chloride, ph . ), and the od,,, of supernatant was monitored until its value was smaller than . . protein was then eluted by loading native-ph elution buffer ( mm sodium phosphate, mm sodium chloride, ph . ) onto the column and collecting ml fractions. hundreds of laboratories have expressed heterologous proteins using recombinant acnpv and sf- or bm- cell lines, yet only several laboratories have focused attention on maximizing yield by examining cellular functions during the growth and expression phases. factors such as moi, timing of infection, and cell density have been well documented. for example, neutra et al. l examined the effects of reactor volume, infection timing, moi, and cell density on p-galactosidase production in shake flasks. king et al. found that low moi was favorable for a temperaturesensitive baculovirus. conversely, for a more typical baculovirus with polyhedrin promoter control and an rcd product, lazarte et alt found high moi was most favorable. perhaps these systems differed by changes in cell metabolism that resulted from medium condition and infection temperature. in the lazarte et a .i study, fresh medium added at infection tripled the yield. indeed, several groups have demonstrated that infected cells retain significant metabolic activity, and this is essential for producing the recombinant protein. caron et al. noted that postinfection cell physiology was critical and suggested that different growth and production medias might be required. zhang et a . noted that for bm- cells, serum concentrations for optimal growth were different than for viral production. additionally, kamen et al. "quantified differences in principal nutrients and waste products during the growth and production phases. reports demonstrating the effects of nutrient limitation and by-product inhibition on protein yield have been limited, however. wang et a .i demonstrated that apparent glucose limitations could be offset by subsequent sucrose decomposition. lindsay and betenbaughi noted that the oxygen demand was higher for infected cells and adjusted aeration properties, which facilitated dramatic increases in yield. likewise, scott et a . " reported a significant increase in protein yield by maintaining dissolved oxygen tension at % of saturation. however, at high cell densities, maintaining oxygen was not sufficient for increasing yield. caron el al." restored recombinant protein production at higher cell density by renewing the medium at the time of infection. feeding the specific rate-limiting nutrients is less expensive, however, and aides in elucidating insect cell physiology under viral infection. two batch cultures were conducted to examine the effects of do during viral replication and protein production.*' the first batch culture was in a fermentor without do control (fig. ) , while the other was conducted with do regulated near % (fig. ) . virus infection was performed for the fermentor without do control by adding ml virus solution (moi ) at hours, after the cells had attained a concentration of . x lo cells/ml. this moi and cell density were chosen so that other factors, such as nutrient depletion, were eliminated and differences due to oxygen level were discernible. shortly after infection, the do dropped to zero and remained zero for another hours. the viable cell concentration remained near . x l@ cells/ml and then started to decrease simultaneously with an increase in do after hours (fig. a) . the onset of cell lysis, however, is marked by both the increment in lactate dehydrogenase (ldh) activity and decrement in cell viability near hours (fig. b) . note that after the do dropped to zero, the lactate increased to a maximum of . mm, reflecting the inadequacy of oxidative phosphorylation to supply the required atp. in these energy-limited cells, the final eh activity was . units per milliliter. for the batch culture with regulated do, the growth kinetics are shown in figure . virus infection was also performed at hours and . x cells/ml. unlike the oxygen-uncontrolled culture, the viable cell concentration continued to increase to a maximum of . x lo cells/ml. the increased demand in oxygen immediately following infection was met in this experiment, and the do was maintained above % and averaged % for the remainder of the experiment. the continuous addition of oxygen likely provided the requirement for continued cell growth. note that the lactate concentration remained below mm for the entire experiment, whereas the glucose concentration dropped almost %, from mm to mm after infection. the epoxide hydrolase activity reached . u/ml, which was % higher than in the oxygen-uncontrolled experiment. the specific productivity increased by % as well. this increase in protein synthesis is accomplished only by continued glucose uptake after infection. in the uncontrolled experiment, infected cells consumed little glucose (- mm) after the do dropped to zero. on the other hand, both the uninfected and infected cells in the do-controlled experiment continued to uptake glucose and consumed almost g/l after infection for cell growth and viral replication. note, therefore, that the effects of oxygen, like moi and cell density, are delineated and realized when sufficient nutrients are available in the extracellular environment. three experiments were performed to evaluate glucose and glutamine feeding in spinner flasks at high cell density (fig. ) . the use of ml spinner flasks with ml working volume prevents oxygen limitation. the cells were grown in a ml spinner flask with ml working volume until a density of . x lo cells/ml was attained; they were then divided and infected in three smaller spinner flasks at an moi of and a cell density of . x lo cells/ml. one of the spinner flasks was diluted :l with fresh medium as a control (total working volume, ml). previously, this method was reported as resulting in the highest specific yield attainable in spinner flasks.l .iy this spinner flask is denoted an oxygen and substrate-sufficient control. one spinner flask was then fed . ml of mm glutamine and pl of m glucose. subsequent daily glucose feeding maintained glucose supply (fig. a) . the last spinner flask was a control high cell density culture, without glucose/glutamine feeding. the epoxide hydrolase activity (fig. b) continued to increase for the culture with fresh medium (oxygenhubstrate control) until hours postinfection (hpi). by contrast, in the high cell density cultures, epoxide hydrolase activity increased more rapidly initially and then decreased at hpi for the glucose/glutamine fed culture, and at hpi for the unfed culture. a significant improvement in protein production from glucose and glutamine feeding was seen as early as hpi. coincidentally, the glucose concentration in the unfed culture had dropped dramatically by this time (fig. a) . the increase in activity continued well into the production phase and because the viable cell concentration in the fed culture was significantly lower after hpi, the increase in activity was due primarily to an increase in specific productivity. when comparing the overall performance of the effects of oxygen, glucose, and glutamine feeding, the total protein activity of the do-controlled culture was three times that of the uncontrolled culture. further, the glucose/glutamine fed culture was twofold higher in yield than the do-controlled culture. ultimately, the epoxide hydrolase yield was - milligrams per liter. as seen above, cell metabolism may play a very important role in recombinant protein production when cells are infected at high cell density. the metabolic stress responses of bacterial and mammalian cells to oxidation, heat shock, and amino acid starvation have been well documented, and many stress proteins have been ider~tified.~"?~ insect stress, however, has been quantified by previous researchers only by measuring the bulk proteolytic activity of whole insect homogenates. , in those reports, increased proteolytic activity corresponded to increased cellular stress. in many recent reports, degradation of the recombinant product has been s i g n i f i~a n t .~~.~~.~~ thus, another important aspect of cell metabolism concerns the elicitation of protease activity. we have used azocasein to investigate the appearance of alkaline proteases that may breakdown the product, particularly if allowed to follow in the separation and purification processes. in another fermentor experiment, we controlled the dissolved oxygen to % and infected cells (moi ) by the addition of virus stock solution when the cell density was about . ~ ~ cells/ml. this lower cell density was chosen in order to avoid other cellular stress effects (for example, nutrient starvation). the cells (fig. ) kept growing after viral infection due to a low mo - for a period of hours and then the viable cell density decreased later due to a secondary infection. the total cell density decreased due to cell lysis, and the viability was % at cell harvest. two recombinant protein activities and the total specific protease activity are also shown. in this system, p-galactosidase expression is under promoter p control; epoxide hydrolase (eh) is our desired recombinant product and is under the polyhedrin promoter control. -galactosidase is a cytoplasmic protein and will be liberated to the medium during expression and lysis. however, eh is membrane bound and not released to the medium. expression of both proteins (fig. ) started simultaneously. the p-galactosidase activity increased until hpi, was constant for a short time, and then decreased by % until harvest. on the other hand, eh increased more slowly initially, remained constant for hours and increased slowly by % until harvest. the specific protease activity was shown to increase dramatically starting at about hpi. this time just corresponds to the reduction of p-galactosidase activity and the slowdown in eh production. we should note that the viability was near % at this moment. inasmuch as the viability was high and.there was no nutrient limitation (not shown), there was no apparent reason for cells to stop producing recombinant protein. the protease (measured as specific activity in the cell pellet) increased at least -fold even when considering partial loss from cell lysis. we also measured proteolytic activity in supernatant, and no increment was found, which suggests that the proteolytic activity was either associated with the cell membrane or was too dilute to detect in the supernatant. although there is no specific connection between the protease activity and the shift in protein activities (p-gal, eh), the coincident timing warrants further consideration. because it is unlikely that protease inhibitors will be used in a commercial process, the specific nature and timing of this proteolytic activity is significant. hence, characterization of the identity and consequences of this and other proteases is under investigation. in addition to the many productivity advantages that continuous suspension culture has over stationary (t-flask) or batch-suspension culture, continuous culture provides homogeneous and constant environmental conditions that facilitate the modeling of cell growth and metabolism. research with hybridoma cells has demonstrated this p~i n t .~' .~~ due to infected cell lysis, most continuous insect cell systems have employed two reactor stages: one for cell growth and the other for virus propagation and recombinant protein expre~sion."*~*~~ however, it has been re- ported that there is eventually an almost complete loss of productivity from continuous insect cell culture with a continuously passaged v i r~s .~~.~~.~~ limited stability of the virus upon serial passage may limit the application of two-stage continuous reaction systems. this may be improved, however, by a three-stage continuous bioreactor system proposed by taticek and shuler, x or by a cycling-batch reactor scheme." at present, batch and perfusion systems have achieved significantly higher viable cell density than continuous cultures. further, there is some question as to the long-term productivity of the cell lines in continuous cultures, independent of the virus.ls consequently, we have investigated methods for maximizing cell number in extended continuous cultures. y dilution rates were examined from . to . hr-l, which is close to the maximum specific growth rate obtained in batch cultures.'' in figure , our small-volume glass-jacketed spinner flask reactorjy is depicted. cultures were stirred at rpm and contained . % pluronic f- . the airsparging system was cyclic*with min on and min off. two volummetric air flow rates were examined: . ml/min ( . vvm) and . ml/min ( . vvm) for low-and high-sparging rates, respectively. the viable and total cell densities for both sparging rates are plotted as a function of dilution rate in figure . at the low sparging rate, a monotonic decrease in viable cell density with increasing dilution rate was observed. this is consistent with the semicontinuous culture of mouse ls cells (without ph and do control) by s i n~l a i r .~~ the high sparging rate exhibited a maximum in total and viable cell densities at a dilution rate of . hr-i. the decrease in viable cell density at lower dilution rate ( . hr-i) may have been due to low nutrient and/or high toxin concentrations."note that at . hr-l, the cell density for the high sparging rate ( . vvm) was five times higher ( . ~ ~ vs. x cells/ml) than the low sparging rate. thus, this sparging rate provided much better oxygenation for cell growth with little contribution to cell death. the trends shown in figure^ are also similar to a continuous sp / -derived mouse hybridoma cell culture with ph and do control as well as an oxygen-limited continuous culture (ph control only) of mouse nbi h y b r i d o m a~.~~ the data depicted was collected in one continuous culture of over hours in duration. at several times, the dilution rate was reset to a previous setting, and the cell density returned to the previous steady state value. y thus, the spinner flask reactor performed well, was reproducible, and will be very useful for further development of the additional protein expression stages. in figure , we demonstrated that a cell density of . x loh cells/ml with % viability could be achieved at a dilution rate of . hr-i. in order to check the protein expression level of infected cells grown continuously for an extended period, cells at dilution rate of . hr-l with low ( . vvm, h) and high sparging rate ( . vvm, hr) were taken from the bioreactor and directly infected with an moi of in two ml spinner flasks ( ml working volume). the maximum recombinant protein yields were . u/ml at hpi and . u/ml at hpi for cells from the hr and hr samples, respectively. the maximum cell densities reached were . x lo and . x loh cells/ml for the hr and hr samples. flow diagram and setup of continuous bioreactor system. the total volume is ml, with working volume up to ml. air is introduced by way of peristaltic pump and hplc solvent porous sparger. air is introduced in on/off mode with a home timer. temperature is maintained by water bath and glass jacket. agitation is provided by magnetic stir plate (bellco). " the average specific eh activity, calculated as the maximum enzyme activity divided by the maximum cell density, was slightly lower for cells from the lower sparging rate ( . vs. . x lo--' u/ q cells). perhaps this was due to a lower growth rate ( . hr-i vs. . hr-i) or lower oxygen supply during the continuous culture. however, the specific eh activity from these long-term cultured cells is comparable to that from freshly prepared cells ( . - x lo-' u/ @ cells"). this demonstrates that high cell density (more than x loh cells/ml) can be achieved in continuous culture, even at a dilution rate of . hr-i. in addition, the specific protein yield from long-term cultured cells (over three months) is not compromised. the results of this work can be combined with the previous discussion on maximizing specific productivity in designing the best reactor strategy. this may involve incorporating feeding policies in the second or third stage of a multistage continuous system, or the inclusion of several alternating batch reactors that are filled by one continuous or semicontinuous reactor. an attractive feature of the baculovirus system has been its potential for disease diagnostics and vaccine development. again, this is due to the high expression level, posttranslational processing, and nonpathogenicity. in addition, recent reports have shown that expression of virus coat proteins often results in self-assembled virus-like particles (vlp) that are essentially empty whole virion^.^,^"' of these vlp-producing systems, vaccines have been proposed for p~liovirus,"~ parvovirus, " ibdv, flock house virus,s' and bluetongue virus. in addition, vaccines have been proposed or tested from non-vlp proteins, including hiv,s yellow fever," feline herpes trypanosoma cru~i,'~ mokola and rabies viruses:h newcastle disease vi-us,^^ japanese encephalitis:s hepatitis c, and bovine coronavirus. of these, only two research groups have considered the advantages of integrating initial separation with product expression by employing imac. *h' we have developed the bioprocess so that imac can be employed as a potentially single-step separation procedure. imac was introduced for the selective adsorption of protein by porath et ~ . ~~ who used divalent cations, zn + and cu +, chelated to a chromatographic matrix to fractionate serum proteins. metal ion ligands most often used in imac are first-row transition metals (zn, ni, cu, and fe) chelated by iminodiacetate (ida). in resin, they distinguish affinities exhibited by functional groups on the surface of proteins. these interactions between particular surface amino acids and immobilized metal ions provide the basis for metal-affinity protein to a first approximation, proteins are retained on metal-affinity columns according to the number of accessible his ti dine^.^*^' histidine is not a common residue in protein sequences and it accounts for only . % of amino acids in globular proteins.hr thus, the chance that a desired protein, engineered to contain additional histidines at its c-or n-terminus, will be preferentially separated from other proteins is quite good. a description of the construction of the vp baculovirus that expresses his,-vp (denoted vp h) is given elsewhere. ' monoclonal antibodies (mabs) recognizing ibdv were produced and characterized using protocols previously d e s~r i b e d .~~,~~' identification of ibdv antigens by modified antigen-capture elisa (ac-elisa) was also carried out as described by snyder et al." recombinant vp proteins from insect cells infected with vedlh- (for vp h) and vedl- (for vp ) were identical using the array of mabs and ac-elisa. however, these two proteins (vp , vp h) were distinguished and resolved on a . % sds-polyacrylamide gel, and detected immunologically following western blotting with polyvalent chicken anti-ibdv ~e r u m .~~,~* this indicated that vp protein was fused with histidine residues. the vp h was purified by affinity chromatography using a niz+ immobilized resin column provided in a protein purification kit as described in the methods section (invitrogen, ca). the final elution was monitored by odzs readings of the fractions, using the native-ph elution buffer as a blank. in f l g u r e , the optical density (odzso) of the column eluate is shown for the wash and vp eluate steps. the eluted pool (fractions - ) from the ph buffer was collected, concentrated, and desalted by a kda membrane (amicon) and checked by western blotting!' in figure a, the purified vp h protein (molecular weight near kda, lane ) comigrated with two vp proteins: one derived from gls ibdv (lane ) and the other from sf- cells infected by ibdv- recombinant baculovirus encoding entire vp , vp , vp , vp x proteins of ibdv (lane ). this demonstrates that the vp h protein can bind the ni + ion on the resin and was effectively eluted by the ph elution buffer. the purity of purified vp h protein was also checked by sds-page (fig. lob) . the intense band with a molecular weight of about kda showed that this vp h protein was dominant in the mixture (roughly %). some small proteins or peptides were still in this purified mixture which may have been due to partial degradation of vp h (data not shown) or additional elution by this strong low-ph buffer. in this work, a step change from ph . to ph , which is much lower than the pka value commonly observed for surface histidines (- . ): may have coeluted some peptides with a stronger binding ~apacity.'~ because we demonstrated that the recombinant viruses express immunoreactive vp proteins, the next step was bench-scale production of vp h protein. i proteins were about and mg/l, respectively. although we have not used imac for a large volume purification, we have obtained sufficient data to estimate protein production costs using the bioprocess presented. advantages for the recombinant poultry vaccine, as compared to the injection of an avirulent strain or the transfer of maternal antibody to breeder hens, include potential for complete and passive protection, higher reactivity per milligram of vaccine protein, smaller dose quantities, and more convenient processing. at this point, results on the optimum dose composition and quantity are incomplete, so a direct economic comparison between the processes is unavailable. however, given both the attractiveness of the baculovirus insect cell expression system and the optimization research activity, one might anticipate commercialization of ibdv vaccines and other baculovirus-based products in the near future. the work reviewed and presented in this paper highlights specific areas where further advancements will lead to increased productivity and therefore lower processing cost, namely maintaining the maximum cell number and specific protein productivity, developing convenient and stable bioreactor operating strategies, and integrating unique lowcost separations steps. structural and growth characteristics of infectious bursal disease virus maternally derived antibody-effect on susceptibility of chicks to infectious bursal disease genomic structure of the large rna segment of infectious bursal disease virus infectious bursal disease virus structural proteins expressed in a baculovirus recombinant confer protection in chickens insect cell culture technology in baculovirus expression systems expression of foreign genes in insects using baculovirus vectors baculovirus as vectors for foreign gene expression in insect cells baculovirus expression vectors: a laboratory manual large-scale insect cell culture for recombinant protein production production of (recombinant) baculoviruses in insect-cell bioreactors optimization of protein-production by the baculorivus expression vector system in shake flasks assessment of virus production and chloramphenicol-acety i-transferase expression by insect cells in serum-free and serum-supplemented media using a temperature-sensitive baculovirus optimization of the production of full-length rcd in baculovirus-infected sf cells high-level recombinant protein production in bioreactors using the baculovirus insect cell expression system a two-stage bioreactor system for the production of recombinant proteins using a genetically engineered baculovirus/insect cell system culture of insect cells in a helical ribbon impeller bioreactor expression of epoxide hydrolase in insect cells: a focus on the infected cell quantification of cell culture factors affecting recombinant protein yields in baculovirus-infected insect cells effects of oxygen on recombinant protein production by suspension cultures of spodoptera fiugiperda (sf- ) insect cells effects of oxygen/glucose/glutamine feeding on insect cell baculovirus protein expression: a study on epoxide hydroxylase production interaction of hepatic microsomal epoxide hydrolase derived from a recombinant baculovirus expression system with an azarene oxide and an aziridine substrate analogue escherichia coli and sulrnonella typhirnuriurn cellular and molecular biology coordinated regulation of a set of genes by glucose and calcium ionophores in mammalian cells alterations in proteases, protease inhibitors and ecdysone levels: a profile of stress in insects effect of developmental derangements on the proteolytic and protease-inhibitory activities in galleria rnellonella (insecta) ecdysteroids increase the yield of recombinant protein produced in baculovirus insect cell expression system production of recombinant sarcotoxin ia in bonbyx rnori cells kinetic analysis of protease activity, recombinant protein production, and metabolites of infected sf- cells under different do levels factors influencing recombinant protein yields in an insect cell-baculovirus expression system: multiplicity of infection and intracellular protein degradation a kinetic analysis of hybridoma growth and metabolism in batch and continuous suspension culture: effect of nutrient concentration, dilution rate, and ph a structured kinetic modeling framework for the dynamic hybridoma growth and monoclonial antibody production in continuous suspension culture a model for baculovirus production with continuous insect cell cultures continuoue production of baculovirus in a cascade of insect-cell reactors continuous p-galactosidase production with a recombinant baculovirus insect-cell system in bioreactors a structured dynamic model for the baculovirus infection process in insect-cell reactor configurations a continuous process for the production of baculovirus using insect-cell cultures a continuous flow bioreactor system for the production of recombinant proteins using the insect cell-baculovirus expression system continuous insect cell (sf- ) culture with aeration through sparging in a novel low-volume bioreactor scale-up of insect cell cultures: protective effects of pluronic f- response of mammalian cells to controlled growth rates in steady-state continuous culture the baculovirus-integrated retrotransposon ted encodes gag and pol proteins that assemble into virus-like particles with reverse transcriptase development of baculovirus triple and quadruple expression vectors: co-expression of three or four bluetongue virus proteins and the synthesis of bluetongue virus-like particles in insect cells three-dimensional reconstruction of baculovirus expressed bluetongue virus core-like particles by cryo-electron microscopy synthesis of bluetongue virus (btv) corelike particles by a recombinant baculovirus expressing the two major structural core proteins of btv analyses of the requirements for the synthesis of virus-like particles by feline immunodeficiency virus gag using baculovirus vectors assembly and release of hiv- precursor pr eg virus-like particles from recombinant baculovirus-infected insect cells synthesis of immunogenic, but non-infectious, poliovirus particles in insect cells by a baculovirus expression vector canine parvovirus empty capsids produced by expression in a baculovirus vector: use in analysis of viral properties and immunization crystallization of virus-like particles assembled from flock house virus coat protein expressed in a baculovirus system cell-surface expression and purification of human cd produced in baculovirus-infected insect cells d yellow fever vaccine virus envelope protein expressed by recombinant baculovirus is antigenically indistinguishable from authentic viral protein the use of feline herpesvirus and baculovirus as vaccine vectors for the gag and env genes of feline leukemia virus trypanosoma cruzi flagellar repetitive antigen expression by recombinant baculovirus: towards an improved diagnostic reagent for chagas' disease structure and expression in baculovirus of the mokola virus glycoprotein: an efficient recombinant vaccine vaccination against newcastle disease with a recombinant baculovirus hemagglutinin-neuraminidase subunit vaccine protection of mice against lethal japanese encephalitis with a recombinant baculovirus vaccine secretion and purification of hepatitic c virus nsl glycoprotein produced by recombinant baculovirus-infected insect cells primary structure of the s peplomer gene of bovine coronavirus and surface expression in -insect cells purification of a recombinant protein produced in a baculovirus expression system by immobilized metal affinity chromatography metal chelate affinity chromatography, a new approach to protein fractionation imac-immobilized metal ion affinity based chromatography purification of proteins by imac the sage of imac and mit cu(i )-binding properties of a cytochrome c with a synthetic metal-binding site: his-x -his in an a-helix evaluation of the interaction of peptides with cu(ii), ni(ii), and zn(i ) by high-performance immobilized metal ion affinity chromatography the independent distribution of amino acid near neighbor pairs into polypeptides group and strainspecific neutralization sites of ibdv defined with monoclonal antibodies differentiation of infectious bursal disease viruses directly from infected tissues with neutralizing monoclonal antibodies: evidence of a major antigenic shift in recent field isolates naturally occurring-neutralizing monoclonal antibody escape variants define the epidemiology of infectious bursal disease viruses in the united states comparative studies on structural and antigenic properties of two serotypes of infectious bursal disease virus a mathematical model for metal affinity protein partitioning metal affinity precipitation of protein carrying genetically attached polyhistidine affinity tails we thank gerard h. edwards and sarah milczanowski for technical assistance. key: cord- -c qcjve authors: faaberg, k. s.; plagemann, p. g. w. title: membrane association of the c-terminal half of the open reading frame a protein of lactate dehydrogenase-elevating virus date: journal: arch virol doi: . /bf sha: doc_id: cord_uid: c qcjve orf a of lactate dehydrogenase-elevating virus, strain p (ldv-p), encodes a protein of amino acids. eisenberg hydrophobic moment analysis of the protein predicted the presence of eleven transmembrane segments in the c-terminal half of the molecule (amino acids – ) that flank the serine protease domain. cdnas encoding orf a protein segments encompassing transmembrane segments to and its amphipathic c-terminal end as well as the n-terminal amino acids of the downstream orf b protein were transcribed and the transcripts in vitro translated in the absence and presence of microsomal membranes. the synthesis of the protein products with putative transmembrane segments was enhanced by the presence of the microsomal membranes and the proteins became membrane associated. when synthesized in the absence of membranes they were recovered in the supernatant upon ultracentrifugation of the translation reaction mixtures, whereas they were recovered in the membrane pellet when synthesized in the presence of membranes. furthermore, the latter proteins were not released from the membranes by disruption of the membrane vesicles in carbonate buffer, ph . , and large portions of the proteins were resistant to digestion by trypsin, chymotrypsin and proteinase k. no n-glycosylation was observed and only little, if any, processing of the protein by the putative serine protease. the results indicate that the c-terminal half of the orf a protein represents a non-glycosylated integral membrane protein. potential modes of synthesis and function of the protein are discussed. in addition, the results showed that the synthesis of the orf a protein was generally terminated at its termination codon, but that read-through into the orf b gene occurred with low frequency. summary. orf la of lactate dehydrogenase-elevating virus, strain p (ldv-p), encodes a protein of amino acids. eisenberg hydrophobic moment analysis of the protein predicted the presence of eleven transmembrane segments in the c-terminal half of the molecule (amino acids - ) that flank the serine protease domain, cdnas encoding orf la protein segments encompassing transmembrane segments to and its amphipathic c-terminal end as well as the n-terminal amino acids of the downstream orf lb protein were transcribed and the transcripts in vitro translated in the absence and presence of microsomal membranes. the synthesis of the protein products with putative transmembrane segments was enhanced by the presence of the microsomal membranes and the proteins became membrane associated. when synthesized in the absence of membranes they were recovered in the supernatant upon ultracentrifugation of the translation reaction mixtures, whereas they were recovered in the membrane pellet when synthesized in the presence of membranes. furthermore, the latter proteins were not released from the membranes by disruption of the membrane vesicles in carbonate buffer, ph . , and large portions of the proteins were resistant to digestion by trypsin, chymotrypsin and proteinase k. no n-glycosylation was observed and only little, if any, processing of the protein by the putative serine protease. the results indicate that the c-terminal half of the orf la protein represents a non-glycosylated integral membrane protein. potential modes of synthesis and function of the protein are discussed. in addition, the results showed that the synthesis of the orf la protein was generally terminated at its termination codon, but that read-through into the orf lb gene occurred with low frequency. lactate dehydrogenase-elevating virus (ldv) belongs to a new group of positive-strand rna viruses, presently classified as genus arterivirus [ ] , which also includes equine arteritis virus (eav), simian hemorrhagic fever virus, and porcine reproductive and respiratory syndrome virus (prrsv) [ , ] . expression of the viral genomes of - kb in infected cells is via the formation of 'coterminal nested sets of or subgenomic mrnas. the major structural proteins of these viruses are translated from subgenomic mrnas , , and . the ' three-quarters of the viral genome encodes two large proteins, la ( - amino acids) and lb ( - amino acids) which are translated from genomic rna. the orf lb protein is expressed via a frameshift mechanism involving a slippery sequence and a pseudoknot [ ] . the orf lb proteins of these viruses possess several common functional motifs, i.e. replicase, helicase, and zinc finger motifs, and probably represent the rna replicases of these viruses. the orf la protein of eav possesses an n-terminal papain-like cysteine proteinase (pcp) that autocatalytically cleaves off the n-terminal -kda end of the protein; (nonstructural protein-l; nsp- ) downstream of the catalytic pcp residues [ ] . the orf la proteins of ldv and prrsv possess two pcps. both are functionally active in cleaving off n-terminal products of about (nsp-la) and about kda (nsp-l[ ), respectively [ ] (see fig. ). in the case of eav, a kda product (nsp- ) is then removed by another cysteine proteinase cp [ ] . in addition, the orf la proteins of these viruses possess a serine proteinase (sp) motif in the c-terminal half of the molecule (see fig. ), and the sp has been suggested to be responsible for the further cleavage of the protein into functional units [ , ] . the function of the orf la proteins is unknown. however, hydrophobic moment analyses of the orf la proteins of ldv, eav, and prrsv have identified similar or potential transmembrane segments in their c-terminal segments of about amino acids [ ] which flank the sp motif, to segments on either side (see fig. ). the predicted structure is unique among virus-encoded proteins and implies some specific function of the orf la proteins in arterivirus replication. the only other proteins with or more transmembrane segments are membrane-associated transport proteins [ , , , ] . in the present study we provide strong evidence that the segment of ldv orf la protein with transmembrane segments - (see fig. ) becomes intimately associated with endoplasmic reticulum (er) membranes during synthesis and that none of its potential n-glycosylation sites becomes glycosylated. our approach has been to in vitro translate mrnas representing portions ofldv orf la in the absence and presence of microsomal membranes and then to examine the membrane association of the protein products and their susceptibility to degradation by proteinases. at the same time we have examined the efficiency of termination of the orf la protein and of frameshift/readthrough at the slippery sequence at the end of ldv-p orf la ( gcu uua aac uguuga... ; slippery sequence and orf la termination codon are underlined; [ ] ). plasmids pbsk-a , pbks ÷ - (g ), and pbks ÷ - carry overlapping segments of the ' end of ldv-p orf la (see fig. ; nt - , - and - , respectively; genbank accession number u ) and have been generated in previous studies [ ] . cdna - also encompasses the y-terminal end of orf lb including the putative pseudoknot that is postulated to play a role in frame-shifting between orf la and lb. cdna - was excised from its bluescript vector with restriction endonucleases bamhi and aatii, incubated with klenow dna polymerase in the presence of deoxynucleotide triphosphates to obtain blunt ends and ligated into the msci site of the pcite- vector (novagen, madison, wi) which supplied an aug initiation codon nucleotides upstream of the orf-la segment in pc . . the dnas of pbsk-a , pbsk÷ - and pc . were linearized and transcribed with t rna polymerases or with t rna polymerase in the case of pbsk-a as described previously [ ] . the integrity of the rna was verified by agarose gel electrophoresis (see later, fig. a ). the transcribed rnas were in vitro translated in a rabbit reticulocyte lysate system in the absence and presence of canine pancreatic microsomal membranes as also described previously [ , ] . for investigating their association with microsomal membranes, the products of translation were incubated and centrifuged under different buffer conditions - , , ] . the translation reaction mixtures were diluted about t -fold with tris-buffered sucrose ( ram sucrose, mm tris-hct, ph . ) or mm sodium carbonate, ph . . the mixtures were incubated on ice for h and then centrifuged for h at about x g at °c. when incubated in isotonic sucrose buffer, the microsomal membranes remain closed vesicles and all proteins associated with the membrane and those located within the vesicles will be located in the membrane pellet (p), whereas proteins synthesized on free ribosomes will be recovered in the supernatant (s). in contrast, when translation products are incubated in mm na÷-carbonate at ph . , the membrane vesicles are disrupted and only integral membrane proteins remain associated with the pelleted membranes, whereas secreted glycoproteins and peripheral proteins are recovered in the supernatant [ ] . after centrifugation, the proteins in the supernatant (s) were precipitated with trichloroacetic acid, washed with acetone, and air dried. the pelleted material (p) was washed once in sterile water. the s and p proteins were suspended in reducing sample buffer and analyzed by tricine sodium dodecylsulfate polyacrylamide gel electrophoresis (sds-page) [ ] using . %t: %c or . %t: %c gels (t refers to total percentage of acrylamide and bisacrylamide monomers and c refers to the percentage of crosslinker relative to t [ ] ) as described previously [ ] . only the orf la segment present in each transcript could yield a protein of significant size since the other two reading h-ames in each transcript did not encode any protein longer than amino acids. the main product of the a transcript whether translated in the absence or presence of microsomat membranes was a protein of about kda ( fig. a, lanes and ) . however, the presence of membranes enhanced translation of the a transcript and the product was almost exclusively recovered in the membrane pellet and thus associated with membranes, whereas the protein synthesized in the absence of membranes was mainly recovered in the s fraction. incubation in carbonate buffer, ph . , did not cause a significant release of the protein from the membranes (fig. b, lanes and ) . thus the protein product became integrated into the membranes during membrane-associated synthesis. translation must have been initiated at one of the three aug codons of orf la located close together at the ' end of the transcript, but which one is not clear. none of the three augs are in favorable context for translation [ ] with c rather than a purine at the - position, relative to the a(+ ) of the aug codon but the same is the case for the initiation codons of most of orfs - which function efficiently and specifically in translation initiation [ ] . the termination codon was provided by the vector so that the six terminal amino acids in the protein product were derived from vector sequences. initiation at one of the three aug codons would yield proteins with , or amino acids, respectively. another minor product of about -kda was consistently produced from the a transcript with properties similar to those of the about -kda protein. its nature is unknown. the results for the translation of the - transcript, which also encodes a protein sequence with potential transmembrane segments, were similar to those described for the a transcript. the main product of about kda was recovered in the s fraction when synthesized in the absence of membranes, whereas it was recovered in the p fraction when synthesized in the presence of membranes ( fig. a, lanes and ) . incubation in carbonate buffer, ph . , did not cause a significant dissociation of the protein from the membranes (fig. b, lanes and ) . translation was markedly enhanced by the presence of membranes. since the molecular weight of the product was the same whether the protein was syl~thesized in the absence or in the presence of membranes, none of three potential n-glycosylation sites in the predicted protein (see fig. ) became fig. . analysis of the membrane association of the in vitro translated proteins a , - , and - . the transcripts of the appropriate plasmids were in vitro translated in the absence and presence of pancreatic membranes (-or +, respectively). the reaction mixtures were further incubated in trissucrose, ph . (a), or carbonate buffer, ph . (b) and centrifuged. the proteins in the supernatant (s) and the pellet (p) were analyzed by tricine sds-page using . %t: % gels glycosylated. in contrast, we have shown previously [ , ] , that under the same experimental conditions the orf - glycoproteins of ldv-p become nglycosylated in vitro when synthesized in the presence of membranes. there were also membrane-associated proteins of about , , and kda synthesized from the - transcript (fig. a, lane ) . since the putative transmembrane segments fall in the middle of the expected product (see fig. ), the smaller products could have resulted from premature termination or initiation at more downstream aug codons. the former was probably the case, since there is only one additional in-frame aug codon that could yield a product with putative membrane segments and approximately the size ( kda) of one of the smaller products and it is in very unfavorable context for initiation. processing by the serine proteinase could also not play a role in generating the smaller products since the potential - product does not contain the complete sp motif. the primary findings concerning the a and - proteins were that they are, as predicted by their hydrophobicity profiles, integral membrane proteins and that they are not n-glycosylated. as also predicted by the hydrophobic moment analyses (fig. i) , the pc . product did not become membrane-associated. a transcript of pc . could potentially yield two protein products if a frame-shift occurred at the slippery sequence at nt - (see earlier and fig. ). one protein would be composed of amino acids ( . kda), nterminal amino acids encoded by the vector plus the amino acid long c-terminal end of the orf la protein. the second protein would possess an additional amino acids (about kda), representing the n-terminal end of the orf lb protein plus c-terminal amino acids encoded by the vector. both products were generated. since the kda product possessed twice as many met residues as the kda protein, namely six, the level of radioactivity in the two proteins indicates that the read-through product was synthesized in considerably lower amounts than the terminated orf la protein ( fig. a, lane ). since significant amounts of a kda protein were not produced, little, if any, cleavage of the kda orf lb protein segment from the kda read-through product occurred. the presence of membranes had no significant effect on the translation of the pc . transcript and the products were recovered in the soluble supernatant fraction whether synthesized in the presence or absence of membranes ( fig. a, lanes - ) . the predicted catalytic triad of the sp of the ldv-p orf la protein consists of his- , asp- , ser- [ , ] . his- and asp- are present in the c-terminal end of the a protein. ser- is encoded in cdna - . in order to examine the functionality of the sp and to further examine the membrane association of the orf la protein, we have joined a and - at a styi site in their overlapping segment (see fig. ) and in vitro transcribed/translated the construct. a was excised from pbsk-a using restriction endonucleases styi and ncoi releasing a segment of nt (nt - ). pbks ÷ - was cut with styi and the a segment ligated to it resulting in a clone which represented ldv nts - followed by ldv nts - . this clone was then digested with bamhi and ncol (removing ldv nt - from the ' end), blunt ended and ligated. the resulting construct pbks ÷ a / - possessed the r terminal aug codons of a and potentially encoded a protein of amino acids ( . kda), orf la amino acids - plus c-terminal amino acids encoded by the vector. it encompassed the sp-motif and putative transmembrane segments --- (see fig. ). transcription of the construct yielded a transcript of the appropriate size (fig. a) . translation of the transcript yielded in some experiments a protein of the expected size of about kda (see fig. ), but in many others, the main product had a size of about kda whether or not membranes were present (fig. b, lanes and ) . the reason for this difference is not known. the -kda protein could have resulted from processing of the -kda protein, but, if this was the case, processing must have occurred very rapidly since a time course experiment showed that maximum amounts of product were produced by min fig. . agarose gel electrophoretic analysis of the transcripts of pbks+a / - , pbsk a , pbks÷ - and pc . (a) and in vitro synthesis of protein a / - and analysis of its membrane association (b). the a / - proteins synthesized in the absence and presence of microsomal membranes were analyzed as described in the legend to fig. of incubation. the protein profile changed little, if at all, during the - min period of translation (data not shown) and there was no indication of the formation of an - kda protein that would have been expected to be formed in such a processing step (figs. b and ) . only a minor product of to kda was consistently produced (figs. b and ) . furthermore, the formation of the protein products was the same when all proteinase inhibitors ( gg pepstatin a, gg leupeptin and gg aprotonin per ml, and iam phenylmethylsulfonyl fluoride) were omitted from the translation reaction mixture (data not shown). thus very little, if any, processing by the sp was observed. regardless, when synthesized in the absence of membranes the protein products were recovered in the supernatant fraction, whereas when synthesized in the presence of pancreatic membranes they were recovered in the membrane fraction whether the reaction products were incubated in the buffered sucrose solution or in carbonate buffer, ph . , prior to centrifugation (fig. b, lanes and and and , respectively) . the results clearly indicate that the primary protein products become membrane associated when synthesized in the presence of membranes. furthermore, the lack of increase in size of the primary products when synthesized in the presence of membranes indicates that the proteins were not significantly glycosylated at any of three potential glycosylation sites. in order to further investigate the membrane association of the a / - protein we determined its proteinase resistance after synthesis in the presence of microsomal membranes. large portions of the protein seemed to be protected by membrane association from digestion by typsin, chymotrypsin, or proteinase k. the main digestion product was about kda but there were lower amounts of products with apparent molecular weights of about kda and kda k.s. faaberg and p. g. w. ptagemann fig. . sensitivity of the membraneassociated a / -t proteins to proteinase digestion. the products synthesized in vitro in the presence of membranes were incubated in trissucrose, ph . (a), or carbonate, ph . (b), and then digested for h on ice with trypsin, chymotrypsin or proteinase k (each at a concentration of . ~tg/ml), or without them (control) before analysis by tricine sds-page using a . %t: %c gel (fig. , lanes - ) . the same products were generated by proteinase digestion after incubation of the primary product in carbonate buffer, ph . (fig. b) . the lower recovery of some products after the carbonate treatment does not indicate additional protein digestion since a variable loss was also frequently observed with non-proteinase-treated proteins after carbonate, ph . , treatment, apparently resulting from sticking of the proteins to test tube walls [ ii. the smallest proteinase digestion products varied in size depending on the proteinase used (fig. ) ; it was the smallest after proteinase k digestion. the proteinase digestion products, except for the - -kda proteins, were larger than the segments making up the transmembrane segments - (about llkda) or -- (about kda). this finding indicates that portions of the a / - protein in addition to the transmembrane segments became proteinase resistant as a result of membrane association of the proteins. similar findings have been reported for coronavirus m proteins; protein segments upstream and downstream of their transmembrane segments seem to be resistant to chymotrypsin and protein k attack [ , ] . the same applies to the m/vp- and vp- p proteins of ldv [ ] . in contrast, the ldv proteins when synthesized in vitro in the absence of membranes were completely digested by the three proteinases under the same experimental conditions [ ] . the authenticity of the in vitro synthesized proteins as ldv proteins was confirmed by immunoprecipitation of the products by anti-ldv antibodies. both the about kda and - kda products of the a / - rna were precipitated by incubation with plasma from -month ldv-infected mice (imp) but not by normal mouse plasma (nmp), or monoclonal antibodies to ldv n/vp- or vp- (fig. , lanes - ) . the same applied to the about kda and fig. a , b. immunoprecipitation of in vitro synthesized products of the a / - transcript mouse plasma (imp), anti-vp- mab c . or anti-vp- pmab - as described previously [ ] kda products of the . rna (fig. , lanes - ) . the results are also of interest in that they demonstrate that mice mount antibody responses to the c-terminal half of the orf la protein. in conclusion, the present study shows that at least a large portion of the c-terminal half of the ldv orf la protein represents an integral membrane protein(s). the in vitro synthesis of the portion of the orf la protein with the putative transmembrane segments is enhanced by the presence of microsomal membranes and the protein(s) synthesized in the presence of the membranes becomes inserted into the membrane in a form that is not released by disruption of the membrane vesicles in a carbonate buffer, ph . , and that is largely protected from proteinase digestion. much larger portions of the protein seem to become proteinase resistant than strictly the portions making up the putative transmembrane segments, suggesting intimate association with the membrane. the membrane association of a large portion of the orf la protein suggests that the orf la protein or at least its c-terminal half is synthesized in vivo on rough er. one scenario suggests that the synthesis of the orf la protein starts on free ribosomes, that the n-terminal end is processed autocatalyticalty by the two paps releasing products of and kda, (fig. ; nsp-l~ and , respectively) [ - . this might be followed by the autocatalytic release of another protein (nsp- ) by the action of cysteine protease [ ] . one of the putative transmembrane segments then could function as a processed or uncleaved signal peptide so that the synthesis of the remainder of the orf la protein occurs on membrane vesicles, however, in a manner that prevents n-glycosylation. in contrast, all the n-glycosylation sites in the ectodomains of the orf and orf glycoproteins become glycosylated during membrane-associated in vitro synthesis and the oligosaccharide chains seem to become processed during traverse through golgi membranes in vivo [ , ] . nothing is known about the functions of the orf la protein. however, the integration of at least a large portion of the protein into er membranes suggests one possibility, namely that this process results in the formation of unique double-membrane vesicles that are invariably associated with the replication of all arteriviruses regardless of the nature of the host cell [ , , ] . electron micrographs suggest that these double membrane vesicles originate from rough er by protrusion and detachment [ ] . their formation is a very early event in ldv replication in macrophages and free nucleocapsids first appear among these vesicles before budding into single-membrane cisternae located generally next to golgi membranes [ , ] . these findings combined with the uniqueness of these structures suggests that these double membrane vesicles may play an important role in virus replication. the possibility that comes to mind is a role in the synthesis of viral genomic and subgenomic lnrnas since viral rna synthesis in general seems to be membrane associated. the function of membranes and their associated proteins in viral nucleic acid synthesis is not known, but they might supply some organizational component to the replication complex and/or facilitate rna folding as suggested for rna chaperones [ ] . in the case of the arteriviruses, the orf lb replicase protein is not an integrai membrane protein and probably supplies the catalytic rna replicase functions since it possesses helicase, replicase, and zinc binding motifs [ ] . our results as well as those for eav [ ] suggest that the orf lb protein is synthesized in much smaller amounts than the orf la protein which is typical for ribosomal frame shifting [ ] . our results are consistent with the hypothesis that the orf la and lb proteins provide structural and catalytic functions, respectively. ribosomal frame shifting on viral rnas structure and function ofcytochrome c oxidase revision of the taxonomy of the coronavirus, torovirus and arterivirus genera coronavirus ibv glycopolypeptides: locational studies using proteases and saponin, a membrane permeabilizer new variation on the translocation of proteins during early biogenesis of apolipoprotein b structure and function of mammalian facilitated sugar transporters processing and evolution of the n-terminal region of the arterivirus replicase orf a protein: identification of two cysteine proteases equine arteritis virus is not a togavirus but belongs to a coronavirus-like superfamily analysis of membrane and surface protein sequencers with the hydrophobic moment plot disulfide bonds between two envelope proteins of lactate dehydrogenase-elevating virus are essential for viral infectivity the envelope proteins of lactate dehydrogenaseelevating virus and their membrane topography open reading frame of lactate dehydrogenaseelevating virus encodes a soluble, non-structural highly glycosylated and antigenic protein isolation of intracellular membranes by means of sodium carbonate treatment: application to endoplasmic reticulum complete genomic sequence and phylogenetic analysis of the lactate dehydrogenaseelevating virus (ldv) the multidrug transporter, a double-edged sword rna chaperones and the rna folding problems molecular-sieve" chromatography of proteins on columns of cross-linked polyacrylamide the scanning model for translation: an update the high amnity na+/glucose cotransporter sequence ofgenome of lactate dehydrogenase-elevating virus: heterogeneity between strains p and c a former amino terminal signal sequence engineered to an internal location directs translocation of both flanking protein domains lactate dehydrogenase-elevating virus and related viruses lactate dehydrogenase-elevating virus, equine arteritis virus, and simian hemorrhagic fever virus: a new group of positive-strand rna viruses assembly in vitro of a spanning membrane protein of the endoplasmic reticulum: the e glycoprotein of coronavirus mouse hepatitis virus a orf la protein ldv tricine-sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the separation of proteins in the range from to kda proteolytic processing of the replicase orf la protein of equine arteritis virus replication of lactate dehydrogenase-elevating virus in macrophages. . evidence for cytocidal replication electron microscopic studies on the morphogenesis of prrsv in infected cells -comparative studies we thank brent langley for competent secretarial help. during part of the work ksf was supported by usphs training grant ca . received december , key: cord- -fewy y a authors: wang, ming-yang; liang, jing-wei; mohamed olounfeh, kamara; sun, qi; zhao, nan; meng, fan-hao title: a comprehensive in silico method to study the qstr of the aconitine alkaloids for designing novel drugs date: - - journal: molecules doi: . /molecules sha: doc_id: cord_uid: fewy y a a combined in silico method was developed to predict potential protein targets that are involved in cardiotoxicity induced by aconitine alkaloids and to study the quantitative structure–toxicity relationship (qstr) of these compounds. for the prediction research, a protein-protein interaction (ppi) network was built from the extraction of useful information about protein interactions connected with aconitine cardiotoxicity, based on nearly a decade of literature and the string database. the software cytoscape and the pharmmapper server were utilized to screen for essential proteins in the constructed network. the calcium-calmodulin-dependent protein kinase ii alpha (camk a) and gamma (camk g) were identified as potential targets. to obtain a deeper insight on the relationship between the toxicity and the structure of aconitine alkaloids, the present study utilized qsar models built in sybyl software that possess internal robustness and external high predictions. the molecular dynamics simulation carried out here have demonstrated that aconitine alkaloids possess binding stability for the receptor camk g. in conclusion, this comprehensive method will serve as a tool for following a structural modification of the aconitine alkaloids and lead to a better insight into the cardiotoxicity induced by the compounds that have similar structures to its derivatives. the rhizomes and roots of aconitine species, a genus of the family ranunculaceae, are commonly used in treatment for various illnesses such as collapse, syncope, rheumatic fever, joints pain, gastroenteritis, diarrhea, edema, bronchial asthma, and tumors. they are also involved in the management of endocrinal disorders such as irregular menstruation [ , ] . however, the usefulness of this aconitine species component intermingles with toxicity after it is administered to a diseased patient. so far, few articles have recorded the misuse of aconitine medicinals with strong emphasis and thus have referenced that the misuse of this medicinal can result in severe cardio-and neurotoxicity [ ] [ ] [ ] [ ] [ ] . in our past research, it was evidenced that the aconitine component is the main active ingredient in this species' root and rhizome, and is responsible for both therapeutic and toxic effects [ ] . this medicinal has been tested for cancerological and dermatological activities. its application to disease conditions proved to exhibit an activity that slowed down cancer tumor growth and to cure serious cases of dermatosis. it was also found to have an effect on postoperative analgesia [ ] [ ] [ ] [ ] . however, a previous safety study has revealed that aconitine toxicity is responsible for its restriction in clinical settings. further studies are needed to explain the cause of aconitine toxicity as well as to show whether the toxicity supersedes its usefulness. a combined network analysis and in silico study was once performed to obtain insight on the relationship between aconitine alkaloid toxicity and the aconitine structure, and it was found that the cardiotoxicity of aconitine is the primary cause of patient death. the aconitine poison is similar to the poison created by some pivotal proteins such as the ryanodine receptor (ryr and ryr ), the gap junction α- protein (gja ), and the sodium-calcium exchanger (slc a ) [ ] [ ] [ ] [ ] . however, among all existing studies about the aconitine medicinal, no one has reported detail of its specific binding target protein linked to toxicity. protein-protein interactions (ppis) participate in many metabolic processes occurring in living organisms such as the cellular communication, immunological response, and gene expression control [ , ] . a systematic description of these interactions aids in the elucidation of interrelationships among targets. the targeting of ppis with small-molecule compounds is becoming an essential step in a mechanism study [ ] . the present study was designed and undertaken to identify the critical protein that can affect the cardiotoxicity of aconitine alkaloids. a ppi network built by the string database is a physiological contact for the high specificity that has been established for several protein molecules and has stemmed from computational prediction, knowledge transfer between organisms, and interactions aggregated from other databases [ ] . the analysis of the ppi network is based on nodes and edges and is always performed via cluster analysis and centrality measurements [ , ] . in cluster analysis, highly interconnected nodes and protein target nodes are divided and used to form sub-graphs. the reliability of the ppi network is identified by the content of each sub-graph [ ] . the variability in centrality measurements is connected to the quantitative relationship between the protein targets and its weightiness in the network [ ] . hence, ppi networks with protein targets related to aconitine alkaloid cardiotoxicity must enable us to find the most relevant protein for aconitine toxicity and to understand the mechanism at the network level. in our research, the evaluation and visualization analysis of essential proteins related to cardiotoxicity in ppis were performed by the clusterone and cytonca plugins in cytoscape . , designed to find the potential protein targets via combination with conventional integrated pharmacophore matching technology built in the pharmmapper platform. structural modification of a familiar natural product, active compound, or clinical drug is an efficient method for designing a novel drug. the main purpose of the structural modification is to reduce the toxicity of the target compound while enhancing the utility of the drug [ ] . the identification of the structure-function relationship is an essential step in the drug discovery and design, the determination of the d protein structures was the key step in identifying the internal interactions in the ligand-receptor complexes. x-ray crystallography and nmr were the only accepted techniques of determining the d protein structure. although the d structure obtained by these two powerful techniques are accurate and reliable, they are time-consuming and costly [ ] [ ] [ ] [ ] [ ] . with the rapid development of structural bioinformatics and computer-aided drug design (cadd) techniques in the last decade, computational structures are becoming increasingly reliable. the application of structural bioinformatics and cadd techniques can improve the efficiency of this process [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . the ligand-based quantitative structure-toxicity relationship (qstr) and receptor-based docking technology are regarded as effective and useful tools in analysis of structure-function relationships [ ] [ ] [ ] [ ] . the contour maps around aconitine alkaloids generated by comparative molecular field analysis (comfa) and comparative molecular similarity index analysis (comsia) were combined with the interactions between ligand substituents and amino acids obtained from docking results to gain insight on the relationship between the structure of aconitine alkaloids and their toxicity. scores from functions were used to evaluate the docking result. the value-of-fit score in moe software reflects the binding stability and affinity of the ligand-receptor complexes. when screening for the most potential target for cardiotoxicity, the experimental data was combined with the value-of-fit score by the ndcg (normalized discounted cumulative gain). the possibility of a protein being a target of cardiotoxicity corresponds with the consistency of this experimental data. since the pioneering paper entitled "the biological functions of low-frequency phonons" [ ] was published in , many investigations of biomacromolecules from a dynamic point of view have occurred. these studies have suggested that low-frequency (or terahertz frequency) collective motions do exist in proteins and dna [ ] [ ] [ ] [ ] [ ] . furthermore, many important biological functions in proteins and dna and their dynamic mechanisms, such as cooperative effects [ ] , the intercalation of drugs into dna [ ] , and the assembly of microtubules [ ] , have been revealed by studying the low-frequency internal motions, as summarized in a comprehensive review [ ] . some scientists have even applied this kind of low-frequency internal motion to medical treatments [ , ] . investigation of the internal motion in biomacromolecules and its biological functions is deemed as a "genuinely new frontier in biological physics," as announced in the mission of some biotech companies (see, e.g., vermont photonics). in order to consider the static structural information of the ligand-receptor complex, dynamical information should be also considered in the process of drug discovery [ , ] . finally, molecular dynamics was carried out to verify the binding affinity and stability of aconitine alkaloids and the most potential target. this present study may be instrumental in our future studies for the synergism and attenuation of aconitine alkaloids and for the exploitation of its clinical application potential. a flowchart of procedures in our study is shown in figure . of-fit score by the ndcg (normalized discounted cumulative gain). the possibility of a protein being a target of cardiotoxicity corresponds with the consistency of this experimental data. since the pioneering paper entitled "the biological functions of low-frequency phonons" [ ] was published in , many investigations of biomacromolecules from a dynamic point of view have occurred. these studies have suggested that low-frequency (or terahertz frequency) collective motions do exist in proteins and dna [ ] [ ] [ ] [ ] [ ] . furthermore, many important biological functions in proteins and dna and their dynamic mechanisms, such as cooperative effects [ ] , the intercalation of drugs into dna [ ] , and the assembly of microtubules [ ] , have been revealed by studying the low-frequency internal motions, as summarized in a comprehensive review [ ] . some scientists have even applied this kind of low-frequency internal motion to medical treatments [ , ] . investigation of the internal motion in biomacromolecules and its biological functions is deemed as a "genuinely new frontier in biological physics," as announced in the mission of some biotech companies (see, e.g., vermont photonics). in order to consider the static structural information of the ligand-receptor complex, dynamical information should be also considered in the process of drug discovery [ , ] . finally, molecular dynamics was carried out to verify the binding affinity and stability of aconitine alkaloids and the most potential target. this present study may be instrumental in our future studies for the synergism and attenuation of aconitine alkaloids and for the exploitation of its clinical application potential. a flowchart of procedures in our study is shown in figure . the whole framework of the comprehensive in silico method for screening potential targets and studying the quantitative structure-toxicity relationship (qstr). the compounds were aligned over, under the superimposition of the common moiety and template compound . the statistical parameters for database alignment-q , r , f, and see-were the whole framework of the comprehensive in silico method for screening potential targets and studying the quantitative structure-toxicity relationship (qstr). the compounds were aligned over, under the superimposition of the common moiety and template compound . the statistical parameters for database alignment-q , r , f, and see-were table . the comfa model with the optimal number of components presented a q of . , an r of . , an f of . , and an see of . , and contributions of the steric and electrostatic fields were . and . , respectively. the comsia model with the optimal number of components presented a q of . , an r of . , an f of . , and an see of . , and the contributions of steric, electrostatic, hydrophobic, hydrogen bond acceptor, and hydrogen bond donor fields were . , . , . , . , and . , respectively. the statistical results proved that the aconitine alkaloids qstr model of comfa and comsia under the database alignment have adequate predictability. experimental and predicted pld values of both the training set and test set are shown in figure , and the comfa ( figure a ) and comsia ( figure b ) model gave the correlation coefficient (r ) value of . and . , respectively, which demonstrated the internal robustness and external high prediction of the qstr models. experimental and predicted pld values of both the training set and test set are shown in figure residuals vs. leverage williams plots of the aconitine qstr models are shown in figure a ,b. all values of standardized residuals fall between σ and − σ, and the values of leverage are less than h*, so the two models demonstrate potent extensibility and predictability. residuals vs. leverage williams plots of the aconitine qstr models are shown in figure a ,b. all values of standardized residuals fall between σ and − σ, and the values of leverage are less than h*, so the two models demonstrate potent extensibility and predictability. under mesh (medical subject headings), a total of articles ( articles were received from web of science, and others were received from pubmed) were retrieved. after selecting cardiotoxicity-related and excluding repetitive articles, articles were used to extract the correlative proteins and pathways for building a ppi network in the string server. the correlative proteins or pathways are shown in table . all proteins were taken as input protein in the string database to find its direct and functional partners [ ] , and proteins and its partners were then imported into the cytoscape . to generate the ppi network with nodes and edges ( figure ). potassium voltage-gated channel h scn a sodium voltage-gated channel type , scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type kcnj potassium inwardly-rectifying channel j during the case of screening of the essential proteins in ppi network, three centrality measurements (subgraph centrality, betweenness centrality, and closeness centrality) in cytonca were utilized to evaluate the weight of nodes. after removing the central node "ac," the centrality measurements of nodes were calculated by cytonca and documented in table s . the top % of three centrality measurement values of all node are painted with a different color in figure a . to screen the node with the high values of each three centrality measures, nodes with three colors were overlapped and merged into sub-networks in figure b . under mesh (medical subject headings), a total of articles ( articles were received from web of science, and others were received from pubmed) were retrieved. after selecting cardiotoxicity-related and excluding repetitive articles, articles were used to extract the correlative proteins and pathways for building a ppi network in the string server. the correlative proteins or pathways are shown in table . all proteins were taken as input protein in the string database to find its direct and functional partners [ ] , and proteins and its partners were then imported into the cytoscape . to generate the ppi network with nodes and edges ( figure ). table . proteins related to aconitine alkaloids induced cardiotoxicity extracted from articles. classification frequency ryanodine receptor ryr ryanodine receptor gja gap junction α- protein (connexin ) slc a sodium/calcium exchanger atp a calcium transporting atpase fast twitch kcnh potassium voltage-gated channel h scn a sodium voltage-gated channel type , scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type scn a sodium voltage-gated channel type kcnj potassium inwardly-rectifying channel j during the case of screening of the essential proteins in ppi network, three centrality measurements (subgraph centrality, betweenness centrality, and closeness centrality) in cytonca were utilized to evaluate the weight of nodes. after removing the central node "ac," the centrality measurements of nodes were calculated by cytonca and documented in table s . the top % of three centrality measurement values of all node are painted with a different color in figure a . to screen the node with the high values of each three centrality measures, nodes with three colors were overlapped and merged into sub-networks in figure b . in the sub-networks, the voltage-gated calcium and sodium channel accounted for a large proportion, which is consistent with our research in clustering the network (clusters , , and ). all proteins in the sub-networks will be utilized to predict the results of the pharmmapper server to receive the potential target of cardiotoxicity induced by aconitine alkaloids (in figure a ,b). in the meantime, v o (camk g) and vz (camk a) were identified as the potential targets with higher fit scores. in the sub-networks, the voltage-gated calcium and sodium channel accounted for a large proportion, which is consistent with our research in clustering the network (clusters , , and ). all proteins in the sub-networks will be utilized to predict the results of the pharmmapper server to receive the potential target of cardiotoxicity induced by aconitine alkaloids (in figure a ,b). in the meantime, v o (camk g) and vz (camk a) were identified as the potential targets with higher fit scores. all compounds were docked into three potential targets. the values of ndcg are shown in table . the dock study of three proteins with an ndcg of . and . , respectively (the detailed in the sub-networks, the voltage-gated calcium and sodium channel accounted for a large proportion, which is consistent with our research in clustering the network (clusters , , and ). all proteins in the sub-networks will be utilized to predict the results of the pharmmapper server to receive the potential target of cardiotoxicity induced by aconitine alkaloids (in figure a ,b). in the meantime, v o (camk g) and vz (camk a) were identified as the potential targets with higher fit scores. all compounds were docked into three potential targets. the values of ndcg are shown in table . the dock study of three proteins with an ndcg of . and . , respectively (the detailed all compounds were docked into three potential targets. the values of ndcg are shown in table . the dock study of three proteins with an ndcg of . and . , respectively (the detailed docking result is shown in table s ) proves that the result of the dock study of v o is consistent with the experimental pld , so the protein v o was utilized for the ligand interaction analysis. table . ranking results by experimental and predicted pld and fit score. experimental pld fit score ( v o) fit score ( vz ) ndcg . . the d-qstr contour maps were utilized to visualize the information on the comfa and comsia model properties in three-dimensional space. these maps used characteristics of compounds that are crucial for activity and display the regions around molecules where the variance of activities is expected based on physicochemical property changes in molecules [ ] . the analysis of favorable and unfavorable regions of steric, electrostatic, hydrophobic, hbd, and hba atom fields contribute to the realization of the relationship between the aconitine alkaloid's toxic activity and its structure. steric and electrostatic contour maps of the comfa qstr model are shown in figure a ,b, respectively. hydrophobic, hbd, and hba contour maps of the comsia qstr model are shown in figure c -e. compound has the most toxic activity, so it was chosen as the reference structure for the generation of the comfa and comsia contour maps. in the case of the comfa study, the steric contour map around compound is shown in figure a . the yellow regions near r , r , and r showed the substituents of the molecule, which proved that these positions were not ideal for sterically favorable functional groups. therefore, compounds , , and (with pld values of . , . , and . , respectively), which consist of sterically esterified moieties at positions r and r , were less toxic than compounds and (with pld values of . and . ), which were substituted by a small hydroxyl group, and compound (with a pld value of . ) has less toxic activity due to the esterified moiety in r . the green regions, sterically favorable the comfa electrostatic contour map is shown in figure b . the blue region near the r and r substitution revealed that the replacement of electropositive groups is in favor of toxicity. this can be proven by the fact that the compounds with hydroxy in these two positions had higher pld values than the compound with acetoxy or no substituents. the red region surrounding molecular scaffolds was not distinct, which revealed that there was no connection between the electronegative and the toxicity. the comsia hydrophobic contour map is shown in figure c . the r , r , and r around the white region indicated that the hydrophobic groups were unfavorable for the toxicity, so the esterification of hydrophilic hydroxyl or dehydroxylation decreased the toxicity, which is consistent with the steric and electrostatic contour map. the yellow contour map near the r manifested that the hydrophilic hydroxy was unfavorable to the toxicity, which can be validated by the fact that aconitine alkaloids with hydroxy substituents in r (compound , with a pld the ppi network of aconitine alkaloids cardiotoxicity was divided into nine clusters using clusterone. statistical parameters are shown in figure . six clusters, namely clusters , , , , , and , which possess quality scores higher than . , a density higher than . , and a p-value less than . , were selected for further analysis (in figure ) . clusters , , and consisted of proteins mainly involved in the effects of various calcium, potassium, and sodium channels. cluster mainly the comsia contour map of hbd is shown in figure d . the cyan regions at r , r , and r represented a favorable condition for the hbd atom, which clearly validated the fact that the compounds with hydroxy in this region show potent toxicity. a purple region was found near r , which proved that the hbd atom (hydroxyl) in this region has an adverse effect on toxicity. the hba contour map is shown in figure . the magenta region around r substitution proved that this substitution was favorable to the hba atom, so compounds , , , and with the hba atom in the r substitution exhibit more potent toxicity (with pld values of . , . , . , and . ) than compounds with methoxymethyl substituents (compounds , , and with pld values of . , . , and . ). the red contour map where hba atoms are unfavorable for the toxicity was positioned around r and r . these contours were well validated by the lower pld value of compounds with carbonyl in these substitutions. the ppi network of aconitine alkaloids cardiotoxicity was divided into nine clusters using clusterone. statistical parameters are shown in figure . six clusters, namely clusters , , , , , and , which possess quality scores higher than . , a density higher than . , and a p-value less than . , were selected for further analysis (in figure ) . clusters , , and consisted of proteins mainly involved in the effects of various calcium, potassium, and sodium channels. cluster mainly consisted of three channel types related to the cardiotoxicity of aconitine alkaloids, cluster contained calcium and sodium channels and some channel exchangers (such as ryr and ryr ), and cluster mainly consisted of various potassium channels. all of these findings are consistent with previous research about the arrhythmogenic properties of the toxicity of aconitine alkaloids: the aconitine binds to ion channels and affects their open state, and thus the corresponding ion influx into the cytosol [ ] [ ] [ ] . the channel exchangers play a crucial role in keeping the ion transportation and homeostasis inside and outside of the cell. cluster contained some regulatory proteins that can activate or repress the ion channels through the protein expression level. atp a , ryr , ryr , cacna c, cacna d, and cacna s mediate the release of calcium, thereby playing a key role in triggering cardiac muscle contraction and maintaining the calcium homeostasis [ , ] . aconitine may cause aberrant channel activation and lead to cardiac arrhythmia. clusters and consisted of camp-dependent protein kinase (capk), cgmp-dependent protein kinase (cgpk), and guanine nucleotide binding protein (g protein). they have not been fully studied to prove whether the cardiotoxicity induced by aconitine alkaloids is linked to the capk, cgpk, and g proteins; however, some studies have shown that cardiotoxicity-related protein kcnj (potassium inwardly-rectifying channel) is controlled by g proteins and the cardiac sodium/calcium exchanger and is said to be regulated by capk and cgpk [ , ] . the result of clusterone indicated that the constructed network is consistent with existing studies and that the network can be used to screen essential proteins in the cytonca plugin. the protein v o belonging to the camkii (calcium/calmodulin (ca + /cam)-dependent serine/threonine kinases ii) isozyme protein family plays a central role in cellular signaling by transmitting ca + signals. the camkii enzymes transmit calcium ion (ca + ) signals released inside the cell by regulating signal transduction pathways through phosphorylation. ca + first binds to the small regulatory protein cam, and this ca + /cam complex then binds to and activates the kinase, which then phosphorylates other proteins such as ryanodine receptor and sodium/calcium exchanger. thus, these proteins are related to the cardiotoxicity induced by aconitine alkaloids [ ] [ ] [ ] . the excessive activity of camkii has been observed in some structural heart disease and arrhythmias [ ] , and past findings demonstrate neuroprotection in neuronal cultures treated with inhibitors of camkii immediately prior to excitotoxic activation of the camkii [ ] . the acute cardiotoxicity of the aconitine alkaloids is possibly related to this target. based on the analysis of the ppi network above, camkii was selected as the potential target for further molecular docking and dynamic simulation. the dock result of v o is shown in figure a . compound has the highest fit scores, so it was selected as the template for conformational analysis. the mechanisms of camkii activation and inactivation are shown in figure b . compound affects the normal energy metabolism of the myocardial cell via binding in the atp-competitive site in figure c . the inactive state of the camkii was regulated by cask-mediated t /t phosphorylation, and this state can be inhibited by the binding of compound in the atp-competitive site. such binding moves camkii toward a ca + /cam-dependent activation active state and a ca + /cam-dependent activation through structural rearrangement of the inhibitory helix caused by ca + /cam binding and the subsequent autophosphorylation of t [ ] , which will induce the excessive activity of camkii and dynamic imbalance of the calcium ions in the myocardial cell, eventually leading to heart disease and arrhythmias. molecules , , x for peer review of channel) is controlled by g proteins and the cardiac sodium/calcium exchanger and is said to be regulated by capk and cgpk [ , ] . the result of clusterone indicated that the constructed network is consistent with existing studies and that the network can be used to screen essential proteins in the cytonca plugin. the protein v o belonging to the camkii (calcium/calmodulin (ca + /cam)-dependent serine/threonine kinases ii) isozyme protein family plays a central role in cellular signaling by transmitting ca + signals. the camkii enzymes transmit calcium ion (ca + ) signals released inside the cell by regulating signal transduction pathways through phosphorylation. ca + first binds to the small regulatory protein cam, and this ca + /cam complex then binds to and activates the kinase, which then phosphorylates other proteins such as ryanodine receptor and sodium/calcium the information of a binding pocket of a receptor for its ligand is very important for drug design, particularly for conducting mutagenesis studies [ ] . as has been reported in the past [ ] , the binding pocket of a protein receptor to a ligand is usually defined by those residues that have at least one heavy atom within a distance of Å from a heavy atom of the ligand. such a criterion was originally used to define the binding pocket of atp in the cdk -nck a complex [ ] , which was later proved to be very useful in identifying functional domains and stimulating the relevant truncation experiments. a similar approach has also been used to define the binding pockets of many other receptor-ligand interactions important for drug design [ , , , [ ] [ ] [ ] [ ] . the information of a binding pocket of camkii for the aconitine alkaloids will serve as a guideline for designing drugs with similar scaffolds, particularly for conducting mutagenesis studies. in figure a , four top fit scores-compounds , , , and -generated similar significant interactions with amino acid residues around the atp-competitive binding pocket. four compounds formed with many van der waals interactions within the noncompetitive inhibitor pocket through amino acid residues such as asp , lys , glu , lys , and leu . the ligand-receptor interaction showed that the hydroxy in r formed a side chain donor interaction with asp . in addition, the hydroxy in r and r also formed a side chain acceptor interaction with glu and ser , respectively (the docking result of compounds and in figure a ). these results correspond to the comfa and comsia contour maps. however, the small electropositive and hydrophilic group in r , r , and r possess a certain enhancement function to toxicity. there were aromatic interactions between the phenyl group in r and amino acid residues. the phenyl group in r formed aromatic interactions with leu , leu , and phe , while the small group hydroxyl did not form any interaction with asp , which demonstrate that bulky phenyl group is crucial to this binding pattern and toxicity. this was mainly equal to the comfa steric contour map, where r was ideal for sterically favorable groups. the methoxymethyl r generated backbone acceptor with lys , which correspond to the comsia hba contour map, where r was favorable for the hba atom. compound docked into v o, and the atp-competitive pocket was painted green; the t , t , and t phosphorylation sites were painted green, orange, and yellow, respectively; the inhibitory helix was painted red. the result of md simulation is shown in figure . the red plot represented the rmsd values of the docked protein. the values of rmsd reached . Å in . ns and then remained between and . Å throughout the simulation for up to ns. the averaged value of the rmsd was . Å. the md simulation demonstrated that the ligand was stabilized in the active site. finally, we combined the ligand-based d-qstr analysis with the structure-based molecular docking study to identify the necessary moiety related to the cardiotoxicity mechanism of the aconitine alkaloids (in figure ). finally, we combined the ligand-based d-qstr analysis with the structure-based molecular docking study to identify the necessary moiety related to the cardiotoxicity mechanism of the aconitine alkaloids (in figure ). to build the ppi network of aconitine alkaloids, literature from january to february was retrieved from pubmed (http://pubmed.cn/) and web of science (http://www.isiknowledge.com/) with the mesh word "aconitine" and "toxicity" and without language restriction. all documents about cardiotoxicity caused by aconitine alkaloids were collected. the proteins related to the aconitine alkaloids cardiotoxicity of this decade were gathered and taken as the input protein in the string (https://string-db.org/) database [ , ] , used to search for related proteins or pathways that had been reported. finally, all the proteins and its partners were recorded in excel in order to import information and build a ppi network in cytoscape software. cytoscape is a free, open-source, java application for visualizing molecular networks and integrating them with gene expression profiles [ , ] . plugins are available for network and molecular profiling analyses, new layouts, additional file format support, making connections with figure . crucial requirement of cardiotoxicity mechanism was obtained from the ligand-based d-qstr and structure-based molecular docking study. to build the ppi network of aconitine alkaloids, literature from january to february was retrieved from pubmed (http://pubmed.cn/) and web of science (http://www.isiknowledge.com/) with the mesh word "aconitine" and "toxicity" and without language restriction. all documents about cardiotoxicity caused by aconitine alkaloids were collected. the proteins related to the aconitine alkaloids cardiotoxicity of this decade were gathered and taken as the input protein in the string (https://string-db.org/) database [ , ] , used to search for related proteins or pathways that had been reported. finally, all the proteins and its partners were recorded in excel in order to import information and build a ppi network in cytoscape software. cytoscape is a free, open-source, java application for visualizing molecular networks and integrating them with gene expression profiles [ , ] . plugins are available for network and molecular profiling analyses, new layouts, additional file format support, making connections with databases, and searching within large networks [ ] . clusterone (clustering with overlapping neighborhood expansion) of cytoscape was utilized to cluster the ppi network into overlapping sub-graphs of highly interconnected nodes. clusterone is a plugin for detecting and clustering potentially overlapping protein complexes from ppi data. the quality of a group was assessed by the number of sub-graphs, p-values, and density. the cluster was discarded when the number of sub-graphs was smaller than , the density was less than . , the quality was less than . , and the p-value was under . [ ] . the clustering results of the clusterone are instrumental to understanding how the reliability of the ppi network relates to aconitine alkaloids' cardiotoxicity. cytonca is a plugin in cytoscape integrating calculation, evaluation, and visualization analysis for multiple centrality measures. there are eight centrality measurements provided by cytonca: betweenness, closeness, degree, eigenvector, local average connectivity-based, network, subgraph, and information centrality [ ] . the primary purpose of the centrality analysis was to confirm the essential proteins in the pre-built ppi network. the three centrality measurements in the cytonca plugin-subgraph centrality, betweenness centrality, and closeness centrality-were used for evaluating and screening the essential protein in the merged target network. the subgraph centrality characterizes the participation of each node in all subgraphs in a network. smaller subgraphs are given more weight than larger ones, which makes this measurement an appropriate one for characterizing network properties. the subgraph centrality of node "u" can be calculated by [ ] µ l (u) is the uth diagonal entry of the lth power of the weight adjacency matrix of the network. v , v , . . . , v n is be an orthonormal basis composed of r n composed by eigenvectors of a associated to the eigenvalues λ , λ , . . . , λ n v u v , which is the uth component of v v [ ] . the betweenness centrality finds a wide range of applications in network theory. it represents the degree to which nodes stand between each other. betweenness centrality was devised as a general measure of centrality. it is applicable to a wide range of problems in network theory, including problems related to social networks, biology, transport, and scientific cooperation. the betweenness centrality of a node u can be calculated by [ ] ρ (s, t) is the total number of shortest paths from node s to node ρ (s, u, t), which is the number of those paths that pass through u. closeness centrality of a node is a measure of centrality in a network, calculated as the sum of the length of the shortest paths between the node and all other nodes in the graph. thus, the more central a node is, the closer it is to all other nodes. the closeness centrality of a node u can be calculated by [ ] |nu| is the number of node u's neighbors, and dist (u, v) is the distance of the shortest path from node u to node v. pharmmapper serves as a valuable tool for identifying potential targets for a novel synthetic compound, a newly isolated natural product, a compound with known biological activity, or an existing drug [ ] . of all the aconitine alkaloids in this research, compounds , , and exhibited the most toxic activity and were used for the potential target prediction. the mol format of three compounds was submitted to the pharmmapper server. the parameters of generate conformers and maximum generated conformations was set as on and , respectively. other parameters used default values. finally, the result of the clusterone and pharmmapper will be combined together to select the potential targets for the following docking study [ ] . comparative molecular field analysis (comfa) and comparative molecular similarity index analysis (comsia) are efficient tools in ligand-based drug design and are in use for contour map generation and identification of favorable and unfavorable regions in a moiety [ , ] . the comfa consists of a steric and electrostatic contour map of molecules that are correlated with toxic activity, while the comsia consists of hydrophobic field, hydrogen bond donor (hbd)/hydrogen bond acceptor (hba) [ ] , and steric/electrostatic fields that are correlated with toxic activity. the comfa and comsia have been utilized to generate a d-qstr model [ ] . all molecule models and the generation of d-qstr were performed with sybyl x . . alkaloids in mice with ld values listed in table were extracted from recent literature [ ] . the ld values of all aconitine alkaloids were converted into pld with a standard tripos force field. these pld values were used as a dependent variable, while comfa and comsia descriptors were used as an independent variable. the sketch function of sybyl x . was utilized to illustrate structure and charges, and was calculated by the gasteiger-huckel method. additionally, the tripose force field was utilized for energy minimization of these aconitine alkaloid molecules [ ] . the molecules were divided into a ratio of : . the division was done in a way that showed that both datasets are balanced and consist of both active and less active molecules [ ] . the reliability of the d-qstr model depends on the database molecular alignment. the most toxic aconitine alkaloids (compound ) was selected as the template molecule, and the tetradecahydro- h- , , -(epiethane [ , , ] triyl)- , -methanonaphtho [ , -b] azocine was selected as the common moiety. pls (partial least squares) techniques are associated with field descriptors with activity values such as [ ] leave one out (loo) values, the optimal number of components, the standard error of estimation (see), cross-validated coefficients (q ), and the conventional coefficient (r ). these statistical data are pivotal in the evaluation of the d-qstr model and can be worked out in the pls method [ ] . the model is said to be good when the q value is more than . and the r value is more than . . the q and r values reflect a model's soundness. the best model has the highest q and r values, the lowest see, and an optimal number of components [ , , ] . in the case of comfa and comsia analysis, the values of the optimal number of components, see, and q can be worked out by loo validation, with use sampls turned on and components set to , while in the process of calculating r , the use sampls was turned off and the column filtration was set to . kcal mol − in order to speed up the calculation without the need to sacrifice information content [ ] [ ] [ ] [ ] . therefore, components were set to and , respectively, which were optimal numbers of components calculated by performing a sampls run. see and r were utilized to assess the non-cross validated model. the applicability domain (ad) of the topomer comfa and comsia model was confirmed by the williams plot of residuals vs. leverage. leverage of a query chemical is proportional to its mahalanobis distance measure from the centroid of the training set [ , ] . the leverages are calculated for a given dataset x by obtaining the leverage matrix (h) with the equation below: x is the model matrix, while xt is its transpose matrix. the plot of standardized residuals vs. leverage values was drawn, and compounds with standardized residuals greater than three standard deviation units (± σ) were considered as outliers [ ] . the critical leverage value is considered p/n, where p is the number of model variables plus one, and n is the number of objects used to calculate the model. h > p/n mean predicted response is not acceptable [ ] [ ] [ ] . (cadd) software program that incorporates the functions of qsar, molecular docking, molecular dynamics, adme (absorption, distribution, metabolism, and excretion), and homologous modeling. all of these functions are regarded as conducive instruments in the field of drug discovery and biochemistry. the molecular docking and dynamics technology were performed in moe software to detect the stability and affinity between the ligands and predictive targets [ , ] . the docking process involves the prediction of ligand conformation and orientation within a targeted binding site. docking analysis is an important step in the docking process. it has been widely used to study the reasonable binding mode and obtain information of interactions between amino acids in active protein sites and ligands. the molecular docking analysis was carried out to determine the toxicity-related moiety of aconitine alkaloids through the ligand-amino-acid interaction function in moe . the pdb format of v o and vz was downloaded from the pdb (protein data bank) database (https://www.rcsb.org/), and the mol format of compounds was from the sybyl software of qstr research. the structure preparation function in moe software will be carried out to minimize the energy and optimize the structure of the protein skeleton. based on the london dg score and induced fit refinement, all compounds will be docked into the active site of every potential target by taking score values as the scoring function [ ] . the dcg (discounted cumulative gain) algorithm was utilized to examine the consistency between the ranking result of pld and our research (fit scores of dock study). they rely on the formula that refers to pld . the idcg (ideal dcg) refers to the ordered pld values. the closer the normalized discounted cumulative gain (ndcg) value is to , the better the consistency [ ] . preliminary md simulations for the model protein were performed using the program namd (nanoscale molecular dynamics program, v . ), and all files were generated using visual molecular dynamics (vmd). namd is a freely available software designed for high-performance simulation of large biomolecular systems [ ] . during the md simulation, minimization and equilibration of original and docked proteins occurred in a Å size water box. a charmm force field file was applied for energy minimization and equilibration with gasteiger-huckel charges using boltzmann initial velocity [ , ] . integrator parameters also included fs/step for all rigid bonds and nonbonded frequencies were selected for Å and full electrostatic evaluations for Å were used with steps for each cycle [ ] . the particle mesh ewald method was used for electrostatic interactions of the simulation system periodic boundary conditions with grid dimensions of . Å [ ] . the pressure was maintained at . kpa using the langevin piston and the temperature was controlled at k using langevin dynamics. covalent interactions between hydrogen and heavy atoms were constrained using the shake/rattle algorithm. finally, ns md simulations for original and docked protein were carried out for comparing and verifying the binding affinity and stability of the ligand-receptor complex. the method combining network analysis and the in silico method was carried out to illustrate the qstr and toxic mechanisms of aconitine alkaloids. the d-qstr was built in sybyl with internal robustness and external high prediction, enabling identification of pivotal molecule moieties related to toxicity in aconitine alkaloids. the comfa model had q , r , optimum component, and correlation coefficient (r ) values of . , . , , and . , respectively, and the comsia model had q , r , optimum component, and correlation coefficient (r ) values of . , . , , and . . the network was built with cytoscape software and the string database, which demonstrated the reliability of cluster analysis. the v o and vz proteins were identified as potential targets with the cytonca plugin with pharmmapper server for interactions between the aconitine alkaloids and key amino acids in the dock study. the result of the dock study demonstrates the consistency of the experimental pld . the md simulation indicated that aconitine alkaloids exhibit potent binding affinity and stability to the receptor camk g. finally, we incorporate pivotal molecule moieties and ligand-receptor interactions to realize the qstr of aconitine alkaloids. this research serves as a guideline for studies of toxicity, including neuro-, reproductive, and embryo-toxicity. with a deep understanding of the relationship between toxicity and structure of aconitine alkaloids, subsequent structural modification of aconitine alkaloids can be carried out to enhance their efficacy and to reduce their toxic side effects. based on such research, aconitine alkaloids can bring us closer to medical and clinical applications. in addition, as pointed out in past research [ ] , user-friendly and publicly accessible web servers represent the future direction of reporting various important computational analyses and findings [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . they have significantly enhanced the impacts of computational biology on medical science [ , ] . the research in this paper will serve as a foundation for constructing web servers for qstr studies and target identifications of compounds. immunomodulating agents of plant origin. i: preliminary screening chinese drugs plant origin aconitine poisoning: a global perspective ventricular tachycardia after ingestion of ayurveda herbal antidiarrheal medication containing aconitum fatal accidental aconitine poisoning following ingestion of chinese herbal medicine: a report of two cases five cases of aconite poisoning: toxicokinetics of aconitines a case of fatal aconitine poisoning by monkshood ingestion determination of aconitine and hypaconitine in gucixiaotong ye by capillary electrophoresis with field-amplified sample injection a clinical study in epidural injection with lappaconitine compound for post-operative analgesia therapeutic effects of il- combined with benzoylmesaconine, a non-toxic aconitine-hydrolysate, against herpes simplex virus type infection in mice following thermal injury aconitine: a potential novel treatment for systemic lupus erythematosus aconitine-containing agent enhances antitumor activity of dichloroacetate against ehrlich carcinoma complex discovery from weighted ppi networks prediction and analysis of the protein interactome in pseudomonas aeruginosa to enable network-based drug target selection the string database in : quality-controlled protein-protein association networks, made broadly accessible identification of functional modules in a ppi network by clique percolation clustering united complex centrality for identification of essential proteins from ppi networks the ppi network and cluster one analysis to explain the mechanism of bladder cancer the progress of novel drug delivery systems mitochondrial uncoupling protein structure determined by nmr molecular fragment searching structural basis for membrane anchoring of hiv- envelope spike unusual architecture of the p channel from hepatitis c virus architecture of the mitochondrial calcium uniporter structure and mechanism of the m proton channel of influenza a virus computer-aided drug design using sesquiterpene lactones as sources of new structures with potential activity against infectious neglected diseases successful in silico discovery of novel nonsteroidal ligands for human sex hormone binding globulin in silico discovery of novel ligands for antimicrobial lipopeptides for computer-aided drug design structural bioinformatics and its impact to biomedical science coupling interaction between thromboxane a receptor and alpha- subunit of guanine nucleotide-binding protein prediction of the tertiary structure and substrate binding site of caspase- study of drug resistance of chicken influenza a virus (h n ) from homology-modeled d structures of neuraminidases insights from investigating the interaction of oseltamivir (tamiflu)with neuraminidase of the h n swine flu virus prediction of the tertiary structure of a caspase- /inhibitor complex design novel dual agonists for treating type- diabetes by targeting peroxisome proliferator-activated receptors with core hopping approach heuristic molecular lipophilicity potential (hmlp): a d-qsar study to ladh of molecular family pyrazole and derivatives fragment-based quantitative structure & ndash; activity relationship (fb-qsar) for fragment-based drug design investigation into adamantane-based m inhibitors with fb-qsar hp-lattice qsar for dynein proteins: experimental proteomics ( d-electrophoresis, mass spectrometry) and theoretic study of a leishmania infantum sequence the biological functions of low-frequency phonons: . cooperative effects low-frequency collective motion in biomacromolecules and its biological functions quasi-continuum models of twist-like and accordion-like low-frequency motions in dna collective motion in dna and its role in drug intercalation biophysical aspects of neutron scattering from vibrational modes of proteins biological functions of soliton and extra electron motion in dna structure low-frequency resonance and cooperativity of hemoglobin solitary wave dynamics as a mechanism for explaining the internal motion during microtubule growth designed electromagnetic pulsed therapy: clinical applications steps to the clinic with elf emf molecular dynamics study of the connection between flap closing and binding of fullerene-based inhibitors of the hiv- protease molecular dynamics studies on the interactions of ptp b with inhibitors: from the first phosphate-binding site to the second one the cambridge structural database: a quarter of a million crystal structures and rising molecular similarity indices in a comparative analysis (comsia) of drug molecules to correlate and predict their biological activity single channel analysis of aconitine blockade of calcium channels in rat myocardiocytes conversion of the sodium channel activator aconitine into a potent alpha -selective nicotinic ligand aconitine blocks herg and kv . potassium channels inactivation of ca + release channels (ryanodine receptors ryr and ryr ) with rapid steps in [ca + ] and voltage targeted disruption of the atp a gene encoding the sarco(endo)plasmic reticulum ca + atpase isoform (serca ) impairs diaphragm function and is lethal in neonatal mice cyclic gmp-dependent protein kinase activity in rat pulmonary microvascular endothelial cells different g proteins mediate somatostatin-induced inward rectifier k + currents in murine brain and endocrine cells cardiac myocyte calcium transport in phospholamban knockout mouse: relaxation and endogenous camkii effects inhibition of camkii phosphorylation of ryr prevents induction of atrial fibrillation in fkbp . knock-out mice regulation of ca + and electrical alternans in cardiac myocytes: role of camkii and repolarizing currents the role of calmodulin kinase ii in myocardial physiology and disease excitotoxic neuroprotection and vulnerability with camkii inhibition structure of the camkiiδ/calmodulin complex reveals the molecular mechanism of camkii kinase activation a model of the complex between cyclin-dependent kinase and the activation domain of neuronal cdk activator binding mechanism of coronavirus main proteinase with ligands and its implication to drug design against sars an in-depth analysis of the biological functional studies based on the nmr m channel structure of influenza a virus molecular therapeutic target for type- diabetes novel inhibitor design for hemagglutinin against h n influenza virus by core hopping method the string database in : functional interaction networks of proteins, globally integrated and scored cytoscape: a software environment for integrated models of biomolecular interaction networks detecting overlapping protein complexes in protein-protein interaction networks cytonca: a cytoscape plugin for centrality analysis and evaluation of protein interaction networks subgraph centrality and clustering in complex hyper-networks ranking closeness centrality for large-scale social networks enhancing the enrichment of pharmacophore-based target prediction for the polypharmacological profiles of drugs comparative molecular field analysis (comfa). . effect of shape on binding of steroids to carrier proteins sample-distance partial least squares: pls optimized for many variables, with application to comfa a qsar analysis of toxicity of aconitum alkaloids recent advances in qsar and their applications in predicting the activities of chemical molecules, peptides and proteins for drug design unified qsar approach to antimicrobials. . multi-target qsar modeling and comparative multi-distance study of the giant components of antiviral drug-drug complex networks comfa qsar models of camptothecin analogues based on the distinctive sar features of combined abc, cd and e ring substitutions applicability domain for qsar models: where theory meets reality comparison of different approaches to define the applicability domain of qsar models molecular docking and qsar analysis of naphthyridone derivatives as atad bromodomain inhibitors: application of comfa, ls-svm, and rbf neural network concise applications of molecular modeling software-moe medicinal chemistry and the molecular operating environment (moe): application of qsar and molecular docking to drug discovery qsar models of cytochrome p enzyme a inhibitors using comfa, comsia and hqsar estimating a ranked list of human hereditary diseases for clinical phenotypes by using weighted bipartite network biomolecular simulation on thousands processors molecular dynamics and docking investigations of several zoanthamine-type marine alkaloids as matrix metaloproteinase- inhibitors salts influence cathechins and flavonoids encapsulation in liposomes: a molecular dynamics investigation review: recent advances in developing web-servers for predicting protein attributes irna-ai: identifying the adenosine to inosine editing sites in rna sequences iss-psednc: identifying splicing sites using pseudo dinucleotide composition irna-pseu: identifying rna pseudouridine sites ploc-mplant: predict subcellular localization of multi-location plant proteins by incorporating the optimal go information into general pseaac ploc-mhum: predict subcellular localization of multi-location human proteins via general pseaac to winnow out the crucial go information iatc-misf: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals psuc-lys: predict lysine succinylation sites in proteins with pseaac and ensemble random forest approach irnam c-psednc: identifying rna -methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition ikcr-pseens: identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier iacp: a sequence-based tool for identifying anticancer peptides ploc-meuk: predict subcellular localization of multi-label eukaryotic proteins by extracting the key go information into general pseaac iatc-mhyb: a hybrid multi-label classifier for predicting the classification of anatomical therapeutic chemicals ihsp-pseraaac: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition irna-psecoll: identifying the occurrence sites of different rna modifications by incorporating collective effects of nucleotides into pseknc impacts of bioinformatics to medicinal chemistry an unprecedented revolution in medicinal chemistry driven by the progress of biological science this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license key: cord- - lnpujip authors: anthonsen, henrik w.; baptista, antónio; drabløs, finn; martel, paulo; petersen, steffen b. title: the blind watchmaker and rational protein engineering date: - - journal: j biotechnol doi: . / - ( ) -x sha: doc_id: cord_uid: lnpujip in the present review some scientific areas of key importance for protein engineering are discussed, such as problems involved in deducting protein sequence from dna sequence (due to posttranscriptional editing, splicing and posttranslational modifications), modelling of protein structures by homology, nmr of large proteins (including probing the molecular surface with relaxation agents), simulation of protein structures by molecular dynamics and simulation of electrostatic effects in proteins (including ph-dependent effects). it is argued that all of these areas could be of key importance in most protein engineering projects, because they give access to increased and often unique information. in the last part of the review some potential areas for future applications of protein engineering approaches are discussed, such as non-conventional media, de novo design and nanotechnology. nature has evolved using several types of random mutations in the genetic material as a fundamental mechanism, thereby creating new versions of existing proteins. by natural selection nature has given a preference to organisms with proteins which directly or indirectly made them better adapted to their environment. thus nature works like a blind watchmaker, trying out an endless number of combinations. this may seem to be an inefficient approach by industrial standards, but nevertheless nature has been able to develop some highly complex and sophisticated designs, simply by the power of natural selection over millions of years, occurring in a large number of parallel processes. by virtue of reproduction several copies of each organism have been able to test the effect of different mutations in parallel. it is quite probable that the mutation frequency was higher in ancient species (doolittle, ) , although it is still possible to find highly mutable loci in genes involved in adaptation to the environment (moxon et al., ) . enzymes have been used by man for thousands of years for modification of biological molecules. the use of rennin (chymosin) in rennet for cheese production is a relevant example. and with increased knowledge about proteins, genes and other biological macromolecules scientists started starting with a protein with known sequence and properties, we make a -d model of the protein from experimental structure data or by homology. by modelling and simulation we identify mutations that will modify selected properties of the protein (the design part of the process), these mutations are implemented at the dna level and expressed in a suitable organism (the production part of the process), and the success of the design is verified by experimental methods. to look at methods for making modified proteins with new or improved properties. at first this was done by speeding up nature's own approach, by increasing the number of mutations (e.g., by using chemicals or radiation) and by using a very strong selection based on tests for specific properties. with the introduction of new and powerful techniques for structure determination and site directed mutagenesis, it is now possible to do rational protein modification. rather than testing out a large number of random mutations, it has become feasible to identify key residues within the protein structure, to predict the effect of changing these residues, to implement these changes in the genetic material, and finally to produce large amounts of modified proteins. this is protein engineering. there are several reviews describing the fundamental ideas in protein engineering, see fersht and winter ( ) for a recent one. the basic protein engineering process is shown in fig. (see also petersen and martel ( ) ). in most cases it starts out with an unmodified protein with well-characterised properties. for some reason we want to modify this protein. in the case of an enzyme we may want to make it more stable, alter the specificity or increase the catalytic activity. first we enter the design part of the protein engineering process. based on structural data we create a computer model of the protein. by a combination of molecular modelling and experimental methods the correlation between relevant properties and structural features is established, and changes affecting these properties can be identified and evaluated for implementation. in more and more cases the effect of these changes can be simulated, and the modifications can be optimised with respect to these simulations. as soon as a new design has been es~tablished we may enter the production part of the process. the necessary mutations must be implemented in the genetic material, this genetic material is introduced into a production organism, and the resulting modified protein can (in most cases) be extracted from a bioprocess. this protein can be tested with respect to relevant properties, and if necessary it may be used as a basis for re-entering the design part of the protein engineering process. after a few iterations we may reach an optimal design. there are several examples of successful protein engineering projects. protein engineering may be used to improve protein stability (kaarsholm et al., ) , enhance or modify specificity (getzoff et al., ; witkowski et al., ) , adapt proteins to new environments (arnold, ; gupta, ) , or to engineer novel regulation into enzymes (higaki et al., ) . in some cases even de novo design of new proteins may be relevant, using knowledge gained from existing structures (kamtekar et al., ; shakhnovich and gutin, ; ghadiri et al., ; ball, ) . in a truly multidisciplinary project chymosin mutants with optimal activity at increased ph values compared to wild-type chymosin was designed and produced (pitts et al., ) . point mutations changing the charge distribution of superoxide dismutase have been used to increase reaction rate by improved electrostatic guidance (getzoff et al., ) . a project on converting trypsin into chymotrypsin has been important for understanding the role of chymotrypsin surface loops (hedstrom et al., ) , a serine active site hydrolase has been converted into a transferase by point mutations (witkowski et al., ) , and mutations in insulin aiming at increased folding stability have given an insulin with enhanced biological activity (kaarsholm et al., ) . an example of a rational de novo project (as opposed to the random approach used, e.g., in generation of catalytic antibodies) is the design of an enzymatic peptide catalysing the decarboxylation of oxaloacetate via an imine intermediate, in which a very simple design gave a three to four orders of magnitude faster formation of imine compared to simple amine catalysts . in some cases it may also be an interesting approach to incorporate nonpeptidic residues into otherwise normal proteins (baca et al., ) , or to build de novo proteins by assembling peptidic building blocks on to a nonpeptidic template (tuchscherer et al., ) . it has been shown that incorporation of nonpeptidic residues into e-turns of hiv- protease gives a more stable enzyme (baca et al., ) . the main problem with this approach is how to incorporate the non-standard residues. in the hiv- protease case solid-phase peptide synthesis combined with traditional organic synthesis was used, others have suggested that the degeneracy of the genetic code may be used to incorporate novel residues via the standard protein synthesis machinery of the cell (fahy, ) . in the present review we will look at the design part of the protein engineering process, with emphasis on some of the more difficult steps, especially homology based modelling in cases with very low sequence similarity, nuclear magnetic resonance (nmr) of very large proteins and modelling of electrostatic interactions. in the last part of the review we will discuss some possible future directions for protein engineering and protein design. any protein engineering project is based on information about the protein sequence. this information may stem from either direct protein sequencing or a deducted translation of the dna/rna sequence. the amount of information on protein and nucleic acid sequences, as well as on relevant data like -d structures and disease-related mutations, is growing at a very rapid pace, and novel databases and computer tools give increased access to these data (coulson, ) . it is very reasonable to expect that projects like the human genome project will succeed in providing us with sequence information about every single gene in our chromosomes within the next decade. this information will be after transcription, the mrna may be edited, a process that now has been reported in man, plants and primitive organisms (trypanosoma brucei). the mrna is then translated into a protein sequence. this protein sequence can subsequently be modified, leading to n-or o-glycosylation, phosphorylation, sulfation or the covalent attachment of fatty acid moieties to the protein. at this stage the protein is ready for transport to its final destination -which may be right where it is at the time of synthesis, but the destination may also be extracellular or in secluded compartments such as the mitochondria or lysosomes. in this case the protein is equipped with a signal sequence. after arrival to its destination the protein is processed, often involving proteolytic cleavage of the signal sequence. shorter routes to the functional protein with fewer steps undoubtedly exist as well as routes with interchanged steps of processing. finally, the catabolism of the protein is also part of the process, but has been left out in the figure. of key importance for our understanding of the biology, development and evolution of man. it should, however, be kept in mind that the sequence itself may give us little information about regulation of gene expression, i.e., under what conditions genes are expressed, if they are expressed at all. most protein sequences have been deducted from gene sequences. it is in most cases a priori assumed that a trivial mapping exists between the two sets of information. however, this may not necessarily be the case. in fig. , the various steps currently recognised as being of importance for the production of the mature enzyme are shown, and several of these steps may affect the mapping from gene to protein. posttranscriptional editing is modifications at the mrna level affecting the mapping of information from gene to protein, often involving modification, insertion or deletion of individual nucleotides at specific positions (cattaneo, ) . currently only speculative models exist for the underlying molecular mechanism(s) for posttranscriptional editing. in the case of mammalian apolipoprotein b two forms exist, both originating from a single gene. the shorter form, apo b , arises by a posttranscriptional mrna editing whereby cytidine deamination produces an uaa termination codon (teng et al., ) . in the ampa receptor subunit glur-b mrna editing is responsible for changing a glutamine codon (cag) into a arginine codon (cgg) (higuchi et al., ) . this editing has a pronounced effect on the ca + permeability of the ampa receptor channel, and it seems to be controlled by the intron-exon structure of the rna. similar mrna editing has been reported in the related kainate receptor subunits giur- and giur- , where two additional codons in the first trans-membrane region are altered (sommer et al., ; k hler et al., ) . it is also interesting in this context that certain human genetic diseases have been related to reiteration of the codon cag (green, ) . mrna editing in plant mitochondria and chloroplasts has also been reported (gray and covello, ) . here the posttranscriptional mrna editing consists almost exclusively of c to u substitutions. editing occurs predominantly inside coding regions, mostly at isolated c residues, and usually at the first or second position of the codons, thus almost always changing the amino acid compared to that specified by the unedited codon. in trypanosoma brucei some extensive and well-documented posttranscriptional cases of editing have been reported (read et al., ; harris et al., ; adler et al., ) . the editing takes place at the mitochondrial transcript level where a large number of uridine nucleic acid bases are added or deleted from the mrna, which then subsequently is translated. several non-editing processes affecting the transcription/translation steps are also known. although the ribosomes in an almost perfect manner translate the message provided by the mrna (with error rates less than x - per amino acid incorporated), it appears as if the mrna in certain cases contain information, that forces the ribosome to read the nucleic acid information in a non-canonical fashion (farabaugh, ) . a special case, that may deserve some attention as well, is the seleno proteins, were seleno cystein is introduced into the protein by an alternative interpretation of selected codons (b ck et al., ; yoshimura et al., ; farabaugh, ) . translational frameshifting has been found in retroviruses, coronaviruses, transposons and a prokaryotic gene, leading to different translations of the same gene. two cases of translational 'hops' have been reported, where a segment of the mrna is being skipped by all ribosomes, in the two cases and nucleotides were skipped, respectively (farabaugh, ) . to our knowledge posttranscriptional editing and related processes are uncommon but definitely present in humans. it is, therefore, important to understand precisely how these mechanisms work, in order to correctly deduct the protein sequence from the gene sequence. the most common posttranslational modifications are side chain modifications like phosphorylations, glycosylations and farnesylations, as well as others. however, some modifications may also affect the (apparent) gene to protein mapping. posttranslational processing may involve removal of both terminal and internal protein sequence fragments. in the latter case an internal protein region is removed from a protein precursor, and the external domains are joined to form a mature protein (hodges et al., ; xu et al., ) . interestingly, all intervening protein sequences reported so far have sequence similarity to homing endonucleases (doolittle, ) , which also can be found in coding regions of group i introns (grivell, ) . posttranslational modifications like phosphorylation, glycosylation, sulfation, methylation, farnesylation, prenylation, myristylation and hydroxylation should also be considered in this context. they modify properties of individual residues and of the protein, and may thus make surface prediction, dynamics simulations and structural modelling in general more complex. the residues that are specifically prone to such modifications are tyrosines (phosphorylation and sulfation), serine and threonine (o-glycosylation), asparagine (nglycosylation), proline and lysine (hydroxylation) and lysine (methylation). in addition glutamic acid residues can become y-carboxylated leading to high affinity towards calcium ions (alberts, ) . specific transferases are involved in the modification, e.g., tyrosylprotein sulfotransferases (suiko et al., ) and farnesyl-protein transferases (omer et al., ) . phosphorylation of amino acid residues is an important way of controlling the enzymatic function of key enzymes in the metabolic and signalling pathways. tyrosine kinases phosphorylate tyrosine residues -thus introducing an electrostatic charge at a residue, which under normal physiological ph is uncharged. phosphorylation is central to the function of many receptors, such as the insulin and insulin-like growth factor i receptors. given the possibility that several modifications may be introduced in the sequence when we move from gene to mature protein, the task of deducting a protein sequence from the gene sequence may be more complex than we normally assume. although the protein sequence itself is a valuable starting point, the optimal basis for a rational protein engineering project will be a full structure determination of the protein. in many cases this turns out to be an expensive and timeconsuming part of the project. most structure determinations are based on x-ray crystallography. this approach may give structures of atomic resolution, but is limited by the fact that stable high quality crystals are needed. many proteins are very difficult to crystallise, in particular many structural and membrane-associated proteins. a large number of important x-ray structures have been published over the last few years, and the structures of the hhai methylase (klimasauskas et al., ) , the tbp/tata-box complex (kim et al., a; kim et al., b) and the porcine ribonuclease inhibitor (kobe and deisenhofer, ) are mentioned as examples only. nmr may be an alternative in many cases, as the proteins can be studied in solution, and for some experiments they can even be membrane associated. however, nmr is limited to relatively small molecules, and even with incorporation of labelling in the protein the upper limit for a full structure determination using current state of the art methods seems to be close to kda. some novel techniques for studying structural aspects of larger proteins will be discussed (vide infra). representative examples of important nmr structures may be interleukin / (clore et al., a) , the glucose permease iia domain (fairbrother et al., ) and the human retionic acid receptor-/ dna-binding domain (knegtel et al., ) . cryo electron microscopy (cem) is a relatively new approach to protein structure determination. the resolution of the structures are still lower than the corresponding x-ray structures, and a -dimensional crystal is a prerequisite. however, despite this cem appears to be a very promising approach to structure determination of membrane associated proteins that can form -dimensional crystals. cem has been used to study the nicotinic acetylcholine receptor at .~ resolution (unwin, ) and the atp-driven calcium pump at a resolution (toyoshima et al., ) , and in a combined approach using high resolution x-ray data superimposed on cem data the structure of the actin-myosin complex (rayment et al., ) and of the adenovirus capsid (stewart et al., ) has been studied. the recent structure by kiihlbrandt et al. ( ) of the chlorophyll a/b-protein complex at . a resolution shows that the resolution of cem rapidly is approaching the resolution of most x-ray protein data. scanning tunnelling microscopy (stm) is another new approach for studying protein structures (amrein and gross, ; lewerenz et al., ; haggerty and lenhoff, ) . the method is interesting because of a very high sensitivity, as individual molecules may be examined. the method will give a representation of the surface of the molecule, rather than a full structure determination. however, it is possible that both cem and stm can be used for identification of protein similarity. if data from these methods show that the overall shape of a protein is similar to some other known high resolution protein structure, then the known structure may be evaluated as a potential template for homology based modelling. we believe that such a model can either be used as an improved starting point for a full structure determination (i.e., for doing molecular replacement on x-ray data), or as a low resolution structure determination by itself. in homology based modelling a known structure is used as a template for modelling the structure of an homologous sequence, based on the assumption that the structures are similar. this is a very simple and rapid process, compared to a full structure determination. the sequences may be homologous in the strict sense, meaning that there is an evolutionary relationship between protein data banks the sequences. the same approach may obviously also be used for sequences that are similar, but not necessarily evolutionary related, and in that case we probably should talk about similarity based modelling. however, in this paper we will use homology based modelling as a general term, especially since the distinction between homology and similarity may be difficult in many cases. homology based modelling may turn out to be essential for the future of protein engineering. in fig. , the number of entries in the swissprot protein sequence database (bairoch and boeckmann, ) and the brookhaven protein structure database (bernstein et al., ; abola et al., ) are shown as a function of time. as we can see, there is a very significant gap between the number of sequences and the number of structures. this gap is in fact even larger than shown in fig. , as not all entries in the brookhaven database are unique structures. a large number of entries are mutants of other structures or identical proteins with different substrates or inhibitors. there has been an exceptional growth in the number of protein structures over the last - years. however, it is unrealistic to assume that we will be able to get high resolution experimental structures of all known proteins. the structure determination process is too time consuming, and the sequence databases are growing at a far faster pace, as shown in fig. , especially as a consequence of several large-scale genome sequencing projects. on the other hand, it may not really be necessary to do experimental structure determination of all proteins (ring and cohen, ) . the assumption that similar sequences have similar structures (see fig. ) has been proved valid several times and it seems to be true even for short peptide sequences as long as they come from proteins within the same general folding class ). an interesting case which is to some degree an exception to this rule is the structure of hiv- reverse transcriptase (kohlstaedt et al., ) . two units with identical sequence have similar secondary structure, but very different tertiary structure. however, this seems to be a rather exceptional case. new approaches to general structure alignment (orengo structure distance sequence distance fig. . sequence and structure similarity. in most cases similar sequences have similar structures (region ), and dissimilar sequences (i.e., measured by a standard mutation matrix) have dissimilar structures (region ). in several cases quite dissimilar sequences have been shown to have very similar structures, at least with respect to individual domains (region ). in very special cases we may have similar sequences with different structures (region ), at least with respect to tertiary structure, showing that environment and binding to other proteins may be essential for the final conformation in some cases. however, in most cases it seems to be safe to assume that structures can be found in the lower grey triangle of this graph, indicating that structure is better conserved than sequence. holm et al., ; alexandrov and go, ; lessel and schomburg, ) have made it possible to search for structurally conserved domains in proteins with very low sequence similarity (swindells, ) . this is an important approach, as structure normally is better conserved than sequence (doolittle, ) . several cases have been identified where the sequences are very different (at least by traditional similarity measures), whereas the three-dimensional structures are surprisingly similar. the identification of a globin fold in a bacterial toxin (holm and sander, ) , and the similarity between the dsba protein and thioredoxin (martin et al., ) are relevant examples. recently the structure of the human serum amyloid p component was shown to be similar to concanavalin a and pea lectin, despite only % sequence identity (emsley et al., ) , and the similarity between hen egg-white lysozyme and a lysozyme-like domain in bacterial muramidase "is remarkable in view of the absence of any significant sequence homology", as noted by thunnissen et al. ( ) . this shows that there probably is a limited number of protein folds, and this number must be lower than the number of sequence classes, defined as groups of similar protein sequences. recent estimates show that this number probably is close to different protein folds (chothia, ) , and approx. of these folds are known so far (burley, ; orengo et al., ) . this means that rather than full structure determination of a very large number of proteins, it may be sufficient to do structure determination of only a few selected examples of each protein fold, and use this as a basis for homology based modelling of other proteins shown to have the same fold. homology based modelling of the -d structure of a novel sequence can be divided into several steps. first, one or more templates must be identified, defined as known protein structures assumed to have the same fold as the trial sequence. then a sequence alignment between trial sequence and template is defined, and based upon this alignment an initial trial model can be built. this initial model must be refined in several steps, taking care of gap splicing, loops, side chain packing etc. the final model can be evaluated by several quality criteria for protein structures. an example of homology based modelling is the modelling of cinnamyl alcohol dehydrogenase based on the structure of alcohol dehydrogenase (mckie et al., ) . the protein folding problem is a fundamental problem in structural biology. this problem can be defined as the ab initio computation of a protein's tertiary structure starting from the protein sequence. this problem has not been solved and appears to be extremely difficult. if we want to solve the problem by computing an energy term for all conformations of a protein, defined by rotation around the ~b and ~o backbone angles of n residues in degree steps, we have to evaluate (n-d alternatives, even without considering the side chains. for a peptide with residues this corresponds to conformations. a hypothetical computer with processors, each processor running at hz (the frequency of uv light) and completing the energy evaluation of one conformation per cycle would need x years in order to test all conformations. the estimated age of the universe is x ~ years. a more realistic approach is the use of molecular dynamics or monte carlo methods for simulation of protein folding. however, it is still very difficult to use this as an ab initio approach, both because folding is a very slow process compared to a realistic simulation time scale, and also because it is very difficult to distinguish between correctly and incorrectly folded structures using standard molecular mechanics force fields (novotny et al., ) . a possible alternative approach may be to generate potential folds on a simplified lattice representation of possible residue positions (covell and jernigan, ; crippen, ) . however, this approach is still very experimental. some progress has been achieved in the area of secondary (rather than tertiary) structure prediction (benner and gerloff, ) . studies of local information content indicate that % match may be an upper limit for single-sequence prediction methods (rao et al., ) , whereas methods taking homology data into account may probably raise this limit to approx. %. methods based on neural networks and combinations of several prediction schemes seem to give good predictions, and especially methods using homology data from multiple alignments may give predictions at % match or better in many cases (salzberg and cost, ; boscott et al., ; rost and sander, a; rost et al., ; levin et al., ) . also methods taking potential residue-residue interactions into account, like the hydrophobic cluster analysis (hca), may be used for identification of potential secondary structure elements (woodcock et al., ) . it has been shown that by restricting the prediction to a consensus region with stable conformation it is possible to make very reliable predictions (rooman and wodak, ) . in one case, neural networks were shown to be capable of returning a limited amount of information on the tertiary structure (bohr et al., ) . . structure retrieval by secondary structure. a flow chart for structure retrieval by secondary structure (right side) compared to retrieval by sequence (left side). please see the text for details. in this example the secondary structure library was generated using dssp (kabsch and sander, ) , the secondary structure was predicted with the phd program (rest et al., ), and fasta (pearson, ; pearson and lipman, ) was used to search the secondary structure library and the nrl- -d databases (namboodiri et al., ; george et al., ) . only the secondary structure based method was able to identify the hla class i structure as similar to the class ii structure. the ribbon representation of the hla class i antigen binding region used in this figure was generated with molscript (kraulis, ) . be inconsistent, compared to the more sophisticated classification which can be achieved by a trained expert. recent studies show that the average agreement between alternative assignment methods used on identical structures is close to % for three methods (colloc'h et al., ) , or % if only two methods are compared (woodcock et al., ) . vadar is a new classification method which is aiming at a better agreement between manual and automatic assignment (wishart et al., ) , to what degree this may have influence on prediction systems remains to be seen. over the last few years it has been realised that the inverse folding problem is much easier to solve (bowie et al., ; blundell and johnson, ; bowie and eisenberg, ) . the inverse folding problem can be defined as follows: given a known protein structure, identify all protein sequences which can be assumed to fold in the same way. a large number of protein structures must be available in order to use this as a general approach, as the relevant protein fold has to be represented in the database in order to be identified. however, with a limited number of possible folds actually used by nature, a complete database of all folds appears to be possible. some information about possible folding classes can be derived from experimental data. circular dichroism can be used as a crude way of measuring the relative amounts of secondary structure in a protein. classification methods based on amino acid composition can be used for classification of proteins into broad structural classes zhou et al., ; chou and zhang, ; dubchak et ai., ) . this information may limit the number of different folds which have to be evaluated. it is also possible that such information may be used to improve the performance of other methods, although data on secondary structure prediction of all-helical proteins seems to indicate that the gain may be small (rost and sander, b) . however, for a unique identification of folding class more sensitive methods are needed, and the most useful one is probably some kind of protein sequence library search. in order to identify the folding class we have to search a database of known protein structures with our trial sequence. the problem is that standard methods for sequence retrieval may not be sensitive enough in all cases. if the sequences are similar, then retrieval is trivial. however, we know that there are cases where structures are known to be similar despite very different sequences. how can these cases be identified in a reliable way? the most promising approaches are based on methods for describing the environment of each residue (bowie et al., ; eisenberg et al., ; overington et al., ; ouzounis et al., ; wilmanns and eisenberg, ; liithy et al., ) . this description can be used for generating a profile, showing to what degree each residue is found in a similar environment in other structures, and this profile can be used as a basis for sequence alignment and library searches. similar property profiles can also be used for searching database systems of protein structures (vriend et al., ) . a very simple approach can be used if we accept the hypothesis that protein sequences representing structures with a similar linear distribution of secondary structure elements may fold in a similar way. we can then create a sequence type library of known structures where the residues are coded by secondary structure codes rather than residue codes (see fig. ). given the sequence of a protein with unknown -d structure, we can use a secondary structure prediction method and translate the sequence into a secondary structure description. if we define a suitable 'mutation' matrix describing the probability of inter conversion between different secondary structure elements, then a standard library search program like fasta (pearson and lipman, ; pearson, ) can be used in order to identify potential template structures. the example shown in fig. is the identification of hla class i as a suitable candidate for homology modelling of hla class ii. the sequence similarity is very low, % sequence identity in the antigen binding region (based on alignment of the structures), and especially for this region most sequence based methods will retrieve a large number of alternative sequences before any of the class i molecules. bly improve the performance to use a positiondependent gap penalty, where most gaps are placed in loop regions rather than in helices or strands. however, the method is very simple to implement and test, as necessary tools and data already are available in most labs. however, for the secondary structure based approach the hla class i sequences are retrieved as top candidates. the structure prediction did not include any information about the hla class ii structure, which recently has been published (brown et al., ) . it should be mentioned that the % sequence identity score is not significantly higher than the score from a random alignment of sequences. if we, for each sequence in the swissprot protein sequence library, align it against a sequence selected at random from the same library (alignment without gaps, using the full length of the shortest sequence, and start the alignment at a random position within the longest sequence), then the average percentage of identical residues is ( _+ )% at standard deviations. the identification method using secondary structure is based on an assumption which has to be examined more closely, and the implementation of it is very crude. much work can be done on the secondary structure prediction, the 'mutation' matrix and the search method. it will proba- . . sequence alignment pip _rat as described in the introduction a crucial feature in molecular evolution has been the parallel exploration of several different mutations. and although mechanisms like horizontal gene transfer and intragenic recombination may have been important as key steps in the evolution of new proteins, the most common mechanism seems to have been gene duplication followed by mutational modification (doolittle, ) . this means that especially multiple sequence alignment can give essential information about the mutation studies already performed by nature. conserved residues are normally conserved because they . multim alignment. alignment of inositol triphosphate specific phospholipase c / from rat (pip rat) against three other pip sequences from rat. each horizontal bar represents a sequence, marked in residue intervals. black lines connecting the bars represent well conserved motifs found in all sequences, in this case subsequences of residues where at least residues are completely conserved in all sequences. it is very easy to identify two well conserved regions, annotated as region x and y in the swissprot entries, despite a residue insertion in two of the sequences. this insertion contains sh and sh domains (pawson, ) . it is an interesting observation that the extra c-terminal domain of the pipi_rat sequence shows a weak similarity to myosin and tropomyosin sequences. have an important structural or functional role in the protein, and identification of such residues will thus give vital information about structure and activity of a protein. several tools have been developed for multiple alignment. a very attractive one is macaw (schuler et al., ) , which will generate several alternative alignments of a given set of regions, and in a very visual way help the user to identify a reasonable combination of (sub)alignments. an even more general tool is muitim (drablcs and . here all possible alignments, based on short motifs, are shown simultaneously, and the user is free to identify potential similarities even in cases with low sequence identity and very disperse motifs. this is possible because of the superior classification potential of the human brain compared to most automatic approaches. the method includes an option for probability based filtering of motifs, and an example of a multim alignment is shown in fig. . however, it is important to realise that in standard sequence alignment we are trying to solve a three-dimensional problem (residue interactions) by using an essentially one-dimensional method (alignment of linear protein sequences). as a consequence important conserved throughspace interactions may not be evident from a standard sequence alignment. a good example can be found in the alignment of lipases (schrag et al., ) . in fig. , the sequence alignment of residues in a structurally conserved core of three lipases (rhizomucor miehei lipase (derewenda et al., ) , candida antarctica b lipase (a. jones, personal communication) and human pancreatic lipase (winkler et al., ) is shown. the active site residues, ser (s), asp (d) and his (h), are shown as black boxes. the ser and his residues are at identical positions. however, the asp residue of the pancreas lipase is at a very different sequence position compared to the other two lipases. it would be very difficult to identify this as the active site asp from a sequence alignment. if we look at the structural alignment in fig. , we see that the positions are structurally equivalent, it is possible for all three lipases to have highly similar relative orientation of the active site atoms, despite the fact that the alternative asp positions are located at the end of two different / -strands. an improved alignment may be generated if we can incorporate -d data for at least one of the sequences in the linear alignment (gracy et al., ) . however, in order to get a reliable alignment of sequences with low sequence similarity, we have to take true three-dimensional effects into account. this means that if we are able to identify a known -d structure as a potential basis for modelling, then the sequence alignment should be done in -d using this structure as a template. this can be done by threading the sequence through the structure and calculating pairwise interactions (jones et al., ; bryant and lawrence, ) . as soon as a template has been identified, and an alignment between this template and a sequence has been defined, a -d model of the protein can be generated. we can either use the template coordinates directly, combined with dif- fig. . sequence alignment of lipases. alignment of structurally conserved regions of three lipases. for each lipase the solvent accessible surface in % compared to the gxg standard state (grey scale, white is buried and black is exposed), the secondary structure as defined by the dssp program, and the sequence is shown. the position of each subsequence in the full sequence is also shown. the active site residues are shown in white on black. please observe the shift of the active site asp (d) between two very different positions. the alignment was generated using alscript (barton, ) . ferent modelling approaches for the ill-defined regions, or the template can be used as a more general basis for folding the protein by distance geometry (srinivasan et al., ) or general molecular dynamics methods. loop regions are often highly variable, and must be treated with special approaches (topham et al., ) . it is also necessary to consider the orientation of side chains. although the backbone may be well conserved, many residues especially at the protein surface will be mutated, as shown in fig. . the stability of a protein depends upon an optimal packing of residues, and it is important to optimise side chain conformation if we want to study protein stability and complex formation. a very common approach is the use of rotamer libraries combined with molecular dynamics refinement. recent studies show that this step of the modelling in fact may be less difficult than has been assumed . and important interactions, and exposed regions may to some degree be identified by using antibodies. however, in many cases the rational for modelling by homology is the very lack of experimental data related to structure, and we have to use other more general methods for evaluation of models. some of the approaches we already have described for sequence alignment can obviously also be used for evaluation of models. in general, model evaluation can be based on -d profiles (lfithy et al., ) , contact profiles (ouzounis et al., ) or more general energy potentials (hendlich et al., ; jones et al., ; nishikawa and matsuo, ) . some of these approaches have been implemented as programs for evaluation of structures or models, like procheck (laskowski et al., ) and prosa (sippl, a, b) . however, in general no model (or even experimental structure) should be trusted beyond what can be verified by experimental methods. a protein model based on homology (or similarity) has to be verified in as many ways as possible, and experimental methods should always be preferred. mutation studies may give valuable information about active site residues a prerequisite for rational protein engineering is -d structure information about the protein. in fig. , including parts of the sequences connecting the core regions. the active site asp is able to maintain a similar relative orientation, despite very different sequence positions. the alignment was generated using insight (biosym technologies). adddition to x-ray crystallography, nmr is the most important method for protein structure determination. x-ray crystallography has several advantages when compared to nmr. solving the crystal structure by x-ray crystallography is usually fast as soon as good crystals of the protein are obtained (even if it may not be so easy to obtain these crystals). it is also possible to determine the structure of very big proteins. the major disadvantage of x-ray crystallography is that it is the crystal structure that is determined. this implies that crystal contacts may distort the structure (chazin et al., ; wagner et al., ) . since active sites and other binding sites usually are located on the surface of the proteins, very important regions of the protein may be distorted. some structures even show large differences between nmr and x-ray structure (frey et al., ; klevit and waygood, ) the advantage of nmr is that it is dealing with protein molecules in solution, usually in an environment not too different from its natural one. it is possible to study the protein and the dynamical aspects of its interaction with other molecules like substrates, inhibitors, etc. it is also possible to obtain information about apparent pk a values, hydrogen exchange rates, hydrogen binding and conformational changes. all nuclei contain protons, and therefore they carry charge. some nuclei also possess a nuclear spin. this creates a magnetic dipole, and the nuclei will be oriented with respect to an external magnetic field. the most commonly studied nuclei in protein nmr ( h, c and n) have two possible orientations, representing high and low energy states. the frequency of the transition between the two orientations is proportional to the magnetic field. at a magnetic field of . tesla the energy difference corresponds to about mhz for protons. in an undisturbed system there will be an equilibrium population of the possible orientations, with a small difference in spin population between the high and low energy orientation. the equilibrium population can be perturbed by a radio frequency pulse of a frequency at or close to the transition frequency. in addition, the spins will be brought into phase coherence (concerted motion) and a detectable magnetisation will be created. the intensity of the nmr signal is proportional to the population difference between the levels the nuclei can possess. nuclei of the same type in different chemical and structural environments will experience different magnetic fields due to shielding from electrons. the shielding effect leads to different resonance frequencies for nuclei of the same type. the effect is measured as a difference in resonance frequency (in parts per million, ppm) between the nuclei of interest and a reference substance, and this is called the chemical shift. in molecules with low internal symmetry most atoms will experience different amounts of shielding, the resonance signals will be distributed over a well-defined range, and we get a typical nmr spectrum. the process that brings the magnetisation back to equilibrium may be divided into two parts, longitudinal and transverse relaxation. the longitudinal or t relaxation describes the time it takes to reach the equilibrium population. the transverse or t relaxation describes the time it takes before the induced phase coherence is lost. for macromolecules the t relaxation is always shorter than the t a relaxation. short t relaxation leads to broad signals because of poor definition of the chemical shift. most molecules have dipoles with magnetic moment, and the most important cause of relaxation is fluctuation of the magnetic field caused by the brownian motion of molecular dipoles in the solution. how effective a dipole may relax the signal depends upon the size of the magnetic moment, the distance to the dipole, and the frequency distribution of the fluctuating dipoles. a nucleus may also detect the presence of nearby nuclei (less than three bonds apart), and this will split the nmr signal from the nucleus into more components. several nuclei in a coupling network is called a spin system. by applying radio frequency pulses it is possible to create and transfer magnetisation to different nuclei. it is, as an example, possible to create magnetisation at one nucleus, and transfer the magnetisation through bonds to other nuclei where it may be detected. the pulses are applied in a so-called pulse sequence (ernst, ; kessler et al., ) . the methodology for determination of protein structure by two-dimensional nmr is described in several textbooks and review papers (wagner, ; wiithrich, ; wider et al., ) . the standard method is based on two steps, sequential assignment: assignment of resonances from individual amino acids, and distance information: assignment of distance correlated peaks between different amino acids. the first step involves acquiring coupling correlated spectra (cosy, tocsy) in deuterium oxide to determine the spin system of correlated resonances. some amino acids have spin systems that in most cases make them easy to identify (gly, ala, thr, ile, val, leu). the other amino acids have to be grouped into several classes, due to identical spin systems, even though they are chemically different. the spin systems can be correlated to the nh proton by acquiring cosy and tocsy spectra in water. the assigned nh resonance is then used in distance correlated spectra (noesy) to assign correlations to protons (nh, h,, h~) at the previous amino acid residue (fig. ) . by combining the knowledge of the primary sequence (which gives the spin system order) with the nmr data collected it is possible to complete the sequential assignment. when the sequential assignment is done the assignment of short range noe (up to four residues) will give information about secondary structure (a-helix, fl-strand). long-range correlations will serve as constraints (together with scalar couplings) to determine the tertiary structure of the protein. excellent procedures describing these steps are available (roberts, ; wiithrich, ) . with large proteins there will be spectral overlap of resonance lines. the problem is partially solved by labelling the protein with c and n isotopes. triple resonance multidimensional nmr methods (griesinger et al., ; kay et al., ) may then be applied. the resonances will then be spread out in two more dimensions ( c and ~sn) and the problem with overlap is reduced. these methods depend upon the use of scalar couplings to perform the sequential assignment, the sequential assignment procedure will then be less prone to error. the noesy spectra of such large proteins are often very crowded, but four-dimensional experiments like the c-~ c edited noesy spectrum (clore et al., b) have been designed. such experiments will spread the proton-proton distance correlated peaks by the chemical shift of its corresponding c neighbour and reduce the spectral overlap. secondary structure elements may also be predicted from the chemical shift of h and c (spera and bax, ; williamson and asakura, ; wishart et al., ) . obtaining nmr-spectra of proteins has some aspects that should be considered. spectral overlap. as we move to larger proteins the probability of overlap of resonance lines increases. at some point it will become impossible to do sequential assignment due to this overlap. application of -d and -d multiresonance nmr has made it possible to assign proteins in the kda range (foght et al., ; stockman et al., ) . fast relaxation. as the size of the protein is increased the rate of tumbling in solution is re-duced. this leads to a reduced transverse relaxation time (t ), and broadening of the resonance lines in the nmr spectra. the intensities of the peaks are reduced and they may be difficult to detect. the short transverse relaxation time will also limit the length of the pulse sequences it is possible to apply (because there will be no phase coherence left), and multidimensional methods become difficult. it is possible to determine a -d structure by nmr or x-ray crystallography are probably a subset of all proteins (wagner, ) . proteins may have regions with mobility and few cross peaks. the effective size of a protein is often increased by aggregation. the amount of aggregation can often be reduced by reducing the protein concentration. thus, very often the degree of aggregation will determine whether it is possible to assign and solve a protein structure by nmr, by limiting the maximum concentration that may be used. the stability of the proteins is also a major issue. a sample may be left in solution for days, often at elevated temperatures, so denaturation may become a problem. photo-cidnp (chemically induced nuclear polarisation) is an interesting technique for the study of surface positioned aromatic residues in proteins (broadhurst et al., ; cassels et al., ; hore and kaptain, ; scheffier et al., ) . by introducing a dye and exciting it with a laser, it is possible to transfer magnetisation to aromatic residues, where it can be observed. in addition to high-resolution nmr, solid state nmr has also been applied to studies of proteins. studies of active sites and conformation of bound inhibitors yields interesting information. the stability of proteins may be monitored under different conditions by detecting signals from transition intermediates bound to the active site (burke et al., ; gregory et al., ) . structural constraints on transition state conformation of bound inhibitors can be obtained (auger et al., ; christensen and schaefer, ) . structural constraints of the fold and conformation of the amino sequence may be gathered by setting upper and lower distances for lengths between specific amino acids (mcdowell et al., ) . using solid-state nmr it is also possible to study membrane proteins and their orientation with respect to their membrane (killian et al., ; ulrich et al., ) . we expect such studies to give insight into ion channels in membranes (woolley and wallace, ). an important mechanism for relaxation in high-resolution nmr is dipolar relaxation. usually this is induced by the spin of nuclei in the immediate vicinity, and it is a function of the size of the dipole. the electron is also a magnetic dipole, and the magnitude of this dipole is about -times that of a proton. paramagnetic compounds have an electron that will interact with . the paramagnetic relaxation method. outline of the paramagnetic relaxation method. the protons located at the protein surface will be closer to the dissolved paramagnetic relaxation agent than the protons located inside the protein core, hence the resonance lines from protons at the surface will be broadened more than resonance lines stemming from protons located inside the protein. nearby protons and increase the relaxation rate of these protons. the widest use of paramagnetic compounds has been of gd ÷ bound to specific sites in a protein , but also other compounds have been used (chang et al., ; hernandez et al., a, b ). this will make it possible to identify resonance lines from residues in the vicinity of the binding site. it is also possible to calculate distances from the paramagnetic atom as the relaxation effect is distance dependent. the paramagnetic broadening effect can also be used with a compound moving freely in solution (drayney and kingsbury, ; esposito et al., ; petros et al., ) . in this way residues located on or close to the protein surface will give broadened resonance lines compared to residues in the interior of the protein. this method can be used to measure important noe and chemical shifts inside the protein directly, or it can be used as a difference method to identify resonances at the surface by comparing spectra acquired with and without the paramagnetic relaxation agent (fig. ) . we have used the paramagnetic compound gadolinium diethylenetriamine pentaacetic acid (gd-dtpa) as a relaxation agent. gd-dtpa will increase both the longitudinal and the transverse relaxation rates of protons within the influence sphere. suitable nmr experiments to highlight the relaxation effect may be noesy, roesy and tocsy (bax and davis, ; braunschweiler and ernst, ) gd-dtpa is widely used in magnetic resonance imaging (mri) to enhance tissue contrast. it is assumed to be non-toxic and we do not expect it to bind to proteins. we used the wellstudied protein hen egg-white lysozyme as a test protein. both the structure and the nmr spectra of this protein are known (diamond, ; redfield and dobson, ) , and the protein is extremely well suited for nmr experiments. in fig. , the -d ih-nmr spectrum recorded in the presence and absence of gd-dtpa is shown. although it is evident that there is a selective broadening in the -d spectrum, it is also clear that there are problems with overlapping spectral lines. we therefore applied two-dimensional nmr methods, and shown in fig. is the low field region of a noesy spectrum of lysozyme. the region corresponds to the same region as shown in fig. . from fig. we see that the signals from w , w and w disappear with addition of gd-dtpa, while the signals from w , w and wlll still are observable. by examination of the solvent accessible surface of lysozyme it is evident that the indole nh of w , w and w is exposed to solvent, while the indole nh of w , w and wlll is not exposed. this shows that the changes in the spectrum are as expected from the structure data. the appearance of the nh-nh region of the spectrum (fig. ) also shows the reduction in the number of signals in the gd-dtpa exposed spectrum. this shows that the paramagnetic broadening effect can be used for selective identification of signals from solvent exposed residues in a protein. one of the fundamental steps in the protein engineering process shown in fig. is the design step, where a correlation between structure and properties is established in order to select potential structural candidates that match new functional profiles. the understanding of this correlation implies a realistic modelling of the physical chemical properties involved in the functional features to be engineered. these features are basically of two types: diffusional and catalytic. any ligand binding to a protein, whether ligandreceptor or substrate-enzyme, is essentially a diffusional encounter of two molecules. electrostatic interactions are the strongest long-range forces at the molecular scale and, thus, it is not surprising that they are one of the determinant effects in the final part of the encounter (berg and von hippel, ) . in the case of substrateenzyme interactions the catalytic step that follows the binding of the substrate seems to be possible mainly by the presence of electrostatic forces that stabilise the reaction intermediates in the binding site (warshel et al., ) , from which the product formation may proceed. another and much more basic necessary condition for a successfully engineered protein is that a functional folded conformation is maintained. solvation of charged groups is one of the determinants in protein folding (dill, ) , so that even the conformation of the protein is electrostatically driven. given the ubiquitous role of electrostatic interactions, it is then obvious that their accurate modelling is an essential prerequisite in the design of engineered proteins. several good reviews exist on protein electrostatics (warshel and russel, ; matthew, ; rogers, ; harvey, ; davies and mccammon, ; sharp and honig, ) . this section intends to give a brief overview of the subject. we start by presenting the methods one can use to model electrostatic interactions. the most familiar methodology in biomolecular modelling is certainly molecular mechanics (mm) (either through energy minimisations or molecular dynamics (md)). we point out some of the limitations of mm in the treatment of electrostatic interactions, and the need to use alternative ways of describing the system, such as continuum methods. the computation of ph-dependent properties and some potential extensions of mm are also discussed. finally, we refer some applications of electrostatic methods relevant to protein engineering. in mm simulations, electrostatic interactions are usually described with a pairwise coulombic term of the form qlq /dr ,were ql and q are the charges of the pair of atoms, r their distance, and d the dielectric constant. d is usually set equal to when the solvent is included. a complete simulation in a sufficiently big box with water molecules should, in principle, give a realistic description of the protein molecule (harvey, ) . this would be specially true if a force field including electronic polarizability effects (see . .) was available for use with biomolecular systems, which unfortunately is not the case (harvey, ; davis and mccammon, ) . we use the term force field in this context as including both the functional form and parameters describing the energetics of the system, from which the forces are derived. simulations where solvent molecules are not treated explicitly are naturally appealing, since the computation time increases with the square of the number of atoms. several methods have been proposed that attempt to account for solvent effects. the more popular approach is an ad hoc dielectric 'constant' proportional to the distance (e.g., mccammon and harvey, ) but different distance dependencies can be used (e.g., solmajer and mehler, ) . a variety of more elaborated methods were also suggested (northrup et al., ; still et al., ; gilson and honig, ) . all these methods should be viewed as attempts of including solvent screening effects in a simplified way. they can be useful when inclusion of water is computationally prohibitive, but they cannot substitute for an explicit inclusion of solvent since, e.g., the existence of hydrogen bonding with the solvent is not properly described by these approaches. mm of biomolecules has, in general, heavy computation needs. the number of water molecules that should be included in order to simulate a typical protein in a realistic way is quite large, especially if one wants to perform md. also, each pair of atoms has its own electrostatic interaction and the number of pairs cannot be lowered by a short cut-off distance (e.g., a) as in van der waals interactions, since electrostatic interactions are very long range, typically up to ~,. mm simulations have also some limitations on the description of the system, since ph and ionic strength effects usually are difficult or impossible to include. the only way to include ph effects is through the protonation state of the residues. each titrable group (in asp, glu, his, tyr, lys, arg, c-and n-terminal) in the protein have two states, protonated or unprotonated. thus, a protein with n titrable groups will have n possible protonation charge sets. the best we can do is to choose the set corresponding to the protonation states of model compounds at the desired ph. free ions can be included in md simulations of proteins (levitt, ; mark et al., ) , but it is not clear if the simulated time intervals are long enough to realistically reflect ionic strength effects. another problem with mm is that the understanding it provides of the system (through energy minimisation or md) does not include entropic aspects explicitly, i.e., it does not give free energies directly. there are methods to calculate free energies based on mm potentials (beveridge and dicapua, ) , but even though several applications have been made on biomolecular systems (for a review see beveridge and dicapua, ) , they are still too demanding for routine use in systems of this size. then, when the properties under study are related to free energies rather than energies (which is often the case), mm by itself can only be seen as a first approach. in summary, although mm simulations can provide some unique information on the structural and dynamical behaviour of biomolecular systems, some limitations exist due to both conceptual and practical reasons, in particular regarding the treatment of electrostatic interactions. fortunately, other methods exist that can provide insight on aspects whose modelling is poor or absent in mm simulations, although at the cost of the atomic detail in the description. there is no 'best' modelling method and we should resort to the several methods available in order to gain an understanding of the system that is as complete as possible. the so-called continuum or macroscopic models assume that electrostatic laws are valid at the protein molecular level and that macroscopic concepts such as dielectric properties are applicable. protein and solvent are treated as dielectric materials where charges are located. these charges may be titrable groups (whose protonation state may vary), permanent ions (structural and bound ions, etc.) or, more recently, permanent partial charges of polar groups. given the dielectric description of the system and the placement of the charges, the problem can be reduced to the solution of the poisson equation (or any equivalent formulation), as can any problem of electrostatics (e.g., jackson, ) . the electrostatic potential thus obtained can be used to study diffusional processes or visually compare different molecules (see . .). the simplest continuum model assumes the same dielectric constant inside and outside the protein. typically, a value somewhere between the protein and solvent dielectric constants has been used (sheridan and allen, ; koppenol and margoliash, ; hol, ) . this approach completely ignores the effects of having two very different dielectric regions, but can be used for a first qualitative computation. the more common continuum models treat the protein as a low dielectric cavity immersed in a high dielectric medium, the solvent. the way the charges are placed in this cavity and the way the electrostatic problem is solved vary with the particular method. analytical solutions can be obtained for the simplest shapes, such as spheres, but in general the more complex shapes require numerical techniques. in the first cavity model the protein was assumed to be a sphere with the charge uniformly distributed over its surface (linderstr~m-lang, ) . tanford and kirkwood ( ) proposed a more detailed model in which each charge has a fixed position below the surface. assuming a spherical geometry allows for a simple solution to the electrostatic problem. it is even possible to include an ionic atmosphere that accounts for ionic strength effects (leading to the poisson-boltzman equation). the effect of ph occurs naturally in the formalism. the energy cost of burying a charge inside the low-dielectric protein (self-energy) is taken to be the same as in small model compounds, since at the time when this method was developed (before protein crystallography) charges were believed to be restricted to the protein surface. this limits the method to proteins without buried charges, unless we have some estimate on the self-energy. there are, obviously, some problems in fitting real, irregularshaped proteins to a spherical model. some solutions to this problem were proposed, including an ad hoc scaling of interactions based on solvent accessibility (shire et al., ) , and the placing of more exposed charges in the solvent region (states and karplus, ) . the inclusion of non-spherical geometries im-plies the use of numerical techniques, as referred above. warwicker and watson ( ) and used the finite differences technique to solve, respectively, the poisson and poisson-boltzman equations. self-energies can be included (gilson and honig, ) , such that the method is fully applicable when buried charges exist. the intrinsic discretization of the system in the finite differences technique, makes these methods readily applicable to any kind of spatial dependency on any of the properties involved. the inclusion of a spatially-dependent dielectric constant, for instance, will be relatively simple. other extensions such as additional dielectric regions (ligands, membranes, etc.), eventually with charges, should also be possible. alternative numerical techniques for solving the poisson or poisson-boltzman equations have also been used, including finite elements (orttung, ) and boundary elements (zauhar and morgan, ) . the dielectric constant in a region comes from the existence of dipoles in that region, permanent or induced. permanent dipoles are due to atomic partial charges (e.g., water dipole, peptide bond dipole). induced dipoles are due to the polarizability of electron clouds. warshel and levitt ( ) represented this electronic polarizability by using point dipoles in the atoms. as pointed out by davies and mccammon ( ) this representation is roughly equivalent to a spatially-dependent dielectric constant. this approach is usually combined with a simplified representation ot water by a grid of dipoles (warshel and russel, ) . ionic strength and ph effects are not considered. all the above methods deal with a particular charge set (see . .), even when ph effects are considered. however, a protein in solution does not exist in a single charge set. we are usually interested in the properties of a protein at a given ph and ionic strength, not at a particular charge set. moreover, if we want to test the available methods, we have to test them against experimental results which usually do not correspond to a specific charge set. a common test on the accuracy of electrostatic models is their ability in predicting pk a values of titrable groups in a protein (see . .), obtained via titrations, nmr, etc. these values can be quite different from the ones of model compounds, due to environment of the groups in the protein. this difference (pk a shift) can be of several pk units. the experimentally determined apparent pka (pkap p) is determined as the ph value at which half of the groups of that residue are protonated in the protein solution, i.e., when its mean charge is / (thus, the equivalent notation pk / ). then, if we can devise a method to compute the mean charge of the titrable groups at several ph values, we can predict their pkap p values. as mentioned above (see . .), we have n possible charge sets. any structural property can, in principle, be computed through a boltzman sum over all those sets, with each one contributing according to its free energy (taken as the electrostatic energy) (tanford and kirkwood, ; bashford and karplus, ) . the property thus computed is characteristic of the chosen ph value (and ionic strength, if considered) instead of a specific charge set. we are particularly interested in computing the mean charges at a given ph (see last paragraph). a sum with n terms is not, however, a trivial calculation in terms of computer time. tanford and roxby ( ) avoided the boltzman sum by placing the mean charges directly on the titrable groups, instead of using one of the integer sets. this corresponds to considering the titration of the different groups as independent (a mean field approximation; bashford and karplus, ) . other alternatives to the boltzman sum are the monte carlo method (beroza et al., ) , less drastic mean field approximations (yang et al., ; gilson, ) , the 'reduced site' approximation (bashford and karplus, ) , or even assume that the predominant charge set is enough to describe the system (gilson, ) . since electrostatic interactions in proteins are typically dominated by titrable groups whose charge is affected by ph, no electrostatic treat-ment can be complete without taking this effect into account. a simple, although effective, way of doing this is to: (i) compute the electrostatic free energies (e.g., by a continuum method); (ii) compute the mean charge of each titrable group at a given ph (e.g., by a mean field approximation); (iii) use those charges to compute the electrostatic potential (e.g., by a continuum method), which can be displayed together with the protein structure (see the human pancreatic lipase example in section . .). in this way a ph-dependent electrostatic model of the protein can be obtained, which is not possible with usual mm-based modelling techniques. as stated above (see . .), electronic polarizability is not explicitly considered in common force fields. van belle et al. ( ) included the induced dipole formalism (warshel and levitt, ) in mm calculations. the electrostatic interactions in the applied force field were simply 'corrected' with additional terms due to inducible dipoles. however, it should be noted that a force field fitted to experimental data without polarizability terms, should be fitted again if those terms are included. the protein conformation used in molecular modelling is usually an experimentally based (xray, nmr) mean conformation, characteristic of those particular experimental conditions. that conformation may, however, be inadequate for modelling the protein properties at different conditions. in particular, proteins are known to denaturate at extreme ph conditions. thus, ph-dependent methods such as the continuum methods may give incorrect results when using one single conformation over the whole ph range. actually, md simulations have shown that the results can be highly dependent on side chain conformation (wendoloski and matthew, ) . although overall properties like titration curves did not seem to be very sensitive, individual pka's showed variations up to . pk units. as mentioned in section . , mm has the problem of what charge set to use in simulations. instead of using a charge set corresponding to model compounds at the intended ph, one may use the predominant charge set of the protein, determined, e.g., by a continuum method, as suggested by gilson ( ) . a different approach to this problem would be to devise a way of including the averaged effect of all charge sets in the mm simulation. we have recently developed a method where a force field is derived which includes the proper averaged effect of all charge sets (a potential of mean force) (to be published). the method depends on the calculation of electrostatic free energies obtained from, e.g., a continuum method. the electrostatic potential, computed in some of the referred methods, can help to understand the contribution of electrostatic interactions in the diffusional encounters of proteins with ligands (substrates or not). the diffusional process driven by the electrostatic field can be simulated through brownian dynamics (bd) and diffusion rates may be computed (for references see, e.g., davies and mccammon, ) . the effect of mutations on the diffusion of superoxide ion into the active site of superoxide dismutase has been studied by this technique (sines et al., ) and faster mutants showing - -fold increase in reaction rate could be designed (getzoff et al., ) , although this enzyme usually is considered to be 'perfect'. electrostatically driven bd simulations can help to reveal steric 'bottlenecks' (reynolds et al., ) and orientational effects (luty et al., ) . this method can also be applied to study the encounter of two proteins (northrup et al., ) . visual comparison of electrostatic fields can also provide useful information. soman et al. ( ) showed that rat and cow trypsins have similar electrostatic potentials near the active site, despite a total charge difference of . units. as an illustration of such type of comparisons, using ph-dependent electrostatics, we have applied the solvent accessibility-modified tanford-kirkwood method (see . .) to the human pancreatic lipase structures with both closed (van tilbeurgh et al., ) and open lid (van tilbeurgh et al., ) , as shown in fig. a and b. fig. c-f shows surfaces corresponding to an electrostatic potential equal to + . kt/e (where k is the boltzman constant, t the absolute temperature and e the proton charge). these surfaces correspond to regions were the electrostatic interactions on a charge are roughly of the same magnitude as the thermal effects due to the surrounding solvent, i.e., where charged molecules in solution start to feel electrostatic steering or repulsion. at ph clear differences exist between the closed and open forms, the latter showing a dipolar groove in the presumed binding site region. at pi-i the molecule is strongly positively charged and most electrostatically differentiated regions have disappeared. given the role of electrostatic interactions on molecular orientation and association (see the beginning of this section ( )), this is expected to markedly affect the interaction with the lipid-water interface. for enzymes the catalytic activity involving a charged residue can be modulated by shifting the pk a of that residue. the pk a shifts of the active site histidine has been successfully predicted for a number of mutants of subtilisin loewenthal et al., ) . one of the main reasons why enzymes are good catalysts is because they stabilise the transition state intermediate (fersht, ) . for enzymatic reactions that are not diffusion limited, engineering leading to an enhanced stabilisation of the intermediate will result in an increased activity. the induced dipole method was used to compute the activation free energy for different mutants of trypsin and subtilisin (warshel et al., ) , with some qualitative agreement with the experimental results. the prediction of changes introduced by mutations on redox potentials could also be of interest to protein engineering. prediction of redox potentials has been made with some success (rogers et al., ; durell et al., ) . in plastocyanin the effect of chemically modifying charged groups was also considered (durell et al., ) . the effect of mutations could also be analysed, as has been done for pk a shift calculations (see above). the above examples clearly show that, whatever the particular method used, the modelling of a c e d t electrostatic interactions in proteins has an important role to play in protein engineering. a highly relevant example is the design of a faster 'perfect' enzyme (getzoff et al., ) , which also illustrates the combination of different methods (bd and electrostatic continuum methods) that can sometimes be determinant in a modelling study. the science of protein engineering is advancing rapidly, and is emerging in many new contexts, such as metabolic engineering. rational protein engineering is a complex undertakingand only the groups with sufficient understanding of sequences and -d structures can handle the complex underlying problems. predicting protein structure may be difficult -but predicting future developments in a very active branch of science can be hazardous at the best. however, we will review a few of the more recent research aspects that we are convinced will be of key importance in the future development of protein engineering. often the substrates or products in an enzymatic process are poorly soluble in an aqueous medium. this may lead to poor yields and difficult or expensive purification steps. the potential of using other solvents, either pure or in mixture, where substrates and/or products may be soluble has attracted a great deal of attention (tramper et al., ; arnold, ) . dissolving the protein in organic solvents will alter the macroscopic dielectric constant and lead to a much less pronounced difference between the interior and exterior static dielectric behaviour. protein function in such media may be altered and is poorly understood; we can expect a significant development in the future. despite the often dramatic change in dielectric constant when changing the solvent from, e.g., water to an organic substance, the protein -d structure can remain virtually intact, as has been documented in the case of subtilisin carlsberg dissolved in anhydrous acetonitrile (fitzpatrick et al., ) . the hydrogen bonding pattern of the active site environment is unchanged, and of the enzyme-bound structural water molecules are still in place. one-third of the enzymebound acetonitrile molecules reside in the active site. many enzymes remain active in organic solvents and in the case of enzyme reactions where the substrate has very poor water solubility, a change to organic solvent can be of major importance (gupta, ). an extreme case of a non-conventional medium for enzymatic action is the gas phase. certain enzymes, immobilised on a solid bed, have been shown to be active at elevated temperatures towards selected substrates in the gas phase (lamare and legoy, ) . obviously the range of substrates that potentially can be used is limited to those that actually can be brought into the gas phase under conditions where the enzyme is still active. enzymes for which such reactions have been studied include hydrogenase, alcohol oxidase and lipases. the fact that even interfacially activated lipases (such as the porcine pancreatic and the candida rugosa lipases) function with gas phase carried substrate molecules opens up the interesting possibility of studying the role of water in this reaction. protein engineering may be used to enhance enzyme activity in organic solvents (arnold, ; fig. . electrostatic maps of hpl with closed and open lid. ribbon models of human pancreatic lipase with colipase are shown with closed (left: a,c,e) and open (right: b,d,f) lid. the colipase is shown in blue and the mainly a-helical 'lid' region is highlighted in cyan. the residues of the active site are shown in green. access to the active site pocket seems to be controlled by the conformational st'ate of the lid. electrostatic isopotential contours of + . kt/e are shown at ph (c,d) and ph (e,f). the negative surfaces are represented in red and the positive surfaces in blue. the models and isopotentiai contours were produced with insight h and delphi (biosym technologies, san diego). the ph-dependent charge sets were computed with titra (to be published). chen and arnold, ) . when dissolving subtilisin e in % dimethylformamide (dmf) the kcat/k m for the model substrate suc-ala-ala-pro-met-p-nitroanilide drops -fold. after ten mutations were introduced, the activity in dmf was restored almost to the level of the native enzyme in water. all metabolic conversions in micro-organisms are carried out directly or indirectly by proteins. our ability to manipulate single genes has opened up for the actual control of such processes. we may alter the efficacy of a certain pathway or we may introduce totally new pathways. thus, escherichia coli can be modified in such a way that one can use i>glucose in the e. coli based manufacture of hydroquinone, benzoquinone, catechol and adipic acid (dell and frost, ; draths and frost, ; frost, ) . presently such compounds are produced through organic chemical synthesis using aromatics as one of the reactants. the prospect of producing the same compounds using only microbes and glucose thus has some obvious environmental benefits. we expect to see a virtual surge in the engineering of microorganisms towards the production of rare chemical or biochemical compounds or compounds for which the current synthetic route is costly either economically or from an environmental perspective. the perspective of designing and producing functional protein molecules from scratch is extremely attractive to many visionary scientists. some central questions arise: do we know enough to undertake such tasks, and what goals can we define? screening mutation studies of protein interfaces show that the majority of mutations reduce activity or binding affinity (cunningham and wells, ) , indicating that most proteins already represent highly optimised designs. the groups active in this area have aimed at constructing certain -dimensional folds such as the four helix bundle (felix) (hecht et al., ) and histidine-based metal binding sites (arnold, ) and even the observation of limited enzymatic activity is regarded as a successful result . protein de novo design of helix bundles may even follow a very simple binary pattern of polar and nonpolar amino acids as was concluded in a study of four-helix bundle proteins (kamtekar et al., ) . the helix-helix contact surfaces are mainly hydrophobic, whereas the solvent exposed regions are hydrophilic. many variants conforming to this hydrophobic pattern were generated and two of these proteins were stabilised with . and . kcal tool -~ relatively to the unfolded form, thus approaching what is found for many natural proteins. the authors suggest that such a binary pattern may have been important in the early stages of evolution. in our laboratory we have results supporting this conclusion for the trypsin family of proteins, which is predominantly in a / -strand based fold . fusion and hybrid proteins may be produced by fusing the genes or gene fragments including a proper linking region between the two genes (argos, ) . this in principle may allow for combining properties from two different proteins. thus artificial bifunctional enzymes have been produced by fusing the genes for the proteins, e.g., / -galactosidase and galactokinase (bulow, ). in a recent paper an elegant hybrid protein concept is described. a hybrid antibody fragment was designed to consist of a heavy-chain variable domain from one antibody connected through a linker region of - residues to a short lightchain variable domain from another antibody (holliger et al., ) . the antibody fragments displayed similar binding characteristics as the parent antibodies. the prospect of engineering multifunctional antibodies for medical applications is imminent. a hybrid protein between the glucose transporter and the n-acetylglucosamine transporter of e. coli have been produced. the two proteins displayed % residue identity. the hybrid protein consisted of the putative transmembrane do-main from the glucose transporter and the two hydrophilic domains from the n-acetylglucosamine transporter. the hybrid protein was, somewhat surprisingly, still specific for glucose (hummel et al., ) . interestingly, several naturally occurring proteins themselves seem to have originated through gene fusion. in the case of hexokinase it is proposed that it originated from a duplication of the glucokinase gene maintaining even the gene organisation (kogure et al., ) . several other proteins such as receptor proteins of the insulin family can best be understood as gene fusion products of a kinase domain onto the rest of the receptor (which in itself may consist of several fragments). with potential medical applications, proteinnucleic acid hybrids have been constructed, where the nucleic acid fragment complemented the sequence of a fragment of mrna that the rnase should be targeted towards. the results obtained confirmed that this approach indeed worked (kanaya et al., ) . the potentials for generating anti-viral agents against, e.g., hiv are obvious. as a consequence of the enormous growth in our understanding of molecular biology and material technology, a new technological sector is emerging which takes aim at exploring the possible advantages in creating micro-machines and switchable molecular entities. this concept is currently known as nano technology (birge, ) . two concepts that we find particularly interesting are described briefly below. rhodopsin is a very ancient molecular construct -we find rhodopsin like molecules in a range of roles, all of them associated with its membrane location. proton transport and receptor functions are particularly interesting. bacteriorhodopsin from halobacterium halobium maintains a large ph gradient across the bacterial membrane. this protein complex is coloured, and its colour can be changed by exposing the protein to light of an appropriate frequency. the lifetime of the excited state can be adjusted by adjusting the physical chemical parameters of the medium the rhodopsin is embedded in (birge, ) . this protein can be used as a molecular switch in a very broad sense, e.g., as part of a high density memory device. however, changing the colour of a protein molecule is just one example that could be considered. another molecular based switch concept involves the transfer of a molecular ring (paraquat-derived rotaxane ring) between two binding sites (bradley, ) . currently the transfer is induced by a solvent change, but it is believed that an electrochemical transfer mechanism can be developed as well. similar concepts can probably also be developed for proteins. the present paper reviews some of many new developments in protein engineering. the review is not exhaustive -it is simply not possible to do this properly within the limits of this paper. we have tried to review some selected scientific areas of key importance for protein engineering, such as the validity of protein sequence information as well as structural information. sometimes the translation of a gene sequence to amino acid sequence is not trivial -a range of posttranscriptional editing and splicing events may occur, leading to a functional protein, where the amino acid sequence cannot be directly deducted from the gene sequence. in addition, posttranslational modification may provide triggers for other parts of the cells molecular machinery. we are thus in a situation where the full benefits and profits from projects such as the human genome project may escape us for a while. we have covered some of the recent developments in the modelling of protein structure by homology, which we regard as one of the most strategic areas of development. we will be flooded with sequence information deducted from gene sequences, and in the cases where the deducted amino acid sequences are assumed valid, we have to use homology based structure prediction in most cases. given that the number of protein structure families is expected to be limited the task is durable. here we should again caution the reader. we have no a priori reason to assume that non-soluble proteins, such as structural proteins, have structures that can be predicted from our limited library of mostly globular, soluble proteins. some structural proteins are gigantic, the cuticle collagen in the riftia worms from deep sea hydrothermal vents have a molecular mass of . kda (gaill et al., ) . it is extremely unlikely that a -d structure at atomic resolution of such a protein will ever be determined using methods we have available today. nmr has emerged with surprising speed as a structure determination tool. many excellent reviews have been written on this topic. we have decided to direct the readers attention to some recent developments that we believe will be of significant importance to the usage of nmr in protein engineering projects. the potential of using nmr to study the solvent exposed outer shell of larger proteins, that by far exceed the kda limit mentioned earlier is intriguing. this is particularly so, since most functionality of a protein is a feature of exactly the residues in the outer shell. thus, we can 'peel' the protein, and thereby isolate the spectral information that pertains to the surface only. this simplifies the spectra, and in some cases even allows for a partial assignment of specific residues. recent developments in ph-dependent protein electrostatics have been given special attention here. the similarities and differences within a family of structurally related proteins can only be understood if we are capable of interpreting the consequences of the substitutions, insertions and deletions that mostly occur at the surface of the proteins. when such changes are found and they involve charged residues, this will effect the extent or polarity of the electrostatic fields that the protein molecule is embedded in. we believe that the consequences of charge mutations to a large extent can be predicted through the use of ph-dependent electrostatics although practical examples are still lacking. to our knowledge the results on the electrostatic consequences of the lid motion in the human pancreatic lipase (vide supra) are among the first such reported. the story of molecular biology is continuously unfolding -and our understanding of our own biology, development and evolution is becoming ever deeper and more detailed. but we are also, once again, discovering that one of the many qualities of nature is endless complexity. protein data bank. crystallographic databases -information content, software systems, scientific applications. bonn/cambridge/chester, data commission of the international union of crystallography modification of trypanosoma brucei mitochondrial rrna by posttranscriptional ' polyuridine tail formation significance of similarities in protein structures (in abstracts of the th annual meeting of the protein engineering society of japan) scanning tunneling microscopy of biological macromolecular structures coated with a conducting film an investigation of oligopeptides linking domains in protein tertiary structures and possible candidates for general gene fusion engineering proteins for nonnatural environments solid-state c nmr study of a transglutaminaseinhibitor adduct structural engineering of the hiv- protease molecule with a/ -turn mimic of fixed geometry the swlss-prot protein sequence data bank polymers made to measure alscript: a tool to format multiple sequence alignments pka's of ionizable groups in proteins: atomic detail from a continuum electrostatic model multiple-site titration curves of proteins: an analysis of exact and approximate methods for their calculation mlev- -based two-dimensional homonuclear magnetization transfer spectroscopy predicting the conformation of proteins. man versus machine diffusion-controlled macromolecular interactions the protein data bank: a computer-based archival file for macromolecular structures protonation of interacting residues in a protein by a monte carlo method: application to lysozyme and the photosynthetic reaction center of rhodobacter sphaeroides free energy via molecular simulation: application to chemical and biomolecular systems research and perspectives catching a common fold seleno protein synthesis: an expansion of the genetic code protein structures from distance inequalities secondary structure prediction for modelling by homology inverted protein structure prediction a method to identify protein sequences that fold into a known three-dimensional structure will future computers be all wet? coherence transfer by isotropic mixing: application to proton correlation spectroscopy a photochemically induced dynamic nuclear polarization study of denatured states of lysozyme three-dimensional structure of the human class ii histocompatibility antigen hla-dr an empirical energy function for threading protein sequence through the folding motif preparation of artificial bifunctional enzymes by gene fusion solidstate nmr assessment of enzyme active center structure under nonaqueous conditions forward to the fundamentals study of the tryptophan residues of lysozyme using h nuclear magnetic resonance rna duplexes guide base conversions ph dependence of relaxivities and hydration numbers of gadolinium(ill) complexes of linear amino carboxylates ih nmr studies of human c a anaphylatoxin in solution: sequential resonance assignments, secondary structure, and global fold tuning the activity of an enzyme for unusual environments: sequential random mutagenesis of subtilisin e for catalysis in dimethylformamide proteins. one thousand families for the molecular biologist a correlation-coefficient method to predicting protein-structural classes from amino acid compositions solid-state nmr determination of intra-and intermolecular p- c distances for shikimate -phosphate and [ -i c]glyphosate bound to enolpyruvylshikimate- -phosphate synthase four-dimensional c/ c-edited nuclear overhauser enhancement spectroscopy of a protein in solution: application to interleukin / high-resolution three-dimensional structure of interleukin / in solution by three-and four-dimensional nuclear magnetic resonance spectroscopy origins of structural diversity within sequentially identical hexapeptides comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment extracting the information -sequence analysis software design evolves conformations of folded proteins in restricted spaces prediction of protein folding from amino acid sequence over discrete conformation spaces comparison of a structural and a functional epitope electrostatics in biomolecular structure and dynamics identification and removal of impediments to biocatalytic synthesis of aromatics from d-glucose: rate-limiting enzymes in the common pathway of aromatic amino acid biosynthesis the crystal and molecular structure of the rhizomucor miehei triacylglyceride lipase at . a resolution real-space refinement of the structure of hen egg white lysozyme dominant forces in protein folding complete assignment of aromatic h nuclear magnetic resonances of the tyrosine residues of hen lysozyme stein and moore award address. reconstructing history with amino acid sequences the comings and goings of homing endonucleases and mobile introns multim -tools for multiple sequence analysis genomic direction of synthesis during plasmid-based biocatalysis free radical induced nuclear magnetic resonance shifts: comments on contact shift mechanism prediction of protein folding class from amino acid composition modeling of the electrostatic potential field of plastocyanin three-dimensional profiles for analysing protein sequence -structure relationships a method to configure protein side-chains from the main-chain trace in homology modelling structure of pentameric human serum amyloid p component nuclear magnetic resonance fourier transform spectroscopy (nobel lecture) probing protein structure by solvent pertubation of nuclear magnetic resonance spectra molecular nanotechnology low resolution solution structure of the bacillus subtilis glucose permease iia domain derived from heteronuclear three-dimensional nmr spectroscopy alternative readings of the genetic code enzyme structure and mechanism. freeman protein engineering enzyme crystal structure in a neat organic solvent ih, c and lsn nmr backbone assignments of the -residue serine protease pb from bacillus alcalophilus polypeptide -metal cluster connectivities in metallothionein by novel i h- cd heteronuclear two-dimensional nmr experiments design and use of heterologous microbes for conversion of d-glucose into aromatic chemicals. enzyme engineering xii molecular characterization of the cuticle and interstitial collagens from worms collected at deep sea hydrothermal vents the protein identification resource (pir) faster superoxide dismutase mutants designed by enhancing electrostatic guidance self-assembling organic nanotubes based on a cyclic peptide architecture multiple-site titration and molecular modeling: two rapid methods for computing energies and forces for ionizable groups in proteins calculation of the total electrostatic energy of a macromolecular system: solvation energies, binding energies, and conformational analysis the inclusion of electrostatic hydration energies in molecular mechanics calculations calculations of electrostatic potentials in an enzyme active site calculating the electrostatic potential of molecules in solution: method and error assessment improved alignment of weakly homologous protein sequences using structural information rna editing in plant mitochondria and chloroplasts human genetic diseases due to codon reiteration: relationship to an evolutionary mechanism the influence of hydration on the conformation of lysozyme studied by solid-state c-nmr spectroscopy three-dimensional fourier spectroscopy. application to high-resolution nmr invasive introns enzyme function in organic solvents analysis of ordered arrays of adsorbed lysozyme by scanning tunneling microscopy specific cleavage of pre-edited mrnas in trypanosome mitochondrial extracts treatment of electrostatic effects in macromolecular modeling de novo design, expression and characterization of felix: a four-helix bundle protein of native like sequence converting trypsin to chymotrypsin: the role of surface loops identification of native protein folds amongst a large number of incorrect models. the calculation of low energy conformations from potentials of mean force nuclear magnetic relaxation in aqueous solutions of the gd(hedta) complex proton magnetic relaxation dispersion in aqueous glycerol solutions of gd(dtpa) -and gd(dota) engineered metalloregulation in enzymes rna editing of ampa receptor subunit giur-b: a base-paired intron-exon structure determines position and efficiency protein splicing removes intervening sequences in an archaea dna polymerase the role of the a-helix dipole in protein function and structure diabodies': small bivalent and bispecific antibody fragments globin fold in a bacterial toxin a database of protein structure families with common folding motifs proton nuclear magnetic resonance assignment and surface accessibility of tryptophan residues in lysozyme using photochemically induced dynamic nuclear polarization spectroscopy a functional protein hybrid between the glucose transporter and the n-acetylglucosamine transporter of escherichia coli classical electrodynamics synthesis, structure and activity of artificial, rationally designed catalytic polypeptides a new approach to protein fold recognition engineering stability of the insulin monomer fold with application to structure-activity relationships dictionary of protein secondary structure: pattern recognition of hydrogenbonded and geometrical features protein design by binary patterning of polar and nonpolar amino acids a hybrid ribonuclease h. a novel rna cleaving enzyme with sequence-specific recognition four-dimensional heteronuclear triple-resonance nmr spectroscopy of interleukin- / in solution two-dimensional spectroscopy: background and overview of the experiments orientation of the valine- side chain of the gramicidin transmembrane channel and implications for channel functioning. a h nmr study co-crystal structure of tbp recognizing the minor groove of a tata element crystal structure of a yeast tbp/tata-box complex two-dimensional h nmr studies of histidine-containing protein from escherichia coli. secondary and tertiary structure as determined by nmr hhaimethyltransferase flips its target base out of the dna helix the solution structure of the human retinoic acid receptor-/ dna-binding domain crystal structure of porcine ribonuclease inhibitor, a protein with leucine-rich repeats evolution of the type ii hexokinase gene by duplication and fusion of the glucokinase gene with conservation of its organization determinants of ca + permeability in both tm and tm of high affinity kainate receptor channels: diversity by rna editing crystal structure at . ,~ resolution of hiv- reverse transcriptase complexed with an inhibitor the asymmetric distribution of charges on the surface of horse cytochrome c molscript: a program to produce both detailed and schematic plots of protein structures atomic model of plant light-harvesting complex by electron crystallography biocatalysis in the gas phase procheck: a program to check the stereochemical quality of protein structures a new procedure for the detection and evaluation of similar substructures in proteins quantification of secondary structure prediction improvement using multiple alignments molecular dynamics of macromolecules in water direct observation of reverse transcriptases by scanning tunneling microscopy on the ionization of proteins long-range surface charge-charge interactions in proteins assessment of protein models with three-dimensional profiles improving the sensitivity of the sequence profile method brownian dynamics simulations of diffusional encounters between triosephosphate isomerase and glyceraldehyde phosphate: electrostatic steering of glyceraldehyde phosphate conformational flexibility of aqueous monomeric and dimeric insulin: a molecular dynamics study crystal structure of the dsba protein required for disulphide bond formation in vivo electrostatic effects in proteins dynamics of proteins and nucleic acids inter-tryptophan distances in rat cellular retinol binding protein ii by solid-state nmr a molecular model for cinnamyl alcohol dehydrogenase, a plant aromatic alcohol dehydrogenase involved in lignification adaptive evolution of highly mutable loci in pathogenic bacteria automated protein structure data bank similarity searches and their use in molecular modeling with development of pseudoenergy potentials for assessing protein -d- -d compatability and detecting weak homologies brownian dynamics of cytochrome c and cytochrome c peroxidase electron transfer proteins molecular dynamics of ferrocytochrome c. magnitude and anisotropy of atomic displacements an analysis of incorrectly folded protein models. implications for structure predictions characterization of recombinant human farnesyl-protein transferase: cloning, expression, farnesyl diphosphate binding, and functional homology with yeast prenyl-protein transferases fast structure alignment for protein databank searching identification and classification of protein fold families direct solution of the poisson equation for biomolecules of arbitrary shape, polarizability density, and charge distribution prediction of protein structure by evaluation of sequencestructure fitness. aligning sequences to contact profiles derived from three-dimensional structures environment-specific amino acid substitution tables: tertiary templates and prediction of protein folds sh and sh domains rapid and sensitive sequence comparison with fastp and fasta improved tools for biological sequence comparison gene duplication and the origin of trypsin protein engineering -new or improved proteins for mankind nmr identification of protein surfaces using paramagnetic probes multidisciplinary cycles for protein engineering: site-directed mutagenesis and x-ray structural studies of aspartic proteinases. scand the local information content of the protein structural database structure of the actin -myosin complex and its implications for muscle contraction extensive editin~ of both processed and preprocessed maxicircle cr transcripts in trypanosoma brucei sequential h-nmr assignments and secondary structure of hen egg white lysozyme in solution electrostatics and diffusional dynamics in the carbonic anhydrase active site channel identification of structural motifs from protein coordinate data: secondary structure and first-level supersecondary structure modeling protein structures: construction and their applications nmr of macromolecules. a practical approach the modelling of electrostatic interactions in the function of globular proteins electrostatic interactions in globular proteins: calculation of the ph dependence of the redox potential of cytochrome c i extracting information on folding from the amino acid sequence: consensus regions with preferred conformation in homologous proteins prediction of protein secondary structure at better than % accuracy secondary structure prediction of all-helical proteins in two states phd -an automatic mail server for protein secondary structure prediction progress in protein structure prediction? predicting protein secondary structure with a nearest-neighbor algorithm database of homologyderived protein structures and the structural meaning of sequence alignment an winexpensive, versatile sample illuminator for photo-cidnp on any nmr spectrometer pancreatic lipases: evolutionary intermediates in a positional change of catalytic carboxylates? a workbench for multiple alignment construction and analysis a new approach to the design of stable proteins electrostatic interactions in macromolecules: theory and applications the electrostatic potential of the alpha helix electrostatic effects in myoglobin. hydrogen ion equilibria in sperm whale ferrimyoglobin point charge distributions and electrostatic steering in enzyme/substrate encounter: brownian dynamics of modified copper/zinc superoxide dismutases boltzmann's principle, knowledge based mean fields and protein folding recognition of errors in three-dimensional structures of proteins describing protein structure: a general algorithm yielding complete helicoidal parameters and a unique overall axis electrostatic screening in molecular dynamics simulations electrical potentials in trypsin isozymes rna editing in brain controls a determinant of ion flow in glutamate-gated channels empirical correlation between protein backbone conformation and c a and ct c nuclear magnetic resonance chemical shifts an automated method for modeling proteins on known templates using distance geometry a model for electrostatic effects in proteins difference imaging of adenovirus: bridging the resolution gap between x-ray crystallography and electron microscopy semianalytical treatment of solvation for molecular mechanics and dynamics sequencespecific h and n resonance assignment for human dihydrofolate reductase in solution posttranslational modification of protein by tyrosine sulfation: active sulfate paps is the essential substrate for this modification finding your fold (commentary) theory of protein titration curves. i. general equations for impenetrable spheres interpretation of protein titration curves. application to lysozyme molecular cloning of an apolipoprotein b messenger rna editing protein fragment ranking in modelling of protein structure. conformationally constrained environmental amino acid substitution tables three-dimensional cryo-electron microscopy of the calcium ion pump in the sarcoplasmic reticulum membrane biocatalysis in non-conventional media total chemical synthesis, characterization, and immunological properties of an mhc class i model using the tasp concept for protein de novo design doughnut-shaped structure of a bacterial muramidase revealed by x-ray crystallography structure determination of the cyclohexene ring of retinal in bacteriorhodopsin by solid-state deuterium nmr nicotinic acetylcholine receptor at ~, resolution calculations of electrostatic properties in proteins interfacial activation of the lipase -procolipase complex by mixed micelles revealed by x-ray crystallography structure of the pancreatic lipase -colipase complex a novel search method for protein sequence -structure relations using property profiles nmr investigations of protein structure prospects for nmr of large proteins protein structures in solution by nuclear magnetic resonance and distance geometry theoretical studies of enzymic reactions calculation of electrostatic interactions in biological systems and in solution how do serine proteases really work? calculation of the electric potential in the active site cleft due to a-helix dipoles molecular dynamics effects on protein electrostatics homonuclear two-dimensional h nmr of proteins. experimental procedures calculation of chemical shifts of protons on alpha carbons in proteins three-dimensional profiles from residue-pair preferences: identification of sequences with beta/alpha-barrel fold structure of human pancreatic lipase the chemical shift index: a fast and simple method for the assignment of protein secondary structure through nmr spectroscopy reengineering the specificity of a serine active-site enzyme. two active-site mutations convert a hydrolase to a transferase detection of secondary structure elements in proteins by hydrophobic cluster analysis model ion channels: gramicidin and alamethicin nmr of proteins and nucleic acids in vitro protein splicing of purified precursor and the identification of a branched intermediate on the calculation of pka's in proteins molecular cloning of cdna coding for rat plasma glutathione peroxidase a new method for computing the macromolecular electric potential an optimization approach to predicting protein structural class from amino acid composition a weighting method for predicting protein structural class from amino acid composition we want to thank christian cambillau, cnrs, marseille, for kindly providing us with pre-release -d data of human pancreatic lipase, jerry h. brown, harvard university, for sending us a prerelease dataset for the hla ii structure, alwyn jones, uppsala university, for pre-release -d data of candida antarctica b lipase, and johnmccarthy, brookhaven national laboratory, for helping us with data on previous pdb releases. the french norwegian foundation (fns ) and the norwegian research council (bp ) have contributed with financial support to some of the research activities described in this paper. a.b. and p.m. thank junta nacional de investi-ga~o cientlfica, portugal, for their grants. key: cord- -k m uxw authors: braun, elisabeth; sauter, daniel title: furin‐mediated protein processing in infectious diseases and cancer date: - - journal: clin transl immunology doi: . /cti . sha: doc_id: cord_uid: k m uxw proteolytic cleavage regulates numerous processes in health and disease. one key player is the ubiquitously expressed serine protease furin, which cleaves a plethora of proteins at polybasic recognition motifs. mammalian substrates of furin include cytokines, hormones, growth factors and receptors. thus, it is not surprising that aberrant furin activity is associated with a variety of disorders including cancer. furthermore, the enzymatic activity of furin is exploited by numerous viral and bacterial pathogens, thereby enhancing their virulence and spread. in this review, we describe the physiological and pathophysiological substrates of furin and discuss how dysregulation of a simple proteolytic cleavage event may promote infectious diseases and cancer. one major focus is the role of furin in viral glycoprotein maturation and pathogenicity. we also outline cellular mechanisms regulating the expression and activation of furin and summarise current approaches that target this protease for therapeutic intervention. the human genome encodes more than proteases. these molecular scissors play important roles in essentially all physiological processes. they digest the proteins in our food, degrade misfolded or unwanted proteins and regulate the trafficking and activity of numerous cellular factors. proteolytic cleavage is certainly one of the most important post-translational modifications, generating a plethora of bioactive proteins and peptides with key roles in cell proliferation, immunity and inflammation. not surprisingly, mutations in proteases and/or aberrant protease activity are associated with numerous pathological processes including cancer, cardiovascular disorders and autoimmune diseases. intriguingly, also many viral pathogens exploit cellular proteases for the proteolytic processing and maturation of their own proteins. similarly, activation of bacterial toxins frequently requires cleavage by proteases of the infected or intoxicated host. in recent years, modulation of protease activity has therefore emerged as a potential therapeutic approach in a variety of infectious and noninfectious diseases. one particularly promising target for therapeutic intervention is the cellular protease furin. this protease most likely cleaves and activates more than mammalian, viral and bacterial substrates. among them are viral envelope glycoproteins and bacterial toxins, as well as cellular factors that promote tumor development and growth if they are hyperactivated ( figure ). in this review, we summarise our current knowledge of furinmediated protein processing in health and disease with a focus on the role of furin in viral protein processing. furthermore, we describe cellular mechanisms regulating furin activity at the transcriptional and post-transcriptional level. finally, we present approaches that aim at modulating furin activity for therapeutic purposes and discuss their suitability for clinical application. furin is a member of the evolutionarily ancient family of proprotein convertases. their similarity with bacterial subtilisin and yeast kexin proteases has coined the abbreviation pcsk (proprotein convertase subtilisin/kexin type). humans encode nine members of this protease family (pcsk - ), with pcsk representing furin (table ) . pcsks are well known for their ability to activate other cellular proteins. the proteolytic conversion of inactive precursor proteins into bioactive molecules has already been described in the s. however, it took more than years until furin was identified as the first mammalian proprotein convertase. , to date, more than cellular substrates of pcsks have been described, including hormones, receptors, growth factors and adhesion molecules. although pcsks are frequently coexpressed in the same cell and may cleave the same substrates, there is no complete redundancy and the inactivation of individual pcsks results in specific knock-out phenotypes in mice. pcsk - cleave their substrates after basic residues, with the typical recognition motif k/r-x n -k/r↓ ( table ). in contrast, pcsk cleaves after nonbasic residues and is best known for its regulation of cholesterol and lipid metabolism by activating sterol regulatory element-binding protein (srebp) transcription factors. , like pcsk , pcsk also plays a key role in cholesterol metabolism as it regulates low-density lipoprotein (ldl) particle levels in the blood. however, this effect does not involve proteolytic cleavage of a specific substrate, but is mediated by a direct binding of pcsk to ldl receptors. with two fda-approved inhibitors for the treatment of hypercholesterolaemia, pcsk is also the prime example of a protease that is successfully targeted for therapy. the prototypical and best-characterised member of the pcsk family is furin/pcsk . since it cleaves basic amino acid motifs, it has also been termed pace (paired basic amino acid cleaving enzyme). furin is expressed by the fur (fes upstream region) gene on chromosome . although furin is ubiquitously expressed, its mrna and protein levels vary depending on the cell type and tissue. high levels can be found in salivary glands, liver and bone marrow, whereas muscle cells express relatively low amounts of furin. three promoters (p , p a and p b), each harbouring an alternative transcription start site, have been described ( figure ). however, the respective transcripts differ only in the first untranslated exon and are therefore predicted to express the same protein. while the p a and p b promoters resemble those of constitutively expressed housekeeping genes, the p promoter binds the transcription factor c/ebpb and can be trans-activated upon cytokine stimulation. in line with this, ifnc, tgfb, il- and pma induce furin expression. [ ] [ ] [ ] [ ] upon translation of the mrna, furin enters the secretory pathway as an inactive proenzyme and is integrated into the er membrane via its cterminal transmembrane domain ( figure ). like most type i transmembrane proteins, it harbours a short n-terminal signal peptide that is cleaved off cotranslationally. similar to other proprotein convertases, furin contains an inhibitory nterminal -amino acid propeptide, whose chaperone function is required for correct folding of the catalytic domain. during the transition of furin from the er to the trans-golgi network (tgn), the inhibitory propeptide is removed in a two-step autoproteolytic process and furin gains its enzymatic activity. at the same time, n-linked oligosaccharides are added and trimmed. although furin accumulates in the tgn, it can be further transported to the cell surface and back via the endosomal pathway. , finally, furin can also be shed and released into the extracellular space upon proteolytic separation of the catalytic domain from the membrane-bound c terminus. whether this cleavage step is mediated by furin itself or another protease remains to be determined. the presence of furin in the tgn and endosomal compartments, at the cell surface and in the extracellular space, may explain its ability to process a large variety of intra-and extracellular substrates. the canonical furin cleavage site is frequently described as r-x-k/r-r↓. however, variations of this motif may also be recognised and a stretch of amino acids surrounding the cleavage site as well as post-translational modifications determine interaction with the furin binding pocket. bioinformatic analyses and functional studies uncovered more than furin cleavage sites in mammalian proteins. these comprise growth factors and cytokines (e.g. igf , igf , tgfb, pdgfa, pdgfb, vegf-c, ngf, cxcl ), hormones (e.g. pth, trh, ghrh), adhesion molecules (e.g. integrins, vitronectin), collagens, metalloproteinases, coagulation factors, receptors, membrane channels and albumin. while most of these target factors are activated upon furin-mediated cleavage, furin also exerts inactivating cleavage steps. for example, the furin paralogue pcsk and endothelial lipase can be inactivated by furin. maturation of the cellular protease furin. furin expression is driven by three different promoters, sharing characteristics of either cytokine-activated (p ) or housekeeping gene (p a and p b) promoters. during translation, furin is integrated into er membranes and glycosylated. after the n-terminal signal peptide (red) is removed, an autocatalytic cleavage event occurs, generating a short propeptide (light blue). this propeptide remains associated with furin and acts as an intramolecular chaperone and inhibitor. after transit to the golgi complex, the propeptide is removed and glycans are trimmed before furin gains its proteolytic activity. furin accumulates in the trans-golgi network (tgn), but can also traffic to the plasma membrane and cycle between these two compartments via endosomes. proteolytic cleavage at the c terminus of furin separates the transmembrane domain (orange) from the catalytically active domain. as a result, furin can be shed into the extracellular space as an active enzyme. the physiological importance of furin is reflected by furin knock-out mice, which die at embryonic day because of cardial ventral closure defects and hemodynamic insufficiency. similarly, endothelial cell-specific knock-out of furin results in cardiac malformation and death shortly after birth. even mutations in the cleavage site of a single furin target protein may have detrimental effects and result in genetic disorders such as haemophilia b or x-linked hypohidrotic ectodermal dysplasia. , because of its pleiotropic effects, variations in furin expression levels and/or its enzymatic activity may have detrimental effects and promote the pathogenesis of a variety of disorders, including rheumatoid arthritis, amyloid dementia and cancer. furin has been termed a 'master switch of tumor growth and progression' , as its aberrant expression or activation can promote the formation and progression of various malignancies including colon carcinoma, rhabdomyosarcoma, head and neck cancers, lung, skin and brain tumors. in some cases, furin levels positively correlate with aggressiveness, and increased furin expression has been proposed as prognostic marker for advanced cancers. the oncogenic and prometastatic activity of furin has been ascribed to its ability to activate proteins that promote cell proliferation, angiogenesis, migration and tissue invasion. for example, furin cleaves and activates growth factors such as igfs, pdgfs or ngf that enhance cell proliferation and consequently tumor growth. similarly, angiogenic and lymphangiogenic factors such as vegf-c and vegf-d may be hyperactivated and promote the vascularisation and growth of solid tumors. notably, furin expression is induced by hypoxia, as all three fur promoters harbour binding sites for the hypoxia-inducible factor- (hif- ). thus, furin-mediated vascularisation may preferentially occur in otherwise growth-restricted hypoxic tumors. interestingly, hypoxia also results in subcellular relocalisation of furin to the cell surface, which may further enhance processing of growth factors and other extracellular tumorigenic precursor proteins. besides effects on tumor growth, furin can also promote migration and extravasation of malignant cells as it processes adhesion molecules mediating cell-cell and cell-matrix interactions. the cleavage of integrins may be particularly relevant as they not only mediate adhesion of cells to the extracellular matrix, but also act as signal transducers regulating cell growth, division and survival. in addition, furin activates matrix metalloproteinases (e.g. mmp ) that facilitate metastasis by degrading components of the extracellular matrix. finally, increased furin activity may also promote cancer development by suppressing protective antitumor mechanisms. for example, increased furin-mediated activation of tgfb reduces immune surveillance by promoting the development of suppressive t reg cells and inhibiting effector t-cell functions. the key role of furin in immunity is highlighted by t-cellspecific furin knock-out mice, which harbour inherently over-reactive effector t cells that secrete reduced levels of active tgfb. notably, positive feedback loops can further enhance the oncogenic potential of furin. for example, the furin substrate tgfb not only increases furin mrna expression, but also enhances its proteolytic activity by an unknown mechanism. similarly, furin enhances the secretion of ifnc, which in turn activates the fur promoter. , this mutual enhancement seems particularly important given the key role of ifnc in tumor development and progression. on the one hand, furin-driven ifnc release may have beneficial effects as it boosts the tumorlytic activity of natural killer cells and cytotoxic t lymphocytes. furthermore, ifnc may act as an antiangiogenic factor and directly inhibit tumor cell proliferation by inducing the expression of tumor suppressors such as p or p . on the other hand, however, recent evidence suggests that ifnc may also exert tumor-promoting effects, for example by selecting for immune evasive phenotypes and promoting an immunosuppressive tumor microenvironment. intriguingly, a study on laryngeal cancer patients suggests that the ifnc-furin feedback loop may be further boosted iatrogenically since radiotherapy increased furin expression in some patients. in summary, aberrant furin activation promotes several steps of cancer development, including cell proliferation, vascularisation, metastasis and antitumor immunity. however, the relative contribution of individual furin substrates to tumor progression and the role of other proprotein convertases remain largely unclear. furin may not only promote disease upon aberrant expression, but also by activating a variety of pathogen-derived proteins. one example for cleavage of unwanted proteins is the proteolytic activation of bacterial toxins. particularly, the group of ab toxins comprises several well-described furin substrates. these exotoxins are secreted by bacteria and exert their effect in the cytoplasm of the target cell. they usually consist of an enzymatically active a subunit and a b subunit that mediates membrane binding and translocation. to exert its toxic effect, the a subunit has to be separated from the membrane-associated b subunit by proteolytic cleavage. in case of diphtheria toxin and pseudomonas exotoxin a, furin cleaves r-v-r-r↓ and r-q-p-r↓ target sequences, respectively. , cleavage most likely occurs in endosomes before the a subunit translocates into the nucleus where it inhibits protein synthesis by inhibiting the elongation factor eef . in line with an important role of furin in bacterial virulence, toxicity is highest if an optimal furin cleavage site is present. similarly, furin cleaves shiga and shiga-like toxins expressed by certain shigella spp. and escherichia coli strains and enhances their ability to halt protein synthesis. although a furin target sequence is conserved among all shiga-like toxins, mutational analyses suggested that furin-mediated cleavage augments toxin activity, but is not essential. another wellcharacterised example is anthrax toxin, a threeprotein exotoxin consisting of the receptor binding protective antigen (pa) and the enzymatically active components oedema factor (ef) and lethal factor (lf). upon binding to its receptor, pa is cleaved by furin at the cell surface. this cleavage step triggers the oligomerisation of pa into a prepore that binds ef and lf. subsequently, this toxin complex is endocytosed and pa forms a channel that allows the translocation of ef and lf into the cytoplasm. although pa can be activated by different proprotein convertase family members, furin seems to be the major protease activating anthrax toxin. these examples illustrate that several bacterial pathogens exploit furin and related convertases for the activation of their exotoxins. strictly speaking, however, some toxins produced by bacteria (e.g. diphtheria toxin and shiga toxins) represent viral gene products as they are encoded by bacteriophages. in these cases, the term 'viral exotoxin' may be more appropriate. this strongly suggests that furin-mediated toxin activation confers a selection advantage to both, the bacterium and its phage. for example, induction of cell death by furin-activated toxins may promote tissue invasion, increase transmission rates (e.g. by causing diarrhoea) or suppress cellular immune responses. without the proteolytic activation of exotoxins, diseases such as dysentery or diphtheria would not occur. like bacterial and viral exotoxins, most viral envelope glycoproteins need to be proteolytically cleaved before they can mediate viral entry into host cells. in many cases, viruses exploit cellular trypsin-or subtilisin-like endoproteases for this purpose. while subtilisin-like proteases such as furin require polybasic cleavage sites, trypsin-like proteases also recognise monobasic motifs and cleave after single arginine or lysine residues. notably, the dependency on specific proteases can also be an important determinant of tissue tropism and viral spread in an infected organism. for example, avirulent newcastle disease virus (ndv) strains harbour a monobasic cleavage site in their fusion (f) protein and result only in local infections (mainly in the respiratory tract) since expression of the respective host proteases is limited to a few cell types. in contrast, the f proteins of virulent ndv strains can be cleaved by furin or related proprotein convertases that are ubiquitously expressed. consequently, these viruses are able to spread systemically and cause high rates of mortality in infected birds. another well-described example is the cleavage of influenza a virus hemagglutinin (ha). in contrast to low pathogenic avian influenza a viruses, their highly pathogenic counterparts harbour a polybasic furin cleavage site in the ha protein. thus, the ability of viruses to exploit furin may have drastic effects on their pathogenicity. to date, furin-mediated cleavage has been described for envelope glycoproteins encoded by numerous evolutionarily diverse virus families, including herpes-, corona-, flavi-, toga-, borna-, bunya-, filo-, orthomyxo-, paramyxo-, pneumoand retroviridae (table ) . although viral furin substrates generally harbour the canonical polybasic cleavage site, timing and subcellular localisation of furin-mediated activation may differ substantially between virus families. since furin and viral glycoproteins both enter the secretory pathway, proteolytic activation can occur at different steps of the viral replication cycle. while the envelope proteins of some viruses are cleaved in the producer cell, others are processed in the extracellular space or during entry into their target cells (figure ). the following sections highlight the characteristics of a few selected viral glycoproteins, their processing by furin and the importance of furin-mediated cleavage for infection and pathogenicity. retroviral glycoprotein trimers, such as those of human immunodeficiency, rous sarcoma or murine leukaemia viruses, are proteolytically processed and activated in the producer cells (figure a , left panel). in case of hiv- , the gp precursor of the viral envelope protein (env) is cleaved into gp and membrane-anchored gp that remain associated through noncovalent interactions. cleavage occurs in intracellular compartments, before the assembly of virions at the plasma membrane. notably, proteolytic processing of env depends on correct n-linked glycosylation as aberrant carbohydrate side chains may result in subcellular mistrafficking or sequestration of env. most likely, hiv- takes advantage of the redundancy of several proprotein convertases recognising the polybasic cleavage motif in env. furin, pcsk , pcsk and pcsk have all been shown to cleave gp in cells, albeit with different efficiencies. notably, in vitro cleavage experiments using recombinant proteases did not always reflect cleavage efficiency in transfected cells and the relative contribution of individual pcsks to hiv- maturation in vivo remains unclear. interestingly, hiv- env harbours a second polybasic cleavage site, about eight amino acid residues upstream of the major one. although cleavage at this site does not result in fusiogenic env species, about % of all gp molecules are cleaved at this position, at least in case of the cellculture-adapted hiv- clone lai. , in some cases, gp escapes intracellular cleavage and may be incorporated as an unprocessed precursor into budding virions. whether these env molecules may be processed extracellularly by shed furin is unknown. in this context, it is noteworthy that membrane-bound plasmin has been shown to convert extracellular gp into gp and gp . however, it remains to be determined whether plasmin-mediated cleavage results in fully infectious hiv- particles. influenza a virus hemagglutinin (ha) can be cleaved and primed by a variety of cellular proteases. even bacterial proteases may promote influenza virus spread by cleaving ha into ha and ha . both subunits remain linked via disulphide bonds and form trimeric structures. while ha binds to the sialic acid receptor on viral target cells, ha harbours the fusion peptide that mediates fusion of viral and cellular endosomal membranes. ha cleavage can occur within producer cells, upon release of virions from infected cells or directly prior to entry into new target cells. as a general rule, hemagglutinins of mammalian and low pathogenic avian influenza a viruses cannot be cleaved by furin as they usually only harbour a mono-or dibasic cleavage site. instead, they depend on trypsin-like proteases such as transmembrane protease serine s member (tmprss ) or human airway trypsin-like protease (hat). expression of such trypsin-like proteases is largely restricted to the respiratory and gastrointestinal tract. in contrast, h and h hemagglutinins of a large number of highly pathogenic avian influenza a viruses (hpaiv) can be cleaved by furin or pcsk , which are present in many cell types. , this is because they acquired a polybasic cleavage site upon insertion of additional lysine and/or arginine residues. duplication of lysine and arginine residues in ha is facilitated by polymerase slippage as these amino acids are encoded by purine-rich codons. instead of the prototypical r-x-k/r-r↓ motif, some hpaivs harbour a suboptimal k-x-k/r-r↓ cleavage motif. proteolytic cleavage at this site is only efficient if additional positively charged amino acids upstream of this cleavage motif are present or if attachment sites for masking oligosaccharide chains are missing. , notably, a subset of h n lowly pathogenic avian influenza a virus strains also harbour r-s-k-r↓ or r-s-r-r↓ sites that are not only cleaved by trypsin-like proteases, such as tmprss or hat, but also by pcsks. however, their cleavage is only efficient in the presence of very high amounts of furin or upon mutation of a glycosylation site in ha. thus, the ability to exploit furin for efficient ha cleavage and the associated increase in pathogenicity are not only determined by the presence of a furin consensus target site, but also by adjacent residues and the absence of masking oligosaccharide chains. furin cleaves the glycoproteins (gps) of marburg virus (marv) and all five ebolavirus species into a large n-terminal subunit (gp ) that mediates receptor binding and a small membrane-anchored c-terminal part (gp ) that contains the fusion , the viral envelope (env) glycoprotein precursor (green) migrates through the er to the golgi complex where it is cleaved by furin (pink scissors) into the functional mature env glycoprotein (blue). processed env glycoproteins are transported to the cell surface and incorporated into assembling viral particles. in contrast, dengue viruses bud into the er lumen and incorporate the uncleaved premembrane protein (prm) (right panel). during virus particle transit through the secretory pathway, virion-associated prm proteins can be cleaved by furin (dark blue to light blue). (b) some prm molecules escape furin-mediated cleavage in the producer cells resulting in the release of immature or partially mature dengue virus particles. in this case, processing can also occur in endosomes of new target cells, upon receptor-mediated endocytosis (left panel). during human papillomavirus (hpv) infection (right panel), attachment to heparan sulphate proteoglycans induces a conformational change that allows proteolytic processing of the minor capsid protein l (red) by furin, which is present at the cell surface. furin processing induces a structural rearrangement that allows binding to a secondary receptor and subsequent receptor-mediated endocytosis. peptide. , marv and human pathogenic ebolavirus species harbour canonical furin cleavage sites (r-x-k/r-r↓). in contrast, the gp of the closely related reston virus, which causes asymptomatic infections in humans, is processed less efficiently by furin as it carries the suboptimal cleavage site k-q-k-r↓. surprisingly, however, uncleavable gp mutants of highly pathogenic ebola virus (ebov) are able to mediate infection and furin-mediated cleavage is not required for replication in cell culture. furthermore, an ebov mutant lacking the furin cleavage site replicated efficiently in nonhuman primates and showed no differences in disease progression or lethality compared to wild-type viruses. thus, the high conservation of the furin cleavage site among different ebolavirus species is surprising and it remains to be determined whether furin-mediated gp processing plays a role in the natural reservoir hosts of these viruses. notably, the ebov gp gene harbours an rna editing site sequence and may not only express fulllength gp, but also a soluble form of the glycoprotein (pre-sgp) that lacks the c-terminal transmembrane domain. intriguingly, pre-sgp harbours another furin recognition site and is cleaved into mature sgp and a short so-called d peptide. both of them are ultimately released from infected cells. although pre-sgp is produced in higher amounts than gp, its role in viral replication is under debate. among others, sgp has been suggested to serve as a decoy antigen, to act as a structural substitute for gp and to induce apoptosis of uninfected lymphocytes. flaviviruses flavivirus rna is translated into a single large polyprotein that is cleaved by cellular and viral proteases into all structural and nonstructural proteins of the virus. the structural proteins comprise the envelope proteins prm and e that are incorporated as prm/e heterodimers into budding virions. prm acts as a chaperone and facilitates correct folding of the e glycoprotein. many flaviviruses bud into the lumen of the er and enter the secretory pathway. in the acidic milieu of the trans-golgi network (tgn), a furin cleavage site is exposed and prm can be cleaved into mature pr and m proteins. thus, furinmediated cleavage of the viral glycoprotein occurs only after its incorporation into newly formed virions (figure a, right panel) . this is in contrast to viruses such as hiv, whose envelope proteins are cleaved before assembly. the pr peptide remains associated with the e protein until the virion is released from the cell, thereby preventing premature unintended fusion with membranes of the producer cell. furin-mediated prm cleavage is essential for replication of flaviviruses such as tick-borne encephalitis or dengue virus. , in some cases, prm molecules escape furin processing in the producer cell and result in the release of immature or partially mature viral particles. partially mature virions are still infectious since low amounts of mature m are sufficient to mediate fusion with target cell membranes. furthermore, uncleaved prm may participate in virion attachment to target cells and can be cleaved by furin during the entry process, in the acidic milieu of endosomes , (figure b , left panel). the relative contribution of prm processing during viral entry into new target cells, however, remains to be determined. the ratio of mature to immature prm depends on a variety of factors, including the producer cell type and the flavivirus species. for example, dengue viruses are known to release high amounts of immature or partially mature viruses, most likely because of a conserved acidic residue within the furin recognition site (table ) . importantly, the content of prm in viral particles also affects antibody recognition and consequently antibody-dependent enhancement of dengue virus infection. thus, furin-mediated protein processing may once again markedly affect the outcome of infection. while furin plays a key role in activating envelope glycoproteins of a variety of viruses, its activity is also exploited for the cleavage of other viral proteins. one example is the cleavage of the l protein of papillomaviruses. together with the major capsid protein l , this minor capsid protein builds the viral capsid. the furin cleavage site is located close to the n terminus of l and highly conserved among different human papillomavirus (hpv) strains. cleavage is not required for virus assembly or release, but essential for infection of new target cells. for example, lovo and cho cells lacking furin expression are completely resistant to infection with pseudoviruses of hpv , , which is one of the high-risk hpv types causing cervical cancer. in contrast to flavivirus prm, which can be cleaved during egress and entry, papillomavirus l seems to be exclusively cleaved on target cells. a model has been proposed, in which attachment of papillomaviruses to heparan sulphate proteoglycans induces a conformational change in l that exposes the polybasic cleavage site. upon proteolytic processing of l , l may engage a secondary cellular receptor and mediate infection (figure b, right panel) . furthermore, interaction of cleaved l with an unknown intracellular receptor may be required for escape of l from the endosomal compartment and its ability to escort viral dna into the nucleus. this illustrates that also nonenveloped viruses have evolved the ability to exploit furin or related pcsks for their own purposes. another nonenvelope protein that is cleaved by furin is the external core antigen (hbeag) of hepatitis b virus (hbv). , cleaved hbeag is secreted from infected cells and exerts immunosuppressive effects. it has been suggested to act as a t-cell tolerogen that prevents killing of infected hepatocytes by cytotoxic t lymphocytes. in contrast, uncleaved hbeag may have the opposite effects as it is transported to the plasma membrane where it can trigger antiviral immune responses. thus, furin-mediated cleavage of hbeag may affect the outcome of infection. intriguingly, han chinese frequently harbour a single nucleotide polymorphism in the p promoter of the fur gene that is associated with increased risk of developing persistent hbv infection with detectable amounts of hbeag in the serum. this polymorphism increases the binding efficiency of the hepatic transcription factor nf-e , thereby most likely increasing furin expression. whether the observed increase in hbv persistence is the result of increased hbeag processing and/or other effects of furin remains to be determined. viral pathogens and their hosts are in a continuous arms race. although viruses have evolved sophisticated strategies to exploit the metabolism, protein synthesis and trafficking pathways of an infected cell, the host is not defenceless. besides innate and adaptive immune responses that directly target components of the virus, infected cells may also restrict viral spread by limiting the availability of cellular factors that are critical for viral replication, so-called 'virus dependency factors'. for example, the host protein samhd restricts replication of several viral pathogens by depleting cellular dntp levels. furthermore, ifi targets the cellular transcription factor sp to suppress viral gene expression. intriguingly, accumulating evidence suggests that inhibition of the virus dependency factor furin represents another efficient and broadly active mechanism of antiviral immunity. in , aerts and colleagues found that protease-activated receptor (par ), a g-proteincoupled receptor, interferes with the expression of furin and furin-mediated processing of the human metapneumovirus f protein. follow-up experiments revealed that par harbours an r xxxxr motif that mediates interaction with several pcsks. in line with this, soluble pc a/ pcsk and pcsk cleave par at r ↓ and abrogate its ability to induce calcium signalling upon thrombin-mediated cleavage at the plasma membrane. surprisingly, however, membranebound pcsks such as furin fail to cleave par at this position. instead, furin traps par in the trans-golgi network and prevents its anterograde transport to the cell surface (figure a ). at the same time, par also blocks the proteolytic activity of furin, inhibiting for example the maturation of hiv- env. this inhibitory activity is not shared by its paralogue par , which is efficiently cleaved by furin. notably, expression of par is induced in proinflammatory environments such as the brain of hiv- infected individuals suffering from hiv-associated neurocognitive disorders (hand). thus, par mediated furin inhibition may represent a mechanism of innate immunity limiting the spread of hiv- and potentially additional furindependent viral pathogens. a similar inhibitory activity was recently described for two ifnc-inducible gtpases, termed guanylate-binding proteins and (gbp and gbp ). initially, gbp was described in a screening for novel restriction factors of hiv and shown to interfere with the maturation of the retroviral env protein. , follow-up experiments revealed that this antiviral activity is shared by its paralogue gbp and that both proteins reduce the proteolytically active amount of furin. as a result, cleavage of the env precursor gp into mature gp and gp is reduced and newly formed virions are only poorly infectious (figure b ). since many viral pathogens rely on furin or related pcsks for the maturation of their own (glyco)proteins, gbp and gbp exert broad antiviral activity, inhibiting replication of highly pathogenic avian influenza a, measles and zika viruses. in contrast, gbp and gbp do not decrease infectivity of virions carrying the glycoprotein of vesicular stomatitis virus, which does not require a proteolytic activation step. notably, inhibition of furin in infected cells comes at a cost, since furin-mediated processing of matrix metalloproteinases and other cellular substrates is also reduced in the presence of increased gbp or gbp levels. future experiments will reveal whether par- -or gbpmediated inhibition of furin activity may also prevent the development or proliferation of certain cancers. remarkably, increased gbp / expression is associated with favorable outcome in patients suffering from melanoma or breast cancer. , thus, a better understanding of cellular mechanisms regulating furin activity will help to understand the pathogenesis of infectious diseases and cancer and may uncover novel targets for therapeutic intervention. because of the key role of furin in the pathogenesis of cancer and infectious diseases, its suitability as a therapeutic target has raised significant interest for several years. many laboratories have explored the possibility to limit tumor growth, viral replication or bacterial intoxication by reducing the amount or proteolytic activity of furin. initially, most studies focused on peptides or proteins that bind to the active site of furin and inhibit substrate binding in a competitive manner. for example, a variant of the naturally occurring serine protease inhibitor a- antitrypsin was modified to harbour a consensus furin cleavage site. this variant, termed a- antitrypsin portland (a -pdx), inhibits furin and pcsk and has been shown to prevent the processing of hiv- env and measles virus f in vitro. , similarly, peptides derived from the cleavage site of influenza a virus hemagglutinin and polyarginines compete with natural furin substrates. , even exogenous addition of the autoinhibitory propeptide of furin has been shown to reduce its enzymatic activity, limiting for example the activation of mmp in breast cancer cells. , however, the therapeutic potential of the propeptide has never been evaluated in vivo and the inhibitory effects are most likely limited as it is known to dissociate from furin in the tgn. several approaches, including incorporation of d-instead of l-amino acids, have been applied to increase the stability and hence efficacy of furin inhibitors. for example, hexa-d-arginine (d r), one of the first furin inhibitors, exhibits good stability and prevents the cytotoxic effects of pseudomonas exotoxin a in vitro and in vivo. similarly, topical application of nona-d-arginine (d r) has been shown to reduce corneal damage in mice infected with pseudomonas aeruginosa. interestingly, d r also showed direct bactericidal activity, probably because of its polycationic nature. besides d-amino acids, incorporation of amino acid analogs such as decarboxylated arginine mimetics or -amidinobenzylamide (amba) has been used to increase the stability of peptide-derived furin inhibitors. furthermore, the addition of a chloromethyl ketone (cmk) moiety to the c terminus of a polybasic cleavage motif has proven useful as it results in the alkylation of the active site of furin that irreversibly blocks its enzymatic activity. nevertheless, the cytotoxicity of cmkbased inhibitors and the instability of the cmk moiety may limit their use to topical applications such as the treatment of hpv skin infections. finally, the elucidation of the crystal structure of furin enabled the targeted modelling of nonpeptidic inhibitors such as streptamine-based compounds. upon addition of guanidine residues, streptamine derivatives mimic the cationic furin cleavage site and inhibit its enzymatic activity in the nanomolar range in vitro. dahms and colleagues describe an interesting example of a , dideoxystreptamine-derived inhibitor, where two molecules of the inhibitor form a complex with furin. while the first inhibitor molecule directly interferes with the conformation of the catalytic triad, the second molecule binds to an adjacent planar peptide stretch. besides stability, the subcellular localisation of furin inhibitors is a key determinant of their efficacy in vivo. notably, optimal localisation of the inhibitor strongly depends on the processing event targeted for therapeutic intervention. to inhibit activation of anthrax toxin, for example, the inhibitor does not need to enter the cells, as . par as well as gbp and gbp reduces hiv particle infectivity by inhibiting furin-mediated env processing. human immunodeficiency virus (hiv) particles containing functional mature envelope (env) glycoproteins fuse with the plasma membrane of the target cell to release the capsid core into the host cell cytoplasm. upon reverse transcription and integration of the retroviral genome, viral gene expression is initiated. (a) in a cytokine (e.g. il- b)-induced inflammatory state, furin and protease-activated receptor (par ) expression are induced. furin and par interact with each other and are trapped as inactive proteins in the trans-golgi network (tgn). as a consequence, par cannot traffic to the cell surface, where it is usually cleaved by thrombin to induce inflammatory signalling pathways. moreover, production of infectious hiv- particles is impaired because of reduced furin-mediated cleavage of hiv env. (b) at the same time, cells of the infected host may induce the expression of interferon-stimulated genes such as guanylate-binding proteins and (gbp and gbp ). both proteins colocalise with furin and inhibit its proteolytic activity. as a result, hiv env maturation is impaired and newly forming viral particles are poorly infectious since they incorporate immature env glycoproteins. ifn-i/ii, type i and type ii interferons; stat, signal transducers and activators of transcription. furin cleaves the toxin precursor at the cell surface. in contrast, penetration of the inhibitor into the cell is essential to prevent the maturation of hiv- env and other viral glycoproteins that are cleaved intracellularly. while some inhibitors (e.g. ha-derived peptides) efficiently enter cells, others were modified to increase their intracellular availability. for example, addition of a decanoyl moiety to cmk inhibitors increases their ability to penetrate cells. streptamine derivatives may be particularly promising for targeted therapy as the positioning of the guanidyl substituents determines the localisation of the inhibitor to distinct subcellular compartments such as endosomes or the golgi complex. while many of the inhibitors described above potently reduce furin activity both in vitro and in vivo, most of them also inhibit other proprotein convertases recognising the same or similar polybasic cleavage sites. this limitation is inherent to competitive inhibitors that aim at mimicking the target sequence of furin and may be overcome by allosteric inhibitors that bind furin-specific motifs outside the active site. one example is the nanobody nb , which binds to the c-terminal pdomain of furin, thereby blocking the access of larger substrates to the active site. notably, nb specifically binds to the p-domain of furin and does not recognise other pcsks. instead of targeting furin at the protein level, therapeutic approaches may also aim at targeting its rna. for example, the endogenous degradation of furin mrna by regnase- (zc h a) and/or roquin (rc h ) could be modulated to interfere with the expression and thus proteolytically active amount of furin. however, modulation of regnase- and roquin will most likely have off-target effects as both endoribonucleases also degrade additional mrnas. a more selective rna-based approach is the silencing of furin via shrna. in fact, shrnamediated suppression of furin expression is currently the clinically most advanced therapy targeting this protease. in a phase iii clinical trial, patients suffering from metastatic ewing's sarcoma family of tumors (esft) are treated with an immunotherapy that involves the silencing of furin and simultaneous overexpression of gm-csf. more specifically, tumor cells are extracorporeally transfected and reintroduced as so-called furin knock-down and gm-csf augmented (fang) cancer vaccine, also known as vigil. while gm-csf boosts the antitumor response by dendritic cells and t cells, knockdown of furin prevents the proteolytic activation of tgfb, which may otherwise revert the beneficial effects of gm-csf. [ ] [ ] [ ] in a phase ii clinical trial, the autologous fang/vigil vaccine has already proven successful as it increased relapse-free survival of ovarian cancer patients from to days and showed only limited adverse effects. the proprotein convertase furin has become an attractive target for the treatment of various infectious and noninfectious diseases as it regulates the activity of numerous mammalian, bacterial and viral proteins. in recent years, several peptidic and nonpeptidic inhibitors have been developed that block the activation of bacterial toxins, prevent the maturation of viral proteins and suppress tumor growth in vitro. although some of them also yielded promising results in mouse models, there have been only a limited number of clinical trials in humans. one major challenge is the redundancy of furin with related proprotein convertases that also recognise polybasic cleavage sites. on the one hand, selective inhibition of furin may be beneficial as it limits unwanted side effects due to inhibition of other pcsks. on the other hand, treatment of some diseases may require the simultaneous inhibition of several pcsks to efficiently block pathological substrate conversion. to advance current approaches, a better understanding of the relative contribution of individual pcsks to (nonphysiological) proteolytic protein processing is urgently needed. therapeutic intervention needs to specifically target the convertase(s) that drive disease progression. currently, the most promising approaches for selective inhibition are shrnamediated silencing and nanobodies as they show no or only little off-target effects. even if selective inhibition of individual pcsks can be achieved, systemic long-term inhibition will most likely have detrimental effects, as pcsks are required for the activation of hundreds of cellular substrates. thus, local applications such as targeted treatment of tumors or topical treatment of bacterial and viral infections may be more feasible than systemic therapy. finally, the ability of tumor cells or pathogens to evolve resistance or evasion mutations remains poorly investigated. for example, several substrates such as dengue virus prm harbour suboptimal furin target sequences and may optimise their cleavage sites upon therapy to enable sufficient cleavage in the presence of inhibitors. although the therapeutic application of furin inhibitors may be full of pitfalls, it is certainly a promising approach that should be further pursued. future studies will elucidate the role of individual pcsks and their substrates in disease progression and a better understanding of cellular pathways regulating furin activity may uncover additional targets for therapeutic intervention. pathophysiological aspects of proteases furindb: a database of -residue furin cleavage site motifs, substrates and their associated drugs insulin biosynthesis: evidence for a precursor evolutionary conserved close linkage of the c-fes/fps proto-oncogene and genetic sequences encoding a receptor-like protein furin is a subtilisin-like proprotein processing enzyme in higher eukaryotes the biology and therapeutic targeting of the proprotein convertases secreted site- protease cleaves peptides corresponding to luminal loop of sterol regulatory element-binding proteins catalytic activity is not required for secreted pcsk to reduce low density lipoprotein receptors in hepg cells tissue-based map of the human proteome expression of the dibasic proprotein processing enzyme furin is directed by multiple promoters proprotein convertase furin is preferentially expressed in t helper cells and regulates interferon c tgfb regulates gene expression of its own converting enzyme furin the convertases furin and pc can both cleave the human immunodeficiency virus (hiv)- envelope glycoprotein gp into gp (hiv- su) and gp (hiv-i tm) processing of human toll-like receptor by furin-like proprotein convertases is required for its accumulation and activity in endosomes intramolecular chaperones and protein folding activation of the furin endoprotease is a multiple-step process: requirements for acidification and internal propeptide cleavage furin at the cutting edge: from protein traffic to embryogenesis and disease intracellular trafficking and activation of the furin proprotein convertase: localization to the tgn and recycling from the cell surface maturation of the trans-golgi network protease furin: compartmentalization of propeptide removal, substrate cleavage, and cooh-terminal truncation shed" furin: mapping of the cleavage determinants and identification of its c-terminus a residues motif delineates the furin cleavage site and its physical properties may influence viral fusion proprotein convertases [corrected] are responsible for proteolysis and inactivation of endothelial lipase in vivo evidence that furin from hepatocytes inactivates pcsk failure of ventral closure and axial rotation in embryos lacking the proprotein convertase furin loss of endothelial furin leads to cardiac malformation and early postnatal death mutations within a furin consensus sequence block proteolytic release of ectodysplasin-a and cause x-linked hypohidrotic ectodermal dysplasia haemophilia b: database of point mutations and short additions and deletions proprotein convertases: "master switches" in the regulation of tumor growth and progression proprotein convertase inhibition: paralyzing the cell's master switches the proprotein convertase furin in tumour progression: the pc furin in tumour progression hypoxiaenhanced expression of the proprotein convertase furin is mediated by hypoxia-inducible factor- : impact on the bioactivation of proproteins hypoxia enhances cancer cell invasion through relocalization of the proprotein convertase furin from the trans-golgi network to the cell surface get a ligand, get a life: integrins, signaling and cell survival tgf-b in t cell biology: implications for cancer immunotherapy the dark side of ifnc: its role in promoting cancer immunoevasion radiotherapyassociated furin expression and tumor invasiveness in recurrent laryngeal cancer proteolytic activation of bacterial toxins: role of bacterial and host cell proteases evidence for involvement of furin in cleavage and activation of diphtheria toxin cell-mediated cleavage of pseudomonas exotoxin between arg and gly generates the enzymatically active fragment which translocates to the cytosol effect of diphtheria toxin on protein synthesis: inactivation of one of the transfer factors cellular processing of the interleukin- fusion toxin dab -il- and efficient delivery of diphtheria fragment a to the cytosol of target cells requires arg human furin is a calcium-dependent serine endoprotease that recognizes the sequence arg-x-x-arg and efficiently cleaves anthrax toxin protective antigen bacteriophage control of bacterial virulence host cell proteases controlling virus pathogenicity molecular biology of newcastle disease virus the molecular biology of influenza virus pathogenicity maturation of hiv envelope glycoprotein precursors by cellular endoproteases immunological analysis of human immunodeficiency virus type envelope glycoprotein proteolytic cleavage improved antigenicity of the hiv env protein by cleavage site removal the extracellular processing of hiv- envelope glycoprotein gp by human plasmin role of staphylococcus protease in the development of influenza pneumonia influenza virus activating host proteases: identification, localization and inhibitors as potential therapeutics proteolytic activation of influenza viruses by serine proteases tmprss and hat from human airway epithelium influenza virus hemagglutinin with multibasic cleavage site is activated by furin, a subtilisin-like endoprotease proprotein-processing endoproteases pc and furin both activate hemagglutinin of virulent avian influenza viruses virulenceassociated sequence duplication at the hemagglutinin cleavage site of avian influenza viruses different hemagglutinin cleavage site variants of h n in an influenza outbreak in chickens in mutations at the cleavage site of the hemagglutinin alter the pathogenicity of influenza virus a/chick/penn/ (h n ) hat, and tmprss activate the hemagglutinin of h n influenza a viruses a novel activation mechanism of avian influenza virus h n by furin processing of the ebola virus glycoprotein by the proprotein convertase furin proteolytic processing of marburg virus glycoprotein endoproteolytic processing of the ebola virus envelope glycoprotein: cleavage is not required for function proteolytic processing of the ebola virus glycoprotein is not critical for ebola virus replication in nonhuman primates gp mrna of ebola virus is edited by the ebola virus polymerase and by t and vaccinia virus polymerases Δ-peptide is the carboxy-terminal cleavage fragment of the nonstructural small glycoprotein sgp of ebola virus the multiple roles of sgp in ebola pathogenesis degrees of maturity: the complex structure and biology of flaviviruses fusion activity of flaviviruses: comparison of mature and immature (prm-containing) tick-borne encephalitis virions structure of the immature dengue virus at low ph primes proteolytic maturation association of the pr peptides with dengue virus at acidic ph blocks membrane fusion cleavage of protein prm is necessary for infection of bhk- cells by tick-borne encephalitis virus functional importance of dengue virus maturation: infectious properties of immature virions west nile virus discriminates between dc-sign and dc-signr for cellular attachment and infection immature dengue virus: a veiled pathogen? differential modulation of prm cleavage, extracellular particle distribution, and virus infectivity by conserved residues at nonfurin consensus positions of the dengue virus pr-m junction cleavage of the papillomavirus minor capsid protein, l , at a furin consensus site is necessary for infection the role of furin in papillomavirus infection mechanisms of human papillomavirus type neutralization by l cross-neutralizing and l type-specific antibodies characterization of genotype-specific carboxyl-terminal cleavage sites of hepatitis b virus e antigen precursor and identification of furin as the candidate enzyme proteolytic processing of the hepatitis b virus e antigen precursor. cleavage at two furin consensus sequences the secreted hepatitis b precore antigen can modulate the immune response to the nucleocapsid: a mechanism for persistence a function of the hepatitis b virus precore protein is to regulate the immune response to the core antigen the secretory core protein of human hepatitis b virus is expressed on the cell surface influence of a single nucleotide polymorphism in the p promoter of the furin gene on transcription activity and hepatitis b virus infection rules of engagement: molecular insights from host-virus arms races samhd restricts hiv- by reducing the intracellular pool of deoxynucleotide triphosphates the intracellular dna sensor ifi gene acts as restriction factor for human cytomegalovirus replication neuroinflammationinduced interactions between protease-activated receptor and proprotein convertases in hivassociated neurocognitive disorder hiv-induced neuroinflammation: impact of par and par processing by furin identification of potential hiv restriction factors by combining evolutionary genomic signatures with functional analyses guanylate binding protein (gbp) is an interferon-inducible inhibitor of hiv- infectivity the authors guanylate-binding proteins and exert broad antiviral activity by inhibiting furin-mediated processing of viral envelope proteins distinct prognostic value of mrna expression of guanylate-binding protein genes in skin cutaneous melanoma interferoninducible guanylate binding protein (gbp ) is associated with better prognosis in breast cancer and indicates an efficient t cell response inhibition of hiv- gp -dependent membrane fusion by a furindirected a -antitrypsin variant engineered serine protease inhibitor prevents furin-catalyzed activation of the fusion glycoprotein and production of infectious measles virus targeting host proteinases as a therapeutic strategy against viral and bacterial pathogens polyarginines are potent furin inhibitors the prosegments of furin and pc as potent inhibitors of proprotein convertases. in vitro and ex vivo assessment of their efficacy and selectivity opposing function of the proprotein convertases furin and pace on breast cancer cells' malignant phenotypes: role of tissue inhibitors of metalloproteinase- the furin inhibitor hexa-d-arginine blocks the activation of pseudomonas aeruginosa exotoxin a in vivo nona-d-arginine therapy for pseudomonas aeruginosa keratitis nona-d-arginine amide for prophylaxis and treatment of experimental pseudomonas aeruginosa keratitis potent inhibitors of furin and furin-like proprotein convertases containing decarboxylated p arginine mimetics the crystal structure of the proprotein processing proteinase furin explains its stringent specificity therapeutic uses of furin and its inhibitors: a patent review guanidinylated , -dideoxystreptamine derivatives as anthrax lethal factor inhibitors structural studies revealed active site distortions of human furin by a small molecule inhibitor inhibition of proteolytic activation of influenza virus hemagglutinin by specific peptidyl chloroalkyl ketones generation and characterization of non-competitive furin-inhibiting nanobodies regnase- and roquin nonredundantly regulate th differentiation causing cardiac inflammation and fibrosis vigil + irinotecan and temozolomide in ewing's sarcoma -full text view -clinicaltrials.gov generation of large numbers of dendritic cells from mouse bone marrow cultures supplemented with granulocyte/macrophage colony-stimulating factor processing of transforming growth factor b precursor by human furin convertase contrasting effects of tgf-b and tnf-a on the development of dendritic cells from progenitors in mouse bone marrow phase ii study of human cytomegalovirus strain towne glycoprotein b is processed by proteolytic cleavage identification and structure of the gene encoding gpii, a major glycoprotein of varicella-zoster virus epstein-barr virus glycoprotein homologous to herpes simplex virus gb coronavirus ibv: partial amino terminal sequencing of spike polypeptide s identifies the sequence arg-arg-phe-arg-arg at the cleavage site of the spike precursor propolypeptide of ibv strains beaudette and m cleavage inhibition of the murine coronavirus spike protein by a furin-like enzyme affects cell-cell but not virus-cell fusion nucleotide sequence of yellow fever virus: implications for flavivirus gene expression and evolution proteolytic activation of tick-borne encephalitis virus by furin nucleotide sequence of the s mrna of sindbis virus and deduced sequence of the encoded virus structural proteins furin processing and proteolytic activation of semliki forest virus mechanism of borna disease virus entry into cells processing of the borna disease virus glycoprotein gp by the subtilisinlike endoprotease furin crimean-congo hemorrhagic fever virus glycoprotein precursor is cleaved by furin-like and ski- proteases to generate a novel -kilodalton glycoprotein molecular analyses of the hemagglutinin genes of h influenza viruses: origin of a virulent turkey strain complete nucleotide sequence of an influenza virus haemagglutinin gene from cloned dna structural comparison of the cleavage-activation site of the fusion glycoprotein between virulent and avirulent strains of newcastle disease virus fusion glycoprotein of human parainfluenza virus type : nucleotide sequence of the gene, direct identification of the cleavage-activation site, and comparison with other paramyxoviruses cloning and sequencing of the mumps virus fusion protein gene the nucleotide sequence of the mrna encoding the fusion protein of measles virus (edmonston strain): a comparison of fusion proteins from several different paramyxoviruses fusion protein of the paramyxovirus simian virus : nucleotide sequence of mrna predicts a highly hydrophobic glycoprotein nucleotide sequence of the gene encoding the fusion (f) glycoprotein of human respiratory syncytial virus endoproteolytic cleavage of gp is required for the activation of human immunodeficiency virus structural characterization of the avian retrovirus reverse transcriptase and endonuclease domains the role of envelope glycoprotein processing in murine leukemia virus infection furin-mediated cleavage of the feline foamy virus env leader protein we thank frank kirchhoff and dominik hotter for their helpful comments. this work was funded by the dfg priority programme 'innate sensing and restriction of retroviruses' (spp ) to ds; eb was supported by the international graduate school in molecular medicine ulm (igradu). the authors declare no conflict of interest. eb and ds performed literature research. eb drafted the figures, and ds wrote the initial version of the manuscript.this is an open access article under the terms of the creative commons attribution-noncommercial license, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. key: cord- -q f el authors: farhadi, tayebeh; hashemian, seyed mohammadreza title: computer-aided design of amino acid-based therapeutics: a review date: - - journal: drug des devel ther doi: . /dddt.s sha: doc_id: cord_uid: q f el during the last two decades, the pharmaceutical industry has progressed from detecting small molecules to designing biologic-based therapeutics. amino acid-based drugs are a group of biologic-based therapeutics that can effectively combat the diseases caused by drug resistance or molecular deficiency. computational techniques play a key role to design and develop the amino acid-based therapeutics such as proteins, peptides and peptidomimetics. in this study, it was attempted to discuss the various elements for computational design of amino acid-based therapeutics. protein design seeks to identify the properties of amino acid sequences that fold to predetermined structures with desirable structural and functional characteristics. peptide drugs occupy a middle space between proteins and small molecules and it is hoped that they can target “undruggable” intracellular protein–protein interactions. peptidomimetics, the compounds that mimic the biologic characteristics of peptides, present refined pharmacokinetic properties compared to the original peptides. here, the elaborated techniques that are developed to characterize the amino acid sequences consistent with a specific structure and allow protein design are discussed. moreover, the key principles and recent advances in currently introduced computational techniques for rational peptide design are spotlighted. the most advanced computational techniques developed to design novel peptidomimetics are also summarized. different diseases may be caused by pathogens or malfunctioning organs, and using therapeutic agents to heal them has an old recorded history. small molecules are conventional therapeutic candidates that can be easily synthesized and administered. however, many of these small molecules are not specific to their targets and may lead to side effects. moreover, a number of diseases are caused due to deficiency in a specific protein or enzyme. thus, they can be treated using biologically based therapies that are able to recognize a specific target within crowded cells. under the biologic conditions, some macromolecules such as proteins and peptides are optimized to recognize specific targets. therefore, they can override the shortcomings of small molecules. recently, pharmaceutical scientists have shown interest in engineering amino acid-based therapeutics such as proteins, peptides and peptidomimetics. [ ] [ ] [ ] theoretical and experimental techniques can predict the structure and folding of amino acid sequences and provide an insight into how structure and function are encoded in the sequence. such predictions may be valuable to interpret genomic information and many life processes. moreover, engineering of novel proteins or redesigning the existing proteins has opened the ways to achieve novel biologic macromolecules with desirable therapeutic functions. protein sequences comprise tens to thousands of amino acids. besides, the backbone and side chain degrees of freedom lead to a large number of configurations for a single amino acid sequence. protein design techniques give minimal frustration through precise identification of sequences and their characteristics. [ ] [ ] [ ] [ ] considering energy landscape theory, the adequately minimal frustration in natural proteins occurs when their native state is adequately low in energy. the de novo design of a sequence is difficult because there are huge numbers of possible sequences: n for n-residue proteins with only natural amino acids. peptide design should incorporate computational approaches. it can benefit from searching the more advanced fields used for small molecules and protein design. however, the straightforward adoption of computational approaches employed to small-molecule and protein design has not be accepted as a reasonable solution to the peptide design problem. [ ] [ ] [ ] in the peptide drug design, the conformational space accessible to peptides challenges the small-molecule computational approaches. besides, the necessity for nonstandard amino acids and various cyclization chemistries challenges the available tools for protein modeling. furthermore, the aggregation of peptide drugs during production or storage can be an unavoidable problem in the peptide design procedure. rational design of a peptide ligand is also challenging because of the elusive affinity and intrinsic flexibility of peptides. peptide-focused in silico methods have been increasingly developed to make testable predictions and refine design hypotheses. consequently, the peptide-focused approaches decrease the chemical spaces of theoretical peptides to more acceptable focused "drug-like" spaces and reduce the problems associated with aggregation and flexibility. , for the discussions that follow, peptides can be defined as relatively small ( - residues) polymers of amino acids. in physiological conditions, several problems such as degradation by specific or nonspecific peptidases may limit the clinical application of natural peptides. moreover, the promiscuity of peptides for their receptors emerges from high degrees of conformational flexibility that can cause undesirable side effects. besides, some properties of therapeutic peptides, such as high molecular mass and low chemical stability, can result in a weak pharmacokinetic profile. therefore, peptidomimetic design can be a valuable solution to circumvent some of undesirable properties of therapeutic peptides. , in the biologic environment, peptidomimetics can mimic the biologic activity of parent peptides with the advantages of improving both pharmacokinetic and pharmacodynamic properties including bioavailability, selectivity, efficacy and stability. a wide range of peptidomimetics have been introduced, such as those isolated as natural products, synthesized from novel scaffolds, designed based on x-ray crystallographic data and predicted to mimic the biologic manner of natural peptides. using hierarchical strategies, it is possible to change a peptide into mimic derivatives with lower undesirable properties of the origin peptide. over the past years, computational methods have been developed to discover peptidomimetics. in a part of this review, novel computational methods introduced for peptidomimetic design have been summarized. peptidomimetics can be categorized as follows: peptide backbone mimetics (type ), functional mimetics (type ) and topographical mimetics (type ). the first generation of peptidomimetics (type ) mimics the local topography of amide bond. it includes amide bond isosteres, pyrrolinones or short fragments of secondary structure, such as beta-turns. such mimetics generally match the peptide backbone atom-for-atom, and comprise chemical groups that also mimic the functionality of the natural side chains of amino acids. a number of prosperous instances of type peptidomimetics have been reported. the second type of peptidomimetics is described as functional mimetics or type mimetics, which include small, non-peptide compounds that are able to identify the biologic targets of their parent peptide. at first, they were assumed to be conservative structural analogs of parent peptides. however, using site-directed mutagenesis, their binding sites to biologic targets were investigated. the results indicated that type peptidomimetics routinely bind to protein sites that are different from those selected by the original peptide. therefore, type mimetics maintain the ability to interfere with the peptide-protein interaction process without the necessity to mimic the structure of the natural peptide. type peptidomimetics reveal the best conception of peptidomimetics. they consist of the necessary chemical groups that act as topographical mimetics and contain novel chemical scaffolds that are unrelated to natural peptides. here, theoretical and computational techniques to design proteins, peptides and peptidomimetics are reviewed. however, the current review does not deeply highlight the computational aspects of amino acid-based therapeutic design, but only discusses the methods used to design the mentioned therapeutics. figure summarizes the key concepts presented in this study. as some examples, the structures of aldesleukin, leuprolide and spaglumic acid, important amino acid-based therapeutics approved by the us food and drug administration (fda), are shown in figure a computer-aided design of amino acid-based therapeutics figure a ) and leuprolide (pdb id: yy ; figure b ) were obtained from the protein data bank (pdb; http://www. rcsb.org/) and visualized by pymol tool. the structure of spaglumic acid was retrieved (in mol format) from pub-chem database (https://pubchem.ncbi.nlm.nih.gov/) with the pubchem id ( figure c ) and visualized using pymol. aldesleukin, a lymphokine, is a recombinant protein used to treat adults with metastatic renal cell carcinoma (https://www.drugbank.ca/drugs/db ). leuprolide, a synthetic nine-residue peptide analog of gonadotropin releasing hormone, is used to treat advanced prostate cancer (https://www.drugbank.ca/drugs/db ). spaglumic acid is used in allergic conditions such as allergic conjunctivitis. the drug belongs to a class of peptidomimetics known as hybrid peptides. hybrid peptides contain at least two dissimilar types of amino acids (alpha, beta, gamma or delta) linked to each other via a peptide bond (https://www.drugbank.ca/ drugs/db ). in the current study, all fda-approved therapeutics (in ) were retrieved from drugbank (https://www.drugbank. ca/biotech_drugs) and an analysis was conducted to compare their percentages. protein-based therapies, gene or nucleic acid-based therapies, vaccines, allergenics and cell transplant therapies made up . %, . %, . %, . % and . % of total approved therapeutics, respectively. small-molecule drugs made up . % of the approved therapeutics ( figure ). computational designing of proteins can be classified as follows: ) template-based designing in which three-dimensional ( d) farhadi and hashemian structure of a predefined template is adapted to design a sequence and ) de novo designing in which the amino acids' arrangement is changed to generate both sequence and d structure of a completely novel protein. the problem of predicting the fold of an unknown sequence could be solved by utilizing templates. since the fold is unaltered, the backbone atoms are directly located on this framework. moreover, to generate a functional protein, the side chains that can effectively stabilize the structure are added to the backbone. , routine concerns and methods for template-based protein design are reviewed below. selecting the template (scaffold) protein the template (also named as scaffold protein) contains a group of backbone atom coordinates. the coordinates can be retrieved from an available x-ray crystal structure or cautiously from a nuclear magnetic resonance (nmr) structure. computer-aided design of amino acid-based therapeutics fixing the backbone decreases the computational complication, but it may inhibit the main chain modifications to adjust sequence alternation. backbone flexibility can generate designed functionalities over the protein's normal function. the backbone flexibility is introduced through incorporating other closely associated conformations to an existing structure. [ ] [ ] [ ] recently, new functionalities were effectively introduced into the tim-barrel topology. this fold has been detected as one of the most shared structures in distinct protein superfamilies. sequence search and characterization in a design procedure, a protein sequence is selected such that it meets the energetic and geometric constraints established by the chosen fold. sequence search techniques sample different sequences and estimate their energies to gain the one owing the minimum energy. in order to identify the sequences subject to an objective function or a specific energy, a diverse strategies including optimization and probabilistic approaches have been developed. optimization processes may recognize candidate sequences using stochastic or deterministic methods. probabilistic approaches focus on characterizing the sequence space probabilistically. deterministic methods: to achieve a sequence folded into a global minimum energy conformation, deterministic methods search the whole sequence space and identify the global optima. , these methods include dead-end elimination (dee), self-consistent mean field, graph decomposition and linear programming. stochastic algorithms search the sequence space in an exploratory manner. these algorithms include monte carlo algorithms (simulated annealing), graph search methods and genetic algorithms. some of the most commonly used methods are discussed below. dee has been considered as a thorough search algorithm. to find and remove sequence-rotameric positions that are not portions of the global minimum energy conformation, dee compares two amino acid rotamers and removes the one with greater interaction energy. interaction energies are computed for each rotamer of the test amino acid, along with all rotamers of every other amino acid. the situation is repetitively examined for total amino acid states as well as their rotamers until it no longer holds true. , expanding the sequence length increases the combinatorial complication of dee exponentially. therefore, to design sequences of amino acids or larger, application of dee may be restricted. details of the theorem are explained elsewhere. , stochastic search algorithms: as mentioned before, deterministic approaches are perfect to design proteins with small sizes, but show the applied disadvantages with extension of sequence size. stochastic or heuristic methods are valuable to design large proteins. the most widely used method for protein design includes monte carlo sampling. , monte carlo method samples positions of complicated proteins in a way related to a selected probability distribution such as boltzmann distribution. boltzmann distribution specially weighs low-energy configurations. the monte carlo algorithm performs iterative series of calculations. at the primary step of each search, a partially accidental test sequence is generated, and its energy is calculated via a physical potential. during the primary step, both rotamer state and amino acid identity are adjusted and an efficient temperature controls the probable energy alterations. in the next step, named simulated annealing, the temperature gradually decreases and permits favorable sampling of lowerenergy configurations. multiple independent calculations are carried out to converge the system to a global minimum. , for more explanation about the theorems and details of the formulation of the probability distribution and weights, readers are referred to study previous reports. , probabilistic approach: probabilistic approaches are frequently employed when thorough information is not accessible for protein design. in a probabilistic approach, sitespecific amino acid probabilities may be utilized, rather than particular sequences. the procedure is partially motivated by the uncertainties to find sequences consistent with a specific structure. briefly, the backbone atoms are fixed or greatly constrained, side chain conformations are discretely handled, energy functions are estimated and solvation is handled by simple models. however, in order to offer valuable sequence information for design experiments and to find structurally significant amino acids, probabilistic techniques leverage structural characteristics of interatomic interactions. generally, monte carlo methods give a probabilistic sampling of sequences. , in addition, an entropy-based formalism has been defined to predict amino acid probabilities for a certain backbone structure. , the method employs concepts from statistical thermodynamics to assess the sitespecific probabilities. to address the whole space of existing compositions, the theory is not restricted by the computational enumeration and sampling. large protein structures with . variable residues can be supplied simply. sampling sequence space to generate conformations the chemical variability of a sequence and the number of various amino acids permitted at each position are defined as "degrees of freedom for each amino acid". moreover, each of the natural residues search the whole sequence space. drug design, development and therapy : submit your manuscript | www.dovepress.com to decrease the degrees of freedom for each amino acid and searching the sequence space, diverse approaches such as hydrophobic patterning have been proposed. monomers can be used to probe a protein structure and improve its function, other than the naturally occurring amino acids. sampling of side chain conformational space to form conformations side chain conformations are typically consistent with the energy minima of molecular potentials and can be obtained from a structural database. rotamer statuses are related to the repeatedly detected values of dihedral angles in the side chain of each amino acid. for example, the simplest amino acids including alanine and glycine have only one rotamer status, while the bigger amino acids have . diverse rotamer statuses. a variety of rotamer libraries including backbonedependent, secondary structure-dependent and backboneindependent libraries have been developed for protein design. , by using a rotamer library, one can discretize a meaningful state space to decrease the computational difficulty. rotamer libraries can be extended beyond the natural amino acids. the effective rotamers can model cofactors, ligands, water and posttranslational modifications. for example, to improve the modeling of protein-protein interactions and model water within proteins interiors, the structurally definite water molecules can be inserted as a solvated rotamer library. energy functions have been employed to quantify sequencestructure compatibilities. they include linear associations of hydrogen bonds made by backbone atoms, repulsion among atoms, hydrophobic attraction among non-polar groups and electrostatic interactions among sequential neighbors. the sequence of a protein is selected so that it can adjust the energetic and geometric constraints enforced by the favorite fold. constraints typically contain several intramolecular interactions such as van der waals, hydrophobic, polar and electrostatic interactions, as well as hydrogen bonds. generally, by using a scoring function, it is possible that energetic contributions of the mentioned parameters are taken into account. , , de novo design: designing the sequence and d structure through assembly of proteins fragments , or secondarystructure elements, , novel structures can be modeled de novo. in the design procedures, the backbone coordinates are generally constrained. summary and important findings of some proteins designed using computational approach including a retroaldol enzyme, the kemp elimination enzyme, a novel βαβ protein, a redesigned procarboxypeptidase, a novel α/β protein structure and the top are shown in table . peptide design methods have been categorized as ligand-and target-based design methods. in the ligand-based designing procedure, information derived from peptides is used to design novel therapeutic peptides. in the target-based method, information derived from target proteins is specifically utilized. typically, a hybrid approach including both ligand-and target-based design is utilized. ligand-based peptide design the ligand-based design has been classified as follows: ) sequence-based, ) property-based and ) conformationbased design. sequence-based approach uses the information of conserved regions and analyzes the multiple sequence alignments. this method is directed by the hypothesis that conserved regions are functionally and structurally significant. computational tools allow the ligand-based peptide design, although they lag behind bioinformatics strategies developed for protein designing. recently, using a method based on a pam matrix, the relationship between a series of collagen peptides and antiangiogenic activity including proliferation, migration and adhesion was analyzed. the pam matrix captured information of mutation rates among all pairs of amino acids. based on the results, regions at the c and n termini of the peptides were detected to be significant for an ideal activity and suggested as two distinct binding sites. the approach showed the potential worth of the sequence-based peptide design. in another report, a computational platform called sarvision was developed to support sequence-based design. sarvision signifies an important step for peptide sequence/activity relationship (sar) analysis. moreover, it pools the improved visualization abilities with advanced sequence/activity analysis. compared to small molecules, property-based design methods for peptides are in the early stages of development. in a recent study, the Δg decomposition per residue and the physicochemical characteristics of amino acids, such as hydrophilicity, hydrophobicity and volume, were used computer-aided design of amino acid-based therapeutics to model peptide binding to targets of interest. , finally, a model was built to estimate peptide Δg values for binding to the class i major histocompatibility complex (mhc) protein hla-a* . furthermore, in a wide range of studies, antimicrobial peptides were successfully analyzed by using the property-based approach. for example, a machinelearning method was employed to design novel antimicrobial peptides. the victory of the property-based methods with antimicrobial peptides may be explained by the fact that the desired biologic activity of membrane disruption is relatively nonspecific. in the case of conformation-based peptide design, computational techniques were developed to predict the conformational ensembles or structure of peptides and analyze the sars. , pep-fold is an online tool used to predict the d structures of peptides of length - residues. a remarkable suggestion from the data is that pep-fold seems to solve the conformational sampling problem. , in order to search conformational spaces of a peptide, long timescale molecular dynamic simulations have been employed. , besides, quantum mechanical calculations are promising to address the scoring deficiency in the peptide conformational examination. apparently, to affect the peptide design processes positively, improving the major theoretical and technical issues is necessary before such computationally sophisticated and costly procedures. conformation of a peptide may be modeled to generate a d pharmacophore hypothesis. a certain pharmacophore hypothesis is useful to determine the adme/tox activities or particular potencies of a peptide. for example, screening of a peptide library was jointed to generate a pharmacophore hypothesis to identify potent agonists of melanocortin- receptor isoforms. a combinatorial tetrapeptide library was screened, and sar and ligand-derived pharmacophore templates were generated. the pharmacophore hypothesis was proposed to allow continuous attempts in the rational design of melanocortin receptor molecules. target-based peptide design compared to ligand-based peptide design, target-based design appears to be in a more improved level. targetbased design is initiated with the computer-aided survey of a ligand-bound or unbound protein target to recognize its potential binding sites, prospective specificity surfaces and other pharmacologic activity elements. the phase is generally followed by an in silico design phase where computational methods perform, refine and evaluate peptide design ideas. some recently developed computational methods for targetbased peptide design are reviewed below. recently, an increase in the number of protein-peptide d structures deposited in the pdb has assisted to search the molecular mechanism and structural basis of peptide recognition and binding. information of crystal structures of protein-peptide complexes can improve our knowledge of the farhadi and hashemian chemical forces involved in the binding and special modes of binding. dynamic data of the complexes can be partially extracted from the solution nmr structures deposited in the pdb. to record the structures and functions of various protein-peptide complexes, the experimentally resolved structure data were gathered, annotated and analyzed, and several distinctive databases such as pepx, pepbind and peptiddb were generated. the pepx database, derived from the pdb, comprises unique protein-peptide interface collections. the pepbind database contains , proteinpeptide complex structures from the pdb. peptiddb is a curated database of protein-peptide complexes. the abundance of the structural information specifically on monomeric proteins could be gathered to design proteinpeptide interactions with no requirement for their sequence homology. protein-peptide docking precise docking of a highly flexible peptide is a major challenge. traditional docking protocols, such as autodock, vina , and moe-dock, developed for docking of small molecules, were also used to dock a peptide to a protein receptor. however, comparative studies revealed that these techniques would face failure if the docked peptides were . residues long. therefore, development of peptide-focused docking protocols is very important. other protein-protein docking tools such as z-dock and hex have been used for the computational peptide design in some studies. below, details of recently developed peptide-focused docking approaches are discussed. first, heuristic evolution procedures were applied to search the large conformational space of linear peptides before the binding. however, these procedures were not efficient and their use was limited. then, a scheme based on conformational sampling became common in the peptide docking. besides, several illustrative approaches were proposed to balance between the accuracy and efficacy of the flexible peptide docking. in this aspect, docscheme, dynadock and pepspec were integrated to online userfriendly interfaces and introduced. recently, pepcrawler and flexpepdock were developed as the peptide docking tools. it is reported that flexpepdock has sub-angstrom accuracy in reproducing the crystal structures of protein-peptide complexes. all of the flexpepdock-based methods assume previous information about the peptide-binding site. anchordock, a recently described algorithm, allows powerful blind docking calculations through relaxing the constraint. the program predicts anchoring origins on a protein surface. following recognition of the anchoring origins, an assumed peptide conformation is refined using an anchor-constrained molecular dynamic process. haddock, a well-known protein-protein docking tool, has been recently expanded to run the flexible peptide-protein docking. to handle a docking procedure, haddock uses ambiguous interaction restraints based on the experimental information about intermolecular interactions. this rigid body peptide docking is followed through a flexiblesimulated annealing process. the novel haddock strategy initiates docking computations from an ensemble of three dissimilar peptide conformations (eg, α-helix, extended and polyproline-ii) that are high informative inputs. cabs-dock is a recently introduced protein-peptide docking tool and runs a primary docking procedure whose outcomes can be refined by other tools such as flexpepdock. in the primary phase of the procedure, random conformations of a peptide are predicted and located around the protein target of interest. the process is followed by replica exchange monte carlo dynamics. subsequently, models are selected for the last optimization using the modeller tool to gain accurate scoring and ranking poses. , galaxypepdock was developed to use experimentally resolved protein-peptide structures for running the template-based docking pooled by flexible energy-based optimization. atomistic simulation atomistic monte carlo and molecular dynamics simulations are accurate, but they are meticulous techniques to investigate peptide-protein binding interactions. these techniques can also detect the thermodynamic profile and trajectory included in protein-peptide identification. these methods predict the association among conformations of a peptide in solution or protein. in a study, in order to describe the binding of a decapeptide to the cognate sh receptor, a long-term molecular dynamic simulation was used and a two-state model was built. in the first step, a relatively quick diffusion phase, nonspecific encounter complexes were generated and stabilized by using electrostatic energy. the secondary step was a slow modification phase, in which the water molecules were emptied out from the space between the peptide ligand and the receptor. in another report, by using monte carlo method, the mentioned two-state model was verified to trace some oligopeptide routes for binding to various pdz (post synaptic density protein, drosophila disc large tumor suppressor, and zonula occludens- protein) domains. drug design, development and therapy : submit your manuscript | www.dovepress.com computer-aided design of amino acid-based therapeutics the affinity of bh peptides to bcl- protein was investigated, and results showed the higher affinity of bound peptides occurred when the corresponding peptides were in a lower degree of disorder in unbound states and vice versa. these results showed that the highly structured peptides could increase their affinity through reducing the entropic loss associated with the binding. overall, in addition to the electrostatic and hydrophobic forces, protein-peptide interactions can be affected by the entropic effect and conformational flexibility that could be willingly examined with atomistic simulations. very recently, using a fast molecular dynamics simulation, the energetic and dynamic features of protein-peptide interactions were studied. in most cases, the native binding sites and native-like postures of protein-peptide complexes were recapitulated. additional investigation showed that insertion of motility and flexibility in the simulation could meaningfully advance the correctness of protein-peptide binding prediction. peptide affinity prediction most features of computational peptide design are based on the accuracy and efficacy of affinity prediction. hence, the fast and reliable prediction of peptide-protein affinity is significant for rational peptide design. in this aspect, two categories of prediction algorithms including sequence-and structure-based approaches were developed. the sequencebased method uses the information derived from primary polypeptide sequences to approximate and evaluate the standards of the binding affinity. the structure-based process takes the information derived from d structures of proteinpeptide complexes to predict the binding affinity. at the sequence level, the quantitative structure-activity relationships (qsars) have been widely utilized to forecast the binding affinity of peptides and conclude the biologic function. to model the statistical correlation between sequence patterns and biologic activities of experimentally assessed peptides, machine-learning methods such as partial least squares (pls), artificial neural networks (ann) and support vector machine (svm) have been used. the obtained correlations have been used to infer experimentally undetermined peptides. the relationship between the biologic activity and molecular structure is an important issue in biology and biochemistry. qsar is a well-established method employed in pharmaceutical chemistry and has become a standard tool for drug discovery. however, the predictive capacity of qsar techniques is generally weaker than statistics-based approaches. therefore, a combination of the qsar method with a statistic-based technique may bring out the best in each other and can be a trend in future developments of drug discovery. at the structural level, numerous reports on affinity prediction have addressed the mhc-binding peptides. plentiful mhc-peptide complex structure records have been deposited in the pdb. the significance of domain-peptide recognition has been recently illustrated in the metabolic pathway and cell signaling. to predict the protein-peptide binding potency, a number of strict theories were suggested based on the potential free energy perturbation. the theories computed the alteration of free energies upon the interaction between phosphor-tyrosine-tetra-peptide (pyeei) and human lck sh domain. furthermore, to obtain a deep insight into the structural and energetic aspects of peptide recognition by the sh domain, a number of molecular modeling experiments such as homology modeling, molecular docking and mechanism dynamics were used. peptide array strategies confirmed that some peptide candidates may be potent binders of the abl sh domain. very recently, an approach including quantum mechanics/molecular mechanics, semiempirical poisson-boltzmann/surface area and empirical conformational free energy analysis was developed to quantitatively illustrate the energetic contributions involved in the affinity losing of pdz domain and oppa protein to their peptide ligands. , de novo peptide design recently, in order to de novo target-based peptide design, two remarkable methodologies including the vital method and an approach developed by bhattacherjee and wallin were introduced. the vital method pools verterbi algorithm with autodock to design peptides for the binding sites of a target. the "bhattacherjee and wallin" approach explores both peptide sequence and conformational space around a protein target at the same time. this approach was tested on three dissimilar peptide-protein domains to assess its ability. a brief list of the existing computational resources employed in peptide design is presented in table . in recent years, some computational methods have been proposed to design peptidomimetics. these methods can be classified based on their specificity to translate peptides to peptidomimetics. to select the best method, drug design, development and therapy : submit your manuscript | www.dovepress.com farhadi and hashemian awareness about the structure of peptide-protein complexes is important. , herein, recently introduced methods for computer-aided design of peptidomimetics are presented. growmol is a combinatorial algorithm employed in the peptidomimetics design. growmol searches a variety of probable ligands for the binding sites of a target protein and produces molecules with the chemical and steric complementarity for the d structure of binding sites. this method was used to generate peptidomimetic inhibitors of thermolysin, hiv protease and pepsin. by using the x-ray crystal structures of pepstatin-pepsin complexes, growmol predicted therapeutic peptidomimetics against the aspartic proteases. the algorithm created some cyclic inhibitors bridging the side chains of cysteine residues in the pl and p inhibitor subsites. the binding modes were checked using x-ray crystallography. , ludi is another interesting software referring to the de novo methodology. by using natural and non-natural amino acids as building blocks, the software designed peptidomimetics against renin, thermolysin and elastase. conformational flexibility of each novel peptidomimetic was searched through sampling the multiple conformers of each amino acid. peptide-driven pharmacophoric hypothesis is the most perceptive computational technique discovered in the peptidomimetics design. the method is especially useful when the x-ray structures of protein-protein complexes exist. the main idea is to adapt the hot spot concept into the associated pharmacophoric feature concept. with a pharmacophorebased virtual screening process, this strategy can determine novel type mimetics. in fact, the side chains of each amino acid can be simply categorized based on the conventional pharmacophoric characteristics, such as hydrogen bond donors and acceptors, aromatic ring and charged and hydrophobic centers. for example, in a report, pharmacophore model directed synthesis of the non-peptide analogs of a cationic antimicrobial peptide identified an anti-staphylococcal activity. to make a pharmacophore hypothesis, a model of rna iii-inhibiting peptide (rip), a well-known heptapeptide inhibitor of the staphylococcal pathogenesis, was utilized. through the virtual screening of , commercially available small molecules based on the rip-based pharmacophore, hamamelitannin was discovered as a non-peptide mimetic of rip. hamamelitannin is a tannin derivate extracted from hamamelis virginiana. , in another study, two rounds of in silico screening were performed to discover potential peptidomimetics able to mimic a cyclic peptide (cyclo- [cpfvktqlc] ) that is known to bind the anb integrin receptor. at the end of the process, the most potent representatives were at least , times better than the original cyclopeptide (around mm). in a prosperous instance, virtual screening was done by using multi-conformational forms of a large commercial library. a target-based pharmacophoric model mapped the cd -binding site on hiv- gp . the pharmacophore hypothesis was made based on a homology model of the protein cavity. in a cell-based assay, two of the top scoring molecules were detected as micromolar inhibitors of hiv- replication. computer-aided design of amino acid-based therapeutics the pharmacophore-based screening was used to find the novel alzheimer's therapeutics as mimetics of neurotrophins. the therapeutic utilization of neurotrophins might be restricted because of several deficiencies such as its reduced central nervous system penetration, decreased stability and potency to enhance neuronal death through interaction with the p ntr receptor. the mimetism of particular nerve growth factor domains could inhibit neuronal death. peptidomimetics of the loop and loop domains of nerve growth factor can prevent neuronal death induced by p ntr-dependent and trk-related signaling. in another study, a full-computational pharmacophorebased approach assessed the fda-approved drugs as valuable candidates to inhibit protein-protein interactions. peptide structures were designated in terms of pharmacophores and searched against the fda-approved drugs to detect same molecules. the top ranking drug matches contained several nuclear receptor ligands and matched allosterically to the binding site on the target protein. the top ranking drug matches were docked to the peptide-binding site. the majority of the top-ranking matches presented a negative free energy change upon binding that was comparable to the standard peptide. geometry similarity method geometry similarity methods create a geometric similarity between non-peptide templates and peptide patches. in a study, the supermimic tool was developed to recognize peptide mimetics. in the program, a complex library of peptidomimetics composed of several protein structure libraries has been deposited. moreover, supermimic includes the d-peptides, synthetic components (reported as betaturn or gamma-turn mimetics) and peptidomimetic ligands obtained from the pdb. in the program, the searching process allows scanning a library of small molecules that mimic the tertiary structure of a query peptide followed by scanning of a protein library where a query for small molecule can adopt into the backbone. , sequence-based method recently, a method has been developed to rank peptide compound matches that are limited to short linear motifs in proteins and compounds with amino acid substituents. the algorithm allows mapping the side chain-like substituents on every compound of a large chemical library. the complete molecule can be signified by a short sequence, and each fragment in the molecule can be represented as a distinct letter abbreviation. a cross-search between the pubchem database (about . million molecules) and a non-redundant collection of , peptides obtained from pdb demonstrated that the algorithm can be useful for high-throughput measurements. to recognize a true positive, the method explored identified protein motifs against the national cancer institute developmental therapeutic program compound database. in another study, the similarity of amino acid motifs to compounds web server was developed to ease screening of identified motif structures against bioactive compound databases. the methodology was reported to be efficient since the compound databases were preprocessed to maximize the accessible data, and the necessary input data was minimal. in similarity of amino acid motifs to compounds, motif matching can be full or partial that may decrease or enhance the number of potential mimetics, respectively. using a novel search algorithm, the web service can perform a fast screening of known or putative motifs against ready compound libraries. the classified results can be examined by linking to appropriate databases. , fragment-based method replacement with partial ligand alternatives through computational enrichment is a fragment-based approach. by using structures of peptide-bound proteins as design anchors, the program can computationally find a non-peptide mimetic for specific determinants of known peptide ligands. hybrid peptide-driven shape and pharmacophoric method development and application of strategies for pharmacophore modeling indicate that the medicinal chemistry community has broadly accepted the intuitive nature of the pharmacophore concept. besides, shape complementarity has been identified as a significant element in the molecular identification between ligands and their targets. in virtual screening efforts, using the pharmacophore-and shape-based techniques distinctly may increase the rate of false-positive results. therefore, incorporating both pharmacophore-and shape-matching techniques into one program can potentially diminish the rate of false positives. recently, to discover novel peptidomimetics, a weboriented virtual screening tool named pepmmsmimic was developed to pool the conventional pharmacophore matching with shape complementarity. a library of million conformers were extracted from . million commercially available chemicals and gathered in the mmsinc database. the database was used as a skeleton to develop farhadi and hashemian pepmmsmimic. in the pepmmsmimic interface, the d structure of a protein-bound peptide is used as an input. then, chemical structures able to mimic the pharmacophore and shape similarity of the original peptide are proposed to involve in the protein-protein recognition. a list of in silico methods used to design potential peptidomimetics along with their strengths and weaknesses is presented in table . overall, design and development of therapeutics are tedious, expensive and time-consuming procedures. therefore, using modern approaches including computer-aided design methods can lessen the examination phase, price and failure of therapeutics discovery. computational methods used to design amino acid-based therapeutics can increase the range of available biotherapeutics. benefiting from the dramatic advance in bioinformatics, computational tools can be used to find and develop therapeutic proteins, peptides and peptidomimetics. , moreover, using the computational tools decrease the cost of therapeutics development, from concept to market, by up to %. however, in the computational protein designing, there are some challenges such as our inadequate knowledge of folding and physical forces that stabilize protein structures. moreover, sequences and local structures have many degrees of freedom that can complicate the sequence search. therefore, there is a requirement for effective methods to find sequences related to a particular structure and measure essential protein folding criteria. overall, in silico design of amino acid-based therapeutics includes many challenges that should be removed to improve the overall performance of the design processes. for example, although structure determination of all disease-related proteins through crystallography and nmr is a laborious task, it is necessary to gather much structural information of peptide-protein interactions. besides, development of vigorous algorithms to calculate protein-protein binding energies is essential. the estimation of binding constant between two macromolecules with an appropriate speedaccuracy tradeoff needs millisecond scale molecular dynamics. moreover, understanding of both protein-protein and protein-peptidomimetics recognition processes in a molecular level can be improved using higher accurate force fields such as quantum mechanical polarizable force. in recent years, there are growing examples on the approval of monoclonal antibodies (therapeutic antibodies) by the fda for treatment of various diseases. this important area of amino acid-based therapeutics has been covered in more depth elsewhere. , for more explanation about the theorems and details of antibody informatics for drug discovery as well as the computer-aided antibody design, readers are referred to study previous reports. , the authors report no conflicts of interest in this work. computer-aided design of amino acid-based therapeutics what is the future of targeted therapy in rheumatology: biologics or small molecules protein therapeutics: a summary and pharmacological classification in silico methods for design of biological therapeutics constructing novel chimeric dna vaccine against salmonella enterica based on sopb and groel proteins: an in silico approach in silico phylogenetic analysis of vibrio cholera isolates based on three housekeeping genes designing of complex multi-epitope peptide vaccine based on omps of klebsiella pneumoniae: an in silico approach theoretical and computational protein design annu evaluation of in silico protein secondary structure prediction methods by employing statistical techniques inhibition of mycobacterial cyp enzyme by sesamin and β-sitosterol: an in silico and in vitro study theory of protein folding: the energy landscape perspective toward an outline of the topography of a realistic protein-folding funnel artificial diiron enzymes with a de novo designed four-helix bundle structure computer-enabled peptide drug design: principles, methods, applications and future directions docking small peptides remains a great challenge: an assessment using autodock vina empirical estimation of local dielectric constants: toward atomistic design of collagen mimetic peptides recent work in the development and application of protein-peptide docking rational design of peptide drugs: avoiding aggregation computational peptidology: a new and promising approach to therapeutic peptide design strategies employed in the design and optimization of synthetic antimicrobial peptide amphiphiles with enhanced therapeutic potentials multifaceted roles of disulfide bonds. peptides as therapeutics peptidomimetics, a synthetic tool of drug discovery an in silico pipeline for the design of peptidomimetic proteinprotein interaction inhibitors (order no. ) natural products as sources of new drugs over the last years diversity-oriented synthesis of macrocyclic peptidomimetics structure-based design, synthesis, and biological evaluation of peptidomimetic sars-cov clpro inhibitors advances in amino acid mimetics and peptidomimetics a hierarchical approach to peptidomimetic design mimicking peptides… in silico peptidomimetic design rational design for peptide drugs peptidomimetics as a cutting edge tool for advanced healthcare an unusual functional group interaction and its potential to reproduce steric and electrostatic features of the transition states of peptidolysis low molecular weight, non-peptide fibrinogen receptor antagonists neurotrophin small molecule mimetics: candidate therapeutic agents for neurological disorders design of peptides, proteins, and peptidomimetics in chi space molecular technology. designing proteins and peptides molecular engineering: an approach to the development of general capabilities for molecular manipulation x-ray versus nmr structures as templates for computational protein design high-resolution protein design with backbone freedom prediction of protein-protein interface sequence diversity using flexible backbone computational protein design backbone flexibility in computational protein design de novo computational design of retro-aldol enzymes one fold with many functions: the evolutionary relationships between tim barrel families based on their sequences, structures and functions search and sampling in structural bioinformatics the dead-end elimination theorem and its use in protein side-chain positioning application of a self-consistent mean field theory to predict protein sidechains conformation and estimate their conformational entropy design of protein-interaction specificity gives selective bzip-binding peptides computational methods for protein design and protein sequence variability: biased monte carlo and replica exchange exploring the conformational space of protein side chains using dead-end elimination and the a* algorithm side-chain and backbone flexibility in protein core design dead-end elimination with a polarizable force field repacks pcna structures improved prediction of protein side-chain conformations with scwrl trading accuracy for speed: a quantitative comparison of search algorithms in protein sequence design using self-consistent fields to bias monte carlo methods with applications to designing and sampling protein sequences computational design and characterization of a monomeric helical dinuclear metalloprotein statistical theory of combinatorial libraries of folding proteins: energetic discrimination of a target structure achieving stability and conformational specificity in designed proteins via binary patterning photophysics of a fluorescent non-natural amino acid: p-cyanophenylalanine an expanded eukaryotic genetic code a "solvated rotamer" approach to modeling watermediated hydrogen bonds at protein-protein interfaces rotamer libraries in the st century improved side-chain prediction accuracy using an ab initio potential energy function and a very large rotamer library potential energy functions for protein design de novo design of foldable proteins with smooth folding funnel: automated negative design and experimental verification assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and bayesian scoring functions structure by design: from single proteins and their building blocks to nanostructures computational de novo design and characterization of a four-helix bundle protein that selectively binds a nonbiological cofactor using α-helical coiled coils to design nanostructured metalloporphyrin arrays kemp elimination catalysts by computational enzyme design de novo design of a βαβ motif high-resolution structural and thermodynamic analysis of extreme stabilization of human procarboxypeptidase by computational protein design design of a novel globular protein fold with atomic-level accuracy novel peptide-specific quantitative structure activity relationship (qsar) analysis applied to collagen iv peptides with antiangiogenic activity development of an informatics platform for therapeutic protein and peptide analytics two-level qsar network ( l-qsar) for peptide inhibitor design based on amino acid properties and sequence positions recent development of peptide drugs and advance on theory and methodology of peptide inhibitor design predicting the affinity of epitope-peptides with class i mhc molecule hla-a* : an application of amino acid-based peptide prediction a brief overview of antimicrobial peptides containing unnatural amino acids and ligand-based approaches for peptide ligands machine learning assisted design of highly active peptides for drug discovery pep-fold: an updated de novo structure prediction server for both linear and disulfide bonded cyclic peptides in silico predictions of d structures of linear and cyclic peptides with natural and nonproteinogenic residues long-timescale molecular dynamics simulations of protein structure and function how fastfolding proteins fold bond distances in polypeptide backbones depend on the local conformation identification of tetrapeptides from a mixture based positional scanning library that can restore nm full agonist function of the l p, i t, i s, a v, c y, and c r human melanocortin- polymorphic receptors (hmc rs) the protein data bank protein design with fragment databases pepbind: a comprehensive database and computational tool for analysis of protein-peptide interactions the structural basis of peptide-protein binding strategies protein-peptide interactions adopt the same structural motifs as monomeric protein folds highly flexible protein-peptide docking using cabs-dock computer-aided design of amino acid-based therapeutics virtual screening for potential inhibitors of ctx-m- protein of klebsiella pneumoniae in silico panning for a non-competitive peptide inhibitor comparative evaluation of eight docking tools for docking and virtual screening accuracy in silico designing of peptide inhibitors against pregnane x receptor: the novel candidates to control drug metabolism computation of the binding of fully flexible peptides to proteins with flexible side chains a flexible docking procedure for the exploration of peptide binding selectivity to known structures and homology models of pdz domains dynadock: a new molecular dynamics-based algorithm for protein-peptide docking including receptor flexibility structure-based prediction of proteinpeptide specificity in rosetta pepcrawler: a fast rrt-based algorithm for high-resolution refinement and binding-affinity estimation of peptide inhibitors rosetta flexpepdock ab-initio: simultaneous folding, docking and refinement of peptides onto their receptors sub-angstrom modeling of complexes between flexible peptides and globular proteins anchordock: blind and flexible anchordriven peptide docking a unified conformational selection and induced fit approach to proteinpeptide docking cabs-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site galaxypepdock: a proteinpeptide docking tool based on interaction similarity and energy optimization predicting peptide structures in native proteins from physical simulations of fragments mechanism of fast peptide recognition by sh domains binding free energy landscape of domain peptide interactions molecular dynamics simulations of pro-apoptotic bh peptide helices in aqueous medium: relationship between helix stability and their binding affinities to the anti-apoptotic protein bcl-xl structural and dynamic determinants of protein-peptide recognition quantitative sequenceactivity model (qsam): applying qsar strategy to model and predict bioactivity and function of peptides, proteins and nucleic acids recent advances in qsar and their applications in predicting the activities of chemical molecules, peptides and proteins for drug design comprehensive comparison of eight statistical modelling methods used in quantitative structure retention relationship studies for liquid chromatographic retention times of peptides generated by protease digestion of the escherichia coli proteome prediction of mhc-peptide binding: a systematic and comprehensive overview domain mediated protein interaction prediction: from genome to network calculation of absolute protein-ligand binding free energy from computer simulations prediction of binding affinities between the human amphiphysin- sh domain and its peptide ligands using homology modeling, molecular dynamics and molecular field analysis characterization of domain-peptide interaction interface: a generic structure-based model to decipher the binding specificity of sh domains why oppa protein can bind sequence-independent peptides? a combination of qm/mm, pb/sa, and structure-based qsar analyses characterization of pdz domain-peptide interactions using an integrated protocol of qm/mm, pb/sa, and cfea analyses computational design of peptide ligands exploring protein-peptide binding specificity through computational peptide screening multiple highly diverse structures complementary to enzyme binding sites: results of extensive application of a de novo design method incorporating combinatorial growth transformation of peptides into non-peptides. synthesis of computer-generated enzyme inhibitors towards the automatic design of synthetically accessible protein ligands: peptides, amides and peptidomimetics structure-based pharmacophores for virtual screening antimicrobial activity of small β-peptidomimetics based on the pharmacophore model of short cationic antimicrobial peptides small molecule inhibitors of hantavirus infection a dynamic target-based pharmacophoric model mapping the cd binding site on hiv- gp to identify new inhibitors of gp -cd protein-protein interactions alzheimer's therapeutics approved drug mimics of short peptide ligands from protein interaction motifs supermimic-fitting peptide mimetics into protein structures identification of potential small molecule peptidomimetics similar to motifs in proteins drug design, development and therapy web server to identify similarity of amino acid motifs to compounds (saamco) replace: a strategy for iterative design of cyclin-binding groove inhibitors swimming into peptidomimetic chemical space using pepmmsmimic mmsinc: a large-scale chemoinformatics database computational drug discovery developability assessment as an early de-risking tool for biopharmaceutical development antibody informatics for drug discovery computer-aided antibody design tumorhope: a database of tumor homing peptides drug-permeability and transporter assays in caco- and mdck cell lines pepx: a structural database of non-redundant protein-peptide complexes rosetta flexpepdock web server -high resolution modeling of peptide-protein interactions pdock: a new technique for rapid and accurate docking of peptide ligands to major histocompatibility complexes predicting peptide binding sites on protein surfaces by clustering chemical interactions protein-peptide complex prediction through fragment interaction patterns pep-sitefinder: a tool for the blind identification of peptide binding sites on protein surfaces vital: viterbi algorithm for de novo peptide design drug design, development and therapy : submit your manuscript | www.dovepress.com submit your manuscript here: http://www.dovepress.com/drug-design-development-and-therapy-journal drug design, development and therapy is an international, peerreviewed open-access journal that spans the spectrum of drug design and development through to clinical applications. clinical outcomes, patient safety, and programs for the development and effective, safe, and sustained use of medicines are the features of the journal, which has also been accepted for indexing on pubmed central. the manuscript management system is completely online and includes a very quick and fair peer-review system, which is all easy to use. visit http://www.dovepress.com/testimonials.php to read real quotes from published authors. key: cord- - goljir authors: yuan, meng; song, zi-han; ying, mei-dan; zhu, hong; he, qiao-jun; yang, bo; cao, ji title: n-myristoylation: from cell biology to translational medicine date: - - journal: acta pharmacol sin doi: . /s - - - sha: doc_id: cord_uid: goljir various lipids and lipid metabolites are bound to and modify the proteins in eukaryotic cells, which are known as ‘protein lipidation’. there are four major types of the protein lipidation, i.e. myristoylation, palmitoylation, prenylation, and glycosylphosphatidylinositol anchor. n-myristoylation refers to the attachment of -carbon fatty acid myristates to the n-terminal glycine of proteins by n-myristoyltransferases (nmt) and affects their physiology such as plasma targeting, subcellular tracking and localization, thereby influencing the function of proteins. with more novel pathogenic n-myristoylated proteins are identified, the n-myristoylation will attract great attentions in various human diseases including infectious diseases, parasitic diseases, and cancers. in this review, we summarize the current understanding of n-myristoylation in physiological processes and discuss the hitherto implication of crosstalk between n-myristoylation and other protein modification. furthermore, we mention several well-studied nmt inhibitors mainly in infectious diseases and cancers and generalize the relation of nmt and cancer progression by browsing the clinic database. this review also aims to highlight the further investigation into the dynamic crosstalk of n-myristoylation in physiological processes as well as the potential application of protein n-myristoylation in translational medicine. lipids are one of the principal components that structure the cell membrane and provide the barrier required for cells to survive. in addition, various lipids and lipid metabolites are also generated for modifying proteins in eukaryotic cells in a process known as 'protein lipidation'. generally, there are four major types of protein lipidation: myristoylation, palmitoylation, prenylation, and glycosylphosphatidylinositol (gpi) anchoring. these lipid modifications have been defined by different functional properties that are classified according to the characteristics of lipid attachment, the covalent bond, the specific sequence on the protein and the enzymes involved [ , ] . these lipidation distinctions impact the charge, hydrophobicity, and other aspects of targeted-protein chemistry, resulting in marked differences in the physiology of the targeted protein, such as its conformation, trafficking, localization, and binding affinity for cofactors. therefore, due to the deregulated lipid metabolism that occurs, protein lipidation may contribute to various diseases. this review primarily summarizes the current knowledge of nmyristoylation with updated study results and discusses the strategy of using n-myristoylation in the treatment of diseases. n-myristoylation consists of the addition of the -carbon fatty acid, myristate, to the n-terminal glycine residue of a protein via a covalent amide bond. in rare cases, including those of ras gtpases and tnf [ , ] , myristic acid is attached to a lysine residue [ ] through an amide bound, a process named lysine myristoylation. previously, it was observed that the myristate attaches to the nascent peptide in the first min of translation during protein biosynthesis. therefore, n-myristoylation is considered a cotranslational modification with the most accurate step occurring after the removal of the methionine initiator by methionine aminopeptidase (metap) (fig. a, b) . furthermore, it is recognized that nmyristoylation can also occur posttranslationally on an internal glycine exposed by caspase cleavage during apoptosis (fig. c) . for many protein signaling systems, the n-myristoyl moiety represents an essential feature and contributes to numerous effects, including changing protein stability, influencing protein-protein interactions, enhancing subcellular targeting to organelles or the plasma membrane, and so on [ ] [ ] [ ] [ ] [ ] . two nmt isozymes during the multiple enzymatic steps of n-myristoylation, the process of myristate transfer is completed by nmt, which is classified as a member of the gcn -related n-acetyltransferase (gnat) superfamily. some well-studied structural and biochemical investigations of yeast and human nmts revealed an ordered bi bi kinase mechanism (fig. d ) that involves a structural transformation of nmt during nmt catalysis. in fact, nmt is ubiquitous in eukaryotes while no protozoans possessed it. typically, lower eukaryotes (e.g., s. cerevisiae and drosophila) have only one isoform of nmt. however, for some mammalian species, including humans and mice, two isozymes have been identified, referred to as nmt and nmt . these isozymes are encoded by distinct genes but share approximately % peptide sequence identity with unique substrate specificities in the n-terminus, suggesting a distinct physiologic role of each isozyme in mammals. additionally, each isozyme has a conserved sequence in the catalytic domain of divergent species, implicating an essential role for each gene family throughout evolution. in humans, there are four isoforms of nmt and two isoforms of nmt , which are translated from splice variants of mrna with different reading frames. the main differences among the nmt isoforms are found in the n-termini. although the n-terminal structure of these nmts does not contribute to the construct of the kinase pocket, some investigations have characterized an nterminal truncation that increases kinase activity without affecting enzyme stability [ ] . in addition, the isoforms of these nmts may have specific effects on their intracellular localization and substrate selectivity [ , ] , suggesting that nmts are involved in diverse physiological processes. physiological roles of nmts unequivocally, nmts play essential roles in the survival and cell proliferation of diverse species [ , ] . some evidence suggests a principal role of nmt in the embryonic development of mice. in normal mice, the expression level of nmt is similar to that of nmt in a wide variety of tissues, but it is higher than the nmt levels in embryos. knocking out nmt severely impaired the differentiation ability of embryonic stem cells. homozygous nmt −/− -knockout mice were not born alive, and induction of nmt activity alone was unable to elicit the survival of heterozygous nmt +/− mice. in addition, it was reported that the subcellular localization and catalytic activity of both nmt and nmt were altered during apoptosis. nmt was transported to the cytosol from ribosomes and membranes following caspase- -and caspase- -mediated nmt cleavage, and % of the nmt activity was eliminated h after the induction of apoptosis. however, the relocalization of the cytosolic fraction to the membrane and reduced activity of nmt were also found under the same conditions. in addition, the depletion of nmt caused a . -fold increase in the apoptosis rate compared to the apoptosis rate upon depletion of nmt [ ] . this evidence led to the speculation that nmt may be responsible for ribosome-based cotranslational n-myristoylation, while nmt may be the major contributor to apoptosis-related posttranslational n-myristoylation. in summary, the widespread and conservative presence of nmt in different species emphasizes its importance in basic physiological processes, such as embryonic development, while nmt appears only in higher-level organisms, suggesting that it is necessary for sophisticated physiological processes. in the future, in-depth biochemical and pharmacological research is expected to improve the understanding of the unique roles of nmt and nmt as they apply to the clinic. protein demyristoylation some reports have revealed the demyristoylation ability of some proteins. human sirtuin (sirt ), which is a member of the lysine deacetylase sirtuin protein family, was found to exhibit more efficient demyristoylase activity than deacetylase activity [ ] . the crystal structure implied that, in complex with a thiomyristoyl peptide, sirt has a dominant hydrophobic pocket that can adopt a myristoyl group. the hydrophobic acyl pocket of sirt resembles that of sirt , which has been previously demonstrated to possess efficient fatty acid deacylase activity [ ] . in addition, the two pockets are different in certain aspects, suggesting differential adoption of fatty acid chains. moreover, the other homologs, sirt and sirt , each have a hydrophobic acyl pocket very similar to that of sirt , hinting at analogous demyristoylation activities among them. the shigella virulence factor ipaj was identified as an irreversible demyristoylase [ ] that cleaves the peptide bond between nmyristoylated gly- and asn- in some n-myristoylated proteins such as human adp-ribosylation factor p (arf ) and c-src. this irreversible demyristoylation mechanism provides a new approach to exploring the functional effects of n-myristoylated proteins in human health and diseases. in most cases, n-myristoylation on a protein is irreversible, indicating that the myristoyl motif may orient the protein toward a specific destiny, as if it is pressing a button that will irrevocably affect the dynamics of the protein and its subsequent pathway. therefore, it is reasonable to study the interactions among the factors of n-myristoylation and those of biological signaling pathways to understand the significant role of n-myristoylation. although n-myristoylation is irreversible, it cannot shield the myristoylated protein from cross talk. in contrast, cross talk is regarded as a means of interfering with n-myristoylation functions. it has been proposed that one protein modification might initiate the signaling that leads to the addition or removal of a second protein modification or the binding of another protein, suggesting that cross talk between protein modification components may serve as an important bypass of regulating protein functions. for example, both methylation and phosphorylation are able to trigger acetylation of histones [ ] . here, while introducing the physiological functions of nmyristoylation, we also delineate the cross talk of nmyristoylation components with signaling constituents in light of well-established studies to explore the robust role of nmyristoylation in cell biology. dynamic structural changes in membrane anchoring and intracellular trafficking one of the major functions of n-myristoylation is to facilitate protein binding in membranes. in fact, peitzsch and mclaughlin established a tenet stating that the myristoyl motif is insufficient for the stable anchorage of a protein to a lipid bilayer [ ] . therefore, a second signal, comprising a group of hydrophobic residues, positively charged amino acids or another lipid moiety, is required for stable membrane attachment. in one scenario called the 'ligand-dependent switch' (fig. a) , the conformation of a protein is changed upon ligand binding, exposing the myristoyl motif that attaches to a component in the lipid bilayer. for example, the gtp-myristoyl switch facilitates the membrane interaction of arf [ , ] . the exposed myristoyl motif and the basic hydrophobic residues in the n-terminus facilitate the interaction of arf -gtp with the membrane. the second scenario refers to a cluster of positively charged amino acids that are associated with a cofactor, such as calcium (fig. b) , or are phosphorylated (fig. c) ; the former cluster accumulates a positive charge to strengthen membrane binding, while the latter attenuates the positive charge to weaken membrane binding and cause membrane dissociation. the binding of two calcium ions to ef-hand motifs in the recoverin protein facilitates the exposure of a myristoyl group from a hydrophobic cavity to solvent (fig. b ) [ ] . another example is the myristoylated alanine-rich c kinase substrate (marcks) protein. the phosphorylation of serine residues contributes to its membrane dissociation, since the phosphate moiety reduces the positive charge ( fig. c ) [ ] . ece c. gaffarogullari et al. [ ] proposed a novel myristoyl/phosphorylation switch in a pka-c model of membrane attachment. pka-c maintains conformational equilibrium between a myr-in state and a myr-out state, which refer to the myristoyl group tucked into the hydrophobic pocket of the pka-c or extruded from the hydrophobic pocket, respectively. in the myr-out sate, the exposed myristoyl group inserts into the lipid bilayer and facilitates pka-c binding to the membrane. therefore, it was proposed that a large population of pka-c maintains the myr-in state distal to the membrane and that pka-c shifts to the myr-out state in the proximity of the plasma membrane or when phosphorylated at ser independent of the regulatory subunit. the palmitoyl moiety, which consists of a -carbon fatty chain, also induces signal transduction (fig. d) . it was observed that h-ras requires both a myristoyl group and palmitoyl group to target the membrane and stimulate kinase activity, whereas only n-myristoylated h-ras acts a substrate for palmitoyltransferases and therefore is palmitoylated [ , ] . the n-myristoylationnegative mutants of cdpk [ ] and fibroblast growth factor receptor substrate α (frs α) [ ] do not incorporate palmitate and therefore do not attach to the plasma membrane. moreover, proteins with myristoyl moieties show a tendency for inclusion in the membrane fraction. for example, it was reported that both n-myristoylation and palmitoylation appear to have opposing roles and different membrane lipid microdomain preferences for the g protein-membrane interactions i (gαi ) monomer, which are likely due to the conformational differences in the presence of different fatty acids [ ] . the gαi protein that is n-myristoylated but not palmitoylated preferentially anchors to lamellar-prone membrane domains with a net negative charge. in contrast, the gαi protein that is both n-myristoylated and palmitoylated preferentially localizes to raft-like lamellar membranes without negative charges. n-myristoylated akt (myrakt ) exhibits a distinct substrate preference that is not exhibited by the cytosolic or the membrane fractions that do not include lipid rafts, indicating that akt for which oncogenicity is conferred by nmyristoylation is enriched in lipid rafts [ ] . in addition, the evidence that simvastatin or cholesterol depletion preferentially ablates the phosphorylation of myrakt in lipid rafts suggests that myrakt is modified in a cholesterol-sensitive manner (fig. e ). it is well known that the most organelles in cells, such as the endoplasmic reticulum (er), mitochondrion and endosome, are bound to the plasma membrane and that these membrane interactions produce different effects. in addition, some proteins must undergo n-myristoylation for subcellular trafficking and localization. some mitochondria-related proteins, such as tomm , samm [ ] and, clpabp [ ] , were demonstrated to require the function of myristoylated proteins to bind to the mitochondrial outer membrane. in one mechanism, the hydrophobic myristoyl group motif increases dependence of the protein to reduce cytoplasmic shuttling. for example, in mice, binding domain-containing protein (stbd ) was observed to be a transmembrane resident protein, and nonmyristoylated stbd was shuttled with ease between the er and mitochondria [ ] . ring finger protein (rnf ) colocalizes with both early and recycling endosomes. while rnf requires n-myristoylation and spalmitoylation for membrane binding, n-myristoylation plays a greater role [ ] . the removal of myristoylate results in protein diffusion. in another mechanism, the hydrophobicity of the myristoyl moiety increases protein flexibility, facilitating its shuttling through hydrophobic areas. in a yeast model [ ] , the proteasome subunit rpt is cotranslationally n-myristoylated. loss of rpt n-myristoylation causes abnormal dynamic nucleuscytoplasm translocation of proteasomes and the aberrant accumulation of ubiquitinated protein levels (fig. f) . requirements for protein assembly and protein-protein interaction many proteins require assembly for maturity and function, and some evidence indicates that n-myristoylation drives the aggregation of proteins in some cases [ ] . spassov et al. reported that src dimerization is mediated by the myristoylated n-terminal region of one partner interacting with the hydrophobic kinase domain of its counterpart [ , ] . both y autophosphorylation and dimerization are codependent and activate src kinases (fig. a) . the functions of src kinases are disrupted by interfering dimerization, emphasizing the importance of n-myristoylation to src activity. in addition, the myristoyl-group-driven aggregation in lipid bilayers is also observed for h-ras [ ] . some protein-protein interactions require a myristoyl group. the brain-specific protein cap- /nap binds calmodulin (cam) with high affinity even though it lacks any canonical cam-binding domain. a crystal structure of ca + -cam-cap- /nap -cam complex showed that the myristoyl group of cap- /nap passes through a hydrophobic tunnel created by two hydrophobic components exclusively in the pockets of the terminus of cam, implying the direct involvement of the myristoyl group in this interaction. further, the interaction of calmodulin induces the phosphorylation of cap- /nap- [ ] . in hiv infections, both nef and gag are myristoylated by the host nmt cell to execute proper functions. in the assembly of hiv, nef and gag proteins are essential for infection. nef has many virulence factors that attenuate the immune system recognition of infected cells and enhance infectivity and viral replication [ ] , and gag is a precursor protein for structural components of the viral capsid. because this virus lacks nmt proteins, both nef and gag are myristoylated by the host nmt cell, and the gag-gag interaction triggers an entropic switch toward a myristoyl-exposed conformation, providing the impetus for protein assembly (fig. b) . in contrast, in beta-retroviruses, such as mouse mammary tumor virus (mmtv), the oligomerization of the matrix (ma) domain, which contains the n-terminal residue in gag, adopts a myristoylsequestering conformation [ , ] . moreover, the results from screens of the interaction of the host factors with the hiv- ma domain showed that heme oxygenase (ho- ) specifically binds the myristoyl moiety of gag via a hydrophobic channel adjacent to its heme-binding pocket, which inhibits virus production. in addition, ho- binds n-myristoylated tram, an adaptor protein for toll-like receptor (tlr ) [ ] , which inhibits the tramdependent lipopolysaccharide(lps)-tlr pathway. these findings suggest ho- is a novel cellular n-myristoylated protein that negatively regulates both virus replication and host inflammatory responses [ ] . recent studies have revealed that a glycine positioned in the nterminus can act as a potent degron, indicating that nmyristoylation may contribute to the removal of proteolytic cleavage products. richard t. timms et al. [ ] identified two cul cullin-ring e ubiquitin ligase complexes called cul zyg b and cul zer , both of which target n-myristoylated proteins for proteasomal degradation by recognizing n-terminal glycine degrons, which presumably play important roles in the quality control of protein n-myristoylation. nonmyristoylated c-src has enhanced stability compared to that of soluble n-myristoylated c-src [ ] . borja belda-palazon et al. [ ] found that rglg , an e ligase in arabidopsis, is n-myristoylated by nmt . n-myristoylation facilitates the attachment of rglg to the plasma membrane. the phytohormone abscisic acid (aba) induces the degradation of pp ca, which is predominantly localized in the nucleus, through rglg / e ligases. this degradation mechanism was found by aba downregulation of nmt , which hindered the nmyristoylation of rglg and promoted its translocation to the nucleus, where it interacted with pp ca, increasing pp ca degradation. in addition, in aspergillus nidulans [ ] , the swof gene was found to encode an nmt. the enhanced activity of the s proteasome and the accumulation of ubiquitinated substrates in the n-myristoylate-deficient swof mutant, compared with the activity and accumulation in wild-type strain, resulted in impaired cell morphogenesis. alterations to signaling pathways n-myristoylation influences downstream kinase activity directly or indirectly, usually by the mechanisms described above, through ras and src. c-abl is a member of the src family of tyrosine kinases. a 'myristoyl/phosphotyrosine' switch has been identified in the regulation of the kinase activity of c-abl [ ] . nmyristoylation locks the protein into an autoinhibitory conformation when the sh domain docks to the kinase domain. in contrast, myristoylation leads to an unpredicted function: c-src is induced into a conformation optimal for kinase activity. a series of studies have unequivocally demonstrated that n-myristoylation of the β-subunit is a prerequisite for the initiation of ampk signaling in response to amp [ , , ] . therefore, in the case of nmt deficiency or upon cell treatment with an nmt inhibitor, suppressed n-myristoylation diminished the extent of α-thr phosphorylation of amp and abolished its activation, thereby causing multiple morbid physiological outcomes. moreover, the myristoyl switch regulated by amp may affect the selectivity of the substrates and serve as a gatekeeper for transducing signals of metabolic stress [ ] . recently, it was suggested that pka-c n-myristoylation may account for the enriched kinase activity at the membrane and the preferential phosphorylation of membrane substrates. these findings indicate an important role for n-myristoylation in synaptic function normality and plasticity during normal pka regulation [ , ] . in another pattern of results from recent studies, several proteins were found to phosphorylate nmt, resulting in the activation of nmt. rajala et al. [ ] verified that lyn and fyn are nmyristoylated by nmt before they phosphorylate nmt. in addition, it has been hypothesized that the interaction between nmt and the lyn tyrosine kinase proceeds in a phosphorylationdependent manner [ ] . manipulation of substrates during apoptosis apoptosis, or programmed cell death, is the physiologically and tightly controlled process for eliminating unnecessary or redundant cells in eukaryotic organisms. in addition, dysfunctional apoptosis is one of the characteristics of malignant cells. apoptosis is orchestrated by a signaling cascade of proteases called caspases (cysteine-containing aspases), which leads to the exposure of novel n-termini susceptible to various posttranslational modifications. since the initial discovery of myristoylated bid, it has been revealed that multiple proteins are modified posttranslationally by n-myristoylation upon caspase cleavage. using a myristoylated protein-labeling method in jurkat t cells and mcf- cells, at least and posttranslationally nmyristoylated proteins were discovered, respectively, during apoptosis [ ] . further characterization of these proteins revealed that caspase truncated (ct)-gelsolin and ct-pkc have antiapoptotic roles while ct-bid and ct-pak have proapoptotic roles upon posttranslational n-myristoylation, suggesting that posttranslational n-myristoylation plays an important role in maintaining a balance between cell death and survival. moreover, it has been found that nmt and nmt are substrates of different caspases. furthermore, n-myristoylation may be involved in sphingolipid biosynthesis pathway, which includes ceramide synthesis and is involved in the induction of apoptosis. in the last step of ceramide biosynthesis, a trans △ -double bond in the carbon chain of dihydroceramide is specifically catalyzed by dihydroceramide △ desaturase (des ), which is n-myristoylated by nmt [ ] . myristic acid upregulates novel des activity, which differs from that of saturated fatty acids, possibly because of the accumulation of n-myristoylated des . n-myristoylation causes a proportion of des to be translocated from the er to the mitochondrial outer membrane, leading to an increase in ceramide levels. since the production of ceramide can induce apoptosis by inducing mitochondrial dysfunction [ ] , the myristic acid-associated increase in des activity can enhance apoptosis induction in cells, which shows the importance of n-myristoylation in the process of apoptosis. n-myristoylated lancl , which belongs to the eukaryotic lanc-like protein family, is known as the testisspecific adriamycin sensitivity protein (tasp). one biochemical study identified a phosphatidylinositol phosphate (pip)-binding site in the n-terminus of lancl . in addition, both nmyristoylation and pip binding are crucial for lanc binding to the membrane. it is possible that the overexpression of lancl increases cell sensitivity to adriamycin, which is dependent on both n-myristoylation and membrane association [ ] . in summary, we can conclude that a myristoyl group as a hydrophobic motif always stably binds to substrates to change the conformation and increase the hydrophobicity of the protein. further, it affects protein localization and the ease with which a protein binds to substrates. however, insufficient hydrophobicity of the -carbon myristoyl group leads to it being influenced by its surroundings. the hydrophilic group can ease protein binding, and the hydrophobic group can stabilize the state of the n-myristoylated protein. it seems that n-myristoylation triggers consequential functions of protein necessarily but insufficiently. accumulating evidence has demonstrated that n-myristoylation is an evolutionarily conserved lipidation that is essential for cell viability in different organisms. currently, interventions of protein n-myristoylation are principally achieved through nmt inhibitors. therefore, nmt inhibitors are very suitable for use against diseases caused by proliferating cells or pathogens, such as infectious diseases caused by various pathogens and malignancies. in fact, broadly classified inhibitors for each process in nmt-induced disease are designated as antifungal and antiviral agents. typically, these inhibitors can be divided into three categories [ ] : ( ) myristoyl-coa and myristate derivatives. although the myristoyl-coa binding sites in human nmt and fungal nmt are highly conserved, their peptide-binding sites are divergent, which provides an explanation for the selectivity of inhibitors that do not induce adverse toxicity in humans. ( ) histidine analogs. in most conserved regions in the catalytic domain eeveh ( - ), histidine is critical for the myristoyl-coa transfer [ ] . ( ) myristoyl peptide derivatives. as previously mentioned, lipid metabolism disorders can affect protein lipidation [ ] . various saturated and unsaturated fatty acids have been evaluated as potent inhibitors of human nmt . indeed, the development of nmt inhibitors is not limited to proteins involved in infectious diseases because they are great prospects as immunodeficiency and cancer treatments. as mentioned previously, nmt has been characterized as the principal enzyme for the early development of embryogenesis. further investigation was focused on the role of nmt in myelopoiesis through which blood cell types are matured. bone marrow-derived macrophages (bmdms) from nmt +/− -deficient mice have defective morphology compared with that of mature bmdms in wild-type mice. during bmdm maturation, the nmt activity increased during the initial period and then decreased for the remaining time due to differential nmt expression. although the nmt activity in the bmdms of nmt +/− -deficient mice followed a trend similar to that of the bmdms in the wild-type mice, the nmt expression levels were reduced to approximately one-half in the mutant mice. this report addressed an essential role of nmt during monocytic/macrophage differentiation. of course, studies are continuously uncovering valuable evidence not only for nmt but also for nmt in both normal and malignant hematopoiesis. a bioinformatic database of gene expression in different cancer cell lines revealed decreased nmt expression and preserved nmt expression in some hematological cancer cell lines, such as burkitt lymphoma, diffuse large b-cell lymphoma and acute myeloid leukemia (aml) cell lines. a study performed by ryan stubbins et al. suggested a correlation between nmt and aml. nmt and nmt protein levels in the marrow aspirates or peripheral blood of aml patients were assayed by fluorescence-activated cell counting with intracytoplasmic staining, expressed as the mean fluorescent intensity (mfi). the evidence showed that the nmt mfi was higher in the lymphocytes and lower in the monocytes, suggesting that the regulation of nmt protein levels may influence early lymphoid/myeloid lineage commitment. further, the overall trend revealed by a survival analysis showed higher nmt mfi values, portending a worse prognosis for aml patients and suggesting a role for nmt as a novel prognostic biomarker for intermediate-risk aml [ ] . in terms of adaptive immunity, it has been reported that nmyristoylation is indispensable for t cell development [ , ] . a comparative analysis of thymuses showed that mice with deficient nmt or nmt levels had reduced medullary volume, which was % and % lower, respectively, than it was in wild-type mice, and mice with double nmt and nmt deficiency had a much greater medullary volume reduction ( %). these findings suggest nonredundant roles for both nmts in maintaining the development of thymocytes, since the thymus has high nmt activity levels [ ] . in agreement with these results, it was demonstrated that n-myristoylation and its potential applications m yuan et al. t-cell receptor (tcr) signaling is disrupted in mice with t-cell lineages characterized by specific nmt and nmt activity deficiencies and results in retarded thymocyte development. these outcomes may be attributable to the mislocalization of nmyristoylation-deficient src family tyrosine kinases, such as lck or fyn [ ] . for example, n-myristoylation contributes to the cytomembrane targeting of fyn and facilitates its binding with the z chain of tcr. non-n-myristoylated lck is in the cytoplasm and unable to facilitate tcr signaling. however, the regulation of n-myristoylation or nmt activity in these processes during adaptive immunity has not been well established to date. to gain in-depth understanding of the regulation of immune responses, further investigation into the specific roles of nmyristoylation, which may involve nmt as a potential target of immune modulation, is warranted. parasitic and other infectious diseases it is known that some n-myristoylated proteins in small rna viruses and retroviruses are essential for virus assembly during viral replication or production of infectious viral particles, suggesting that n-myristoylation may be related to the survival and propagation of pathogens. in addition, some pathogens need to utilize host cellular machinery to replicate within host cells due to deficiency in viral nmts. in immunosuppressed patients, cryptococcus neoformans can easily cause chronic meningitis; however, it is unable to survive at °c if it harbors mutant nmt with reduced activity. some disease-causing parasitic protozoa such as falciparum (malaria), leishmania donovani (leishmaniasis), and trypanosoma brucei (african sleeping sickness) retain the nmt necessary for their survival. given that the peptide-binding pocket of nmt is not strictly conserved across species, the search for species-specific nmt inhibitors focused on the binding pocket is worthwhile [ ] . notwithstanding, current anti-infectious agents have some drawbacks, such as drug resistance, poor oral bioavailability, and renal toxicity. a group of studies have suggested nmt as a novel target for use in anti-infective drugs. theoretically, according to the respective skeletal structures, nmt inhibitors are classified into four categories (table ) . among these inhibitors, benzofuran and, benzothiazole have high species selectivity [ ] . anti-infective drugs targeting nmt have certain advantages. first, myristoyl-coa is found at very low accumulation levels, at approximately nm, in mammalian cells. in addition, the strict hydrophobic structure of the fatty carbon chains reduces the likelihood that incompatible fatty acid groups will be recognized by the nmt, even in cases where the palmitic acid concentration is higher than the myristic acid in vitro [ ] . these physiological phenomena suggest that a small amount of inhibitor can have a good inhibitory effect. second, nmt inhibitors can be designed for high selectivity because of the significant differences in substrate specificity in the human and parasitic organisms. moreover, many fungal and parasites must use the nmt of the host to synthesize essential proteins for their own reproduction. potential targets of cancer treatments given that altered nmt expression is observed in many types of cancer tissues and because many n-myristoylated proteins are involved in signaling processes that regulate cell proliferation, growth and death, it has been proposed that n-myristoylation or nmts can be considered as therapeutic targets for cancer. the premise for their use is based on a thorough understanding of the abnormal regulation of n-myristoylation in carcinogenesis. for instance, given that n-myristoylation can facilitate src-mediated prostate tumorigenesis, the myristoyl-coa analog b and its derivative lcl [ , ] have been identified as inhibitors of nmt enzymatic activity and blockers of src n-myristoylation; their actions are based on competing for myristoyl-coa binding site, which provides a promising approach to inhibit src family kinase-mediated oncogenic activity, and offers preclinical support for the use of protein n-myristoylation inhibitors in treating cancer [ ] . an organopalladium compound, tris(dibenzylideneacetone) dipalladium (tris dba) was identified as a novel human nmt inhibitor that not only blocks the kinase activity of nmt but also reduces its expression. it showed potent antiproliferative activity against melanoma cells by inhibiting several proliferation-related signaling pathway proteins including mapk, akt, and stat- [ ] . the most successful examples of myristoylation-related anticancer inhibitors are allosteric abl tkis such as abl [ ] [ ] [ ] , gnf and gnf [ ] (table ) . these allosteric inhibitors selectively target the myristoyl-binding pocket in the c-lobe of the abl kinase domain and not the atp-binding pocket. moreover, these inhibitors increase the sensitivity of a bcr-abl (t i) mutant to atp-competitive tkis. a series of phase i clinical trials of abl and tkis (nilotinib, imatinib, and dasatinib) therapy have been ongoing for cml and ph+all patients (https://clinicaltrials.gov/ ct /show/nct ). some studies have shown the potential targets of nmt in cancer cells. the cancer genome atlas (tcga) reports a group of genomic alterations induced by nmt and nmt in several cancers. although the occurrence of somatic mutations is more common than the occurrence of genomic amplification in nmt and nmt , the roles of these mutations in regulating pathological mechanisms remain to be determined (https://www.cbioportal. org/). moreover, patients with diseases such as liver cancer, cervical cancer, and lung cancer and high expression levels of nmt or nmt are more likely to have a poor prognosis, as indicated consistently in some reports. in addition, it is noteworthy that a cohort with renal cell carcinoma with either high expression of nmt or low expression of nmt was found to have worse overall survival, suggesting the unique roles of these proteins in kidney cancer. however, no evidence showed that their catalytic activity is linked to tumor progression in kidney cancer (http:// kmplot.com/analysis/). in addition, several nmt activators that enhance nmt enzyme activity have been identified. l-histidine and d-histidine can activate human nmt activity in a concentration-dependent manner. this finding suggests that two analogs may be involved in myristoyl-coa transfer by interacting with his- of nmt. naf was identified as an nmt activator factor in bovine brain [ ] . the reconstitution of naf and nmt likely resulted in an open conformation where the active site is expanded. a novel stratagem of nmt activation is enhanced nmt specificity for specific myristoyl-coa substrates. on the basis that nmt can transfer abundant palmitoyl-coa but can use only the rare myristoyl-coa for acylation of a substrate protein, eric soupene et al. [ ] determined that the acyl-coa-binding protein domain (acbd ) stimulates the activity of nmt . acbd can block the access of palmitoyl-coa to the nmt -binding site and enhance its catalytic activity, which requires interaction between nmt and acbd . a mutant acbd unable to interact with nmt or with deficient ligand binding at its own n-terminus does not stimulate nmt activity. in this review article, we outlined advanced studies of nmyristoylation, focusing on the role of protein n-myristoylation in physiology and pathology. the important role of nmyristoylation underlies the early stages of protein maturation. after the protein is folded in the golgi apparatus or the er, cotranslationally n-myristoylation is likely to influence the transport and localization of the protein, which can affect the biofunction of protein. the insufficient nmt kinase activity in pathogens and the variations in different species emphasize the selective lethality of nmt inhibitors for infectious diseases for which either no valid drug is qualified or for which available drugs induce drug resistance. the breakthroughs of nmt inhibitors will likely be as tumor treatments. abl is being pioneered continuously for cml and ph+all patients in phase i trials, which suggests a strategy focused on the n-myristoylation of oncoproteins. furthermore, posttranslational n-myristoylation in the apoptotic process suggests the participation of nmts, specifically nmt , in cell death. the function of the n-myristoylated protein in the apoptotic process, whether pathogenically or physiologically normal, can further indicate the orientation of the treatment strategy for targeting nmt. the knowledge of other protein modifications, such as ubiquitination or palmitoylation, which involves a multimember kinase family, can inform many specific enzyme-substrate studies and the design of selective inhibitors. in contrast, scientists need more precise approaches for analyses of nmts and nmyristoylation. nevertheless, traditional methods based on radioactively labeled probes to detect n-myristoylation are insensitive and time consuming, and the technical difficulties in obtaining three-dimensional structures of myristate-attached proteins need to be overcome. further, bioinformatics on n-myristoylation and nmts is still in its infancy; however, since databases are integrated, these data provide many clues linked to other biological fields. regardless of the technical difficulties, it is worth exploiting novel nmt inhibitors as single agents and exploring the potential drug synergies that might improve multiple clinical applications and enhance therapeutic efficacy, reverse drug resistance, or extend the therapeutic index for drugs already used in the clinic. it is likely that two approaches [ ] can be used to reveal the prospects of nmt inhibitors for oncology therapy in the future: ( ) identify currently unknown sensitivities of certain cancer types by widely screening cancer cell line panels and ( ) discover the essential sensitivity or resistance mechanisms in resistant cell lines and wild-type cell lines by proteomic analyses of nmt substrate profiles and proteomic changes. as our knowledge of the biochemistry and cell biology of n-myristoylation continues to grow, more substrate proteins will be identified, and scientists will continue to deduce the effects of this lipid attachment on protein structure and function. protein lipidation in cell signaling and diseases: function, regulation, and therapeutic opportunities protein lipidation sirt and lysine fatty acylation regulate the transforming activity of k-ras a hdac regulates type i interferon signaling through defatty-acylation of shmt protein lysine acylation and cysteine succination by intermediates of energy metabolism protein myristoylation in health and disease cholesterol sensitivity of endogenous and myristoylated akt myristoylated naked antagonizes wnt-beta-catenin activity by degrading dishevelled- at the plasma membrane myristoylation of the dual-specificity phosphatase c-jun n-terminal kinase (jnk) stimulatory phosphatase is necessary for its activation of jnk signaling and apoptosis inhibition of enterovirus vp myristoylation is a potential antiviral strategy for hand, foot and mouth disease n-terminal region of the catalytic domain of human nmyristoyltransferase acts as an inhibitory module hiv- production is specifically associated with human nmt long form in human human n-myristoyltransferase amino-terminal domain involved in targeting the enzyme to the ribosomal subcellular fraction fatty acids regulate germline sex determination through acs- -dependent myristoylation comparison of myristoyl-coa: protein n-myristoyltransferases from three pathogenic fungi: cryptococcus neoformans, histoplasma capsulatum, and candida albicans two n-myristoyltransferase isozymes play unique roles in protein myristoylation, proliferation, and apoptosis efficient demyristoylase activity of sirt revealed by kinetic and structural studies sirt regulates ras-related protein r-ras by lysine defatty-acylation nmyristoyltransferase is essential in early mouse development protein kinase c coordinates histone h phosphorylation and acetylation molecular determinants of the myristoyl-electrostatic switch of marcks structural basis for activation of arf gtpase: mechanisms of guanine nucleotide exchange and gtp-myristoyl switching structure and membrane interaction of myristoylated arf sequestration of the membranetargeting myristoyl group of recoverin in the calcium-free state myristoylation-dependent n-terminal cleavage of the myristoylated alanine-rich c kinase substrate (marcks) by cellular extracts a myristoyl/phosphoserine switch controls camp-dependent protein kinase association to membranes c-terminal kda fragment of cytoskeletal actin is posttranslationally n-myristoylated upon caspase-mediated cleavage and targeted to mitochondria n-terminally myristoylated ras proteins require palmitoylation or a polybasic domain for plasmamembrane localization membrane localization of a rice calcium-dependent protein kinase (cdpk) is mediated by myristoylation and palmitoylation myristoylationdependent palmitoylation of the receptor tyrosine kinase adaptor frs alpha g proteinmembrane interactions i: galphai myristoyl and palmitoyl modifications in protein-lipid interactions and its implications in membrane microdomain localization identification and characterization of protein n-myristoylation occurring on four human mitochondrial proteins, samm , tomm , mic , and mic role of n-myristoylation in stability and subcellular localization of the clpabp protein mouse stbd is n-myristoylated and affects er-mitochondria association and mitochondrial morphology multiple modification and protein interaction signals drive the ring finger protein (rnf ) e ligase to the endosomal compartment n-myristoylation of the rpt subunit of the yeast s proteasome is implicated in the subcellular compartment-specific protein quality control system aggregation of lipid-anchored full-length h-ras in lipid bilayers: simulations with the martini force field a dimerization function in the intrinsically disordered n-terminal region of src a myristoyl-binding site in the sh domain modulates c-src membrane anchoring crystal structure of a myristoylated cap- /nap- n-terminal domain complexed with ca +/calmodulin a myristoyl switch regulates membrane binding of hiv- gag myristoylation drives dimerization of matrix protein from mouse mammary tumor virus the myristoylation of trif-related adaptor molecule is essential for toll-like receptor signal transduction heme oxygenase binds myristate to regulate retrovirus assembly and tlr signaling a glycine-specific ndegron pathway mediates the quality control of protein n-myristoylation myristoylation and membrane binding regulate c-src stability and kinase activity aba inhibits myristoylation and induces shuttling of the rglg e ligase to promote nuclear degradation of pp ca a novel interaction between n-myristoylation and the s proteasome during cell morphogenesis a myristoyl/phosphotyrosine switch regulates c-abl beta-subunit myristoylation is the gatekeeper for initiating metabolic stress sensing by ampactivated protein kinase (ampk) n-myristoyltransferase deficiency impairs activation of kinase ampk and promotes synovial tissue inflammation exploring protein myristoylation in toxoplasma gondii liberated pka catalytic subunits associate with the membrane via myristoylation to preferentially phosphorylate membrane substrates myristoylated alanine-rich c-kinase substrate effector domain phosphorylation regulates the growth and radiation sensitization of glioblastoma phosphorylation of human n-myristoyltransferase by n-myristoylated src family tyrosine kinase members potential role of n-myristoyltransferase in cancer rapid detection, discovery, and identification of post-translationally myristoylated proteins during apoptosis using a bio-orthogonal azidomyristate analog myristic acid increases the activity of dihydroceramide delta -desaturase through its nterminal myristoylation monounsaturated fatty acid modification of wnt protein: its role in wnt secretion myristoylation of human lanc-like protein (lancl ) is essential for the interaction with the plasma membrane and the increase in cellular sensitivity to adriamycin effects of l-histidine and its structural analogues on human n-myristoyltransferase activity and importance of eeveh amino acid sequence for enzyme activity expression and activity of n-myristoyltransferase in lung inflammation of cattle and its role in neutrophil apoptosis deficiency of n-myristoylation reveals calcineurin activity as regulator of ifn-gammaproducing gamma delta t cells myristoylation: an important protein modification in the immune response n-myristoylation of ampk controls t cell inflammatory function the residue at position of the n-terminal region of src and fyn modulates their myristoylation, palmitoylation, and membrane interactions vhy, a novel myristoylated testis-restricted dual specificity protein phosphatase related to vhx novel analogs of d-e-mapp and b . part : signature effects on bioactive sphingolipids role of long-chain fatty acyl-coa esters in the regulation of metabolism and in cell signalling novel analogs of de-mapp and b . part : synthesis and evaluation as potential anticancer agents blocking myristoylation of src inhibits its kinase activity and suppresses prostate cancer progression tris (dibenzylideneacetone) dipalladium, a n-myristoyltransferase- inhibitor, is effective against melanoma growth in vitro and in vivo discovery of asciminib (abl ), an allosteric inhibitor of the tyrosine kinase activity of bcr-abl allosteric inhibitors of bcr-abl: towards novel myristate-pocket binders the allosteric inhibitor abl enables dual targeting of bcr-abl inhibitors of the abl kinase directed at either the atp-or myristate-binding site differential activation of bovine brain n-myristoyltransferase(s) by a cytosolic activator acbd protein controls acyl chain availability and specificity of the n-myristoylation modification of proteins n-myristoyltransferase inhibition induces er-stress, cell cycle arrest, and apoptosis in cancer cells this work was supported by grants from the national natural science foundation of china ( to j.c.), zhejiang provincial natural science foundation (y h to j.c.), and the talent project of zhejiang association for science and technology (no. ycgc to j.c.). my and jc conceived and designed the review article. my, zhs, and jc collected the related research articles contributed to the paper. my, mdy, hz, by, qjh, and jc made amendments to the paper. competing interests: the authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential competing interests. key: cord- -iuo cw authors: lippé, roger title: deciphering novel host–herpesvirus interactions by virion proteomics date: - - journal: front microbiol doi: . /fmicb. . sha: doc_id: cord_uid: iuo cw over the years, a vast array of information concerning the interactions of viruses with their hosts has been collected. however, recent advances in proteomics and other system biology techniques suggest these interactions are far more complex than anticipated. one particularly interesting and novel aspect is the analysis of cellular proteins incorporated into mature virions. though sometimes considered purification contaminants in the past, their repeated detection by different laboratories suggests that a number of these proteins are bona fide viral components, some of which likely contribute to the viral life cycles. the present mini review focuses on cellular proteins detected in herpesviruses. it highlights the common cellular functions of these proteins, their potential implications for host–pathogen interactions, discusses technical limitations, the need for complementing methods and probes potential future research avenues. over the last decades, many host-pathogen interactions have been characterized using genetics, biochemical, and microscopy approaches. these discoveries relied on mutants, pharmacological reagents, immunoprecipitations, immunofluorescence, electron microscopy, cell fractionation, and western blotting to name a few of the methods employed. these approaches provided much precious information but, given the typical focus of these approaches on individual molecules, likely only revealed a small portion of the proteins involved. other methods such as high throughput two-hybrid and genetic screens, nucleic acid arrays, rna interference, and proteomics are now proving essential tools to tackle the complexity of these interactions. the main advantages of mass spectrometry, for instance, are that it is a fast, sensitive and potentially a quantitative approach to identify putative novel players, particularly when coupled to efficient purification schemes. already, proteomics revealed how viruses modulate the expression of host proteins (rassmann et al., ; sun et al., ; tong et al., ; antrobus et al., ; pastorino et al., ; thanthrige-don et al., ; zandi et al., ; zhang et al., zhang et al., , coombs et al., ; emmott et al., ; lu et al., lu et al., , munday et al., ; bartel et al., ; lietzen et al., ; ramirez-boo et al., ; chou et al., ) . a relatively new and interesting field is the characterization of host-pathogen interactions within mature purified virions. as reviewed on several occasions, several studies reported the presence of individual cellular proteins in viral particles (bernhard et al., ; maxwell and frappier, ; viswanathan and fruh, ; friedel and haas, ; zheng et al., ) . this includes vaccinia virus (krauss et al., ) , influenza virus (shaw et al., ) , hiv (gurer et al., ; cantin et al., ; ott, ) , vesicular stomatitis virus (moerdyk-schauwecker et al., ) , and several herpesviruses (see below). though these cellular components have often been considered purification contaminants, the presence of similar proteins in both related and unrelated viruses suggests that some of them may be biologically relevant. the identification of virion-associated host proteins could thus lead to the discovery of novel therapeutic tools against viruses. the present review focuses on their identification and putative roles with respect to the proteomics of herpesviruses. thus far, the protein composition of eight different herpesvirions has been studied by mass spectrometry. these studies include the alphaherpesvirinae herpes simplex virus type (hsv- ) and pseudorabies virus (prv; loret et al., ; kramer et al., ) , the betaherpesvirinae human and murine cytomegaloviruses (hcmv and mcmv, respectively; kattenhorn et al., ; varnum et al., ) and the gammaherpesvirinae kaposi sarcoma herpesvirus (kshv), gamma herpesvirus (γhv ), epstein-barr virus (ebv), and alcelaphine (bortz et al., ; johannsen et al., ; bechtel et al., ; zhu et al., ; dry et al., ) . interestingly, host proteins were detected in all herpesvirions analyzed so far, as summarized in table . for instance, our laboratory previously reported the protein composition of mature extracellular hsv- viral particles and identified as many as cellular proteins (loret et al., ) . similarly, studies focusing on prv and ebv reported up to and cellular proteins, respectively (johannsen et al., ; kramer et al., ) . meanwhile, varnum et al. ( ) found as many as different host proteins in extracellular hcmv virions. while fewer cellular proteins were reported for other viral particles, it is clear that herpesviruses can potentially incorporate many proteins from its host. moreover, of the different proteins detected in herpesvirions, nine protein groups are present in at least four distinct herpesvirions. this includes - - , actin, annexins, cofilin, translation factors, gapdh, heat shock proteins, pyruvate kinase m , and various rab gtpases. these results indicate that, first of all, it is common for herpesviruses to incorporate cellular proteins into their viral particles and, secondly, that www.frontiersin.org ? different viruses share similar host proteins. most excitingly, it also suggests that these host proteins may play common roles throughout the herpesviral family. this defines an interesting and novel set of host-pathogen interactions taking place within the virus itself, rather than the cell. it is tempting to speculate that some viruses might have a higher capacity to steal cellular proteins because of their size and symmetry. herpesviruses are indeed large viruses containing a layer called the tegument between their capsids and envelopes that could accommodate non-viral proteins. though some host proteins may randomly be incorporated into virions, others may rather be selected to insure the optimal replication of the viruses that carry them. bioinformatics databases such as the kegg, gene ontology, or david are useful tools to get an overview of the functional interplay of proteins (ashburner et al., ; huang da et al., ; kanehisa et al., ) . as pointed out by friedel and haas ( ) , complex statistical tools are available to quantitatively evaluate the implication of proteins in various processes but these are beyond the scope of the present review. here an analysis of the proteins identified in herpesvirions was instead performed with the ingenuity pathways analysis database (ingenuity ® systems), which contains all the known physical and functional links among cellular proteins and defines their most significant functions. that analysis indicates that many of the cellular proteins found in herpesvirions normally modulate trafficking, cell proliferation, cell death, cell migration, cell metabolism, or the cytoskeleton (figure , upper pie chart). though subtle differences between family members are noticeable when looking at individual viruses, similar functions are found (figure , other charts) . immune-related molecules are also important constituents for several viruses, including hsv- , kshv, γhv , alcelaphine, and mcmv. altogether, this provides an overall picture whereby herpesviruses, not surprisingly, modulate all of the important aspects of the cell but where each virus might deploy its energies slightly differently. the main surprise is that so many cellular proteins are detected within assembled viral particles, which raises an important question as to their biological significance and mode of action. the overall picture that several important cellular functions might be modulated by the host proteins incorporated into viral particles is intriguing. this clever strategy is consistent with the parasitic nature of all viruses, including herpesviruses, which would presumably gain some replication advantage from stealing cellular modulators rather than coding for them in their own genomes. the most critical question is the benefit for the viruses to incorporate these cellular proteins in their assembled particles, particularly since these proteins also exist in the cells. while this is open to discussion, one possibility is that some of the incorporated cellular proteins may be remnants of the final capsid envelopment process. alternatively, this may allow the prompt action of some of these proteins immediately upon viral entry. this could jumpstart the expression and/or duplication of the viral genome, as it is the case for the herpesviral vhs, vp , icp , and icp proteins that are present in virions (lam et al., ; everett, ; halford and schaffer, ; ellison et al., ; hancock et al., ; loret et al., ; sarma et al., ; loret and lippe, ) . other early potential sites of action are the process of viral entry itself, intracellular capsid transport, import of the viral genome through the nuclear pore or immune modulation, all common steps among herpesviruses. whatever the case might be, the question remains as to why the cellular pool of these proteins would not suffice. several options may be considered. first, it may be that the virions incorporate specific isoforms, splice variants or post-translationally modified proteins that could have properties or functions distinct than their cellular counterparts. second, the incorporation of a host protein from one cell type might permit the infection of a different cell type that does not express such protein. for example, alpha herpesviruses initially infect mucosal cells and could acquire host proteins that are beneficial to infect dormant neuronal cells. finally, the host proteins might be in complex with viral proteins and it is those complexes that are active to promote the infection. these possibilities are of course speculative at this point and need to be explored. one aspect where the incorporation of host proteins in mature virions might be beneficial is molecules involved in intracellular trafficking. work by numerous laboratories demonstrated that the transport machinery used to move cellular proteins is also employed by viruses (simons and warren, ; lodish et al., ; sollner, ; greber and way, ; mercer et al., ) . this is essential for their proteins and particles to reach their final destination, for example, the site of viral replication, assembly, and/or envelopment. along with snares proteins, rab and arf gtpases are master regulators of molecular trafficking throughout the cell (sollner and rothman, ; zerial and mcbride, ; mizuno-yamasaki et al., ) . so far,vamp , a snare, was identified in prv virions (kramer et al., ) but it may only be a matter of time until other snares are discovered in other members of the herpes family. this is relevant as another snare was reported to facilitate the envelopment of mcmv capsids (cepeda and fraile-ramos, ) . in contrast, a great number of rab proteins have been identified in herpesvirions, particularly hsv- and prv (table ) . one stimulating option is that these proteins regulate the displacement of viral capsids in the cell, which could justify their incorporation in the viral particles. as rab and arf proteins collectively modulate several intracellular transport steps within the cell, it is anticipated they may be involved in various stages of the infection. for instance, rab , which is present in hsv- extracellular virions (loret et al., ) , and rab were recently demonstrated to modulate the final envelopment of the virus (zenner et al., ) . similarly, rab , found in hsv- and prv (loret et al., ; kramer et al., ) , is also necessary for the efficient assembly of the related hcmv (indran and britt, ) . it will now be of interest to determine if the virion-associated pool of these gtpases actively participates in the viral life cycle. interestingly, several rab proteins have been implicated in autophagosome formation and maturation (chua et al., ) . while it is difficult to consider how virion-incorporated rab proteins play a role at that stage, they might rather be incorporated into the virions as mcmv figure | the proteins from table were analyzed with the ingenuity database to define their putative functions in the context of an infection. to this end, the protein accession numbers (or gi numbers) were queried from the ingenuity database. for the purpose of this figure, all known functions associated with these proteins were exported to microsoft excel and regrouped. in the top pie chart, the cellular proteins found in all the herpesvirions were analyzed collectively, while the other pie charts depict the host proteins incorporated into each virus. since each protein can be associated with multiples functions in the database, the results of those analyses are expressed as relative values instead of raw numbers, which consequently exceeds the original number of proteins analyzed. the percentages therefore represent the number of proteins falling into a given category with the total of each pie chart being %. a graphical legend of the categories is provided at the bottom right corner of the figure. a consequence of their involvement in autophagosome formation and concomitant viral envelopment. given the vast impact of rab proteins on the cell, it will be a major challenge to decipher all their roles in the life cycle of herpesviruses, particularly for the pool present in mature virions. molecular trafficking is not only dependent on snares, rab, and arf proteins, it is also intimately linked to the cytoskeleton. it is thus not surprising that herpesviruses devote some of their resources toward regulating this central cellular machinery. for instance, herpesviruses significantly reorganize both cellular and nuclear actin as well as microtubules (norrild et al., ; avitabile et al., ; sharma-walia et al., ; simpson-holley et al., ; de regge et al., ; saksena et al., ) . they also travel along microtubules during both entry and egress and interact with several cellular molecular motors (sodeik et al., ; smith et al., ; dohner et al., ; marozin et al., ; lee et al., ; wolfstein et al., ; radtke et al., ) as well as cortical and nuclear actin filaments (forest et al., ; feierbach et al., ; roberts and baines, ) . furthermore, some members incorporate in their viral particles tubulin or actin-related components ( table ; wong and chen, ; grunewald et al., ) . actin has been reported to compensate the loss of various viral tegument proteins in prv (del rio et al., ; michael et al., ) and may thus act as an abundant filling agent, so its significance in herpesviral particles remains enigmatic. similarly, the relevance of intermediate filament components vimentin and keratins in some herpes virions ( table ) is difficult to assess given these filaments are not as well characterized as other cytoskeletal elements. it may nevertheless be important for herpesviruses, particularly since they are not all associated with the common skin or hair contaminants often detected in mass spectrometry (hertel, ) . viruses tend to monopolize for their own purpose their host expression apparatus, including protein translation (bushell and sarnow, ) . for example, the prototypic hsv- icp viral protein regulates all aspect of mrnas including transcription, splicing, nuclear export, and translation for the benefit of the virus (rice and knipe, ; sekulovich et al., ; sandri-goldin and mendoza, ; smith et al., ; hardwicke and sandri-goldin, ; hardy and sandri-goldin, ; brown et al., ; soliman et al., ; chen et al., ; lindberg and kreivi, ; ellison et al., ; larralde et al., ; fontaine-rodriguez and knipe, ) . as these cellular functions are highly regulated, the inclusion of ddx x, a multifunctional rna helicase that also regulates transcription, nuclear export, and translation that is used by several viruses (schroder, (schroder, , ) may be relevant. its incorporation into mature virions could thus accelerate viral gene expression in the early stages of the infection. similarly, the presence of translation initiation or elongation factors in virions (table ) may also jumpstart gene expression in favor of the viruses. interestingly, hsv- does not require cells to be in the s-phase and even arrests the cell cycle at the g /s transition step (shadan et al., ; song et al., ) , which partly explains why it can grow in non-dividing neurons. while the precise mechanism of this arrest is unclear, it is known that the viral icp protein and the vp cellular partner hcf modulate the cell cycle (hobbs and deluca, ; lomonte and everett, ; piluso et al., ) . moreover, icp interacts with the host cyclin d (kawaguchi et al., ) . however, it was recently reported that stress, rather than the cell cycle per se, may be a critical feature (bringhurst and schaffer, ) . clearly, the interaction of herpesviruses with the cell proliferation apparatus is complex and likely involves several host and viral proteins. identifying novel players that might be incorporated into mature virions may thus be very useful to clarify this process. an interesting scenario is the possible regulation of apoptosis by host proteins loaded onto viral particles. apoptosis is regulated both negatively and positively by several viruses (teodoro and branton, ; goodkin et al., ) , presumably to insure their survival at the early stages of the infection but their efficient release later on. conceptually, the presence of anti-apoptotic proteins in herpes particles might thus provide a mean to quickly evade death upon entry, while the presence of pro-apoptotic proteins on newly assembled/enveloped viral particles may trigger or stimulate their extracellular release. only further work will resolve this open question. several factors generally contribute to variation among proteomic studies. hence, the preparation of the samples (e.g., in-gel trypsin digestion versus liquid digestion and chromatography) may lead to the detection of different populations of tryptic peptides. moreover, the sensitivity of the mass spectrometers and the abundance of the proteins in the samples also impact peptide detection. the relative abundance of a peptide is itself influenced by the complexity of the samples, where some proteins may evade identification. finally, each protein differs in its properties (ionization, resolution in sds-page gels), which will be reflected in their detection. this includes snares, which are transmembrane proteins resistant to sds extraction (yang et al., ; kubista et al., ) . it is thus likely that some of the proteins in table are present in more viral particles than reported and that additional proteins are indeed incorporated in herpesvirions. more specific aspects regarding herpesviruses includes the purification schemes employed to enrich the viral particles, which will directly influence the purity of the samples and hence the potential detection of contaminants. one important caveat is that some host proteins may simply stick to the large viral particles. another one is common contaminants such as some hair/skin associated keratins or as mentioned above actin, which may simply fill the virions. however, even potential contaminants cannot simply be discarded since actin and even some keratins may indeed participate in viral life cycles. moreover, the relative abundance of all the cellular proteins within the cell is unknown, so it is not possible to rule out potential contaminants on the sole basis of abundance. it is thus critical to orthogonally validate all proteomics hits. various tools are available to define the biological relevance of host proteins identified in viral particles, including western blotting, immuno-electron microscopy or functional screens. one powerful method is rna interference. however, given the dual presence of the host proteins within the viral particles and the cell itself, this becomes a challenging task. rna interference also has its own caveats (false positives and negatives). another common step is the expression of dominant positive or negative mutants. in all cases, one major difficulty is that the host proteins may be essential for the cells and their depletion may lead to cytotoxicity, thus proper controls are needed. in addition, the host proteins might be essential for the virus within the cells but only accessory within the virions. consequently, depletion of a protein may have limited impact on the virus since complemented by the other pool of that protein in the virus or the cell. small reduction or stimulation in viral yields may thus result. it such cases, it may be necessary to produce the virus on cells that lack these proteins to see if this makes a difference. one should also consider animal models since tissue culture based screens may miss important players, for instance modulators of the immune system or virulence factors. clearly, multiple experimental strategies are needed to ultimately insure the biological significance of the host proteins found in viral particles. the identification and functions of host proteins in viral particles is an important step toward the elucidation of novel host-pathogen interactions. in the case of herpesvirions, this is well under way with eight different family members analyzed so far. one main aspect is to sort biologically relevant cellular proteins from sticky contaminants. the orthogonal validation of the host proteins found in herpesvirions using biologically relevant assays is thus critical. as pointed out above, it will necessary to analyze all these proteins in the background of two pools, one cellular and one virion-associated, which are likely to complement one another. an interesting possibility is that some isoforms or specific posttranslationally modified host proteins may be loaded into the capsids. thus a detailed analysis of the host proteins present in viral particles will be important and a potential way to distinguish them from their cell-associated counterparts. another issue is the expected variation among cell types. in that respect, it would be most interesting to examine the cellular protein content of hsv- produced in neurons in opposition to the virions produced on other cell types. finally, the mechanisms by which all these host proteins are recruited to the viral particles will also need to be explored. thus the proteomics of viral particles is only the beginning of the adventure, which should prove most exciting yet challenging. i am indebted to the canadian institutes of health research (grant # mop ) for funding our proteomics research. i also wish to thank kerstin radtke for excellent suggestions and daniel henaff for critical reading of the manuscript. proteomic analysis of cells in the early stages of herpes simplex virus type- infection reveals widespread changes in the host cell proteome gene ontology: tool for the unification of biology redistribution of microtubules and golgi apparatus in herpes simplex virus-infected cells and their role in viral exocytosis proteome analysis of vaccinia virus ihd-w-infected hek cells with -dimensional gel electrophoresis and maldi-psd-tof ms of on solid phase support n-terminally sulfonated peptides host and viral proteins in the virion of kaposi's sarcomaassociated herpesvirus new insights into viral structure and virus-cell interactions through proteomics identification of proteins associated with murine gammaherpesvirus virions cellular stress rather than stage of the cell cycle enhances the replication and plating efficiencies of herpes simplex virus type icp -viruses herpes simplex virus trans-regulatory protein icp stabilizes and binds to ends of labile mrna hijacking the translation apparatus by rna viruses plunder and stowaways: incorporation of cellular proteins by enveloped viruses a role for the snare protein syntaxin in human cytomegalovirus morphogenesis icp interacts with the rna export factor aly/ref to direct herpes simplex virus type intronless mrnas to the tap export pathway an overview of the vaccinia virus infectome: a survey of the proteins of the poxvirus-infected cell involvement of members of the rab family and related small gtpases in autophagosome formation and maturation quantitative proteomic analyses of influenza virus-infected cultured human lung cells alpha-herpesvirus glycoprotein d interaction with sensory neurons triggers formation of varicosities that serve as virus exit sites actin is a component of the compensation mechanism in pseudorabies virus virions lacking the major tegument protein vp function of dynein and dynactin in herpes simplex virus capsid transport proteomic analysis of pathogenic and attenuated alcelaphine herpesvirus control of vp translation by the herpes simplex virus type immediateearly protein icp quantitative proteomics using stable isotope labeling with amino acids in cell culture reveals changes in the cytoplasmic, nuclear, and nucleolar proteomes in vero cells infected with the coronavirus infectious bronchitis virus icp , a regulator of herpes simplex virus during lytic and latent infection alpha-herpesvirus infection induces the formation of nuclear actin filaments herpes simplex virus icp increases translation of a subset of viral late mrnas active intranuclear movement of herpesvirus capsids virus-host interactomes and global models of virus-infected cells herpes simplex virus infection and apoptosis a superhighway to virus infection three-dimensional structure of herpes simplex virus from cryoelectron tomography specific incorporation of heat shock protein family members into primate lentiviral virions icp is required for efficient reactivation of herpes simplex virus type from neuronal latency herpes simplex virus regulatory proteins vp and icp counteract an innate intranuclear barrier to viral gene expression the herpes simplex virus regulatory protein icp contributes to the decrease in cellular mrna levels during infection herpes simplex virus inhibits host cell splicing, and regulatory protein icp is required for this effect herpesviruses and intermediate filaments: close encounters with the third type perturbation of cell cycle progression and cellular gene expression as a function of herpes simplex virus icp systematic and integrative analysis of large gene lists using david bioinformatics resources a role for the small gtpase rab in assembly of human cytomegalovirus proteins of purified epstein-barr virus kegg for representation and analysis of molecular networks involving diseases and drugs identification of proteins associated with murine cytomegalovirus virions herpes simplex virus alpha regulatory protein icp interacts with and stabilizes the cell cycle regulator cyclin d proteomic characterization of pseudorabies virus extracellular virions an investigation of incorporation of cellular antigens into vaccinia virus particles evidence for structural and functional diversity among sds-resistant snare complexes in neuroendocrine cells herpes simplex virus vp rescues viral mrna from destruction by the virion host shutoff function direct stimulation of translation by the multifunctional herpesvirus icp protein reconstitution of herpes simplex virus microtubule-dependent trafficking in vitro quantitative subcellular proteome and secretome profiling of influenza a virus-infected human primary macrophages splicing inhibition at the level of spliceosome assembly in the presence of herpes simplex virus protein icp viruses: structure, function, and uses herpes simplex virus type immediate-early protein vmw inhibits progression of cells through mitosis and from g( ) into s phase of the cell cycle comprehensive characterization of extracellular herpes simplex virus type virions biochemical analysis of icp , icp , ul and ul incorporated into extracellular herpes simplex type virions two-dimensional liquid chromatography-tandem mass spectrometry coupled with isobaric tags for relative and absolute quantification (itraq) labeling approach revealed first proteome profiles of pulmonary alveolar macrophages infected with porcine reproductive and respiratory syndrome virus proteomic analysis of the host response in the bursa of fabricius of chickens infected with marek's disease virus herpes simplex virus type infection of polarized epithelial cells requires microtubules and access to receptors present at cell-cell contact sites virus entry by endocytosis composition of pseudorabies virus particles lacking tegument protein us , ul , or ul or envelope glycoprotein e gtpase networks in membrane traffic analysis of virion associated host proteins in vesicular stomatitis virus using a proteomics approach quantitative proteomic analysis of a cells infected with human respiratory syncytial virus subgroup b using silac coupled to lc-ms/ms organization of cytoskeleton elements during herpes simplex virus type infection of human fibroblasts: an immunofluorescence study cellular proteins detected in hiv- isolation and preliminary characterization of herpes simplex virus primary enveloped virions from the perinuclear space host cell factor- interacts with and antagonizes transactivation by the cell cycle regulatory factor miz- plus-and minus-end directed microtubule motors bind simultaneously to herpes simplex virus capsids using different inner tegument structures quantitative proteomics by -de, o/ o labelling and linear ion trap mass spectrometry analysis of lymph nodes from piglets inoculated by porcine circovirus type proteome alterations in human host cells infected with coxsackievirus b gene-specific transactivation by herpes simplex virus type alpha protein icp actin in herpesvirus infection herpes simplex virus type accumulation, envelopment, and exit in growth cones and varicosities in mid-distal regions of axons a herpesvirus regulatory protein appears to act posttranscriptionally by affecting mrna processing small interfering rnas that deplete the cellular translation factor eif h impede mrna degradation by the virion host shutoff protein of herpes simplex virus human deadbox protein has multiple functions in gene regulation and cell cycle control and is a prime target for viral manipulation viruses and the human dead-box helicase ddx : inhibition or exploitation? the herpes simplex virus type alpha protein icp can act as a trans-repressor or a trans-activator in combination with icp and icp n-butyrate, a cell cycle blocker, inhibits the replication of polyomaviruses and papillomaviruses but not that of adenoviruses and herpesviruses kaposi's sarcomaassociated herpesvirus/human herpesvirus envelope glycoprotein gb induces the integrin-dependent focal adhesion kinase-srcphosphatidylinositol -kinase-rho gtpase signal pathways and cytoskeletal rearrangements cellular proteins in influenza virus particles semliki forest virus: a probe for membrane traffic in the animal cell identification and functional evaluation of cellular and viral factors involved in the alteration of nuclear architecture during herpes simplex virus infection herpesviruses use bidirectional fast-axonal transport to spread in sensory neurons evidence that the herpes simplex virus immediate early protein icp acts post-transcriptionally during infection to regulate gene expression microtubulemediated transport of incoming herpes simplex virus capsids to the nucleus shuttling of the herpes simplex virus type regulatory protein icp between the nucleus and cytoplasm mediates the expression of late proteins intracellular and viral membrane fusion: a uniting mechanism molecular machinery mediating vesicle budding, docking and fusion herpes simplex virus infection blocks events in the g phase of the cell cycle proteomic alteration of pk- cells after infection by classical swine fever virus regulation of apoptosis by viral gene products analyses of the spleen proteome of chickens infected with marek's disease virus proteomic analysis of cellular protein alterations using a hepatitis b virus-producing cellular model identification of proteins in human cytomegalovirus (hcmv) particles: the hcmv proteome viral proteomics: global evaluation of viruses and their interaction with the host the inner tegument promotes herpes simplex virus capsid motility along microtubules in vitro evidence for the internal location of actin in the pseudorabies virion snare interactions are not selective. implications for membrane fusion specificity proteomics analysis of bhk- cells infected with a fixed strain of rabies virus analysis of rab gtpase-activating proteins indicates that rab a/b and rab are important for herpes simplex virus secondary envelopment rab proteins as membrane organizers proteomic analysis of pbmcs: characterization of potential hiv-associated proteins differential proteome analysis of host cells infected with porcine circovirus type mass spectrometry based proteomic studies on viruses and hosts -a review virion proteins of kaposi's sarcoma-associated herpesvirus the author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. key: cord- - bw jbzq authors: borlak, jürgen; singh, prashant; gazzana, giuseppe title: proteome mapping of epidermal growth factor induced hepatocellular carcinomas identifies novel cell metabolism targets and mitogen activated protein kinase signalling events date: - - journal: bmc genomics doi: . /s - - -z sha: doc_id: cord_uid: bw jbzq background: hepatocellular carcinoma (hcc) is on the rise and the sixth most common cancer worldwide. to combat hcc effectively research is directed towards its early detection and the development of targeted therapies. given the fact that epidermal growth factor (egf) is an important mitogen for hepatocytes we searched for disease regulated proteins to improve an understanding of the molecular pathogenesis of egf induced hcc. disease regulated proteins were studied by de maldi-tof/tof and a transcriptomic approach, by immunohistochemistry and advanced bioinformatics. results: mapping of egf induced liver cancer in a transgenic mouse model identified n = (p < . ) significantly regulated proteins of which n = were tumour-specific. to unravel molecular circuits linked to aberrant egfr signalling diverse computational approaches were employed and this defined n = key nodes using n = disease regulated proteins for network construction. string analysis revealed protein-protein interactions of > % disease regulated proteins with individual proteins being validated by immunohistochemistry. the disease regulated network proteins were mapped to distinct pathways and bioinformatics provided novel insight into molecular circuits associated with significant changes in either glycolysis and gluconeogenesis, argine and proline metabolism, protein processing in endoplasmic reticulum, hif- and mapk signalling, lipoprotein metabolism, platelet activation and hemostatic control as a result of aberrant egf signalling. the biological significance of the findings was corroborated with gene expression data derived from tumour tissues to evntually define a rationale by which tumours embark on intriguing changes in metabolism that is of utility for an understanding of tumour growth. moreover, among the egf tumour specific proteins n = were likewise uniquely expressed in human hcc and for n = proteins regulation in human hcc was confirmed using the publically available human protein atlas depository, therefore demonstrating clinical significance. conclusion: novel insight into the molecular pathogenesis of egf induced liver cancer was obtained and among the newly identified proteins several are likely candidates for the development of molecularly targeted therapies and include the nucleoside diphosphate kinase a, bifunctional atp-dependent dihydroyacetone kinase and phosphatidylethanolamine-binding protein , the latter being an inhibitor of the raf- kinase. electronic supplementary material: the online version of this article (doi: . /s - - -z) contains supplementary material, which is available to authorized users. liver malignancies are common cancers worldwide and are responsible for approximately one million deaths each year with most hcc patients having poor prognosis as a result of rapid disease progression. the relative -year survival rate is about % and can be attributed to an advanced stage of disease at the time of diagnosis, the occurrence of cirrhosis and of other co-morbidites [ ] . detection of early stages of disease is essential for an improved prognosis and overall survival. however, apart from alpha-fetoprotein (afp) only a few serological markers are available in clinical practice (such as glypican- , mir- , fucosylated gp , α-fucosidase) with afp diagnostics remaining unsatisfactory because of its low sensitivity and the non-specific correlation between the clinical behavior of hcc and afp blood levels. for this reason, new biomarkers are in strong demand [ , ] and more selective markers, such as soluble interleukin- receptor levels, are evaluated [ ] . importantly, research into the molecular pathogenesis of hcc identified several signalling pathways as deregulated. this inspired the development of molecularly targeted therapies such as the multikinase inhibitor sorafenib that inhibits signalling of c-raf- , mek, erk, vegfr, pdgfr and other kinases, effectively [ ] . given that epidermal growth factor is an important mitogen for hepatocytes we were particularly interested in an understanding of the consequences of its targeted overexpression in liver. in our initial study we reported the oncogenomics and pathology of egf induced hepatocarcinogenesis and an identification of molecular circuitries linked to exaggerated egfr signalling [ ] . furthermore, we investigated the serum proteome of egf-tumourbearing mice to obtain information on disease-regulated proteins and to search for novel biomarkers at different stages of disease [ ] . note, a regulatory loop was proposed whereby egf induces transcriptional activation of hdac by ck Α/akt activation in liver cancer cells [ ] . additionally, inhibition of egfr in different animal models by erlotinib was shown to attenuate liver fibrosis and the development of hepatocellular carcinoma [ ] thus suggesting new therapeutic intervention strategies in the prevention of hcc. indeed, the egf receptor tyrosin kinase plays a much wider role in the immortalization of different cell types as originally anticipated [ ] , and is highly expressed in a number of solid tumours and egfr over-expression correlates well with tumour progression, resistance to chemotherapy and poor prognosis. the present study aimed at an identification of disease regulated proteins to facilitate an improved understanding of its complex signalling networks and to search for cross-talk amongst other pathways while an identification of disease regulated proteins would aid the development of molecularly targeted therapies. for this purpose, a two-dimensional electrophoresis and maldi-tof/tof ms strategy was employed to identify disease regulated proteins in an egf transgenic mouse model of hcc. this resulted in an identification of statistically significant regulated proteins of which are uniquely expressed in liver cancer. importantly, out of mouse tumour specific proteins were likewise uniquely expressed in human hcc and disease regulated proteins identified in egf induced liver cancer were similarly regulated in human hcc, as determined by immunhistochemistry using different antibodies and the information given in the publically available human protein atlas depository. clinical significance of the identified proteins could be demonstrated and a total of so far unkown proteins could now be related to egf induced liver cancer, several of which are likely candidates for the development of molecularly targeted therapies. this includes the nucleoside diphosphate kinase a, bifunctional atp-dependent dihydroxyacetone kinase, phosphatidylethanolamine binding protein , i.e. an inhibitor of the raf- kinase as well as aldo-keto reductase family proteins, members c and c , interleukin and the v-crk sarcoma virus ct oncogene homolog. finally, to gain insight into the molecular circuitries of egfr induced hepatocarcinogenesis diverse computational approaches were employed. this revealed master regulatory proteins and permitted network constructions of disease regulated proteins with protein-protein interactions being confirmed for > % of regulated proteins in string analysis. their regulations were also studied by immunohistochemistry in egf transgenic hcc. we also compared the serum and liver proteomes of hcc bearing mice and found proteins to be similarly regulated, thus evidencing leakage of tumour proteins that can be detected in serum. obviously, these are highly interesting biomarker candidates, of which were also regulated in human hcc as determined by immunohistochemistry. all animal work followed strictly the public health service (phs) policy on humane care and use of laboratory animals of the national institutes of health, usa. formal approval to carry out animal studies was granted by the animal welfare ethics committee of the state of lower saxony, germany ('lower saxony state office for consumer protection and food safety' (laves)). the approval id is az: . - - - / . a total of n = c /bl non-transgenic and n = egf transgenic mice (aged - months), weighing - g, were housed in makrolon® type iii cages. water and food (v - , ssniff, the netherlands) was given ad libitum. the temperature and relative humidity was set to ± °c and - %, respectively and a -h day and night cycle was used. tris, urea, thiourea, chaps, dithiothreitol, bromophenol blue, glycerin, sodium dodecyl sulfate, glycine, temed, ammoniumperoxodisulfate, ammonium sulfate, ammonium bicarbonate, colloidal coomassie blue, and acrylamide were purchased from roth (karlsruhe, germany). iodacetamide was obtained from serva (heidelberg, germany) and benzonase was purchased from novagen (darmstadt, germany). ampholytes (biolyte - ) were purchased from bio-rad laboratories (münchen, germany) and destreak was obtained from amersham bioscience (freiburg, germany). mice were anesthetized with ketamine % μl/ g and xylazine % μl/ g, and after surgical removal the liver was perfused and rinsed with ice cold ringer solution until free of blood. approximately . g of the liver sample was ground in a mortar under liquid nitrogen flow. then, the samples were processed with . ml of a buffer containing mm tris base, m urea, % chaps, mm dtt, and . % (v/v) biolyte - first (lb ). the suspensions were homogenized by sonication ( × s) and after addition of μl of benzonase (endonuclease that degrades dna and rna) were left at room temperature for min. the samples were then centrifuged at , g for min. the pellets were washed and sonicated for min with a further . ml of lb and centrifuged at , g for another min, and the resulting two fractions of supernatant were collected (extract a). finally, the pellets were redissolved with . ml of buffer containing mm tris base, m urea, m thiourea, % chaps, mm dtt, and . % (v/v) biolyte - (lb ), sonicated, and centrifuged at , g for min. the pellet was collected, and the supernatant was marked as extract b. from the same animals, a further . -g portion was ground in a mortar, but was now treated with . ml of lb . the suspensions were sonicated, incubated with benzonase, and centrifuged. the pellets were then washed with another . ml of lb , sonicated and centrifuged, and the supernatants were collected (extract c). proteome mapping was done under a variety of conditions, e.g. extraction with lysis buffers and . in addition, proteins were separated at two different ph ranges [ ] [ ] [ ] [ ] [ ] [ ] . a total of independent experiments were carried out, and duplicate measurements were run for each experiment. the protein concentration of all extracts was determined using the bradford assay. liquid-phase ief pre-fractionation was performed in the rotofor cell system (bio-rad) following the supplier's instructions. ion exchange membranes were equilibrated overnight in the appropriate electrolyte (anion exchange membranes in naoh . m and cation exchange membranes in h po . m). after four runs ion exchange membranes were always discarded and new membranes were replaced for the other samples. for each run, the electrode chambers were filled with appropriate fresh electrolytes ( ml) . initially, the cell was filled with pure water and run for min at watts constant power to remove residual ionic contaminants from the membrane core and ion exchange membranes. approximately ml of lb were used to fill the cell. a total of mg of total proteins in approximately ml of lb were added to the cell to reach the maximum loadable volume ( ml) . focusing started at watts constant power. after approximately hours the voltage increased to v and the wattage decreased to w. the focused proteins were harvested in ~ . ml fractions, and ph values were checked. fractions having ph values between and . were collected and denoted "a-a" (acid). fractions having ph values > . were collected and denoted "a-b" (basic). again, the protein concentration was determined for both fractions (a-a and a-b) by the bradford method. approximately mg of protein were recovered at the end of the liquid-phase ief pre-fractionation from an initial -mg load. the losses are accounted for by the multi-step pre-fractionation procedure, but are not the result of a precipitate that could not be dissolved in our lysis buffer. after each run the membrane core was cleaned with naoh . m overnight and sonicated for min in water before the new focusing. isoelectric focusing (ief) -first dimension ief was performed using precast linear ipg strips. the -cm ipg strips - and - were loaded with . mg of proteins by active rehydration ( h, v) . samples destined to be separated by ipg strips - received an excess of hydroxyethyldisulphide (hed) (destreak™) prior to the focusing run. focusing began at v for min in rapid mode, , v for h in linear mode and , v for , vh in rapid mode (for the ipg strips - ). ief for the strips - was carried out at v for min in rapid mode, , v for h in linear mode and , v for , vh in rapid mode. each sample was analysed in duplicate. control and hcc samples were run always at the same time ( control and hcc samples). after ief, the ipg strips were either stored at − °c or transferred to ml of equilibration buffer ( m urea, % w/v glycerin, % w/v sds, mm tris-hcl ph . ) with % w/v dtt and . % v/v bromophenol blue solution ( . % w/v bromophenol blue, . m tris-hcl ph . , . % w/v sds) and incubated for min at room temperature. strips were removed and incubated in equilibration buffer with % w/v iodoacetamide and . % v/v bromophenol blue solution for further min at room temperature. finally, the strips and μl sds-page molecular weight standard on filter paper were placed on top of the cm x . cm % second-dimension gel ( % v/ v acrylamide/bis solution, mm tris, ph . , . % v/v sds, / temed, . % v/v aps). both were fixed in place with a . % w/v agarose overlay. gels were run in protean plus dodeca cell (bio-rad) at v for approximately h, followed by v until the bromophenol blue dye reached the bottom of the gel. the running buffer ( mm tris, . m glycin, . % sds) was cooled externally to °c. gels/proteins were fixed overnight in % ethanol, % phosphoric acid, and washed x min with % phosphoric acid. the gels were equilibrated with % ammoniumsulfate, % ethanol, % phosphoric acid for min and finally stained with colloidal coomassie blue for h. after staining, gels were washed min with pure water and scanned on a molecular fx scanner (bio-rad) at μm resolution. protein spots were imaged first automatically and then manually and analysed using the pdquest™ software (bio-rad). the normalization was carried out in total density in gel mode according to the manufacturer's recommendation. gels were excised using the spot cutter of bio-rad and placed into -well microtiter plates. excised gel spots were washed manually with μl of water for min and destained twice, first with μl ammonium bicarbonate mm for min and then with μl % ammonium bicarbonate mm - % acetonitrile for min. finally, the gel particles were covered by acetonitrile until gel pieces shrunk and left to dry for min. all gels/proteins were digested manually in situ with μl of ammonium bicarbonate mm containing ng trypsin (sequencing grade modified trypsin, promega, germany). after min each gel piece was re-swelled with μl of ammonium bicarbonate mm and incubated for h at °c. after h the reaction was stopped by adding μl of trifluoroacetic acid % containing . % (w/v) n-octyl-beta-d-glucopyranoside (ogp) (applichem). for the application of the samples, μl of peptide solution were loaded onto an mtp anchor chip target / (bruker daltonics) previously prepared with a saturated solution of matrix, alpha-cyano- -hydroxy-cinnamic acid (alpha-hcca) (bruker daltonics, bremen, germany). maldi-ms was performed on an ultraflex ii maldi-tof/tof (bruker daltonics) mass spectrometer equipped with a smartbeam™ laser and a lift-ms/ms facility. the instrument was operated in positive ion reflectron mode and an acceleration voltage of kev for the peptide mass fingerprint (pmf) mode. typically, spectra, acquired at hz, were summed and externally calibrated. in the case of ms/ms-cid the lift device was used for selection and fragmentation of the ions; the acceleration voltage in the ion source kv, the timed ion selector was set to . % (relative to parent mass), and argon was used as collision gas (~ - × - mbar). resulting fragments were further accelerated in a second source by kv and analysed by a two-stage gridless reflectron. typically, shots were accumulated for the parent ion signal and shots for the fragments. flexcontrol™ . , and flexanalysis™ . were used as instrument control and processing software (bruker daltonics, bremen, germany). a calibration standard was used for the external calibration of spectra (peptide calibration standard for mass spectrometry, which covered the mass range~ - da internal calibration was achieved using trypsin autolysis products (m/z's . , . and . ) resulting in a mass accuracy of ≤ ppm. spectra were collected by the flexcontrol software without smoothing or baseline subtraction and a peak resolution higher than or a.u. in case of dhb and chca matrixsample preparation, respectively. the spectra were sent to the flexanalysis software which labeled the peaks for protein identification by proteinscape . or biotools . (bruker daltonics). trypsin autolysis products, tryptic peptides of human keratin and matrix ions were automatically discarded by proteinscape (mass control list). proteinscape score booster feature was used to improve database search results by automatic iterative recalibrations and background eliminations. protein scores greater than were considered significant (p < . , mascot) and an annotation as mouse protein as the top candidates was requested in the search when no restriction was applied to the species of origin. identified proteins were checked individually for further considerations. for pmf peak picking the snap peak detection algorithm, a signal to noise threshold of , maximal number of peaks , a quality factor threshold and baseline subtraction tophat was applied. peptide masses were searched against the swiss-prot database (download - sequences, residues) employing the mascot server (in-house mascot-server, matrix sciences ltd., http://www.matrixscience.com/, revision . . ), taking into account carbamidomethyl of cysteines -carbamidomethyl (c)-as fixed modification and possible oxidation of methionine -oxidation (m)-as a variable modification but allowing one missed cleavage. based on initial data, ion precursors were selected by proteinscape for tandem ms data acquisition (by lift-tof/tof, bruker daltonics, bremen, germany). in the mascot ms/ms ions search, the restriction mammalia was applied with peptide tolerance of ( ppm and ms/ms tolerance of ( . da (fixed and variable modifications as pmf). the acceptance criteria for pmf-based identification were an individual ions score > , at least five matching peptides and % peptide coverage of the theoretical sequences. livers, dissected from egf-overexpressing mice aged between - months, were fixed in % buffered paraformaldehyd and embedded in paraffin. μm thick sections were deparaffinized and rehydrated through a descending alcohol series followed by a min washing step in destilled h o. subsequently, antigen retrieval was performed in citrate buffer (ph ) by autoclaving the sections min at °c. the envision kit (dako, hamburg, germany) was used for immunohistochemistry. the slides were rinsed with destilled h o and after a min incubation step in tris-buffered saline (washing buffer), endogenous peroxidase activity was blocked with dako peroxidase blocking reagent for min followed by a second washing step. thereafter, the sections were blocked for min with protein-block serum free (dako) and incubated with primary antibodies for min. details of antibody dilutions with washing buffer are given in additional file : table s . in the case of goat primary antibodies a rabbit-anti-goat bridging antibody (dako) was employed. specifically, the bound primary antibodies or bridging antibodies were detected by use of labelled polymer hrp anti-rabbit secondary antibody (envision kit; dako) and the immunoreactivity was visualized by dako liquid dab substrate chromogen system in a min incubation. finally, the sections were counterstained with harris haematoxylin for min, dehydrated in an ascending alcohol series, coverslipped and examined under a light microscope (leica, jülich, germany). a total of n = disease regulated proteins were filtered for statistical significance at p < . (table ). this yielded n = statistically significantly regulated proteins two of which had identical accession number, i.e. aah = atp synthase h+ transporting mitochondrial f complex, subunit d and bac = apoa but differed in their spot ids as a result of posttranslational modifications. the statistically significantly regulated proteins were grouped into four different categories to yield tumour specific (to), up-regulated (ur), down-regulated (dr) and proteins only expressed in healthy non-transgenic control livers (co). categorization of tumour regulated proteins based on ontology terms non-redundant tumour proteins covering to, ur and dr categories were considered and analysed for ontologies using the genexplain software (v. . . ), the biological pathways tools reactome (http://www.reactome.org) and kegg (http://www.genome.jp/kegg) and wikipathways (http://wikipathways.org). the tumour regulated proteins (to + ur + dr) were subjected to functional classification based on ontology terms and a p-value of < . was considered to be significant. moreover, disease regulated proteins were analysed with the cytoscape software version . . using the function go-tree levels and number or % of proteins for a given term (see additional file : table s ). master regulatory proteins were searched based on the designated workflow of the genexplain software. it is designed to find master regulatory molecules upstream of an input list of regulated tumour proteins. after annotation of the input datasets the tool for master regulator finding over geneways network (http://www.genexplain.com) was applied. specifically, the geneways software is used to automatically extract, analyse, visualise and integrate molecular pathway data from the published peer reviewed literature. it is based on document sorting, term identification, term meaning disambiguation, information extraction, ontology, visualization and system integration [ ] . the following filtering threshold was used, i.e. score cutoff ( . ), search collection (geneways hub), maximum radius [ , ] , fdr cutoff ( . ), z-score cutoff ( . ), penalty ( . ) and decay factor ( . ) (additional file : table s ). protein network for disease regulated proteins were also constructed using the string software (http:// string-db.org/). the underlying database informs on known and predicted protein-protein interaction and the constructed networks are based on active prediction methods of neighborhood, gene fusion, co-occurrence, co-expression, databases and textmining. eventually, confidence scores were calculated for each interaction pair and only those above default cutoff scores ( . ) were selected. finally, mapping of pathways information from reac-tome, kegg and wikipathways have been implemented the proteins are sorted in alphabetical order, and the ncbi annotation is given in the accession number column. molecular weight, pi, and mascot scores are also given. the column "gels", "c" (c = control) and "t" (t = tumour) indicate the frequency of positive identification of proteins in a total of independent gels, whereas "lb " and "lb " (lb = lysis buffer) refers to the different lysis buffers employed. furthermore, references are given for those proteins which have already been described as hcc-associated whereas those marked with a star (*) are so far unknown as egfr disease regulated in hepatocellular carcinoma. over protein networks using information of known pathways and sustained proteins connecting these pathways in a given network. the histopathology and oncogenomics of egf induced liver cancers was previously reported [ ] and an important finding of the study was the % incidence of malignant tumour formation in less than one year after birth. notably, a sequence of events was observed that initially consisted of diffuse large cell dysplasia followed by multiple dysplastic foci and nodules and growth of hcc. figure a and b depict the histopathology of healthy non-transgenic control liver and egf induced tumours, respectively. after protein extraction de was performed. subsequently, the gels were scanned on a bio-rad molecular fx scanner at a μm resolution. image analysis was done with the pdquesttm software and spots were detected automatically. a total of proteins differed in expression or were de novo expressed when de gels of non-transgenic controls and hcc mice were compared (see table for detailed information on the proteins identified and figure e -g depicting examples of zoomin-gels of some regulated proteins). among them are statistically significantly regulated proteins (p ≤ . ) of which were significantly up-regulated (ratio hcc/control ≥ ) and included fibrinogen and subunits of it, vimentin, cu/zn superoxide dismutase, and apolipoprotein e ( figure f (i-iv), while proteins were repressed in expression (ratio hcc/control ≤ . ) and included arginase , dhdh protein, glutathione peroxidase and agmatine ureohydrolase ( figure g (i-iv) and table ). a reference -de map of mouse liver and serum proteins was constructed that consists of more than n = proteins [ , ] . note, in our previous efforts we identified n = serum proteins as regulated in the egf transgenic disease model. among them were alpha-fetoprotein, clusterin, fibrinogen-α and -γ, serum amyloid component p and several apolipoproteins all of which were significantly up-regulated. based on the combined use of de and maldi-ms a total of n = differentially expressed proteins were identified (table ) and included isoforms as well as post translational modifications of albumin ( up-regulated spots), alpha enolase ( down-regulated spots), apoliproptein a-i ( up-regulated spots), atp synthase h+ transporting mitochondrial ( down-regulated spots), fibrinogen beta ( up-regulated spots), glycine n-methyltransferase ( spots, in controls only), hsp ( down-regulated spots), nit protein ( down-regulated spots), peroxiredoxin ( up-regulated spot and down-regulated spot), and c rik ( up-regulated spots) (see table ). importantly, a total of n = so far unknown disease regulated proteins were identified that can now be related to egf induced liver cancer. these are marked with an asterisk in table . furthermore, a comparison of serum and liver proteoms revealed n = proteins to be regulated in common, thus evidencing leakage of tumour proteins into systemic circulation (table ) . among them was serum afp; it's up-regulation and that of others was confirmed by western blot analysis (figure a-e) . likewise, apolipoprotein e was up-regulated both in serum and tumour samples, the ratio hcc/control being . and . , respectively. in a previous study on human hcc increased expression of apoe was observed in % of study cases; however, gene apoe expression and serum levels were unchanged to suggest its accumulation and impaired secretion [ ] . two isoforms of alpha- -macroglobulin were up-regulated in serum of hcc-bearing mice (spot : ratio hcc/control = . ; spot : ratio hcc/control = . ). its expression was exclusively associated with tumours. finally, serum amyloid component p was up to -fold up-regulated in serum and its tissue expression was tumour specific ( table ) . to further evidence disease regulated proteins and to provide information on their subcellular localization a total of n = proteins were studied by immunohistochemistry. five of them were selected for their novelty (see table ) while amphiregulin and epiregulin were chosen for their importance in the egf-signalling pathway. furthermore, hnf α was studied for its pivotal role in liver cancer [ ] . depicted in figure are immunohistochemistry stainings performed with egf transgenic livers to confirm regulation and predominant cytoplasmic expression of arginase ii. note, arg is only expressed in hcc and recent evidence suggest modulation of arginine levels in the extracellular milieu to be part of an immune escape mechanism whereby lack of local arginine weakens tumour-infiltrating lymphocytes as t cells require adequate argine levels [ ] . likewise, the tumour specific and cytoplasmic expression of the f-actin capping protein α subunit (capza ) and the predominant nuclear expression of tubulin β that was particularly visible beneath the liver capsule may possible promote microtubule stability and interactions of microtubules with endogenous proteins. furthermore, the induced and predominat cytoplasmic expression of the gdp dissociation inhibitor (gdi ) protein is part of the control of vesicular trafficking. this protein is known to regulate gdp-gtp exchange amongst members of the rab family of proteins. the tumour specific and cytoplasmic expression of amphiregulin supports the notion of a switch in autocrine signalling and it has been reported that amphiregulin is a prognostic marker for poor outcome of a variety of malignancies including colorectal liver metastasis [ ] . finally, the repressed nuclear expression of hnf a was not unexpected and confirms earlier findings [ ] . based on the information given in table the human protein atlas depository (www.proteinatlas.org, version ) was interrogated. as shown in additional file : table s out of mouse liver cancer regulated proteins were likewise regulated in human hcc. it should be noted that for some proteins several antibodies were used to study their expression; only representative data were considered. importantly, out of the proteins uniquely expressed in mouse liver tumours n = were likewise uniquely expressed in human hcc thus evidencing clinical significance of our findings. we compared our previously published transcriptomic data of egf induced liver cancers with the proteomic data obtained in the present study. such comparisons revealed n = genes to be significantly regulated of which n = are in common regulated whereas for n = genes transcript expression was opposite to that of the coded proteins (see additional file : table s ). of the significantly regulated proteins were mapped to different biological processes (see figure a ) of which prominent examples are regulation of arginine metabolism and amino acid import, regulation of cdc protein signal transduction', cellular response to oxidative stress, hydrogen peroxide and superoxide, glycolysis and gluconeogenesis, regulation of cholesterol transport, protein-lipid complex and plasma lipoprotein particle remodeling, positive regulation of steroid metabolic process, negative regulation of calcium ion transmembrane transporter activity and release of sequestered calcium ion into cytosol by sarcoplasmic reticulum, (see additional file : table s ). in figure a -c the go biological process, cellular components and molecular functions are depicted. note, some of the ontology terms could be grouped, i.e. chaperone-mediated protein complex assembly and folding, endoplasmic reticulum unfolded protein response, er-nucleus signalling pathway and response to er oxidative stress as well as hypoxia, blood coagulation, developmental growth and regulation of programmed cell death. as depicted in figure b significantly regulated proteins were mapped to cellular components (see additional file : table s ), i.e. mitochondrial crista, matrix and inner membrane, endoplasmic reticulum lumen, early endosome and cytoplasmic membrane-bounded vesicle, chylomicron, very-low and high density lipoprotein particle, proteasome accessory complex, peroxisome, extracellular vesicular exosome and extracellular membrane-bounded organelle. furthermore, significantly regulated proteins were mapped to molecular functions (see figure c ) and included arginase activity, fructose-bisphosphate aldolase activity, hydrolase and oxidoreductase activity, acting on carbon-nitrogen (but not peptide) bonds, acting on aldehyde, ch-oh group or oxo group of donors, nad or nadp as acceptor as well as steroid dehydrogenase activity. in addition, phosphatidylcholine-sterol o-acyltransferase activator activity, cholesterol transporter activity, sterol transporter, antioxidant and lipid transporter activity as well as electron carrier and serine-type endopeptidase inhibitor activity were prominent functions. finally, proteins functioning in metal ion and purine ribonuleoside triphosphate binding, lipoprotein particle receptor binding, chaperone and oxygen binding, binding of magnesium ion and nad, protease and single-stranded dna binding were observed as disease regulated (additional file : table s ). in all, significantly regulated proteins were classified by the reactome, kegg and wikipathway databases, respectively. the different databases provided similar information with the majority of tumour proteins acting in major metabolic pathways (see figure and information derived from cluego and cluepedia). for example, the proteins aldoa, aldoc, fbp and pkm function in glycolysis and gluconeogenesis whereas akr c , aldoa, aldoc and fbp are part of the fructose and mannose metabolic pathway. likewise, atp h and ndufv are part of the oxidative phosphorylation pathway and mdh and pkm contribute to pyruvate metabolism. similarly, the proteins akr c , akr c , akr c , alb, apoa , apoa , apoe, fdps, gpx , hacl and plg take part in the metabolism of lipids, arachidonic acid and lipoproteins whereas the proteins agmat, arg , arg , bckdha, ckb, cps , haao and phgdh are specified for arginine and proline metabolism. in the same manner the proteins gpx , itpa and nme contribute to the metabolism of nucleotides and related to this are the proteins itpa, pkm and psmc which are part of the purine metabolic pathway. apart from these pathways a highly significant regulation of the blood coagulation cascade, figure western blotting of serum proteins in control and egf transgenic mice. for the commonly regulated proteins in serum and tumours their regulation in liver tissue was confirmed by de and maldi-tof/ms (see table ). depicted are western blots for serum proteins. note, with the exception of egf the regulated serum proteins were already reported in our earlier publication [ ] . platelet activation and fibrinolysis was observed as defined by the proteins crk, fga, fgb, fgg, plg and sod all of which were highly significantly regulated. furthermore, trna aminoacylation (aars, gars and sars), advanced glycosylation endproduct receptor signalling (alb, capza and lgals ), peroxisome (ech , hacl and sod ), protein processing in endoplasmic reticulum (ganab, hyou and pdia ), proteasome (psmc and psmd ) and activation of chaperone genes by xbp (s) and 'unfolded protein response' (hyou and lmna) are pathways significantly perturbed in liver cancer induced by egf (see additional file : table s and additional file : table s ). using the designated workflow of the genexplain platform (see methods section) we searched for master regulatory proteins. the software is designed to identify molecules upstream of regulated tumour proteins to assist in the construction of molecular circuitries. after annotation of the input datasets the tool "find master regulators in networks (geneways)" was used to identify key nodes amongst proteins exclusively expressed in tumours (to). this revealed upstream regulatory molecules. among them five were selected for their link to the egfr signalling pathway, i.e. plaur, fgfr , ptbp and agtrap while the protein s a was chosen for its importance in the plaur/egfr network, (see additional file : figure s a -e). in additional file : table s and additional file : table s , the tumour regulated proteins distributed amongst the selected master regulatory molecules are summarized. in support of its biological significance the constructed networks were enriched with gene expression data from transgenic non-tumour and tumour tissues. thus, the gene and protein data were merged and hybrid networks for each master regulatory protein were constructed. subsequently, these were merged into one (see figure ) and the integrated hybrid network consisted of n = network proteins of which n = were tumour specific. in support, the genes coding for lmna, i.e. a component of the nuclear lamina that is frequently up-regulated in cancers and mvp that codes for multidrug resistance were up-regulated (ur-t) whereas nme, a suppressor of metastasis was repressed in expression (dr-t). likewise, the genes coding for igals , i.e. a beta-galactoside-binding protein frequently overexpressed in cancers and pcbp that is involved in transcription and functions as an inhibitor of invasion [ ] were up-regulated in transgenic nontumour livers (ur-tr-nt) whereas transcript expression of aars, a member of trna synthases and anaxa , a calcium-dependent, phospholipid-binding protein with important roles in the tumour microenvironment and metastasis were repressed (dr-tr-nt). finally, the entire network was enriched with expression data of and genes, respectively that were significantly regulated in tumour and non-tumour transgenic livers. next, we searched for master regulatory molecules by considering regulated tumour proteins obtained from the comparison tumour specific or up-and down regulated as compared to healthy non-transgenic controls (to + ur + dr). this revealed filtered (threshold radius of ) upstream regulators. among these were selected as candidates because of their regulation in liver tumours and their link to egfr signalling. notably, in the constructed network all master regulators were significantly up-regulated and included pdia , apeh, pebp and apoe while the protein expression of arg , fbp and haao was repressed (see additional file : figure s a-g) . note, in the case of arg transcript expression was equally repressed. in additional file : table s and additional file : table s the tumour regulated proteins distributed amongst the selected master regulatory molecules are summarized. in support of its biological significance the fused hybrid network was enriched for gene expression data derived from transgenic non-tumour and tumour tissues. thus, the integrated hybrid network consisted of out of regulated proteins and gene expression calls evidenced of the up-regulated tumour (to + ur) proteins to be regulated at the transcript level as well whereas among the down-regulated tumour proteins (dr) the gene arg was repressed in expression. likewise, gene expression data from non-tumour transgenic livers evidenced genes out of networks partners to be increased in expression (ur-tr-nt) and among the down-regulated networks proteins the gene phb was repressed (dr-tr-nt). thus, when the tumour gene expression data of the entire network was considered a total of genes were regulated, of which were up-regulated and were repressed in expression, (see figure ) . based on the information of the hybrid master regulatory network and in addition to other disease regulated proteins summarized in table (note, some of the proteins were not part of the networks) a total of n = disease regulated proteins were considered for network construction. after filtering for non-connected proteins the string database informed on n = protein-interactions of which n = were disease regulated as identified in the present study. among these , and were either up-, down-or not statistically significantly regulated. furthermore, gene expression calls for up-regulated proteins were supported by up-and down-regulated genes identified in tumours and up-and down-regulated genes in transgenic non-tumour livers. likewise, gene expression calls for down-regulated proteins were supported by and downregulated genes in tumours and transgenic non-tumour livers, respectively. therefore, the entire network was supported by induced and repressed tumour specific gene expression changes and up-regulated and downregulated genes observed in transgenic non-tumour livers. as depicted in figure the proteins of the fusion network displayed functional associations via the egf/egfr network and included out of ( %) significantly regulated tumour proteins with out of master regulators being connected to egfr through the network's proteins (see additional file : table s for possible protein-protein interactions and related scores). of the network proteins could be mapped to distinct pathways. after removal of non-relevant terms such as alzheimer disease a total of proteins were mapped to pathways with meaningful associations (see figure ) and consisted of 'platelet activation, signalling and aggregation (platelet degranulation)' , 'lipoprotein metabolism' , 'mapk signalling pathway' , 'glycolysis and gluconeogenesis', 'metabolism of amino acids and derivatives (arginine and proline metabolism)', 'apoptosis' and 'egfr signalling pathway'. additionally, a total of and tumour regulated proteins were mapped to the hif- signalling and protein processing in endoplasmic reticulum pathways, respectively. the pathway mapping was also supported by gene expression data with up-and down-regulated genes in tumours and up-and downregulated genes in transgenic non-tumour livers. note, two of the significantly regulated tumour proteins, i.e. crk and pebp are members of the egfr signalling pathway with pebp also functioning as a master regulator while the other regulated proteins are connected to egfr signalling through cross-talk among the pathways (see additional file : table s ). figure integrated master regulatory network for proteins uniquely expressed in tumours. based on network information obtained for the different master regulators an integrated hybrid network was constructed. the network contained proteins including with connectivity to egfr signalling (yellow coloured inner node). the master regulator, the connecting proteins (network elements) and regulated proteins are given as red, green and blue coloured inner node, respectively. furthermore, each node is partitioned into four segments whereas the first segment seen from left refers to tumour specific proteins and is red-coloured. the second, third and fourth segments refer to either up-and down-regulated proteins, tumour specific gene expression changes and gene regulations in transgenic non-tumour liver tissue, respectively. increased expression of either proteins or genes is given in red, whereas the blue colour denotes repressed expression. recent research into the molecular pathogenesis of hcc evidenced significant alterations in signalling pathways. given the fact that the epidermal growth factor is an important mitogen for hepatocytes we were particularly interested in investigating the consequences of its targeted overexpression in the liver. in our previous study we employed chromatin immunoprecipitation followed by cloning and sequencing of dna to search for tumour associated gene regulations targeted by novel hnf alpha p and p promoter-driven isoforms. this identified egf-receptor substrate (eps r) and eps as regulated by the p promoter-driven hnf alpha splice variant in mouse and human hcc. a molecular circuitry was proposed whereby eps and eps r mediate internalization of activated egfr to stimulate receptor recycling, therefore responding to mitogenic signalling of egf [ ] . in the present study disease proteomics was performed to further investigate the role of egf in liver cancer. this identified regulated proteins of which are novel and have not been reported so far. a total of proteins were significantly up-regulated (table ) . among these were extra-cellular or secreted proteins and included albumin and isoforms of it, apolipoproteins (apoe, apoa and apoai), α-, β-, γfibrinogen, plasminogen as well as interleukin receptor antagonist (il- ra). note, an isoform of apoa was already proposed as serum marker of hcc [ ] and based on ihc staining il- ra expression was confirmed in about % of mouse liver adenoma and carcinoma cases; however preneoplastic foci as well as normal hepatocytes surrounding the lesions were negative. furthermore, rt-pcr analysis confirmed mouse hepatic tumours to contain both secreted and intracellular forms of il- ra [ ] and figure integrated master regulatory network for hcc regulated proteins. based on network information obtained for different master regulators an integrated hybrid network was constructed. the network contained proteins including with connectivity to egf/egfr signalling (yellow coloured inner node). the master regulator, the connecting proteins (network elements) and regulated proteins are given as red, green and blue coloured inner node, respectively. furthermore, each node is partitioned into four segments whereas the first segment seen from left refers to tumour specific proteins and is red-coloured. the second, third and fourth segments refer to either up-and down-regulated proteins, tumour specific gene expression changes and gene regulations in transgenic non-tumour liver tissue, respectively. increased expression of either proteins or genes is given in red, whereas the blue colour denotes repressed expression. serum levels of il- ra were monitored to assess therapeutic efficacy of radiofrequency ablation in hcc patients [ ] . an important finding of the present study is the statistically significant regulation of mitochondria associated proteins of which were repressed while were upregulated. similar results were reported by chignard and wei sun with mitochondrial proteins being the second largest proportion of regulated proteins in human viral hcc [ , ] . among the repressed proteins were nadh dehydrogenase (ubiquinone) alpha subcomplex and prohibitin, a mitochondrial chaperone. this protein, when deleted (prohibitin ko mice) induced fibrosis, bile duct metaplasia, liver dysplasia and eventually multifocal hcc. however, its overexpression in tumour cell lines inhibited cell proliferation to demonstrate tumour suppressor function [ ] . likewise, glutathione peroxidase (response to oxidative stress) and argininosuccinate synthetase (ass, urea cycle) were repressed. note, ass is the first of two enzymes to convert citrulline to arginine and this pathway allows cells to synthesize arginine from citrulline to function in no production, ammonia detoxification and synthesis of polyamines. several reports suggest ass deficiency to be common in tumour cell lines [ ] [ ] [ ] [ ] [ ] [ ] , and the present study confirms ass expression to be confined to healthy non-transgenic control liver, but ass was absent in tumour tissue extracts (see table ). ablation of ass in diverse tumours suggests a tumour suppressor function and the fact that forced expression of ass in osteosarcoma cell lines suppresses growth adds weight to this notion [ ] . another example of tumour specific ablation of proteins refers to glycine n-methyltransferase (gnmt). the enzyme catalyzes the transfer of a methyl group from s-adenosylmethionine (sam) to glycine thereby generating s-adenosylhomocysteine and n-methylglycine. this protein was completely downregulated in liver tumours. gnmt is known to play a role in the maintenance of genetic stability [ , ] , and a novel tumour suppressor function was recently reported that is independent of its catalytic activity but does require its nuclear localization [ ] . several of the proteins listed in table were already reported for their tumour specific regulation while proteins so far unknown for their regulations in hcc, are marked with an asterisk (table ) . these function in diverse biological processes including metabolism, translation and signalling. specifically, changes in carbohydrate metabolism are commonly observed in tumours where energy production relies on glycolysis rather than mitochondrial oxidative phosphorylation. in the present study induced expression of several glycolytic enzymes was observed, most notable [ ] pyruvate kinase that catalyzes the transfer of a phosphate group from phosphoenolpyruvate to adp and was shown to be a target of mi-rna in hcc [ , ] aldolase, an enzyme that converts fructose , -bisphosphate into dihydroxyacetone phosphate (dhap) and glyceraldehyde -phosphate and was reported to be a sensitive marker for benign and malignant liver disease [ ] and [ ] alpha glucosidase , a hydrolase that cleaves glycosidic bonds with the release of alpha glucose from carbohydrates. further important findings include the tumour specific expression of alanyl-, glycyl-and seryl-trna synthetases which catalyze the transfer of specific amino acids to trna, as well as regulation of eukaryotic translation elongation factor and poly(rc) protein that binds to oligo dc. note, knowledge on the role of aminoacyl-trna synthetases in cancer is just emerging [ ] and through the use of a lentiviral mediated shrna vector, a link between aminoacyl-trna synthetases [aars]-interacting multifunctional protein (aimp ) and repressed egfr signalling was established that resulted in repressed glucose uptake [ ] . we also observed induced expression of heterogeneous ribonucleoprotein (hnrnp) that takes on diverse functions in the processing of mrna. its expression was reported to be increased in serum of hcc patients . in contrast, proteins involved in the synthesis and degradation of cholesterol, lipids, steroids and fatty acid were in part oppositely regulated and included induced expression of the aldo-keto reductase family . regulation of this protein has been reported for lung and pancreatic cancers [ ] , and gene silencing of aldo-keto reductase family b resulted in growth inhibition of colorectal cancer cells that might be of therapeutic utility [ ] . the repressed expression of figure pathways mapping of fussed network proteins. cytoscape . . with plugins (see methods section) are used to generate functionally grouped network of pathways. grouping of significant pathway terms (p ≤ . ) were based on kappa score threshold of . , initial group size of and sharing group percentage of . the pathway network consisted of significantly and non-significantly regulated proteins involved in distinct pathways which are colour-coded. note, the two individual terms are grey-coloured. up and down-regulated proteins are coded as orange and green small discs, respectively. up-and down-regulated as well as non-significantly regulated proteins and connecting proteins of the network are given as orange, green, yellow and blue coloured discs, respectively. the network depicts protein-protein interactions in liver tumours of egfr transgenic mice and their relation to various pathways under the influence of egfr signalling. egfr is highlighted as blue triangle in this network. certain proteins may also be considered as an adaptive response and includes the enzyme enoyl coenzyme a hydratase . its activity was shown to contribute to lymphatic spread of liver tumours as was evidenced in gene silencing studies [ ] . likewise, we observed repressed expression of dihydrodiol dehydrogenase in tumours. this enzyme plays an important role in the metabolism of steroids that leads to inactivation of circulating androgens, progestins and glucocorticoids and was repeatedly reported to be overexpressed in non-small cell lung cancer. amongst patients with high dhd expression the incidence of early tumour recurrence and distant metastasis is significantly higher and patients are highly resistant to chemo and radiotherapy [ ] . intriguingly, complete ablation of mitochondrial butyryl coenzyme a synthetase , a gtp-dependent lipoateactivating enzyme was observed in tumours of egf transgenic mice. little is known about the possible link between butyrate metabolism and liver cancer. however, butyrate is well known to inhibit proliferation of human colon carcinoma cells in an epigenetic manner that involves histone acetylation [ ] . note, it was recently reported that due to the warburg effect butyrate-mediated histone acetylation and cell proliferation is dictated [ ] . several lines of evidence therefore suggest butyrate to act as a cytosolic sensor for histone acteylation and when transformed to intermediates by butyryl coenzyme a synthetase is unable to escape the mitochondria. moreover, we observed a highly significant repression of -hydroxyphytanoyl-coa-lyase. this peroxisomal thiamine pyrophosphate-dependent enzyme is rate limiting in the breakdown of -hydroxy fatty acids. the biological role of -hydroxy fatty acids has only recently become apparent [ ] and cumulative evidence suggests intermediates of energy metabolism to specifically activate g-protein coupled receptors which are now classified as hydroxy carboxylic acid receptors (hca - ). the hca receptor is involved in a complex negative feed-back loop whereby ketone bodies derived from fatty acid oxidation are sensed by hca via the activity of -hydroxybutyrate that leads to inhibition of lipolysis and to restriction of further fatty acid supply. in this way triglyceride use is diverted and energy demands for tumour growth are met more efficiently. specifically, during rapid tumour growth and the herewith associated ischemia the yield of high energy bonds (atp) from glucose oxidation is about twice that of fatty acid oxidation. our observation that proteins involved in the ß-oxidation of fatty acids were either repressed or unchanged agrees well with this principle (see also discussion below). the reduced expression of lysophosphopholipase signifies an adaptive response; it catalyses the production of lysophosphatidic acid, i.e. a second messenger known to contribute to tumour cell motility, survival and proliferation [ ] . additionally, the repressed expression of mitochondrial acyl-coa thioesterase in liver tumours which hydrolyzes acyl-coas to free fatty acids and coenzyme a, will influence the supply of ligands for nuclear receptors and the regulation of fatty acid oxidation in mitochondria and peroxisomes. equally, the regulation of farnesyl diphosphate synthetase, i.e. a key enzyme in the isoprenoid biosynthetic pathway is highly interesting and this enzyme is explored as a drug target of bisphosphonates to treat tumour growth [ ] . it's up-regulation in colon cancers was reported [ ] . in the present study repressed expression of the ribosom-compononent rps and enzymes of amino acid metabolism like branched chain ketoacid dehydrogenase e as well as dimethyl glycine dehydrogease was observed. conversely, expression of the proteasome s atpase subunit (p /sug) and its non-atpase regulatory subunit (psmd ) was confined to tumour tissues (see table ); the latter subunit is known to display high activity in embryonic stem cells. this multicomplex molecular machinery degrades intracellular proteins marked up by ubiquitin chains. psmd was reported to be up-regulated in breast cancer cells [ ] . enhanced expression of cytoskeletal proteins such as tubulin β and capza was also confirmed by ihc staining (see figure ). differences in the localization of these proteins were obvious with tubulin ß expression being primarily associated with cells proximal to the liver capsule, whereas expression of capping protein zline α (capza ) was strongly associated with tumour foci and this protein is known to play a pivotal role in cytoskeletal networks to support cell mobility, invasion and metastasis. additionally, gdi , a protein functioning in the cycling of rab gtpases and arginase ii, i.e. a non-liver isoform of the urea cycle were up-regulated in tumours of egf transgenic mice (see figure ). regulation of arginase ii was observed in various malignancies including lung cancer [ ] . besides, the actin-binding protein lasp was uniquely expressed in tumours and is also up-regulated in breast cancer [ ] to possibly support migration of cancer cells [ ] . furthermore, pdia , a disulfide bond isomerase and master regulator of the constructed networks (see below) was up-regulated as was kininogen that is part of the blood coagulation system and functions as a precursor of kinin. conversely, the serinproteinase inhibitor serpinb a was repressed in expression to possible limited immunological responses in tumour growth and to influence inflammatory cytokine production by infiltrating monocytes [ ] . the significant regulation of the calcium binding protein sorcin and nucleobindin are further highly interesting results. sorcin is associated with multidrug-resistance in human leukemia cells [ ] and nucleobindin is evaluated as a biomarker of colon cancer [ ] . in egf induced liver tumours transthyretin was also up-regulated. this protein is involved in the transport of thyroid hormones and was reported to be aberrantly regulated in thyroid cancer [ ] . among the newly identified proteins is v-crk sarcoma virus ct . this oncoprotein interacts with several tyrosinephosphorylated proteins and is part of the intracellular signalling cascades notably the phosphoinositide -kinase (pi k)/akt pathway [ ] . likewise, regulation of the kda glucose-regulated protein grp is of great importance. this lumenal endoplasmic reticulum plays a role in immunoglobulin folding as was confirmed by coimmunoprecipitation in four different b cell hybridoma cell lines [ ] . in our previous study several immunoglobulins were found to be either repressed or absent in serum of egf tumour bearing mice and this was particularly obvious for the ig k and l classes [ ] . it remains to be determined whether repression of immunoglobulins can be attributed to aberrant grp activity. a summary of the biological functions in addition to their previous reported tumour association is given in additional file : table s while the regulation of genes coding for newly identified proteins and of genes coding for commonly regulated proteins in liver tumours and serum of egf b-transgenic mice is given in additional file : table s and additional file : table s . initially the network construction was based on proteins exclusively expressed in tumours and by selecting master regulatory proteins linked to egfr signalling. thereafter, a fused hybrid network was developed in which tumour specific proteins were part of it. subsequently, the search was extended to all significantly regulated proteins (table ) . this revealed master regulatory proteins and its associated networks and encompassed proteins of which were disease regulated. eventually a fused network was developed; however not all disease regulated proteins are part of it. the performed pathway mapping over fused networks (see string analysis) defined protein interactions and grouped disease regulated proteins into distinct pathways of which platelet activation, signalling and aggregation is a major one (see figure ) . specifically, the glycoprotein fibrinogen is a multimeric protein and consists of α, ß and y subunits. it is synthesized by hepatocytes and an essential blood coagulation factor with all polypeptide chains being highly regulated in tumours of egf transgenic mice. note, an association between coagulation factors and malignancies was established whereby fibrinogen functions as an extracellular matrix protein to interact with integrin receptors in the control of cell proliferation and cell migration [ ] . accordingly, induced gene expression of the integrin receptors itgb , itga and itgav was observed in egf induced liver tumours. in cancer progression a regulatory loop between fibrinogen, platelets and tumour cells has been determined that is activated by platelet cytosolic ca +. this second messenger induces integrin receptor complex formation through an association of platelet glycoprotein chains iib and iiia (cd /cd ) thereby creating an active binding site for fibrinogen. an association of tumour regulated proteins with the regulatory loop was confirmed in string analysis ( figure ) and fibrinogen was reported to be an important determinant for metastasis of circulating tumour cells [ ] . it is therefore of no surprise that elevated blood fibrinogen is a poor prognostic factor. haemostatic complications are commonly observed in cancer patients and future therapeutic strategies may focus on the hemostatic system by targeting tumour stroma. in this regard the tumour specific induction of plasminogen is of great importance. this zymogen [ ] is converted to plasmin by urokinase (upa), a serine protease which itself was unchanged; however, gene expression of its receptor was significantly up-regulated in transgenic non-tumour livers. one report suggests the urokinase receptor to prime cells for proliferation in response to egf by promoting tyr phosphorylation and stat b activation; nonetheless, this depended on intracellular c-src levels [ ] . further studies established a link between induced expression of plasminogen activator, upa receptor and plasminogen activator inhibitor type- (pai- ) and invasiveness and metastasis of hcc [ , ] . indeed, a fine balance exists between the plasminogen activating system and its inhibition by pai- and pai- . based on transcriptomic data a highly significant induction of pai-i (up to -fold) in large tumours of egf transgenic mice was observed [ ] ; consequently, the regulation of components of the plasminogen activating system may be considered as part of a strategy to degrade extracellular matrix thereby facilitating invasion and metastasis [ , ] . to meet energy demands efficiently different sources are utilized and the induction of the proteins aldoa, aldoc, eno , pkm and fbp is testimony to an altered glycolytic and pentose phosphate pathway. however, with the exception of acyl-coa thioesterase that was below the limit of detection and functions in the hydrolysis of myristoyl-palmitoyl-, stearoyl-and arachidoyl-coa esters the regulation of enzymes linked to fatty acid metabolism in mitochondria and peroxisomes was hardly observed. in pursue of tumour growth and to sustain organelle and membrane biogenesis lipids are de novo synthesized and mobilized from stores and while the complex interaction of hepatic lipid and glucose metabolism in liver disease is the subject of intense research [ ] the present study evidences significant regulation of several apolipoproteins, i.e. apoe, apoa , apoa and isoforms of albumin. apart from lipid transport apolipoproteins play a wider role in cancers and are known to interact with diverse receptors to elicit cellular events as demonstrated for apoe to cause sustained proliferation and survival of cancer cells [ ] . a further group of highly regulated proteins are aldoketo reductases. their quantitative evaluation in different hepatocellular carcinoma (hcc) cell lines was recently reported [ ] . this superfamily of proteins comprises nad (p)(h)-dependent enzymes which catalyze oxidoreduction of a variety of prostaglandins, steroids and toxic aldehydes. their involvement in tumorigenesis is supported by several studies and they are explored as drug targets to overcome chemoresistance. in the present study the aldo-keto reductases akr c , akr c and akr c were uniquely expressed in tumours, however glutathione peroxidase was repressed to % of healthy control livers to possibly support hif- signalling. indeed, the redox state and therefore glutathione participates in the hypoxic induction of hif- [ ] , and two proteins of the glycolytic pathway, i.e. aldoa and eno , which respond to hif- signalling, were regulated. moreover, glutathione peroxidase was shifted in the gel as shown in figure panel g iii as a result of post translational modifications that most likely involved c-abl and arg kinase activity at tyr of gpx [ ] . likewise, the genes coding for aldo and eno were significantly up-regulated in egf induced liver tumours. a complex interaction exists between egfr and rage signalling. this receptor for advanced glycation end-products is a member of the immunoglobulin family of cell surface molecules and was reported to significantly influence hepatic tumour growth in murine models of colorectal carcinoma [ ] . there is strong evidence for rage to promote cancer growth upon ligand dependent activation and several proteins of the s family bind to the extracellular domain of rage [ , ] . it is of considerable importance that gene expression of s a and s a was up to -fold induced in tumours of egf transgenic mice, however expression of s a was repressed. likewise the tumour specific expression of the rage binding proteins lectin, galactoside-binding, soluble, and capza in tumours of egf transgenic mice is highly suggestive for a sustained crosstalk between rage and egfr [ ] . although the precise mechanism by which s proteins stimulate egfr signalling remains to be elucidated binding of s a to egf and to other egfr ligands was reported to possibly facilitate interaction with the receptor [ ] . similarly, the binding of s a /a to rage was shown to promote migration and invasion of human breast cancer cells through actin polymerization and epithelial-mesenchymal transition [ ] . conversely, advanced glycation endproduct (age) receptor suppressed oxidant stress-dependent signalling via the egfr and shc/grb /ras pathway [ ] . as depicted in figure the amino acid metabolism was another distinct pathway to which several of the regulated proteins could be mapped to. note, the tumour specific regulations of arginine and as well as the regulation of subunits of the proteasome s atpase (psmc and psmd ) were already discussed (see above). in the following additional proteins regulated in this pathway are briefly summarized. specifically, -hydroxyanthranilate- , -dioxygenase (haao) catalyzes oxidation of -hydroxyanthranilate to quinolinate and this intermediate functions as a precursor in nad and pyridine biosynthetic pathways. expression of haao was significantly repressed in tumours of egf transgenic mice and hypermethylation of the coding gene was observed in ovarian cancer [ ] . due to the fact that haao is significantly repressed at the gene and protein level in at least two different tumour entities (ovarian and liver cancer) the protein may function as a tumour suppressor that appears to be repressed by an epigenetic mechanism. a significant finding is the tumour specific expression of -phosphoglycerate-dehydrogenase which catalyses the production of -phosphoglycerate. this intermediate of glycolysis is an essential precursor of the serine biosynthetic pathway. importantly, a recent metabolomic study evidenced -phosphoglycerate to be diverted into serine and glycine metabolism and repressed expression of -phosphoglyceratedehydrogenase resulted in impaired tumour cell proliferation [ ] . in support of tumour growth the diversion of intermediate of glycolysis affects protein, membrane lipid and nucleotide synthesis. moreover, the observed induction of creatine kinase in tumours of egf transgenic mice creates a circuitry for cellular energy homeostasis in conditions of high metabolic demands [ ] . the enzyme catalyses the reversible transfer of phosphate from phosphocreatine to adp to yield atp and creatine. its induction has been observed in many cancers including liver cancer cell lines [ , ] and a further study suggested a possible interplay between p mutations, hcc, ck expression with growth-inhibitory effects of cyclocreatine in hcc [ ] . while the rationale of tumour cells in embarking on abnormal metabolism had already been discussed (see above) the finding that agmatine ureohydrolase was strongly repressed in egf induced liver tumours to about % of non-transgenic healthy livers is of great importance. this enzyme hydrolyzes agmatine (= decarboxylated arginine) to form putrescine and urea and repression of the enzyme will significantly increase agmatine tissue concentration to influence diverse cellular control mechanisms. importantly, in the study of battaglia and coworkers [ ] mm agmatine induced large amounts of superoxide production in rat liver mitochondria; however, it did not affect mitochondrial respiration or redox levels of thiols and glutathione. furthermore, atp synthesis remained normal and prevented ca( +)-induced mitochondrial permeability transition in the presence of phosphate to suggest an intriguing regulatory loop whereby h o induces hypoxia signalling that is linked to abberant metabolism, nonetheless by selecting interconnected physiological pathways tumour cells are equipped to avoid programmed cell death [ , ] . thus, arginine deprivation is evaluated for its utility in cancer therapy [ ] . a further enzyme repressed to % of healthy nontransgenic liver is carbamoyl phosphate synthetase (cps ), i.e. a liver specific ligase to function in ammonia detoxification. it is perplexing that tumour cells disable such an important pathway of the urea cycle. however, a recent study demonstrated dna hypermethylation as a key mechanism of silencing cps gene expression in human hcc. note, forced expression of cps induced cell proliferation and the observed repression in human hcc may simply be the result of genomic instability as was observed in tumour cells [ ] . the present study identified novel disease regulated proteins induced by overexpression of egf to provide new insight into the complex signalling events in hcc. six major pathways perturbed by egfr hyperactivity were identified and several of the regulated proteins are interesting drug target candidates and this includes tumour specific expression of kinases as well as proteins involved in aberrant metabolism. an identification of commonly regulated proteins in tumour and sera will be of great utility in the development of biomarkers to monitor disease progression and responses to therapy. the following additional data are available with the online version of this paper. aminoacyl-trna synthetases interacting multifunctional protein ; alpha-hcca: alpha-cyano- -hydroxycinnamic acid; aps: ammonium persulfate; ass: argininosuccinate synthetase ; atp: adenosine triphosphate; ck: creatine kinase; ck Α: casein kinase ii subunit alpha; co: control specific protein dr: down-regulated protein; dr-t: down-regulated tumour; dr-tr-nt: down-regulated transgenic non-tumour; egf: epidermal growth factor egfr: epidermal growth factor receptor; eps : epidermal growth factor receptor substrate ; eps r: epidermal growth factor receptor substrate r; er: endoplasmic reticulum; erk: extracellular signalling regulated kinase; fdr: false discovery rate; fgfr : fibroblast growth factor receptor go: gene ontology; grp : kda glucose-regulated protein; gtp: guanosine triphosphate h po : orthophosphoric acid; hca: hydroxy carboxylic acid; hcc: hepatocellular carcinoma; hdac : histone deacetylase ; hed: bis( -hydroxyethyl) disulfide; hrp: horseradish peroxidase ihc: immunohistochemistry; il- ra: interleukin- receptor antagonist ko: knockout mouse mek: mitogen-activated protein kinase kinase; nad: nicotinamide adenine dinucleotide; nad(p)(h): nicotinamide adenine dinucleotide phosphate nadp: nicotinamide adenine dinucleotide phosphate; no: nitric oxide; ogp: n-octyl β-d-glucopyranoside pai- : plasminogen activator inhibitor type- pdgfr: platelet-derived growth factor receptors; plaur: plasminogen activator, urokinase receptor; pmf: peptide mass fingerprinting; ptbp : polypyrimidine tract binding protein ; rage: receptor for advanced glycation endproduct s a : s calcium binding protein a ; sam: s-adenosylmethionine short hairpin rna; stat b: signal transducer and activator of transcription b; temed: tetramethylethylenediamine; to: tumour specific protein; trna: transfer rna; tyr : tyrosine residue ; upa: urokinase-type plasminogen activator; ur: up-regulated protein up-regulated transgenic non-tumour; vegfr: vascular endothelial growth factor receptor; xbp (s): x-box binding protein isoform surgery and ablative therapy for hepatocellular carcinoma improved method for proteome mapping of the liver by -de maldi-tof ms identification of specific protein markers in microdissected hepatocellular carcinoma soluble interleukin- receptor levels in hepatocellular cancer: a more sensitive marker than alfa fetoprotein discovery and development of sorafenib: a multikinase inhibitor for treating cancer epidermal growth factor-induced hepatocellular carcinoma: gene expression profiles in precursor lesions, early stage and solitary tumours mapping of the serum proteome of hepatocellular carcinoma induced by targeted overexpression of epidermal growth factor to liver cells of transgenic mice oncogenic potential of ck alpha and its regulatory role in egf-induced hdac expression in human liver cancer epidermal growth factor receptor inhibition attenuates liver fibrosis and development of hepatocellular carcinoma distribution and function of egfr in human tissue and the effect of egfr tyrosine kinase inhibition the -kda glucose-regulated stress protein is an endoplasmic reticulum protein that binds immunoglobulin the hsp and grp stress proteins: newly recognized relatives of the hsp s proteome database of hepatocellular carcinoma proteomic analysis of differentially expressed proteins in hepatocellular carcinoma developed in patients with chronic viral hepatitis c proteome analysis of hepatocellular carcinoma by two-dimensional difference gel electrophoresis: novel protein markers in hepatocellular carcinoma tissues identification of human hepatocellular carcinoma-related proteins by proteomic approaches expressed proteome analysis of human hepatocellular carcinoma in nude mice (lci-d ) with high metastasis potential overexpression of alpha enolase in hepatitis c virus-related hepatocellular carcinoma: association with tumor progression as determined by proteomic analysis profilin obtained by proteomic analysis in all-trans retinoic acid-treated hepatocarcinoma cell lines is involved in inhibition of cell proliferation and migration serum biomarkers of hepatitis b virus infected liver inflammation: a proteomic study protein level of apolipoprotein e increased in human hepatocellular carcinoma proteome analysis of hepatocellular carcinoma by laser capture microdissection proteomic profiling of proteins decreased in hepatocellular carcinoma from patients infected with hepatitis c virus current progress in proteomic study of hepatitis c virus-related human hepatocellular carcinoma incidence and distribution of argininosuccinate synthetase deficiency in human cancers: a method for identifying cancers sensitive to arginine deprivation pegylated recombinant human arginase (rharg-peg , mw) inhibits the in vitro and in vivo proliferation of human hepatocellular carcinoma through arginine depletion renal cell carcinoma does not express argininosuccinate synthetase and is highly sensitive to arginine deprivation via arginine deiminase identification and preliminary validation of novel biomarkers of acute hepatic ischaemia/ reperfusion injury using dual-platform proteomic/degradomic approaches pegylated arginine deiminase: a novel anticancer enzyme agent arginine deprivation, growth inhibition and tumour cell death: . deficient utilisation of citrulline by malignant cells accurate qualitative and quantitative proteomic analysis of clinical hepatocellular carcinoma using laser capture microdissection coupled with isotope-coded affinity tag and two-dimensional liquid chromatography mass spectrometry oxidative phosphorylation and f(o)f( ) atp synthase activity of human hepatocellular carcinoma underexpression of mrna in human hepatocellular carcinoma focusing on eight loci insight into hepatocellular carcinogenesis at transcriptome level by comparing gene expression profiles of hepatocellular carcinoma with those of corresponding noncancerous liver proteome analysis of hepatocellular carcinoma hypermethylation of nad(p)h: quinone oxidoreductase (nqo ) gene in human hepatocellular carcinoma use of proteomic methods to identify serum biomarkers associated with rat liver toxicity or hypertrophy identification of hepatocellularcarcinoma-associated antigens and autoantibodies by serological proteome analysis combined with protein microarray higher farnesyl diphosphate synthase activity in human colorectal cancer inhibition of cellular apoptosis proteome analysis of human hepatocellular carcinoma tissues by two-dimensional difference gel electrophoresis and mass spectrometry selenoprotein deficiency and high levels of selenium compounds can effectively inhibit hepatocarcinogenesis in transgenic mice contrasting patterns of regulation of the antioxidant selenoproteins, thioredoxin reductase, and glutathione peroxidase, in cancer cells selenium regulation of glutathione peroxidase in human hepatoma cell line hep b genotypic and phenotypic characterization of a putative tumor susceptibility gene, gnmt, in liver cancer characterization of reduced expression of glycine n-methyltransferase in cancerous hepatic tissues using two newly developed monoclonal antibodies strategic shotgun proteomics approach for efficient construction of an expression map of targeted protein families in hepatoma cell lines the alteration of histidase catalytic activity and the expression of the enzyme protein in rat primary hepatomas identification of differential expression of genes in hepatocellular carcinoma by suppression subtractive hybridization combined cdna microarray inhibition of the acute-phase response in a human hepatoma cell line il- receptor antagonist regulation of acute phase protein synthesis in human hepatoma cells expression of an il- receptor antagonist during mouse hepatocarcinogenesis demonstrated by differential display analysis major urinary protein as a negative tumor marker in mouse hepatocarcinogenesis hepatocellular carcinoma: from bedside to proteomics plasminogen fragment k - improves survival in a murine hepatocellular carcinoma model expression of angiostatin cdna in human hepatocellular carcinoma cell line smmc- and its effect on implanted carcinoma in nude mice the role of angiostatin, vascular endothelial growth factor, matrix metalloproteinase and in the angiogenesis of hepatocellular carcinoma using proteomic approach to identify tumor-associated antigens as markers in hepatocellular carcinoma a gene expression biomarker provides early prediction and mechanistic assessment of hepatic tumor induction by nongenotoxic chemicals positional expression profiling indicates candidate genes in deletion hotspots of hepatocellular carcinoma hormonal regulation of vitamin d-binding protein production by a human hepatoma cell line geneways: a system for extracting, analyzing, visualizing, and integrating molecular pathway data progression of hcc in mice is associated with a downregulation in the expression of hepatocyte nuclear factors boosting antitumor responses of t lymphocytes infiltrating human prostate cancers amphiregulin is a promising prognostic marker for liver metastases of colorectal cancer pcbp- regulates alternative splicing of the cd gene and inhibits invasion in human hepatoma cell line hepg cells tasp , and prpf are novel disease candidate genes targeted by hnf alpha splice variants in hepatocellular carcinomas a strategy for the comparative analysis of serum proteomes for the discovery of biomarkers for hepatocellular carcinoma proteomic analysis of sera from hepatocellular carcinoma patients after radiofrequency ablation treatment proteomics for hepatocellular carcinoma marker discovery liver-specific deletion of prohibitin results in spontaneous liver injury, fibrosis, and hepatocellular carcinoma in mice reduced argininosuccinate synthetase is a predictive biomarker for the development of pulmonary metastasis in patients with osteosarcoma methyl transfer in glycine n-methyltransferase. a theoretical study a novel tumor suppressor function of glycine n-methyltransferase is independent of its catalytic activity but requires nuclear localization mir- targets pyruvate kinase m and affects metabolism of hepatocellular carcinoma serum aldolase isoenzymes in benign and malignant liver disease aminoacyl trna synthetases and their connections to disease lentiviral vector-mediated shrna against aimp -dx suppresses lung cancer cell growth through blocking glucose uptake overexpression and oncogenic function of aldo-keto reductase family b (akr b ) in pancreatic carcinoma aldo-keto reductase family b gene silencing results in growth inhibition of colorectal cancer cells: implication for cancer intervention enoyl coenzyme a hydratase is an important factor in the lymphatic metastasis of tumors expression of dihydrodiol dehydrogenase and resistance to chemotherapy and radiotherapy in adenocarcinoma cells of lung butyrate metabolism in human colon carcinoma cells: implications concerning its growth-inhibitory effect the warburg effect dictates the mechanism of butyrate-mediated histone acetylation and cell proliferation biological roles and therapeutic potential of hydroxy-carboxylic acid receptors autotaxin has lysophospholipase d activity leading to tumor cell growth and motility by lysophosphatidic acid production farnesyl pyrophosphate synthase: a key enzyme in isoprenoid biosynthetic pathway and potential molecular target for drug development over-expression of genes and proteins of ubiquitin specific peptidases (usps) and proteasome subunits (pss) in breast cancer tissue observed by the methods of rfdd-pcr and proteomics arginase is expressed by human lung cancer, but it neither induces immune suppression, nor affects disease progression nuclear localization and cytosolic overexpression of lasp- correlates with tumor size and nodal-positivity of human breast carcinoma regulation of cell migration and survival by focal adhesion targeting of lasp- serpinb regulates homeostatic expansion of il- + gammadelta and cd + th cells sorcin, an important gene associated with multidrug-resistance in human leukemia cells autoantibodies to ca + binding protein calnuc is a potential marker in colon cancer detection fine-needle aspiration of thyroid nodules: proteomic analysis to identify cancer biomarkers v-crk activates the phosphoinositide -kinase/akt pathway by utilizing focal adhesion kinase and h-ras tumors and fibrinogen. the role of fibrinogen as an extracellular matrix protein fibrinogen is an important determinant of the metastatic potential of circulating tumor cells plasminogen receptors and their role in the pathogenesis of inflammatory, autoimmune and malignant disease urokinase receptor primes cells to proliferate in response to epidermal growth factor invasion and metastasis of hepatocellular carcinoma in relation to urokinase-type plasminogen activator, its receptor and inhibitor urokinase-type plasminogen activator receptor transcriptionally controlled adenoviruses eradicate pancreatic tumors and liver metastasis in mouse models plasminogen activator system and its clinical significance in patients with a malignant disease the urokinase plasminogen activator system: role in malignancy the interaction of hepatic lipid and glucose metabolism in liver diseases apolipoprotein e is required for cell proliferation and survival in ovarian cancer quantitative evaluation of aldo-keto reductase expression in hepatocellular carcinoma (hcc) cell lines the redox state of glutathione regulates the hypoxic induction of hif- controlled elimination of intracellular h( )o( ): regulation of peroxiredoxin, catalase, and glutathione peroxidase via post-translational modification rage signaling significantly impacts tumorigenesis and hepatic tumor growth in murine models of colorectal carcinoma binding of s proteins to rage: an update rage and rage ligands in cancer identification of galectin- as a high-affinity binding protein for advanced glycation end products (age): a new member of the age-receptor complex epidermal growth factor receptor ligands as new extracellular targets for the metastasis-promoting s a protein rage-binding s a /a promotes the migration and invasion of human breast cancer cells through actin polymerization and epithelial-mesenchymal transition advanced glycation end product (age) receptor suppresses cell oxidant stress and activation signaling via egf receptor identification of candidate epigenetic biomarkers for ovarian cancer detection phosphoglycerate dehydrogenase diverts glycolytic flux and contributes to oncogenesis intracellular compartmentation, structure and function of creatine kinase isoenzymes in tissues with high and fluctuating energy demands: the'phosphocreatine circuit' for cellular energy homeostasis creatine and creatinine metabolism mitochondrial creatine kinase as a tumor-associated marker elevated creatine kinase activity in primary hepatocellular carcinoma different behavior of agmatine in liver mitochondria: inducer of oxidative stress or scavenger of reactive oxygen species? hypoxia signaling pathways in cancer metabolism: the importance of co-selecting interconnected physiological pathways arginine deprivation as a targeted therapy for cancer submit your next manuscript to biomed central and take full advantage of: • convenient online submission • thorough peer review • no space constraints or color figure charges • immediate publication on acceptance • inclusion in pubmed, cas, scopus and google scholar • research which is freely available for redistribution we gratefully acknowledge support from the virtual liver network (grant ) of the german federal ministry of education and research (bmbf) to jb. additional file : table s . antibodies and dilutions used to study disease regulation by immunohistochemistry.additional file : table s . statistics and selection criteria for functional grouping of pathways terms of disease regulated proteins. a hypergeometric test followed by bonferroni correction was used with p-value ≤ . . for grouping of pathway terms, the kappa score threshold was set to . . additional file : table s . different master regulators and associated network for proteins expressed in tumours only (to) or significantly regulated when compared to healthy liver of non-transgenic control animals (t + ur + dr).additional file : table s . comparison of disease regulated proteins in mouse and human hcc. data were taken from 'the human protein atlas' database.additional file : table s . comparison of transcriptomic and proteomic data. this comparison revealed significantly regulated genes of which are regulated in common whereas for genes transcript expression was opposite to that of the coded proteins.additional file : table s . biological processes ontology for significantly regulated proteins. additional file : table s . cellular component ontology for significantly regulated proteins.additional file : table s . molecular function ontology for significantly regulated proteins.additional file : table s . biological pathways and their cluster. cytoscape . . with plugins (cluego and cluepedia) were used to generate functionally grouped network of pathways based on reactome, kegg and wikipathways databases. the grouping of significant pathway terms (p ≤ . ) is based on a kappa score of . , initial group size of and sharing group percentage of .additional file : table s . biological pathways information based each significantly regulated protein (to + ur + dr). this table depicts pathway terms of nearly % of regulated proteins (to + ur + dr) and are taken from reactome and kegg databases.additional file : figure s . master regulatory networks for proteins uniquely expressed in tumours with link to egfr signalling: (a) the plaur network consists of proteins of which are tumour-specifically regulated, (b) the fgfr consists of proteins of which are tumour-specifically regulated, (c) the ptbp network consists of proteins of which are tumour-specifically regulated, (d) the agtrap network consists of proteins of which are tumour-specifically regulated and (e) the s a network consists of protein of which are tumour-specifically regulated. note all networks display connectivity to egfr signalling (yellow coloured inner node) and in the case of the s a master regulatory protein egfr s ignalling is via the plaur/egfr network.the master regulator, the connecting proteins (network elements) and regulated proteins are given as red, green and blue coloured inner node, respectively. furthermore, each node is partioned into four segments whereas the first segment seen from left refers to tumour specific proteins and is red-coloured. the second, third and fourth segments refer to either up-and down-regulated proteins, tumour specific gene expression changes and gene regulations in transgenic non-tumour liver tissue, respectively. increased expression of either proteins or genes is given in red, whereas the blue colour denotes repressed expression.additional file : table s . integrated hybrid network with master regulator information for proteins expressed in tumours only (to) and significantly regulated proteins when compared to heathy liver of non-transgenic control animals (t + ur + dr).additional file : figure s . master regulatory networks for regulated proteins with link to egfr signalling: (a) the pdia network consists of proteins including significantly regulated proteins, (b) the apeh network consists of proteins including significantly regulated proteins, (c) the pebp network consists of proteins including significantly regulated proteins, (d) the apoe network consists of proteins including significantly regulated proteins, (e) the arg network consists of proteins including significantly regulated proteins, (f) the fbp network consists of proteins including significantly regulated proteins, (g) the haao network consists of proteins including significantly regulated proteins. note, all networks display connectivity to egf protein (yellow coloured inner node). the master regulator, the connecting proteins (network elements) and regulated proteins are given as red, green and blue coloured inner node, respectively. furthermore, each node is partioned into four segments whereas the first segment seen from left refers to tumour specific proteins and is red-coloured. the second, third and fourth segments refer to either up-and down-regulated proteins, tumour specific gene expression changes and gene regulations in transgenic non-tumour liver tissue, respectively. increased expression of either proteins or genes is given in red, whereas the blue colour denotes repressed expression.additional file : table s . protein interaction information of the fused network. given are interacting proteins with association score from different prediction methods, i.e. neighborhood, gene fusion, co-occurrence, co-expression, databases and textmining.additional file : table s . biological pathways and their cluster for fused network proteins. reactome, kegg and wikipathways database information was used as input data file for cytoscape . . . the table shows grouping of significant pathway terms (p ≤ . ) and is based on a kappa score of . , initial group size of and sharing group percentage of .additional file : table s . biological function of newly identified proteins and their previously reported tumour association.additional file : table s . regulation of genes coding for newly identified proteins in egf b-transgenic liver tumours.additional file : table s . regulation of genes coding for common proteins in tumour tissue and serum of egf transgenic mice. the authors declare that they have no competing interests.authors' contributions jb conceived the study and contributed the reagents, gg performed the experiments, ps performed the bioinformatics analysis. jb, gg and ps analysed the data, jb wrote the manuscript and all authors read and approved the final manuscript. key: cord- - nasrbs authors: nan title: gene chip for viral discovery date: - - journal: plos biol doi: . /journal.pbio. sha: doc_id: cord_uid: nasrbs nan comparing the genomes of related organisms, researchers can see what parts of the genomes are conserved-highly conserved genes tend to be important-and then focus on these regions to track down genes and determine how they function. to construct a draft sequence of the c. briggsae genome, the researchers merged genomic data from three sources-one derived from whole-genome shotgun sequencing, another from physical genome mapping, and the third from regions of a previously "finished'' sequence. for the shotgun sequence, the researchers extracted dna from worms, randomly cut it into short pieces, sequenced them, and then assembled overlapping sequences to create thousands of stretches of contiguous dna sequence. to help fill in the gaps between these "contigs,'' stein and colleagues developed a "fingerprint'' map of the genome as a guide for aligning the shorter fragments. the map also helped them identify inconsistencies and misalignments in the genome assembly. finally, they integrated the previously finished sequence to improve the draft genome sequence. using these massive datasets, the authors produced a high-quality genome sequence; although it does not quite meet the gold standard of a "finished'' sequence, it covers % of the genome and has an accuracy of . %. after confirming the accuracy of the draft, the researchers turned to the substance of the genome. examining two species side by side, scientists can quickly spot genes and flag interesting regions for further investigation. analyzing the organization of the two genomes, stein et al. not only found strong evidence for roughly , new c. elegans genes, but also indications that certain regions could be "footprints of unknown functional elements.'' while both worms have roughly the same number of genes (about , ), the c. briggsae genome has more repeated sequences, making its genome slightly larger. because the worms set out on separate evolutionary paths about the same time mice and humans parted ways-about million years ago, compared to million years ago-the authors could compare how the two worm genomes have diverged with the divergence between mice and humans. the worms' genomes, it seems, are evolving faster than their mammalian counterparts, based on the change in the size of the protein families (c. elegans has more chemosensory proteins than c. briggsae, for example), the rate of chromosomal rearrangements, and the rate at which silent mutations (dna changes with no functional effect) accumulate in the genome. this would be expected, the researchers point out, because generations per year are a better measure of evolutionary rate than years themselves. (generations in worms are about three days; in mice, about three months.) what is surprising, they say, is that despite these genomic differences, the worms look nearly identical and occupy similar ecological niches; this is obviously not the case with humans and mice, which nevertheless have remarkably similar genomes. both worm pairs-as well as mouse and human-also share similar developmental pathways, suggesting that these pathways may be controlled by a relatively small number of genes and that these genes and pathways have been conserved, not just between the worms, but also between the nematodes and mammals. this question, along with many others, can now be explored by searching the two species' genomes and comparing those elements that have been conserved with those that have changed. with the nearly complete c. briggsae genome in hand, worm biologists have a powerful new research tool. by comparing the genetic makeup of the two species, c. elegans researchers can refine their knowledge of this tiny human stand-in, fill in gaps about gene identity and function, as well as illuminate those functional elements that are harder to find, and study the nature and path of genome evolution. some , people live with partial or nearly total permanent paralysis in the united states, with spinal cord injuries adding , new cases each year. most research aimed at recovering motor function has focused on repairing damaged nerve fibers, which has succeeded in restoring limited movement in animal experiments. but regenerating nerves and restoring complex motor behavior in humans are far more difficult, prompting researchers to explore alternatives to spinal cord rehabilitation. one promising approach involves circumventing neuronal damage by establishing connections between healthy areas of the brain and virtual devices, called brain-machine interfaces (bmis), programmed to transform neural impulses into signals that can control a robotic device. while experiments have shown that animals using these artificial actuators can learn to adjust their brain activity to move robot arms, many issues remain unresolved, including what type of brain signal would provide the most appropriate inputs to program these machines. as they report in this paper, miguel nicolelis and colleagues have helped clarify some of the fundamental issues surrounding the programming and use of bmis. presenting results from a series of long-term studies in monkeys, they demonstrate that the same set of brain cells can control two distinct movements, the reaching and grasping of a robotic arm. this finding has important practical implications for spinal-cord patients-if different cells can perform the same functions, then surgeons have far more flexibility in how and where they can introduce electrodes or other functional enhancements into the brain. the researchers also show how monkeys learn to manipulate a robotic arm using a bmi. and they suggest how to compensate for delays and other limitations inherent in robotic devices to improve performance. while other studies have focused on discrete areas of the brain-the primary motor cortex in one case and the parietal cortex in another-nicolelis et al. targeted multiple areas in both regions to operate robotic devices, based on evidence indicating that neurons involved in motor control are found in many areas of the brain. the researchers gathered data on both brain signals and motor coordinates-such as hand position, velocity, and gripping force-to create multiple models for the bmi. they used retraining the brain to recover movement if you have ever spent an evening hoisting brews with your pals at the corner pub, chances are you never stopped to think-gee, how do i lift my glass now that it's only half full? it seems like a simple task-you raise that glass reflexively, whether it is empty or full-yet the neural calculations that determine the force needed to lift your arm smoothly to your lips in each case are anything but simple. the brain, it seems, operates like a computer to process variable cues-such as the weight of a glass and the position of your arm-to generate an appropriate response: lifting the glass. neuroscientists believe the brain builds a kind of internal software program based on past experience to transform such variable cues into motor commands. the brain's software, or internal model, depends on specialized sets of instructions, or "computational elements,'' in the brain. but exactly how the brain organizes these elements to process sensory variables that affect arm movements is far from clear. eun jung hwang and colleagues predict that these computational elements are based on a multiplicative mechanism, called a gain field, through time after time in biology, revelations about structure lead to insights about corresponding functional mechanisms. while evolution throws in the occasional spandrel, more often organizational structure serves a practical purpose. so naturally, neuroscientists wonder, does the architectural organization of the motor system reveal an underlying functional organization? progress on this question has been complicated by the fact that there appears to be no clear correspondence between the development of motor neurons centrally and their target muscles in the periphery. in the visual system, for example, retinal ganglion cells send axons in an ordered manner into the brain, where they form connections with neurons of the primary visual center in the brain responsible for detecting visual targets. the arrangement of these connections mirrors the neighboring relationships of the neurons in the retina, and so the neural map of connections in the brain is an "anatomical correlate'' of the arrangements in the retina. the origin of these anatomical relationships can be traced through the process of development, allowing scientists to link the assembly of this sensory system with the function of the neurons involved. matthias landgraf and colleagues now report that in the fruitfly drosophila the arrangement of motor neurons corresponds to the distribution of their target muscles. thus, anatomical correlates also exist in the motor system, in the form of a "myotopic map,'' where the arrangement of motor neuron dendritic branches in the central nervous system reflects the distribution of their target body wall muscles in the periphery. starting with the larger question of how the neural networks governing locomotion are specified and assembled during development, the researchers decided to see if they could identify an elementary principle of motor system organization. working in drosophila, they examined motor neurons and the body wall muscles they innervate. with an eye toward understanding the mechanisms directing the assembly of the motor system, the researchers concentrated on the early stages of development, when the motor neurons first establish their characteristic dendritic territories. they found that the dendrites of motor neurons innervating internal muscles and that those innervating external muscles do in fact project into distinct regions, corresponding to the distinct mapping of the muscles themselves. surprisingly, the arrangement of the dendrites in the myotopic map forms independently of the muscles they innervate. it may be, the researchers suggest, that the initial signals charting the location of the dendrites are set very early in development, when the coordinates for other structural elements are established. but that question requires further investigation. the researchers are among the first to reveal such an orderly underlying principles of motor system organization revealed which sensory signals to the brain are amplified by signals from the eye, head, or limbs. in this way, the brain can rely on past experience of one kind of sensory cues to predict how to respond to new but similar situations. while previous studies had established that some visual cues are combined through a gain field, this study shows that motor commands may also be processed via gain fields. this finding, the researchers demonstrate, accounts for a range of behaviors. based on previous studies showing that when people reach to various directions in a small space, they can extrapolate what they learn about the forces in one starting position to a significantly different position, it has been proposed that the way the brain computes movement is not terribly sensitive to limb position. citing other research with seemingly contrary conclusions-that the brain can be highly sensitive to limb position in calculating force and movement-hwang et al. set out to investigate whether-and how-the brain creates a template to translate sensory variables (limb position and velocity) into motor commands (force). they created a computer model to mimic the reaching behaviors observed by people in their experiments and found that the most accurate model used computational elements that are indeed sensitive to both limb position and velocity. if the brain processes these two independent variables through a gain field, it can use the relationship of the two variables-that is, the strength of the gain field-to adapt information about the force needed to move or lift something in one situation to accomplish a wide range of similar movements. when the researchers compared their model to previously published results, they found their model accounted for seemingly disparate findings. they explain that the brain's sensitivity to limb position can be either low or high after a task has been learned because the gain field itself is adjustable. the authors note that neurophysiological experiments suggest that the motor cortex may be one of the crucial components of the brain's internal models of limb dynamics. the next step will be to track the motor cortex neurons to see whether their activity supports this model. hwang et al. predict they will. novel "checkpoint" mechanism mediates dna damage responses connection between patterns of motor neuron dendrites and patterns of muscles. this organization, in the form of the myotopic map, may be mirrored by the patterning of processes of higher-order neurons, which form connections with the motor neuron dendrites themselves. in vertebrates, studies have shown that motor neurons are grouped into "pools'' and "columns'' that correlate with the muscles they innervate. but because these pools and columns represent the location of the cell bodies and not the areas of the spinal cord where the neurons receive most of their inputs, that is, their dendritic branches, scientists could not say whether the pools and columns are simply spandrels-an incidental result of the way motor neurons are generated-or mirror a functional organization of the motor system. this novel fi nding in drosophila will pave the way for future studies on the relationship between anatomy and physiology during development. it will be particularly interesting to discover whether such myotopic arrangements of motor neuron dendrites are unique to insects or whether this organizational principle occurs in other motor systems, including vertebrates. of all the tasks a cell must accomplish day in and day out, protecting its genome may be the most important. genomes confront all manner of potential assaults, from the strand-splitting action of gamma-radiation to the simple copying mistakes sometimes made when dna replicates before a cell divides. though some mutations are harmless, others can disrupt gene action, leading to cancer and other diseases. to guard against such events, healthy cells maintain quality-control "checkpoints'' that sense and respond to dna injuries, as well as to defects in dna replication, and that prevent cell division until the dna can be repaired. if the damage is beyond repair, apoptosis pathways set about the business of destroying the affl icted cell. many of the genes and protein complexes involved in these checkpoint responses have been identifi ed, but the biochemical mechanisms that in some cases trigger cell cycle arrest are not fully understood. experiments by philip hanawalt and his student david pettijohn at stanford university in suggested that the molecular machinery of dna replication and repair-which they discovered at sites of damage-are quite similar and closely linked. while many studies have since supported that link, viola ellison and bruce stillman, the director of the cold spring harbor laboratory, have found new evidence that the two processes may indeed coincide by showing that protein complexes regulating a cellular checkpoint in dna repair operate much like similar complexes involved in dna replication. the molecular pathways governing the replication of dna before cell division are well known. as the double-stranded dna molecule unwinds, different protein complexes step in to ensure that each strand is faithfully reproduced. two protein complexes required for this process are replication factor c (rfc) and proliferating cell nuclear antigen (pcna). in the s, stillman's laboratory isolated pcna and rfc and showed that they function together to "load'' pcna onto a structure in dna that is created after dna synthesis begins. pcna forms a clamp around the dna strand and regulates the dna polymerases that duplicate the dna double helix. studies in yeast had identifi ed a series of proteins required for the dna synthesis phase of the cell cycle and the dna damage checkpoint pathways; mutations in these proteins' genes make cells very sensitive to radiation (hence the name rad genes). a subset of these proteins, which are conserved in human cells, form two protein complexes-rsr and rhr-that function like rfc and pcna, respectively, with rsr loading the rhr clamp onto dna. ellison and stillman demonstrate that both pairs of "clamp-loading'' complexes follow similar biochemical steps, but, signifi cantly, rfc and rsr favor different dna structures for clamp loading. while it was known that the rsr/rhr complexes exist in human cells, it had not been established that the two types of clamps prefer different dna targets. the researchers also show that the rsr/rhr biochemistry depends on rpa, a protein known to be involved in the dna damage-response pathway. the discovery that rsr loads its rhr clamp onto a different dna structure was unexpected; it suggests not only that the two clamp loaders have distinct replication and repair functions, but also how the checkpoint machinery might work to prevent dna damage from being passed on to future generations. by establishing the chemical requirements of rsr/rhr interactions as well as the preferred dna-binding substrate, the researchers have charted the way for determining the different functions of these cell cycle checkpoint complexes and how the complexes' different subunits affect these functions. the researchers propose that the role of this checkpoint machinery is not as an initial sensor of dna damage, but rather as a facilitator of dna repair, stepping in after preliminary repairs to dna lesions have been made. ellison and stillman's work helps establish a biochemical model for studying how both of these checkpoint complexes function to coordinate replication and repair-and promise to help scientists understand how cancer develops when the checkpoint repair mechanisms fail. during animal development, cells gradually grow, multiply, and specialize to create the tissues and organs that shape and sustain multicellular organisms. the progression from a single cell to a thousand-, million-, or trillion-celled animal follows an exacting schedule and plan involving an elaborate network of genes and proteins. one of the primary mechanisms coordinating this process is cell-to-cell communication. cellular signaling regulates two crucial development mechanisms, apoptosis (programmed cell death) and cell proliferation, which work like chisel and clay to sculpt multiplying masses of cells into, say, a fly wing or a human finger. controlled by multiple signals operating at fixed intervals, the entwined pathways can be steered off-course by a single defect in the communication network, resulting in the death of a healthy cell, for example, or the survival of a damaged cell. such disruptions can lead to physical abnormalities, such as webbed hands and feet, when cells that should die remain alive; degenerative nerve disease, when healthy cells are killed; and cancer, when damaged cells survive and evade normal growth limitations. researchers have uncovered some of the mechanisms underlying these processes by studying genes involved in fruitfly (drosophila) development. following that tradition, stephen cohen and david hipfner have identified a gene critical to drosophila development that juggles cell growth and survival signals to help promote cell growth and prevent inappropriate apoptosis. they searched for genes associated with changes in tissue growth in fruitfly wings and identified some that can cause tissue "overgrowth''-abnormally large masses resulting either from cells growing faster than they divide or from cells escaping proliferation controls when they are overexpressed. among these is a gene that encodes a newly divide, and differentiate, most respond to the defect by killing themselves, even under conditions that normally promote survival. thus, cells without slik appear to have an intrinsic survival defect, suggesting that slik prevents apoptosis. when slik is overexpressed, cell proliferation increases, but so does apoptosis. only when apoptosis was blocked did the cells form tumor-like growths. this coupling of cell growth and cell death is characteristic of oncogenes (cancercausing genes), and slik also seems to function in both pathways. the authors point out that the signal to proliferate may inherently sensitize cells to apoptosis, as has been shown previously for some cancer cells. this may keep an individual cell under the control of its neighbors, who collectively monitor the needs of the organism. for a cell to respond to a signal by dividing rather than dying, it must get the appropriate signs from its comrades. slik, the authors demonstrate, is a key factor in determining whether a cell lives or dies. whether its mammalian counterparts play a similar role is yet to be determined. identified kinase that contributes to the regulation of cell proliferation and survival (or death, depending on the circumstance) during drosophila development. cohen and hipfner called the gene slik based on its similarity to two human kinase-coding genes (slk and lok). little is known about these human proteins, though previous studies suggest they may affect cytoskeletal dynamics and cell adhesion. in this paper, the authors report preliminary evidence supporting the notion that slik may regulate the cytoskeleton, the "backbone'' of the cell that confers structure and motility. interestingly, disturbances to cell adhesion and cytoskeletal structure are known triggers of apoptosis and are being explored as potential anticancer agents. kinases make up one of the largest families of proteins and are important regulators of cell signaling. to investigate the function of slik in drosophila, the researchers removed the gene and then studied the physical and cellular effects. they found striking delays in growth and developmental timing and showed that these effects result largely from the demise of the slik-deficient cells. while cells deprived of slik can grow, one might not expect that yeast lead terribly eventful lives, yet the singlecelled fungus must struggle to survive just like everyone else. and for yeastlike everyone else-survival means being able to detect and coordinate a rapid response to changes in its environment. though survival for humans is a bit more complicated, our cells use the same regulatory networks, which maintain cell growth and health when they work and contribute to diseases, from asthma to cancer, when they break down. given the variety of conditions even the lowly yeast is likely to encounter during its life, one might expect to find a multitude of molecules mobilizing a response. but yeast cells, it turns out, are fairly resourceful. as erin o'shea and colleagues report, just one protein in yeast activates different groups of genes in response to different amounts of an environmental stimulus. the researchers focused on how yeast responds to various levels of phosphate, an essential nutrient for all cells. one way that cells regulate responses to environmental stimuli is through the transcription (activation) of genes. these transcriptional responses are often controlled by a multistep process that shuttles gene-activating proteins into the nucleus, where they can generate the appropriate response for a given stimulus, or confines them to the cytoplasm if their gene products are not needed. during this process, called phosphorylation, the addition of a phosphate group to a protein-such as a receptor or transcription factor-acts as a mechanism for controlling gene expression. o'shea's team demonstrated that phosphorylation of a transcription factor called pho controls gene expression by controlling where that protein resides in the cell-in the cytoplasm or in the nucleus. as is the case with many proteins, pho can accept phosphate groups at multiple sites. to see whether the location of phosphorylation affects the action of pho , o'shea's team exposed yeast to different levels of phosphate and tracked the cellular response. they found that when yeast is deprived of phosphate, pho has no phosphate groups at any of its binding sites and enters the nucleus, where it binds to dna and activates a set of genes whose products can scavenge for phosphate or otherwise compensate for the scarcity. when yeast has ample supplies of phosphate, pho is phosphorylated and remains in the cytoplasm-unable to influence transcription-suggesting that the cells can absorb plenty of nutrients from their environs without having to engage a specialized foraging team. when the researchers exposed the yeast to intermediate amounts of phosphate, the results were surprising. middling concentrations of phosphate produced different forms of phosphorylated pho , which varied in their ability to activate genes, and so added to the number of possible responses. pho partially phosphorylated at one site, for example, could still enter the nucleus, but activated only one type of phosphate-recovery gene and not others. while it is not unexpected that differential phosphorylation could have different functional outcomes, the authors say, it is surprising that one enzyme acting on one transcription factor can create different phosphorylation patterns-and therefore different gene-expression patterns-in response to different amounts of a single stimulus. their results show that cells rely on a highly regulated series of interactions that induce subtle changes in gene expression to fine-tune their response to small environmental changes. and they do this in a remarkably efficient manner, relying on a small cast of characters to orchestrate the responses essential for survival. extraction, amplification, and decoding of viral sequences rapidly identify known viruses and classify new ones based on their genetic makeup. this was validated in march when the viral chip contributed to the identification of the cause for severe acute respiratory syndrome (sars) as a novel coronavirus. in the article published in this issue, the researchers describe the chip (or microarray), how it was used in the classification of the sars virus, and how it provides direct access to viral genomic sequence. microarray technology works by taking advantage of the structural properties of dna. dna molecules normally exist as double helices, two complementary strands of nucleotides wrapped around each other. the microarray consists of a large number of single dna strands attached to a solid base. these probes (which in case of the viral chip represent sequences from all fully sequenced reference viruses) can be used to interrogate unknown sequences: if a solution containing such sequences is passed over the chip, similar sequences will "hybridize,'' or bond in a signature double helix. known viruses hybridize in a characteristic pattern and can be identified quickly. because bonding occurs even when the match between probe and sample sequence is not perfect, new relatives of known viruses can be identified as belonging to a particular family (such as coronaviruses, in the case of sars). to quickly obtain more information on a novel virus, it is then possible to "syphon off'' those viral sequences that stuck to their respective counterparts on the chip and to use the material to determine part of the genomic sequence. such sequence information provides more detail on how the new virus relates to known ones, which might provide clues about its origin and possible treatment strategies. faced with all manner of potential threats in the form of billions of different viral, bacterial, and chemical pathogens, the mammalian immune system relies on a "safety in diversity'' strategy for protection. with two distinct subsystems-one innate, the other adaptive-the immune system can recognize some trillion antigens. the innate system deploys cells programmed to quickly recognize microbes with a particular set of conserved molecular structures. the adaptive system relies on billions of uniquely outfitted lymphocytes (white blood cells) to identify just as many pathogens through their protein fragments, or antigens. a human being grinds out billions of these cells every day. in the absence of threats, the immune system maintains a quiescent state and many of these cells are discarded. but for the immune system, doing nothing takes a concerted effort. lymphocytes originate in the bone marrow, though not all differentiate there. one class of lymphocytes, called t cells, develops in the thymus, where every t cell acquires a one-of-a-kind receptor, called a t cell receptor (tcr), designed to recognize a different antigen. when an antigen gets bound by a tcr (a bound molecule is called a ligand), the antigen triggers a signaling cascade that tells the t cell either to attack the infected cell or to alert other immune cells of the infiltrator. but as jeroen roose, arthur weiss, and colleagues report, signaling pathways activated by bound tcrs appear to influence gene expression even in the absence of antigen or other receptor ligands, a process called ligand-independent signaling. these findings lend support to the notion that cellular signaling pathways regulated by surface receptors, like tcrs, exhibit a continuous low-level signaling (known as basal signaling) in the absence of a stimulus and that this continuous signaling, by influencing gene expression, has significant influence on cellular differentiation. roose, weiss, et al. focused on the tcr signaling pathway that regulates the expression of a group of genes, including rag- and rag- , that are activated in two distinct waves during t cell development. rag genes play a crucial role in t cell development, a highly complex, multistage process that involves a reshuffling, or recombination, of tcr genes and the activation of different proteins and genes at different stages. rag genes regulate the genetic recombination and ultimate cell surface expression of tcrs. using chemical inhibitors and mutant human t cell lines deficient in critical signaling components involved in antigen receptor-dependent pathways, the researchers found that the loss of specific functions or specific proteins affected an unexpected set of target genes. notably, when downstream components (the protein kinases erk and abl) were disabled in the basal signaling pathway, the researchers saw a resurgence of rag gene expression. while erk was already known to play a prominent role in signaling pathways downstream of the tcr, it now appears that abl may also be regulated in tcr pathways. most importantly, these findings suggest that signaling pathways thought to be triggered only by ligated receptors can influence gene expression on their own. and it may be through this type of signaling that tcr pathways help regulate t cell development by repressing rag gene activity. these basal signals, the researchers postulate, may in effect save the rag expression machinery until recombination is called for. if rag genes were expressed at the wrong time, they could cause inappropriate genetic recombination and create t cells that either lack function or attack healthy cells, as happens in immunodeficiency and autoimmune diseases. elucidating the mechanisms and components of this basal pathway will contribute important insights into the development and function of the immune system. but these studies also establish a model for investigating other signaling systems, to determine whether biologically functional basal signaling is a rare phenomenon or whether it is a fundamental cell process needed to control the profile of gene expression in the quiescent state. a multicellular organism can have more than different types of cells and as many as trillion altogether. during the process of development, an organism enlists the service of hundreds of signaling molecules and thousands of receptors to direct cell growth, differentiation, and morphological destiny. any given cell has no use for most of these signals and gets by with just a limited repertoire of receptors on its surface. once a signal reaches a receptor, it triggers a series of biochemical reactions as different molecules transform the external signal into a biological response, in a process called signal transduction. one cell type controls all of its cellular functions-both universal and specialized-with just a few dozen receptors; each receptor elicits a wide range of responses by triggering a small number of interacting pathways. exactly how a receptor produces the right response at the right time is a fundamental question in biology. of particular interest is a class of receptors-called receptor tyrosine kinases (rtks)that regulate cell proliferation, differentiation, and survival and play an important role in embryonic development and disease. growth factor receptors are an important subset t he protozoan parasite plasmodium falciparum causes falciparum malaria, a fatal parasitic disease in humans, and is transmitted by anopheles mosquito vectors (predominantly the anopheles gambiae complex and an. funestus in africa). there are about million malaria cases and - million deaths annually, the brunt of which are borne mostly in africa by children under years of age and by pregnant women. in many african countries, malaria poses a formidable challenge to an overburdened and underfunded public health system. the current malarial control strategies consist of chemotherapy directed against the malaria parasite and prevention of mosquito vector/human contact using insecticide-impregnated bednets and, to a lesser extent, indoor residual insecticide spraying and environmental control for reducing mosquito breeding sites. there are still no malaria vaccines in clinical practice. chemotherapy (the use of drugs to target disease) is used for both treatment and prevention. drug resistance is increasingly becoming a problem. some of the antimalarial drugs in current use include quinolines, artemisinins, antifolates, atovaquone/ proguanil, and antibiotics. chloroquine of rtks. the platelet-derived growth factor receptor (pdgfr) family activates downstream signaling enzymes that stimulate the growth and motility of connective tissue cells, such as vascular smooth muscle cells (vsmcs), oligodendrocytes (cells of the tissue encasing nerve fibers), and chondrocytes (cartilage cells). the pdgf beta receptor is essential for directing the differentiation of vsmcs. while studies of signal transduction of this growth factor have established a model of how receptor tyrosine kinases function, the role of individual downstream signaling components in a living organism is still unclear. using mouse molecular genetics, michelle tallquist and colleagues set out to determine the function of individual components in the pdgfr beta pathway. they discovered a quantitative correlation between the overall amount of signal produced by the receptor and the end product of the signal, formation of vsmcs. receptor responses, they report, are controlled in two ways: signaling was influenced both by the amount of receptors expressed and by the number of specific pathways engaged downstream of the receptor. surface receptors have "tails'' that project into a cell's interior. when a surface receptor is activated, a number of potential binding sites-modified amino acid residues-are exposed on its intracellular tail. ten of these sites can bind to proteins with a specific amino acid sequence, called an sh domain; proteins with these domains can then initiate a signal transduction pathway. by introducing mutations in the sh domain-binding sites in mice, the researchers could evaluate how the loss of a particular binding site-and therefore pathway-affected the function of the receptor. they had previously investigated the functions of two other downstream signaling proteins in similar experiments. surprisingly, tallquist et al. found that losing some of the individual components did not produce a significant negative physiological effect. only when multiple downstream signaling pathways were disrupted did the researchers see a significant effect on the population of the cells. reductions in the numbers of both activated receptors and activated signal transduction pathways produced reductions in the population of vsmcs. these results have not been seen in tissue culture before, suggesting that signal transduction is more complex in vivo and that future studies would benefit from incorporating a global approach, rather than targeting a single signaling component. the next step will be to investigate exactly how the individual pathways contribute to this result. it is also unclear whether these results apply only to these growth factor receptors or explain how rtks operate in general. such questions have significant clinical relevance. overexpression of the pdgfr beta pathway has been linked to a variety of serious diseases, including atherosclerosis and cancer. understanding how cells control the action of this growth factor is an important step in developing targeted therapies. since many of these conditions result from a growth factor stuck in the "on'' position, inhibiting overactive receptors promises to be an effective clinical intervention. (cq) is a cheap and widely used aminoquinoline, but cq-resistant parasites have become ubiquitous in endemic countries and other drugs are now used much more frequently (ridley ) . fansidar, a combination of sulphadoxine and pyrimethamine (sp), is a first-line treatment in several african countries, but resistance to sp is spreading rapidly. targeting the mosquito vector with pyrethroidimpregnated bednets, in addition to chemotherapy, is an effective method of controlling malaria transmission. however, pyrethroid resistance has been reported in an. gambiae s.s. in west africa, and there is concern about its emergence in east africa (chandre et al. ) . thus, the public health problem due to malaria is exacerbated by the emergence of drug-resistant parasites and insecticide-resistant mosquitoes. the clinical application of efficacious intervention tools is therefore an urgent imperative for malarial control. this brings into sharp focus the importance of genomics research for drugs, vaccines, diagnostics, and insecticides. the unraveling of the genomes of humans, p. falciparum, and an. gambiae has ushered in a new era of hope that genomics research will result in the development of new and better tools for malaria control. the p. falciparum genome of . megabases (mbp) distributed among chromosomes consists of , protein-coding genes (gardner et al. ) . p. falciparum possesses a relict plastid, the apicoplast, homologous to the chloroplasts of plants and algae. the apicoplast is essential for parasite survival and functions in the anabolic synthesis of fatty acids, isoprenoids, and heme (seeber ). these essential metabolic pathways are not present in humans and are therefore ideal targets for the development of safe antimalarial drugs. inhibitors of type ii fatty acid biosynthesis (triclosan and thiolactomycin) and mevalonateindependent isoprenoid biosynthesis (fosmidomycin and fr ) with potent antimalarial activities have been identified by computational mining of the genome data. the fact that fosmidomycin has rapidly entered into clinical trials underscores the great utility of genomics research in the control of malaria (lell et al. ) . about , proteins ( %) in p. falciparum have no known functions (gardner et al. ) . the greatest challenge of malarial functional genomics (the elucidation of the functions of genes encoded by an organism's genome) is to assign functions to these proteins, thus comprehensively identifying the proteins that function at various lifecycle stages and that function together to carry out particular cellular processes, e.g., red blood cell invasion, signal transduction, growth, vesicular trafficking, etc. the application of functional genomics approaches allows the properties of many genes and proteins to be assessed in parallel on a large scale. these approaches are being used to address specific questions about the biology of p. falciparum. gene profiling (determining which genes are expressed) by microarray technology allows a rapid, parallel analysis of genome-wide changes in gene expression over a variety of experimental conditions (e.g., chloroquine versus saline control), tissues, and cell types; these genes can be clustered (ordered by expression pattern) to identify those that function in the same process. one of the most promising applications of microarrays is the study of differential gene expression during the complex p. falciparum lifecycle, specifically the formidable and challenging task of determining which subset of the , genes is represented in the transcriptome of each stage (bozdech et al. ; le roch et al. ) . these approaches are beginning to yield invaluable insights about new vaccine candidates, novel drug targets, and the molecular basis of drug resistance. proteomics is the study of all the proteins expressed in an organism. global protein analysis offers a unique means of determining not only protein expression, but also interacting partners, subcellular localizations, and post-translational modifications of proteins of whole proteomes. analyses of the proteomes of parasites that have been exposed to distinct environmental stimuli (e.g., chloroquine versus saline control) or that manifest distinct phenotypes (drug resistant versus drug sensitive) might also facilitate the identification of biochemical drug targets and of the specific proteins involved in drug resistance. comparative genomics (the comparison of genomes of related species), on the other hand, will yield invaluable insights about the biology of and the pathogenesis of disease associated with different parasites, i.e., p. falciparum doi: . /journal.pbio. .g on the one hand and p. vivax on the other. the biology and pathology of the two parasites are quite distinct, e.g., the preference for reticulocytes (p. vivax) versus mature red blood cells (p. falciparum), the ability to cause severe (p. falciparum) versus mild (p. vivax) disease, and the implication of amino acid substitutions in pfcrt in cq resistance in one (p. falciparum) but not in the other (p. vivax). the mbp sequence of the nuclear genome of the pest strain of an. gambiae s.s. has been published in draft form and is considerably larger than the mbp assembled sequence of the fruitfly drosophila melanogaster (holt ) . the an. gambiae genome includes a treasure trove of odorant receptor genes and about genes that encode glutathione-s-tranferases, cytochrome p s, and carboxylesterases. these and possibly other genes probably play a critical role in human host finding and detoxification of insecticides, respectively, and could be exploited, using gene profiling, proteomics, and comparative genomics, for the development of novel mosquito repellants or traps and insecticides. the ability to introduce foreign genes into anopheles vectors is an exciting advance that might facilitate the development of transgenic mosquitoes that do not transmit malaria parasites (moreira et al. ) . however, the future implementation of this control strategy, if current technical hurdles can be overcome, must take into consideration concerns about the environmental impact of releasing genetically altered mosquitoes. scientists in endemic countries must be active participants in malaria genomics research and not just conduits for field materials for northern partners. however, the reality is that there is an increasing technological gap between endemic-and developedcountry researchers in the field. this needs to be urgently addressed. the world health organization special programme for research and training in tropical diseases have initiated a series of training workshops in bioinformatics in endemic countries; the howard hughes medical institute has supported one such workshop. the training must extend to other aspects of genomics and include infrastructure development. there is considerable optimism that genomics research will result in new drugs, vaccines, diagnostics, and tools for malarial vector control. strong linkages between genomics research and national malarial control programs will facilitate the translation of research findings into intervention tools. as it is for all new technologies, it might also be important for the communities in endemic countries to have a greater awareness and understanding of genomics research. this will enhance acceptance of the products and improve informed consent. there is therefore a unique opportunity for collaborations between social-economic scientists and genomics researchers. the challenges of searching the scientific literature t he standard "front end" for biomedical literature search is medline and its entrez query system. huge, well-managed, and nearly exhaustive, medline and its million references provide incredible ease and facility for anyone who can type a boolean query. though not quite a parallel for google-which runs a kind of popularity contest for web links in real time-the entrez search has opened up the literature to anyone with a web browser. to those who grew up chasing citations and papers through the aisles of a scientific library, entrez is a dream come true. and yet. suspend disbelief and imagine for a moment a kind of literature search dream-tool. "find me all references citing my gene of interest," you could ask. but why stop there? "find me all references citing some or all of my four genes of interest with expression or in vitro data." and then, "bring up the text of the paragraph in which these citations occurred so i can view them in context. and do it in real time." tools that can perform such searches would go beyond google because they avoid the repetitiveness involved in multiple searches. and they would go beyond entrez because they would search the entire medical literature in full-text format and not, as medline does, just the abstracts. furthermore, they would go beyond both types of searches in that they would be at least somewhat intelligent. such text-mining efforts are the next frontier for both academic and commercial groups that have sprung up from pasadena to boston to tel aviv. but how realistic is this venture? text-mining and its more universal relative "information retrieval" are still in their infancy. the first paper on text-mining for biology was published only in . furthermore, because biological text-mining comes so close to the challenge of comprehending human language-arguably the most complex invention in the history of the planet-it is what computer scientists call a "hard problem." so even here, at the embryonic and fun stage in this technology's history, the outcome and especially the timing of improvement are impossible to predict. language-processing software tools have been successfully applied in text-mining of nonscientific sources, especially to newswire content. computer programs can already perform all three levels of text-mining ( figure ) effectively: retrieving documents relevant to a given subject; extracting lists of entities or relationships among entities; and answering questions about the material, delivering specific facts in response to natural-language queries. information retrieval and extraction can be performed on news data at success rates of %- %, says lynette hirschman, a structural linguist. question-answering has been reported in the literature at % accuracy, she notes, which is "amazingly good." the question is, how soon can these levels be achieved for biology? good thing for biologists that hirschman has turned her energies in their direction. hirschman works in massachusetts at mitre corporation, a government-funded institution that pursues projects in the national interest, be they in defense and intelligence or, as in the case of textmining, "anywhere we can move an entire field forward," says hirschman. the good news from news-mining is that improvement seems to arrive in direct proportion to the time and energy expended by the research community. similar improvement has occurred in speech recognition by computers, she adds ( figure ). when people took successively harder problems and worked on them for four or five years, she explains, it caused error rates to drop, as a rule, by a factor of two every two years. one might think tackling the biomedical literature would be relatively easy, remarks hirschman: biology jargon has a lot of prefixes and suffixes, which can be parsed more easily than verbs and adverbs; it is highly regular, with greek-letter addons to gene or protein names signifying relatives or subtypes of the original proteins; and there are many resources available, such as databases and ontologies linking different biological terms. but whereas extraction of person and place names from news text routinely reaches %, results in biology remain mired in the %- % range. "it's a little depressing," warns hirschman. "even something as simple as a slash may imply two different entities or a single compound." a chorus of assent greets her observation. programmers eager to codify the rules of biology have been stymied by what one bioinformaticist calls "a sea of exceptions." moreover, there is a chronic lack of data that have been "marked up" by software or humans to indicate the roles played by some of the key words. this marking-up process, however it is done, is crucial for machine-learning tasks. getting these data is both hard and expensive, says hirschman. to move biology text-mining forward, she believes, requires organizing different academic and commercial groups so that they are at least working on the same problem. only then can standards emerge that will allow progress in the field even to be measured. this type of shared problem-known as a "challenge evaluation"-has become something of a "religion" in the speech and language community since the s, says hirschman. by putting out a set of data to train on and then issuing a "challenge" for each group to extract the same information or answer the same questions, "you compare apples to apples. in the process you build a research community." last year, hirschman and others ran the very first challenge evaluation in biology, the kdd cup (officially called the knowledge discovery and data-mining challenge cup). six weeks in advance, the organizers gave participants a training set of journal articles already included in the model organism database flybase, along with associated lists of genes and gene products, as well as relevant data fields from flybase. after building their software tools, the entrants were then asked to take a test set of articles and pretend they were curators: the tools were supposed to determine whether the articles were appropriate for curation, based on whether they contained experimental evidence for gene expression products, including both rna transcripts and proteins. eighteen participants took a shot at the kdd cup and their results speak of the infant state of the field. on average, they could assign only % of the papers correctly and could determine whether relevant gene products were present only % of the time. the winning entrant, a joint group from the israeli company clearforest ("see the forest and the trees") and marylandbased celera genomics, did better. doi: . /journal.pbio. .g the four levels of information retrieval: google and medline both use keywords to direct a searcher to documents. but the next level has been tough to crack. improved software would allow biologists to jump from the web or medline to specifics with a single query. (adapted with permission from the mitre corporation.) information retrieval and extraction can be performed on news data at success rates of %- %. the question is, how soon can these levels be achieved for biology? their entry made the right decision to curate % of the time and the right call on the presence of gene products % of the time. the winning group did so well by using a clever "trick," says hirschman admiringly. their program searched for figure captions and then applied multiple techniques to find those gene products they were looking for. the techniques applied by clearforest and others fall into two broad categories, statistical and heuristic. statistical techniques are the next step up from keyword searches. they count words such as genes or gene products appearing close to one another, but apply no linguistic insights, such as whether an adjective modifies a noun. by contrast, heuristic approaches use hand-crafted rules designed for specific datasets: e.g., january, february, march, etc., are months; the word following "mr." is a name; and so forth. this approach is labor-intensive but especially useful when there is only a limited amount of data-as is the case with single scientific papers or small groups of papers. some statistical approaches have been labeled with the nickname "bag of words" because they fail to account for grammatical relationships; e.g., "man bites dog" and "dog bites man" would drop the same three words in the bag. a key observation at the kdd cup was that the most basic statistical approach, which counts word occurrences at the document level, is not sufficient unless it takes into account at least some higher-level context, such as the part of the paper from which the search terms are extracted. furthermore, the more hand-crafted rules there were, the better. many of the top teams included biologists who applied their expertise to help create empirical rules that became part of the program instructions. this points to a general theme in machine learning: the greater the degree of human intervention, the better. the best programs are covered with fingerprints. although the march toward better text-mining systems is building momentum, there are two issues that could stop it in its tracks. the first is access. experts in text-searching uniformly cite access as a key obstacle for developing better search tools. "access is a bigger problem than algorithms" is how one machine-learning expert puts it, and a half-dozen others agreed. the present "balkanized" situation for text-processing is filled with "dead ends" and "short circuits" in information flow among biologists, says david lipman, head of the united states' national center for biotechnology information, which runs pubmed, the medline database, as well as the national library of medicine and other critical resources in biology and bioinformatics. it is as if readers are marine biologists on a coastline whose beaches are % private. at best, asking permission to view every article slows down the work. at worst, there are some important tools one can never build owing to the missing context. medline itself would be much more powerful if it were based on full text, experts say. owing to lack of access, says hirschman, "we miss a great deal by not having large corpora of full-text articles" included in the design of both the kdd cup and the next challenge evaluation, called biocreative, being held later this year. many of the relevant biological data are found outside abstracts, but getting access to full text is complicated at best. for manual searching, researchers traditionally fall back on portalhopping: jumping from one full-text subscription (to nature, science, or cell, for example) to another, or from one portal (highwire, web of science) to another. that way, many scientists routinely obtain access to as many as % of the journals they need. the rest they can usually request via interlibrary loan or order as photocopies online. however, this approach fails for most automated search programs. just sorting out the permissions and keeping up with changes in the portals dramatically increase the headaches for anyone trying to build a search tool. the second threat to text-searching programs ever becoming widely useful has more of the ring of linguistics jargon. the so-called " ontology problem" threatens successful searching based on the very specific nature of biological terminology. the issue here is not only that scientists are truly terrible about sticking to established terminologies. "scientists would rather share each other's underwear than use each other's nomenclature," as biochemist keith yamamoto is fond of saying. consequently, the scientific literature is a hodgepodge of identical or overlapping terms. a naïve text-parsing program does not know whether "cat" refers to the catalase gene, the chloramphenicol transferase gene, or a household animal. the challenge is to build an ontology describing all the important relationships so your computer program can navigate among them without asking you what to do. consequently, an ontology would prescribe rules for understanding the interactions among genes based on the appearance of certain verbs ("inhibit," "express"), nouns ("agonist"), or phrases. although within each narrow scientific subdiscipline it may be possible to build exquisitely useful textmining tools, as soon as programmers broach the borders of the narrowest subfields, they will run into a kind of heisenberg uncertainty principle of linguistics and science. every toolmaker doi: . /journal.pbio. .g is faced with the ontology problem in one respect or another, especially when the tool is meant to be a general one. david gilmour, chief executive officer of tacit inc., a knowledge management company in palo alto, california, is an industry veteran of exactly this war "and i have scars all over my body to prove it," he says. the issue in a nutshell, he explains, is that "ontologies scale poorly, and by the time they are useful," that is, large enough to capture most of the possible relationships among words, "they are unmaintainable." hirschman acknowledges that keeping up with the literature and new terminologies is challenging. adapting tools to new domains has traditionally been one of the "critical stumbling blocks" for text-processing technology, she says. the dynamic growth of biological terminology does not help. there are - alterations every week to the nomenclature section of mouse genome database web page. staying within one's narrow domain, then, could be a recipe for success, as long as the vocabulary and user questions remain tightly constrained, especially if there is a way to tiptoe around the access problem. that is apparently the case at wormbase, though the newly available tool there, called textpresso, is still being built. the motivation for textpresso was simple, says hans-michael mueller, a postdoctoral fellow in the lab of paul sternberg at caltech in pasadena, california, where wormbase-the genetic database for the nematode worm caenorhabditis elegans-is curated. "we want the user to be able to avoid going to the library to read all those papers [on genes and proteins] that your favorite gene interacts with. that is very tedious." the other goal is equally recognizable in the biology community: no mere mortal can hope to keep up with the burgeoning literature, even in the relatively narrow field of worm biology. mueller, a nuclear physicist by background, called textpresso "a search engine for full-text searches of abstracts and articles" that can help find answers to more challenging queries than simple keyword searches. mueller and his team use human "taggers" to mark up the corpus of text to indicate categories like "biological processes" ("late larval activation"), "genes" (let- ), and "molecular functions." then, like the clearforest-celera program, textpresso searches for combinations of categories in the same or neighboring sentences. the ontology relating the expressions and categories to one another is based both on scientific and common sense as well as linguistic components. in less than two years of work, mueller and his team have already marked up . million terms in , abstracts and , full-text papers. a typical search asks a question such as "what can be found out about the negative regulatory aspects of a genetic network in the pharynx?" answers emerge in the form of citations, abstracts, and, if available, a paragraph or so from the text of the relevant paper. textpresso went up-unpublicized-on the web in february this year and already receives a couple of hundred hits a day, a big number in a field of about , researchers. mueller estimates that textpresso is % accurate and that about % of the relevant papers have been included. textpresso needs full-text access to be as good as it is, says mueller. "we noticed" that drawing on full text "greatly increased the chances of a true hit," not a false positive. he managed to avoid the access issue by claiming a kind of "curator's privilege." only the curators see the full text. once the data are on the web, users can only get at most a paragraph, which falls within fair use, said mueller. if a user happens to subscribe to the journal in question, it is possible for him to click through, the publisher's portal and see the paper. whereas textpresso works exclusively on worm genetic data and commercial players like clearforest are just beginning to hunt for biological applications, a handful of companies have begun to market text-searching products to academic biomedical scientists. one such product is called quosa, for query, organize, share, and analyze. the software had its commercial launch in late . put simply, the program-available on an institution-wide basis and already installed for hundreds of researchers at massachusetts general hospital and the dana-farber cancer institute in boston-allows a search across one's own documents. a front end for the literature that cooperates with medline, quosa pulls in and prioritizes full-text papers. the program first allows the user to search for the relevant files and download them in full-text format to the extent permitted by her library's subscription agreements and licenses. once it becomes second nature to users, they rave about it. like the best of the firstgeneration software, quosa allows users to make connections they would not have otherwise made. like so many other early software products, its longterm success will hinge on demand as well as improvements made in the upgrades. because of the ontology problem, improvements in searching in the next couple of years are likely to result from the application of ever-better techniques within existing domains. collaborations among wormbase, flybase, and other model-organism database groups will help improve all their search tools. medline itself may benefit from more advanced search techniques, though these will be restricted to abstract searches. the big unknown for predicting further development of text-search tools is the path publishers will take. if each publisher or portal such as reed-elsevier or highwire were to license or develop its own tool for searching its own content, the result might be better than the status quo, but would still be unsatisfying. running the same search three times on three different subsets of content might be better than running it times-but wouldn't it be easier to run it just once?  the transcriptome of the intraerythrocytic developmental cycle of plasmodium falciparum we are grateful to drs. lisa ranford-cartwright (university of glasgow, glasgow, united kingdom), ayoade oduola and yeya toure (world health organization, geneva, switzerland), and wilfred mbacham (university of yaounde, yaounde, cameroon) for critical comments on the manuscript. key: cord- -mkwpuav authors: moreira, rebeca; balseiro, pablo; planas, josep v.; fuste, berta; beltran, sergi; novoa, beatriz; figueras, antonio title: transcriptomics of in vitro immune-stimulated hemocytes from the manila clam ruditapes philippinarum using high-throughput sequencing date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: mkwpuav background: the manila clam (ruditapes philippinarum) is a worldwide cultured bivalve species with important commercial value. diseases affecting this species can result in large economic losses. because knowledge of the molecular mechanisms of the immune response in bivalves, especially clams, is scarce and fragmentary, we sequenced rna from immune-stimulated r. philippinarum hemocytes by -pyrosequencing to identify genes involved in their immune defense against infectious diseases. methodology and principal findings: high-throughput deep sequencing of r. philippinarum using pyrosequencing technology yielded , high-quality reads with an average read length of bp. the reads were assembled into , contigs and the . % of the translated nucleotide sequences into protein were annotated successfully. the most frequently found contigs included a large number of immune-related genes, and a more detailed analysis showed the presence of putative members of several immune pathways and processes like the apoptosis, the toll like signaling pathway and the complement cascade. we have found sequences from molecules never described in bivalves before, especially in the complement pathway where almost all the components are present. conclusions: this study represents the first transcriptome analysis using -pyrosequencing conducted on r. philippinarum focused on its immune system. our results will provide a rich source of data to discover and identify new genes, which will serve as a basis for microarray construction and the study of gene expression as well as for the identification of genetic markers. the discovery of new immune sequences was very productive and resulted in a large variety of contigs that may play a role in the defense mechanisms of ruditapes philippinarum. the manila clam (ruditapes philippinarum) is a cultured bivalve species with important commercial value in europe and asia, and its culture has expanded in recent years. nevertheless, diseases produced by a wide range of microorganisms, from viruses to metazoan parasites, can result in large economical losses. among clam diseases, the majority of pathologies are associated with the vibrio and perkinsus genera [ ] [ ] [ ] . although molluscs lack a specific immune system, the innate response involving circulating hemocytes and a large variety of molecular effectors seems to be an efficient defense method to respond to external aggressions by detecting the molecular signatures of infection [ ] [ ] [ ] [ ] [ ] ; however, not many immune pathways have been identified in these animals. although knowledge of bivalve immune-related genes has increased in the last few years, the available information is still scarce and fragmentary. most of the data concern mussels and eastern and pacific oysters [ ] [ ] [ ] [ ] [ ] [ ] , and very limited information is available on the expressed immune genes of r. philippinarum. recently, the expression of immune-related genes of ruditapes philippinarum and ruditapes decussatus were characterized in response to a vibrio alginolyticus challenge [ ] . also, a recent pyrosequencing study was carried out by milan et al. [ ] , who sequenced two normalized cdna libraries representing a mixture of adult tissues and larvae from r. philippinarum. even more recently ghiselli et al. [ ] , have de novo assembled the r. philippinarum gonad transcriptome with the illumina technology. moreover, a few transcripts encoded by genes putatively involved in the clam immune response against perkinsus olseni have been reported by cdna library sequencing [ ] . currently ( / / ) , there are , ests belonging to r. philippinarum in the genbank database. the european marine genomics network has increased the number of ests for marine mollusc species particularly for ecologically and commercially important groups that are less studied, such as mussels and clams [ ] . unfortunately, most of the available resources are not annotated or well described, limiting the identification of important genes and genetic markers for future aquaculture applications. the use of -pyrosequencing is a fast and efficient approach for gene discovery and enrichment of transcriptomes in non-model organisms [ ] . this relatively low-cost technology facilitates the rapid production of a large volume of data, which is its main advantage over conventional sequencing methods [ ] . in the present work, we undertook an important effort to significantly increase the number of r. philippinarum ests in the public databases. specially, the aim of this work was to discover new immune-related genes using pyrosequencing on the gs flx (roche- life sciences) platform with the titanium reagents. to achieve this goal, we sequenced the transcriptome of r. philippinarum hemocytes previously stimulated with different pathogen-associated molecular patterns (pamps) to obtain the greatest number of immune-related transcripts as possible. the raw data are accessible in the ncbi short read archive (accession number: sra . ). the r. philippinarum normalized cdna library was sequenced with gs flx technology as shown in figure . sequencing and assembly statistics are summarized in table . briefly, a total of , raw nucleotide reads averaging . bp in length were obtained. of these, , exceeded our minimum quality standards and were used in the mira assembly. a total of , quality reads were assembled into , contigs, corresponding to . megabases (mb). the length of the contigs varied from to bp, with an average length of . bp and an average coverage of . reads. singletons were discarded, resulting in , contigs formed by at least ests, and , of these contigs were longer than bp. clustering the contigs resulted in , clusters with more than one contig. the distribution of contig length and the number of ests per contig, as well as the contig distribution by cluster are all shown in figure . even though the knowledge of expressed genes in bivalves has increased in the last few years, it is still limited. indeed, only , nucleotide sequences, , ests, , proteins and genes from the class bivalvia have been deposited in the genbank public database ( / / ) , and the top entries are for the mytilus and crassostrea genera. for ruditapes philippinarum, these numbers are reduced to , ests, proteins and genes. this evidences the lack of information which prompted the recent efforts to increase the number of annotated sequences of bivalves in the databases. for non-model species, functional and comparative genomics is possible after obtaining good est databases. these studies seem to be the best resource for deciphering the putative function of novel genes, which would otherwise remain ''unknown''. ncbi swissprot, ncbi metazoan refseq, the ncbi nonredundant and the uniprotkb/trembl protein databases were chosen to annotate the contigs that were at least bp long ( , ) . the percentage of contigs annotated with a cut off evalue of e- was . %. contig sequences and annotations are included in table s . of these contigs, . % matched sequences from bivalve species and the remaining matched to non-bivalvia mollusc classes ( . %), other animals ( . %), plants ( . %), fungi ( . %), protozoa ( . %), bacteria ( . %), archaea ( . %), viruses ( . %) and undefined sequences ( . %). as shown in figure a , the species with the most sequence matches was homo sapiens with , occurrences. the first mollusc in the top list was lymnaea stagnalis at position . the first bivalve, meretrix lusoria, appeared at position . r. philippinarum was at position with occurrences. notably, a high percentage of the sequences had homology with chordates, arthropods and gastropods ( figure b and c), and only contigs matched with sequences from the veneroida order ( figure d ). these values can be explained by the higher representation of those groups in the databases as compared to bivalves and the quality of the annotation in the databases, which has been reported in another bivalve transcriptomic study [ ] . the data shown highlight, once again, the necessity of enriching the databases with bivalve sequences. a detailed classification of predicted protein function is shown for the top blastx hits ( figure a ). the list is headed by actin with occurrences, followed by ferritin, an angiopoietin-like protein and lysozyme. an abundance of proteins directly involved in the immune response was predicted for this run; ferritin, lysozyme, c q domain containing protein, galectin- and hemagglutinin/amebocyte aggregation factor precursor are immune-related proteins present on the top list. ferritin has an important role in the immune response. it captures circulating iron to overcome an infection and also functions as a proinflammatory cytokine via the iron-independent nuclear factor kappa b (nf-kb) pathway [ ] . lysozyme is a key protein in the innate immune responses of invertebrates against gram-negative bacterial infections and could also have antifungal properties. in addition, it provides nutrition through its digestive properties as it is a hydrolytic protein that can break the glycosidic union of the peptidoglycans of the bacteria cell wall [ ] . the c q domain containing proteins are a family of proteins that form part of the complement system. the c q superfamily members have been found to be involved in pathogen recognition, inflammation, apoptosis, autoimmunity and cell differentiation. in fact, c q can be produced in response to infection and it can promote cell survival through the nf-kb pathway [ ] . galectin- is a central regulator of acute and chronic inflammatory responses through its effects on cell activation, cell migration, and the regulation of apoptosis in immune cells [ ] . the hemagglutinin/amebocyte aggregation factor is a single chain polypeptide involved in blood coagulation and adhesion processes such as self-nonself recognition, agglutination and aggregation processes. the hemagglutinin/ amebocyte aggregation factor and lectins play important roles in defense, specifically in the recognition and destruction of invading microorganisms [ ] . other proteins that are not specifically related to the immune response but could play a role in defense mechanisms include the following: angiopoietin-like proteins, apolipoprotein d and the integral membrane protein b. in other animals, angiopoietin-like proteins (angptl) potently regulate angiogenesis, but a subset also function in energy metabolism. specifically, angptl , the most represented angptl, promotes vascular inflammation rather than angiogenesis in skin and adipose tissues. inflammation occurs via the a b integrin/rac /nf-kb pathway, which is evidenced by an increase in leukocyte infiltration, blood vessel permeability and the expression of inflammatory cytokines (tumor necrosis factor-a, interleukin- and interleukin- b) [ ] . apolipoprotein d (apod) has been associated with inflammation. pathological and stressful situations involving inflammation or growth arrest have the capacity to increase its expression. this effect seems to be triggered by lps, interleukin- , interleukin- and glucocorticoids and is likely mediated by the nf-kb pathway, as there are several conserved nf-kb binding sites in the apod promoter (apre- and ap- binding sites are also present). the highest affinity ligand for apod is arachidonic acid, which apod traps when it is released from the cellular membrane after inflammatory stimuli and, thus, prevents its subsequent conversion in pro-inflammatory eicosanoids. within the cell, apod could modulate signal transduction pathways and nuclear processes such as transcription activation, cell cycling and apoptosis. in summary, apod induction is specific to ongoing cellular stress and could be part of the protective components of mild inflammation [ ] [ ] [ ] . finally, the short form of the integral membrane protein b (itm bs) can induce apoptosis via a caspase-dependent mitochondrial pathway [ ] . to avoid redundancy, the longest contig of each cluster was used for gene ontology terms assignment. a total of . % of the representative clusters matched with at least one go term. concerning cellular components ( figure b ), the highest percentage of go terms were in the groups of cell and cell part with . % in each; organelle and organelle part represented . % and . %, respectively. within the molecular function classification ( figure c ), the most represented group was binding with . % of the terms, which was followed by catalytic activity ( . %) and structural molecular activity ( . %). with regard to biological process ( figure d ), cellular and metabolic processes were the highest represented groups with . % and . % of the terms, respectively, which was followed by biological regulation ( . %). similarities between the r. philippinarum transcriptome and another four bivalve species sequences were analyzed by comparative genomics (crassostrea gigas of the family ostreidae, bathymodiolus azoricus and mytilus galloprovincialis of the family mytilidae and laternula elliptica of the family laternulidae). this analysis could identify specific transcripts that are conserved in these five species. a venn diagram was constructed using unique sequences from these databases according to the gene identifier (gi id number) of each sequence in its respective database: , from c. gigas, , from b. azoricus, , from m. galloprovincialis and , , from l. elliptica. c. gigas was chosen because is the most represented bivalve species in the public databases. the other three species are bivalves that have been studied in transcriptomic assays. figure shows that of the total , clusters, % were found exclusively in the r. philippinarum group, while only . % shared significant similarity with all five species. the number of coincidences among other groups was very low ( . % to . % of sequences), suggesting that , new sequences were discovered within the bivalve group. the percentage of new sequences is very high compared to previous transcriptomic studies [ ] [ ] , in which the fraction of new transcripts was approximately %. one possible explanation for this discrepancy is the low number of nucleotide and est sequences currently available in public databases for r. philippinarum, but these transcripts could also be regions in which homology is not reached, such as and untranslated regions or genes with a high mutation rate. on the other hand, a comparison between our results and the milan et al. [ ] transcriptome using a blastn approach is summarized in table immune-related sequences r. philippinarum hemocytes were subjected to immune stimulation using several different pamps to enrich the est collection with immune-related sequences. the objective was to obtain a more complete view of clam responses to pathogens. a keyword list and go immune-related terms were used to find proteins putatively involved in the immune system. after this selection step, we found that more than % of the proteins predicted from the contig sequences had a possible immune function. some sequences were found to be clustered in common, well-recognized immune pathways, such as the complement, apoptosis and toll-like receptors pathways, indicating conserved ancient mechanisms in bivalves ( figures , , ). the complement system is composed of over plasma proteins that collaborate to distinguish and eliminate pathogens. c is the central component in this system. in vertebrates, it is proteolytically activated by a c convertase through both the classic, lectininduced and alternative routes [ ] . although the complement pathway has not been extensively described in bivalves, there is evidence that supports the presence of this defense mechanism. ests with homology to the c q domain have been detected in the american oyster, c. virginica [ ] , the tropical clam codakia orbicularis [ ] , the zhikong scallop chlamys farreri [ ] and the mussel m. galloprovincialis [ ] [ ] . more recently, a novel c q adiponectin-like, a c and a factor b-like proteins have been identified in the carpet shell clam r. decussatus [ ] [ ] . these data support the putative presence of the complement system in bivalves. our pyrosequencing results, using the blastx similarity approach, showed that the complement pathway in r. philippinarum was almost complete as compared to the kegg reference pathway ( figure ). only the complement components c r, c s, c , c and c were not detected. i. lectins. lectins are a family of carbohydrate-recognition proteins that play crucial self-and non-self-recognition roles in innate immunity and can be found in soluble or membraneassociated forms. they may initiate effector mechanisms against pathogens, such as agglutination, immobilization and complement -mediated opsonization and lysis [ ] . several types of lectins have been cloned or purified from the manila clam, r. philippinarum [ ] [ ] [ ] , and their function and expression were also studied [ , ] . also, a manila clam tandemrepeat galectin, which is induced upon infection with perkinsus olseni, has been characterized [ ] . lectin sequences have been found in the stimulated hemocytes studied in our work: of the contigs are homologous to c-type lectins (calcium-dependent carbohydrate-binding lectins that have characteristic carbohydrate-recognition domains), are homologous to galectins (characterized by a conserved sequence motif in their carbohydrate recognition domain and a specific affinity for bgalactosides), contigs have homology with ficolin a and b (a group of oligomeric lectins with subunits consisting of both collagen-like and fibrinogen-like domains) and contigs have homology with other groups of lectins such as lactose-, mannoseor sialic acid-binding lectins. ii. b-glucan recognition proteins. b-glucan recognition proteins are involved in the recognition of invading fungal organisms. they bind specifically to b- , -glucan stimulating short-term immune responses. although these receptors have been partially sequenced in several bivalves, there is only one complete description of them in the scallop chlamys farreri [ ] . two contigs with homology to the beta- , -glucan-binding protein were found in our study. iii. peptidoglycan recognition proteins. peptidoglycan recognition proteins (pgrps) specifically bind peptidoglycans, which is a major component of the bacterial cell wall. this family of proteins influences host-pathogen interactions through their pro-and anti-inflammatory properties that are independent of their hydrolytic and antibacterial activities. in bivalves, they were first identified in the scallops c. farreri and a. irradians [ , ] and the pacific oyster c. gigas, and from the latter four different types of pgrps were identified [ ] . peptidoglycan-recognition proteins and a peptidoglycan-binding domain containing protein have been found for the first time in r. philippinarum in our results and were present and times, respectively. iv. toll-like receptors. toll-like receptors (tlrs) are an ancient family of pattern recognition receptors that play key roles in detecting non-self substances and activating the immune system. the unique bivalve tlr was identified and characterized in the zhikong scallop, c. farreri [ ] . tlr , and were present among the pyrosequencing results. tlr and tlr form a heterodimer, which senses and recognizes various components from bacteria, mycoplasma, fungi and viruses [ ] . tlr is a novel and poorly characterized member of the toll-like receptor family. although the exact role of tlr is currently unknown, phylogenic analysis indicates that tlr is a member of the tlr subfamily [ ] suggesting that it could recognize urinary pathogenic e. coli [ ] . it has been demonstrated that tlr colocalizes and interacts with unc b , a molecule located in the endoplasmic reticulum, which strongly suggests that tlr might be found inside cells and might play a role in recognizing viral infections [ ] . figure summarizes the tlr signaling pathway with the corresponding molecules found in the r. philippinarum transcriptome. pathogen proteases are important virulence factors that facilitate infection, diminish the activity of lysozymes and quench the agglutination capacity of hemocytes. because protease inhibitors play important roles in invertebrate immunity by protecting hosts through the direct inactivation of pathogen proteases, many bivalves have developed protease inhibitors to regulate the activities of pathogen proteases [ ] . some genes encoding protease inhibitors were identified in c. gigas [ ] , a. irradians [ ] , c. farreri [ ] and c. virginica; in the latter a novel family of serine protease inhibitors was also characterized [ ] [ ] [ ] . a total of contigs with homology to serine, cystein, kunitzand kazal-type protease inhibitors and metalloprotease inhibitors were found among our results. lysozyme was one of the most represented groups of immune genes in this transcriptome study with contigs present. it is an antibacterial molecule present in numerous animals including bivalves. although lysozyme activity was first reported in molluscs over years ago, complete sequences were published only recently including those of r. philippinarum [ ] . antimicrobial peptides (amps) are small, gene-encoded, cationic peptides that constitute important innate immune effectors from organisms spanning most of the phylogenetic spectrum. amps alter the permeability of the pathogen membrane and cause cellular lysis [ ] . in bivalves, they were first purified from mussel hemocyte granules [ , ] . in mussels, the amp myticin c was found to have a high polymorphic variability as well as chemotactic and immunoregulatory roles [ , ] . in clams, two amps with similarity to mussel myticin and mytilin [ ] and a big defensin [ ] are known. we were able to detect contigs with homology to different defensins: defensin- (american oyster defensin), defensin mgd- (mediterranean mussel defensin) and the big defensin previously mentioned. four contigs were similar to an unpublished defensin sequence from venerupis ( = ruditapes) philippinarum. the primary role of heat shock proteins (hsps) is to function as molecular chaperones. their up-regulation also represents an important mechanism in the stress response [ ] , and their activity is closely linked to the innate immune system. hsps mediate the mitochondrial apoptosis pathway and affect the regulation of nf-kb [ ] . hsps are well studied in bivalves. for r. philippinarum, several assays have been developed to better understand the hsps profile in response to heavy metals and pathogen stresses [ ] [ ] [ ] . the most important and well-studied groups of hsps were present in our r. philippinarum transcriptome (hsp , hsp / dnaj, hsp and hsp ), but other, less common hsps were also represented (hsp , hsp , hsp and some members from the hsp family). recently, several genes related to the inflammatory response against lps stimulation have been detected in bivalves. such is the case of the lps-induced tnf-a factor (litaf), which is a novel transcription factor that critically regulates the expression of tnfa and various inflammatory cytokines in response to lps stimulation. it has been described in three bivalve species: pinctada fucata [ ] , c. gigas [ ] and c. farreri [ ] . other tnf-related genes have been identified in the zhikong scallop, such as a tnfr homologue [ ] and a tumor necrosis factor receptor-associated factor (traf ), which is a key signaling adaptor molecule common to the tnfr superfamily and to the il- r/tlr family [ ] . figure shows that several components of the tlr signaling pathway that are present in our transcriptomic sequences (myd , irak , traf- and - , tram, btk, rac- , pi k, akt, btk and tank). a total of , contigs, . % of those annotated, had homology with the main groups of putatively pathogenic organisms such as viruses ( hits), bacteria ( , hits), protozoa ( hits) and fungi ( hits). figure displays the taxonomic classification of these sequences and table summarizes a list of the known bivalve pathogens found in our results. bacteria constitute the main group found among the sequences not belonging to the clam. as filter-feeding animals, bivalves can concentrate a large amount of bacteria and it could be one of their sources of food [ ] . because vibrio spp. are ubiquitous in aquatic ecosystems, it was expected that the vibrionales order, with hits, would be the most predominant. several species of the vibrio genus are among the main causes of disease in bivalves specifically causing bacillary necrosis in larval stages [ ] . is noticeable that sequences belonging to the causative agent of brown ring disease in adults of manila clam, vibrio tapetis, have not been found. perkinsus marinus, with matches, is the only bivalve pathogen found within the protozoa (alveolata) group. perkinsosis is produced by species from the genus perkinsus. both p. marinus and p. olseni have been associated with mortalities in populations of various groups of molluscs around the world and are catalogued as notifiable pathogens by the oie. viruses were the least represented among pathogens. the baculoviridae family was the most predominant with matches, but the corresponding sequences were inhibitors of apoptosis (iaps) [ ] that could also be part of the clam's transcriptome. five viral families were found in our transcriptome study: iridoviridae, herpesviridae, malacoherpesviridae, picornaviridae and retroviridae. a well-known bivalve pathogen was also identified, the ostreid herpesvirus , which has been previously been found to infect clams [ ] . fungi had matches in our results. it is known that bivalves are sensitive to fungal diseases, which can degrade the shell or affect the larval bivalve stages [ , ] . this study represents the first r. philippinarum transcriptome analysis focused on its immune system using a -pyrosequencing approach and complements the recent pyrosequencing assay carried out by milan et al. [ ] . the discovery of new immune sequences was effective, resulting in an enormous variety of contigs corresponding to molecules that could play a role in the defense mechanisms. more than % of our results had relationship with immunity. this new resource is now gathered in the ncbi short read archive with the accession number: sra . . our results will provide a rich source of data to discover and identify new genes, which will serve as a basis for microarray construction and gene expression studies as well as for the identification of genetic markers for various applications including the selection of families in the aquaculture sector. we have found sequences from molecules never described in bivalves before like c , c , c , c , aif, bax, akt, tlr and tlr , among others. as a part of this work, three immune pathways in r. philippinarum have been characterized, the apoptosis, the toll like signaling pathway and the complement cascade, which could help us to better understand the resistance mechanisms of this economically important aquaculture clam species. animal sampling and in vitro stimulation of hemocytes r. philippinarum clams were obtained from a commercial shellfish farm (vigo, galicia, spain). clams were maintained in open circuit filtered sea water tanks at uc with aeration and were fed a total of clams were notched in the shell in the area adjacent to the anterior adductor muscle. a sample of ul of hemolymph was withdrawn from the adductor muscle of each clam with an insulin syringe, pooled and then distributed in -well plates, ml per well, in a total of wells, one for each treatment. hemocytes were allowed to settle to the base of the wells for min at uc in the darkness. then, the hemocytes were stimulated with mg/ml of polyinosinic:polycytidylic acid (poly i:c), peptidoglycans, ß-glucan, vibrio anguillarum dna (cpg), lipopolysaccharide (lps), lipoteichoic acid (lta) or ufc/ml of heat-inactivated vibrio anguillarum (one stimulus per well) for h at uc. all stimuli were purchased from sigma. pyrosequencing. after stimulation, hemolymph was centrifuged at g at uc for minutes, the pellet was resuspended in ml of trizol (invitrogen) and rna was extracted following the manufacturer's protocol. after rna extraction, samples were treated with turbo dnase free (ambion) to eliminate dna. next, the concentration and purity of the rna samples were measured using a nanodrop nd spectrophotometer. the rna quality was assessed in a bioanalyzer (agilent technologies). from each sample, mg of rna was pooled and used for the production of normalized cdna for sequencing in the unitat de genòmica (sct-ub, barcelona, spain). full-length-enriched double stranded cdna was synthesized from , mg of pooled total rna using mint cdna synthesis kit (evrogen, moscow, russia) according to manufacturer's protocol, and was subsequently purified using the qiaquick pcr purification kit (qiagen usa, valencia, ca). the amplified cdna was normalized using trimmer kit (evrogen, moscow, russia) to minimize differences in representation of transcripts. the method involves denaturation-reassociation of cdna, followed by a digestion with a duplex-specific nuclease (dsn) enzyme [ , ] . the enzymatic degradation occurs primarily on the highly abundant cdna fraction. the single-stranded cdna fraction was then amplified twice by sequential pcr reactions according to the manufacturer's protocol. normalized cdna was purified using the qiaquick pcr purification kit (qiagen usa, valencia, ca). to generate the library, ng of normalized cdna were used. cdna was fractionated into small, -to -basepair fragments and the specific a and b adaptors were ligated to both the and ends of the fragments. the a and b adaptors were domain; pkc: protein kinase c; pten: phosphatidylinositol- , , -trisphosphate -phosphatase and dual-specificity protein phosphatase pten; raidd: caspase and rip adapter with death domain; tnf r : tumor necrosis factor receptor ; tnf-a: tumor necrosis factor alpha; tradd: tnf receptor type -associated death domain protein; traf : tnf receptor-associated factor ; trail: tnf-related apoptosis-inducing ligand; trail decoy: decoy trail receptor without death domain; trail-r: trail receptor. doi: . /journal.pone. .g used for purification, amplification, and sequencing steps. one sequencing run was performed on the gs-flx using titanium chemistry. sequencing is based on sequencing-by-synthesis, addition of one nucleotide, or more, complementary to the template strand results in a chemiluminescent signal recorded by the ccd camera within the instrument. the signal strength is proportional to the number of nucleotides incorporated in a single nucleotide flow. all reagents and protocols used were from roche life sciences, usa. pyrosequencing raw data, comprised of , reads, were processed with the roche quality control pipeline using the default settings. seqclean (http://compbio.dfci.harvard.edu/tgi/software/) software was used to screen for and remove normalization adaptor sequences, homopolymers and reads shorter than bp prior to assembly. a total of , quality reads were subjected to mira, version . . [ ] , to assemble the transcriptome. by default, mira takes into account only contigs with at least reads. the other reads go into debris, which might include singletons, repeats, low complexity sequences and sequences shorter than bp. ncbi blastclust was used to group similar contigs into clusters (groups of transcripts from the same gene). two sequences were grouped if at least % of the positions had at least % identity. the , contigs were grouped into a total of , clusters. an iterative blast workflow was used to annotate the r. philippinarum contigs with at least bp ( , contigs out of , ). then, blastx [ ] with a cut off value of e- , was used to compare the r. philippinarum contigs with the ncbi swissprot, the ncbi metazoan refseq, the ncbi nr and the uniprotkb/trembl protein databases. after annotation, blast go software [ ] was used to assign gene ontology terms [ ] to the largest contig of a representative cluster (minimum of bp). this strategy was used to avoid redundant results. default values in blast go were used to perform the analysis and ontology level was selected to construct the level pie charts. to make a comparison between r. philippinarum and other bivalve species, the nucleotide sequences and ests from c. gigas, m. galloprovincialis, l. elliptica and b. azoricus were obtained from genbank and from dedicated databases, when available. [ ] . unique sequences from these databases (based on gi number) were used from each of the databases. these sequences were compared by blastn against the longest contig from each of , r. philippinarum clusters with a cut off e-value of e- . hits to r. philippinarum sequences were represented in a venn diagram. the comparison between our results, the longest contig from each of , clusters, and the milan et al. [ ] transcriptome, contigs downloaded from ruphibase (http:// compgen.bio.unipd.it/ruphibase/query/), was made by blastn with a cut off e-value of e- . another analysis was carried out to compare just the longest contig from each of , clusters identified as immune-related and the milan et al. contigs as well. the results were summarized in a table ( table ). the percentage of coverage is the average % of query coverage by the best blast hit and the percentage of hits is the % of query with at least one hit in database, in parenthesis were added the total number of hits. identification of immune-related genes all the contig annotations were revised based on an immunity and inflammation-related keyword list (i.e. apoptosis, bactericidal, c , lectin, socs…) developed in our laboratory to select the candidate sequences putatively involved in immune response. the presence or absence of these words in the blastx hit descriptions was checked to identify putative immune-related contigs. the remaining non-selected contigs were revised using the go terms at level , and assigned to each sequence after the annotation step that had a direct relationship with immunity. selected contigs were checked again to eliminate non-immune ones and distributed into functional categories. immune-related genes were grouped in three reference immune pathways (complement cascade, tlr signaling pathway and apoptosis) to describe each route indicated by our pyrosequencing results. to identify and classify the groups of organisms that had high similarity with our clam sequences, the uniprot taxonomy [ ] was used except for the protozoa group. because protozoa are a highly complex group, a specific taxonomy [ ] was followed. briefly, after the blastx annotation step all the hit descriptions included the species name (i.e. homo sapiens) or a code (i.e. human) meaning that protein has been previously identified as belonging to that species. with such information sequences were classified in taxonomical groups and represented in pie charts. table s list of contigs (e-value, - ) of ruditapes philippinarum including sequence, length, description (hit description), accession number of description (hit acc), e-value obtained and database used for annotation (blast). study of diseases and the immune system of bivalves using molecular biology and genomics bacterial disease in marine bivalves, review of recent studies. trends and evolution perkinsosis in molluscs: a review bacteria-hemocyte interactions and phagocytosis in bivalves role of lectins (c-reactive protein) in defense of marine bivalves against bacteria modulation of the chemiluminescence response of mediterranean mussel (mytilus galloprovincialis) haemocytes immune parameters in carpet shell clams naturally infected with perkinsus atlanticus nitric oxide production by carpet shell clam (ruditapes decussatus) hemocytes generation and analysis of a , unique expressed sequence tags from the pacific oyster (crassostrea gigas) assembled into a publicly accessible database, the gigasdatabase immune gene discovery by expressed sequence tags generated from hemocytes of the bacteria-challenged oyster, crassostrea gigas sequence variability of myticins identified in haemocytes from mussels suggests ancient host-pathogen interactions mytibase, a knowledgebase of mussel (m. galloprovincialis) transcribed sequences insights into the innate immunity of the mediterranean mussel mytilus galloprovincialis development of expressed sequence tags from the pearl oyster, pinctada martensii dunker gene expression analysis of clams ruditapes philippinarum and ruditapes decussatus following bacterial infection yields molecular insights into pathogen resistance and immunity transcriptome sequencing and microarray development for the manila clam, ruditapes philippinarum: genomic tools for environmental monitoring de novo assembly of the manila clam ruditapes philippinarum transcriptome provides new insights into expression bias, mitochondrial doubly uniparental inheritance and sex determination analysis of est and lectin expression in hemocytes of manila clams (ruditapes phylippinarum) (bivalvia, mollusca) infected with perkinsus olseni increasing genomic information in bivalves through new est collections in four species, development of new genetic markers for environmental studies and genome evolution rapid transcriptome characterization for a nonmodel organism using pyrosequencing sequencing technologies -the next generation transcriptomic analysis of the clam meretrix meretrix on different larval stages ferritin functions as a proinflammatory cytokine via iron-independent protein kinase c zeta/nuclear factor kappab-regulated signaling in rat hepatic stellate cells cloning and characterization of an invertebrate type lysozyme from venerupis philippinarum c q and tumor necrosis factor superfamily: modularity and versatility the regulation of inflammation by galectin- isolation, cdna cloning, and characterization of an -kda hemagglutinin and amebocyte aggregation factor from limulus polyphemus angiopoietin-like proteins: emerging targets for treatment of obesity and related metabolic diseases modulation of apolipoprotein d expression and translocation under specific stress conditions neuroprotective effect of apolipoprotein d against human coronavirus oc -induced encephalitis in mice apolipoprotein d itm bs regulates apoptosis by inducing loss of mitochondrial membrane potential transcriptomic signatures of ash (fraxinus spp.) phloem transcriptomics of the bed bug (cimex lectularius) complement and its role in innate and adaptive immune responses potential indicators of stress response identified by expressed sequence tag analysis of hemocytes and embryos from the american oyster, crassostrea virginica analysis of a cdna-derived sequence of a novel mannose-binding lectin, codakine, from the tropical clam codakia orbicularis a novel c q-domaincontaining protein from zhikong scallop chlamys farreri with lipopolysaccharide binding activity the c q domain containing proteins of the mediterranean mussel mytilus galloprovincialis: a widespread and diverse family of immune-related molecules mgc q, a novel c q-domain-containing protein involved in the immune response of mytilus galloprovincialis differentially expressed genes of the carpet shell clam ruditapes decussatus against perkinsus olseni characterization of a c and a factor b-like in the carpet-shell clam, ruditapes decussatus structural and functional diversity of lectin repertoires in invertebrates, protochordates and ectothermic vertebrates purification and characterisation of a lectin isolated from the manila clam ruditapes philippinarum in korea characterization, tissue expression, and immunohistochemical localization of mcl , a c-type lectin produced by perkinsus olseni-infected manila clams (ruditapes philippinarum) noble tandem-repeat galectin of manila clam ruditapes philippinarum is induced upon infection with the protozoan parasite perkinsus olseni lectin from the manila clam ruditapes philippinarum is induced upon infection with the protozoan parasite perkinsus olseni cdna cloning and mrna expression of the lipopolysaccharide-and beta- , -glucan-binding protein gene from scallop chlamys farreri molecular cloning and characterization of a short type peptidoglycan recognition protein (cfpgrp-s ) cdna from zhikong scallop chlamys farreri molecular cloning and mrna expression of peptidoglycan recognition protein (pgrp) gene in bay scallop (argopecten irradians, lamarck ) distribution of multiple peptidoglycan recognition proteins in the tissues of pacific oyster, crassostrea gigas molecular cloning and expression of a toll receptor gene homologue from zhikong scallop, chlamys farreri pattern recognition receptors and inflammation the evolution of vertebrate toll-like receptors a tolllike receptor that prevents infection by uropathogenic bacteria unc b delivers nucleotide-sensing toll-like receptors to endolysosomes cg-timp, an inducible tissue inhibitor of metalloproteinase from the pacific oyster crassostrea gigas with a potential role in wound healing and defense mechanisms molecular cloning, characterization and expression of a novel serine proteinase inhibitor gene in bay scallops (argopecten irradians, lamarck ) molecular cloning and expression of a novel kazal-type serine proteinase inhibitor gene from zhikong scallop chlamys farreri, and the inhibitory activity of its recombinant domain a novel slowtight binding serine protease inhibitor from eastern oyster (crassostrea virginica) plasma inhibits perkinsin, the major extracellular protease of the oyster protozoan parasite perkinsus marinus evidence indicating the existence of a novel family of serine protease inhibitors that may be involved in marine invertebrate immunity serine protease inhibitor cvsi- potential role in the eastern oyster host defense against the protozoan parasite perkinsus marinus antimicrobial peptides: pore formers or metabolic inhibitors in bacteria? innate immunity. isolation of several cysteine-rich antimicrobial peptides from the blood of a mollusc, mytilus edulis a member of the arthropod defensin family from edible mediterranean mussels (mytilus galloprovincialis) evidence of high individual diversity on myticin c in mussel (mytilus galloprovincialis) mytilus galloprovincialis myticin c: a chemotactic molecule with antiviral activity and immunoregulatory properties analysis of differentially expressed genes in response to bacterial stimulation in hemocytes of the carpetshell clam ruditapes decussatus: identification of new antimicrobial peptides molecular characterization of a novel big defensin from clam venerupis philippinarum heat shock proteins: facts, thoughts, and dreams heat shock proteins, cellular chaperones that modulate mitochondrial cell death pathways djla, a membrane-anchored dnaj-like protein, is required for cytotoxicity of clam pathogen vibrio tapetis to hemocytes alternation of venerupis philippinarum hsp gene expression in response to pathogen challenge and heavy metal exposure identification of two small heat shock proteins with different response profile to cadmium and pathogen stresses in venerupis philippinarum molecular characterization and expression analysis of a putative lps-induced tnf-alpha factor (litaf) from pearl oyster pinctada fucata cloning, characterization and expression analysis of the gene for a putative lipopolysaccharide-induced tnf-alpha factor of the pacific oyster molecular cloning and characterization of a putative lipopolysaccharide-induced tnf-alpha factor (litaf) gene homologue from zhikong scallop chlamys farreri first molluscan tnfr homologue in zhikong scallop: molecular characterization and expression analysis identification and expression of traf (tnf receptor-associated factor ) gene in zhikong scallop chlamys farreri diversity and pathogenecity of vibrio species in cultured bivalve molluscs an apoptosis-inhibiting baculovirus gene with a zinc finger-like motif detection of ostreid herpesvirus dna by pcr in bivalve molluscs: a critical review synopsis of infectious diseases and parasites of commercially exploited shellfish a fungus disease in clam and oyster larvae a novel method for snp detection using a new duplex-specific nuclease from crab hepatopancreas simple cdna normalization using kamchatka crab duplex-specific nuclease using the miraest assembler for reliable and automated mrna transcript assembly and snp detection in sequenced ests basic local alignment search tool blast go, a universal tool for annotation, visualization and analysis in functional genomics research gene ontology, tool for the unification of biology. the gene ontology consortium pyrosequencing of mytilus galloprovincialis cdnas: tissue-specific expression patterns insights into shell deposition in the antarctic bivalve laternula elliptica: gene discovery in the mantle transcriptome using pyrosequencing highthroughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel bathymodiolus azoricus newt, a new taxonomy portal the new higher level classification of eukaryotes with emphasis on the taxonomy of protists key: cord- -bkr ndl authors: seifi, morteza; walter, michael a. title: accurate prediction of functional, structural, and stability changes in pitx mutations using in silico bioinformatics algorithms date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: bkr ndl mutations in pitx have been implicated in several genetic disorders, particularly axenfeld-rieger syndrome. in order to determine the most reliable bioinformatics tools to assess the likely pathogenicity of pitx variants, the results of bioinformatics predictions were compared to the impact of variants on pitx structure and function. the mutpred, provean, and pmut bioinformatic tools were found to have the highest performance in predicting the pathogenicity effects of all characterized missense variants in pitx , all with sensitivity and specificity > %. applying these three programs to assess the likely pathogenicity of previously uncharacterized pitx missense variants predicted / variants as deleterious, except a v which was predicted as benign variant for all programs. molecular modeling of the pitx homoedomain predicts that of the known pitx variants, l q, f l, v f, v l, w c, w s, and r p alter pitx ’s structure. in contrast, the remaining variants are not predicted to change pitx ’s structure. the results of molecular modeling, performed on all the pitx missense mutations located in the homeodomain, were compared with the findings of eight protein stability programs. cupsat was found to be the most reliable in predicting the effect of missense mutations on pitx stability. our results showed that for pitx , and likely other members of this homeodomain transcription factor family, mutpred, provean, pmut, molecular modeling, and cupsat can reliably be used to predict pitx missense variants pathogenicity. paired-like homeodomain transcription factor (pitx , refseq nm . , mim# ) is located at q and is expressed in the developing eye, brain, pituitary, lungs, heart, and gut [ ] . mutations in human pitx or the forkhead box transcription factor c (foxc ; p , refseq nm . , mim# ) underlie the autosomal dominant disorder called axenfeld-rieger syndrome (ars; mim# ) [ ] [ ] [ ] [ ] . ars is a full penetrant, but clinically and genetically heterogeneous disorder characterized by developmental anomalies involving both ocular and non-ocular structures [ ] . to date, identified including deletions, insertions, splice-site mutations, and coding region frameshift, nonsense and missense mutations [ ] [ ] [ ] [ ] [ ] [ ] [ ] . identifying new disease-associated variants is becoming increasingly important for genetic testing and it is leading to a significant change in the scale and sensitivity of molecular genetic analysis [ ] . one of the most frequent approaches for detecting novel variants in target genes is using direct gene sequencing. however, due to increasing number of newly identified missense variants, it is often difficult to interpret the pathogenicity of these variants as not all the mutations alter protein function, and the ones that do may also have different functional impacts in disease [ , ] . thus, prior to detailed analyses, novel variants cannot be easily classified as either deleterious or neutral, because of their unknown functional and phenotypic consequences. therefore, further research should be conducted to validate the genetic diagnosis when a novel missense variant is discovered. preferably, in vitro characterization of novel variants should be undertaken; however, due to facility limitation, it is often not practicable to experimentally verify the impact of large number of mutations on protein function [ ] . another robust approach to substantiate the pathogenicity is using animal models by generating the homologous mutation that recapitulates the human phenotype; but, similar to in vitro studies, these are time-consuming, labor-intensive, difficult and expensive, making this approach unfeasible to experimentally determine the pathogenicity effects of all novel identified variants [ ] . to circumvent the above mentioned limitations and to provide fast and efficient methods for predicting the functional effect of nonsynonymous variants on protein stability, structure, and function, several computational tools have been developed [ ] [ ] [ ] . protein stability and structure are key factors affecting function, activity, and regulation of proteins. conformational changes are necessary for many proteins' function and disease-causing variants can impair protein folding and stability. missense variants are also capable of impairing protein structure, likely by affecting protein folding, protein-protein interaction, solubility or stability of protein molecules. the structural effect of mutational changes can be examined in silico on the basis of three-dimensional structure, multiple alignments of homologous sequences, and molecular dynamics [ ] [ ] [ ] . therefore, analysing sequence data in silico first and detecting a small number of predicted deleterious mutations for further experimental characterization is a key factor in today's genetic and genomic studies. in general, bioinformatics prediction methods obtain information on amino acid conservation through alignment with homologous and distantly related sequences. the most common criteria considered in many bioinformatics programs for predicting the functional effect of an amino acid substitution are amino acid sequence conservation across multiple species, physicochemical properties of the amino acids involved, database annotations, and potential protein structural changes [ , , ] . as mentioned above, resources for in vitro and in vivo functional analysis of novel variants are constrained in most clinical laboratories. therefore, identifying and reporting novel variants that are likely to be pathogenic often requires accurate prediction using computational tools. in a previous study, we examined the effect of foxc variants on protein structure and function by combining laboratory experiments and in silico techniques. our results showed that integration of different algorithms with in vitro functional characterization serves as a reliable means of prioritizing, and then functional analyzing, candidate foxc variants [ ] . unlike most previous studies that focused on using only polyphen and sift to predict the pathogenicity of missense mutations, here, we investigated the predictive value of sift, poly-phen and nine other prediction tools by comparing their predictions to in vitro functional data for pitx variants. the bioinformatics programs found to be most reliable were then used to predict the likely consequences of functionally-uncharacterized pitx variants. we also performed molecular modeling on all the pitx missense mutations located in the homeodomain and compared the results with the findings of protein stability algorithms to identify the most reliable tools in predicting the effect of missense mutations on pitx stability. to the best of our knowledge, this is the first study that incorporates the results of functional studies in conjunction with bioinformatics approaches for predicting the pathogenicity of mutations in pitx gene. lists of pitx missense variants were assembled from the previous literature and a search using the clinvar [ ] , human gene mutation database (hgmd) [ ], the genome aggregation database (gnomad), and the single nucleotide polymorphism database (dbsnp). this study found pitx missense variants; of which were described in the literature as being associated with ars or coronary artery disease (cad), while the remaining variants, were considered as benign variants (fig ) . eighteen of the variants were classified as pathogenic based on functional studies utilizing site-directed mutagenesis, expression studies, and other functional analysis ( table ) . thirteen of variants were described as associated with ars and cad in the absence of functional analyses on pitx structure or function. sixteen snps, with population allele frequencies > . were identified from the gnomad and the clin-var. based upon the allele frequency (approximately -fold greater than the disease frequency of ars) these have been considered benign polymorphisms. nucleotide numbering of the mutations herein indicates cdna numbering with + as the a of the atg translation initiation codon in the ncbi reference sequence nm_ . , while the amino positions are based on the corresponding ncbi reference sequence np_ . . this study is a retrospective case report that does not require ethics committee approval at our institution. all patients' mutations and phenotypes were obtained from previously published studies. these programs were used to analyse functionally characterised pitx missense variants plus additional, functionally uncharacterized pitx missense variants. sift program provides functional predictions for coding variants, based on the degree of conservation of amino acid residues in sequence alignments derived from closely related sequences, collected by psi-blast algorithm [ ]. the polyphen- (polymorphism phenotyping- ) server predicts possible effect of an amino acid change on the structure and function of a protein using several sources of information such as straightforward physical and comparative considerations [ ] . panther-psep is a new application that analyses the length of time a given amino acid has been conserved in the lineage leading to the protein of interest. there is a direct association between the conservation time and the likelihood of functional impact [ ] . mutpred is a free web-based application that utilizes a random forest algorithm with data based upon the probabilities of loss or gain of properties relating to many protein structures and dynamics, predicted functional properties, and amino acid sequence and evolutionary information [ ] . mutationtaster is a tool that combines information derived from various biomedical databases and uses established analysis programs. unlike sift or poly-phen- which work on dna level, mutationtaster processes substitutions of single amino table for more information on the prediction tools used in this study. the nmr structure of the homeodomain of pitx complexed with a taatcc dna binding site (pdb: lkx) were analyzed by the swiss-model server (http://www.expasy.org/spdbv/ ; provided in the public domain by the swiss institute of bioinformatics, geneva, switzerland). model structures of wild-type and mutants were created in swiss-pdb viewer and investigated using the anolea server (http://melolab.org/anolea). for structure predictions of pitx , sequence in fasta format was obtained from ncbi database (np_ . ). eight different protein stability programs (duet, sdm, mcsm i-mutant . , mupro, iptree-stab, cupsat, and istable) were used to predict the effects of missense mutations on the stability of pitx protein. duet is a web server that uses integrated computational approach to predict effect of missense mutations on protein stability [ ] . duet calculation is based on complementary data regarding the mutation including secondary structure [ ] and a pharmacophore vector [ ] . sdm, a computational method, has been demonstrated as the most appropriate method to use along with many other programs. sdm assesses the amino acid substitution occurring at specific structural environment that are tolerated within the family of homologous proteins of defined three dimensional structures and change them into substitution probability tables [ ] . mcsm relies on graph-based signature concept and predicts not only the effect of single-point mutations on protein stability, but also protein-protein and protein-nucleic acid binding [ ] . i-mutant . is a neural-network-based web server that predicts automatically protein stability changes upon single point protein mutations based on either protein sequence or protein structure. i-mutant . can predict the severity effect of a mutation on the stability of the folded protein [ ] . mupro is a set of machine learning programs that accurately calculates protein stability alterations based on primary sequence information particularly where the tertiary structure is unrevealed, overcoming a major restriction of previous methods which are based on the tertiary structure [ ] . iptree-stab is a web service and mainly provides two function modules of services including discriminating the stability of a protein upon single amino acid substitutions and predicting their numerical stability values [ ] . cupsat uses protein environment specific mean force potentials (through solvent accessibility and secondary structure specificity) to analyse and predict protein stability changes upon point mutations [ ] . istable, a combined predictor, was designed by using sequence information and prediction data from various element predictors. istable is available with two different input types: structural and sequential [ ] . please see table for more information on the stability predictors used in this study. previous analyses of missense variations in different human diseases predicted that the stability margin without any immediate effect on protein fitness is - kcal mol - [ ] [ ] [ ] . mutations that reduce the protein stability by > kcal mol - contribute to severe disease phenotypes [ , ] . therefore, in this study, all variations were classified as predicted to be neutral (- . < ΔΔg < . ), stabilizing (ΔΔg > . ) or destabilizing (ΔΔg < - . ). the protein sequence and/or protein structure with mutational position and amino acid residue of previously functionally characterized pathogenic pitx missense variants, plus snps with a population frequency of higher than . % (thus considered benign polymorphisms), were used to test the predictive value of eleven common bioinformatics prediction programs; sift, polyphen- , panther-psep, mutpred, mutationtaster, provean, pmut, fathmm, nssnpanalyzer, align gv-gd, and revel (table and table ). to evaluate the performances of the programs, seven measures (sensitivity, specificity, accuracy, precision, positive predictive value (ppv), negative predictive value (npv), and matthews correlation coefficient (mcc)) were calculated by comparing the results of all programs with previously generated functional data. for pitx , mutpred, provean, and pmut were the most reliable of the bioinformatics tools in predicting the pathogenicity effects of all functionally characterized missense variants in pitx , with sensitivity and specificity of > % (fig ) . then, revel tool showed high sensitivity and specificity, . % and . %, respectively. analysis of the sensitivity and specificity sift showed that this program had good sensitivity ( . %) but low specificity ( . %). although polyphen- , mutationtaster, panther-psep, fathmm, and align gv-gd exhibited over % sensitivity, they were unable to identify the benign polymorphisms, showing specificity of . %, . %, . %, . %, and . %, respectively. the predictive value of nssnpanalayzer was similar to that of sift program, with sensitivity and specificity of . % and . %, respectively. the most reliable programs found in this study's analyses (mutpred, provean, and pmut) were then used to predict the likely pathogenicity of pitx missense variants for which functional testing has not been performed (table ) . interestingly, the a v variant unanimously was predicted as benign by all three programs. the remaining pitx variants were predicted to be disease-associated mutations by all programs. molecular models for the homeodomain of wild-type and variant-containing pitx proteins were designed using threading algorithms to assess impairment of pitx structure by missense variants. three functionally characterised variants, n d, l v, and n t, were excluded from these molecular modeling analyses since they are not located in the homeodomain, which is the only portion of pitx with a known structure. wild-type amino acids were changed to variant residues to determine putative structural effects of the remaining functionally analysed pitx variants through anolea mean force potential calculations. the molecular modeling identified three mutations as high-risk (l q, v l, and r p) to change the structure of pitx , particularly in the h , h , and h subdomains (fig ) . the r p variant was predicted to grossly disrupt the non-local amino acid side chain contacts. similar, although less profound, effects were predicted when l and v were altered to glutamine and leucine, respectively. in contrast, the remaining twelve amino acid variants showed no predicted substantially altered pairwise interactions, indicating that these missense variants are predicted to have minor or no effects on pitx 's structure (s fig). molecular modeling was also performed on the nine functionally uncharacterised pitx missense mutations located in the homeodomain. four mutations (f l, v f, w c, w s) were predicted to change the structure of pitx (fig ) , while, the remaining five variants (r h, p l, p r, r c, and r p) were predicted to have minor or no impact on pitx 's structure (s fig). to assess the performance of eight different stability predictor programs (duet, sdm, mcsm, i-mutant . , mupro, iptree-stab, cupsat, and istable) in predicting the effect of missense mutations on pitx protein stability, the change in protein stability (ΔΔg) were computed for all pitx homeodomain variants ( functionally characterised and functionally uncharacterised mutations) (table ) . of these eight programs, cupsat was the most consistent with the results of our molecular modeling, by identifying of destabilizing mutations that were also predicted to be destabilizing by molecular modeling (v l, v f, w s, w c, and r p). computational analysis of pitx mutations sdm also showed high consistency with the results of our molecular modeling, by detecting of destabilizing mutations that were also predicted to be destabilizing by molecular modeling (l q, r p, w s, and w c). duet, mcsm, and i-mutant . identified and iptree-stab detected of destabilizing mutations detected by molecular modeling. mupro and istable were unable to identify any of the destabilizing mutations predicted by molecular modeling. although in silico programs are not a substitute for wet-lab experiments, they can provide a supportive role in the experimental validation of disease-associated alleles and can help further diagnostic strategies by prioritizing the most likely pathogenic novel variants. while many tools are available for assessing the functional significance of variants, determining the reliability of prediction results is challenging. in this context, the current study investigated the combination of experimental findings, molecular modeling, in silico mutation prediction programs, and stability prediction software to assess the pathogenicity of pitx missense variants. in silico methods that correctly identify deleterious variants do not always inevitably work well for benign predictions. the methods determined by this study to be preferred for analyses of pitx variants were those best able to distinguish both pathogenic and benign variants, thus yielding the highest accuracy. our results showed that mutpred, provean, and pmut tools were the most accurate in predicting pathogenicity of pitx missense variants (fig ) . the sensitivity and specificity of these three tools in recognizing pitx disease-causing variants were over %, indicating the strong performance of these programs in identifying as pathogenic only pitx variants with significant functional defects. after these three tools, revel showed highest sensitivity and specificity, . % and . %, respectively. sift showed good sensitivity ( . %) but low specificity ( . %). polyphen- , mutationtaster and panther-psep, fathmm, and align gv-gd demonstrated > % sensitivity, but, they were unable to identify the benign polymorphisms, showing the specificity of . %, . %, . %, . %, and . %, respectively. the predictive value of nssnpanalayzer was similar to that of sift program, with sensitivity and specificity of . % and . %, respectively. our results showed, therefore, that computational analysis of pitx mutations mutpred, provean, and pmut can be utilized with high confidence to test whether or not a pitx missense variant is likely to be deleterious. interestingly, mutpred was the only in silico program that ranked in the top three programs in identifying both pathogenic and benign pitx and foxc variants [ ] . a likely explanation for mutpred's high ranking is that it evaluates the most factors in making assessments. however, since the number of variants available for testing in this study were small, a larger dataset would confirm that our results are reproducible and generally applicable. the three programs that were found to be the most reliable (mutpred, provean, and pmut) were then used to assess the likely pathogenicity of thirteen pitx missense variants for which functional analyses have not been performed, but which have been associated with ars or cad (table ). our results showed that mutpred, provean, and pmut predicted as pathogenetic / of the variants. the a v variant was scored as non-pathogenetic/benign by all three programs. while it is possible that a v is an example of a false negative for all three programs, it is likely that this variant is instead benign. functional testing of the a v variant is needed to determine which of these possibilities is accurate. various intramolecular interactions are involve in stabilizing and folded state of protein, including hydrophobic, electrostatic, and hydrogen-bonding [ ] [ ] [ ] [ ] . the stability state of a protein is key factor in its proper functionality. in fact, up to % of mendelian disease-causing mutations in protein coding regions are predicted to be caused by altering protein stability [ ] . in recent years, due to the availability of high-throughput array-based genotyping methods [ ] and next generation sequencing platforms [ , ] , a large number of snps has been reported. however, the association of missense variants with protein stability has often been difficult to predict. fortunately, recent advances in computational prediction of protein stability offers potential insight into this question. we used two parallel prediction methods to investigate the possible effects on pitx protein structure and stability of missense variants. knowledge of a protein's d structure can be used to predict the functionality of protein and the possible impact of variants on protein conformation and structure. we thus first used molecular modelling analyses to assess and compared the total energy difference between native and mutated modeled structure of pitx proteins. the results predicted that while most pitx variants did not dramatically affect the protein tertiary structure, seven variants (l q, f l, v f, v l, w c, w s, and r p) altered the total energy level in comparison with the native structure, suggesting that these amino acid substitutions changed the structure of the pitx protein. molecular modeling of the pitx homeodomain predicted that these variants impair the required energy to maintain the proper folding of helix - and cause global destabilization of the structure of pitx . these seven amino acids are either invariant (e.g., w ) or highly conserved in the approximately homeobox proteins analyzed, consistent with a pivotal role of these residues in the homeodomain [ ] [ ] [ ] . these seven amino acids are tightly packed hydrophobic amino acids responsible for holding helices of the pitx homeodomain together, supporting our molecular modeling predicting that mutations of these amino acids disrupt pitx structure. for f l, v f, and v l, the native wild-type residues and the introduced mutant residues differ in size, probably causing loss of hydrophobic interactions in the core of the protein, particularly involving helix - . for l q, w c, w s, and r p, the wild-type residues and the mutant residues are different in both size and charge, likely disturb the local structure of protein thereby altering protein structure and function. residues v , w , and r are located within the third helix which is specifically responsible for binding with the major groove of the dna [ ] . thus, the prediction that these mutations impair the capacity of this helix to interact with dna is consistent with this knowledge and with previous functional characterizations that showed reduced dna-binding capacities of the v l and r p mutant pitx proteins [ , ] . consistent with bioinformatics predictions of deleterious affects of mutation of w , mutations of the neighboring amino acids (r w and k e) have been shown to decrease the ability of the mutant proteins to interact with dna [ , ] . residues l and f are located in helix of the homeodomain, responsible for contacting with the minor groove of the dna. molecular modeling of l q is consistent with the computational analysis of pitx mutations suggestion that mutations in these highly-conserved residues in helix of the homeodomain might disturb the dna-protein binding affinity. our prediction is supported by the fact that changing the leucine to a glutamine (l q) disrupts dna-protein complex, indicating the necessity of leucine at position for pitx binding ability [ ] . thus, consistent with our recent studies on foxc protein [ ] , the results of molecular modeling of pitx are strongly consistent with the functional characterization of pitx missense variants. the results from our molecular modeling analysis were also compared to the predictions of eight stability predictor methods (duet, sdm, mcsm, i-mutant . , mupro, iptree-stab, cupsat, and istable). based on our analyses, it appears that cupsat performs the best of the seven methods evaluated here in predicting the effect of missense mutations on pitx protein stability, with sdm, duet, mcsm, and i-mutant . , performing weaker, consistent with the results of previous studies [ , ] . our results indicate that further studies are required to improve ΔΔg predictions, especially for buried amino acids. in this study, for the first time, we evaluated the impact of missense variants on pitx stability, structure and function by integrating stability prediction algorithms, bioinformatics mutation prediction tools, and molecular modeling. our results showed that mutpred, provean, pmut, molecular modeling, and cupsat are reliable methods to assess pitx family missense variants in the absence of laboratory experiments. however, for our analyses, it must be noted that we used sixteen snps as non-pathogenetic control variants to investigate the performance of prediction programs. although we considered snps with a population frequency of > . % as benign, we cannot formally exclude that these snps might have un-documented pathogenic effects on pitx . in addition, while the prediction methods used in this study are not gene-specific, generalization of the performance of these programs to other human genes may be inappropriate without additional study. when assessing the pathogenicity of missense variants, it is necessary to be cautious on depending merely on in silico programs without wetlab experiments. according to standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the american college of medical genetics and genomics (acmg) and the association for molecular pathology, in silico predictions only serve as one supporting factor, whereas functional tests are frequently needed to assess the pathogenicity of missense variants. in particular, as per clinical guidelines for the interpretation of single substitution variants, the output of computational tools should be interpreted in the light of functional studies results, population frequency data and segregation in affected families. evidence that rieger syndrome maps to q or q expanding the spectrum of foxc and pitx mutations and copy number changes in patients with anterior segment malformations. investigative ophthalmology & visual science a novel mutation in the pitx gene in a family with axenfeld-rieger syndrome novel mutations of foxc and pitx in patients with axenfeld-rieger malformations phenotypic variability and asymmetry of rieger syndrome associated with pitx mutations. investigative ophthalmology & visual science single nucleotide polymorphisms in clinical genetic testing: the characterization of the clinical significance of genetic variants and their application in clinical research for brca . mutation research/fundamental and molecular mechanisms of mutagenesis the role of functional data in interpreting the effects of genetic variation determining the pathogenicity of genetic variants associated with cardiac channelopathies. scientific reports structure based thermostability prediction models for protein single point mutations with machine learning tools using sift and polyphen to predict loss-of-function and gain-offunction mutations. genetic testing and molecular biomarkers stability effects of mutations and protein evolvability prediction of protein stability upon point mutations predicting the functional effect of amino acid substitutions and indels molecular mechanisms of disease-causing missense mutations integrating population variation and protein structural analysis to improve clinical interpretation of missense variation: application to the wd domain. human molecular genetics prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases identification of related proteins with weak sequence identity using secondary structure information comparison of bioinformatics prediction, molecular modeling, and functional analyses of foxc mutations in patients with axenfeld-rieger syndrome clinvar: public archive of relationships among sequence variation and human phenotype mcsm: predicting the effects of mutations in proteins using graph-based signatures sdm-a server for predicting effects of mutations on protein stability and malfunction sdm: a server for predicting effects of mutations on protein stability mcsm: predicting the effects of mutations in proteins using graph-based signatures predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information prediction of protein stability changes for single-site mutations using support vector machines iptree-stab: interpretable decision tree based method for predicting protein stability changes upon mutations cupsat: prediction of protein stability upon point mutations istable: off-the-shelf predictor integration for predicting protein stability changes iptree-stab: interpretable decision tree based method for predicting protein stability changes upon mutations stability effects of mutations and protein evolvability correlation of levels of folded recombinant p in escherichia coli with thermodynamic stability in vitro investigating the effects of mutations on protein aggregation in the cell using model proteins to quantify the effects of pathogenic mutations in ig-like proteins systematically perturbed folding patterns of amyotrophic lateral sclerosis (als)-associated sod mutants de novo mutations in histone-modifying genes in congenital heart disease prevalence and spectrum of pitx c mutations associated with familial atrial fibrillation identification of four new pitx gene mutations in patients with axenfeld-rieger syndrome. molecular vision expanding the spectrum of foxc and pitx mutations and copy number changes in patients with anterior segment malformations. investigative ophthalmology & visual science mutation in pitx is associated with ring dermoid of the cornea novel mutations of foxc and pitx in patients with axenfeld-rieger malformations journal of oral pathology & medicine: official publication of the international association of oral pathologists and the american academy of oral pathology dental and craniofacial anomalies associated with axenfeld-rieger syndrome with pitx mutation. case reports in medicine a novel pitx mutation causing iris hypoplasia. human genome variation pitx and foxc spectrum of mutations in ocular syndromes a novel pitx mutation in a chinese family with axenfeld-rieger syndrome. molecular vision. emory university a novel pitx mutation and a polymorphism in a -generation family with axenfeld-rieger anomaly and coexisting fuchs' endothelial dystrophy mutation analysis of the genes associated with anterior segment dysgenesis, microcornea and microphthalmia in patients with glaucoma influence of hydrophobic and electrostatic residues on sars-coronavirus s protein stability: insights into mechanisms of general viral fusion and inhibitor design influence of hydrophobic and electrostatic residues on sars-coronavirus s protein stability: insights into mechanisms of general viral fusion and inhibitor design protein science: a publication of the protein society mechanisms of protein stabilization and prevention of protein aggregation by glycerol snps, protein structure, and disease whole-genome genotyping. methods in enzymology the complete genome of an individual by massively parallel dna sequencing sequencing technologies-the next generation the structure of the antennapedia homeodomain determined by nmr spectroscopy in solution: comparison with prokaryotic repressors understanding the homeodomain solution structure of the k class homeodomain pitx bound to dna and implications for mutations that cause rieger syndrome functional analyses of two newly identified pitx mutants reveal a novel molecular mechanism for axenfeld-rieger syndrome identification of a dominant negative homeodomain mutation in rieger syndrome the molecular basis of rieger syndrome. analysis of pitx homeodomain protein activities comparison of bioinformatics prediction, molecular modeling, and functional analyses of foxc mutations in patients with axenfeld-rieger syndrome screening of mutations affecting protein stability and dynamics of fgfr -a simulation analysis performance of protein stability predictors the authors would like to thank the members of the walter laboratory for critical reading of the manuscript and for helpful comments. key: cord- -ejv xln authors: crouch, erika c title: surfactant protein-d and pulmonary host defense date: - - journal: respir res doi: . /rr sha: doc_id: cord_uid: ejv xln surfactant protein-d (sp-d) participates in the innate response to inhaled microorganisms and organic antigens, and contributes to immune and inflammatory regulation within the lung. sp-d is synthesized and secreted by alveolar and bronchiolar epithelial cells, but is also expressed by epithelial cells lining various exocrine ducts and the mucosa of the gastrointestinal and genitourinary tracts. sp-d, a collagenous calcium-dependent lectin (or collectin), binds to surface glycoconjugates expressed by a wide variety of microorganisms, and to oligosaccharides associated with the surface of various complex organic antigens. sp-d also specifically interacts with glycoconjugates and other molecules expressed on the surface of macrophages, neutrophils, and lymphocytes. in addition, sp-d binds to specific surfactant-associated lipids and can influence the organization of lipid mixtures containing phosphatidylinositol in vitro. consistent with these diverse in vitro activities is the observation that sp-d-deficient transgenic mice show abnormal accumulations of surfactant lipids, and respond abnormally to challenge with respiratory viruses and bacterial lipopolysaccharides. the phenotype of macrophages isolated from the lungs of sp-d-deficient mice is altered, and there is circumstantial evidence that abnormal oxidant metabolism and/or increased metalloproteinase expression contributes to the development of emphysema. the expression of sp-d is increased in response to many forms of lung injury, and deficient accumulation of appropriately oligomerized sp-d might contribute to the pathogenesis of a variety of human lung diseases. surfactant protein-d (sp-d) is a member of the collagenous subfamily of calcium-dependent lectins (collectins) that includes pulmonary surfactant protein a (sp-a) and the serum mannose-binding lectin [ ] [ ] [ ] . collectins inter-act with a wide variety of microorganisms, lipids, and organic particulate antigens, and can modulate the function of immune effector cells and their responses to these ligands. this article reviews what is currently known about the sites of production, structure, function, and regulated expression of sp-d. emphasis will be placed on functional attributes, known ligand interactions, and structure-function relationships believed to be important for host defense. for additional information on sp-a and other members of the collectin family, the reader is referred to other recent reviews [ ] [ ] [ ] . sp-d is synthesized and secreted into the airspaces of the lung by the respiratory epithelium [ ] . at the alveolar level, sp-d is constitutively synthesized and secreted by alveolar type ii cells. more proximally in the lung, sp-d is secreted by a subset of bronchiolar epithelial cells, the non-ciliated clara cells. because sp-d is stored within the secretory granules of clara cells [ , ] , it seems likely that sp-d is subject to regulated secretion via granule exocy-tosis at this level of the respiratory tract. in some species, sp-d is also synthesized by epithelial cells and/or submucosal glands associated with the bronchi and trachea [ ] . although many alveolar macrophages show strong cytoplasmic and/or membrane labeling with antibody against sp-d, they do not contain detectable sp-d message. the lung seems to be the major site of sp-d production. however, there is increasing evidence for extrapulmonary sites of expression as assessed with monoclonal or affinity-purified antibodies, reverse-transcriptase-mediated pcr (rt-pcr), and/or hybridization assays of tissues from humans and other large mammals [ • , - ] (summarized in table ). it is difficult to entirely exclude crossreactions or amplification of related sequences; however, localization to many of these sites in human tissues was confirmed by using monoclonal antibodies in combination with rt-pcr with sequencing of the amplified products [ • ]. non-pulmonary expression seems to be largely restricted to cells lining epithelial surfaces or ducts and certain glandular epithelial cells that are in direct or indirect continuity with the environment. notable exceptions to this generalization might include heart, brain, pancreatic islets, and testicular leydig cells. sp-d has also been identified in amnionic epithelial cells by immunohistochemistry [ ] ; however, it is unclear whether this is synthesized locally or derived from the lung by way of the amniotic fluid. interestingly, in many of these sites sp-d microscopically co-localizes with gp- , an sp-d binding protein and putative sp-d receptor [ • ]. sites of extrapulmonary expression have also been described in small mammals. in the rat, sp-d message was identified in rna extracted from skin and blood vessel [ ] , and both protein and message were identified in gastric mucosa [ ] and mesentery [ ] . using rt-pcr, sp-d message has also been identified in mouse stomach, heart, and kidney [ ] . sp-d ( kda, reduced) consists of at least four discrete structural domains: a short, n-terminal domain; a relatively long collagenous domain, a short amphipathic connecting peptide or coiled-coil neck domain, and a c-terminal, ctype lectin carbohydrate recognition domain (crd). each molecule consists of trimeric subunits ( × kda), which associate at their n-termini (fig. ) . although most preparations of sp-d contain a predominance of dodecamers (that is, four trimeric subunits), the proportions of various oligomers vary between species. for example, rat lavage and recombinant rat sp-d are almost exclusively assembled as dodecamers (four trimers), whereas recombinant human sp-d is secreted as trimers, dodecamers and higher-order multimers [ ] . sp-d isolated from the lavage of some patients with alveolar proteinosis consists predominantly of higher-order multimers, which can contain up to (or more) trimeric subunits (fig. ). recent crystallographic and mutagenesis studies suggest that the structural determinants of saccharide binding are similar to those originally described for mannose-binding lectin [ , , • , • ]. at least two bound calcium ions and two intrachain disulfide crosslinks stabilize the required tertiary structure, and glu and asn within the crd participate in glucose/mannose type recognition. interactions with at least one glycolipid ligand, phosphatidylinositol (pi), require the participation of the c-terminal end of the protein [ , ] . a trimeric cluster of crds is necessary for high-affinity binding to carbohydrate ligands [ • , ]. the crystal structure of human sp-d suggests that the spatial distribution of crds within a trimeric subunit permits simultaneous and cooperative interactions with two or three glycoconjugates displayed on the surface of a particulate ligand [ • ]. furthermore, solid-phase binding studies have shown that monomeric crds have an approximately -fold lower binding affinity for multivalent ligands than trimeric crds. crystallographic studies of human sp-d further suggest that the spatial organization of crds within a trimer is stabilized by interactions of the c-terminal sequence with the trimeric neck domain [ • , ]. interestingly, the three crds show a deviation from threefold asymmetry, suggesting some flexibility of the crds in relation to the neck. thus, the dependence of the binding of pi on the c-terminal sequence could reflect conformational effects, rather than the direct participation of this sequence in ligand interactions. the collagen domain length of sp-d is highly conserved and lacks interruptions in the repeating gly-x-y sequence (in which x and y are different amino acids). as for other collagenous proteins, this domain is enriched in imino acids and contains hydroxyproline. unlike sp-a, sp-d also contains hydroxylysine. although the collagen domain of rat, human, bovine, and mouse sp-d lacks cysteine residues, cdna sequencing has identified a codon for cysteine within the collagen domain of pig sp-d [ • ]; this suggests the possibility of alternative patterns of chain association and oligomeric assembly for pig sp-d. the first translated exon of sp-d contains a highly conserved and unusually hydrophilic gly-x-y sequence that shows little homology with the remainder of the collagen sequence. the functional significance of this region is unknown. however, it has been suggested that this region contributes to oligomer assembly or mediates interactions with cellular receptors. the collagen domain determines the maximal spatial separation of trimeric, c-terminal lectin domains within sp-d molecules, but might also contribute to normal oligomeric assembly and secretion. for example, deletion of the entire collagen domain of rat sp-d results in the secretion of trimers rather than dodecamers [ ] . in addition, , -dipyridyl, an inhibitor of prolyl hydroxylation that interferes with the formation of a stable collagen helix, causes the intracellular accumulation of kda monomers and dimers [ ] . in any case, the complete conservation of the number of gly-x-y triplets suggests that the spatial separation of trimeric crds is critical for normal sp-d function. the n-terminal peptide of the mature protein contains two conserved cysteine residues at positions and . these residues participate in interchain disulfide crosslinks that stabilize the trimer, as well as the n-terminal association of four or more trimeric subunits. stable oligomerization of trimeric subunits permits cooperative or bridging interactions between spatially separated binding sites on the same surface or on different particles. the process of forming interchain disulfide bonds is complex, and appropriate crosslinking of the n-terminal domains might be rate limiting for secretion [ ] . subcellular fractionation studies suggest that interchain bonds form initially between the three chains of a trimeric subunit. subsequent rearrangements within the rough endoplasmic reticulum might allow the covalent crosslinking of a single chain from one subunit and two crosslinked chains of another, with the associated elimination of free thiol groups. mutant proteins that contain unpaired n-terminal cysteine residues are not secreted. however, it is unclear whether this results from abnormalities in disulfide bonding itself, or the failure to stabilize the required n-terminal conformation. the collagen domain contains hydroxylysyl-derived glycosides and a single n-linked oligosaccharide. in most species (human, rat, mouse, and cow) the site of n-linked glycosylation is located near the n-terminal end of the collagenous domain. recently, it was shown that pig sp-d has an additional potential site of n-linked glycosylation within the crd [ • ]. although rat and human lung lavage sp-d seem to be sialylated, as suggested by charge heterogeneity and cleavage with highly purified neuraminidase, preparations of human amniotic fluid and bovine lavage sp-d recovered from amniotic fluid showed predominantly complex type biantennary structures and no sialic acid [ ] . a variant form of sp-d ( kda) has been identified in lavage from a subset of human lavage samples; this protein shows o-linked glycosylation of threonyl residues within the n-terminal peptide domain [ • ]. at present, the functional significance of these sugars is not known. the presence of o-linked glycosylation within the n-terminal domain might be predicted to interfere with normal dodecamer assembly. in this regard, the o-glycosylated kda form of human sp-d is recovered as trimeric subunits or smaller species. as for many glycoproteins, the functional role of the attached carbohydrate is unknown. mutational analysis has shown that the n-linked sugar on rat sp-d is not required for secretion, for dodecamer formation, or for interactions with a variety of microorganisms [ , ]. consistent with its designation as a 'mannose-type' c-type lectin, sp-d preferentially binds to simple and complex saccharides containing mannose, glucose, or inositol [ , ] . sp-d also interacts with specific constituents of pulmonary surfactant including pi [ - ] and glucosylceramide [ ] . binding to glucosylceramide involves interactions of the carbohydrate-binding sequences of the crd with the glucosyl moiety. however, the interaction of sp-d with pi involves interactions with the lipid, as well as crd-dependent interactions with the inositol moiety [ , ] . microorganisms are surfaced with a diverse and complex array of polysaccharides and glycoconjugates, and most classes of microorganism contain one or more sugars recognized by sp-d. however, the outcome of this interaction depends on the specific organism and can be modified by the conditions of microbial growth. the potential consequences of this interaction include the following: varying degrees of lectin-dependent aggregation (namely, microbial agglutination), enhanced binding of microorganisms or microbial aggregates to their 'receptors' on host cells, phagocyte activation, and opsonic enhancement of phagocytosis and killing, potentially involving one or more cellular receptors for sp-d. binding to organisms in suspension is often -but not always -accompanied by some degree of aggregation. sp-d binds to purified lipopolysaccharide (lps) isolated from a variety of gram-negative organisms [ , ] . in addition, lps is the major cell wall component that is labeled on lectin blotting of outer membranes isolated from escherichia coli [ ] . the latter interactions involve the recognition of the core oligosaccharide domain, which contains glucose and heptose [ ] . sp-d interacts preferentially with purified lps molecules characterized by short or absent o-antigens and preferentially agglutinates bacterial strains expressing a predominance of rough (o-antigen-deficient) lps [ , ] . although the core oligosaccharide domain of lps constitutes the major ligand for sp-d on at least some gram-negative bacteria, the mechanism of interaction with this group of microorganisms is probably heterogeneous. sp-d binds to some smooth, unencapsulated strains of gram-negative bacteria by immunofluorescence. the mechanism is uncertain; the quantity or quality of binding differs from that observed for rough strains and does not necessarily result in agglutination. lps molecules on the surfaces of bacteria show heterogeneity in the extent of maturation, so it is possible that this interaction is mediated by a subpopulation of lps with deficient o-antigens and that the density of binding sites is too low for high-affinity binding. the recognition of the surface glycoconjugates on gramnegative bacteria by sp-d depends not only on the expression of lectin-specific residues by a given strain or species, but also on the accessibility of these residues [ , ] . for example, sp-d binds inefficiently to the core region of lps of encapsulated klebsiella, but efficiently agglutinates the corresponding unencapsulated phase variants. interactions of sp-d with the core oligosaccharides of gram-negative organisms are also influenced by the number of repeating saccharide units associated with the terminal o-antigen of the lps [ , ]. other potential ligands include the o-antigen domain of lps, certain capsular polysaccharides, and membraneassociated glycoproteins. in this regard, sp-d can bind to di-mannose containing o-antigens expressed by a subset of klebsiella serotypes (i ofek, h sahly and ec crouch, unpublished data). although other c-type lectins, specifically sp-a and the mannose receptor, can interact with specific capsular polysaccharides [ ], a specific interaction of sp-d with capsular glycoconjugates or exopolysaccharides has not been described. the mechanism of interaction with gram-positive organisms has not been elucidated. lipoteichoic acids, which are the major glycolipids associated with the gram-positive cell wall, do not detectably compete with lps for binding to sp-d (i ofek, a mesika, m kalina, y keisari, d chang, d mcgregor and ec crouch, manuscript submitted). in preliminary studies we observed that binding was competed only partly with maltose and/or edta, raising the possibility that binding might be more complex than for some gram-negative organisms. . however, similar effects were observed when the neutrophils were preincubated with sp-d, and there was only a slight enhancement of uptake when bacteria were incubated with human sp-d and washed before their addition to neutrophils. notably, the extent of binding and internalization was dependent on the extent of multimerization, with human sp-d multimers demonstrating the highest potency. differences in cell type, the extent of sp-d multimerization, or differences in size or organization of bacterial aggregates could account for some of the apparent inconsistencies. although lps mediates the binding of sp-d to at least some gram-negative bacteria, sp-d can also bind to spe- in the latter study the authors suggested that fungal aggregation inhibits phagocytosis. interestingly, sp-d binding directly inhibited fungal growth and decreased the outgrowth of pseudohyphae, the invasive form of the fungus, in the absence of phagocytic cells [ ] . it is possible that these effects are also secondary to agglutination, possibly as result of nutrient deprivation. purified rat and human sp-d inhibit the infectivity and hemagglutination activity of influenza sp-d can interact with host cells, both directly and indirectly. as indicated above, sp-d can enhance the phagocytosis and killing of certain microorganisms and enhance the oxidant response to microbial binding. however, at present there is only one study that suggests that the enhancement of phagocytosis by sp-d might involve the participation of an opsonic receptor. furthermore, the enhanced uptake of iav seems to be mediated by viral aggregation, with enhanced interactions of the virus with its natural receptors on the host cell. in any case, sp-d can interact directly with host cells, and in some cases can influence their behavior. sp-d is chemotactic and haptotactic for neutrophils and certain mononuclear phagocytes [ • , - ] and can elicit directional actin polymerization in alveolar macrophages [ ] . in this regard, sp-d is considerably more potent than sp-a. early studies with natural proteins isolated from silicotic animals reported directed effects on the oxidant metabolism of isolated alveolar macrophages [ ] . however, such effects can probably be attributed to endotoxin contamination and/or aggregation. purified dodecamers do not significantly increase the production of nitric oxide [ ] or of proinflammatory cytokines such as tumor necrosis factor-α (y kesari, h wang, a mesika, e crouch and i ofek, unpublished data). interestingly, purified sp-d has been reported to increase the production of several metalloproteinases in the absence of a significant effect on proinflammatory cytokine production [ ] . despite the ability of sp-d to modulate a variety of cellular functions, little is currently known about potential cellular receptors for this protein. compartments [ ] , but it is unclear whether the uptake is receptor dependent and whether sp-d is being internalized in association with specific ligands. there are at least two classes of binding to host cells: crd-dependent and crd-independent. some studies have demonstrated crd-dependent binding to phagocytes that can be inhibited with edta or competing saccharides, both in vitro and in vivo. as indicated above, the ability of sp-d to elicit the chemotaxis of neutrophilic and monocytic cells depends on the lectin activity of sp-d [ ] . in addition, kuan and coworkers reported that extracting formaldehyde-fixed alveolar macrophages with detergents largely eliminates the binding of purified sp-d, suggesting a membrane-associated ligand or glycolipid receptor [ ] . dong and wright have extended these findings and suggest that pi can contribute to sp-d binding by alveolar macrophages [ ] . it is of interest that sp-d can bind to recombinant scd through interactions with n-linked oligosaccharides [ • ]. given that the membrane-associated form of cd is widely expressed on host cells, it is possible that cd can serve as a binding site on macrophages and other cell types. the phagocytic uptake of certain bacteria by neutrophils is also inhibited by calcium chelation or competing sugars [ ]; however, this could result from the inhibition of microbial agglutination rather than lectin-dependent interactions with the phagocyte. wang et al suggested that sp-d can bind to lymphocytic cells by a lectin-dependent mechanism [ •• ] . in this regard, it is interesting to note that glucosylceramide, a ligand for sp-d in vitro, is one of the most abundant neutral glycolipids expressed by lymphoid cells. reid and co-workers were the first to present evidence for lectin-independent binding [ ] . these and other studies suggested that binding does not involve known c q or collectin receptors. the only putative receptor protein, gp- , is a widely expressed member of the scavenger receptor superfamily [ , • ]. it binds to the crd of sp-d in a calcium-dependent manner that does not require the lectin activity of sp-d. although the protein has been immunolocalized to alveolar macrophage membranes and distributes together with sp-d in many different human tissues [ • , ], it has not yet been shown to mediate the binding of sp-d to these cells or to participate in signal transduction events. the cdnas isolated from lung have not shown a membrane-spanning region [ ] , and the protein is abundant as a soluble component in bal. given that gp- is a highly multimerized protein that contains numerous potential ligand binding domains (fig. b) , it is possible that the protein cooperates with sp-d in the neu-tralization or clearance of certain ligands rather than specifically mediating the interactions of sp-d with host cells. wright and co-workers have demonstrated the binding of sp-d to isolated type ii pneumocytes. the mechanism seems distinct from the binding to macrophages [ • ]. the binding was dependent on concentration, time, and temperature and required calcium; it was not sensitive to protease treatment or to pi-phospholipase c. although the internalized sp-d was degraded or recycled to lamellar bodies, sp-d binding did not alter the uptake of surfactant lipids. sp-d has demonstrated comparatively few direct effects on the metabolism of host cells, at least in situations where self-aggregation and endotoxin contamination have been excluded. one possible explanation is that modulation of cellular function requires the prior interaction of sp-d with a ligand. this would have numerous potential physiological advantages, because the presence of 'active' protein might be restricted to sites of microbial or antigenic deposition. the binding of complex, multivalent, particulate antigens to two or more crds could markedly alter the conformation of sp-d molecules, with respect to the spatial orientation of the arms in relation to the n-terminal crosslinking domain and/or with respect to the spatial orientation of the crds within a given trimeric subunit. thus, the 'charging' of sp-d with a particulate ligand could lead to local or distant conformational changes that expose 'cryptic' binding sites for cellular receptors. there is some preliminary evidence consistent with the notion that the interaction of sp-d with a ligand alters its capacity to activate host cells. table and discussed below. sp-d can be isolated in different multimeric forms from proteinosis lavage [ • ] and are produced by chinese hamster ovary k cells transfected with human sp-d cdna [ ] . as described previously, the effects of sp-d on the neutrophil response to influenza virus are highly dependent on the ability of sp-d to agglutinate the viral particles, and the agglutination activity is directly correlated with the extent of multimerization. trimers can bind to the virus but have little capacity to modulate neutrophil interactions. by contrast, highly multimerized proteins show greater activity than dodecamers [ ] . given these observations, factors that favor enhanced oligomerization or lead to the accumulation of trimeric subunits promote might influence sp-d function. for example, the liberation of active trimers by a hypothetical microbial protease could lead to the accumulation of molecules that might inhibit the aggregation-dependent activities of sp-d. in contrast, recombinant trimeric crds can stimulate chemotaxis [ ] and decrease viral infectivity [ • ]. although higher-order oligomers of sp-d can self-aggregate and precipitate in the presence of calcium in vitro, the functional consequences are not known. the lectin activity of sp-a is decreased after the nitric oxide-dependent nitration of tyrosine residues [ ] , and nitration decreases the ability of sp-a to enhance the adherence of pneumocystis to alveolar macrophages [ ] . however, similar findings have not yet been reported for sp-d. conditions of mildly acidic ph, as might be found in endocytic compartments, are predicted to disrupt the lectin-dependent activities of sp-d [ ]. proteolytic degradation remains an important possibility. however, sp-d is highly resistant to degradation by a wide variety of neutral proteases in vitro, and degradation products have not yet been shown to accumulate under pathological conditions in vivo. glucose concentrations at levels encountered in diabetes can interfere with sp-d's ability to interact with specific strains of iav or other microorganisms in vitro [ • ]. many microorganisms release cell wall polysaccharides or glycoconjugates, which might interfere with the binding of collectins to the same or other organisms. in this regard, sp-d recovered from rats after the instillation of lps into the airway shows decreased lectin activity, which is attributed to occupancy of the crd with lps [ • ]. it seems reasonable to speculate that some organisms might compete with other organisms for binding to sp-d. such a situation could conceivably predispose to secondary infections. lastly, the potential inhibitory effects of competing saccharide ligands presents important methodological considerations for experiments using carbohydrate-containing cell culture medium or buffers. non c-type lectins (such as ficolins) it is difficult to predict the functions of sp-d within the airspace. other lectins with overlapping specificity are also present. although the levels of mannose-binding lectin are probably low in the absence of increased vascular permeability, sp-a and the macrophage mannose receptor could conceivably interact with the same ligands in the distal airways and alveoli. such interactions could lead to antagonistic or cooperative effects. furthermore, we have little knowledge regarding the microanatomic distribution of these molecules in specific circumstances in vivo. although most sp-a is probably associated with the insoluble phase of the alveolar lining material, and the macrophage mannose receptor is membrane-associated, the distribution might be altered in the setting of lung injury. models of sp-d deficiency show no detectable anatomical or physiological abnormalities at birth. however, the animals gradually develop a patchy, subpleural alveolar lipidosis with associated type ii cell hypertrophy, the accumulation of enlarged and foamy macrophages, and an apparent expansion of peribronchial lymphoid tissue [ • , • ]. interestingly, the mice eventually develop distal-acinar emphysema and areas of subpleural fibrosis, which could reflect a continuing inflammatory reaction associated with abnormal oxidant metabolism and metalloproteinase activity [ • ]. by contrast, sp-a-deficient mice (-/-) show essentially normal respiratory function and surfactant lipid metabolism [ , ] but numerous apparent host defense abnormalities [ ] . the capacity of sp-d to bind to specific strains of influenza a in vitro is highly correlated with the capacity of the virus to proliferate in mice in vivo [ ] . specifically, strains with more oligosaccharide attachments on the ha are preferentially neutralized by sp-d in vitro and show decreased proliferation in mice. because the administration of mannan together with the virus increased the replication of iav in the lung, the involvement of a mannose-type, c-type lectin was implicated. sp-d-sensitive iav strains also replicate to higher titers in the lungs of diabetic mice than in nondiabetic controls [ • ]. replication of the virus is positively correlated with blood glucose level, and decreases in response to insulin treatment. significantly, blood glucose levels comparable to those measured in the diabetic mice were sufficient to inhibit the interaction of sp-d with these viral strains in vitro. pr- , a strain that does not interact with sp-d but does interact with sp-a, replicated to the same extent in diabetic and control mice. sp-d levels increase in association with certain infections. for example, sp-d levels, but not the levels of serum mannose-binding lectin, increase markedly after iav infection [ ] . impressive increases in sp-d have also been observed in murine models of pneumocystis carinii [ ] and p. aeruginosa infection [ ] . sp-d-deficient mice have not yet been extensively characterized with respect to host defense function. however, they show decreased viral clearance and enhanced inflammation after challenge with respiratory syncytial virus [ ] and iav (am levine, personal communication). in addition, they show increased inflammation, increased oxidant production, and decreased macrophage phagocytosis in response to intratracheally instilled group b streptococcus and haemophilus influenzae (am levine, personal communication). although the overexpression of wild-type sp-d in type ii pneumocytes with the sp-d-deficient mice can prevent the lipidosis and inflammatory changes [ ] , the ability of overexpressed wild-type sp-d or exogenous sp-d to ameliorate these abnormalities has not yet been described. the coexisting pulmonary abnormalities also complicate the interpretation of challenge models. for example, macrophage activation might enhance killing and offset any decrease that results more directly from sp-d deficiency. sp-d deficiency modifies the host response to instilled lps with decreased lung injury and inflammatory cell recruitment [ ]. molecules that can bind to potential antigens and deliver them to macrophages and other antigen-presenting cells might contribute to the development of acquired immunity. in this regard, a few published observations suggest possible roles in the development of humoral and/or cellular immunity in response to microorganisms or complex organic antigens. for example, sp-d can decrease interleukin- dependent t-lymphocyte proliferation [ • ]. interestingly, single-arm mutants were at least as potent as intact dodecamers in mediating this effect. sp-d also binds to oligosaccharides associated with dust mite allergen [ • ], and can inhibit the binding of specific ige to these allergens, possibly through direct, crd-dependent binding to lymphocytes [ • ]. thus, alterations in the level of sp-d (or the state of oligomerization) might influence the development of immunological responses and contribute to the pathogenesis of asthma and other hypersensitivity disorders. there are other potential interplays between humoral immunity and collectins with regard to antimicrobial host defense. for example, increased glycosylation of iav coat proteins, an adaptation that is believed to help the virus to evade antibody-mediated neutralization, is associated with increased reactivity with sp-d and other collectins [ ]. thus, the relative potential importance of antibody and collectin-mediated host defenses might be influenced by subtle variations in the structure of the microbial surface. there is little recent information on the developmental regulation of sp-d expression. in general, sp-d increases rapidly late in gestation [ ] [ ] [ ] [ ] . the production of sp-d increases during the culture of fetal lung explants, and expression can be increased with glucocorticoids [ , , ] . the exposure of fetal rats to glucocorticoids in vivo leads to precocious expression with increased numbers of sp-d-expressing cells and increased cellular levels of sp-d message [ , , ] . although sp-d is produced constitutively within the lung, protein accumulation and gene expression are inducible and increases in sp-d expression have been observed in a number of disease states or models (tables and ). in general, the synthesis and secretion of sp-d increase in association with lung injury and activation of the respiratory epithelium [ ] . for example, levels of sp-d mrna and sp-d accumulation are increased within - h after intratracheal instillation of lps [ • ], and sp-d expression by alveolar and bronchiolar epithelial cells increases after exposure of rats to % o for h [ ] . keratinocyte growth factor (kgf) increases sp-d expression and protein production in association with pneumocyte hyperplasia and after injury caused by bleomycin [ ] . in addition, the levels of sp-d can increase markedly in response to the overexpression of certain cytokines, such as interleukin- , or in response to microbial challenge [ , ] . studies of the upstream regulatory region of the sp-d gene have demonstrated increased promoter activity in the presence of glucocorticoids, which is consistent with the findings in vivo and in lung organ culture [ ] . however, no functional glucocorticoid response elements have been identified, and the effects of dexamethasone seem to be secondary and involve the effects of other transregulatory molecules. the activity of the human sp-d promoter is dependent on a conserved activator protein- (ap- ) element (- ) that binds to members of the fos and jun families of transcriptional factors [ ] . in addition, the promoter contains multiple functional binding sites for ccaat-enhancer-binding protein (c/ebp) transcription factors. mutagenesis experiments suggest that these are required for basal and stimulated promoter activity, and promoter activity is markedly increased in h cells after co-transfection with c/ebpβ cdna (yc he and e crouch, unpublished data). the importance of the conserved ap- element and the presence of multiple binding sites for c/ebp transcription factors is consistent with the observed modulation of sp-d expression in the setting of tissue injury. sp-d promoter activity is not dependent on the binding of thyroid transcription factor (ttf- ) [ ] . however, promoter activity is dependent on two interacting forkhead binding sites, upstream and downstream of the ap- element; these sites bind to hepatic nuclear factor- α and apparently other forkhead box proteins in h lung adenocarcinoma nuclear extracts [ ] . initial comparison of genomic and cdna sequence suggested the existence of genetic polymorphisms in the sp-d coding sequence, including one in the n-terminal propeptide domain (thr compared with met in the mature protein) and three additional differences within the collagen domain at positions , , and [ ] . the latter substitutions are conservative to the extent that they are not expected to disrupt the collagen helix. floros table increased sp-d accumulation or expression in animal models silicosis rat [ ] hyperoxia rat [ ] endotoxin (lps) rat [ ] challenge with p. aeruginosa mouse [ ] challenge with iav mouse [ ] challenge with pneumocystis carinii scid mouse [ ] rat [ ] overexpression of interleukin- mouse [ ] scid, severe combined immunodeficiency. and co-workers have recently confirmed the existence of polymorphisms at positions and of the mature protein [ ] . the potential biological significance, if any, is not known. interestingly, the kda variant of sp-d showed o-linked glycosylation of thr [ • ], suggesting that this polymorphism might be associated with altered glycosylation. interestingly, the kda variant was recovered as trimeric subunits, raising the possibility that differences in the glycosylation of residue , which is immediately n-terminal to cys , could influence multimerization and the capacity of sp-d to participate in bridging interactions. there is increasing evidence that sp-d interacts specifically with a wide variety of respiratory pathogens, modulates the leukocyte response to these organisms, and participates in aspects of pulmonary immune and inflammatory regulation (table ) . sp-d can influence the activity of phagocytes through crd-dependent and crd-independent interactions. at least some of the effects of sp-d result from aggregation with enhanced binding of the agglutinated ligand to their natural 'receptors'. although the lung is the major site of sp-d expression, it is likely that the protein has more generalized roles in host defense and the acute response to infection and tissue injury. collectins and pulmonary host defense immunomodulatory functions of surfactant lung surfactant proteins involved in innate immunity interactions of surfactant protein a with pathogens the role of collectins in host defense structural aspects of collectins and receptors for collectins immunocytochemical localization of surfactant protein d (sp-d) in type ii cells, clara cells, and alveolar macrophages of rat lung surfactant protein d: subcellular localization in nonciliated bronchiolar epithelial cells localization and developmental expression of surfactant proteins d and a in the respiratory tract of the mouse localiza-• tion of lung surfactant protein d (sp-d) on mucosal surfaces in human tissues this recent study was the first to systematically investigate the extrapulmonary expression of sp-d in human tissues immunolocalization of sp-d in human secretory tissues surfactant protein a and d expression in the porcine eustachian tube expression of hydrophilic surfactant proteins by mesentery cells in rat and man mouse surfactant protein-d. cdna cloning, characterization, and gene localization to chromosome surfactant proteins a (sp-a) and d (sp-d): levels in human amniotic fluid and localization in the fetal membranes antiviral activity of bovine collectins against rotaviruses recombinant sp-d carbohydrate recognition domain is a chemoattractant for human neutrophils interactions of pulmonary surfactant protein d (sp-d) with human blood leukocytes surfactant proteins a and d specifically stimulate directed actin-based responses in alveolar macrophages rat surfactant protein d enhances the production of oxygen radicals by rat alveolar macrophages effects of endotoxin on surfactant protein a and d stimulation of no production by alveolar macrophages induction of matrix metalloproteinase biosynthesis in human alveolar macrophages exposed to surfactant protein d (sp-d): possible roles in pulmonary host defense binding of surfactant protein d (sp-d) to membrane glycolipids on alveolar macrophages inhibitory effect of •• pulmonary surfactant proteins a and d on allergen-induced lymphocyte proliferation and histamine release in children with asthma surfactant protein d binding to alveolar macrophages cloning of gp- , a putative opsonin receptor for lung surfactant protein d isolation and characterization of a new member of the scavenger receptor superfamily, glycoprotein- (gp- ), as a lung surfactant protein-d binding molecule gp was the first protein shown to bind to sp-d in a crd-independent manner. although it can be found on the surface of alveolar macrophages, it remains uncertain whether it can function as a cellular receptor binding and uptake of surfactant • protein d by freshly isolated rat alveolar type ii cells this study describes the direct binding of sp-d to type ii pneumocytes. the mechanism appears distinct from that observed for alveolar macrophages this study describes specific ultrastructural alterations in the organization of phospholipid mixtures containing phosphatidylinositol. similar tubular structures are found in type ii cell cultures collectins and pulmonary innate immunity nitration of surfactant protein a results in decreased ability to aggregate lipids nitrated sp-a does not enhance adherence of pneumocystis carinii to alveolar macrophages increased suscepti-• bility of diabetic mice to influenza virus infection: compromise of collectin-mediated host defence of the lung by glucose? is uncontrolled diabetes mellitus associated with defective collectin function? surfactant protein-d regulates surfactant phospholipid home ostasis in vivo this paper and the following paper by botas and coworkers [ • ] describe the phenotype of sp-d deficient transgenic mice altered surfactant homeostasis and alveolar type ii cell morphology in mice lacking surfactant protein d increased metalloproteinase activity, oxidant production, and emphysema in surfactant protein d gene-inactivated mice altered surfactant function and structure in sp-a gene targeted mice surfactant metabolism in surfactant protein a-deficient mice surfactant protein a (sp-a) gene targeted mice p. carinii induces selective alterations in component expression and biophysical activity of lung surfactant interleukin- enhances pulmonary clearance of pseudomonas aeruginosa surfactant protein-d modulates lung inflammation with respiratory syncytial virus infection in vivo pulmonary-specific expression of sp-d corrects pulmonary lipid accumulation in sp-d gene-targeted mice • recombinant rat surfactant-associated protein d inhibits human t lymphocyte proliferation and il- production this paper was the first to describe direct, inhibitory effects of sp-d on the stimulated proliferation of blood lymphocytes interaction of • human lung surfactant proteins a and d with mite (dermatophagoides pteronyssinus) allergens this study was the first to suggest that interactions of sp-d with glycoconjugates on particulate allergens might influence the immune response developmental expression of pulmonary surfactant protein d (sp-d) modulation of surfactant protein d expression by glucocorticoids in fetal rat lung ontogeny of surfactant apoprotein d, sp-d, in the rat lung regulation of surfactant protein d in human fetal lung regulation of surfactant protein d expression by glucocorticoids in vitro and in vivo pre-and postnatal stimulation of pulmonary surfactant protein d by in vivo dexamethasone treatment of rats surfactant proteins a • and d increase in response to intratracheal lipopolysaccharide this important paper suggests that sp-a and sp-d participate in the acute response to lung injury brief exposure to % oxygen alters surfactant protein d and mrna in adult rat alveolar and bronchiolar epithelium kgf increases sp-a and sp-d mrna levels and secretion in cultured rat alveolar type ii cells characterization of the human surfactant protein d promoter: transcriptional regulation of sp-d gene expression by glucocorticoids proximal promotor of the surfactant protein d (sp-d) gene: regulatory role of ap- , forkhead box, and gt-box binding proteins genomic organization of human surfactant protein d (sp-d). sp-d is encoded on chromosome q . - . novel, non-radioactive, simple and multiplex pcr-crflp methods for genotyping human sp-a and sp-d marker alleles recognition of klebsiella pneumoniae by pulmonary c-type lectins deficient hydrophilic lung surfactant proteins a and d with normal surfactant phospholipid molecular species in cystic fibrosis serial changes in surfactant-associated proteins in lung and serum before and after onset of ards decreased contents of surfactant proteins a and d in bal fluids of healthy smokers surfactant proteins a and d in premature baboons with chronic lung injury (bronchopulmonary dysplasia). evidence for an inhibition of secretion deficiencies in lung surfactant proteins a and d are associated with lung infection in very premature neonatal baboons pulmonary surfactant protein d in sera and bronchoalveolar lavage fluids surfactant protein-a concentration in bronchoalveolar lavage fluids of patients with pulmonary alveolar proteinosis surfactant protein d. increased accumulation in silica-induced pulmonary lipoproteinosis surfactant protein-d mediates aggregation of pneumocystis carinii il- increases surfactant and regulates metabolism in vivo this paper was the first to definitively demonstrate a classical opsonic activity of sp-d. the studies further suggest the existence of an opsonic receptor on alveolar macrophages. key: cord- - l fdmi authors: marquet-blouin, e.; bouche, f.b.; steinmetz, a.; muller, c.p. title: neutralizing immunogenicity of transgenic carrot (daucus carota l.)-derived measles virus hemagglutinin date: journal: plant mol biol doi: . /a: sha: doc_id: cord_uid: l fdmi although edible vaccines seem to be feasible, antigens of human pathogens have mostly been expressed in plants that are not attractive for human consumption (such as potatoes) unless they are cooked. boiling may reduce the immunogenicity of many antigens. more recently, the technology to transform fruit and vegetable plants have become perfected. we transformed carrot plants with agrobacterium tumefaciens to generate plants (which can be eaten raw) transgenic for an immunodominant antigen of the measles virus, a major pathogen in man. the hemagglutinin (h) glycoprotein is the principle target of neutralizing and protective antibodies against measles. copy numbers of the h transgene were verified by southern blot and specific transcription was confirmed by rt-pcr. the h protein was detected by western blot in the membrane fraction of transformed carrot plants. the recombinant protein seemed to have a % lower molecular weight than the viral protein. although this suggests a different glycosylation pattern, proper folding of the transgenic protein was confirmed by conformational-dependent monoclonal antibodies. immunization of mice with leaf or root extracts induced high titres of igg and igg a antibodies that cross-reacted strongly with the measles virus and neutralized the virus in vitro. these results demonstrate that transgenic carrot plants can be used as an efficient expression system to produce highly immunogenic viral antigens. our study may pave the way towards an edible vaccine against measles which could be complementary to the current live-attenuated vaccine. the development of genetic transformation technology has allowed the expression of foreign genes in an increasing number of plant species. the use of plants for the production of foreign antigen proteins that could serve as experimental immunogens was first reported in the early s (cardineau and curtis, ; mason et al., ) . since then, a number of viral and bacterial antigens have been expressed in a variety of plant species (mcgarvey et al., ; thanavala et al., ; carrillo et al., ; gomez et al., ; modelska et al., ; tacket et al., ; wigdorovitz et al., ) . despite differences in post-translational processing viral and bacterial antigens preserved their immunogenic properties when produced in plants and induced cross-reactive and sometimes neutralizing and protective antibodies. plants could therefore be an inexpensive source of antigens that could be easily purified for parenteral inoculation (thanavala et al., ; gomez et al., ) . moreover, oral ingestion of plants expressing high levels of antigens bear the potential of edible vaccines (kong et al., ) . strategies based on potent mucosal adjuvants such as cholera toxin and heat-labile enterotoxin of escherichia coli (haq et al., ; arakawa et al., a, b) may pave the road for oral immunization. research on edible plant vaccines has been carried out mostly in plant species largely inappropriate for human consumption (e.g. tobacco, nicotiana tabacum, thanavala et al., ; mason et al., ; huang et al., ; nicotiana benthamiana, modelska et al., ; arabidopsis thaliana, carrillo et al., ; gomez et al., ) . for most proteins of human pathogens expressed in edible plants, potatoes were used, which are normally boiled before consumption (mason et al., ; arakawa et al., a; richter et al., ; kong et al., ) . it can be anticipated that many antigens would not resist cooking without being denatured and that cooked plant material is less immunogenic than raw plants (kong et al., ) . therefore, there is a need to develop other and more appropriate transgenic plant species that can serve as edible vaccines. the technology for creating other transgenic edible plants, including fruits and vegetables has been further perfected (schenk et al., ; brodzik et al. ) . we chose to transform transgenic carrots, which can be grown in most parts of the world, which can be eaten both raw and cooked, and are part of the early diet of infants. current life-attenuated measles vaccines are given routinely at to months of age. after a single injection, seroconversion rates are high, complications are rare and protection is long-lasting. these advantages are difficult to match by other experimental measles vaccine. however, after years % of vaccinees are thought to have lost protective levels of antibodies (mossong et al., ) . revaccination with an oral vaccine which would boost the residual immunity would be a preferred strategy since it can be selfadministered, requires less training of health workers and avoids the risks associated with needle injections. the potential of a parenteral/oral prime-booster schedule has been demonstrated for a number of pathogens (kong et al., ; mantis et al., ) . such a schedule is probably less liable to problems of weak and variable responses after oral vaccination. a strategy based on oral vaccination could be a particularly useful for large-scale booster immunization in developing countries where the need to deliver parenteral vaccines may hamper eradication efforts. measles is caused by a paramyxovirus (mv) which projects two glycoproteins, the hemagglutinin (h) and the fusion protein, from the outer viral envelope. the h protein is responsible for the attachment of the virus to the host cell (naniche et al., ) , whereas the fusion protein is directly involved in the fusion of viral and target cell membranes required for the penetration of the virus (wild et al., ) . virus-neutralizing and protective antibodies are mainly directed against the hemagglutinin and, to a lesser extent, the fusion protein (mcfalin et al., ; giraudon and wild, ) . the aim of this study was ( ) to explore the potential of carrots as an expression system for antigens that is suitable for human consumption, and ( ) to test whether the measles virus hemagglutinin glycoprotein would preserve its neutralizing immunogenicity in this system. although some work has been done with transgenic carrot callus cells (brodzik et al., ) , this is one of the first reports of the expression of a transgenic antigen in mature carrots, showing that high levels of virus-neutralizing antibodies can be induced with a glycoprotein produced in this plant. the coding sequence corresponding to the measles virus hemagglutinin (mv-h) protein (bouche et al., a) was subcloned into the expression cassette of the prtl vector at the ncoi-bamhi sites (restrepo et al., ) . in this vector, the mv-h sequence was under the control of the constitutively expressed cauliflower mosaic virus (camv) s promoter fused to the tobacco etch virus (tev) -untranslated region, a translational enhancer, and the camv s terminator (odell et al., ; pietrzak et al., ) . these control sequences were flanked with hindiii restriction sites that allowed their transfer, together with the h sequence, into the t-dna region presents in binary vector pbin (bevan, ) , creating the recombinant plasmid pbin -mvh. the t-dna region, delimited by the right and left border sequences, also contains the neomycin phosphotransferase ii gene (nptii) that provides neomycin and kanamycin resistance to transformed plants ( figure ). after subcloning of the expression cassette in pbin and transformation of xl -blue cells kanamycin-resistant colonies were picked and checked for the presence of the expression cassette by pcr and automated sequencing ( a model, perkin elmer, netherlands). plasmid dna was then isolated from a positive clone and introduced in agrobacterium tumefaciens strain lba by electroporation ( µf, v, ). transformed bacteria were selected on yeb-agar solid medium containing µg/ml kanamycin ( • c, h) and were used for subsequent carrot transformation. the protocols of a. tumefaciens-mediated transformation of hypocotyls were modified for the production of transgenic carrot plants (hardegger and sturm, ; tokuji and fukuda, ; brodzik et al., ) . sterilized carrot seeds (herrera-estrella and simpson, ; daucus carota cv. senkou-gosun, kindly given by dr tokuji, japan) were sown in . % agar in the dark at • c. after days hypocotyl segments were harvested in gamborg b liquid medium (b ) supplemented with % sucrose and with the phytohormone , -dichlorophenoxyacetic acid ( mg/l) for h at • c. after washing, the segments were placed for days on phytohormone-free b solid medium ( . % agar) containing % sucrose. the hypocotyl fragments were then transformed by immersion ( h) in the bacterial suspension of a. tumefaciens containing the recombinant pbin -mvh binary plasmid. the segments were further co-cultured in the dark at • c with the bacteria on b solid medium with % sucrose. five days later, the explants were washed with b medium containing cefotaxime ( mg/l) to kill remaining agrobacteria. explants were grown in darkness at • c on b solid medium containing cefotaxime ( mg/l) and geneticin ( mg/l) for the selective growth of transgenic cells. after weeks of selection, somatic embryos resistant to geneticin appeared on the hypocotyl segments. each hypocotyl usually gave rise to a few embryos. embryos were subcultured and rooted on selective medium at • c under light. plantlets with adequate roots were transferred to potting soil and grown in a greenhouse under normal light and humidity conditions. several lines of transgenic carrots were obtained and further studied. genomic dna was isolated from both untransformed and transformed plants by macerating frozen leaves (ca. g) in liquid nitrogen. the resulting powder extract was re-suspended in the extraction buffer ( mm tris-hcl, mm edta, . m nacl, mm -mercaptoethanol and % n-cetyl-n,n,n,trimethylammonium bromide, ph . ) and incubated at • c for min. after a chloroform extraction, nucleic acids were precipitated with . volume of isopropanol and the pellet obtained after centrifugation was resuspended in tris-hcl ( mm, ph . )/ edta ( mm) buffer. after rnase treatment the dna was stored at − • c until being used. a fragment of the mv-h expression cassette (promotor-mvh-terminator) was specifically amplified with a forward primer ( -gcaagacccttcctctatat- ) from the s promoter region and a reverse primer ( -atctgggaactactcacac- ) from the s terminator region. the presence of the nptii gene was detected by pcr amplification with a pair of specific primers within the nptii gene ( -tgctcctgccgagaaagtatc- and -tcctgtatcgcaaccgatgggc- ). genomic dna of untransformed plants was used as negative control. southern blotting with genomic dna was performed following conventional protocols. briefly, µg of extracted dna was digested overnight at • c with ecori which cuts the recombinant t-dna at a single position (between the s promoter and the tev leader). after agarose gel ( . %) electrophoresis, digestion products were transferred overnight onto a nylon membrane (hybond+, amersham-pharmacia biotech, uk). the membrane was equilibrated with × sspe prior to immobilization of dna by uvcross-linking (uv stratalinker, stratagene, netherlands). for hybridization, a p-labelled h-specific cdna probe, generated by nick translation according to feinberg and volgelstein ( ) , was incubated with the membrane for h at • c. the membrane was then washed for min at • c successively with . % sds in × ssc, × ssc and finally in . × ssc. hybridized complexes were detected by autoradiography. total rna from leaves ( g) of transformed plants was isolated as described by hughes and galau ( ) . rna from untransformed plants was used as a negative control template. reverse transcription (rt) was carried out for h at • c in µl final reaction volume containing µg rna, ng random hexamers and a mixture of dntps ( mm each), mm dtt, units of rnasin (promega, netherlands) and units of m-mlv reverse transcriptase (superscript ii, gibco life sciences, belgium). after adding . volume of mm atp, ligation was carried out by incubation for min at • c in the presence of units of t dna ligase. pcr amplification was performed with two h-specific primers including the first and the last nucleotides of the coding sequence. the pbin -mvh vector was used as positive control template. a pcr reaction was also performed directly on the total rna to confirm the absence of specific dna in the extract. plant tissues were homogenized on ice in pbs containing mm edta and protease inhibitors ( µg/ml aprotinin, . mg/ml iodoacetamide, µg/ml leupeptin, mm pmsf). the mixture was centrifuged at × g ( min, • c) to remove insoluble debris. the membrane fraction was sedimented by ultracentrifugation at × g ( min, • c), and re-suspended in pbs containing . % np- . protein concentration was determined with the dc protein assay kit (biorad, belgium). one gram of wet weight of carrot plant gave about µg of membrane protein. proteins of the membrane fraction were separated by % sds-page under reducing and denaturing conditions ( mm dtt, m urea, % sds). proteins were blotted onto nitrocellulose membrane in tris-glycine buffer for h at ma. the membrane was blocked for h at room temperature with % non-fat dehydrated milk in pbs containing . % tween- . for detection of the mv-h protein two specific monoclonal antibodies (mabs; bh and bh , : dilution) and goat anti-mouse iggconjugated horseradish peroxidase secondary antibodies ( : dilution) were used. bound antibodies were detected by enhanced chemiluminescence (ecl kit, amersham-pharmacia biotech). to characterize recombinant h protein in leaf and root extracts microtiter plates (maxisorb, nunc, denmark) were coated with increasing concentrations of plant extract (in pbs) and revealed with h-specific mabs (dilution : ). bh recognizes the sequential helix-forming epitope h - (fournier et al., ; deroo et al., ) ; bh and bh are conformation-dependent antibodies (unpublished); bh binds only denatured protein (ziegler et al., ) ; bh binds to the hemagglutinin noose epitope (hne) h - (ziegler et al., ) . mouse sera ( : to : ) were titrated against immobilized purified h protein (bhk-h) produced in bhk- cells ( ng/well; bouche et al., b) . the coated antigens were washed and blocked with % bsa in tris-buffered saline ( mm, ph . ). after addition of mabs or mouse serum, alkaline phosphatase-conjugated goat anti-mouse igg ( : ; southern biotechnology association, usa) and pnitrophenylphosphate (sigma, usa) were used for detection. optical density (od) was measured after min at nm. data were expressed as net od values after subtracting either the absorbance of the negative control antigen (e.g. extract of wild-type roots or leaves, or bhk- antigen; figure ) or the absorbance of the conjugate without mouse serum (< . od). h-specific antibody isotypes and subclasses were determined in mouse sera ( : ) by elisa with specific conjugates and substrate of a commercial kit (biorad). data were expressed as od at nm after min as recommended. groups of four spf balb/c mice were primed by intraperitoneal injection of with µg of leaf or root extracts from transgenic or wild-type (wt) plants or µg of bhk-h or bhk- antigens emulsified ( : ) in freund's complete adjuvant (sigma). by comparison with the elisa signal of mammalianexpressed purified h protein, the plant extract and the bhk extract was estimated to contain about and µg of the specific protein, respectively. mice were boosted on days , and with the same antigen preparation emulsified in freund's incomplete adjuvant (sigma). sera were drawn days after boosting and were tested as pooled sera (after the second boost) or as individual sera (after the third boost). the reactivity of immune sera ( : ) with native h protein or mv was tested by flow cytometry as described before using permanently h-transfected mel-juso cells expressing the recombinant protein at their surface (mel-juso/h, gift of r. de swart, rotterdam, netherands; de swart et al., ) or mvsuperinfected ebv-transformed human b cell line (mv-wmpt; gift of b. chain, london, uk; muller et al., ) . for both assays wt or uninfected cells (mel-juso/wt or wmpt) were used as negative control cells. cells were incubated on ice ( min) with diluted serum of individual mice. after washing, fitc-conjugated goat anti-mouse fc-specific antibody ( : , sigma) was used for detection. fitcconjugate alone, naïve serum on positive and negative cells or test serum on negative cells served as negative controls. bh ( : ; ziegler et al., ) served as an antibody positive control. dead cells were excluded by propidium iodide staining ( µg/ml). to test mv-neutralizing activity, mouse sera were heated for min at • c to inactivate complement. duplicates of two-fold serial dilutions ( : to : ) of heat-inactivated serum were mixed with an equal volume of medium containing plaqueforming units of edmonston strain mv. after . h of pre-incubation at • c, the mixture was added to a subconfluent vero cell culture in -well plates ( . × cells per well). after one hour unbound virus was removed and cells were covered with a carboxymethylcellulose ( %) overlay. after four days of incubation under tissue culture conditions, a . % neutral red solution was added. on day , the overlay was removed and cells were fixed with a % formaline solution. the % neutralization titre, defined as the reciprocal of the dilution that reduces the number of plaques by %, was calculated accord-ing the spearman-kärber method. titres < were considered to be negative. the transformation of carrot plants was mediated by recombinant agrobacterium infection using the pbin -mvh plasmid (figure ). the regenerated transgenic plants showed no morphological changes in comparison to wt carrot plants (data not shown). about plants resulting from independent transformation events were selected and grown in the greenhouse. ten of them were analysed further. the presence of the mv-h expression cassette in transgenic plants was confirmed by pcr followed by gel electrophoresis of the amplified fragments (figure a) . from all transformed plants tested a product of the expected size ( . kb) was amplified (lanes - ). the same size was obtained with pbin -mvh vector as a positive control template (lane ). this product was absent in dna of untransformed plants (lane ). similarly, the presence of the nptii gene was confirmed as a specific product of bp in all geneticin-resistant plants ( figure b, lanes - ) . this product was absent from untransformed plants (lane ). copy numbers of the transgene and the integration pattern were determined by southern blot by digesting genomic dna of the transformed plants ( figure c ) with ecori (which has a unique restriction site in the t-dna, see figure ). hybridization with a mv-h-specific probe showed in every transgenic plant a unique restriction pattern (lanes - ) indicating that insertions occurred at random sites throughout the genome. all plants appeared to have integrated a single copy of the transgene, as illustrated by the presence of a single hybridization band. the insert corresponding to the radioactive probe was used as positive control template (lane ). no signal appeared when genomic dna of wt plants was used as a template (lane ). the transcription of the mv-h gene was analysed by rt-pcr on total rna extracted from transgenic plants ( figure a ). untransformed plants served as a negative control (lane ). all tested plants produced a specific major transcript of the expected size lanes - ) . the minor unspecific, smaller band was also found when other transgenes were tested. no amplified dna was detected when the pcr was directly performed on the rna preparations, confirming the rna-specificity of the reaction (lane ). crude membrane preparations of leaves ( ng protein) and of bhk-h cells ( ng protein) gave specific bands of similar intensities in western blots ( figure b ). based on elisa with purified h protein the content of specific protein was estimated at about % and % of specific protein in membrane fraction. the h-specific monoclonal antibodies bh and bh that bind to two non-overlapping epitopes revealed a strong under reducing conditions with an estimated molecular mass of about kda (lane ). under the same conditions, h protein produced in mammalian cells (bhk-h) migrated with an apparent size of ca. kda as reported earlier (lane ; bouche et al., b) . this difference in mass may correspond to different levels of glycosylation of the monomer, but this was not further explored (vialard et al., ) . negative controls included an irrele- vant mab (lanes - ) as well as a crude membrane preparation of wt leaves (lanes ). the western blot suggested that there may be differences in glycosylation. glycosylation is well known to be important for the proper folding of the h protein (hu et al., ) . several monoclonal antibodies were used to investigate the conformational integrity of the transgenic protein ( figure a ). microtitre plates were coated with increasing concentrations ( . - ng/well) of membrane fractions from both transgenic leaves and roots. high net signals were obtained with two conformational-dependent mabs (bh and bh ), and bh which recognizes the sequential epitope h - (fournier et al., ) with a putative helical conformation (deroo et al., ) . in contrast, bh , which binds denatured protein only, showed essentially no reactivity above the background of wt plants. bh binds to the hemagglutinin noose epitope (hne, h - ) with its oxidized cysteine bridge (ziegler et al., ) . the antibody binding pattern was essentially the same in root and leaf extracts and resembled that of purified h protein of mammalian origin ( figure b ). in general, signals tended to be higher in younger than in older tissues (data not shown). since the transgenic h protein seemed to be antigenically conserved, its immunogenicity was tested in mice. after or boosts with transgenic plant extracts all animals showed reactivity with purified h protein produced in mammalian cells ( figure a ). after the third boost, average antibody levels obtained with leaf extracts were about times higher than after immunization with root extracts. similar levels were found after boost for leaves, whereas antibodies still increased between boost and with root extracts. sera of control mice immunized with wt plant extract showed no cross-reactivity with the h protein (net od < . ; data not shown). interestingly, antibodies generated with the transgenic leaves were of both the igg and igg a subclass, whereas the immune response against the h protein produced in bhk- cells was essentially restricted to igg , suggesting a difference in the th /th balance ( figure b ). no h-specific igg b, igg , iga and igm were detected. to exclude that the reactivity may be due to partially denatured recombinant protein, the sera were further tested for antibodies against the intact native protein expressed on mv-superinfected wmpt cells ( figure a ) and h-transfected mel-juso cells (figure b ). the flow cytometry data showed that all mice vaccinated with transgenic leaf or root extracts produced high levels of antibodies cross-reacting with the native protein independently whether virus-infected or h-protein transfected cells were used. antibody levels were similar to those of mice immunized with h protein produced in mammalian cells, while wt plants induced no cross-reactive antibodies. in addition to a strong virus cross-reactivity, all sera showed high levels of neutralizing antibodies in a standard plaque reduction neutralization assay. in the group immunized with leaf extracts, mean neutralizing titres of (range - ) were observed, while root extracts induced significantly lower titres ( ; range - ). all sera from mice given extracts of untransformed plants had titres < and were considered negative. in general, both cross-reactive and neutralizing titres were higher in leaves than in roots. this could reflect a difference in expression or in the yield of the extraction procedure. although edible vaccines seem to be feasible, very few antigens have been expressed in plants fit for human consumption (modelska et al., ; kapusta et al., ; sandhu et al., ) . for instance, potatoes have been used to express antigens of human pathogens (mason et al., ; arakawa et al., a, b; richter et al., ; kong et al., ) . however, raw potatoes are not very appealing, and cooking can drastically reduce the immunogenicity of the vaccine (kong et al., ) . this is the first report of the expression of an antigenic protein as a transgene in mature carrots (which can be eaten raw by human beings) and of the antigenic and immunogenic properties of the heterologous protein in mice. genomic integration of the mv-h gene under the camv s double promoter was obtained using agrobacterium tumefaciens. southern blot analysis showed that all regenerated transgenic plants analysed integrated a single copy of the transgene and that integration occurred randomly in the carrot genome. in all clones a specific transcript of the expected size could be amplified by rt-pcr. under the fluorescence microscope, a fusion protein of mv-h with the green fluorescence protein (gfp) expressed in tobacco protoplasts showed a predominant association with the plasma membrane (data not shown). the membrane fraction of carrot cells produced a strong signal in western blot correspond-ing to an estimated µg of specific protein per gram of wet weight. interestingly, the transgenic protein migrated faster in sds-page than the same protein produced in mammalian cells, suggesting a size difference of about kda. a similar observation was made when the rabies virus glycoprotein g was expressed in plants (mcgarvey et al., ) . the expression of the c-terminally fused gfp demonstrates that the h protein is fully translated and that the difference in apparent size is most probably due to post-translational modifications. in the virus, the edmonston strain h protein undergoes glycosylation at of the predicted n-glycosylation sites (hu et al., ) . it has been shown that glycosylation of mammalian proteins is efficient in plants, but normally different carbohydrate side chains are utilized (bardor et al., ) . the complex glycans of plants are often smaller than those of animals, partially because they lack sialic acid (faye et al., ) . in insect cells, which also lack sialic acid, the reduced size of the recombinant h protein was explained by a difference in glycosylation (vialard et al., ) . authentic post-translational modifications such as glycosylation and cystine-bridge formation are thought to be important for intracellular trafficking (hu and norrby, ) as well as the antigenic conformation of the h protein (hu et al., ; hu and norrby, ) . recently, huang et al. ( ) reported the expression of the h protein in tobacco leaves. in this system, the native h protein was undetectable. after adding a retention signal for the endoplasmic reticulum low levels of protein became detectable by elisa with polyclonal sera from rabbits and man; the reactivity with mabs was even weaker or negative suggesting that the conformation may have been less than optimal. the immunogenic integrity of the h protein is critical for the induction of antibodies that protect against virus infection and disease. conformational dependent mabs confirmed the proper antigenic structure of the transgenic protein produced in carrots; while mab bh which recognizes only denatured protein showed no reactivity. after immunization with extracts of transgenic carrots high levels of virus-neutralizing antibodies were found, using the who recommended plaque neutralization assay (miller et al., ) . these titres were comparable to those obtained with the mammalian cell-derived h protein extract. the antibody isotype subclasses suggested that in contrast to the h protein of mammalian origin, which produced primarily a th (or antibody-dominated) response, the plant protein induced a th /th -balanced response, indicative of both a humoral and cellular response. this may be important to safeguard against atypical measles when such a vaccine is used in unprimed individuals. in many countries carrots are components of the diet of both adults and children and a stable antigenic transgene in an edible plant that can be consumed raw without further processing could bring a vaccine within reach of the most destitute. although plants can be used as efficient bioreactors for producing antigens, the full potential of plant-based vaccines becomes apparent when immunogenicity can be demonstrated after oral ingestion. a frequent problem of plant-based vaccines is that the response after oral delivery is inconsistent and variable. sometimes even low levels of antigen can give an effective response after oral administration in mice (mason et al., ; wigdorovitz et al., ) and man (kapusta et al., ) . in other cases mucosal immunity was improved by mucosal adjuvants (arakawa et al., b; kong et al., ) . in the study by huang et al. ( ) , the immune response after oral administration of transgenic tobacco was very weak even when exceedingly high levels ( times mg) of ctb where co-administered. however, when experimental oral vaccines were given after parenteral priming, results were more consistent than after oral immunization alone (kong et al., ; mantis et al., ) . therefore, a prime booster schedule with an oral vaccine could be an attractive alternative when immunity to the current (injectable) live-attenuated measles vaccine wanes in adults. further studies are required to confirm the efficacy of the carrot plant expressed measles antigen in such a scenario. efficacy of a food plant based cholera toxin b subunit vaccine a plant-based cholera toxin b subunit-insulin fusion protein protects against the development of autoimmune diabetes analysis of the n-glycosylation of recombinant glycoproteins produced in transgenic plants binary agrobacterium vectors for plant transformation immunosorbent assay based on recombinant hemagglutinin protein produced in a high-efficiency mammalian expression system for surveillance of measles immunity a simplified immunoassay based on measles virus recombinant hemagglutinin protein for testing the immune status of vaccinees transgenic plants as a potential source of an oral vaccine against helicobacter pylori oral immunization by transgenic plants protective immune response to foot-and-mouth disease virus with vp expressed in transgenic plants measles virus fusion protein-and hemagglutinin-transfected cell lines are sensitive tool for the detection of specific antibodies by facs-measured immunofluorescence assay enhanced antigenicity of a four-contact-residue epitope of the measles virus hemagglutinin protein by phage display libraries: evidence of a helical structure in the putative active site detection, biosynthesis and some functions of glycans n-linked to plant secreted proteins a technique for radiolabeling dna restriction endonuclease fragments to high specific activity antibodies to a new linear site at the topographical or functional interface between the haemagglutinin and fusion proteins protect against measles encephalitis correlation between epitopes on hemagglutinin of measles virus and biological activities: passive protection by monoclonal antibodies is related to their hemagglutination inhibiting activity expression of immunogenic glycoprotein s polypeptides from transmissible gastroenteritis coronavirus in transgenic plants oral immunization with a recombinant bacterial antigen produced in transgenic plants transformation and regeneration of carrot foreign gene expression in plants role of individual cysteine residues in the processing and antigenicity of measles virus hemagglutinin protein role of n-linked oligosaccharide chains in the processing and the antigenicity of the measles virus haemagglutinin protein plant-derived measles virus hemagglutinin protein induces neutralizing antibodies in mice preparation of rna from cotton leaves and pollen a plant-derived edible vaccine against hepatitis b virus oral immunization with hepatitis b surface antigen expressed in transgenic plants immunization of mice with recombinant gp in a systemic prime/mucosal boost protocol induces hiv- -specific serum igg and secretary iga antibodies expression of hepatitis b surface antigen in transgenic plants expression of norwalk virus capsid protein in transgenic tobacco and potato and its oral immunogenicity in mice monospecific antibody to the haemagglutinin of measles virus expression of the rabies virus glycoprotein in transgenic tomatoes antibodies to measles, mumps, and rubella in uk children years after vaccination with different mmr vaccines immunization against rabies with plant-derived antigen modeling the impact of subclinical measles transmission in vaccinated populations with waning immunity cholera toxin b stimulates systemic neutralizing antibodies after intranasal co-immunization with measles virus human membrane cofactor protein (cd ) acts as a cellular receptor for measles virus identification of dna sequences required for activity of a plant promoter: the camv s promoter expression in plants of two bacterial antibiotic resistance genes after protoplast transformation with a new plant expression vector nuclear transport of plant potyviral proteins production of hepatitis b surface antigen in transgenic plants for oral immunization oral immunization in mice with transgenic tomato fruit expressing respiratory syncytial virus-f protein induces a systemic immune response promoters for pregenomic rna of banana streak badnavirus are active for transgene expression in monocot and dicot plants immunogenicity in humans of a recombinant bacterial antigen delivered in a transgenic potato human immune responses to a novel norwalk virus vaccine delivered in transgenic potatoes immunogenicity of transgenic plant-derived hepatitis b surface antigen a rapid method for transformation of carrot (daucus carota l.) by using direct somatic embryogenesis synthesis of the membrane fusion and hemagglutinin proteins of measles virus, using a novel baculovirus vector containing the β-galactosidase gene induction of a protective antibody response to foot and mouth disease virus in mice following oral or parental immunization with alfalfa transgenic plants expressing the viral structural protein vp measles virus: both hemagglutinin and fusion glycoproteins are requiered for fusion protection against measles virus encephalitis by monoclonal antibodies binding to a cystein loop domain of the h protein mimicked by peptides which are not recognized by maternal antibodies we are grateful to richard wagner and his team (ibmp) for taking care of the transgenic carrots. we also thank b. jérouville, s. willieme and w. ammerlaan for technical assistance. this research was supported by a grant from the ministère de l'education nationale et de la formation professionnelle du grand-duché de luxembourg, the european union th framework programme (project pl ) and the crp-santé, luxembourg. key: cord- -u eaaj authors: stolf, beatriz s.; smyrnias, ioannis; lopes, lucia r.; vendramin, alcione; goto, hiro; laurindo, francisco r. m.; shah, ajay m.; santos, celio x. c. title: protein disulfide isomerase and host-pathogen interaction date: - - journal: scientificworldjournal doi: . / / sha: doc_id: cord_uid: u eaaj reactive oxygen species (ros) production by immunological cells is known to cause damage to pathogens. increasing evidence accumulated in the last decade has shown, however, that ros (and redox signals) functionally regulate different cellular pathways in the host-pathogen interaction. these especially affect (i) pathogen entry through protein redox switches and redox modification (i.e., intra- and interdisulfide and cysteine oxidation) and (ii) phagocytic ros production via nox family nadph oxidase enzyme and the control of phagolysosome function with key implications for antigen processing. the protein disulfide isomerase (pdi) family of redox chaperones is closely involved in both processes and is also implicated in protein unfolding and trafficking across the endoplasmic reticulum (er) and towards the cytosol, a thiol-based redox locus for antigen processing. here, we summarise examples of the cellular association of host pdi with different pathogens and explore the possible roles of pathogen pdis in infection. a better understanding of these complex regulatory steps will provide insightful information on the redox role and coevolutional biological process, and assist the development of more specific therapeutic strategies in pathogen-mediated infections. host cells have the ability to cope with the progression and severity of infection in response to different types of pathogen. on the other hand, numerous mechanisms have evolved that support the use of the host cell machinery to facilitate pathogen survival and multiplication. such co-evolutionary processes are directly affected by different physicochemical factors within different cell compartments, both in the host and in pathogen. for instance, ph critically affects antigen stability of the influenza virus which modulates endosome acidity that attenuates its own infection [ ] . ros (and reactive nitrogen species) production and the redox state of different cell compartments are also critically involved in cellular hostparasite interaction. among the many redox sensitive proteins that are altered during the course of different infections, protein disulfide isomerase (pdi-) mediated redox switches have been associated with pathogen attachment-internalization, antigen processing in the er/phagosome, and the regulation of ros production by nox family enzymes. thus, pdi emerges as a ubiquitous redox protein that regulates different steps of diverse infection processes. several pathogens also have their own pdi that act as an important virulence factor (table ). other redox modifications directly mediated by ros and especially via nitric oxide (no) generated by inducible nitric oxide synthase (inos), which is abundant in phagocytic cells, have been reviewed elsewhere [ , ] and are not considered in this article. below, the main cellular redox aspects of host and pathogen pdi will be discussed. the ancient pdi is a ubiquitous redox chaperone belonging to the thioredoxin oxireductase super family and can reduce (reaction ), oxidize (reaction ), and catalyse dithiol-disulfide exchange reactions (i.e., isomerase activities, reaction , figure ). such broad range of activities overlaps with the chaperone role of pdi that overall performs a housekeeping function in helping to maintain proteins in a more stable conformation. there are around pdi homologues, and the detailed structure and function of eukaryotic pdis have been covered in recent excellent reviews [ , ] . the classic mammalian pdi ( kda) has several domains ordered as a-b-b -a -c with thioredoxin-like motifs (trp-cys-gly-his-cys) displayed in the a and a domain [ ] [ ] [ ] (figure ). pdi is abundant in the er (∼ . mm) where the relatively oxidizing conditions at basal level (i.e., gsh/gssg ratios ∼ - : ) favours pdi isomerase/oxidase activity, which is primarily involved in client protein redox folding (reaction - , figure ). the oxidizing equivalents for this process are driven mainly by the er thiol-containing oxidase, ero (endoplasmic reticulum oxidase- ), which binds fad and is in turn re-oxidized via electron transfer to oxygen, generating h o in the process [ ] [ ] [ ] [ ] . the h o destiny is elusive, but it can oxidize er-located peroxiredoxin iv (prxiv) that is further reduced by pdi that is oxidized in the process [ ] . this redox circuit is thought to increase total protein folding and thiol oxidation via ero [ ] . however, even in the absence of ero , protein folding still occurs, and it is suggested that other oxidases may compensate for redox demand in the er in some circumstances [ , ] . nevertheless, the pdi-ero -dependent oxidative activity is balanced to cytosolic glutathione levels suggesting a functional redox interplay between these compartments [ ] . pdi reductase activity has been primarily associated to more reducing compartments (i.e., gsh/gssg ratios ∼ - : ), such as those in the vicinity of the plasma membrane [ , ] . pdi redox versatility is mainly governed by the low pka of the proximal cysteine on the active n-terminal a domain. indeed, the lower pka of . renders pdi a much better oxidase than thioredoxin, which has a pka of . and is mainly a reductase in most neutral ph cell compartments. it should also be noted that pdi functions as a chaperone independently of its redox-active domains as especially required for its atpase and ca + activity, although pdi redox motifs still stabilize binding interaction [ , , ] . in the er, pdi is tightly associated with prolyl- hydroxylase (the rate-limiting enzyme for collagen biosynthesis), the sec translocon, and the mhc class i complex (see later). it can be also found as a heterodimer with microsomal triglyceride transfer protein [ , ] . pdi is a soluble homodimer and does not have a transmembrane domain and, similarly to other er chaperones, carries the kdel c-terminal sequence which binds to respective receptors in the cop vesicles that circulate in the er-golgi vicinity, recycling proteins back to er. pdi also undergoes intense intracellular trafficking and is found on the surface of diverse prokaryotic and eukaryotic cells [ ] [ ] [ ] . despite limited knowledge about this traffic, it is possible that pdi exits the er through the translocon sec pore and/or via secretory vesicles [ ] . pdi is thought to attach to lipids, glycans, and integral membrane proteins via electrostatic interactions at the cell plasma membrane [ , ] , where its reductive activity mediates the infection of different pathogens ( figure , discussed later). pdi along with erp have been found in the nuclei in association to dna and affecting transcriptional activity of nf-kb, ap- , and stat [ ] . these transcriptional regulators are key elements in many inflammatory processes, but their functional association to pdi and to different pathogen still elusive. in contrast to many other members of its family, such as thioredoxin itself and erp , pdi is not normally found in the cytosol, where it is likely cleaved by caspase- and - [ ] . protozoans and bacteria have their own pdis ( table ). the function of these pdis in protein folding is poorly understood; however, there are intriguing data correlating pdi expression and the pathogenicity of several parasites, especially obligatory intracellular protozoans. leishmania that leads to distinct types [ ] . if l. chagasi is a subgroup of l. infantum brought to america by european colonist or other specie in its own still a matter of controversy (see discussion in [ ] ). the infection cycle in the vertebrate host and is initiated when leishmania promastigote is injected into the skin by the insect vector. in the host, the promastigote is phagocytised especially by macrophages, and further it is converted into intracellular amastigote. amastigote replicates inside the phagosome within the cell and is liberated after the cell lyses, subsequently infecting other cells resulting in the progression to disease [ , ] . l. amazonensis has at least four pdis, and the use of specific pdi inhibitors substantially affected parasite growth [ ] . in l. major, the increased levels of leishmania pdi (lmpdi) expression and secretion at the parasite surface reflects optimal protein folding balanced to parasite multiplication. importantly, this is correlated to high virulence of the parasite strains [ ] . more recently, the use of lmpdi antigens to generate a vaccine for l. major partially protected balb/c animals and accelerated the cure of different strains of mice [ ] . similarly to leishmania species, other parasites of the trypanosomatid group such as trypanosoma contain several genes predicted to encode for pdis, that can execute n-glycosylation and protein folding in the er [ , ] . although pdi was considered essential for t. brucei survival, pdi activity was not essential for the growth of trypanosomes in vitro [ ] . pdi is also expressed in different species of plasmodium protozoans (table ) , the parasites that cause malaria [ , ] . p. falciparum expresses at least nine different pdis and the pfpdi- has great similarity to the prototype pdi and is expressed during all stages of parasite life cycle. this pdi facilitates the disulfide-dependent conformational folding of eba- protein, an emerging candidate for the development of malaria vaccines [ ] . this is intriguing given that malaria parasites express proteins with high content of cysteine, which are associated to parasite invasion and sequestration in the vertebrate host and transmission into mosquito host [ ] . finally, toxoplasma gondii pdi was identified in host tears, suggesting an extracellular location and adhesion to host cells during the initial phase of infection [ ] . antigen presentation occur through two distinct pathways. antigen presenting cells (apcs; especially macrophages and dendritic cells; dcs) are long-lived cells that capture antigens and subsequently process and present them at the cell surface, where they are recognized by t-lymphocytes. this process provides a long-term adaptive immune response to fungi, bacteria, and parasite. after internalization by the apc, antigens pass through phagosome/lysosome vesicles, where they form complexes with mhc class ii (figure ), which are recognized by helper cd + t lymphocytes (exogenous pathway). in contrast, self cell antigens and virus synthesized within cells (mostly non-apcs) are degraded by the proteasome in the cytosol and nucleus. in successive steps, the antigen is processed, folded, and incorporated into the mhc class i ( figure ) and the complex exposed on the cell surface, and recognized by cytotoxic cd + t lymphocytes (endogenous pathway). these two pathways overlap and some antigens are presented by both mhc class i and ii, in a process called cross-presentation. this has been described in dcs responding to viral infection, transplant rejection, and some autoimmune diseases and cancer. moreover, a wide range of pathogens passing or living in the phagosome such as mycobacterium tuberculosis, salmonella typhimurium, toxoplasma gondii, and especially leishmania spp and trypanosoma cruzi are all crosspresented in association to high levels of cd + t cells [ ] . pdi as part of the er protein folding machinery directly regulates antigen processing of the mhc class i complex [ ] [ ] [ ] [ ] . antigens that are degraded by peptidases and proteasome to shorter peptides in the cytosol and nucleus can be further transported to the er through the tap system, a transmembrane er type of atp-binding cassette (abc) peptide transporter family [ ] . er-located pdi interacts with the peptide-loading complex (pcl) that efficiently promotes peptide assembly with mhc class i molecules and supporting the exit of the peptide-antigen complex from the er [ ] [ ] [ ] . other pcl components include calreticulin, tapasin, erp (another pdi family member), and the tap transporter itself. cells lacking pdi present much less peptide loading to mhc class i and the disulfide bridge between the peptide and mhc groove remains in a reduced redox state [ ] . normally, this interaction is affected by the redox exchange between pdi (predominantly oxidized) and erp (predominantly reduced) [ ] , a condition in which pdi favours the release of peptide-mhc class i from the pcl and the antigen-mhci complex is exited from er [ ] . in fact, pdi-bound peptide facilitates the disassembly of the tapasin-erp complex while the pdi unbound to the complex is unable to interact with tapasin-erp , retaining mhc i molecules in the er [ ] . overall, pdi redox activity modulates the stability of the antigen peptide-mhc class i complex and further determines the transport of the complex to the plasma membrane [ ] [ ] [ ] [ ] . these redox effects may vary according to the type of antigen and some pathogens interfere with this pathway to escape antigen process and evading cd + t-cells recognition. this is the case for us protein from human cytomegalovirus, which enhances pdi degradation via the proteasome [ ] . pdi participation in immune response, however, goes beyond its role in the er protein folding machinery and it acts at other cellular steps of host-pathogen interaction. pdi in the er is also thought to play a role in parasite phagocytosis, and the pdi displayed on the cell surface can mediate the entry of some viral, bacterial, and protozoan. pdi is also implicated in protein unfolding and trafficking of some pathogenic antigens across the endoplasmic reticulum and towards the cytosol by the endoplasmic reticulum-associated degradation system (erad). this is the main pathway where proteins are retrotranslocated from the er to cytosol and further degradated by the proteasome. next, we discuss some examples of the cellular association between host pdi and different pathogens. phagocytosis is the main gate for large microbes to enter into apcs. after binding and attaching to the pathogen, these cells can internalize organisms and large particles even bigger then their own size, which are then phagocytosed in an active process that involves intense membrane remodelling [ ] . proteomics studies accumulated over the last decade revealed the presence of er chaperones in the isolated phagosome, uncovering a process called er-mediated phagocytosis [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . er chaperones were detected in phagosomes of macrophages exposed to different particulate material and pathogens, including latexbeads opsonised or not with immunoglobulin g (igg) or mouse serum (to facilitate entry through the fcr or complement receptors), igg-opsonized erythrocytes, promastigotes of leishmania donovani derived from wild-type cells or cell-surface lpg knockout, among other parasites [ ] . a mix of er and endocytic vesicles in the formation of the parasitophorous vacuoles (pvs) during the uptake of different leishmania spp. was recently shown in macrophages overexpressing er-tagged green fluorescent protein [ ] . the presence of the er proteins sec , bip/grp , and pdi in the phagosome of apcs [ , ] support the idea that the er provides the necessary machinery for antigen translocation from the phagosome to the cytoplasm and thus, possibly converges mhc class i and class ii antigen cross-presentation [ ] (figure ). there are several other complementary hypotheses on how peptides cross from phagosomes to cytoplasm during the cross-presentation process [ , ] . neutrophils are short-lived cells (half-life of - h in the human circulation) and very active in the phagocytosis of large microbes such as bacteria, parasites, and fungus. contrary to apcs, neutrophils only contain restricted amounts of er machinery and are thought to lack the er-mediated phagocytosis process [ ] . whether er proteins functionally operate on phagocytosis-mediated infection has not been well characterised yet. an important work has shown that dictyostelium lacking both er calreticulin and calnexin present altered phagocytic cup formation and substantial decline in phagocytosis [ ] . these two proteins utilise ca, and their disruption per se affects actin filaments and plasma membrane remodelling during phagocytosis [ ] . we recently showed that pdi is critically involved in leishmania parasite infection in vitro [ ] . we showed that phagocytosis of promastigotes (but not amastigotes) of leishmania chagasi was significantly inhibited by macrophage incubation with the thiol/pdi inhibitors dtnb, bacitracin, phenylarsine oxide, and neutralizing pdi antibody in a parasite redox-dependent way [ ] . the phenylarsine response is of particular interest, since this arsenic compound may act similarly to antimonials, widely used in leishmaniasis chemotherapy [ , ] . pdi preferentially affects parasite internalization and the phagocytosis of the promastigote forms is increased when wild-type pdi is overexpressed in macrophages, an effect opposed by pdi knockdown. at later stages of infection (i.e., after h), pdi from promastigote-infected j macrophages was immunoprecipitated and subsequently blotted with an anti-leishmania antibody revealing a parasite band at ∼ kda [ , figure (b), lane ]. subsequent removal and analysis of this band by mass fingerprint spectrometry showed a % match with elongation factor (ef ) of l. major (q q ; data not shown). the incubation of purified bovine pdi (sigma, p ) and parasites did not yield any detectable protein complexes, suggesting that the macrophage milieu may be important to sustain pdi-ef association [ ] . interestingly, leishmania ef has important virulent features and acts as a soluble antigen in lymphocyte stimulation in vitro [ ] and in vivo [ ] . moreover, proteomics studies revealed that ef is secreted during promastigote differentiation into the amastigote stage with potential immunomodulatory proprieties in animal models [ ] . leishmania ef is therefore of particular interest for leishmania therapeutic interventions such as vaccines. although our studies did not address the role of the er in mediating phagocytosis, these data provide compelling evidence for a functional role of er-pdi in a host-parasite interaction. other mechanisms underlining pdi-mediated l. chagasi promastigote phagocytosis involves its association to ros production by phagocyte nadph oxidase and this is discussed next. the nadph oxidase (nox) family of enzymes uses nadph as an electron donor to convert oxygen to superoxide anion (o •− ), a precursor of h o and other powerful oxidants such as hydroxyl radical and peroxynitrite (in the presence of nitric oxide), collectively called ros [ , , ] . each of the seven oxidase family members is characterized by a distinct catalytic subunit (i.e., nox - and duox - ), and has differing requirements for additional protein subunits [ , ] . the prototypic member of the nox family, nox oxidase (or gp phox oxidase), is best known for its role in neutrophil and macrophage phagocytosis. genetic defects in the enzyme are related to chronic granulomatous disease, a condition in which affected children suffer from recurrent severe fungal and bacterial infections due to defective phagocyte function [ ] . each nox isoform forms heterodimers with a lower molecular weight p phox subunit and is predicted to be membrane-bound. nox is normally quiescent and acutely activated by agonists such as pma, lpg, and cytokines in a tightly regulated process in which cytosolic subunits (p phox , p phox , p phox , and rac in the case of macrophages and dendritic cells, or rac in neutrophil) associate with the nox -p phox heterodimer to initiate enzyme activity [ , ] . nox also has electrogenic features [ ] and in apc cells is linked to the regulation of phagosome/lysosome ph and antigen processing [ , ] . usually, phagosome acidity is maintained by a vacuolar atpase (v-atpase) that transports protons from the cytosol into the phagosome lumen, therefore regulating the function of lysosome proteases in the fused phagolysosomes. savina et al. [ ] [ ] [ ] have shown that nox -derived superoxide in the phagosomal vesicle promptly consumes protons maintaining a higher ph ambient in dendritic cells during particle internalization, which favours antigen processing and presentation [ ] [ ] [ ] . opposite results were found in macrophages [ ] [ ] [ ] . the selective role of nox in different phagocytic cells remains to be defined. the jury is out on whether the results shown in macrophages (association of nox to rab a; a member of rab family of gtpases) are related to vesicle traffic molecule assembly and quality, or rather associated to degradation processes [ ] . nox complex protein expression and function is greatly affected by redox compounds, and it is especially regulated by pdi with implications for cell signalling [ ] . the association of pdi to p phox and other nox isoforms in different cell types, especially in vascular cells, has been previously described [ ] [ ] [ ] . a functional and spatial/physical interaction between pdi and the p phox oxidase subunit was shown in macrophages [ ] and more recently between pdi and p phox in neutrophils [ ] . in macrophages, pdi-nox association was correlated to leishmania infection in vitro [ ] . it is well known that during phagocytosis of leishmania, nox is activated and parasite uptake is inhibited by antioxidants such as catalase [ ] . intriguingly, in the course of promastigote infection, some parasites evade that stressful condition and convert themselves into intracellular amastigotes, multiplying and resulting in progression to a disease process. overall, our studies support the view that parasite phagocytosis/infection by macrophages is a redox process mediated by pdi in at least two ways. initially, pdi-nadph oxidase increases ros production generating an oxidizing milieu, which seems to favour promastigote infection. the downstream role of ros generated by pdi-nadph oxidase remains unknown but can be related to the unfolded protein response signalling [ ] or, similar to pdi-ero , to protein folding in the macrophage er compartment with key implications for antigen processing. nevertheless, at later stages of infection, macrophage pdi physically associates with leishmania elongation factor- (as discussed earlier). some viruses envelop their genetic material within a protein-coated capsid in a further lipid membrane layout, for example, influenza virus, baculovirus, hepatitis-c, hiv, and herpes virus. these enveloped particles require successive steps to successfully entry and infect host cells. they usually first attach onto host receptors (and attachment factors), and their membranes fuse to interact with endosome vesicles that traffic the virus toward the endoplasmic reticulum, where it is uncoated. the proteins are finally transported to the cytosol and nucleus [ ] (figure ). there is convincing evidence showing that most viral infections are strongly influenced by changes in the redox environment and that host pdi mediates infection of enveloped viruses [ ] [ ] [ ] [ ] [ ] [ ] . in the course of hiv infection, the virus first binds to attachment factors, for example, mannose binding c-type lectin receptor and intracellular adhesion molecule (icam- ) on the surface of host cd + t cells. the glycoprotein (gp ) subunit of the virus envelope binds to immunoglobulin g of cd + and undergoes conformational changes, allowing the virus to interact with its coreceptors, cxcr or ccr . these interactions favour downstream conversions of gp envelope subunit to a competent fusion conformation. initial studies showed that membrane-impermeable pdi inhibitors and monoclonal antibodies against pdi prevent hiv- infection [ ] . it was then revealed that the domain d of the cd has redox-active disulfide bonds and is regulated by thioredoxin [ ] . using membrane-impermeable reducing agents (especially arsenical-derived compounds) and labelling thiol reagents, it was demonstrated that cd + reactive thiols critically drive hiv entry into cells [ ] . work from another group also revealed that pdi, on the surface of hiv- target cells, reduces disulfide bonds of the recombinant envelope glycoprotein gp (reaction , figure ), a reaction prevented by the usual pdi inhibitors [ ] . intriguingly, pdi silencing in u and hela cells had little impact on hiv infection itself as compared to the effect mediated by general thiol inhibitors [ ] . the reasons for this discrepancy remain to be elucidated and raise the question whether the reductive effect of pdi is coupled to other redox proteins (e.g., thioredoxin or nox's) that could amplify virus-cd redox association in some cells. it is noteworthy that in these later studies, pdi knockdown on the cell surface was not evident as compared to massive loss of most pdis within the er; an observation that supports the idea that pdi in the er has little impact in hiv-mediated infection [ ] . thiol inhibitors also affect viral fusion as that mediated by the fusion (f) protein from the paramyxovirus newcastle disease virus [ ] . the overexpression of pdi family members pfdi and erdj has also been shown to significantly catalyze the reduction of thiols in f protein, facilitating membrane fusion [ ] . there is evidence suggesting a possible association between pdi and infection mediated by some members of the of herpesviridae viruses family [ ] . pdi is also implicated in the attachment of some bacteria from different species of chlamydia [ ] [ ] [ ] . chlamydia is an obligatory intracellular pathogen that causes diverse diseases in humans. the most common species are chlamydia trachomatis, which is sexually transmitted and can cause blindness and infertility, and c. pneumoniae, which affects the respiratory tract. cho cells have impaired endogenous pdi expression due to a defect in truncated mrna processing, thus providing a valuable model to understand the effect of pdi-mediated cell-cell interaction and infection. these cells are very resistant to chlamydia infection showing impaired attachment, an effect restored by ectopic expression of pdi [ ] . similar to hiv infection, the molecular mechanisms most likely include the reductive activity of pdi (reaction , figure ) on the surface of cho cells [ ] . crossing the endoplasmic reticulum (er) membrane is an irreversible process for most proteins. in some cases, however, this flow is reversed and misfolded proteins retained in the er are retrotranslocated to the cytosol via erad to be degraded by the proteasome. this pathway is also exploited by small pathogens, especially non-enveloped viruses and some bacterial toxins, to gain access to the cytosol. in these cases, antigenic particles that reach the er by different means suffer molecular redox rearrangements and binding to pdi allowing them to be transported back to the cytosol or nucleus. well-known examples are infections mediated by polyomaviruses (py) and simian virus (sv ) extensively studied in the field of carcinogenesis. after sv interaction with the gm receptor on the cell surface, the particle enters the host cell through endocytosis and traffics via the caveosome (a particular caveolin containing endosome with neutral ph) towards the er compartment [ , ] . sv -coated pentamers are linked to each other by disulfide bonds between cysteine (c ). further isomerisation in the er is crucial for the viral uncoating process. in vitro cell screening shows that among all er-resident proteins, pdi and erp more specifically regulate sv infection [ ] . pdi silencing substantially decreases sv infection that is also dependent on some redox sensitive cysteines on the viral particle [ ] . pdi cooperation with erassociated erad proteins derlin- and sel l is ca dependent and facilitates sv traffic through erad [ ] . a similar pathway is used by some nonobligatory intracellular bacteria that exert their effect through production of potent endotoxins, such as diphtheria toxin (dt) and cholera toxin (ct). these proteins function similarly to some plant toxins, such as ricin and abrin. conversion into toxic proteins involves cleavage of their interchain disulfide bond, allowing them to traffic into the endocytic pathway within the host cell [ , ] . in humans, ct is derived from the bacterium vibrio cholerae that causes cholera disease and has subunits (a and a ). the protein first attaches to the host cell surface via gm and the subunit a , which contains a kdel sequence, and is transported back to the er (see earlier discussion). there, pdi reduces and unfolds a and a that exit the er via the sec channel into the cytosol. pdi in the reduced state (reaction , figure ) binds to the toxin and subsequent oxidation of pdi, probably via ero α, enables the release of ct toxin [ , ] . the active polypeptide a efficiently modifies a heterotrimeric g protein in the cytosol that leads to massive loss of chlorine and water secretion by intestinal epithelial cells in mammals, resulting in severe diarrhoea. in this article we have reviewed the main cellular aspects of pdi-mediated host pathogen interactions and the pathways that are involved in viral, bacterial (including bacterial toxins), and parasitic infections. a number of cellular mechanisms through which pdi modulates some specific cellular pathways in immune cells have been described, such as redox-sensitive attachment, antigen presentation in the er and exit from it, and association to phagosome and ros production by nadph oxidase (figure ). many of these responses are antigen-specific and the precise mechanisms of action remain to be fully elucidated, especially in the context of redox changes in cross-presentation phenomena. moreover, little is known about the role of pdi in infection per se, as well as how pdi signals to a more integrated cellular response to stress [ ] . pdi global knockout mice are only viable until birth, but partial gene-modified mice and also modified pathogens will help to reveal the significant redox role of pdi and its redox partners. overall, pdi is a key regulator that may propagate or limit the severity of the infection processes, depending on the infectious organism involved. a better understanding of these complex regulatory steps will provide insightful information on the redox role and coevolutional biological process, and assist the development of more specific therapeutic strategies in pathogen-mediated infections. endoplasmic reticulum mhc: major histocompatibility complex h o : hydrogen peroxide erp : a member of pdi family also know as glucose-regulated protein or -kd (grp ) nf-kb: factor nuclear kappa b ap- : activator protein stat- : signal transducer and activator of transcription . cell entry by enveloped viruses: redox considerations for hiv and sars-coronavirus role of nitric oxide synthesis in macrophage antimicrobial activity nitrogen dioxide and carbonate radical anion: two emerging radicals in biology the human pdi family: versatility packed into a single fold protein disulfide isomerase: a critical evaluation of its function in disulfide bond formation protein disulfide isomerase: the multifunctional redox chaperone of the endoplasmic reticulum generating disulfides enzymatically: reaction products and electron acceptors of the endoplasmic reticulum thiol oxidase ero p the fad-and o -dependent reaction cycle of ero -mediated oxidative protein folding in the endoplasmic reticulum the contributions of protein bisulfide isomerase and its homologues to oxidative protein folding in the yeast endoplasmic reticulum protein disulfide isomerase recycling of peroxiredoxin iv provides a novel pathway for disulphide formation in the endoplasmic reticulum oxidative protein folding by an endoplasmic reticulum-localized peroxiredoxin oxidative folding in the endoplasmic reticulum: towards a multiple oxidant hypothesis? secretion, surface localization, turnover, and steady state expression of protein disulfide isomerase in rat hepatocytes proteins of the pdi family: unpredicted non-er locations and functions identification of a disulfide isomerase protein of leishmania major as a putative virulence factor glutathione limits ero -dependent oxidation in the endoplasmic reticulum protein disulfide isomerase is cleaved by caspase- and - during apoptosis tegumentary leishmaniasis as a manifestation of immune reconstitution inflammatory syndrome in patients with aids the molecular basis of leishmania pathogenesis further thoughts on the use of the name leishmania (leishmania) infantum chagasi for the aetiological agent of american visceral leishmaniasis protein disulfide isomerase (pdi) associates with nadph oxidase and is required for phagocytosis of leishmania chagasi promastigotes by macrophages identification and enzymatic activities of four protein disulfide isomerase (pdi) isoforms of leishmania amazonensis comparative evaluation of two vaccine candidates against experimental leishmaniasis due to leishmania major infection in four inbred mouse strains a developmentally regulated gene of trypanosomes encodes a homologue of rat protein-disulfide isomerase and phosphoinositol-phospholipase c characterization of two protein disulfide isomerases from the endocytic pathway of bloodstream forms of trypanosoma brucei cloning of plasmodium falciparum protein disulfide isomerase homologue by affinity purification using the antiplasmodial inhibitor , -bis{ -[n-(cyclohexyl methyl)amino]propyl}piperazine protein disulfide isomerase assisted protein folding in malaria parasites protein disulfide isomerase of toxoplasma gondii is targeted by mucosal iga antibodies in humans oxidative folding and reductive activities of ehpdi, a protein disulfide isomerase from entamoeba histolytica leishmania antigens are presented to cd + t cells by a transporter associated with antigen processing-independent pathway in vitro and in vivo redox regulation facilitates optimal peptide selection by mhc class i during antigen processing molecular mechanisms of mhc class i-antigen processing: redox considerations redox-regulated export of the major histocompatibility complex class i-peptide complexes from the endoplasmic reticulum a role for protein disulfide isomerase in the early folding and assembly of mhc class i molecules peptide trafficking and translocation across membranes in cellular signaling and selfdefense strategies professional and non-professional phagocytes: an introduction the phagosome proteome: insight into phagosome functions er-mediated phagocytosis: a new membrane for new functions endoplasmic reticulum-mediated phagocytosis is a mechanism of entry into macrophages leishmania parasitophorous vacuoles interact continuously with the host cell's endoplasmic reticulum phagosomes are competent organelles for antigen cross-presentation the known unknowns of antigen processing and presentation quantitative and dynamic assessment of the contribution of the er to phagosome formation calreticulin and calnexin in the endoplasmic reticulum are important for phagocytosis molecular mechanisms of antimony resistance in leishmania dual action of antimonial drugs on thiol redox metabolism in the human pathogen leishmania donovani proteomic approach for identification and characterization of novel immunostimulatory proteins from soluble antigens of leishmania donovani promastigotes identification and characterization of t cell-stimulating antigens from leishmania by cd t cell expression cloning approaches for the identification of potential excreted/secreted proteins of leishmania major parasites nadph oxidases in lung biology and pathology: host defense enzymes, and more nox enzymes and the biology of reactive oxygen the superoxide-generating nadph oxidase of human neutrophils is electrogenic and associated with an h + channel the nadph oxidase of professional phagocytes-prototype of the nox electron transport chain systems phagocytosis and antigen presentation in dendritic cells nox controls phagosomal ph to regulate antigen processing during crosspresentation by dendritic cells rab a regulates phagosomal ph and nadph oxidase recruitment to dendritic cell phagosomes nadph oxidase controls phagosomal ph and antigen crosspresentation in human dendritic cells activation of antibacterial autophagy by nadph oxidases novel role of protein disulfide isomerase in the regulation of nadph oxidase activity: pathophysiological implications in vascular diseases regulation of nad(p)h oxidase by associated protein disulfide isomerase in vascular smooth muscle cells protein disulfide isomerase redox-dependent association with p phox: evidence for an organizer role in leukocyte nadph oxidase activation mechanisms and implications of reactive oxygen species generation during the unfolded protein response: roles of endoplasmic reticulum oxidoreductases, mitochondrial electron transport, and nadph oxidase how viruses enter animal cells inhibition of human immunodeficiency virus infection by agents that interfere with thiol-disulfide interchange upon virus-receptor interaction disulfide exchange in domain of cd is required for entry of hiv-i inhibitors of protein-disulfide isomerase prevent cleavage of disulfide bonds in receptor-bound glycoprotein and prevent hiv- entry role of protein disulfide isomerase and other thiol-reactive proteins in hiv- envelope protein-mediated fusion thiol/disulfide exchange is required for membrane fusion directed by the newcastle disease virus fusion protein overexpression of thiol/disulfide isomerases enhances membrane fusion directed by the newcastle disease virus fusion protein the human cytomegalovirus ul immediate-early regulatory protein is an integral membrane n-glycoprotein which traffics through the endoplasmic reticulum and golgi apparatus attachment and entry of chlamydia have distinct requirements for host protein disulfide isomerase chlamydia attachment to mammalian cells requires protein disulfide isomerase protein disulfide isomerase, a component of the estrogen receptor complex, is associated with chlamydia trachomatis serovar e attached to human endometrial epithelial cells simian virus depends on er protein folding and quality control factors for entry into host cells downregulation of protein disulfide isomerase inhibits infection by the mouse polyomavirus cleavage of disulfide bonds in endocytosed macromolecules. a processing not associated with lysosomes or endosomes cell surface sulfhydryls are required for the cytotoxicity of diphtheria toxin but not of ricin in chinese hamster ovary cells unfolded cholera toxin is transferred to the er membrane and released from protein disulfide isomerase upon oxidation by ero the ero α-pdi redox cycle regulates retro-translocation of cholera toxin protein disulfide isomerase and host-pathogen interaction the authors thank daniel c. pimenta for mass spectrometry experiments (centre for applied toxinology cat/cepid, butantan institute). they also thank dr. joão wosniak jr. from vascular biology laboratory, heart institute (incor) for helpful discussion. this paper was supported by auxílio fundação de amparò a pesquisa do estdado de são paulo (fapesp) and conselho nacional de pesquisa (cnpq), instituto do milênio redoxoma; fundação ej zerbini (incor). a. m. shah, s. ioannis and c. x. c. santos are also supported by the british heart foundation and a leducq fondation transatlantic network of excellence award. key: cord- -j vndxgg authors: tremblay, reynald; feng, mary; menassa, rima; huner, norman p. a.; jevnikar, anthony m.; ma, shengwu title: high-yield expression of recombinant soybean agglutinin in plants using transient and stable systems date: - - journal: transgenic res doi: . /s - - - sha: doc_id: cord_uid: j vndxgg soybean agglutinin (sba) is a specific n-acetylgalactosamine-binding plant lectin that can agglutinate a wide variety of cells. sba has great potential for medical and biotechnology-focused applications, including screening and treatment of breast cancer, isolation of fetal cells from maternal blood for genetic screening, the possibility as a carrier system for oral drug delivery, and utilization as an affinity tag for high-quality purification of tagged proteins. the success of these applications, to a large degree, critically depends on the development of a highly efficient expression system for a source of recombinant sba (rsba). here, we demonstrate the utility of transient and stable expression systems in nicotiana benthamiana and potato, respectively, for the production of rsba, with the transgenic protein accumulated to % of total soluble protein (tsp) in nicotiana benthamiana leaves and . % of tsp in potato tubers. furthermore, we show that both plant-derived rsbas retain their ability to induce the agglutination of red blood cells, are similarly glycosylated when compared with native sba, retained their binding specificity for n-acetylgalactosamine, and were highly resistant to degradation in simulated gastric and intestinal fluids. affinity column purification using n-acetylgalactosamine as a specific ligand resulted in high recovery and purity of rsba. this work is the first step toward use of rsba for various new applications, including the development of rsba as a novel affinity tag for simplified purification of tagged proteins and as a new carrier molecule for delivery of oral drugs. electronic supplementary material: the online version of this article (doi: . /s - - - ) contains supplementary material, which is available to authorized users. soybean agglutinin (sba) is a legume lectin glycoprotein that binds non-covalently to specific cell-surface carbohydrates, provoking agglutination of the bound cells when in solution. sba has been used to fractionate different cell types for use in clinical and biomedical applications. one such application is the separation of pluripotent stem cells from human bone marrow. cells fractionated by sba do not produce graft versus host disease (gvhd) and can be used in bone marrow transplantation across histocompatibility barriers (yura et al. ) . another application is the enrichment and isolation of fetal cells from the blood of pregnant women as a means of detecting fetal genetic and chromosomal abnormalities. it has been shown that the number of fetal erythroblasts recovered by a soybean agglutinin galactose-specific lectin method was approximately eightfold higher than the number obtained by a standardized magnetic cell-sorting (macs) procedure (babochkina et al. ). in addition, sba binds very effectively to some tumor cells and has been used to detect and treat several cancers including breast cancer (pusztai et al. ) . sba and other lectins, which are carbohydrate-binding proteins that bind sugar residues reversibly and specifically, have also been exploited as a ligand for targeted oral drug delivery, because their glycoprotein and glycolipid targets are integral parts of the enterocyte membrane (smart ) . procedures for high recovery and purity of sba have been developed, with extraction of sba from soybean flour resulting in high-quality purification with over % yield (percin et al. ), opening the door for the use of recombinant sba as a potential novel affinity tag for purification of genetically fused proteins. the potential production of a variety of recombinant proteins genetically fused to sba, however, necessitates the generation of an efficient recombinant production system. native sba is produced in the developing seeds of glycine max, with maximum production in seeds reaching over % tsp (lindstrom et al. ). it is relatively simple to obtain large amounts of purified native sba given that soybean seeds express high levels of native sba together with the establishment of efficient purification methods. however, many potential applications of sba in biotechnology and medicine could not be achieved or are difficult to achieve by use of unmodified native sba. for example, it is difficult to use native sba as an affinity tag for protein purification unless the target protein is genetically fused to sba and produced as a recombinant fusion protein. sba has previously been expressed in transgenic tobacco seeds in order to study the upstream and downstream cis-regulatory sequences mediating the expression of native sba (lindstrom et al. ). although a protein band with a molecular mass expected for the monomer of sba protein was seen on western blot, no analysis or characterization of the recombinant protein itself was performed; it was merely used as a marker for gene expression. others have successfully expressed sba in both e. coli and monkey bs-c- cells in an attempt to generate recombinant sba in order to analyze the relationship between glycosylation and tetramer assembly (adar et al. ) . they demonstrated that the recombinant sba retained its agglutination ability and specificity for n-acetylgalactosamine. the sba generated in bacteria, however, lacks glycosylation that, while not required for agglutination or assembly, is important to the stability of the bioactive tetramer (sinha and surolia ) . production in bs-c- cells resulted in a similarly glycosylated protein to native sba with an identical sugar binding profile but for some unknown reason had a lower agglutination ability. the minimum amount of bs-c- cell-derived sba required to induce hemagglutination is - lg/ml, which is - times larger than that of native sba. in addition, for production of a therapeutic protein, animal cells carry an inherent risk of pathogen contamination and have a high production cost due in large part to media costs. therefore, both bacteria and mammalian cells are not ideal bioreactors for sba production for therapeutic uses. plants are a suitable alternative expression system for recombinant sba production. as bioreactors, plants enable unlimited scalability, elimination of product contamination by mammalian pathogens, and reduced production costs compared with microbial or animal cell-based systems (boehm ; ma et al. ; tremblay et al. ). because of their eukaryotic nature, plants can perform the complex post-translational modification and processing required by many transgenic therapeutic proteins for biological and/or immunological function. furthermore, plant bioreactors have the short turn-around time needed to obtain gram quantities of a recombinant protein in a matter of weeks when expressed transiently. this is not only economically advantageous, but is also critical to meeting challenges related to quick access to life-saving biotechnology drugs and therapies. in addition, stable edible transgenic plant tissue might enable direct oral delivery of plant-derived therapeutic proteins and peptides, eliminating the need for expensive downstream protein purification and processing. our long-term objective is to generate recombinant sba for use as a carrier molecule for oral delivery of protein and peptide drugs and as an affinity tag for simplified and high-yield, high purity isolation of recombinant proteins. as a first step toward this objective, we report here the transient production of recombinant sba (rsba) in nicotiana benthamiana (nb), a close relative of tobacco (nicotiana tabacum), and stable production in solanum tuberosum (st) under the control of a ubiquitous camv s promoter. we demonstrate transient expression of sba in nb plants at levels as high as % of total soluble protein (tsp), whereas its stable expression in st tubers reaches . % tsp. furthermore, nbrsba and strsba are similarly glycosylated compared with native sba, retain their ability to induce hemagglutination, bind specifically to n-acetylgalactosamine, are stable in simulated gastric fluid (sgf) containing pepsin at acidic ph and are rapidly isolated in high purity from total soluble protein. the cdna of soybean agglutinin (sba) was cloned from glycine max cdna derived from to mm developing seeds. in brief, total messenger rna was extracted from the seeds using the rneasy plant mini kit (qiagen # ) and converted to cdna by following the superscript ii procedure (invitrogen # ). the primers f aatccatggctacttc aaagttgaaaacc and r tctagattaa tgatgatgatgatgatggatggcctcatgca acacaaaacttg (with addition of a xhis-tag to the c-terminus underlined, restriction sites in bold, and a silent mutation to remove an internal hindiii site italicized) were constructed using the previously published sequence for sba (genbank accession #k . ). the generated sba cdna was then inserted into puc- . after confirmation by sequencing, the sba cdna was inserted into prtl- via digestion with ncoi and xbai, replacing the gus gene, providing a s promoter, and utr (carrington and freed ) . the resulting expression cassette was digested with hindiii and inserted into pbi- to create pbi-rsba. tri-parental mating was used to transfer pbi-rsba to agrobacterium tumifaciens strain lba and confirmed via pcr using specific primers. transient expression of sba in n. benthamiana transient expression of pbi-rsba was accomplished by infection of - -week-old leaves of nicotiana benthamiana as described by sparkes et al. ( ) . briefly, overnight cultures of agrobacterium were grown until density reached . - . a , at which time the cells were centrifuged at g for min, rinsed four times with infiltration medium ( . % d-glucose, mm mes, mm na po , . m acetosyringone), and then resuspended to the desired cell density, between . to . a . the cells were then infiltrated into the abaxial side of the leaves using a -ml syringe and infected tissue was harvested each morning at day through postinfection. for co-infiltration with a second agrobacterium harboring the t-dna vector encoding the p suppressor gene of tomato bushy stunt virus (lakatos et al. ) , the two agrobacterial cultures were mixed at equal concentration before infiltration. stable expression of sba in s. tuberosum solanum tuberosum mini-tubers were generated as described previously (bourque et al. ) . overnight cultures of agrobacterium with pbi-rsba were used to transform tubers as previously described (ma et al. ) . regenerating plantlets were transferred to magenta boxes containing ms media supplemented with lg/l kanamycin. the presence of the transgene in transgenic plants was confirmed by pcr, and pcr-positive plants were then selected to produce mini tubers for protein expression analysis. for mini tuber induction, a stem section ( cm) with one resting auxiliary bud and one fully developed leaf was excised from a sterile magenta-grown transgenic plant. the leaf was removed and the stem was transferred to tuber-inducing medium as described above and placed in darkness at °c. after weeks, the tubers formed ( mm in diameter) were harvested and used for protein analysis. total soluble protein was extracted from infected n. benthamiana leaf tissue or potato tuber and then quantified as described previously . briefly, approximately mg plant material, either solanum tuberosum tuber or nicotiana benthamiana leaf, was ground in a . -ml tube with a plastic pestle and mixed with protein extraction buffer ( mm tris ph . , mm nacl, mm sucrose, mm phenylmethylsulfonyl fluoride, mm edta, mm b-mercaptoethanol, . % tween- , lg/ml leupeptin, lg/ml aprotinin). the mixture was incubated on ice for min and clarified by centrifugation at top speed for min at °c. the supernatant was transferred to a new . -ml tube and used for all subsequent assays. protein samples, boiled or unboiled, were loaded on to a . % sds-page gel and resolved. gels were then either stained with coomassie blue to visualize the protein profile or transferred to a pvdf membrane as described previously ). the blot was then blocked for h at room temperature with % w/v milk in tbs-t and washed times, each time for min, with tbs-t. the blots were incubated overnight at °c in : rabbit anti-sba (cedarlane al- - ) in / blocking buffer / wash buffer. the blots were washed times, each time for min, in wash buffer and incubated in : , goat anti-rabbit-hrp (g- , sigma) for h at room temperature. the blots were washed times, each time for min, in wash buffer and then incubated with supersignal west pico chemiluminescent substrate ( , pierce, rockford, il, usa). blots were exposed and then developed using a film processor. the amount of accumulated rsba in nicotiana benthamiana leaves or solanum tuberosum tubers was quantified by enzyme linked immunosorbent assay (elisa). briefly, commercial native sba standard (sigma l- ) and triplicate ntrsba or strsba samples were bound to well plates by incubation in phosphate buffer overnight at °c. the plates were then washed with pbs-t and blocked with % bsa in pbs-t for h at room temperature. the plates were washed times and then incubated overnight at °c in : , rabbit anti-sba in / blocking: / wash buffer. the plates were washed times and incubated for h at room temperature with : , goat anti-rabbit-hrp in / blocking: / wash buffer and then rinsed times. the plates were incubated with substrate reagent pack (dy , r&d systems) according to the manufacturer's instructions. the color was developed for min and then stopped by addition of an equal volume of m h so . the plate was then read using a thermomax microplate reader (molecular devices, usa). standard curves were calculated and used to determine protein concentrations of the individual samples. negative control was protein extract prepared from wild-type plant tissue. purification of rsba from nicotiana benthamiana or potato tubers was carried out according to the procedure for ge healthcare histrap hp column (# - - ) or using an n-acetylgalactosamineagarose column. for purification with the n-acetylgalactosamine-agarose column, agarose columns with pre-bound n-acetylgalactosamine (sigma a - ml) were washed and equilibrated with . m nacl. the tsp containing rsba was then applied to the column and rinsed with excess . m nacl. samples were taken throughout rinsing and total protein in rinse was determined by spectrophotometry at a . once protein levels were negligible, rsba was eluted with . m galactose/ . m nacl. the purified protein was confirmed via western blot and then desalted via dialysis against excess . pbs buffer. hemagglutination assay was performed using % rabbit red blood cells (rbc) suspended in saline buffer. commercial native sba standard and purified ntrsba and strsba were used for the assay. the proteins were diluted to different concentrations with saline in order to determine the effective unit for each sample, with one unit defined as the amount of sba required to induce agglutination (lin et al. ) . all agglutination assays were repeated in triplicate. rbc ( %, ll) was added to round-bottomed -well plates and then mixed with ll protein dilutions. the mixture was then allowed to settle for h and the results were recorded. competitive sugar-binding assay sugar-binding assay was carried out as described above for hemagglutination, with the following modifications. one unit of native sba or nbrsba or strsba was mixed with saline containing lm or lm concentrations of one of the sugars n-acetylgalactosamine, n-acetylglucosamine, arabinose, lactose, or raffinose. the mixture ( ll) was then added to ll % rbc and incubated for h. the results were then recorded as positive or negative for hemagglutination. deglycosylation of nbrsba and strsba was carried out with pngase f according to the manufacturer's procedure (sigma p- - set). the samples were then loaded on to an sds-page gel followed by western blot analysis using anti-sba antibody. n-linked glycan removal was confirmed via a band shift on the western blot. to confirm the presence of mannose-type glycans on plant-derived rsba, blots containing both pngase f-treated and untreated samples were incubated with lg/ml concanavalin a (c- , sigma) at room temperature for h. after several washes with . % tween- in pbs (ph . ), the blot was incubated with hrp-conjugated anti-cona antibodies (hal- - , e.y. laboratories) at , -fold dilution for h. after the same wash step, the blot was incubated with supersignal west pico chemiluminescent substrate. the signals were then developed using a film processor. analysis of in-vitro digestion of nbrsba was carried out using either simulated gastric fluid (sgf) or simulated intestinal fluid (sif). for sgf ( . g nacl, . g pepsin, ll hcl, in ml h o, final ph . ) and sif ( . g monobasic kh po , . ml . m naoh, . g pancreatin, in ml h o, final ph * . ), purified protein was incubated in a °c water bath with samples taken and mixed in neutralization buffer ( . g na co in ml h o) at time , s, s, min, min, min, and min. samples were boiled for min, separated on a . % sds-page gel and subjected to western blot analysis as described above. control native sba and a non-glycoprotein, human gad made in e. coli (plantigen), were also tested. isolation and cloning of cdna encoding sba to obtain a cdna clone encoding sba, rna was extracted from wild-type soybean seeds that were approximately - mm in size, then converted into cdna that was then used as a template for the pcr cloning of the sba coding sequence. the primers included the addition of a ncoi and xbai restriction sites to facilitate sub-cloning of the pcr products, and addition of the histidine tag to the c-terminus. the cloned sba gene was confirmed by sequencing. for the convenience of sub-cloning, the internal hindiii site within the native sba coding sequence was removed by converting a g to an a in the nd codon position encoding serine, resulting in no change of the amino acid. the complete sba expression cassette was released from the prtl-rsba via hindiii digestion and inserted into pbi . to form pbi-rsba (supplemental fig. ) . the plasmid pbi-rsba was then transferred into a. tumifaciens strain lba for plant transformations. six-week-old n. benthamiana plants were infiltrated with the a. tumefaciens clone harboring pbi-rsba or mixed with cultures of a. tumefaciens containing the p expression cassette, a viral inhibitor of the rna silencing pathway in plants that has proven effective in increasing the yield of transiently expressed proteins (lakatos et al. ) (fig. a) . leaf samples were collected from three independent plants on different days post-infiltration with different concentrations of a. tumefaciens cultures, with an equal amount of p added for each assay, in order to determine the peak of rsba expression in the agroinfiltrated n. benthamiana plants. the results of immunoblot analysis showed that the highest expression using the gene-silencing suppressor p was reached at day , with an optimum concentration of agrobacterium of . a (fig. b,c) . as calculated by elisa, the expression level of rsba at day reached % of tsp (fig. d) . examination of unboiled samples on sds-page gels followed by western blotting showed the presence of two bands, one corresponding in size to the monomer and the other equivalent to the size of the tetramer of sba (supplemental fig. ) , suggesting that n. benthamiana made rsba was assembled into a tetrameric protein complex, essential for its biological activity. stably transformed potato plants were also tested for expression of rsba. thirteen individual transgenic potato lines were generated after agrobacteriummediated transformation and transferred to minituber-inducing magenta boxes in order to generate consistently sized tubers for further analysis. immunoblot analysis of rsba expression in unboiled samples of tuber extract showed two major bands of kda and just over kda (supplemental fig. ) , expected for the monomer and tetramer of sba, respectively. immunoblot analysis of boiled samples of tuber extract showed only the small band of kda (data not shown). the level of strsba accumulation in potato mini-tubers was variable among individual transgenic lines, ranging from no detectable signal to . % tsp (supplemental fig. ) . soybean agglutinin (sba) specifically binds to n-acetylgalactosamine, enabling the purification of this protein via an n-acetylgalactosamine-agarose column to a high degree of purity. purification through this column also serves as confirmation of authentic sba production. figure a shows that sugar-specific purification of rsba from nicotiana benthamiana leaf extracts resulted in efficient protein retention during purification and elution. in addition, the sugar-purified sample was of high-quality when examined on sds-page gel and coomassie blue staining, with negligible contamination of non-specific proteins (fig. b) . sugar-specific purification of rsba from potato mini tubers also resulted in high-quality, purified protein (data not shown). as a control, the isolation of nicotiana benthamiana-derived rsba using a traditional histrap column was also performed (fig. b) . hemagglutinating activity of plant-derived rsba soybean agglutinin is known to agglutinate human and animal red blood cells (hemagglutination). rsba was therefore assessed for its ability to agglutinate rabbit erythrocytes. purified nbrsba was found to c accumulation of rsba at day with different a concentrations of agrobacterium. ? is native sba, nb is nicotiana benthamiana wild-type tsp. d elisa data for accumulation at days - after infection at optimized concentration. values are expressed as a percentage of total soluble protein (tsp), with each bar representing the mean value of the three collected samples repeated in triplicate from each day, with standard error. all western blots were loaded with approximately lg tsp loaded per lane, ng native sba control induce the agglutination (clumping) of rabbit erythrocytes within h of treatment (fig. a) . similar results were obtained with strsba (data not shown). the minimum amount of native sba required to induce agglutination in the assay was . lg, whereas nbrsba and strsba required approximately and . lg, respectively. as agglutination of erythrocytes by sba is because of its ability to bind to specific sugar residues on the cell surface, it is therefore speculated that the agglutinating activity of sba could be inhibited by addition of specific sugars. indeed, addition of n-acetylgalactosamine even at low concentrations was able to prevent the agglutinating ability of nbrsba, similarly to the native sba control (fig. b) . in contrast, the agglutinating ability of nbrsba or native sba was unaffected by high concentrations of non-specific sugars (supplemental table ; see also fig. b ). native sba is a glycosylated protein with a single high-mannose glycan (adar et al. ) . the presence of glycosylation, while not required for assembly or function, is important in maintaining the stability of sba (sinha and surolia ) . to determine the glycosylation status of plant-derived rsba, nbrsba, strsba, and sba standard were digested with pngase f and analyzed by immunoblotting following separation by sds-page gels. both sba standard and plant-derived rsba had a decrease in molecular size of approximately - kda, which is an expected size for a single glycan (fig. a) . to determine the type of glycans of plant-derived rsba, blots containing both pngase f-treated and untreated nbrsba, strsba, and sba standard were incubated with cona, which binds to n-linked mannose, and then with anti-cona antibody. figure b shows that binding of cona occurs only in the untreated samples, with no binding to any of the pngase f treated sba proteins, suggesting that plant-derived rsba contains high-mannose type glycans, similar to native sba. plant-derived rsba was further investigated in simulated gastric and simulated intestinal fluid (sgf and sif) to assess its stability in the digestive system. sgf ( . g/l pepsin, ph . ) has been used to mimic the acidic stomach environment in animals, and sif ( g/l pancreatin, ph . ) is used for proximal small bowel conditions. to this end, purified nbrsba and native sba were used in simulated digestion experiments. the effect of sgf and sif on the degradation of nbrsba, as evaluated by immunoblot analysis, is shown in fig. a ,b. as can be seen, nbrsba remains relatively intact after digestion even for min in either solution. native sba showed similar resistance to sgf and sif digestion (data not shown). to ensure proper activity of the simulated fluids, e. coli-derived recombinant gad (glutamic acid decarboxylase) previously produced in our laboratory was tested as a control. the protein lasts less than min in sgf and min in sif (fig c,d) . we have demonstrated the successful recombinant production of sba through both stable and transient expression in solanum tuberosum and nicotiana benthamiana, respectively. sba has served as an excellent system of choice to study the fundamentals of protein-carbohydrate interaction (loris et al. ; sharon and lis ) . with a vast increase in knowledge of sba and other lectins, it has become apparent that sba has many additional applications, especially in medical and biotechnology-focused industrial areas such as the utilization of sba to selectively remove cancerous cells from whole blood without removing normal t cells, to prevent graftversus-host disease following transplant, or to enrich hematopoietic stem cells (bakalova and ohba ; kernan et al. ; yura et al. ) . moreover, recombinant sba has the potential to serve as a carrier system for oral drug delivery or to be used as an affinity tag for isolation of high-purity fusion proteins. to make these applications feasible, however, a more affordable source of large quantities of high-quality rsba is required. previous recombinant production of sba in tobacco was incidental, because it was simply used as a marker for studying the promoter activity and, as such, no assays were performed to confirm that the protein made retained authentic sba activity (lindstrom et al. ) . similarly, recombinant production in both bacterial and mammalian cell culture was for the purpose of elucidating the functional role of glycosylation and the produced recombinant sba was either lacking glycosylation (e. coli-derived) or had reduced agglutinating ability (mammalian cell-derived) (adar et al. ) . our objective was to develop a robust expression system that can generate large quantities of low-cost authentic rsba, which retains its binding specificity to facilitate high-purity simplified protein isolation and maintains its stability in the gastrointestinal tract (gi) needed for future use as a carrier molecule for oral drug delivery. authenticity of both nbrsba and strsba was confirmed by several results. the foremost is the binding and inhibition studies using rabbit red blood cells. sba binds to different surface proteins bearing n-acetylgalactosamine, and, in the case of red blood cells, results in hemagglutination (gordon and marquard ) . both of the plant-derived rsbas were able to induce hemagglutination of rabbit red in addition, a similar minimum amount of sba, whether native or from n. benthamiana or s. tuberosum, was able to induce agglutination. this suggests that, unlike monkey cell-made sba, nb/strsba is functionally equivalent to native sba. many lectins can induce hemagglutination, however, so it is important to confirm the specificity of rsba for its ligand. for example, the chinese black soybean variety is able to cause agglutination of rabbit red blood cells but is inhibited by a different sugar, in that case melibiose (lin et al. ) . the inhibition of hemagglutination by addition of n-acetylgalactosamine but not other sugars confirms the rsba retains the authentic binding profile of native sba. in addition to this, the ability to specifically purify high-quality rsba via n-acetylgalactosamine bound to agarose further confirms the authenticity of the recombinant protein, as there was no detectable level of rsba in either the flow-through or wash steps, only in the elution step. the capture rate for this technique has been reported to be over % efficient, which is supported by the lack of rsba signal in the flowthrough steps seen in fig. b (percin et al. ). the ability to purify high-quality rsba also enables potential use as an affinity tag. through genetic fusion with a gene of interest (goi), it may be possible to first purify the fusion protein on the sugar columns, followed by subsequent cleavage of the desired goi via specific endoproteolytic cleavage, such as with tobacco etch virus (tev) protease (tubb et al. ) . a second possibility would be in combination with a linker intein, an intervening protein sequence that can be induced to undergo n-terminal, c-terminal or both termini cleavage under specific conditions (evans et al. ) . this could enable cleavage of the goi from rsba without any expensive proteases and result in high-purity protein. it should be mentioned that because of the requirement of the tetrameric formation for proper sba activity, addition of a fusion partner may affect the formation of a tetrameric sba. however, several groups, including our own, have previously demonstrated that other multimer forming proteins, for example the non-toxic b subunit of cholera toxin, retain their ability to form pentamers and bind to their target ligand when fused with other proteins (arakawa et al. ; li et al. ; ruhlman et al. ; tremblay et al. ). the production of rsba was accomplished using both transient and stable transformation of nicotiana benthamiana and potato, respectively. transient gene expression in plants has a number of advantages over stable genomic transformation in plants or other recombinant expression platforms. first and foremost is that new recombinant proteins can be generated in sufficient quantities for in-vitro analysis and in-vivo studies at a speed not possible with traditional bioreactors. this is best demonstrated in the generation of idiotpe-specific scfvs to treat non-hodgkins lymphoma patients. researchers were able to generate patient-specific scfvs by first cloning the antibody fragment sequences from the patient's biopsy and then generated sufficient quantities in plants to inject the plant-made scfvs back into the patient in less than weeks, something unachievable with any other biological system (mccormick et al. ) . transient expression can also be used to generate and validate new vaccines against future pandemics, for example h n or sars, for which large quantities of vaccine are needed in a short-period of time in order to meet world-wide demand. an additional benefit of transient expression in plants is the level of accumulation. in our work, we achieved almost % accumulation of rsba in less than a week. this is a tenfold increase over the accumulation seen in our stably transformed potato lines. based on our data, the transient system can generate an approximate yield of between . and g of rsba per kg fresh weight in week, meaning that a greenhouse setting capable of producing , kg of tissue per week could generate up to kg of purified rsba every week. the use of stable lines expressing rsba can likewise prove advantageous. the production of a stable expression platform provides long-term, lowcost therapeutic proteins, typically required where the therapeutic agent is commonly used in multiple individuals over a long period of time, for example in the potential prevention of type diabetes using glutamic acid decarboxylase (gad ) (ma et al. ) . we were able to accumulate rsba to over . % of tsp in transgenic potato tubers. a major benefit of tuber accumulation is the stability of recombinant proteins during long periods of storage. in the production of monoclonal antibodies, for example, the recombinant antibodies were stable for over months when stored in the dark at room temperature, with little to no loss in either quantity or activity (de wilde et al. ) . this makes potatoes an excellent system for recombinant protein production, because one of the major disadvantages of field/ greenhouse grown plants for recombinant production is losses in yield owing to protein degradation during transit and processing. an additional benefit of the use of tubers is that they can be used to orally deliver the target protein with minimal processing, especially given that potato tubers form a staple part of many cultures' diets and can be processed for consumption via freeze-drying that enables ingestion without cooking and thus could help minimize protein denaturation/loss during food preparation. in-vitro digestion of n. benthamiana-derived rsba under simulated gastric and intestinal conditions proved it to be stable. there was no significant loss or degradation of the protein after incubation of nbrsba in sgf for min, as seen in fig. a ,c. these results suggest that the plant-derived rsba may hold great promise for use as a carrier system for oral delivery of protein and peptide drugs. the oral route for drug delivery is the most preferred route and has considerable advantages: requiring neither sterile needles nor trained personnel, lower cost and increased quality of life, increased access to a large population, reduced side-effects often seen with systemic delivery, and greater patient compliance and acceptability. however, administration of therapeutic peptide or protein drugs by the oral route is a major challenge. orally administered peptide or protein drugs are readily degraded because of their exposure to the harsh environment of the gi (low ph and various proteinases and peptidases). therefore, development of suitable delivery systems is crucial to the success of oral administration of protein drugs. plant-derived rsba may provide an ideal vehicle for oral drug delivery. moreover, sba undergoes endocytosis through the epithelial lining of the intestine, adding to its potential as an adjuvant for therapeutic protein delivery (benjamin et al. ) . a potential concern with the use of rsba as a carrier system for targeted drug delivery is its antinutritional property. indeed, sba is one of the predominant anti-nutritional factors found in the raw soybean and accounts for approximately % of the growth inhibition in rats fed unheated soybean (liener ) . however, its anti-nutritional activity is strictly dose-dependent. a negative effect of sba on growth and immune function in rats was observed only when very high levels of this protein was given in diets ( mg/per rat daily or equivalent to . % of its body weight). in our previous work on oral tolerance induction in mice using plant-derived autoantigenic protein, we showed that only microgram quantities (or . % of the animal's body weight) of the plant-derived antigen is required to induce a response (ma et al. ). this suggests that when used as a carrier system, the minimum effective dose of rsba-antigen fusion protein required to be delivered is in the range of micrograms, not milligrams. delivery of microgram quantities of sba to animals is safe (buttle et al. ; tang et al. ) . however, as with all therapeutics destined for human use, thorough studies would be required to determine the overall bio-safety. in conclusion, our work has demonstrated the feasibility of high accumulation, high-purity recombinant production of sba. this is the first molecular characterization of rsba in plants. the availability of low-cost, high-quality rsba may make many important applications of this protein possible, especially in medicine. synthesis of soybean agglutinin in bacterial and mammalian cells a plant-based cholera toxin b subunitinsulin fusion protein protects against the development of autoimmune diabetes evaluation of a soybean lectin-based method for the enrichment of erythroblasts interaction of soybean agglutinin with leukemic t-cells and its use for their in vitro separation from normal lymphocytes by lectin-affinity chromatography inflammatory and anti-inflammatory effects of soybean agglutinin bioproduction of therapeutic proteins in the st century and the role of plants and plant cells as production platforms use of an invitro tuberization system to study tuber protein gene-expression the binding of soybean agglutinin (sba) to the intestinal epithelium of atlantic salmon, salmo salar and rainbow trout, oncorhynchus mykiss, fed high levels of soybean meal cap-independent enhancement of translation by a plant potyvirus nontranslated region expression of antibodies and fab fragments in transgenic potato plants: a case study for bulk production in crop plants protein splicing elements and plants: from transgene containment to protein purification factors affecting hemagglutination by concanavalin a and soybean agglutinin prevention of gvhd in hlaidentical marrow grafts by removal of t-cells with soybean agglutinin and srbcs molecular mechanism of rna silencing suppression mediated by p protein of tombusviruses expression of cholera toxin b subunit and the b chain of human insulin as a fusion protein in transgenic tobacco plants effects of processing on antinutritional factors in legumes: the soybean case purification of melibiose-binding lectins from two cultivars of chinese black soybeans expression of soybean lectin gene deletions in tobacco legume lectin structure induction of oral tolerance to prevent diabetes with transgenic plants requires glutamic acid decarboxylase (gad) and il- production of biologically active human interleukin- in transgenic tobacco and potato plant-based pharmaceuticals and its application in oral tolerance. in: pontell eb (ed) immune tolerance research development plant-produced idiotype vaccines for the treatment of non-hodgkin's lymphoma: safety and immunogenicity in a phase i clinical study n-acetyl-dgalactosamine-specific lectin isolation from soyflour with poly(hpma-gma) beads uses of plant lectins in bioscience and biomedicine expression of cholera toxin b-proinsulin fusion protein in lettuce and tobacco chloroplasts-oral administration protects against development of insulitis in nonobese diabetic mice carbohydrate-protein interactions oligomerization endows enormous stability to soybean agglutinin: a comparison of the stability of monomer and tetramer of soybean agglutinin attributes of glycosylation in the establishment of the unfolding pathway of soybean agglutinin lectin-mediated drug delivery in the oral cavity rapid, transient expression of fluorescent fusion proteins in tobacco plants and generation of stably transformed plants effects of purified soybean agglutinin on growth and immune function in rats expression of a fusion protein consisting of cholera toxin b subunit and anti-diabetic peptide (p ) from human heat shock protein in transgenic tobacco plants tobacco, a highly efficient green bioreactor for production of therapeutic proteins purification of recombinant apolipoproteins a-i and a-iv and efficient affinity tag cleavage by tobacco etch virus protease selection of hematopoietic stem cells with a combination of galactose-bound vinyl polymer and soybean agglutinin, a galactose-specific lectin key: cord- -ihh q f authors: nan title: posters p - p date: - - journal: eur biophys j doi: . /s - - -x sha: doc_id: cord_uid: ihh q f nan in order to understand mrna stability, we proceed to the studie of life time model system composed of the regb ribonuclease and the ribosomal protein s . the t bacteriophage life cycle is modulated by regb which mediate speci c mrna degradations. regb cleaved the mrna at the middle of the translation initiation region (shine-dalgarno sd). studies had shown that regb activity is enhanced with addition of s which is normaly required for the translation of mrna in case of unusual sd regions. s is considered as a key factor of translation's inititiation. composed of six similar domains (f -f ), it interacts with the ribosome by his f -f domains and with the mrna by f -f . we had shown that the f fragments got the same properties than the whole protein. many global architectures had been published (linear or globular conformation). then we try to understand what is the catalysis way of s in the regb activation and in rna recognition. we study the global conformation of f fragment. for that, we use nmr to determinate the domains interfaces. at rst we made the nmr backbone assignement of n/ c/ h labeled fragments (f -f ). then we analysed hsqc overlapping of each fragments in order to identify interfaces residues. the residues identi cation on s homologous model allowed us to determinate interaction region between domains. currently we study of s -rna interactions to determinate if the ligand conformation could change the interaction region and involve conformationals changes. translocation of amino acids from the membrane interface to the interior: theory and experimiment c. aisenbrey , e. goormaghtigh , j.-m. ruysshaert , b. bechinger ulp/cnrs, chemistry, rue blaise pascal, strasbourg, université libre, campus plaine cp / , brussels the interactions of a series of histidine-containing peptides with biological model membranes have been investigated by attenuated total re ection fourier transform infra red (atr-ftir) and oriented solid-state nmr spectroscopies. related peptides have previously been shown to exhibit antibiotic and dna transfection activities. the -residue lah x and lah x peptides were designed in such a manner to form amphipathic helical structures in membrane environments. four histidines and four/six variable amino acids x constitute one face of the helix whereas leucines and alanines characterize the opposite hydrophobic surface. the dichroic ratio of atr-ftir spectra, or the orientation-dependent n solid-state nmr chemical shift have been used to follow the ph-dependent transition from in-plane to transmembrane alignments upon increase in ph. a theoretical model of the topological modulations is presented and the experimental transition curves analysed in order to reveal the gibbs free energy of transition. the novel concept provides access to the free energy changes associated with the amino acids x incorporated into an extended -helix and in the context of phospholipid bilayers. for the peptides of the lah x series the gibbs free energies associated with the transition from the membrane interface to the bilayer interior follow the sequence of amino acids: l < a i < s f < t g < v w << y. here we present a novel solid-state nmr approach which allows for the accurate determination of the tilt and rotational pitch angles of peptides reconstituted into uniaxially oriented membranes. the method works with transmembrane or in-plane oriented peptides that have been labelled with , , - h -alanine and n-leucine at two selected sites. proton-decoupled n and h solid-state nmr spectroscopy at sample orientations of the membrane normal parallel to the magnetic eld direction have been used to characterize the tilt and rotational pitch angle of these peptides in considerable detail. the same samples when inserted into the magnetic eld at degrees tilted alignments provide valuable information on the rotational diffusion constants in membranes and thereby of the association and size of peptide complexes within the membrane environments. whereas monomeric transmembrane peptides exhibit spectral averaging and well-de ned resonances, larger complexes are characterized by broad spectral line shapes. in particular the deuterium line shape is sensitive to association of a few transmembrane helices. in contrast, the formation of much larger complexes affects the n chemical shift spectrum. aisenbrey ch. & bechinger, b. biochemistry , - ( ) a meccano set approach of joining trpzip a water soluble -hairpin peptide with a didehydrophenylalanine containing hydrophobic helical peptide p. chetal, v. chauhan, d. sahal international centre for genetic engineering & biotechnology,new delhi ,india a residues long, water soluble, monomeric -hairpin peptide "trpzip" [cochran et al ( ) pnas , - ], stabilized by tryptophan zipper has been linked via a tetraglycyl linker to a hydrophobic didehydrophenylalnine ( f) containing helical octapeptide. circular dichroism studies of this residues long peptide, "trpzipalpha" (ac-gewtwddatktwtwte-gggg-fal fal fa-nh ) in water have revealed the presence of both the -hairpin and the helical conformations. this is the rst instance where a f containing peptide has been found to display a helical fold in water. the uorescence emission wavelengths of tryptophan in ac-g-w-g-nh , trpzip and trpzipalpha were . , . and . nm respectively. the uorescence quantum yield of trpzip was . fold higher than trpzipalpha suggesting that proximal interactions between the -hairpin and the helix caused the quenching of tryptophan uorescence in the former by the fs in the latter. the molar ellipticity of the far uv couplet characteristic of trpzip was reduced in trpzipalpha and the cd based thermal melting temperatures at nm were c (trpzip) and c (trpzipalpha). a concentration dependent variable temperature cd study in water showed that in trpzipalpha, increasing temperature is detrimental to the -hairpin, but it augments the helical motif by intermolecular oligomerization. our results show that in water, trpzipalpha exhibits long-range interactions between two different secondary structures. structural-based differential stability in the yoeb-yefm toxin-antitoxin module i. cherny, e. gazit department of molecular microbiology and biotechnology, faculty of life sciences, tel aviv university, tel aviv , israel the speci c physiological role of natively unfolded proteins is only recently beginning to be explored. a notable case in which natively unfolded state appears to have physiological signi cance is the e. coli yoeb-yefm toxin-antitoxin (ta) module. a crucial element in proper functioning of ta systems requires physiological instability of the antitoxin in contrast to the stable pro le of its toxin partner. we have shown that yefm antitoxin is a natively unfolded protein, lacking secondary structure even at low temperatures. in contrast, its toxin partner has a well-folded conformation at physiological temperatures. we suggest that the structural-based differential thermodynamic stability between the two components is the cause for their differential physiological stability, since structural instability of the antitoxin exposes it to cellular quality-control machinery. we further revealed that yefm and yoeb interact and form tight complex and determined it stoichiometry. a potential use of ta systems is as novel antibacterial targets. indeed, we identi ed homologous yefm-yoeb systems in a large number of bacteria including major pathogens. we aim to design peptides capable of interfering with the yefm-yoeb interaction, thus releasing the toxin to execute its detrimental function. for this purpose, we identi ed a short linear determinant within yefm that is involved in yoeb interaction. this peptide motif will be optimized for development of antibacterial lead molecules. a. chatterjee, a. prabhu, a. ghosh-roy, r. v. hosur tata institute of fundamental research, mumbai, india- dynein light chain (ddlc ), a member of the cytoplasmic motor assembly exists as a monomer or a dimer (functional form) under different experimental conditions. here we report the unfolding characteristics of the monomeric ddlc at ph , due to urea and guanidine hydrochloride, by various biophysical techniques. it is observed that the unfolding pathways due to the two denaturants have many differences. urea unfolding seems to be two state, while guanidine unfolding is more complex. the nmr experiments carried out at low denaturant concentrations have enabled detailed characterization of the structure and dynamics of the near native excited states of the protein. these are similar to the native state in structure, except for the small extensions of the helices in the nterminal half of the protein. however, the local stabilities of the and -strands are perturbed and this occurs differently in the two denaturants. in the guanidine case the entire multi-stranded -sheet in the c-terminal half is destabilized. in either case the motional characteristics, seem to suggest the presence of a nite population of the dimer in the excited state ensemble. these states are suggested to be likely intermediates in the momoner-dimer transition, and their characterization here thus provides clues to the molecular mechanism of the transition. it is also envisaged that the near native excited states could play regulatory roles in the functioning of the protein. kinetic bottlenecks identi cation in different folding models f. cecconi , c. guardiani , r. livi infm -istituto di sistemi complessi -cnr, italy, centro interdipartimentale per lo studio delle dinamich complesse, università di firenze, italy, dipartimento di sica, università di firenze, italy the ww domains are a family of fast folding, compact, modular domains featuring a triple-stranded, antiparallel beta-sheet. the ww domain of the pin protein, due to the availability of a complete picture of the residues involved in thermodynamic stability and in the formation of the transition state, in particular, represents an excellent benchmark to test computational methods. the objective of the present work is to identify the kinetic bottlenecks in the folding process through md unfolding simulations at increasing temperatures. the kinetic bottlenecks are related to the establishment of contacts requiring the overcoming of a large entropy barrier and acting as a nucleus for the creation of further contacts. the key sites are therefore those involved in contacts showing a dramatic decrease in fractional occupation near the speci c heat peak. the technique was applied to the go model and to a model based on the knowledge of secondary structure, providing in both cases a picture of the folding process consistent with the experimental data. evidence is also shown that while the go model allows a more accurate prediction of the native structure, the folding pathway is better described by the other model. the protein talin plays a key role in coupling the integrin cell adhesion molecules to the actin cytoskeleton and in integrin activation. the globular head of talin, which binds -integrins, is linked to a rod containing an actin-binding site and binding sites for the protein vinculin, which regulates the dynamic properties of cell-matrix junctions. we have determined the structure of three domains which contain vinculin binding sites (vbss) and shown that each of these are made up of helical bundles. the structures of complexes between vinculin and peptides corresponding to the vbss show that the residues which interact with vinculin are buried in the hydrophobic core of the helical bundles of the talin domains. nmr studies of the interaction of one of these domains with vinculin shows that it involves a major structural change in the talin fragment, including unfolding of one of its four helices, to make the vbs accessible. while the observation of folding of unstructured regions of a protein on interaction with a 'partner' is quite common, this kind of major unfolding to permit a protein-protein interaction is much less common. ways in which it may be regulated will be discussed. conformational changes of eye lens proteins studied by combined saxs and high pressure s. finet , f. skouri-panet , a. tardieu european synchrotron radiation facility, rue horowitz, bp , grenoble france, institut de minéralogie et de physique des milieux condensés, rue de lourmel, paris france, protéines: biochimie structurale et fonctionnelle, case , quai st-bernard, paris france -, -and -crystallins are the main components of vertebrate eye lenses, with exceptional structural and associative properties. the crystallins are known to be exceptionally stable in vivo since they have to last the lifetime of the organism. they therefore represent an extreme case of stability versus unfolding and aggregation. these proteins are mainly beta strands. -crystallins are kda monomers (from to % sequence identity), and -crystallins are large hetero-oligomers of about kda. -crystallins are molecular chaperone; they belong to the ubiquitous superfamily of small heat shock proteins, shsps. here, the conformation and the stability of -and -crystallins were investigated by small angle x-ray scattering (saxs) and high pressure, depending upon temperature and ph. at room temperature, -crystallins have shown a partially reversible change in size from to kb, and this effect was enhanced by the combination of temperature and pressure. in the case of -crystallins and in the pressure range up to kb, the pressure was combined with temperature and ph. the results depend upon the different itself. structural studies on pore-forming peptides s. m. ennaceur , d. bown , j. m. sanderson chemistry department, university of durham, biology department, university of durham the resistance of pathological microorganisms to conventional antibiotic drugs has created a need for new antibacterial agents. biologically active antimicrobial peptides that act as primary defense agents in a large variety of species are thought to have the potential as precursors for a new range of drugs from antibiotics to cancer treatments. this study has attempted to analyse the structural properties of membrane peptides and proteins through the use of model systems that have been designed to mimic their natural counterparts: we have successfully synthesised model membrane peptides with a beta-sheet structural motif and have used a wide range of techniques to analyse their interactions with phospholipid (pc) membranes. the synthetic peptides were very hydrophobic and only soluble in uorinated alcohols such as hfip and to a lesser extent tfe. we found hfip to have a very strong af nity for pc membranes and carried out a series of experiments to investigate this af nity. binding of hfip to pc membranes was found to be reversible and we exploited this property in d crystal trials of our synthetic peptides. we over expressed the c-terminal domain of brka, a gram negative autotransporter protein, which forms a beta-barrel channel in the outer membrane (omp), for comparison with our model peptides. we performed d crystal trials on the omp and imaged the resulting protein arrays by stem and afm. a protein spontaneously folds into a unique native structure in physiological conditions. this process accompanies a huge loss of the conformational entropy (ce). our major concern is to specify the factor that can compete with the ce loss. the previous discussions concerning protein folding have been focused on contributions to the free energy of folding from the interaction potentials in a system. a view lacking in earlier studies is that the folding is critically in uenced by the translational movement of water molecules. when solute particles contact each other in a solvent, the excluded volumes for the solvent molecules overlap, and the total volume available to their translational movement increases. this leads to a gain in the translational entropy (te) of the solvent. this type of te effect should be much stronger in protein folding where the tight packing of the side chains occurs. an elaborate statistical-mechanical theory is employed to analyze the te of water in which a peptide or a protein molecule is immersed. it is shown that the te gain upon folding is large enough to compete with the ce loss. when water is replaced by another solvent whose molecular size is larger, the te gain decreases to a remarkable extent. we suggest that the entropic loss accompanying the self-assembly and the formation of ordered structures in a living system is compensated mostly by the te gain of water, highlighting an aspect of the crucial importance of water in sustaining life. domain ii of ribosome recycling factor is required for disassembly of the post-termination complex p. guo, l. zhang, y. feng, g. jing institute of biophysics, chinese academy of sciences, national laboratory of biomacromolecules, beijing, china ribosome recycling factor (rrf) consists of two domains and, in concert with elongation factor ef-g, triggers dissociation of the post-termination ribosomal complex. however, the exact function of the individual domains of rrf remains unclear. to clarify this, two rrf chimeras, ecodi/ttedii and ttedi/ecodii, were created by exchanging the rrf domains between the proteins from escherichia coli and thermoanaerobacter tengcongensis. the ribosome recycling activity of the rrf chimeras was compared with their wild-type rrfs by using in vivo and in vitro activity assays. the experiments show that like wild-type tterrf, the ecodi/ttedii chimera fails to complement the rrf ts phenotype of e.coli lj (frr ts ) strain and has no polysome breakdown activity. however, under the same conditions, the ttedi/ecodii chimera complements the rrf ts phenotype and has polysome breakdown activity equivalent to that of wild-type ecorrf. the results indicate that domain ii of rrf is the functional domain that is mainly responsible for the disassembly of the post-termination ribosomal complex, and the speci c interaction between rrf and ef-g on the ribosomes mainly depends on the interaction between domain ii of rrf and ef-g; while domain i of rrf is the main contributor for binding ribosomes and maintaining the stability of the rrf molecule. this study provides direct genetic and biochemical evidence for the assignment of individual functions of rrf domains. self-assembly of natural somatostatin into liquid crystalline nano brils w the natural neuropeptide somatostatin- is a cyclic tetradecapeptide hormone, with broad inhibitory effects on both endocrine and exocrine secretions. we report the self-assembly of somatostatin in solution, into stable liquid crystalline nano brils, based on the neuropeptide bioactive backbone conformation. the system was studied as a function of peptide concentration, milieu composition and time, using optical and electron microscopy, x-ray scattering, vibrational spectroscopy and sec/rp-hplc. in pure water, the formation of twisted nano brils (around nm wide and a few microns long) was characterized. their structure relies on the native somatostatin -hairpin and on intermolecular antiparallel -sheets networks. the nano brils were observed to laterally associate further with increased concentration and time, as well as to generate hexagonal phases. increase in ionic strength (sodium chloride, phosphate) was found to signi cantly favor the self-association process. the soft conditions of formation of the somatostatin nano brils support biological relevance, for instance to the biological mechanism of storage of the neuropeptide hormone. unraveling the physical origin of the structure of fully denatured ubiquitin s. golic grdadolnik, f. avbelj national institute of chemistry, hajdrihova , ljubljana, slovenia the structure and dynamics characterization of non-native states of proteins is crucial for understanding the mechanism of protein folding. recently many experimental studies have shown variations of conformational propensity and exibility along the backbone chain of fully denatured proteins. it has been supposed that areas of residual structure may serve as initiation sites of protein folding. however, the physical origin of these variations is still unclear. we analyze the structure of fully urea-denatured ubiquitin. the experimental veri cation of conformational propensities of protein backbone is obtained through structurally dependent nmr parameters. although the secondary structure of ubiquitin under strong denatured conditions is not detectable and no correlation with the native overall topology is found, the variations of nmr parameters along the backbone follow the secondary structure elements of its native state. we show that these variations are in accord with the recently developed electrostatic screening model of denatured proteins ( ). in this model, the backbone conformations of residues in unfolded protein are determined by local backbone electrostatic interactions and their screening by backbone solvation. many virulence factors from gram-negative bacteria are autotransporter proteins. the nal step of autotransporter secretion is c nterminal threading through the outer membrane (om), followed by folding. this process requires neither atp hydrolysis nor a proton gradient. pertactin, an autotransporter from bordetella pertussis and the largest -helix structure solved to date, folds much more slowly than expected based on size and native state topology, yet folding intermediates are not aggregation-prone. equilibrium denaturation results in the formation of a partially folded structure, a stable core comprising the c-terminal half of the protein. examination of the pertactin crystal structure does not reveal the origin of the enhanced c-terminal stability. yet sequence analysis reveals that, despite size and sequence diversity, all autotransporters are predicted to fold into parallel -helices, suggesting this structure may be important for secretion. for example, slow folding in vivo could prevent premature folding of in the periplasm prior to the assembly of the om porin. moreover, extra stability in the c-terminal rungs of the -helix may serve as a template for the formation of the native protein during secretion, and formation of the growing template may contribute to the energy-independent translocation mechanism. coupled with the sequence analysis, these results suggest a general mechanism for autotransporter secretion. thermal and functional properties of e.coli outer membrane protein-receptor fhua fhua is e.coli outer membrane protein, which transports iron into the bacterial cell and also serves as receptor for several phages. in order to get more deep information about structural properties of fhua, we've studied its thermal properties by means of calorimetry. we've also investigated the interaction between t , ira phages and directly fhua by means of viscometry. the calorimetric result of heat denaturation of membrane protein fhua and next deconvolution of the recorded calorimetric curve with two transitions has shown that in a chosen conditions the structure of fhua consists of domains. though both t and ira phages grow on the common bacterial strain (e.coli k ho ), expressing fhua the results of viscometric investigation show that under direct interaction of phages with fhua the receptor activity of protein revealed only for t phage. therefore, we conclude that other than fhua protein serves as receptor for ira phage. it should be mentioned that the phage dna ejection process induced by receptor was observed for the rst time by us in an incessant regime. electron spray ionization mass spectroscopy (esi-ms) is a powerful tool for the investigation of the protein folding or proteins non-covalent interactions in solution since charge state distributions (csds) in esi-ms are affected by the conformational state and mass relates on the association state. we used this tecnique to inquire at different ph and different conditions the dimerization process of the porcine and bovine -lactoglobulins that share a high sequences similarity and close d structures. dimerization oflactoglobulins is reversible and involves both electrostatic and hydrophobic interactions. it was possible to detect simultaneously both the monomeric and dimeric form of the proteins in solution, pointing out the different dimerization behaviour of the two isoforms. we assessed the maximal stability of the dimeric structure at ph for the porcine protein and ph for the bovine one. moreover we showed that bovine lactoglobulin has a stronger dissociation costant than the porcine protein. further we showed that it is possible to modulate the dimerization equilibrium of the bovine isoform at ph both increasing temperature and adding methanol without inducing denaturation of the protein. a possible novel method of protein structure prediction; a. ikehata division of structural bioinformatics science of biological spramolecular systems graduate school of integrated sciences mechanism of protein folding has been mysterious since an nsen's dogma was sugested.here i would like to propose a possible novel method of protein structure prediction; origami method. the method comes from a protein backbone property of uctuation and the residue hydrophobicity. l. marsagishvili , m. shpagina , z. podlubnaya institute of theoretical and experimental biophysics ras, pushchino state university amyloid brils are formed by proteins or their peptides in the result of a conformational transition from alpha helix into beta-sheet structure. despite the different nature of proteins-precursors their amyloids have common properties: beta-pleated sheet structure with individual beta-sheets oriented parallel to the main axis; insolubility in vivo; speci c binding to congo red and thyo avin-t. amyloid deposits are observed in different diseases such as myositis, myocarditis, cardiomyopathies and others. we showed that sarcomeric cytoskeletal proteins of titin family (x-, c-, h-proteins) of rabbit skeletal muscles are capable to form amyloid brils in vitro. these proteins already contain % of beta-sheet structure necessary for formation of amyloids. the amyloid nature of their brils was conrmed by electron, polarization and uorescence microscopy. as x-, c-, h-proteins form amyloid brils easily in vitro, there is a danger of fast growth their amyloid deposits in vivo. taking into account common properties of amyloids formed by different proteins, our results clear the ways for conducting by amyloidogenesis in human organs and tissues. work is supported by rfbr grants - - , "universities of russia" . . and program of the presidium ras "fundamental sciences for medicine". probing the folding capacity and residual structures in -and -residue fragments of staphylococcal nuclease d. liu, x. wang, y. feng, l. shan, j. wang national laboratory of biomacromolecules, institute of biophysics, chinese academy of sciences n-terminal fragments of staphylococcal nuclease (snase) with different chain lengths were used as a model system in the folding study. the detailed characterization of conformational states of - and - residues snase fragments (snase and snase ) and their v w and g w mutants can provide valuable information on the development of conformations in the folding of snase fragments of increasing chain lengths in vitro. in this study, the presence of retained capacity for folding and residual structures in snase and snase is detected by cd, uorescence, ftir, and nmr spectroscopy. snase is represented as an ensemble of interconverting conformations. the uctuating nascent helix-and âsheet-like structures, localized in regions of a -a and t -v , respectively are transiently populated in snase . the native-like tertiary conformations are obtained for g w and v w and for snase in the presence of . m tmao. analysis of the results of such studies indicate that folding of snase fragments is dominated by developing the local and non-local nucleation sites from native-like secondary structures and by intensifying the longrange interactions of residues at nucleation sites with residues further removed in sequence. thermal disruption of a spanning network of hydration water and conformational changes of elastin a. krukau , i. brovchenko , a. oleinikova , a. geiger international max-planck research school in chemical biology, otto-hahn str. , d- , germany, physical chemistry, university of dortmund, otto-hahn str. , d- , germany hydration water strongly in uences the structural and dynamical properties of biomolecules. the existence of a spanning hydrogenbonded network formed by the hydration water enables the function of biomolecules at low hydration levels. we can expect that the formation/disruption of the spanning network formed by the hydration water in solution also affects crucially the protein properties. we present the rst computer simulation study of the thermal disruption of a spanning network formed by the hydration water of a biomolecule (elastin-like peptide). this process obeys the laws of d percolation transition, similarly to the formation of a spanning water network with increasing hydration level [ ]. the spanning water network transforms into an ensemble of small water clusters with increasing temperature: it is still permanent at k and exists with probability % at k. in the same temperature interval, the conformation of the peptide changes noticeably: its radius of gyration increases sharply (by %) at about k. these two phenomena may be related to the "inverse temperature transition" at about k, where an elastin solution separates into two phases. in our simulations, the displacement of hydration water by the addition of a denaturant (urea) or by other peptide molecules causes an even stronger increase of the radius of gyration (up to % lipidic cubic phases formed by distinct water and lipid volumes provide bicontinuous d bilayer matrices that have speci c and controllable water channel sizes and large surface areas. these systems have proven to be also valuable as membrane mimetic structures, as promising matrices for controlled-release and delivery of proteins, vitamins and small drugs in pharmacological applications, and they offer a d lipid matrix for successful crystallization of membrane proteins which do not easily crystallize in bulk solution. the present study is directed towards a better understanding of the interplay between curved cubic lipid phases and the protein entrapped within their aqueous channel structures. as model systems, we have chosen a cubic ia d phase formed by an uncharged lipid, monoolein, and incorporated different proteins, such as cytochrome c, -chymotrypsin and insulin, in its narrow water channels. we show that the protein secondary structure and unfolding behaviour may be in uenced by the con nement and, vice versa, the topology of the lipid matrix may change as a function of protein size and concentration. in fact, even new cubic lipid structures may be formed that are not known in pure lipid systems. furthermore, we compare the aggregation scenario of insulin in bulk solution and in the narrow water channels of the cubic lipid matrix and discuss the differences found in terms of the geometrical limitations imposed by the con nement. after endocytosis, i.e. at acidic ph, the t domain inserts in the membrane of the target cell and helps the translocation of the catalytic domain into the cytoplasm. therefore, the t domain has a key role in the strategy of internalization of the toxin. the study of the interaction of the t domain with membranes and its ph dependence is important for a better understanding of the diphtheria toxin translocation mechanism. at least, two steps can be distinguished during the membrane insertion of the t domain. the rst step involves hydrophobic interactions with the membrane and is related to the ph-induced stabilization in a molten-globule state. in the second step, electrostatic interactions are preponderant and the ph-sensitivity comes from changes of the balance between repulsive and attractive electrostatic interactions. the role of the n-terminal part of the t domain in the second step has been investigated by studying peptides corresponding to the amphiphilic helices found in this part of the domain. the results are correlated with those obtained with a single trp mutant probing the n-terminal region of the whole domain. the translocation mechanism will be discussed in view of the physico-chemical properties of the peptides. in complex systems with many degrees of freedom such as peptides and proteins there exist a huge number of local-minimumenergy state. one way to overcome this multiple-minima problem is to perform a simulation in a generalized ensemble where each state is weighted by a non-boltzmann probability weight factor so that a random walk in potential energy space may be realized (for reviews, see refs. [ ] [ ] [ ] xas spectroscopy results show that there are two different structures of the metal binding site in the a peptide according to whether they are complexed with cu or zn ions. while the geometry around copper is suggestive of an intra-peptide binding with three histidine residues bound to the metal, the zinc site geometry is compatible with an inter-peptide aggregation mode. this result reinforces the hypothesis that assigns opposite physiological roles to the two metals, with zinc favouring and copper blocking peptides aggregation and consequent plaque formation. effect of pressure on the conformation of proteins. a molecular dynamics simulation of lysozyme a. n. mccarthy, r. grigera instituto de física de líquidos y sistemas biológicos (iflysib), conicet-unlp-cic, university of la plata, argentina the effect of pressure on the structure and dynamics of lisozyme was studied by md computer simulation at bar ( pa) and kbar using gromacs package. all-atoms (ff gmx) force eld were used for the minimization process and for all the md simulation and kept all protein bond lengths constrained (lincs algorithm). water molecules (spc/e model) were constrained using the settle algorithm. for the electrostatic forces we applied the reaction field method. lennard-jones interactions were calculated within a cut-off radius of . nm. the results have good agreement with the available experimental data, allowing the analysis of other features of the effect of pressure on the protein solution. the studies of mobility show that although the general mobility is restricted under pressure this is not true for some particular residues. from the analysis of secondary structures along the trajectories it is observed that the conformation under pressure is more stable, suggesting that pressure acts as a 'conformer selector' on the protein. the difference in solvent accessed surface (sas) with pressure shows a clear inversion of the hydrophilic/hydrophobic sas ratio, which consequently shows that the hydrophobic interaction is considerably weaker under high hydrostatic pressure conditions. direct observation of mini-protein folding using uorescence correlation spectroscopy h. neuweiler, s. doose, m. sauer applied laser physics & laser spectroscopy, university of bielefeld, bielefeld, germany the "trp-cage" motif represents the smallest and one of the fastest folding mini-proteins known to date. the globular fold is characterized by a hydrophobic core burying a single tryptophan (trp) residue. here, we report on the direct observation of trp-cage folding kinetics using uorescence correlation spectroscopy (fcs). our method is based on the selective uorescence quenching of oxazine dyes by trp which becomes ef cient only upon contact formation between the dye and the indole moiety of trp. by sitespeci cally labeling the dye to trp-cage, temporal uorescence uctuations of the dye-peptide conjugate, caused by intramolecular contact formation between dye and trp, directly report on conformational dynamics and folding transitions of the peptide chain. in order to measure uorescence uctuations directly in solution we used fcs on a confocal uorescence microscope setup. fcs allows us to reveal conformational dynamics with nanosecond timeresolution, under thermodynamic equilibrium conditions, and in highly dilute solutions (i.e. at nano-molar sample concentrations). our method con rms microsecond folding kinetics of the trp-cage motif, previously estimated with non-equilibrium temperature-jump techniques. we further investigated stability and folding rates under denaturing conditions and at various temperatures, giving further insight into structural transitions during the folding process. identi cation and mutagenesis of a region of tnt required for the stability of tnt-tni coiled-coil evolutionarily conserved heptad hydrophobic repeat (hr) domains present in troponin subunits tnt and tni are involved in alpha helical coiled-coil formation. using recombinant peptides from fast-skeletal tnt and tni, we examined the contributions of amino acid residues within these hr domains as well as anking these domains, to the stability of the coiled-coil interaction. a series of tnt fragments were tested for their ability to form coiledcoil with tni hr domain. we show that the tnt region - , although remains outside of the coiled-coil domain, is absolutely required for the stability of the coiled-coil. interestingly, the region tnt - contains few absolutely conserved residues that are potential candidate for ionic interaction, as predicted by molecular modeling. using single point mutants we show that among all the conserved residues, residue lysine is most important in stabilizing the coiled-coil interaction, whereas others play accessory role. we propose that the lysine initiates the stabilization of the coiled-coil interaction and then the other residues acts in a zipper like fashion. magnesium promotes conformational switching of ca sensor s. mukherjee, k. v. r. chary tata institute of fundamental research the importance of mg , one of the most abundant metal ion in the cell cytoplasm, relating to the calcium sensor mechanism is demonstrated. in this study it has been shown that the amino acid long calcium binding protein from entamoeba histolytica (ehcabp) can exist in three different forms namely, the calcium-free (apo) form, the magnesium bound (apo-mg) form and the calcium bound (holo) form. these three forms have been characterized using chromatographic, calorimetric as well as various spectroscopic techniques. there is a radical difference in the stability between the calcium free and ca bound forms. the calcium free form has molten globule like characteristics. mg stabilizes the closed conformation of the apo form, where the hydrophobic core remains buried. the presence of mg signi cantly alters the calcium binding cooperativity thereby increasing the cooperativity of the conformational switching between the open and closed conformation which is an important aspect of such regulatory proteins. a structural model for the molten-globular form of apo-ehcabp and its equilibrium folding towards completely folded holo state in presence and absence of mg will be presented. intermediate states of formin binding protein ww domain: explored by replica-exchange simulation y. mu school of biological science, nanyang technological university, singapore ww domain of formin-binding protein (fbp) is a model system for beta strand folding study. although it is small, only amino residues in total, the folding kinetics of fbp ww domain proved to be biphasic. an extensive molecular dynamics-monte carlo hybrid method, called replica-exchange simulation strategy is employed to study the folding/unfolding of the fbp ww domain in explicit water model. begin with randomly chosen conformations from high temperature unfolding trajectory distributed in replicas ( different temperatures covering from k to k), the simulation lasts nanoseconds in each replica. in the end an interesting distribution of conformations shapes up. we nd that there are three distinctive subgroups, one being the unfolded conformations with rmsd averaging around Å , one being native conformations with rmsd . Å, more interestingly, the third group having rmsd Å, an intermediate folding ensemble. by checking the intermediate ensemble in detail, we nd that it is quite heterogeneous and the heterogeneity mainly comes from the exibility of the c-terminal loop region. our ndings provide a microscopic picture of folding kinetics of the ww domain: the stable intermediate states with mis-registered hydrogen bonds on the c-terminal beta strand make this peptide folding as a three-state folding model rather than a usual two-state model. h. rezaei, f. eghiaian, j. perez, y. quenet, y. choiset, t. haertle, j. grosclaude institut national de la recherche agronomique, france in pathologies due to protein misassembly, low oligomeric states of the misfolded proteins rather than large aggregates play an important biological role. to get better insight into the molecular mechanisms of prpc/prpsc conversion, we studied the kinetic pathway of heat induced amyloidogenesis of the full length recombinant ovine prp (arq) at ph . . according to the size exclusion chromatography experiments, three sets of oligomers were generated from the partial unfolding of the monomer. the effect of concentration on the oligomerization kinetics was different for the three species obsreved suggesting that they are generated from distinct kinetic pathways. limited proteolysis and peptide analysis of the two best separated peaks showed a difference in the accessibility of the c-terminal domain of these two oligomers, and allowed the identi cation of regions undergoing a structural change during the conversion process. the analysis of correlations between oligomer populations as well as numerical kinetic modeling led us to propose a multi-step kinetic pathway describing the evolution of each species as a function of time. the existence of at least three distinct oligomerization pathways on one hand and the differences in the accessibility of the two puri ed oligomers on the other hand re ect the structural plasticity of the prp protein. studying the mechanism of retention of ovine prion protein in soils will tackle the environmental aspect of potential dissemination of scrapie infectious agent. the conformational transition from the monomeric cellular prion protein prp c in -helical structure into the aggregated -sheet-rich multimer prp sc is supposed to be responsible for the so-called prion diseases. it is commonly admitted that the recombinant prp could serve as model of conversion of the normal prion protein prp c . fundamentally, the interaction of proteins with surfaces either uid or solid involves both protein binding and unfolding. our goal in studying protein adsorption is to determine the nature and the amplitude of the structural changes occurring during non-speci c adsorption. the protein-clay interaction depends on several parameters such as protein hydration, net charge, charge distribution on the protein surface. ftir spectroscopy is well-suited to probe structural changes of proteins at a molecular level at aqueous/solid interfaces. the conformational states of the full-length ovine prp adsorbed on the electronegative clay surface are compared to its solvated state in deuterated buffer in the pd range . - , using ftir spectroscopy. during the intoxication of a cell, the diphtheria toxin binds to a cell surface receptor, is internalized and reaches the endosome. the translocation (t) domain from the toxin interacts with the membrane of the endosome in response to the acidic ph found in this compartment. this process drives then the passage of the catalytic domain of the toxin through the membrane into the cytoplasm. the interaction of the t domain with the membrane involves at least two steps. in the rst step occurring at ph , t adopts a molten globule conformation, which is able to bind super cially to the membrane through hydrophobic interactions by the c-terminal region (helices th and th ). in a second step occurring between ph and , penetration into the membrane involves electrostatic interactions. this step leads to a functional inserted state. trp was mutated to phe in order to use trp located in helix th as a probe of its behavior during the interaction with the membrane as a function of ph. we found that the second step is correlated with the reorganization of the n terminal region in the membrane and is controlled by electrostatic interactions. peptide conformational search using generalized simulated annealing method p. g. pascutti , f. p. agostini , c. osthoff , k. c. mundim , m. a. moret ibccf -ufrj -brazil, lncc -brazil, iq-unb -brazil, ceppev -fundação visconde de cairú / df-uefs -brazil the three-dimensional structure of proteins is mainly determined by the sequence of amino acids, making possible the development of ab initio methods for peptides and protein structure prediction. we proposed a stochastic method based on a classical force field and the generalized simulated annealing (gsa), which utilizes tsallis generalization of boltzmann-gibbs statistics. we have applied this method for peptide conformational search and as a complement for comparative modeling of proteins, searching the conformation for loops and mismatched sequence alignments. the gsa ef ciency depends on the right choice of the parameters involved in the conformational calculation: the qv parameter which de nes a function for visiting the molecular energy surface; and the qa parameter de ning an acceptance probability, both according with tsallis statistics. to avoid the conformational trapping in local energy minima we introduced a new parameter in this work, qt, to control the temperature decreasing. to investigate the qv, qa and qt best parameters set, we used the -alanine and -alanine peptides, which have a known alpha-helical structure in low dielectric environment. the global minimum energy occurs for the alpha-helix folded structure, and was found for qv ranging from . to . , qa from . to . , and qt from . to . . we observed also that convergence values for qv decrease while for qa and qt increase for folded structures. p. s. santiago, É. v. de almeida, m. tabak instituto de quimica de são carlos-usp cp são carlos sp brasil extracellular hbgp is a giant hemoglobin, similar to other annelid hemoglobins, having a molecular weigth of . mda. the effect of ctac in the oligomeric protein structure was assessed by optical absorption and emission spectroscopies. optical absorption spectra of hbgp . mg/ml as a function of ctac concentration from up to . mm evidentiate changes both in soret band at nm, and at nm associated to light scattering which appears at low ctac concentration. below mm of surfactant extensive light scattering occurs together with signi cant shifts of max of soret band from nm to around nm, which is probably due to oxidation of the original oxyhbgp. light scattering reaches a maximum value disappearing for higher ctac concentrations. fluorescence spectra show a signi cant increase in intensity ( fold) upon titration with ctac. this is consistent with the dissociation of the oligomer with signi cant reduction of intrinsic quenching of tryptophan uorescence due to the heme groups. similar data were obtained at protein concentration of mg/ml. in this case a signi cant increase of light scattering is observed with protein precipitation at a narrow ctac range followed by re-disolution at higher ctac concentration. differently from anionic sds surfactant, cationic one induces protein aggregation. support: cnpq and fapesp. in situ formation of silk protein nano-particles studied by small angle x-ray scattering m. w. roessle , c. riekel , d. i. svergun european molecular biology laboratory; outstation hamburg/germany, european synchrotron radiation facility; grenoble/france silk threads from the mulberry silkworm bombyx mori are used for the production of textiles since centuries. in modern applications silk proteins are chosen because of their good tensile strength, their high biocompatibility as well as of the resorbability for the human body. however, the processing of silk by the silkworm is barely understood. the silk proteins are stored as a solution in the glands of the silkworm and processed during the spinning into a co-block polymer like ber. in a rst step the random coil silk proteins are transformed into molecules with beta-sheet subdomains, which provide a protein-protein interface for the ber assembly. this transformation can be mimicked by a rheometer applying a shear force to the silk protein solution. applying combined small/wide angle x-ray scattering (saxs/waxs) transient formed silk nanoparticles upon increasing shear force were found. further details of these macromolecules were derived by xing the transient state with chemicals such as polyethylene oxide (peo). the resulting data can be analyzed in detail by saxs data evaluation software and low resolution models of the found nanoparticles were derived. moreover, the internal structure of the particles was explored as well as suggestions for the silk processing of the silkworm could be made. on the three-dimensional information of a protein sequence v. a. risso, l. g. gebhard, r. g. ferreyra, j. santos, m. noguera, m. r. ermacora universidad nacional de quilmes conicet in this work, lactamase of b. licheniformis (es l) was used as an experimental model to (a) study the conversion of sequential information into d structure and (b) to investigate the distribution of conformational information in the polypeptide chain. by a novel approach, over thirty connectivity variants of the polypeptide chain were prepared; witch were also nterminally truncated to a variable degree. the variants produced in e. clil were puri ed to homogeneity, refolded, and its structure content analyzed by circular dichroism, hydrodynamic behaviour and aggregation state. several variants were dimeric in solution, suggesting a possible general inespeci c stabilization mechanism. most variants were compact and had different degrees of secondary and tertiary structure. a strikingly large number of variants showed native like spectroscopic signatures and signi cant enzymatic activity, which means that the very elaborate active site of beta lactamase is formed, at least in fractions of the molecules, despite the absence of long stretches of sequence. these ndings are discussed in the light of the current knowledge of the protein folding process. var and lgg contributed equally to this work. on the three-dimensional information of a protein sequence v. a. risso, l. g. gebhard, r. g. ferreyra, j. santos, m. e. noguera, m. r. ermacora universidad nacional de quilmes conicet in this work, lactamase of b. licheniformis (es l) was used as an experimental model to (a) study the conversion of sequential information into d structure and (b) to investigate the distribution of conformational information in the polypeptide chain. by a novel approach, over thirty connectivity variants of the polypeptide chain were prepared; witch were also nterminally truncated to a variable degree. the variants produced in e. clil were puri ed to homogeneity, refolded, and its structure content analyzed by circular dichroism, hydrodynamic behaviour and aggregation state. several variants were dimeric in solution, suggesting a possible general inespeci c stabilization mechanism. most variants were compact and had different degrees of secondary and tertiary structure. a strikingly large number of variants showed native like spectroscopic signatures and signi cant enzymatic activity, which means that the very elaborate active site of beta lactamase is formed, at least in fractions of the molecules, despite the absence of long stretches of sequence. these ndings are discussed in the light of the current knowledge of the protein folding process. var and lgg contributed equally to this work. unfolding of the extrinsic proteins of photosystem ii ( kda, kda and kda) induced by pressure unfolding of the three extrinsic proteins of spinach photosystem ii induced by pressure has been systematically investigated. thermodynamic equilibrium studies indicated that these proteins are very sensitive to pressure. at o c all the proteins show a reversible unfolding transition by about mpa for kda, mpa for kda and mpa for kda. the ph and temperature dependence of pressure unfolding of these proteins were explored. the stabilization effect of reagents sucrose etc on the proteins was found to associate not only with the increase in the unfolding free energy, but also with the reduction of the absolute value of v u . pressure-jump studies of unfolding of kda protein revealed a negative activation volume for unfolding and a positive activation volume for refolding, indicating that in terms of system volume, the transition state lies between the folded and unfolded states. comparison of temperature dependence of v u # , v f # and v u indicated that the thermal expansivities of the transition state and the unfolded state are similar and larger than that of the folded state. aggregation-prone intermediate protein structures on the refolding pathway l. smeller , j. fidy , k. heremans semmelweis university dept. biophysics and radiation biology, budapest, hungary, department of chemistry, katholieke universiteit leuven, belgium the folding of the polypeptide chain into a native conformation can be studied by experimental systems, where the environmental parameters causing the denatured state can be easily and fast eliminated. one of such parameters is the high hydrostatic pressure. refolding of the unfolded protein can be studied after decompression. the refolding pathway can contain several intermediate states. on the other hand, the destabilization of the native proteins can populate conformations, where the polypeptide chain is not completely folded. these metastable conformations can easily aggregate. deposition of insoluble protein aggregates plays a crucial role in the conformational diseases (parkinson, alzheimer's disease, amyloidozis, etc.) stability and conformation of the above mentioned metastable conformations were investigated in case of myoglobin, apo-horseradish peroxidase, lipoxygenase. fluorescence and infrared spectroscopy and light scattering experiments were used to explore not only the structure but also the aggregation af nity of the intermediates in case of all the above mentioned proteins a well de ned temperature range could be determined, where the metastable not completely folded structures were populated considerably during the refolding process. these intermediate conformations were significantly more aggregation prone, than the native conformers existing in the same temperature range. aggregation processes in beta-amyloid peptides: effects of molecular chaperons a. sgarbossa, d. buselli, f. lenci cnr istituto bio sica, pisa, italy several neurodegenerative pathologies, like parkinson's, hungtington's and alzheimer's diseases, are related to the formation of small peptides aggregates, which amyloid brils originate from. understanding the molecular mechanisms responsible for these processes can, therefore, contribute to clarify the origin and, hopefully, to control the development of the afore mentioned diseases. here we report the results of an in vitro study aiming to affect the aggregation kinetic of - and - beta-amyloid peptides by means of an endogenous chaperone-like protein (alpha-crystallin) and an exogenous polycyclic aromatic pigment (hypericin) that can perturb the aggregation process through stacking interactions with the peptides aromatic residues. because of the well known problems in getting reproducible and reliable results, particular attention has been devoted to carefully check the preparation procedures of the samples. the effects of both alpha-crystallin and hypericin on the self-assembly process have been examined at different times of the aggregation kinetics. the results are discussed in relation with the involvement of different molecular structures in the amyloid brillation phenomenon. remodelling the folding of thioredoxin by removal of the c-terminal helix j. santos, j. m. del no iquifib and departamento de química biológica, facultad de farmacia y bioquímica, uba, junín , c aad buenos aires, argentina e. coli thioredoxin (trx) is a monomeric / protein of amino acids with a fold characterized by a central beta sheet surrounded by alpha helices. two subdomains are topologically noticeable, but it is unclear whether their folding occurs in a concerted fashion. subdomain trx - has been extensively studied as a model of a partially folded, with no tertiary or persistent secondary structure. this work describes the expression and characterization -by circular dichroism (cd), uorescence emission, size exclusion chromatography, chemical cross-linking and light scattering-of a novel engineered fragment (trx - ) lacking the last stretch of amino acids. after refolding from inclusion bodies, trx - shows a strong propensity to form soluble oligomers endowed with distinctive optical properties unlike those observed for the full protein. although trx - also shows signi cant changes in secondary structure, trp residues appear to occupy rigid and apolar environments. these ndings support the existence of an alternatively folded form for trx - . in addition, the secondary structure content of chemically synthesized peptide trx - and its ability to complement fragment trx - upon refolding were also evaluated by cd. taken together, the data herein presented shed light upon issues such as the distribution of information content relevant for folding along the polypeptide chain in regard to conformational stability. with grants from anpcyt, ubacyt and conicet. thermal aggregation of two "beta-protein" models at different ph values v. vetri, f. librizzi, v. militello, m. leone physical and astronomical sciences, univ. of palermo, italy & infm the structural stability of proteins strongly depends on the environment and the lost of this stability may trigger a partial unfolding, leading in turn to the formation of aggregates. such processes have been extensively studied also in view of their biotechnological and medical implications. in fact a large number of diseases is associated with protein misfolding and aggregation. conformational changes play a keyrole in the aggregation processes and have their onset under particular external conditions. the aggregation pathways and the topology of the obtained supramolecular structures sizeably depend on the details of the involved conformational changes, which are determined by the details of the external conditions. here we present an experimental study on thermal aggregation processes of two model proteins mainly composed from structures: -lactoglobulin and concanavalin a, at different ph values. the conformational changes of the proteins (whose association state depends by ph) and the aggregation pathways were monitored by intrinsic and suitable external dyes uorescence. at the same time, the growth of supramolecular structures was followed by measuring the rayleigh scattering of the excitation light. secondary structure changes were followed by circular dichroism measurements. the results show that at different ph values the aggregation processes of both proteins follow different pathways determined by the variations in the native structure and by the details of the involved conformational changes. a. verma , t. herges , w. wenzel iwr, forschungszentrum karlsruhe, po box , , karlsruhe, germany, int, forschungszentrum karlsruhe, po box , , karlsruhe, germany ab-initio protein structure prediction and the elucidation of the mechanism of the folding process are among the most important problems of biophysical chemistry. investigations of the protein landscape may offer insights into the folding funnel and help elucidate folding mechanism and kinetics. we investigate the landscape of the internal free-energy of the amino acid villin headpiece with a modi ed basin hopping method in the all atom force eld pff , which was previously used to fold several helical proteins with atomic resolution. we identify near native conformations of the protein as the global optimum of the force eld. more than half of the twenty best simulations started from random initial conditions converge to the folding funnel of the native conformation, but several competing low-energy metastable conformations were observed. from , independently generated conformations we derived a decoy tree which illustrates the topological structure of the entire low-energy part of the free energy landscape and characterizes the ensemble of metastable conformations. these emerge as similar in secondary structure content, but differ in the tertiary arrangement. exploring the free energy landscape of a folded protein by means of afm stretching experiments m. vassalli , b. tiribilli , l. casetti , a. torcini , a. pacini , a. toscano isc -cnr, florence -italy, csdc, univ. of florence -italy, anatomy dept, univ. florence -italy the aim of our work is to study the mechanically induced folding and refolding of single proteins by means of an atomic force microscope (afm). the resulting data will be analyzed with theoretical methods, both to determine the folding pathway and to gain information on the energy landscape of real systems. recently, experiments employing atomic force microscopy have shown that mechanical and thermal unfolding share several common features. we are using an afm to perform mechanical stretching experiments on single biomolecules. the experiments that can be performed are particularly well-suited to reconstruct the folding-unfolding pathways as well as the free energy landscape of the examined protein. in particular we are interested in the free energy pro le associated to titin and elastin, by considering a periodic loading of the afm cantilever, instead of the usual linear ramp, and measuring the force as a function of displacement. these experiments will be complemented by theoretical and numerical studies. molecular dynamics simulation of simple models but including the experimental geometry, will allow to examine in detail the effect of different experimental procedures (periodic loading versus linear ramp) proposed to reproduce equilibrium energy landscapes. moreover, we will investigate the limit of applicability of the jarzynski's equality which has been claimed to be able to be used to extract equilibrium results from non equilibrium measurements. association of subunits is a prerequisite for formation of the native structure of the dimeric ipmdh a. varga, É. gráczer, i. hajdú, p. závodszky, m. vas institute of enzymology, biological research center, hungarian academy of sciences, budapest, hungary to answer the question whether subunits are autonomous folding units, or their association at an early stage of folding is required for formation of the native protein structure, denaturation-renaturation experiments were carried out with the dimeric isopropylmalatedehydrogenase (ipmdh). denaturation was induced by guanidine hydrochloride, renaturation was initiated by dilution and followed by activity measurements and uorimetry. reactivation is a complex process with an initial lag phase, indicating the presence of an inactive intermediate. the kinetics of the process is independent of protein concentration, suggesting that association of the two polypeptide chains takes place much faster than the rate limiting rst order isomerisation step(s). restoration of protein uorescence during renaturation is also protein concentration independent, biphasic process, however the initial lag phase is replaced by an even faster burst increase of uorescence. the rst step leads to formation of an intermediate with a native-like uorescence spectrum. based on our experiments the following mechanism is proposed for refolding of ipmdh: d+d i i n , where d is denaturated monomer, i and i are inactive dimer intermediates, n is native dimer, that means initial association of the polypeptide chains during refolding is a prerequisite for formation of the native dimensional structure of ipmdh. we describe a new beamline for optical biophysic in construction on the synchrotron soleil. the high briliance of the synchrotron beam, associated with its tunability on a broad part of the electromagnetic spectrum make it an excitation source of choice for several biophysical optical techniques. the disco beamline we present will consist of three endstations : . the circular dichroism (cd) endstation will bene t from the inclusion of the energies accessible in the vuv wavelength range ( - nm) and from the natural polarization of the synchrotron beam. cd spectra of proteins covering a broad range of wavelenghts will enable better and ner structural analysis. moreover, new biological chromophores such as sugars which absorb in the deep uv will be accessible in cd. . the mass spectrometry endstation,will bene t from an ionisation beam with even greater energies (down to nm) comprising nebulisation at atmospheric pressure. photoionisation of macromolecular bio-structures without any solvent restriction will produce perfect analytes for mass spectrometry, . the multiparametric imaging endstation, build on a confocal microscope, will use the great tunability of the synchrotron radiation ( - nm) to excite samples at many wavelenghts simultaneously. the temporal component of the beam will allow natural lifetime imaging by phase modulation -demodulation. predictive all-atom protein folding with stochastic optimization methods the prediction of protein tertiary structure, in particular based on sequence information alone, remains one of the outstanding problems in biophysical chemistry. according to the thermodynamic hypothesis, the native conformation of a protein can be predicted as the global optimum of its free energy surface with stochastic optimization methods orders of magnitude faster than by direct simulation of the folding process. we have recently developed an all-atom free energy force eld (pff ) which implements a minimal thermodynamic model based on physical interactions and an implict solvent model. we demonstrated that pff stabilizes the native conformation of several helical proteins as the global optimum of its free energy surface. in addition we were able to reproducibly fold several helical proteins (the amino acid (aa) trp-cage protein, the villin headpiece ( aa), the conserved headpiece of the hiv accessory protein ( aa), the headpiece of protein a ( aa) and the -helix bacterial ribolsomal protein l ( aa), as well as several beta-sheet peptides. we used several stochastic optimization methods: the stochastic tunneling method, an adapted version of parallel tempering, basin hopping techniques and distributed evolutionary optimization strategies. we discuss advantages and limitations of this approach to de-novo allatom protein structure prediction. the independent thermal unfolding simulations of gb have been performed. physical property parameters of protein structure were chosen to construct a -dimension physical property space. then the -dimension property space was reduced to dimensions principle component property space. under the property space, the unfolding pathway ensemble of gb was obtained. the pathway ensemble likes a funnel that was gradually emanative from the native state ensemble to the unfolded state ensemble. the unfolding trajectories have the similar variable trend during the native state and the transition state ensemble. during the unfolded state, the unfolding trajectohries were divided into two types that one includes only one trajectory and the other include trajectories. the rst type of unfolded state was a discontinuity step distribution model, which is not random distribution. the second type of unfolded state was a near ellipsoid distribution model and a near random. there were substantial overlaps of unfolded state, indicating that thermal unfolded state consists of a con ned set of property values that makes the number of unfolded state of protein to be much smaller than that was believed before. the protein circular dichroism data bank (pcddb): a bioinformatics and spectroscopic resource b. a. wallace , l. whitmore , r. w. janes dept. of crystallography, birkbeck college, university of london, school of biological sciences, queen mary, university of london we describe the development and creation of the protein circular dichroism data bank (pcddb), a deposition data bank for validated circular dichroism spectra of biomacromolecules. its aim is to provide a resource for the biological and bioinformatics communities, by providing open access and archiving facilities for circular dichroism spectra. it is named by parallel with the protein data bank (pdb), a long-existing valuable reference data bank for protein crystal structures. it will permits spectral deposition via userfriendly web forms and will include automatic reading of a range of data formats and data mining from le headers to facilitate the process. it will be linked, in the case of proteins whose crystal structures and sequences are known, to the appropriate pdb and sequence data band les, respectively. a series of validation tools that will provide reports on data quality are included (and are accessible as stand-alone software). it is anticipated that this data bank will provide readily-accessible biophysical catalogue of information on folded proteins that may be of value in structural genomics programs, for quality assurance and archiving in industrial and academic labs, as a resource for programs developing spectroscopic structural analysis methods, and in bioinformatics studies. the relation of n-terminal residues and structural stability of l-chain apoferritin k. yoshizawa , k. iwahori , y. mishima , i. yamashita crest/jst, atrl-matsushita, nara institute of science and technology the denaturation of apoferritin by acidic solution was studied. ferritin, the ubiquitous iron storage protein, represents a well known polymeric assembly that is highly resistant to chemical and physical denaturants. it is a cage-shaped protein which is composed of subunits. natural vertebrate ferritins are copolymers of two different subunits, l-and h-chains. in the recombinant h-chain apoferritin (rhf), the structural stability is decreased by deletion of the n-terminal residues. we studied the effect of n-terminal residues of recombinant l-chain apoferritin (rlf) on the acidic ph denaturation and re-assembly. we constructed rlf and mutant rlfs which are lack of (fer ) or (fer ) amino acid residues from the n-terminus and investigated their stability by cd spectra. among three, fer has the least endurance against ph decrease. in the case of fer , the re-assembly of subunits into apoferritin can be performed by increasing solution ph without causing the by-product while huge aggregations are caused in fer and fer . the structural comparison of three mutants indicates that the hydrogen bonds of inter-and intra-subunits decrease by the loss of the n-terminal residues. therefore, it is elucidated that the hydrogen bonds of inter-and intra-subunits from n-terminal residues affect the molecule stability and re-assembly of l-chain apoferritin. pressure perturbation calorimetic studies of solvation, unfolding and aggregation of proteins r. winter university of dortmund, physical chemistry, otto-hahn-str. , d- dortmund pressure perturbation calorimetry (ppc) was used to study the solvation and volumetric properties of various proteins in their native and unfolded state. in ppc, the coef cient of thermal expansion of the partial speci c volume of the protein is deduced from the heat consumed or produced after small isothermal pressure jumps, which strongly depends on the interaction of the protein with the solvent or cosolvent at the protein-solvent interface. the effects of ph and various chaotropic and kosmotropic cosolvents (glycerol, sucrose, urea, guhcl, salts, etc.) on the solvation and unfolding behavior of the proteins was also investigated, and the observed volume and expansivity changes are correlated with further thermodynamic and spectroscopic properties of the systems. depending on the type of cosolvent and its concentration, speci c differences are found for the solvation properties of the proteins, and the volume change upon unfolding may even change sign. taken together, the data obtained lead to a deeper understanding of the solvation process of proteins in different cosolvents in their native and unfolded states. in addition, the effects of con nement and crowding on the solvational properties of the proteins were studied. finally, the use of ppc for studying intermolecular interactions and aggregation (amyloidogenesis) phenomena of proteins (e.g., insulin, prp) will be discussed. the in uence of semisynthetic derivatives of phenolic lipids on activity of yeast abc pumps phenolic lipids are the natural amphiphilic long-chain homologues of orcinol ( , -dihydroksy- -methylbenzene). they occur in numerous plants and microorganisms. resorcinolic lipids exhibit high af nity for lipid bilayer and biological membranes and are able to modify the activity of membrane enzymes (e.g. pla , ache). the in uence of semisynthetic derivatives of phenolic lipids on yeast pdr protein activity was studied by spectro uorimetric method using the potentiometric uorescence probe dis-c ( ). the probe is expelled from s. cerevisiae by abc pumps and can conveniently be used for studying their performance. two pump-competent s. cerevisiae strains and different pump-free mutant strains were used to experiment to check the effect of the semisynthetic derivatives of phenolic lipids on activity of abc transporters. two of these derivatives, named . and . , seem to affect the activity of pdr pumps. their in uence on activity of yeast plasma membrane multidrug resistance abc pumps is concentration-dependent. in uence of lipid membrane composition on pglycoprotein activity k. bucher, s. d. krämer, h. wunderli-allenspach department chemistry and applied biosciencies, eth zurich, switzerland p-glycoprotein (p-gp), a membrane atpase expelling many structurally unrelated compounds, is one of the major contributors to multidrug resistance. it is proposed that substrates bind to it within the membrane and are exported from there out of the cell. p-gp substrates are generally hydrophobic and their binding to the transporter is governed by their ability to partition into the membrane. the intimate association of both p-gp and its substrates with the membrane suggests that p-gp function may be regulated by the composition of the lipid bilayer. as detergents in uence the membrane properties and have been shown to affect p-gp atpase activity, we developed virtually detergent-free proteoliposomes to investigate the in uence of the membrane environment on the atpase activity of p-gp. the basal and substrate induced atpase activity was dependent on the cholesterol level of egg phosphatidylcholine (pc) membranes. the compound concentration at half maximal activation of p-gp (k m ) in proteoliposomes correlated with the af nity of the respective compound to liposomes consisting of the same lipids as the proteoliposomes tested. in conclusion, the basal and drug-induced atpase activity of p-gp is strongly dependent on the cholesterol content in detergent-free p-gp/egg pc/cholesterol proteoliposomes. m. berchel , j. jeftic , t. benvegnu , j.-y. thepot , d. plusquellec enscr umr cnrs , campus de beaulieu, rennes, université de rennes , umr cnrs , rennes bipolar lipids found in archaebacterial membranes, generally termed bolaamphiphiles, induce increased stability in membranes exposed to environments such as acidic conditions, high temperatures, high salt concentrations and/or absence of oxygen. we have synthesized a spin labeled unsymmetrical bolaamphiphile that selforganises in water solutions in multilamellar vesicles and shows slow ip-op phenomenon in comparison to conventional liposomes. generally, the ip-op from the exovesicular to the endovesicular membrane surface is a relatively slow process, which is due to the high energy barrier in transferring the polar amphiphilic heads through the lipophilic membrane. it can be involved in membrane transport mechanisms and in facilitating the transport, cells have evolved to use various supramolecular strategies. the half-life of the ip-op is estimated to more than twelve hours. we are now modulating the ip-op rate by incorporating chemical modi cations such as addition of cyclopentanes, double or triple bonds into the bridging chain of the molecule, in order to control the membrane transport via the ip-op mechanism. transport activity of the monocarboxylate transporter is increased by carbonic anhydrase h. m. becker, j. w. deitmer abt. allgem. zoologie, tu kaiserslautern, kaiserslautern the enzyme carbonic anhydrase (ca), which catalyses the conversion of co and h o to bicarbonate and protons, is present in nearly all animal cells, and is highly expressed in astrocytes. it is known that ca can bind to several membrane transporters, forming a transport metabolon and thereby enhancing the transport activity of the protein. in this project we have studied the functional interaction of the enzyme with the monocarboxylate transporter (mct ), which transports lactate and other monocarboxylates together with protons and is believed to play a pivotal role in the metabolite shuttling between astrocytes and neurons. therefore we expressed mct and then injected ca into xenopus oocytes. our results indicate a direct binding of ca to the mct , leading to a ca-induced increase in acid/base ux mediated by the transporter. interestingly, the effect was insensitive to the ca inhibitor ethoxyzolamide and to the nominal absence of co /hco , but disappeared when binding of ca to the mct was hindered. it seems, that ca, bound to the mct , mediates local buffer capacity by removing protons transported into the cell via the mct . this helps to stabilise the proton gradient close to the cell membrane, and thereby enhances the transport activity of the mct . these ndings suggest that ca can enhance metabolite-acid/base transport, by forming a transport metabolon with the mct . melibiose permease of e.coli (mel b) is a membrane bound ioncoupled sugar symporter that uses the favorable na , li , or h electrochemical potential gradient to drive cell accumulation ofor -galactosides. cysteine scanning mutagenesis, electrophysiological (ssm -solid supported membranes) and uorometric measurements, were used in order to better understand the role of speci c parts of the protein in the function of this symporter. the ssm technique combines a rapid solution exchange with the high sensitivity of planar lipid membranes. it employs a solid supported membrane as a capacitive electrode and allows the time resolved investigation of charge translocation during the catalytic cycle of such transporters as na /solute symporters. in order to obtain some more precise information about the function of mel b symporter, starting from the c-less melb, the mutant g c was constructed and from electrical, spectro uorimetric and fret measurements carried out on this mutant, in the absence and in the presence of speci c inhibitors, conclusions were drawn about the possible role of the helix iv in the function of the symporter. two-dimensional crystallization of co-reconstituted ca +-atpase, phospholamban and sarcolipin the sarcoplasmic reticulum ca -atpase and its regulators phospholamban (plb) and sarcolipin (sln) form a primary control mechanism in the recovery of resting state calcium levels in the myocardium. defects in the regulation of ca -atpase by plb and sln are central determinants in cardiac contractility and disease states such as cardiomyopathy. given the signi cance of these proteins, the structural details of their regulatory mechanisms remain an important future goal for the clinical improvement of heart disease. using co-reconstitution into proteoliposomes at low lipidto-protein ratios, we have examined the effects of mutation on the functional properties of plb and sln, revealing novel insights into calcium pump regulation. in addition, these same co-reconstituted proteoliposomes have been used for structural studies by electron cryo-microscopy. in an attempt to better de ne the structural interactions between plb, sln and ca -atpase, we have sought methods for the production of large two-dimensional crystals suitable for high resolution electron crystallography. we previously utilized the co-reconstituted proteoliposomes to produce long, tubular crystals suitable for helical reconstruction. our new procedure comprises three steps -co-reconstitution, membrane fusion, and crystallization -producing large two-dimensional crystals suitable for high resolution structural studies. herein, we will present our latest results characterizing the structural interaction between plb, sln and ca -atpase. t. v. demina , n. s. melik -nubarov , h. frey , e. e. pohl state university, chemistry department, moscow, russia, institute of organic chemistry, johannes-gutenberg-university, mainz, germany, humboldt university, charite, institute of celland neurobiology, berlin, germany multidrug resistance (mdr) of tumours is associated with overexpression of the p-glycoprotein responsible for an active drug ef ux from cells. block copolymers of ethylene oxide and propylene oxide (pluronics) are known to cause a pronounced chemosensitization of tumour cells. the effect may be due either to speci c polymer -protein interactions or to unspeci c lipid bilayer disturbance. we have shown recently that amphiphilic copolymers with various hydrophilic and hydrophobic blocks can disturb lipid bilayers. importantly, that block copolymers of propylene oxide and glycerol (ppo-pg) with hyperbranched "corona" induced larger effects then pluronics with linear polyethylene oxide chains. in the present work we have shown that ppo-pg copolymers increase dox cytotoxicity towards human erythroleukemia (k is , k /dox) and breast carcinoma (mcf /dox) resistant cell lines. using confocal and two-photon microscopy, we demonstrated that these copolymers accelerated dox penetration into resistant cells, inhibited efux and caused drug redistribution into nuclei. a clear correlation between the ability of the polymers to disturb lipid bilayers and favour drug accumulation in mdr cells was disclosed. this nding points to an unspeci c mode of the copolymers' chemosensitizing activity. voltage dependence of processes related to electrogenic membrane transporters electrogenic membrane transporters, such as the sodiumbicarbonate cotransporter (nbce ), may induce dependence on membrane potential upon processes which are innately voltageindependent. we tested this hypothesis by heterologously coexpressing electrogenic nbce from human kidney in oocytes of the frog xenopus laevis with the electroneutral rat monocarboxylate transporter mct . the apparent intracellular buffer capacity was increased by nbce expression and became voltage-dependent by mm/ mv membrane depolarisation. lactate transport via the mct not only became enhanced after co-expression with nbce , but also dependent upon membrane potential. injection of carbonic anhydrase caii from bovine erythrocytes into oocytes enhanced the ef cacy of nbce activity, identifying an additional, ca-sensitive, membrane current via nbce . our results show that nbce adds voltage-dependent buffer capacity to the cytosol; this is suggested to be the prime cause for enhancing acid/base-coupled transport and conferring membrane potential dependence on transporters which are stoichiometrically electroneutral. these interactions may have functional consequences for cells and tissues, where electrogenic and electroneutral processes interact, such as in brain, heart and muscle. . by using carboxy-snarf- as ph-sensitive uoroprobe and microspectro uorimetry, we now show that nhe activation is due to jun kinase (jnk) activation, resulting from reactive oxygen species (ros) produced during metabolism of b(a)p and might involve lipid raft. when analysing b(a)p-induced apoptosis, we have found that cariporide signi cantly reduces both nuclear fragmentation and caspase- like activity. we further show that nhe activation and/or alkalinization affects the mitochondrial ros production detected during the apoptotic cascade, likely via an effect on the complex iii of the electron transport chain. altogether, our results suggest that apoptotic xenobiotics, such as benzo[a]pyrene, induce an early activation of nhe that might play a signi cant role in the subsequent mitochondria-dependent apoptosis. conformational dynamics of a lactose proton symporter j. c. holyoake, m. s. sansom biochemistry department, university of oxford, south parks road, oxford, ox qu, u.k. transporter proteins are an essential class of proteins, catalysing the transfer of molecules across membranes. increasing numbers of transporter structures are becoming available, opening the way to study their dynamic properties using computational techniques. we present a study on the conformational dynamics of the lactose proton symporter lactose permease using molecular dynamics simulations. these simulations exhibit large scale conformational changes from the initial intracellularly open conformation to a more closed conformation that may be signi cant to the transport mechanism. the conformational change is analysed to identify the contributing motions. effects of nortriptyline and chlorpromazine on anthroylouabain-labeled na,k-atpase e. a. guevara , m. l. barriviera , a. hassón-voloch , s. r. louro departamento de física, puc-rio, rio de janeiro, brazil, instituto de biofísica carlos chagas filho, ufrj, rio de janeiro, brazil the effects of nortriptyline and chlorpromazine (cpz) on the uorescence properties of anthroylouabain (ao)-treated na ,k -atpase of electrocyte membranes from e. electricus are studied. na ,k -atpase oscillates between two major conformations e and e during ion transport cycle. the cardiotonic steroid ouabain speci cally inhibits this enzyme binding to the e conformation. the uorescent label ao presents increased uorescence when binding to the ouabain site of na ,k -atpase. tricyclic drugs such as the antipsychotic cpz and the antidepressant nortriptyline inhibit na ,k -atpase activity in the micromolar range. for the e enzyme, but not e , nortriptyline was found to increase the uorescence in a concentration dependent manner, suggesting a further stabilization of e . for both conformations, cpz induces negligible uorescence change up to µm. the uorescence of atpasebound ao, however, strongly increases upon ultraviolet exposure after cpz treatment at concentrations around µm. fluorescent products of cpz-photodegradation were studied in pure buffer and in the presence of membranes. the results suggest that cpz binds to na ,k -atpase and photolabels amino-acid residues near the ouabain binding site. how important is protein exibility for transport through ion channels? a. a. gray-weale university of sydney certain ion channels are selective for k+ over other ions, but the geometry of the pore does not explain selectivity because thermal uctuations are too large. i extend the usual treatment of ion channels with molecular dynamics simulation by calculating the static and dynamic pair correlations between monovalent ions and ion channels (gramicidin-a and kcsa), and also between certain small, complex cations and the gramicidin channel. this means not only the radial-distribution functions or the density pro les, but also correlations between the ion and the mass and charge densities of various regions of the protein. the advantage of this approach is that it systematically identi es the elements of the protein and modes of motion that contribute to selectivity, and illustrates the decay of correlations. recently, noskov et al. [ ] showed that thermal uctuations protect selectivity. my results on the interaction of ions with carbonyl groups agree with theirs, but take the analysis further to higher correlations. the key new element is the study of the time-correlation functions that describe the motion of the ions through the channel, borrowing methods originally developed for the study of dense or even supercooled liquids. the surface-enhanced infrared absorption spectroscopy (seiras) is used for the investigation of two membrane proteins, the cytochrome c oxidase (cco) and the bacteriorhodopsin (br). of central interest are the transport-mechanisms of electrons (cco) and ions (br). the main parts of the setup are an infrared light source, a hemispherical si-crystal, in which the beam is internal re ected and a plexiglas cell with buffer solution (see schematic scetch). the beam is totally re ected on the inner at surface of the crystal, but the evanescent wave excites surface plasmons in a chemically adsorbed gold-layer on the crystal (attenuated total re ection spectroscopy-atr). the protein to be analysed is attached at the gold surface and can absorb certain wavelengths. the gold is need for the surfaceenhancing effect. cco plays a major role in the respiratory chain, the retinal protein br is a photosynthetic protein. the cco is immobilized on the gold surface via the af nity of its histidine-tag to a nickel-chelating nitrilo-triacetic acid (nta) surface. for the br we incorporate the protein in a lipid membrane, which is attached on the gold surface by the sulfuric bindings of , -di-o-phytanyl-sn-glycerol- -tetraethylene glycol-d,l-lipoic acid ester lipid (dptl). the sensitivity of this method is further enhanced by modulation of an external parameter, like the electric potential. effects of copper ions on the escherichia coli growth and proton-potassium exchange copper ions are required for the function of many important enzymes in escherichia coli but can cause a number of toxic cellular effects also. it's interesting to reveal the in uence of copper ions on the growth of bacteria and proton-coupled membrane systems. upon transition of e. coli mc wild-type culture to stationary growth phase a decrease in redox potential (e h ) from the positive values ( + mv) to the negative ones (of - to - mv), resulting h production by formate hydrogenlyase (fhl) have been studied. copper increased a latent growth phase duration as well as delayed a logarithmic growth phase in concentration-dependent manner. during the anaerobic growth, the production of h was strongly inhibited in the presence of cucl ( mm). . mm cucl was inhibited h production under experimental conditions with glucose. the inhibitory effect of copper ions ( . mm) on n',n'dicyclohexylcarbodiimide (dcc)-sensitive h /k exchange was also observed: k uptake was decreased and the stoichiometry of dcc-inhibited ion uxes varied. interestingly, these effects on h and k uxes were absent for the mutant hd (hyc-operon for hydrogenase was deleted). we suggest that copper ions, inhibiting the activity of fhl, have an effect on h /k -exchanging mechanism which is the proton f f -atpase associated with k uptake trk system. this effect may be due to the relationship of fhl with the ion-exchanging mechanism above under fermentation at alkaline ph. lateral diffusion in tethered bilayer membranes m. jung, v. atanasov, w. knoll, i. koeper max planck institute for polymer research, mainz, germany tethered bilayer membranes (t'blm) provide a useful platform for the investigation of bilayer membranes as well as embedded membrane proteins. we have developed a modular system, which is suitable for surfaces providing gold and oxide surface coatings. these systems serve as a quasi natural environment for the study of membrane proteins, being functionally incorporated into a lipid bilayer, which is covalently bound (tethered) to the substrates. functionality could be shown using electrochemical methods. here we present a study of these systems, basically the membrane itself, using uorescence recovery after photobleaching (frap) studies in order to investigate lateral motion in the lipid bilayers. lateral mobility is essential for successful incorporation of large membrane protein complexes. we will present rst results of experiments that try to differentiate motions hindered due to the tethering from diffusion in free oating or suspended bilayers. the information gained in this study will serve for improvements in the chemical structure of the tethered molecules. we will develop the system as a basis for bio sensing applications, where embedded proteins will serve as actual sensing elements. a. d. ivetac, j. campbell, m. s. p. sansom biochemistry department, university of oxford, south parks road, oxford, ox qu, u.k. atp-binding cassette (abc) transporters form an important superfamily of membrane proteins which couple atp hydrolysis to the active transport of diverse compounds across the cell membrane. their biomedical relevance is highlighted in examples such as multidrug resistance to antibacterial and anticancer agents, and cystic brosis. the availability of crystal structures of three complete bacterial abc transporters provides an opportunity to study structurefunction relationships at the atomic level. in this work, we carry out multi-nanosecond molecular dynamics simulations of the vitamin b importer from e. coli (btucd), with both the complete multimeric transporter embedded in a phospholipid bilayer and the soluble subunits in a membrane-free environment, in an attempt to elucidate some of the conformational changes which arise during the transport event. atp-bound and atp-free structures are used to investigate the effect of nucleotide on the system. a range of analytical techniques have been applied to assess the dynamic behaviour of the protein during the simulations, which includes measurements of: conformational drift, residue exibility, transmembrane domain (tmd) movement, concerted protein motions, nucleotide-binding and translocation pathway changes. an in-vitro method was designed to measure transmembrane transport rates. liposomes were prepared by extrusion with dipalmitoyl phosphatidylcholine (dppc) and optionally cholesterol, and loaded with a peptide (zinc-insulin tagged with a uorescent group or bsa). after removal of the non-encapsulated peptide from the liposome solution by gel ltration, the release of the peptide from the liposomes was monitored by uorescence as a function of time at various temperatures. the transport was greatly accelerated by the presence of a speci c proprietary excipient molecule (cyclopentadecanolide -cpe ™), which effectively triggered the release of the peptide. a mathematical model was developed to quantify these results. a semi-empirical nonlinear equation involving four parameters ts the protein release pro les. then a neural network predictions model was used to correlate the different release condition parameters and the four semi-empirical tting parameters based on the experimental data sets. most release data t well with the mathematical model, further supporting our theory of a two step release mechanism. phenyltin the well known group of organotin compounds that exhibit toxic properties in relation to the biological systems are phenyltins. no studies have been performed as yet to establish directly whether organotins such as diphenyltin dichloride ( dpht) and triphenyltin chloride (tpht) cross the lipid bilayer. we have performed experiments that showed transfer of those compounds across the lipid bilayer using the stopped-ow technique and desorption of those compounds from a monolayer using the langmuire technique. obtained results demonstrate that dpht and tpht rst adsorb onto the lipid bilayer surface, in diffusion controlled manner, within a very short time ( . s), whereas the membrane passing was observed in a minute's time range. the long time kinetics show a complex dependence on the kind of compound, its concentration and the presence of cholesterol in the membrane. the desorption of both compounds from the monolayer to water subphase occurs in a minute's time range. these observations may explain the known fact, that the in uence of organic, amphiphilic tin (and also lead) compounds is more toxic than that of inorganic ones.the phenyltins much easier (compared with tin or lead ions ) penetrate e.g. blood -brain barrier. [ under the resting conditions ca concentration in agonistsensitive ca stores re ects a balance between active uptake mediated by a ca -atpase (serca) and passive ef ux of ca . this ca leak appears to be a common property of ca -storing organelles, but the nature of the leak in submandibular acinar cells remains unclear. we have studied the ca leak pathways in the endoplasmic reticulum (er) of acinar cells of rat submandibular salivary gland by directly measuring concentration ca in the er ([ca ] er ) in mag-fura /am preloaded cells while [ca ] i was clamped at a resting level with a egta/ca mixture. we have shown that thapsigargin (tg) or ca -free buffer treatment completely blocked ca uptake by serca after the rst minute of superfusion and caused a ca leak represented by continuous decline in [ca ] er . this ca leak from the er was not sensitive to tg, heparin and ruthenium red and therefore appears to be independent of the serca, the insp receptor and the ryanodine receptors. however, treatment with puromycin ( . - mm) to remove nascent polypeptides from er-ribosome translocon pores increased ca leak from the er by a mechanism independent of the serca, insp or ryanodine receptors. thus we conclude that basal ca leak from the er of submandibular acinar cells occurs through translocon pores in the er membrane. r. b. kishore, j. reiner, e. edgu-fry, a. jofre, k. helmerson physics laboratory, national institute of standards and technology we have developed a procedure to make lipid and polymer nanotubes of up to one cm long and nm in diameter, from the surface of giant liposomes and polymersomes, using micro uidics and optical tweezers. the liposomes and polymersomes were formed, using elcetroformation method, from phospholipids and amphiphilic diblock copolymers, respectively. the polymer tubes were made extremely robust by cross-linking them using chemical reactions. we are currently studying the transport of molecules in the crosslinked nanotubes for use in nano uidic networks. p-glycoprotein (p-gp) is an active membrane transporter capable of expelling out of the cell a large number of potentially cytotoxic amphiphilic molecules with unrelated chemical structures. as a consequence, p-gp may be responsible for multidrug resistance of tumors against chemotherapy (mdr); it also plays a key role in absorption, biodisposition and elimination of many pharmaceuticals. to understand the molecular mechanisms of the transmembrane drug transport mediated by pgp, it is highly desirable to design a convenient assay for measuring both p-gp atpase activity and p-gp transport function. to do so, we used inside-out native membrane vesicles, prepared from mdr cells and containing high amounts of p-gp. we took advantage of the speci c property of a uorescent dye, the carbocyanin jc- , known to be expelled out of mdr cells: above a critical concentration (the "cjc"), this dye forms j-aggregates which emit a uorescence at a wavelength very different from that emitted by the monomer. in the presence of mgatp, the p-gp-containing vesicles accumulated jc- , which exceeded locally the cjc and thus formed intraluminal j-aggregates; these aggregates allowed accumulated jc- both to be sequestered inside the vesicles, by dramatically slowing down its passive backdiffusion, and to be speci cally detected. kinetic characterization of this transport suggests that jc- is rst translocated to the exoplasmic lea et of the vesicle membrane before its internalization into the aqueous phase of the vesicle lumen. interaction between the energy metabolism and externally applied electric elds in yeast cells electric elds are often used for biophysical or biomedical treatment of biological cells, e.g. cell fusion or killing of cells. however, only a few data about the possible mechanisms of electrosensitivity of biological cells are available. since electrostimulation always induces depolarization of biomembranes, an impact of the energy metabolism is obvious due to the regeneration of electrochemical gradients by the expenditure of cellular energy. our aim is to investigate the interactions between externally applied electric elds and the aerobic/anaerobic energy metabolism of yeast cells. for this, we have constructed a new electrical interface for local stimulation of biological cells with variable duration and amplitude. when applying short lasting electrical pulses to yeast cells, we nd a direct response of the energy metabolism (measured by nadh-uorescence) to these pulses. a sudden and fast decrease in nadh is followed by a slower recovery of the uorescence signal. these nadh-signals are abolished in the presence of antimycin a or kcn, demonstrating the importance of mitochondrial energy production for this phenomenon. we attribute these changes to the immediate break down of atp as a consequence of the regeneration of the membrane potential (atpases) and the slower regeneration of atp by mitochondrial respiration. new insights of hypericin blood transport and its incorporation into the plasma membrane b. m. macri , g. stoian , m. l. flonta department of animal physiology and biophysics, faculty of biology, university of bucharest, department of biochemistry, faculty of biology, university of bucharest, bucharest, romania hypericin (hyp) is one of the active compounds from hypericum perforatum (an herb usually prescribed as antidepressant). the bioavailability of this hydrophobic molecule is a very important issue for medical applications. the goal of our work was the study of hyp blood transport mechanisms. techniques of absorbtion spectroscopy, electrophoresis and uorescence microscopy were used in order to de ne the properties of hyp-albumin and hyp-lipoproteins complexes and to explain the action of hyp at the plasma membrane level. hyp bind to several electrophoretic (sds-page) bands evidenced by mice plasma migration. both albumin and lipoproteins bind to hyp, forming complexes, during the blood transport process. different types of lipoproteins from males and females plasma mice were evidenced by gradient electrophoresis to bind hyp. hyp-albumin complex was also identi ed by absorbtion spectra, and the ratio a /a has a ph-dependence. hyp interaction with plasma membranes was also examined on cell culture by uorescence microscopy, and hyp plasma incorporation is a dose-and incubating time-dependent process. our results partially elucidate the plasma fractions that bind hyp, contributing to its blood transport. this study proposes a new mechanism of hyp cellular insertion, discussing its plasmatic membrane penetration due to its high hydrophobicity. the overexpression of p-glyoprotein (p-gp) is one of the major causes of multidrug resistance (mdr) in cancer chemotherapies. many p-gp inhibitors have been designed to reverse the mdr effect, but the structure-activity relationships of the p-gp inhibitors and substrates still remain largely unknown. until now, it is still very challenging to obtain the high-resolution p-gp structures and currently only low-resolution electron microscopy structure is available. this has caused the structure-based design of "p-gp-ignoring" therapeutic agents a remote goal. however, the recent determination of x-ray crystal structures of bacterial lipid a transporter, msba, has provided eligible structure templates for homology modeling of p-gp. we have therefore conducted explicit solvent molecular dynamics simulations of the fulllength ef ux pump, human p-gp, in an excessively hydrated popc bilayer to re ne the homology model. both free and atp-bound forms of p-gp have been simulated. the entire system consists of more than , atoms. our molecular dynamics simulations have shown that the overall architecture of p-gp remained very stable for tens of nanoseconds, while the observed membrane undulation was rather large. the simulation results have allowed us to investigate the conformational changes of p-gp upon atp binding in the ef ux process and to predict the possible binding site of various known substrates and inhibitors. the re ned structure models of p-gp by our simulations could be used as the basis for further drug design. electroporation (ep) is a phenomenon where increased permeability of cells exposed to an external electric eld is observed. the induced transmembrane voltage presumably leads to the formation of aqueous pores in the phospholipid bilayer, which increases permeability of the membrane for molecules and ions. ep is currently used in many biomedical applications including transfer of genes and electrochemotherapy of tumors. still, the molecular mechanisms of the process are not fully explained. recently it was proposed that ep could be monitored in real-time by measuring electric conductivity of tissue. so far the studies focused mostly on a single pulse, however in biomedical applications usually several pulses are used. in our study we used a train of electric pulses to analyse the relationship between electric conductivity and cell permeabilization. current-voltage measurements during and after pulse application were performed in dense suspension of cells. conductivity changes were analysed numerically using nite elements method and compared with the percentage of permeabilized cells. we obtained a transient increase in conductivity above a certain voltage with complete relaxation in < s. substantial changes in conductivity are also due to the diffusion of ions through membrane pores and osmotic swelling. we further show that relation between conductivity and permeabilization level is indirect. interaction of quinolones with bacterial porin ompf: uorescence quenching studies quinolones are widely used antibiotics witch develop their antibacterial action by inhibition of important bacterial enzymes. consequence of the internal location of their target of action, the translocation of this drugs trough the outer membrane is an essential step for their antibacterial action. in vivo studies have been showing that ompf is important for the entrance of some of these antibiotics, but the exact degree of involvement of this protein in the transport of the different members of this group of antibiotics, remains unknown. in this study, the quenching of the intrinsic tryptophan uorescence of ompf, in presence and in the absence of the drugs and by two distinct quenchers, was used as a rst approach, to elucidate ligandinduced structural changes and consequently prove the differential involvement of ompf in the entrance of these antibiotics in the bacterial cell. the results obtained reveal that the degree of interaction with the protein is related with the hydrophobicity of the different antibiotics. this kind of evidence suggests that the entry by the porin channel is not the only path used by these antibiotics and that it is more important for the latest generations of this group because of their increased hydrophilic characteristics. inhibition of multidrug resistance-associated protein mrp and kv channels by natural polyphenols k. michalak, a. teisseyre, b. ania-pietrzak department of biophysics, wroclaw medical university, poland resistance to cytotoxic agents remains a major obstacle to successful chemotherapy in cancer. best-characterized form of drug resistance is caused by the overexpression of genes encoding membrane drug pumps, like p-gp or mrp . in present study, activity of several plant polyphenols ( avonoids and stilbene) against mrp has been studied using functional assay based on ef ux of mrp uorescent substrate. very recently, a role of kv . potassium channels in proliferation of various cancer cells was suggested. in our study the effect of the plant polyphenols on voltage-gated potassium channels kv . was investigated by patch-clamp electrophysiological method. some of studied compounds were found to be active inhibitors of multidrug resistance-associated protein mrp and voltage-gated potassium channels, and their properties are promising for further research in the eld of anticancer activity of natural products. the transport mechanism of melibiose permease: a study using electrical measurements and uorescence techniques k. meyer-lipp , c. ganea , t. pourcher , g. leblanc , k. fendler max planck institut für biophysik, frankfurt, germany, c. davila medical university, bucharest, romania, cea, université de nice so a antipolis, nice, france the melibiose permease (melb) of escherichia coli is a membrane bound carrier that uses the favorable na+, li+, or h+ electrochemical potential gradient to drive cell accumulation of alfa-galactosides (melibiose, raf nose) or beta-galactosides (methyl- -thio-beta-dgalactopyranoside). electrophysiological techniques have proved to be extremely useful tools to investigate the mechanism of ion transfer across the membrane by ion-coupled transporters. using a solid supported membrane (ssm) as a capacitive electrode a rapid solution exchange can be combined with the high sensitivity of planar lipid membranes and allows time resolved investigation of the charge translocation during the catalytic cycle of na+/solute symporters. this technique has been combined with uorescence measurements, which report on structural changes during the substrate transport process of the carrier. we have used time resolved tryptophane uorescence, uorescence energy transfer with a uorescent sugar substrate and site speci c uorescence of a label attached to a cysteine residue on the protein. this allowed us to identify conformational transitions during the reaction cycle of the melibiose permease. we could assess their electrogenicity and determine rate constants. a kinetic model for na+ and melibiose binding and transport is presented. electrophysiological characterization of the vast number of annotated channel and transport proteins in the postgenomic era would be greatly facilitated by the introduction of rapid and robust methods for the functional incorporation of membrane proteins into dened lipid bilayers. we present an automated method for reconstitution of membrane proteins into lipid bilayer membranes, that substantially reduces both the reconstitution time and the amount of protein required. we have applied this well-de ned system to the characterization of a novel mitochondrial uncoupling protein, ucp and demonstrated that ucp exhibits protonophoric function exclusively in the presence of fatty acids, similar to that previously shown for its homologue ucp . the membrane conductance was proportional to the concentration of the reconstituted ucp in presence of oleic acid or eicosatrienoic acid, and was inhibited by atp. amphipols are amphipathic polymers designed to replace or supplement detergents in membrane protein solution studies. for the study of the ca -atpase from sarcoplasmic reticulum, previous experiments have revealed both advantages and disadvantages to the use of a polyacrylate-based amphipol, a - . these issues have been reinvestigated using four different amphipols. size exclusion chromatography showed that, although a - aggregates in the presence of millimolar concentrations of calcium -an effect that probably accounts for most of the aggregation of atpase/a - complexes observed in our previous work-, aggregation can be avoided by resorting to a sulfonated version of a - . we also found that all amphipols tested slowed down the rate of calcium dissociation from its binding sites and reduced atpase activity, while protecting the solubilized protein against denaturation. this suggests that association with the polymer may damp the protein's dynamics, perhaps due to the multipoint attachment of the polymer to its hydrophobic transmembrane surface. such a "gulliver" effect could contribute both to the protection of membrane proteins against denaturation and to the reversible inhibition of serca a. clc proteins are found from prokaryotes to mammals. they function as plasma membrane chloride channels or provide neutralizing anion currents for v-type h -atpases that acidify compartments of the endosomal/lysosomal pathway. vesicular clcs have been thought to be cl -channels, in particular because clc- and clc- mediate plasma membrane cl -currents upon heterologous expression. we have shown, however, that these two mainly endosomal clc proteins rather function as electrogenic cl /h exchangers, resembling the transport activity of the bacterial clc-e that has been crystallized. neutralization of a critical glutamate residue not only abolished the steep voltage-dependence of transport, but also eliminated the coupling of anion ux to proton counter-transport. clc- and clc- may still compensate the charge accumulation by endosomal proton pumps, but are expected to tightly couple vesicular ph-gradients to cl -gradients. calorimetry and mechanics of ca transporting systems in rat myocardial bigeminies a series of new n-oxides of tertiary amines (nta) was checked for its biological activity. individual compounds differed in the length of substituted alkyl chain. the primary goal was to nd if they can be used as effective antioxidants and, to what degree they modify used model (liposomes) and biological (erythrocytes, algae and cucumber) membranes. various methods were used in order to do that. a mechanism of the interaction between nta and membranes was studied by measuring their potency to hemolyse erythrocytes, to in uence a phase transition temperature in dppc liposomes, to change a membrane potential of algae cells. the measure of the interaction of nta with cucumber cells were potassium leakage, chlorophyll content and inhibition of growth of hypocotyls. antioxidative abilities of nta were determined by measuring their ef ciency to protect erythrocytes against membrane lipid oxidation induced by uv irradiation and by comparising their antioxidative ef ciencies with that of trolox (vitamin e analogue) in chromogen experiments. the mostly widely employed mechanism of drug extrusion in bacteria is via membrane transport proteins called ef ux pumps. in gram-negative bacteria, multidrug resistance is conferred by tripartite complexes, rather than by a single transport protein. through these systems, a wide range of substrates is expelled from the cytoplasm, through the periplasmic region, to the exterior of the cell. among these complexes, the acrab/tolc system in escherichia coli is formed by an inner membrane ef ux pump, acrb, an outer membrane protein, tolc, and a periplasmic protein known as an adaptor, acra. the components of this complex are studied, in order to provide insights into drug transport in bacteria. here we present a dynamics study on mexa, homologue of acra from pseudomonas aeruginosa. the protein has been studied by molecular dynamics simulations in bulk water. a structural adjustment by the periplasmic protein is required in order to engage both the bottom part of the om protein and the top region of im protein. the dynamics on mexa reveals a exible behaviour of the protein in water. the major concerted motions observed are the hinge-bending of the two domains, and the rotation of the -barrel domain. these can be related to the adaptation of mexa (and acra) to the om and im proteins during the process of assembly in forming the complex, and during the opening of the channels. electrophysiologic study of ap in chara corallina -indication of its biochemical nature the speci c conductance's of aqueous solution of electrolytes (viz.naf, nacl, nano , na so , kf, kcl, kno , k so , mgcl , cacl , fecl , mncl ,crcl ,cucl , cocl , )have been measured across peritoneum at temperatures between( - ) c.conductance attains a maximum limiting value at higher concentrations for each electrolyte due to a progressive accumulation of ionic species within the transmembrane region. the membrane becomes more and more conductive to incoming ions and attaining a limiting value due to the fact that an electrically neutral pore, which is speci c for a particular ion, is unlikely to contain more than one type of ion. consequently, at high electrolyte concentration, the pore saturates and the conductance's approaches a limiting value. the values of speci c conductance measured follow the sequence for anions; so > cl > no > f . whereas for the cations the sequence is k > na ; ca > mn > co > cu > mg ; cr > fe . the energy of activation for the cations as well as for the anions follows the sequence (for cations): the low temperature ( k) chlorophyll uorescence, photochemical activity, oxygen ash yield and oxygen burst decay of thylakoid membranes with different organization of the light-harvesting chlorophyll a/b complex of photosysytem ii (lhcii) were investigated after freeze-thaw cycle in criotoxic and cryoprotective medium. the increase of lhcii oligomerization, which is associate with signi cant reduction of the surface charge density of the thylakoid membrane, correlates with lower extent of freezing damage of the photosynthetic apparatus, when the procedure is carried out in cryotoxic medium (nacl). in the presence of the cryoprotective compound (sucrose) freezing damage is less pronounced and is not affected by the degree of the lhcii oligomerization. the mechanisms of damage and protection of photosynthetic apparatus in the process of freeze-thaw treatment are discussed. spectral and redox characterization of the novel heme ci in the cytochrome b f complex . this is an indication of different spin delocalization in the primary donor, for the mutant being typical of a monomeric oxidized bchl. considering the fact that the properties of both isolated and membrane-associated mutant rcs were similar, we conclude that missing bchl molecule from the mutant rc was the result of the introduced mutation but not of the protein puri cation procedure. authors acknowledge the support by the russian brf. we created two site-directed mutants, a s and l i in the d protein of photosystem ii in thermosynechococcus elongatus. both mutations are within the binding pocket of the primary quinone acceptor (q a ). we investigated the effects of the mutations in vivo and in isolated psii. while the l i mutant exhibits characteristics similar to the wild type, the a s mutation effects q a charge recombination measured by thermoluminescence and uorescencedecay. these results strongly indicate that the a s mutation induce a shift in the redox potential of q a . the a s accelerates the rate of photoinhibition, an effect consistent with the negative shift in the redox potential. epr was used to measure the temperature dependence of the electron transfer from q a to q b in the a s mutant. it was found to be indistinguishable from the wild type despite the difference in the midpoint potential of q a . this is taken as an indication as a gating mechanism on the acceptor side of psii similar to that in bacterial reaction centers. protochlorophyllide oxidoreductase takes an abnormal reaction pathway below the glass transition g. durin , d. j. heyes , c. n. hunter , d. bourgeois ibs and esrf, grenoble, france, krebs institute and r hill institute for photosynthesis, shef eld university, shef eld, uk motions through the energy landscape of proteins lead to biological function. at temperatures below a dynamical transition ( - k), the activity of some proteins cease. in this work, we describe an enzyme that, instead, engages into a non-productive pathway below k. protochlorophyllide oxidoreductase (por) catalyzes the reduction of protochlorophyllide (pchlide) into chlorophyllide (chlide), a key step in chlorophyll biosynthesis. por is one of the two enzymes known to require light for catalysis. when illuminated with gentle light at k, the complex of t. elongatus por with pchlide and nadph transforms into a nonuorescent intermediate. upon warming, several uorescent intermediates develop, and at k chlide is released. when illuminated at temperatures below k, por behaves differently. if gentle light is used, the reaction can not start. instead, if a blue laser source is used, the initial complex disappears, like at k. however, upon warming, a new intermediate develops that uoresces at nm and leads to a dead-end product. by using uorescence microspectrophotometry, we have measured the solvent glass transition temperature of the system to be k. the solvent glass transition, possibly controlling a por dynamical transition, may be the determinant that switches the enzyme reaction pathway from a non productive to a productive one. the nonproductive pathway results from a two-photons absorption mechanism, whereas the productive pathway is a one-photon mechanism. sensory rhodopsin ii from n. pharaonis (npsrii) forms a complex with its cognate transducer nphtrii in a : stoichiometry . light activation of npsrii leads to a movement of helix f which triggers a rotation of tm in nphtrii . the mechanism of signal transduction through the hamp region to the cytoplasmic domain of the transducer is still unknown. structural information exists for the transmembrane and cytoplasmic regions, however the hamp domain is not yet characterized. in order to obtain structural information on this domain, twenty-four residues in the membrane adjacent region ( - ), and six residues in the following region were spin labeled and investigated by cw and pulsed x-band epr. to analyze the overall architecture of the complex, doubly spin labeled variants between the transducer and the receptor were also engineered. depending on their function, the absorption spectra of rhodopsins can be tuned by the protein over a wide range. a major determinant for spectral shifts between different rhodopsins are electrostatic interactions between the chromophore retinal and the protein. we compute and compare the classical electrostatic potential at the retinal of three archaeal rhodopsins: bacteriorhodopsin (br), halorhodopsin (hr), and sensory rhodopsin ii (srii). these proteins are an excellent test case for understanding the spectral tuning of retinal. the absorption maxima of br and hr are very similar, while the spectrum of srii is considerably blue shifted. we nd that the electrostatic potential is similar in br and hr, but differs signi cantly in srii. a quantum mechanical model of a particle in a box with a step potential can qualitatively relate the differences between the electrostatic potentials of the proteins to the relative shifts of their absorption maxima. by decomposing the electrostatic potential into contributions of individual residues, we could identify six residues that are responsible for the differences in electrostatic potential between the proteins. three of these residues are close to the retinal, while the other three residues are more then angstroem away from the retinal. the counterion of the schiff base, which is frequently discussed to be involved in the spectral tuning, does not contribute to the dissimilarities between the electrostatic potentials. effect of uv-a radiation on thylakoid membranes with different organization p. ivanova, a. dobrikova, t. markova, s. taneva, e. apostolova institute of biophysics, bulgarian academy of sciences, so a, bulgaria the effect of uv-a ( - nm) radiation on the energy transfer and the photosynthetic oxygen evolution of thylakoid membranes from pea mutants was investigated. the membranes have different pigment composition, stoichiometry and organization of pigment-protein complexes. the aim of our work was to nd out whether uv-a induced damage is affected by the altered content and/or oligomerization of the main light-harvesting chlorophyllprotein complex (lhcii) in thylakoid membranes. the data for the effect of uv-a radiation on the oxygen evolution demonstrate that: (i) the inhibition of photosystem ii (psii)-mediated electron transport and ash-induced oxygen yields strongly depend on the amount of lhcii; (ii) the increase of the s o populations of psii centers in darkness is more pronounced in thylakoid membranes with smaller amount of lhcii; (iii) the inhibition of the oxygen evolution is related to the reduced number of the functionally active psii centres; (iv) the degree of impairing of active psii centres depend on the amount and oligomerization of lhcii. the results also show that the altered content and organization of lhcii in uence the uv-a light-induced changes in the energy transfer between psii and psi and within the supramolecular lhcii-psii complex. the effects of uv-a radiation on leaves and isolated thylakoid membranes are compared. sudden polarisation -a large change in the electric dipole moment between the excited and the ground state -is a well-known phenomenon for retinal chromophore. some early models of the energy transduction mechanism in bacteriorhodopsin (br) even attribute a primary functional role of that. however, it was apparently unrecognized that the maxwell theory intuitively predicts the appearance of an ultrafast transient electromagnetic radiation due to this dipole moment change. here we show that the existence of this type of radiation can be derived from semiclassical quantum electrodynamics as a second order phenomenon. in optical terms it corresponds to the previously unstudied resonant case of optical recti cation. recently we experimentally observed a major component in the fs coherent infrared emission of oriented purple membranes of br corresponding well to this effect (groma et. al, proc. natl. acad. sci. , , ). our theory predicts that such a signal holds detailed information on the dynamics of excited state polarization, opening a new branch of impulsive spectroscopy on asymmetric systems. beyond optical recti cation we found a complex phase a coherent oscillation living for a few ps, i.e. much longer than the excited state of br. fitting analysis resulted in at least seven vibrating modes in the - cm region, while windowed fourier transform indicated time-dependent frequency distribution. a. ghignoli, g. cercignani, s. lucia, g. colombetti istituto di bio sica, cnr -pisa, dipartimento di fisiologia e biochimica, università di pisa, italy the life cycle of ophryoglena ava, a histophagous ciliate dwelling in fresh waters, reportedly includes several stages that feature morphology changes and different phototactic responses. previous studies on the phototactic responses in o. ava during its phase of maximal positive phototaxis led to an action spectrum with two main peaks at and nm, and a minor peak at nm. starting from those results, we analyzed the phototactic response at various cell ages, using three broad-band interferential lters (fwhm = nm) centred respectively at , and nm, and constructed dose-effect curves for each band. a higher photosensitivity at nm, and lower photosensitivies with the other two lters ( and nm) have been observed at any cell age. however, the photosensitivities in the blue and orange regions show a different time course vs. cell age with respect to the photosensitivity in the green region. measures were also carried out on cells whose feeding cycle was altered by a -day starvation (a double time with respect to the standard protocol) before being fed at t = . the maximal photoresponse values reached by starved cells are lower than the highest values reached with standard cultures; in other words, a general reduction of the phototactic response is observed. these results suggest that, while feeding optimally induces cell division, it does not generally reset all cellular functions. a. quaranta , f. lachaud , y. pellegrin , p. dorlet , m.-f. charlot , s. un , a. aukauloo , w. leibl service de bioénergétique, cea-saclay, bât. , gif-sur-yvette cedex (france), laboratoire de chimie inorganique, bât. , université paris-sud, orsay (france) coordination complexes based on a photoactive rutheniumpolypyridyl moiety linked to simple, rigid ligands with binding sites for transition metals, are developed to mimic the light induced charge separation and water oxidation processes taking place in the photosynthetic apparatus. inspired by the structure around the donor side of photosystem ii a family of phenanthroline based ligands holding an imidazole, a phenol or an indole unit simulating the amino acids histidine, tyrosine and tryptophan in the oxygen evolving complex, were developed as models for proton-coupled electron transfer. in some of the molecules investigated the hydrogen bonding interaction present in the natural system is reproduced. combined data from photophysical, spectroelectrochemical studies and dft calculations evidenced the photogeneration of a phenoxyl or a tryptophan radical upon excitation of the chromophore in presence of an external electron acceptor, therefore mimicking the electron trade between p and tyrz-hist . finite element model to predict the electric potential distribution in ps i containing vesicles c. p. a. pennisi , e. chemineau , e. greenbaum , k. yoshida center for sensory motor interaction, aalborg university, denmark, chemical sciences division, oak ridge national laboratories, usa, facultad de ingeniería, uner, argentina photosynthetic reaction centers (rc) are integral membrane proteins and molecular photovoltaic structures. recently, it was suggested their use as triggers of voltage-gated ion channels in excitable cells, where a certain voltage threshold has to be reached to evoke a response (kuritz et al., ieee trans. nanobiosci. in press ). experimental studies with rc's reconstituted in lipid vesicles have shown different values of transmembrane voltage, depending on parameters like light intensity, rc concentration and membrane passive properties. ultimately, the purpose of this work is to have a tool to estimate the proximity, number and density of rc's required near a voltage-gated channel to activate an excitable cell. as a starting point, we aim to predict the spatial distribution of the membrane potential in vesicles. a nite element model was realized using a commercial package (femlab, comsol a/s). the three-dimensional distribution of the electrical potential near a single rc in the surface of a spherical vesicle was calculated. in terms of density, in conditions of saturating light, a minimum of , e rcs/cm is needed to develop a potential of mv, capable to activate voltage-gated sodium channels. microsecond time-resolved x-ray diffraction study of purple membrane t. oka , k. inoue , m. kataoka , n. yagi faculty of science and technology, keio university, japan, japan synchrotron radiation research institute (jasri), japan, nara institute of science and technology, japan the structural changes in the photoreaction cycle of bacteriorhodopsin, a light-driven proton pump, was investigated at a resolution of Å by time-resolved x-ray diffraction experiment utilizing synchrotron x-rays from an undulator of spring- . the xray diffraction measurement system, used in coupling with a pulsed yag laser, enabled to record diffraction pattern from purple membrane lm at a time-resolution of µsec over the time domain of µsec to msec. the low temperature ( k) chlorophyll uorescence, photochemical activity, oxygen ash yield and oxygen burst decay of thylakoid membranes with different organization of the light-harvesting chlorophyll a/b complex of photosystem ii (lhcii) were investigated after freeze-thaw cycle in cryotoxic and cryoprotective medium. the increase of lhcii oligomerization, which is associate with signi cant reduction of the surface charge density of the thylakoid membrane, correlates with lower extent of freezing damage of the photosynthetic apparatus, when the procedure is carried out in a cryotoxic medium (nacl). in the presence of a cryoprotective compound (sucrose) freezing damage is less pronounced and independent of the degree of the lhcii oligomerization. the mechanisms of damage and protection of photosynthetic apparatus during the freeze-thaw process are discussed. we have studied the effect of a cytokinin meta-topolin (mt, m) on senescence-induced changes in the photosynthetic apparatus of detached primary leaves of wheat (triticum aestivum l. cv. hereward). the senescing leaves were kept under continuous light conditions. mt signi cantly slowed down the senescenceinduced decrease in chlorophyll content and markedly stimulated violaxanthin zeaxanthin (z) conversion. the high z content was maintained even after an hour in darkness. mt treatment caused also the appearance of an emission band f peaking at - nm. this emission band is attributed to aggregates of lightharvesting chlorophyll a/b-binding proteins (lhc), the production of which is associated with a higher z content. the presence of lhc aggregates in mt treated leaves was documented also by electron microscopy imagines. besides the lhc aggregation, mt induced also a decrease in photosystem i content which was documented by electrophoresis and k-uorescent spectra. supported by grants frvs / and msm . recently high resolution images of bacterial photosynthetic membranes have revealed the organization of membrane proteins in these native membranes. the organization revealed is remarkable, and all the more so when we realize that these specialized, protein rich, membranes differentiate from the cytoplasmic membrane which has a more complex composition and is richer in lipids. analysis of the protein organization in these specialized membranes from several different bacteria suggest that the organization results from a phase separation of several different contiguous phases. in order to better understand our observations we have undertaken an examination of the different phase behaviors that are possible for membrane proteins considered as a two dimensional colloid. monte-carlo modeling of the phase diagram of this system shows the importance of interaction distance in the determination of system behavior. transcription of our observations on the model systems to the photosynthetic membranes suggests that electrostatic and elastic forces in the membrane are of particular importance in determining the high level order of membrane proteins. the recent crystal structure of photosystem i (psi) from synechococcus elongatus shows two quasi-symmetric branches of potential electron transfer cofactors including primary donor (dimer of chlorophylls p ), monomeric chlorophylls a and a and quinone a , bound to the psaa/psab heterodimer. so far, it is not clear if both potential electron transfer pathways are active in this process or only one of them. to solve this issue, we studied a set of mutants with methionine coordinating the primary electron acceptor, a , replaced by histidine, leucine, or serine in either of two branches. our results obtained with a technique of femtosecond transient absorption spectroscopy show that both branches are equally active in electron transfer. mutation in either branch slows the forward electron transfer between a and a from ps in wilde type psi to - ns in all these mutants. this strong effect is explained by signi cant change in the redox midpoint potential and change in the position of a by the mutations. i. s. zaharieva department of biophysics and radiobiology, faculty of biology, university of so a an approach to the investigation of structural and functional properties of the photosystem ii supramolecular complex in native photosynthesizing objects based on the registration of delayed chlorophyll a uorescence is developed. using a disc phosphoroscope, we register simultaneously: i) changes of the intensity of millisecond delayed uorescence (decayed in . - . ms time range) during the transition of the photosynthetic apparatus from dark to lightadapted state; ii) changes of the intensity of delayed uorescence decaying in different subintervals of the investigated time range; iii) dark relaxation curves at different moments of the transition; iv) changes of the intensity of prompt chlorophyll a uorescence. the analysis of these data allows the correlation of the delayed uorescence characteristics to particular processes occurring in the photosystem ii complex -proton or electrical gradient accumulation, changes in the redox state of quinone acceptors, changes in the pigment-protein complexes caused by different stress factors, for example temperature. three-dimensional structure of major light-harvesting antenna of photosystem ii from cucumber h. yan , z. liu , k. wang , t. kuang , j. zhang , l. gui , x. an , w. chang national lab of biomacromolecules, institute of biophysics, chinese academy of sciences(cas), datun road, chaoyang distr., beijing , china, lab of photosynthesis and environmental molecular physiology, institute of botany, cas, nanxincun, xiangshan, beijing , china the major light-harvesting antenna complex of photosystem ii (lhc-ii), the most abundant integral membrane protein, functions in light capture, energy transfer/distribution and photoprotection. lhc-ii from different species or conditions shows different spectral properties and variation in polypeptide and pigment components. this indicates some speci c function-related alterations in the organization of lhc-ii. here we report a . -Å crystal structure of cucumber homo-trimeric lhc-ii, organized in a perfect virus-like icosahedral particle. the electron-density map shows the reasonable existence of a chlorophyll (chl) a/b mixed binding site in the complex. the occurrence and locus of lactucaxanthin (lac) was seen directly for the rst time. based on the credible structure information, a mechanism of the energy transfer, regulation and excess excited energy dissipation under high light condition was proposed. coherent anti-stokes raman scattering microscopy (cars) is a new approach for chemical imaging of molecular systems within cells and tissues, with high sensitivity, high spatial resolution, and three dimensional sectioning capabilities, without using uorophores that are prone to photobleaching. this technique permits to map selectively molecular species, by using vibrational properties of their chemical bounds. the epi detected (e-cars) and forward detected (f-cars) intensities depends on the shape, the size of the sample, as well as the index of the solvent. in this presentation, after introducing the cars microscopy technique, we show the rst cars studies of the refractive effect of the sample, comparing the e-cars and f-cars signals for different diameters of polystyrene beads, in different refractive index solvents. we present several simulations, comparing forward-detected and backward-detected signals in different sized polystyrene beads, embedded in different index solvents, and we show that, the backwardre ected f-cars dominates the experimentally epi-detected signals. furthermore, we demonstrate experimentally and theoretically that the maxima of forward and epi-detected signals are generated at different positions along the z axis in the sample. we nally discuss how index mismatch in cells can alter cars images. the effects of static magnetic elds on humans have been the subject of continuous investigations. since one of the major static magnetic eld sources is nuclear magnetic resonance imaging (mri), the present study aimed to investigate the effects of . t magnetic eld that is produced by mri on humans. the study is carried out with voluntary and healthy young men from to years of age. the subjects informed about the purpose of the study at the beginning. the subjects exposed to minutes of . t static magnetic eld by means of putting the subjects into the magnetic resonance unit. ml blood was taken from each subject one minute before and one minute after exposure. t and t relaxation times and trace elements were measured in of pre and post exposure plasma of the subjects. the obtained post exposure values were compared with pre-exposure values of the subjects. pre and post exposure results were analyzed by means of student t-test. evaluation of tumor response of breast cancer patients by diffusion weighted mri k. a. danishad , v. seenu , u. sharma , p. k. julka , g. k. rath , n. r. jagannathan department of nmr, department of surgery, department of radiotherapy, all india institute of medical sciences, new delhi, india diffusion weighted mr imaging (dwi) measures the diffusion of water molecules in tissues and is quanti ed by apparent diffusion coef cient (adc). dwi can be used to differentiate tumors from normal tissue and also can be used to monitor the response of tumor to chemotherapy. thirteen healthy volunteers and twelve patients were recruited for the study. dw images were obtained prior to therapy (n= ) and after three cycles of therapy (n= ). the mean adc value of tumors ( . x . mm /s) was signi cantly less (p < . ) compared to the normal tissue ( . x . mm /s). decrease in adc in tumor is due to an increase in the cellularity which restricts the diffusion of water molecules. in patients receiving neo-adjuvant chemotherapy, the adc values were higher ( . x . mm /s) and were closer to that of the normal tissue (p < . ), indicating response of the tumor to chemotherapy. the post-therapy increase in adc is due to the cell damage caused by the therapeutic agents which increases the fractional volume of the interstitial space, causing an increase in the mobility of water. the study showed that dwi can be used non-invasively to assess the response of breast cancer patients to neo-adjuvant chemotherapy. quanti cation by optical imaging of gene electrotransfer in mouse muscle and knee optical imaging was evaluated for monitoring and quanti cation of the mouse knee joint and tibial cranial muscle electrotransfer (et) of a luciferase encoding plasmid. the substrate of luciferase (luciferin) was injected i.p or locally in the muscle or the knee joint. luminescence resulting from the luciferase-luciferin reaction was measured with a cooled ccd camera. luminescence of the knee joint and muscle were higher after local than after i.p injection of luciferin, but both measurements were highly correlated. local injection procedure was adopted. a signi cant correlation was observed between measurements in vivo and in vitro on the same muscle. reproducibility of individual luminescence measurements was also veri ed, and the luminescence levels were clearly dependant of the amount of plasmid injected. in vivo luciferase in the electrotransfered knee joint was detected for two weeks. intramuscular electrotransfer of . or µg of plasmid led to stable luciferase expression for days, whereas injecting µg plasmid resulted in a luminescence fall two weeks after electrotransfer. these decreases were, at least partly, related to the production of antibodies against luciferase. thus, optical imaging was shown to be a relevant technique to quantify variations of luciferase activity in vivo in one given tissue. furthermore, evaluating the effective amount of luciferase in tissues from in vivo luminescence levels requires calibration since it relies on conditions of the enzymatic reaction and light absorption. acute effect of corticosterone on nmda receptormediated ca + elevation in mouse hippocampal slices m. saito , s. sato , h. osanai , a. hirano , y. komatsuzaki , s. kawato department of physics and applied physics, college of humanities and sciences, nihon university, department of biophysics and life sciences, graduate school of arts and sciences, university of tokyo corticosterone (cort) is a principal glucocorticoid synthesized in the rodent adrenal cortex and secreted in response to stress. we examined the rapid effects of cort on n-methyl-d-aspartate (nmda) receptor-mediated ca signals in adult mouse hippocampal slices by using ca imaging technique. application of nmda caused a transient elevation of intracellular ca concentration followed by a decay to a plateau within sec. the min preincubation of cort induced a signi cant decrease of the peak amplitude of nmda-induced ca elevation in the ca region. the rapid effect of cort was induced at a stress-induced level ( . - µm). because the membrane non-permeable bovine serum albuminconjugated cort also induced a similar rapid effect, the rapid effect of cort might be induced via putative surface cort receptors. in contrast, cort induced no signi cant effects on nmdainduced ca elevation in the dentate gyrus. in the ca region, cort effects were not evaluated, because the marked elevation of nmda-induced ca signals was not observed there. in vivo subcellular structures recognized with phase k. nagayama , r. danev , n. usuda , y. kaneko , k. nitta , a. nakazawa , k. atsuzawa , m. tanaka , m. setou okazaki institute for integrative bioscience, okazaki, japan, fujita health university, school of medicine, toyoake, aichi, japan, saitama university, saitama, japan, tokyo metropolitan institute of gerontology, itabashi-ku, tokyo, japan phase contrast transmission electron microscopy has been developed to enable a high contrast and a high resolution observation for unstained ice-embedded samples. to enhance the image contrast, two methodologies have already been developed; i) scattering contrast for stained samples with small aperture diaphrams and ii) defocus contrast for unstained or stained samples with deep defocusing. the former prevails in histochemical sciences and the latter is popular in electron crystallography. both methods, however, have a common drawback that the contrast is only improved by impairing the image quality. this drawback can be removed with use of the phase contrast method using phase plates, which has traditionally been used in visible light microscopy. due to the severe obstacle of the charging of phase plates, however, the idea has not yet been materialized. we have solved the phase-plate charging problem. an experiment kv with tem for a whole cell from cyanobacterium unstained and ice-embed ful lled the expectation. only weak and vague contrast was obtained for the conventional image of the cell even with a very deep defocus. contralily a high-contrasted image has appeared for phase contrast images, where various ne structures are clearly recognized. this may be a rst example to observe nanometer scale structures in details in the intact cell. other examples including intact state intravesicular structures will be shown. j. lichtenberger, p. fromherz max-planck-institute for biochemistry we cultured bovine chromaf n cells on an array of electrolyteoxide-silicon eld-effect transistors (eos fet) and monitored granule secretion. by stimulation with barium chloride, vesicles are released into the narrow sheet of electrolyte between the chip surface and the plasma membrane. the interaction of released protons with the silicon dioxide surface of the chip alters the threshold voltage of the transistor and gives rise to a measurable signal. simultaneously performed measurements with a carbon bre showed a correlation of the transistor signals and amperometric current traces. we conclude that the transistors are able to monitor exocytosis on a single vesicle level. to elucidate the role of protons, we destroyed the proton gradient across the vesicle membrane by nigericin and valinomycin. as a result a massive reduction of the transistor signals was induced, whereas there was only little change of the amperometric records. we conclude that released protons are responsible for the detection of vesicles with transistors. the individual transistor records of vesicle exocytosis can be explained by combining the dynamics of the exocytotic event with the diffusion in the cell-chip junction. transistor recording of exocytosis does not depend on the electrochemistry of transmitters. as many kinds of exocytotic vesicles contain a large amount of buffered protons it can be applied to numerous kinds of exocytotic events, independent on the nature of the transmitter. we tried various solvents for the solubility of the uorescent product, and found that the product was insoluble in water and most organic solvents. a quite bright uorescence emitted by the particles was observed by uorescent microscope when emitted by uv nm. sem indicated that the size of the particles was µm µm, depending on the reaction time and phospholipid concentration in hexane solution. endothelial cells from human vein grew better on the surface prepared from the particles than the culture plate, implies a possible application as a new type of biomaterial as a coating material for medical devices, and as a uorescent tracer for human bodies. confocal microscopy of the phototactic ciliate f. salina. fabrea salina is a marine heterotrich ciliate, which dwells in salt ponds. in previous works we have described the phototaxis and the uorescence properties of a hypericin like endogenous pigment in an albino strain. we have recently obtained a heavily pigmented strain from the saline of torre colimena (taranto, italy). we have used confocal microscopy to characterize the uorescent properties of this strain and to compare them with those of the albino strain. the results obtained by one and two photon confocal microscopy show that, as in the albino strain, the uorescence intensity of the pigment is higher in dead cells than in the living cells. the excitation and emission spectra are quite similar in the two strains and this is also true for the uorescence lifetime, which is about ns. all together, these measurements indicate that the pigment of the new strain belongs to the family of hypericin-like chromophores. the analysis of different confocal planes shows that the pigment is localized not only in granules under the somatic membrane in the cellular body, as currently thought, but also in the cilia. some experiments of fotobleaching "in situ" con rm this result, that might have important implications in the understanding of the mechanisms of the photomotile responses of f. salina and probably of other heterotrich ciliates. there is no doubt that modern physician should have the knowledge of basic sciences as physics, chemistry and biology. furthermore, the biophysics is incorporated into the curriculum of most european medical schools. at medical school in zagreb, the course of physics and biophysics is positioned in rst and forth year of study. the students learn basic physics phenomena of structure of matter, mechanics, thermodynamics, electromagnetism, optics and acoustics applied on the human body. additionally, the interactions of the body with the surrounding are thoroughly discussed as the basis for different diagnostic methods. the arguing at our school is still going on where to include the content of this course. should it be the autonomous course or the part of physiology and radiology courses in the problem based learning approach?! so far at zagreb medical school the biophysical courses are autonomous structured according to the biophysics programs at other european universities. the highlighting is on seminar work and lab, encouraging the students for individual learning. the seminars are made more vivid and instructive for students by inclusion of different model devices constructed in our department learning on science is also learning how scienti c knowledge is produced. in this sense, issues related to the dynamics of science should be brought into focus in science education. the idea has support in the "declaration on science and the use of scienti c knowledge", particularly in the statement that "science curricula should include science ethics, as well as training in the history and philosophy of science and its cultural impact". thinking on rst year undergraduate students, we are interested in an integrated approach of that kind of themes. this presentation describes a practical teaching module included in the basic biophysics course for biochemistry majors. organized in case studies, it deals with stories of biophysics (and biochemistry), and addresses the role of the biophysical approach in the progress of the life sciences. the module follows the whole course, and consists in small exercises on the way a given understanding has been constructed explored within the practice trend of contemporary science studies. examples of the chosen stories -anchored in the subjects covered in the lectures of the course -relate to the search for the mechanism of energy production through atp-synthase, the development of radioactive labelling techniques, and the discovery of protein water channels. beyond their value as cultural legacy and as motivating tools, the insight they might provide is vast. axiomatic theory of biophysics q. zhao china rehabilitation research centre, beijing, china. until now, all approaches to interpret biology by applying principles of physics have been announced failure. the achievement of biophysics is limited in area to provide essential tools for biological research and biophysics is far from to be the basic theory of biology. to resolve it requires the fundamental research about the logic features of biophysical processes of biology. we promoted system logic and then protein thermodynamics structure theory. based upon it, the axiomatic theory of biophysics and biology could be developed. our result shows that the real understanding of biology or biophysics must be constructed based upon new thinking methods. a successful ligand-receptor docking methodology depends strongly in the ef ciency of the global optimization algorithm used to explore the ligand conformational space. in this work we have implemented and analyzed the performance of a new exible ligand-receptor docking methodology. this methodology uses as optimization method a multisolution version of the generalized simulated annealing algorithm adapted to problems with box constraints. a grid-based methodology, considering the receptor rigid, and the gromos classical force eld are used to evaluate the ligand-receptor scoring function. the methodology was tested in redocking (ligand within it own protein conformation) and cross-docking (ligand within another protein conformation) experiments for ve hiv protease-ligand complexes with known threedimensional structures. all ligands tested are highly exible, having to conformational degrees of freedom. the implemented docking methodology was able to redock successfully all exible ligands with a success ratio % and a mean rmsd lower than . Å with respect to the corresponding experimental structures. in the cross-docking experiments we observed a strong dependence of the mean success ratio with respect to the protein structure used as reference. in situations we observed a mean success ratio % and % in cases among the possible ones. s. aida-hyugaji , h. nakagawa , j. nomura , m. sakurai , d. tokushima , t. takada , u. nagashima , t. ishikawa tokai university, japan, tokyo institute of technology, japan, nec corporation, japan, national institute of advanced industrial science and technology, japan irinotecan is a widely-used antitumor drug that inhibits mammalian dna topoisomerase i. however, overexpression of abcg can confer cancer cells resistance to sn- , that is, the active form of cpt- . in the present study to develop a platform for the molecular modeling to circumvent drug resistance associated with abcg , we have characterized a total of fourteen new sn- analogues by some typical properties, which were evaluated by molecular orbital (mo) calculations and neural network (nn) analysis. the nn was rst applied to estimate hydrophobic properties (logp) of the analogues. thereafter, the electrostatic potential (esp) and the solvation free energy ( g) were evaluated by mo calculation. these indexes were found to be well correlated with the drug resistance ratio experimentally observed in abcg -overexpressing cells. it is suggested that hydrophilic analogues carrying oh-or nh -groups are good substrates for abcg and therefore exported from cancer cells. in contrast, sn- analogues with cl or br atom at those positions have similar logp values and high af nities toward the putative active site of abcg , however they were not substrates of abcg . from these results, it is strongly suggested that hydrogen bond formation with oh-or nh -groups are critically involved in the transport mechanism of abcg . a. agopian , j. depollier , e. gros , g. aldrian-herrada , p. clayette , n. bosquet , g. divita crbm-cnrs, route de mende, montpellier, france., spi-bio-cea fontenais aux roses, france reverse transcriptase (rt) plays an essential role in the replication of hiv and constitutes the main target for the development of aids therapies. the biologically active form of hiv rt is a heterodimer of two subunits, p and p , each consisting of distinct subdomains: the ngers, the palm, the thumb, the connection and the rnase h subdomain, the latter only present in p . we have demonstrated that formation of fully active rt is a two-step process involving rapid association of the two subunits (dimerization) followed by a conformational change (maturation). thanks to the crystal structure of rt we have identi ed a new class of inhibitors based on short peptide motifs derived from the dimer interface. we rst identi ed a short mer peptide (pep- ) derived from the tryptophanrich motif of the connection subdomain that blocks dimerization of rt and ef ciently abolishes hiv- replication. pep- interacts preferentially in a pocket involving residues trp and phe on p . we then designed mer peptides derived from the thumb domain which inhibit rt maturation as well as viral replication when delivered into cells. taking into account these results we propose that dimerization of rt constitutes a potential target for the design of more speci c new antiviral drugs. . the success of gene therapy largely relies on the availability of vectors that would deliver the genetic material ef ciently to the target cells with a minimal toxicity. in this context, our purpose was to evaluate as possible vectors a series of newly synthesized low molecular weight ( kda) chitosan derivatives grafted with dodecenoyl (ddc) groups at different percentages ( , , and %). in the absence of dna, the critical micellar concentration (cmc) of these derivatives in mm mes buffer ph . was found to be strongly dependent on the percentage of ddc but not on ph or salt concentrations. this indicates that the ddc groups confer to the chitosan derivatives the potency to self-assemble probably in micellar structures: a property that may dictate the formation and the structure of their complexes with dna. next, we investigated by quasielastic light scattering the size and the surface charge of complexes of plasmid dna with these derivatives at different ph, salt concentrations and n/p ratios (expressed in charged units of chitosan amines to dna phosphates). we found the smallest and more positively charged complexes were obtained at ph . and n/p= in the absence of salt: a condition where the chitosan derivatives were fully protonated and in excess over the dna phosphate groups. biophysical and biological examination of dna/lipids complexes particles of virus-like structure designed for in vivo gene transfer d. durand , m. schmutz , b. lebleu , a. r. thierry lure, centre universitaire paris sud, orsay, france, institut henri sadron, strasbourg, france, laboratoire des défenses antivirales et antitumorale, umr , montpellier, france the structure of complexes made from dna and suitable lipids (lipoplexes lx) was examined by cryo electron microscopy. we observed a distinct concentric ring-like pattern with striated shells when using plasmid dna. these spherical multilamellar particles have a mean diameter of nm with repetitive spacing of . nm with striation of . nm width. small angle x-ray scattering (saxs) con rmed cryoem data and revealed repetitive ordering of . nm, suggesting a lamellar structure containing at least a dozen layers. this concentric and lamellar structure with different packing regimes was also observed by cryoem with linear dsdna, ssdna and oligodeoxynucleotides. for the rst time, dna chains could be visualized in dna/lipid complexes. such speci c supramolecular organization is the result of thermodynamic forces, which cause compaction to occur through concentric winding of dna in a liquid crystalline phase. cryoem of t phage dna packed either in t capsides or in lipidic particles showed similar patterns. saxs suggested an hexagonal phase in lx-t dna. thus, both lamellar and hexagonal phases may coexist in the same lx preparation or particle and transition between both phases may depend upon equilibrium in uenced by type and length of the dna used. organization of such nucleotidic supramolecular assemblies is relevant for prebiotic chemistry. engineering self-assembly peptides for targeted delivery of therapeutics and imaging agents s. s. dhadwar, m. sung, k. kawamura, j. gariépy department of medical biophysics, university of toronto, canada peptide-mediated delivery systems have recently emerged as a means to substitute or augment conventional drug and gene delivery technologies. these approaches are versatile and easily designed to incorporate a number of speci c attributes required for ef cient delivery of therapeutic and imaging agents. in particular, self-associating peptide domains can be utilized to construct stable and structurally well-de ned protein-like assemblies displaying a series of cell-routing functions. more speci cally, a peptide-based self-assembling intercellular delivery vehicle was designed by incorporating the -residue long tetramerization domain of the human tumor suppressor protein p (hp tet). the resulting peptide tetramer displays termini within its structure that allows for the simultaneous presentation of distinct cell targeting signal or functional domains. the fusion of polycationic sequences to the hp tet domain promotes the cellular import of the resulting constructs into eukaryotic cells. this internalization event was dramatically enhanced for such multivalent peptides in relation to their monomeric counterparts. peptides containing a nuclear localization sequence along with a polycationic sequence were found to shuttle reporter plasmids ef ciently to the nucleus of cells. these results have important implications in the design and construction of novel targeted delivery vehicles. mechanisms of non-covalent peptide mediated cellular delivery of therapeutics: a biophysical study s. deshayes , m. c. morris , a. heitz , p. charnet , g. divita , f. heitz crbm -cnrs fre montpellier france, cbs -cnrs umr -inserm u montpellier france two different cell-penetrating peptides mpg and pep- were shown to promote non-endosomal intracellular delivery of non-covalent bound cargos, namely nucleic acids and proteins; respectively. in order to identify the peptide mediated internalization pathway, we undertook conformational investigations of both peptides with and without associated cargos and checked the conformational consequences of the presence of phospholipids. from the conformational point of view, pep- behaves differently from mpg. cd analysis revealed a transition from a non-structured to a helical conformation upon increase of the concentration while mpg remained nonstructured. determination of the structure by nmr showed that in water, it's a-helical domain extends from residue to . cd and ftir indicated that pep- adopts a helical conformation in the presence of phospholipids while mpg is in a -sheet form. adsorption measurements performed at the air-water interface were consistent with the helical form. pep- did not undergo conformational changes upon formation of a particle with a cargo peptide. in contrast, we observed a partial conformational transition when the complex encountered phospholipids. for mpg, interactions with nucleic acids generated a partial folding into -sheet which was more pronounced in the presence of lipids. electrophysiological measurements showed that both peptides, whether associated or not with their cargo, can induce transmembrane pore-like structures. self-assembly of hydrolysed alpha-lactalbumin into nanotubes j. f. graveland-bikker , k. g. de kruif nizo food reseach, ede, netherlands, van´t hoff laboratory, netherlands nanotubes are formed by self-assembly of partially hydrolysedlactalbumin, a kda milk protein. there are several promising applications of these -lactalbumin tubes, in food, pharmacy and nanotechnology. we studied the mechanism of self-assembly, the structure and the properties of the nanotubes. limited proteolysis of the -lactalbumin (by a serine protease) makes the molecule prone to self-assembly. in the presence of ca tubular structures are formed. other divalent ions like mn and zn can also induce tubular self-assembly, while mg leads to random aggregation. light scattering showed that the self-assembly is reversible, which is of relevance for controlled release applications. on the other hand, we could also make stable tubes by cross linking, which would be a requisite for several other applications. from afm and saxs measurements, we obtained values for the outer diameter: nm; and the inner diameter: nm. afm and cryo-em revealed the helical structure of the tube wall; it is a right-handed helix. by performing nano indentations with afm we determined mechanical properties of the tubes. the tubes were shown to be relatively resilient upon small deformations; the elastic modulus is of the order of . gpa. targeted delivery of photosensitizers into the cancer cell nuclei enhances their cytotoxic ef cacy the search for new pharmaceuticals has raised interest in locallyacting drugs which act over short distances within the cell, and for which different cell compartments have different sensitivities, e.g. photosensitizers used in anticancer therapy should be transported to the most sensitive subcellular compartments where their action is most pronounced. earlier we have produced a number of modular recombinant transporters for locally-acting drugs comprising several functional modules for cell-speci c targeting, internalization, escape from intracellular acidic vesicle, and targeting to the nuclei of melanoma cells overexpressing melanocortin receptors. here we describe new transporters on the basis of epidermal growth factor which are speci c for a wide variety of cancers. these transporters possess all necessary functional activities and deliver photosensitizers into the nuclei of human carcinoma cells to result in photocytotoxic effects almost orders of magnitude greater than those of nonmodi ed photosensitizers. characterization of mixtures of dna and nonionic polymeric agents for gene delivery in muscle j. m. gau , j. lal , l. auvray laboratoire mpi -lrp, umr cnrs , université d´Évry val d´essonne, Évry cedex, france, argonne national laboratory, ipns, south cass avenue, illinois , usa a strategy to cure muscle disease is to introduce genes (dna) into the muscle cell to correct or to add genes. nonionic polymeric agents have emerged as an ef cient vector to deliver dna in the muscle. these polymers protect dna from extracellular nuclease degradation by allowing the dna diffusion throughout the muscle tissue. there is at present no understanding about how nonionic polymers enhance transfection in the muscle. the kind of interactions between these nonionic agents and dna, dna-nonionic polymeric agent mixtures and cell membrane are currently unknown. also the structure of dna-nonionic polymeric agent mixtures is not yet well de ned. more information is needed to improve this delivery system. neutron scattering (contrast variation) and light scattering were used to investigate the interaction between: dna and nonionic polymers (pvp, di-and triblock copolymers). furthermore, electrical measurements with the same polymer complexes and black lipid membrane were also performed. depending on the polymer type there is either direct interaction with dna or in other cases polymers exhibit strong interaction with the lipid membrane. an explanation for transfection ef ciency of these nonionic agents in gene delivery to muscle will be given. high throughput in-silico screening against exible protein receptors b virtual screening of chemical databases to targets of known threedimensional structure is developing into an increasingly reliable method for nding new lead candidates in drug development. based on the stochastic tunneling method (stun) we have developed flexscreen, a novel strategy for high-throughput in-silico screening of large ligand databases. each ligand of the database is docked against the receptor using an all-atom representation of both ligand and receptor. in the docking process both ligand and receptor can change their conformation. the ligands with the best evaluated af nity are selected as lead candidates for drug development. using the thymidine kinase inhibitors as a prototypical example we documented the shortcomings of rigid receptor screens in a realistic system. we demonstrate a gain in both overall binding energy and overall rank of the known substrates when two screens with a rigid and exible (up to sidechain dihedral angles) receptor are compared. we note that the stun suffers only a comparatively small loss of ef ciency when an increasing number of receptor degrees of freedom is considered. flexscreen thus offers a viable compromise between docking exibility and computational ef ciency to perform fully automated database screens on hundreds of thousands of ligands. maturation and inhibitor design of sars-cov cl protease based on a product-bound crystal structure severe acute respiratory syndrome (sars) is an emerging infectious disease caused by a novel human coronavirus. here we report that the cl pro containing n-and/or c-terminal additional in-frame sequences underwent autoactivation to cleave the tags and yielded the mature protease in vitro. the -d structure of the c a mutant protease shows that the active site of one protomer of the dimeric protease is bound with the c-terminal six amino acids of the protomer in another asymmetric unit, suggesting a possible mechanism for maturation. the crystal structure of this product-bound form shows that the active site has a p pocket that binds the gln side chain speci cally. in addition, the p and p sites are clustered together to accommodate large hydrophobic side chains. the tagged c a mutant protein served as a substrate for the wildtype protease and the n-terminus was rst digested ( -fold faster) followed by the c-terminal cleavage as shown by the sds-page analysis. the analysis of t analytical ultracentrifuge experiments reveals the remarkably tighter dimer formation for the mature enzyme (k d = . nm) than for the mutant (c a) containing the n-terminal (k d = . nm) or the c-terminal extra amino acids (k d = . nm). taken together, the study here provides insights to the design of our new structure-based inhibitors. nevertheless, a signi cant proportion of patients do not respond to this therapy, and adverse effects are common. here we report the delivery and expression of recombinant mycobacterial dna vaccines in vivo and demonstrate the ability of multicomponent dna vaccines to enhance th -polarized immune responses. splenocytes from immunized groups of mice were re-stimulated in vitro and examined for cytotoxicity against bladder tumour cells. we used four combined recombinant bcg dna vaccines (multi-rbcg) for electroporative immunotherapy in vivo, and found that tumour growth was signi cantly inhibited and mouse survival was prolonged. increased immune cell in ltration and induction of apoptosis were noted after treatment with multi-rbcg alone, with the interleukin- (il- ) vaccine alone, and-most signi cantly-with their combinations. thus, electroporation immunogene therapy using multi-rbcg plus il- may be an attractive regimen for the treatment of bladder cancer. this approach presents new possibilities for the treatment of bladder cancer using recombinant bcg dna vaccines and il- dna vaccine. the cell-penetrating peptide (cpp) pep- is capable of introducing large proteins into different cell lines, maintaining their biological activity. two mechanisms have been proposed to explain the entrance of other cpps in cells, endosomal-dependent and independent. we evaluated the molecular mechanisms of pep- mediated cellular uptake of -galactosidase ( -gal) from e. coli, in large unilamellar vesicles (luv) and hela cells. fluorescence spectroscopy and immuno uorescence microscopy were used to study the translocation. internalization of -gal into luv and protein functionality in hela cells were detected by enzymatic activity. -gal translocated into luv in a transmembrane potentialdependent manner. likewise, -gal incorporation was extensively decreased in depolarized cells. furthermore, -gal uptake efciency and kinetics were temperature-independent and -gal did not co-localize with endosomes, lysosomes or caveosomes. therefore, -gal translocation was not associated with the endosomal pathway moreover transmembrane pores were not detected. these results indicated that the protein uptake in vitro and in vivo was mainly, if not solely, dependent on a physical mechanism governed by electrostatic interactions between pep- (positively-charged) and membranes (negatively-charged). peptide the dramatic acceleration in the identi cation of new nucleic acidbased therapeutic molecules has provided new perspectives in pharmaceutical research. however, the development of nucleic acidand peptide-based therapeutics is limited by their poor cellular uptake and traf cking. with the aim of addressing these issues, we have designed a family of short amphipatic peptides for delivery of nucleic acids (mpg) and of peptide/pna (pep). these carriers consist of a hydrophobic moiety and a nls-derived hydrophilic domain. they form stable non-covalent complexes with peptides, proteins, sirna or pna without any requirement for prior covalent cross-linking. both mpg and pep carriers enter cells rapidly, in a process involving membrane disorganization, independently of the endosomal pathway. mpg ef ciently delivers short odns and sirna into a wide variety of mammalian cell lines, without interfering with their biological function. pep signi cantly improves delivery of pna and peptides. both carriers were used for the delivery of sirna or antisense pna targeting the cell cycle regulatory protein cyclin b in an animal model and were found to block tumor growth upon intravenous injection. we believe that mpg and pepbased technologies will contribute signi cantly to the development of basic and therapeutic applications. probing the bound conformation of acetylcholinesterase (ache) inhibitor at the binding site c. g. kim, x. zhao, s. goodall, a. watts department of biochemistry, university of oxford, south parks road, oxford, ox qu, uk acetylcholinesterases are the enzymes which preferentially hydrolyze acetyl esters (such as ach or acetyl-â-methylcholine), containing amino acid residues in the eeache form and arranged as a -stranded â-sheet surrounded by á-helices. the protein is ellipsoidal in shape, with approximate dimensions of Å by Å by Å. inhibitors of acetylcholinesterase are of commercial and medical interest as pesticides and as therapeutics in the treatment of alzheimer's disease. an understanding of the conformation of inhibitors in the binding site enables the rational design of novel inhibitors with increased potency and speci city. interaction between the ligand, amino- -methyl- -( tri uoroacetylbenzyl-oxymethyl)quinoline (r ), and ache inhibitor has been studied by advanced solid-state nmr through double-quantum chemical shift and distance measurements. combining solid-state nmr data and docking simulations, conformation of the ache inhibitor at the active site has been predicted. in vivo, heat shock proteins (hsps) being stress-inducible chaperones can attenuate detrimental consequences of ischemic insults, inammation, neurodegenerative diseases, etc. also, intracellular accumulation and chaperone activities of some hsps may contribute to improved cell survival following uv or ionizing radiation. in models of pathological states and their treatment, we used special virusbased vectors for overexpression of hsp or hsp in cell cultures to confer cytoprotection under simulated ischemia/reperfusion. in parallel, similar cytoprotection was achieved after pretreatments of the cells with a pharmacological hsp inducer, geranylgeranylacetone. the cytoprotective effects were manifested in the lesser extent of oxidative modi cation and aggregation of cellular proteins, better preservation of the cytoskeleton, faster restoration of energy metabolism and the improved post-stress cell survival. in the other model, we treated normal and tumor cells with an inhibitor of the chaperone activity of hsp , geldanamycin. only the drug-treated tumor cells became more sensitive to gamma-irradiation; such results characterize this drug as a potentially selective radiosensitizer of tumors. taken together our data demonstrate promising approaches to clinically bene cial manipulating the levels of expression and/or chaperone activity of hsp(s) by means of gene therapy or pharmacotherapy. characterizaton of proteins from human pleural uid r. jain, s. kumar, n. singh, s. sharma, t. p. singh all india institute of medical sciences, new delhi ,india the samples of human pleural uid were obtained from both healthy subjects and patients infected by tuberculosis. after the preliminary processing these samples were run in independent lanes of sodium dodecyl sulphate-polyacrylamide gel electrophoresis (sds-page). the two lanes indicated variations in the intensities of a few bands and some new bands were also observed in the infected samples. these were characterized by determining their nterminal sequences. the new bands which had low density were carefully identi ed and cloned. some of the common bands that showed intensity variations were characterized. these were matrix metalloproteins, secretory phospholipasea , transferrin and ceruloplasmin. they were also studied with maldi-tof and their molecular weights have been determined. some of these proteins have been crystallized and their detailed crystal structure determinations are in progress. biophysical study of non-lethal stress response of cultured dc f cells stress factors may induce two kinds of responses in living cells: either cell death or adapting mechanisms. our aim was to search for non-lethal effects of various stress conditions on cultured hamster lung broblasts (dc f cells) as well as to assess the recovery time after stress removal. dc f cells were cultured in standard conditions and were submitted to stress, either by incubation with chemical substances (sodium arsenate, sodium nitroprusiate) and drugs (bleomicine and statins) either by irradiation (uv, he-ne laser). the doses and exposure times were chosen as to avoid cell death. after stress removal, cells were allowed to recover and the recovery time period was measured. structural and functional parameters were evaluated before and after stress, as well as during recovery. by now, experimental models for the in vitro study of non-lethal stress inducing factors have been set up. severe acute respiratory syndrome (sars) is an emerging infectious disease caused by a novel human coronavirus. the viral maturation requires a main protease ( cl pro ) to cleave the virus-encoded polyproteins. we report here that the cl pro containing n-and/or c-terminal additional in-frame sequences underwent autoactivation to cleave the tags and yielded the mature protease in vitro. the -d structure of the c a mutant protease shows that the active site of one protomer of the dimeric protease is bound with the cterminal six amino acids of the protomer in another asymmetric unit, suggesting a possible mechanism for maturation. the tagged c a mutant protein served as a substrate for the wild-type protease and the n-terminus was rst digested ( -fold faster) followed by the c-terminal cleavage as shown by the sds-page analysis. the analysis of the quaternary structures for the tagged and mature proteases by analytical ultracentrifuge experiments reveals the remarkably tighter dimer formation for the mature enzyme than for the mutant (c a) containing the n-terminal or the c-terminal extra amino acids. taken together, the study here provides insights to the design of our new structure-based inhibitors. characterisation of macromolecular transport in physiologically relevant mixed ecm based gels s. lelu, a. pluen school of pharmacy and pharmaceutical sciences, university of manchester (uk) the extracellular matrix (ecm) a complex gel made of hyaluronic acid, collagen and proteoglycans (pg) impedes the penetration of macromolecules especially in tumours, and may compromise the success of novel therapies. though recent in vivo investigations pointed out that, not only ha but brillar collagen its content and organisation and its interactions with pg were involved in macromolecular transport hindrance, transport mechanisms relating the macromolecular drug and these ecm components are unknown. in this study we seek to evaluate the determinants of passive transport mechanisms of biomacromolecules in complex gels made of ha, collagen using uorescence techniques (frap and confocal re ection microscopy (crm)) and rheology. focus was on conditions relevant to tumours and, initially, on low collagen and relatively high ha content. rheology experiments showed that mixed systems containing less than mg/ml of ha present higher elastic modulus ge than pure ha network or pure collagen gels. interestingly crm and frap studies revealed similarities for collagen and mixed gels: the organisation and spacing of the collagen bres did not change and the ratio of the diffusivities (d/d ) of dextran m and igg were not different but higher than those in ha networks. systems with higher collagen content are under investigation to complete the characterisation of transport. encapsulation of clone vector dna by cationic diblock copolymer vesicles for gene delivery a. v. korobko , j. r. van der maarel leiden university, the netherlands, national university of singapore, singapore we will discuss the design, control, and structural characterization of cationic copolymer vesicles loaded with dna. these vesicles serve as a model system for diverse applications such as gene delivery, micro-arraying techniques and packaging of dna in congested states. encapsulation of dna was achieved with a single emulsion technique. for this purpose, an aqueous puc or pegfp-n plasmid solution is emulsi ed in an organic solvent and stabilized by an amphiphilic diblock copolymer. the neutral block of the copolymer forms an interfacial brush, whereas the cationic block complexes with dna. a subsequent change of the quality of the organic solvent results in a collapse of the brush and the formation of a capsule. the capsules are subsequently dispersed in aqueous medium to form vesicles and stabilized with an osmotic agent in the external phase. inside the vesicles, the dna is compacted in a liquid-crystalline fashion as shown by the appearance of birefringent textures under crossed polarisers and the increase in uorescence of labeled dna. the compaction ef ciency and the size distribution of the vesicles were determined by light and electron microscopy, respectively, and the integrity of the dna after encapsulation and subsequent release was con rmed by gel electrophoresis. we demonstrate the gene transfer ability of this new carrier system by the transfection of encapsulated pegfp plasmid into hela cancer cells. cellular transduction of nucleotide kinases to improve the activation of nucleoside analog prodrugs m. konrad , c. monnerjahn , s. ort , a. lavie max-planck-institute for biophysical chemistry, goettingen, germany, university of illinois at chicago, chicago, usa the objective of our study is to improve therapeutic enzymeprodrug systems by generating catalytically superior nucleoside and nucleotide kinases that are essential for activation of nucleoside analogs. compounds, such as azt for the treatment of hiv infections, acv and gcv used against herpes virus, or the anticancer compounds arac and gemcitabine, can enter cells only in the unphosphorylated state (prodrug) and need to be transformed by different kinases to their pharmacologically active triphosphate state that interferes with dna replication. we have rst designed mutants of the human tmp kinase (htmpk) that phosphorylate aztmp up to -fold faster than wildtype. expression of this enzyme in human cells leads to -fold higher intracellular concentrations of azttp and to enhanced hiv inhibition. second, the prodrugs acv and gcv are not phosphorylated by human kinases, but are converted to their monophosphate forms by hsv -tk which is used in enzyme/prodrug-dependent cancer suicide gene therapy. we generated enzyme variants which show selective and ef cient phosphorylation of gcv. third, an engineered human dck variant catalyzes more ef ciently the activation of the prodrugs arac and gemcitabine. thus, the concept of a gene (or enzyme) therapeutic treatment involving expression (or direct intracellular transduction) of a catalytically improved human enzyme may pave the way to the development of novel strategies in nucleoside prodrug-dependent cancer chemotherapy. docking-molecular dynamics studies on the peroxidase site of prostaglandin endoperoxide h synthase prostaglandin endoperoxide h synthases- and (pghs- and ) catalyze the rst step in the biosynthesis of prostaglandins, prostacyclins and thromboxanes. arachidonic acid is transformed into prostaglandin g (pgg ) at the cyclooxygenase site of the enzyme and the -hydroperoxide oxygen-oxygen bond of pgg is subsequently cleaved by reaction with haem at the distinct peroxidase site (pox) to produce prostaglandin h (pgh ). herein we present a plausible productive conformation obtained by docking calculations for the binding of pgg to the pox site of pghs- . the enzyme-substrate complex stability was veri ed by a -ps molecular dynamics simulation. structural analysis unveils the requirements for enzyme-substrate recognition and binding: the pgg -hydroperoxide group is in the proximity of the haem iron and participates in a hydrogen bond network with the invariant his and gln and a water molecule, whereas the carboxylate group establishes salt bridges with the remote lysines and . the interaction of the peptide lah with anionic lipids during dna/rna delivery to eukaryotic cells a. j. mason , a. martinez , c. leborgne , a. kichler , b. bechinger faculté de chimie, université louis pasteur, strasbourg, france, généthon, evry, france the histidine rich amphipathic peptide lah has antibiotic and dna delivery capabilities. the peptide has a strong af nity for anionic lipids found in the outer membrane of bacterial membranes and has shown evidence of higher transfection activity against transformed over healthy tissue in culture. it has been proposed that anionic lipids can ip-op to reach the cytoplasmic monolayer. here they neutralise the cationic transfection complexes thereby causing release of oligonucleotides into the cytoplasm. we were, therefore, particularly interested to test for the role of the acidic lipid phosphatidylserine (ps) in mediating lah -mediated delivery of dna ef ciency. to understand the potential peptide-lipid interactions in more detail, solid-state nmr experiments on model membranes have been performed. p mas nmr on mixed phosphatidylcholine (pc)/ps and pc/phosphatidylglycerol (pg) membranes has been used to investigate speci c lah interactions with anionic lipids. by using deuterated lipids and wide-line h nmr when probing lipid chain order, it is demonstrated that lah preferentially interacts with ps over pc. lah thereby effectively disorders the anionic ps lipid fatty acyl chains. the lipid chain destabilising effect of lah and also lah analogues can then be compared with their transfection ef ciency for dna or sirna in cell culture to aid in rational peptide vector design. virtual screening is now widely accepted as a basis for drug discovery thanks to signi cant improvement and good hit rates [ ]. however, it is still highly cpu-consuming. at the same time, the number of protein-ligand complexes described at the atomic level is rising and the sequence similarity is used for structure and function predictions. new approaches are being developed to take advantage of the available structural data and the huge number of protein sequences in order to allow better tuned virtual screening. new web servers are being built to ease and to speed up the whole process (http://abcis.cbs.cnrs.fr/kindock/). integrating these servers into a pipeline dedicated to molecular modelling (http://abcis.cbs.cnrs.fr/atome/) shall allow both the re ned validation of modelled active sites as well as the oriented screening for the primary caracterization of potential ligands. an ideal drug delivery system should own the following characteristics, the rst is the targeting of therapeutic agent to the speci c site of their action. the second is the controlled delivery of a therapeutic molecule or protein in a pulsatile or staggered fashion. the third is the achieving sustained zero-order release of a therapeutic agent over a prolonged period of time. in this study, a new drug delivery system combined these characteristics was provided, which contains azobenzene derivatives (ab lipid) as an on-off switch incorporated into liposomes. the drastic release of calcein was observed on the rst uv irradiation of ab lipid to the cis isomer, while a suppressed release was observed when irradiated with the rst visible light. after that, the slope of release pro le became coincident. furthermore, calcein release was greatly increased after uv irradiation of ab lipid to the cis isomer and the drug release was greatly suppressed after vis irradiation of ab lipid to the trans isomer. we can control the release rate of calcein from ab lipid/egg pc mixed liposomes by uv or vis light irradiation. tryptophanase (trpase), a bacterial enzyme with no counterpart in eukaryotic cells, produces l-trp pyruvate ammonia and indole. it was suggested that indole is essential for bacteria multiplication and bio lm formation. bio lms destroy equipment and food and cause many illnesses. most synthesized quasi-substrates inhibit trpase at mm range. an optimal and speci c inhibitor of trpase may eliminate indole production and prevent bio lm formation. x-ray crystallography of the holo-wt e. coli trpase soaked with l-trp and the known mechanism of trpase activity should provide the information for the design and synthesis of active-sitespeci c quasi-analogs. utilizing the chromogenic substrate s-(onitrophenyl)-l-cysteine, the following michaelis-menten kinetics analyses determined the mode of trpase inhibition by trp and quinone based quasi-analogues and the corresponding ki values (in µm): dl- -alanyl- , anthraquinone, noncompetitively, ; trypthophan ethylester, competitively, ; acetyltryptophan, uncompetitively, . ; s-phenylbenzoquinone-l-tryptophan, uncompetitively, . plp-l-trp, inhibited irreversibly only the apo form of trpase and may serve for structure-determination purposes. further attempts are being made to synthesize improved trpase inhibitors, i.e., in the nm range. polyelectrolyte multilayer lms (pem) adsorbed on biomaterial surfaces are a new way to create a controlled release system. using biodegradable polymers, the lms can be degraded in vivo and release active molecules. in this work, we demonstrate the possibility of tuning the degradability of polysaccharide pem in vitro and in vivo. chitosan and hyaluronan pem (chi/ha) were either native or cross-linked (cl) using a water soluble carbodiimide (edc) at various concentrations in combination with nhydroxysulfosuccinimide. the in vitro degradation of the lms in contact with enzymes was followed by quartz crystal microbalance measurements and confocal laser scanning microscopy after lm labeling with chi fitc . whereas the native lms were subjected to degradation, the cl lms were more resistant to enzymatic degradation. films made of chitosan of medium molecular weight were indeed more resistant than lms made of chitosan-oligosaccharides. in addition, macrophages could degrade all types of lms and internalize the chitosan in vitro. the native lms implanted in vivo in mouse peritoneal cavity for a week showed an almost complete degradation whereas the cl lms were only partially degraded. these results suggests that the polysaccharides pem are of potential interest for in vivo applications as biodegradable coatings and that degradation can be tuned by controlling lm cross-linking. membrane electroporation -tool for therapeutic electrotransfer of drugs and gene dna e. neumann, s. kakorin physical and biophysical chemistry, faculty of chemistry, university of bielefeld, germany membrane electroporation (mep) is a new electrical high voltage scalpel, transiently opening the cell membranes of tissue for the penetration of foreign substances. due to the enormous complexity of cellular membranes, many fundamental problems of mep have to be studied at rst on model systems, such as the curved bilayer membranes of unilamellar lipid vesicles. electrooptical and conductometrical data of unilamellar liposomes indicate that electric eld pulses cause not only the formation of membrane electropores but also shape deformation of the liposomes, both processes mutually affecting each other. the primary eld effects of mep and cell deformation can trigger a cascade of numerous secondary phenomena, such as pore percolation and transport of small and large molecules across the electroporated membrane. the chemical mep theory represents a molecular physico-chemical approach to electrochemomechanical pore formation, yielding transport parameters, such as permeation coef cients, pore fractions and pore sizes. the pore concept is successfully applied to rationalize optimization strategies for biotechnological and medical applications of mep. in silico elucidation of xenobiotic processing loops k. nakata , y. tanaka , t. nakano , t. ishikawa , h. tanaka , t. kaminuma national institute of health sciences, tokyo, japan, tokyo medical and dental university, tokyo, japan, tokyo institute of technology, yokohama, japan, hiroshima university, hiroshima, japan one of the important challenges for drug designers is to predict and analyze how drugs are absorbed, distributed, metabolized and excreted (adme) in the body. these processes highly correlated with toxicity of drugs and are actively studied in pharmacology. two classes of proteins, the drug metabolizing enzymes such as cytochrome p s (cyps) and transporters, are the target of such adme/tox research. it was relatively recent that these two classes of proteins are synthesized by the genes that are the target genes of the nuclear receptors. nuclear receptors are ligand-activated transcription factors that form a superfamily. in case of humans there are nuclear receptors almost half of whose ligands are identied, leaving some as true orphans. thus it was now recognized that these nuclear receptors play the role of sensors of drugs and other xenobiotic substances including environmental chemical pollutants and nutritional ingredients, while the drug metabolizing enzymes and the transporters are the processors which carry the actual cleaning jobs. we have started to elucidate the feedback loops that are formed by the xenobiotic ligands, nuclear receptors, their target genes, their product proteins, and their feedback actions on the ligands. the work is being carried out on our background database on the ligands and their receptors called kibank, and search programs for target genes of nuclear receptors algorithmically. the most recent results will be presented at the presentation. the major route for drug entry into cells is permeation across lipid bilayers. due to methodological limitations there are only few studies on permeation of drug-like molecules across lipid bilayers. an assay developed in our lab allows the direct measurement of lipid bilayer permeation of aromatic carboxylic acids (acas). tb , which forms a uorescent complex with acas, is entrapped in liposomes and aca entry is determined from luminescence increase. lipid bilayer permeation was ph-dependent, following a henderson-hasselbalch function with a plateau for the neutral and the anionic species, respectively. in contrast to the expectations of the ph-partition hypothesis, permeation of the anionic species was only to magnitudes lower than that of the neutral species, leading to anion-controlled permeation at ph . , independently of bilayer state and lipid composition. permeation across bilayers with a biologically relevant lipid composition was signi cantly slower than across egg-phosphatidylcholine membranes. the in uence of single lipids, such as cholesterol, was dependent on the structure and ionization state of the permeant. permeation coef cients of the neutral species correlated better with the polar surface area (psa) than logp oct , therefore psa is a better predictor for bilayer permeation of the neutral species of small acas than logp oct . interplay between polymerized liposomes physicochemical properties and composition and citotoxicity this study was aimed at investigating whether there is an interplay between diacetylenic polymerized liposomes physicochemical properties and lipid composition affecting citotoxicity in vitro. unsaturated , -bis( , -tricosadiynoyl)-sn-glycero- -phosphocholine with saturated , -dimiristoyl-sn-glycero- phosphocholine in molar ratio : , were combined to give a chemically modi ed membrane by uv-polymerization. biophysical characterization was carried out determining the hydrophobic factor and hydrodynamic radius. citotoxicity was evaluated through haemolytic capacity on bovine red blood cells and indirectly by capacity of induction of lipid peroxidation on microsomes or mitochondrial membranes. the haemolysis percentage in presence of dc , pc/dmpc is less than that induce by polymers used in dentistry. the data obtain suggests that the polymerized lipids can not induce lipid peroxidation on natural membranes. the polymerized diacetylenic liposomes showed less interaction with serum proteins than non polymerized and lower citotoxicity as compared with natural lipids. also cell viability was determined in cell line nih t after exposure to lipids systems under study. the hydrophobic factor showed further augmentation for polymerized liposomes and is discussed in relation to in vitro stability. the above results suggest that polymerized and non-polymerized liposomes would serve as an effective delivery vehicle. s. sonar, s. d´souza, k. p. mishra radiation biology and health sciences division,bhabha atomic research centre,mumbai- , india liposomes offer new approaches for drug delivery through their encapsulation to alter pharmacodynamic properties of loaded drug leading to reduction in toxicities and/ or improved ef cacy. for prolonged systemic circulation, the liposomes size has been shown to be limited to nm or less. the ethanol injection method is an excellent technique for the formation of liposomes of < nm without the need of sonication or extrusion. the present study was aimed to produce liposomes encapsulating doxorubicin in minimum procedural steps. liposomes were prepared using distearoyl phosphatidylcholine and cholesterol, distearoyl phosphatidylcholine, cholesterol and oleic acid. the effects of different operational conditions for vesicle production and drug encapsulation were evaluated, with a view to achieve process cost to a minimum, suitable size and high encapsulation ef ciency. although high ef ciency of doxorubicin encapsulation was obtained by 'active' or 'remote' loading process in dspc/chol system, it was poor in one-step injection method. oleic acid was included to cut down the active loading by ph-gradient. dspc/chol/oa systems spontaneously loaded doxorubicin with encapsulation ef ciency of % and nal drug to lipid molar ratio upto . . the mean diameter of the vesicles was + nm. the method offers liposomes of small size with high loading. design of peptides with consecutive dehydro phenylalanine residues r in order to develop general rules for the design of peptide conformations with consecutive alpha, beta-dehydro phenylalanine-residues, peptides were synthesized, crystallized and crystal structures and molecular conformations were determined. following conclusions were drawn based on the structural data: -peptide unit sequences with two consecutive dehydro-phe residues at (i+ ) and (i+ ) positions adopt an unfolded s -shaped structure with dihedral angles phi-psi centred at , . -the peptides containing two consecutive dehydro-phe residues at (i+ ) and (i+ ) positions • form two overlapping type iii beta -turns (incipient -helix). • with branched beta -carbon residue only at (i+ ) position adopt a conformation with two overlapping types ii and iii beta -turns. • with branched beta-carbon residues such as val and ile at both (i+ ) and (i+ ) positions form two overlapping types ii and i beta -turns. the consistency in the formations of these conformations makes the design of peptides with alpha,beta ?-dehydro -residues a useful and highly predictable method for developing speci c ligands for various biological applications including drug design. binding of cationic porphyrin to isolated double-stranded dna and nucleoprotein complex k. zupán , l. herényi , k. tóth , z. majer , g. csík institute of biophysics and radiation biology, semmelweis univ., budapest, hungary" biophysics of macromolecules, german cancer research center, heidelberg, germany, department of organic chemistry, eötvös loránd univ., budapest hungary the complexation of tetrakis( -n-methylpyridyl)porphyrin (tmpyp) with free and encapsidated dna of t bacteriophage was investigated. to identify binding modes and relative concentrations of bound tmpyp forms, the porphyrin absorption spectra at various base pair/porphyrin ratios were analyzed. spectral decomposition, uorescent lifetime, and circular dichroism measurements proved the presence of two main binding types of tmpyp, e. g., external binding and intercalation both in free and in encapsidated dna. tmpyp binding does not in uence the protein structure and/or the protein -dna interaction. concentrations of tmpyp species were determined by comprehensive spectroscopic methods. our results facilitate a qualitative analysis of tmpyp binding process at various experimental conditions. we analyzed the effect of base pair composition of dna, the presence of protein capsid and the composition of buffer solution on the binding process. protein crystallization and structural study of upa protease domain with active site serine mutation the urokinase (upa) system is composed of upa, its receptor (upar), and inhibitor (pai). it plays important role in various physiologic processes, including brinolysis, cell adhesion, and signal transduction, and has been recognized as a target for intervention in tumor growth and tumor metastasis. we constructed an active site mutant of upa protease domain ( - ) with three mutations (c a, n q, and s a) and expressed it as secreted protein in pichia pastoria with ppicza vector. the secreted mutant was captured from culture medium by a cation exchange column and then further puri ed on a gel ltration column. the puri ed mutant was then crystallized by sitting drop vapor diffusion method with several precipitant conditions: ( ) . - . m ammonium sulphate, - % peg , mm sodium citrate ph . or mm sodium phosphate ph . , . % sodium azide. ( ) . m ammonium sulphate, . m lithium sulphate, mm sodium acetate at ph . , ( ) . - . m sodium formate, mm sodium acetate at ph . . ( ) . - . m sodium chloride, mm sodium acetate at ph . . the crystals were of varying quality but generally diffracted from . Å- . Å with inhouse x-ray source. the structure of this upa mutant and its complex with various inhibitors will provide a platform for rational upa inhibitor design. abstract in this work, the linear interaction energy (lie) method was used to calculate the binding free energies of hiv- integrase (in) and a series of dicaffeoyl -or digalloyl pyrroliding and furan derivatives inhibitors. the model of binding free energy prediction for homogeneous inhibitors of hiv- in has been obtained with a root-mean-square deviation (rmsd) of . kj/mol and estimated to be a precise model with good prediction capability. in addition, the probable binding mode of this series of inhibitors with hiv- in was proposed by using molecular docking and molecular dynamics (md) simulation methods. our results indicate that caffeoyl -or galloyl group of inhibitors have close interaction with a hiv- in conservative dde motif. a speci c non-competitive inhibitor of a small g protein/gef complex on the protective role of selenium and catechin in cadmium toxicity s. Özdemir , s. dursun , s. toplan , n. dariyerli , m. c. akyolcu istanbul university, cerrahpasa medical faculty, department of biophysics, turkey, istanbul university cerrahpasa medical faculty, department of physiology, turkey cadmium as heavy metal is toxic and carcinogenic for organisms. cadmium perform their effects on living organisms by accumulation in blood and various tissues. due to their accumulation in various tissues and in blood, tissue antioxidant enzyme systems are affected. the present study was planned to determine the possible protective roles of selenium and catechin against the toxic effects of considered heavy metals. the study has been performed in wistar albino type rats which divided into four groups as control and cadmium, cadmium+selenium, cadmium+catechin received groups. besides cadmium as heavy metal, selenium concentration determinations were performed in blood, liver and kidney tissues of each group of rats. in the same tissue samples besides lipid peroxidation measurements, glutathione, glutathione peroxidase and superokside dismutase enzyme activity determinations were also performed. the accumulation of heavy metals was determined in blood, liver and kidneys after cadmium administration during experimental period. in the tissue of experimental group animals there was an increased lipid peroxidation but decreased antioxidant enzyme activities were observed. while effects of selenium in decreased toxicity of cadmium have been detected, there was no statistically signicant effect of catechin observed. proposed that motors could dynamically cluster at the tip of tubes when they are individually attached to the membrane. we demonstrate, in a recently designed experimental system, the existence of an accumulation of motors allowing tube extraction. we determine the motor density along a tube by using uorescence intensity measurements. we also perform a theoretical analysis describing the dynamics of motors and tube growth. the only adjustable parameter is the motor binding rate onto microtubules, which we measure to be . +/- . s . in addition, we quantitatively determine, for a given membrane tension, the existence of a threshold in motor density on the vesicle above which nanotubes can be formed. we nd that the number of motors pulling a tube can range from four at threshold to a few tens away from it. the threshold in motor density (or in membrane tension at constant motor density) could be important for the understanding of membrane traf c regulation in cells. kinesin and dynein move a peroxisome in vivo: a tug-ofwar or coordinated movement? c. kural, p. r. selvin, k. hwajin, g. goshima, v. i. gelfand university of illinois at urbana-champaign, usa we have used fluorescence imaging with one nanometer accuracy (fiona) for analysis of organelle movement by conventional kinesin and cytoplasmic dynein in a cell. we can locate a green uorescence protein (gfp)-tagged peroxisome in cultured drosophila s cells to within . nanometer in . milliseconds, a -fold improvement in temporal resolution, suf cient to determine the average step size to be nanometers for both dynein and kinesin. furthermore, we nd that dynein and kinesin do not work against each other in vivo during peroxisome transport. rather, we nd that multiple kinesins or multiple dyneins work together, producing up to times the in vitro speed. engineering a bio-molecular walker h. jankevics, j. e. molloy division of physical biochemistry, mrc national institute of medical research, the ridgeway, mill hill nw aa, london, uk in this work we describe the design and development of a biomolecular walker based on the motile system found in certain ciliated protists. the motile system is driven by the binding of ca ions and in contrast to other commonly studied motor proteins is independent of atpase (amos et al., ) . the motor protein is a kda ca-binding protein called spasmin (maciejewski et al., ; itabashi et al., ) which belongs to the ef-hand family of calcium binding proteins called calmodulins. upon calcium binding, spasmin is thought to undergo a large conformational change as it binds its own target peptide and we wish to exploit this in order to create our own novel molecular walker.we have created a recombinant spasmin with sequence tags to enable speci c immobilization and various conjugate chemistries. cystein mutants have been introduced at speci c points in the protein to enable attachment of other small molecules, for example uorophores. we are now optimising protein expression and puri cation to maximise the yield of active protein on which we can perform the conjugate chemistries. we will characterize the structural changes using biophysical methods such as circular dichroism, analytical ultracentrifugation, electron microscopy, afm and total internal re ection uorescence spectroscopy on single molecules. the force generation in muscle arises from direct interaction of the two main protein components of the muscle, myosin and actin. the process is driven by the energy liberated from the hydrolysis of atp by myosin. the interaction is performed by cyclic interaction of myosin with atp and actin, and at least six intermediates are proposed for actomyosin atpase in solution. the powerful dsc technique allows the derivation of heat capacity of proteins as a function of temperature. from the deconvolution of the thermal unfolding patterns it is possible to characterize the structural domains of the motor protein. in this work we tried to approach the temperature-induced unfolding processes in different intermediate state of atp hydrolysis in striated muscle bres. we have extended the experiments to study the ber system prepared from psoas muscle of rabbit in rigor, strongly binding and weakly binding states of myosin to actin where the inorganic phosphate (p i ) was substituted by the phosphate analogue orthovanadate. the dsc transitions were analyzed in different buffer solutions (tris and mops) to get information about the temperature dependence of ph on the conformational changes. single kinesin motor proteins walking through the searchlight s. verbrugge, l. c. kapitein, e. j. peterman vrije universiteit, de boelelaan , hv, amsterdam, the netherlands the dimeric motor protein kinesin steps by a hand-over-hand mechanism. this means that the centre of mass moves with nm steps, while the two motor domains, the one after the other, move nm to the next binding site on the microtubule. the molecular details of what happens during a step are not fully understood, partly because of lack of time resolution in wide-eld, single-molecule uorescence experiments. we set out to develop an approach to study the motility of kinesin with a time resolution below a millisecond (a single step takes on the order of milliseconds). this approach allows us to look into the mechanochemistry and coupling of the two kinesin motor domains while they are stepping. our method is based on confocal microscopy and we study the uorescent properties of single labeled motors while they walk through the confocal laser spot. we present the experimental details of our approach and show our results on human kinesin constructs that are speci cally labeled in the tail. we show that our approach enables us to study the mechanism of kinesin with a much higher time resolution than what was achieved before with single-molecule uorescence experiments. movement of coupled single-headed kinesins analysed by a brownian-ratchet model . the analysis of this system is expected to provide insights into the mechanism underlying the motility of conventional double-headed kinesin, espetially the roles played by individual heads. we would like to clarify whether the experimentally observed behaviors that are supposed to be caused by a pair of single-headed kinesins can be explained by a simple brownianratchet model, which is successful in describing the motion of an unconventional single-headed kinesin kif a. our model consists of two brownian motors (ratchets) separated by a xed distance r. the velocity and other quantities of the coupled motors are calculated by solving the fokker-planck equation with various choices of r and other parameters. then, assuming a certain probability distribution of r associated with random attachment of kinesin heads on a bead in the experiment, the statistical properties of the motion of the coupled brownian motors are analysed. the force-velocity relation observed experimentally is found to be consistent with the present model with appropriate choices of the model parameters. the adequacy of the parameter choice needs to be con rmed by other experiments. optical trap with fast programmable feedback loop to study rotary molecular motors t. pilizota, f. bai, r. m. berry clarendon laboratory, univeristy of oxford an optical trap with back-focal plane detection and fast programmable feedback has been developed for the study of rotary molecular motors. a helium-neon laser ( nm) is used for position detection and a solid state bre laser ( nm, w cw) forms the trap. acousto-optic de ectors (aods) controlled by a digital signalling processing board are used to achive programmable feedback loops with exible control options and speeds up to khz. several modes of feedback are demonstrated, controlling both bead position (x,y) and angle (r, ). polystyrene beads or bead pairs can be held at set (x,y) or , and the set-point can be changed while the program is running. for example, feedback can be used to move a bead or a bead pair in a circle. results of using the system to study the bacterial agellar motor are presented. a dimeric -d lattice gas as model for molecular motors collective dynamics p. pierobon , t. franosch , e. frey ludwing-maximilian universitaet, münchen, germany, hahn meitner institut, berlin, germany, arnold sommerfeld center for theoretical physics, münchen, germany the transport of molecular motors along microtubules closely resemble the dynamics of a driven lattice gas of dimers without conservation of particles. the unidirectionality, asymmetry and stochasticity of the motion are encoded in the well studied totally asymmetric simple exclusion process (tasep). we extend the model to a more realistic one, including attachment and detachment kinetics and extended (dimeric) particles. we study the stationary phase diagram by means of monte carlo simulations combined with a continuum description (based on an extended mean eld theory). we also evaluate the domain wall theory nding out the effective potential con ning the phase interface into the bulk. a. e. wallin , j. lisal , r. tuma department of physical sciences, pobox , fin- , university of helsinki, finland, institute of biotechnology, university of helsinki, finland molecular motors often consist of two or more subunits that cooperate to convert chemical energy into mechanical motion. hexameric helicases and viral packaging atpases constitute a special class of molecular motors that translocate along nucleic acids. recent structural and spectroscopic characterization of these motors revealed that their enzymatic cooperativity does not result from cooperative binding [ - ]. in order to understand this new type of cooperativity we simulated the kinetics of a single hexameric motor by multiple coupled stochastic reactions using the gillespie algorithm [ ] . in contrast to analytical methods, our direct simulation allowed us to investigate the kinetics with an arbitrary model for cooperativity between the subunits. simulations on the kinetics of the hexameric rna packaging motor p from dsrna bacteriophage [ ] with different cooperativity mechanisms provided insight into the rnamediated cooperativity and yielded a sound theoretical basis for the interpretation of experimental results [ ]. the viral infectivity factor (vif) encoded by hiv- is a small basic protein that strongly modulates the viral replication and is required for pathogenicity. vif is packaged into hiv- particles through a strong interaction with genomic rna and is associated with viral nucleoprotein complexes. moreover vif acts during the early stages of the viral infection (capsid disassembly, reverse transcription) as well as during the late stages of virus replication (virus assembly and maturation of the virion). however the effect on early stages is probably a consequence of a defective assembly and / or virion maturation. understanding the rna-binding properties of vif would contribute to elucidate the role played by vif in the regulation of the genomic rna traf cking in the cytoplasm to unable ef cient packaging, and the prevention of cellular inhibitors from altering hiv- rna. in this context, we have characterised the interactions of recombinant vif with hiv- genomic rna by uorescence spectroscopy, and determined the af nity of the protein for synthetic rnas corresponding to various regions of hiv- genome. taken together our results demonstrate cooperative and speci c binding. in particular, we showed that vif has a high af nity for the 'untranslated region of hiv- genomic rna. s. bernacchi , e. ennifar , k. toth , p. walter , j. langowski , p. dumas cnrs upr -strasbourg (france), dkfz -heidelberg (germany) we have used the dimerization initiation site (dis) of hiv- genomic rna as a model to investigate hairpin-duplex interconversion by using a combination of uorescence, uv-melting, gel electrophoresis and x-ray crystallographic techniques. fluorescence studies with molecular beacons and crystallization experiments with -nucleotide dis fragments showed that the ratio of hairpin to duplex formed after annealing in water essentially depends on rna concentration, and not on cooling kinetics. with natural sequences able to form a loop-loop complex (or 'kissing complex'), concentrations as low as µm in strands are necessary to obtain a majority of the hairpin form. on the contrary, when kissing-complex formation was made impossible by mutation in the loop, a majority of hairpins was obtained even at µm in strands. this mutated sequence also showed that kissing-complex formation is not a prerequisite for an ef cient conversion to duplex in presence of salts. we proved that this happens through hairpins engaged in a cruciform intermediate, but not from free strands after hairpin melting. supporting this view, the very rst step of formation of such a cruciform intermediate could be trapped in a crystal structure. such a mechanism might be biologically signi cant beyond the strict eld of hiv- rna dynamics. generation of rna dimeric form of the human immunode ciency virus type (hiv- ) genome is important for the viral replication. the dimerization initiation site (dis) has been identi ed as a short sequence that can form a stem-loop structure with a selfcomplementary sequence in the loop and a bulge in the stem. a mer dis rna fragment, dis , spontaneously formed "loosedimer" and was converted into "tight-dimer" by supplement of nucleocapsid protein ncp . nmr chemical shift analysis for dis in the kissing-loop and extended-duplex dimers revealed that three dimensional structures of the stem-bulge-stem region were similar between the two types of dimers. therefore, we determined the solution structures of two shorter rna molecules corresponding to the loop-stem region and the stem-bulge-stem region of dis , and the solution structures of dis in the kissing-loop and extendedduplex dimers were determined by combining the parts of structures. the mechanism of conformational conversion will be discussed based on the solution structures and the molecular dynamics analysis. nmr and molecular modelling studies of an rna hairpin containing a g-rich hexaloop the mrna of the pgy /mdr gene encoding the transmembrane p-glycoprotein (p-gp) contains a hairpin that is the target of antisense oligonucleotides, suppressing the p-gp function of multidrug resistance. the solution conformation of this hairpin constituted by the '(gggaug) ' loop closed by a g-u mismatch containing stem is studied by nmr and molecular dynamics in explicit solvent. special attention is given on the sugar and the backbone conformations and the hexaloop intrinsic properties of these two components are carefully investigated. the stem structures obtained by molecular dynamics with and without nmr constraints converge to the same a-type double helix. the wobble g-u mismatch moderately perturbs the overall conformation, despite of c '-endo sugars and unusual backbone conformations located between the mismatch and the loop. in the hexaloop part, the sugar puckers are in majority in c '-endo conformations, probably to extend the strand with the help of unusual backbone angles conformations. the loop appears stabilized by one hydrogen bond and stacking interactions. thus, from the ' to the '-ends, the four purine bases ggga are stacked together, then a u-turn like is observed, and nally, u stacks on the last g that remains rather far from the stem. nmr and molecular modelling studies of an rna hairpin containing a g-rich hexaloop the mrna of the pgy /mdr gene encoding the transmembrane p-glycoprotein (p-gp) contains a hairpin that is the target of antisense oligonucleotides, suppressing the p-gp function of multidrug resistance. the solution conformation of this hairpin constituted by the '(gggaug) ' loop closed by a g-u mismatch containing stem is studied by nmr and molecular dynamics in explicit solvent. special attention is given on the sugar and the backbone conformations and the hexaloop intrinsic properties of these two components are carefully investigated. the stem structures obtained by molecular dynamics with and without nmr constraints converge to the same a-type double helix. the wobble g-u mismatch moderately perturbs the overall conformation, despite of c '-endo sugars and unusual backbone conformations located between the mismatch and the loop. in the hexaloop part, the sugar puckers are in majority in c '-endo conformations, probably to extend the strand with the help of unusual backbone angles conformations. the loop appears stabilized by one hydrogen bond and stacking interactions. thus, from the ' to the '-ends, the four purine bases ggga are stacked together, then a u-turn like is observed, and nally, u stacks on the last g that remains rather far from the stem. aminoglycoside binding to hiv- dis kissing-loop complex: from crystals to cells e. ennifar , j.-c. paillart , a. bodlenner , p. pale , r. marquet , p. dumas cnrs upr , strasbourg -france, cnrs/université louis pasteur strasbourg umr , strasbourg -france all retroviral genomes consist in two homologous single stranded rnas. dimerization is an essential step for viral replication. hiv- dimerization initiation site (dis) is a strongly conserved stem-loop in the ' leader region of the genomic rna. it was shown in vivo that alteration of the dis dramatically reduces viral infectivity. we have previously solved crystal structures of the dis kissing-loop complex. analysis of these structures revealed an unexpected resemblance between the dis kissing-loop and the s ribosomal aminoacyl-trna site (a-site), which is the target of aminoglycoside antibiotics. we have shown that some aminoglycosides specifically bind to the dis kissing-loop complex with an af nity and geometry similar to that observed in the a-site. in agreement with these previous results, we have now solved highresolution crystal structures of the dis kissing-loop complex bound to four aminoglycosides. these structures show that, as expected, two aminoglycosides are bound per kissing-loop complex. importantly, the binding is observed not only in vitro on large hiv- genomic rna fragments, but also on infected cells. moreover, we showed that some of these aminoglycosides stabilize the kissingloop rna dimer, which is consistent with the observation in crystal structures of numerous direct and water-mediated drug-rna contacts. these structures are currently used as starting points for designing potential new drugs targeted against the viral rna. modeling the long range entropy of rna: w. k. dawson , k. fujiwara , k. yamamoto , g. kawai chiba institute of technology, - - tsudanuma, narashino, chiba, japan, international medical center of japan, - - toyama, shinjuku-ku, tokyo, japan non-coding rna appears to make up a large part of the human genome. a reliable rna structure prediction program is needed to understand the structure of this non-coding rna. we recently developed a new way to model the long range entropy in rna and applied it to rna secondary structure prediction. in some of instances, the new approach is able to achieve far better predictions than the state of the art secondary structure programs even given exactly the same parameters. predictions using this method tend to show distributions that are funnel shaped. a new and important parameter in these calculations is the persistence length (a measure of the correlation and exibility of the rna). (url: http://www.rna.it-chiba.ac.jp/ vsfold/vsfold /) this new approach has now been extended to prediction of pseudoknots. the method is a heuristic wherein the hierarchical folding hypothesis is used to nd the pseudoknots as the rna secondary structure is folding, and corrections to that secondary structure are made to accommodate the pseudoknot. it is able to do these searches in roughly n^ time. the model is consistent with the hierarchical hypothesis and it is possible to estimate rna folding times that are of the correct order of magnitude using this model. with further adaptations to account for the size, shape and variablity of amino acid residues (hydrophobicity etc.), the model also appears to be transferable to protein folding problems. a. v. melkikh ural state technical university, ekaterinburg, russia a model of the genome as a gene network capable of receiving information about the environment and performing some operations on genes has been considered. the evolution rate of replicators for the mechanism of random mutations has been estimated. it was shown that the evolution rate under random actions is negligibly small for real dimensions of genomes of replicators [ ]. it was inferred that only a deterministic mechanism of the evolution can explain the known evolution rate of replicators. a deterministic model of the evolution has been proposed. the basic principles of this model include: ) information about the replicators evolution is encoded in the conformational states of proteins; ) the conformational language of proteins is translated into the language of nucleotide sequences during the evolution; ) the structure of genes is controlled such that the transition to a nearest free ecological niche takes a minimum time (at a preset restriction on the control). transfer rnas are synthesized as part of longer primary transcripts that require processing of both their ' and ' extremities in every living organism known. the ' side is matured by the quasiuniversally conserved endonucleolytic ribozyme, rnase p, while removal of the ' tails can be either exonucleolytic or endonucleolytic. the endonucleolytic pathway is catalysed by an enzyme known as rnase z. rnase z cleaves precursor trnas immediately after the discriminator base in most cases, yielding a trna primed for addition of the cca motif by nucleotidyl transferase. rnase z is found in the vast majority eukaryotes and archaea and in about half of the sequenced bacteria. it is often essential for growth and mutations in one of the two genes encoding rnase z (elac ) have been linked with prostate cancer in man. in this poster we present the crystal structure of bacillus subtilis rnase z at . Å resolution ( ) resolved by mad method and propose a model for trna recognition and cleavage. the structure explains the allosteric properties of the enzyme and also sheds light on the mechanisms of inhibition.it also highlights the extraordinary adaptability of the metallohydrolase domain of the b-lactamase family for the hydrolysis of covalent bonds. ( )i. most rnas undergo several steps of post-transcriptional modi cation before carrying out their assigned functions. one of the major modi cations is the splicing process, by which non-coding introns are removed from the coding exons. splicing can be performed by autocatalytic, self-splicing introns (e.g. group ii introns), i.e. catalytic rna or ribozymes. group ii introns, which occur in bacterial genomes and in organellar genes of plants, funghi and lower eukaryotes, consist of a conserved set of six domains. domain recognizes the '-exon through a - base pairing interaction formed by two regions within the intron (exon binding sites, ebs and ebs ) and the last - nucleotides of the '-exon (intron binding sites, ibs and ibs ). as the correct recognition of ibs by ebs is crucial for a successful splicing event we are investigating the structural and metal ion requirements of this part by various spectroscopic techniques, e.g. nmr. our data shows that the hairpin including ebs consists of a helical region followed by an unstructured single stranded part, which is ready for splice site recognition. the results of the structure analysis will be presented. financial support by boehringer ingelheim fonds (fellowship to d. k.) and the swiss national science foundation (snf-förderungsprofessur to r. k. o. s.) is gratefully acknowledged. structural basis for the antigene and antisense properties of modi ed dna:dna and rna:dna duplexes e. c. m. juan , t. kurihara , j. kondo , t. ito , y. ueno , a. matsuda , a. takenaka graduate school of bioscience and biotechnology, tokyo institute of technology, graduate school of pharmaceutical sciences, hokkaido university, faculty of engineering, gifu university oligonucleotides containing polyamines are currently being evaluated as potential antigene and antisense compounds. those with -(n-aminohexyl)carbamoyl- '-deoxyuridine ( n u) and its '-omethyl derivative ( n u m ) exhibit improved nuclease resistance and form stable duplexes with their dna and rna targets. x-ray structures of these duplexes have shown good correlation between the conformational changes and the observed chemotherapeutic properties. the amide groups of the modi ed uracil bases form six-membered rings through the intra-molecular nh-o hydrogen bonds, so that the aminohexyl chains protrude into the major grooves. some of the terminal ammonium groups are involved in intra-duplex interactions with phosphate oxygen anions, whereas the others interact with those of the adjacent duplex. such interactions contribute to the stability of duplex formation. the '-o-methyl modi cation in n u m shifts the ribose ring toward the c '-endo conformation and in uences duplex stability. observed changes in the dimensions of the minor grooves and in the hydration structures are also well correlated to nuclease resistance and duplex stability. group ii intron ribozymes catalyze selfsplicing in bacterial genomes as well as in organellar genes of lower eucaryotes. for correct structure and function these ribozymes need speci c concentrations of monovalent and divalent cations such as k and mg . most of these ions are used for charge screening, but some are also bound to distinct sites ful lling various speci c tasks. the conserved secondary structure of group ii intron ribozymes consists of six domains grouped around a central wheel. here the focus is set on domain (d ) of the yeast mitochondrial intron ai , a hairpin of nucleotides, which is crucial for catalytic activity. the -nucleotide bulge in d is known to be exible and acts as a metal ion binding platform. we have investigated the binding of different metal ions (mg , ca , mn , cd ) to this platform by uorescence spectroscopy. for this the bulge site adenosine in d was replaced by the uorescent nucleotide base analogue aminopurine ( ap). the binding data ts to an equation describing a binding to a single class of sites. the titration experiments not only reveal different dissociation constants for the tested metal ions but also indicate different effects on the bulge structure. financial support by the swiss national science foundation (snf-förderungsprofessur to r.k.o.s) is gratefully acknowledged. modi ed nucleosides and across the anticodon loop interactions in trna u. b. sonavane, k. d. sonawane, r. p. tiwari national chemical laboratory, pune , india in several interesting trna molecules, the ( th ) as well as the ( th ) nucleoside are hyper modi ed. as an example, unique hypermodi ed nucleosides mcm s u and ms t a are crucial in human trna lys , which acts as a primer in hiv replication. modi ed nucleosides may facilitate or hinder across the loop interactions. large substituents in th and th modi ed nucleosides if oriented suitably may also interact with each other. across the loop interactions may lead to unconventional anticodon loop structures also affecting exibility of the anticodon loop. this may restrict or enlarge synonymous codon choice and decoding during protein biosynthesis. except for trna asn (with interacting q and t a ), our studies show conventional 'open' loop structure -free of across the loop interactions, for a number of interesting trna anticodon loops with diverse hyper modi ed nucleosides at both of these locations. molecular dynamics simulations of hydrated anticodon arm of trna asn show persisting interaction involving the diol group of q and carbonyl group of ureido linkage in t a . additionally, the hoogsteen edge of th adenine base participates in hydrogen bonding with watson -crick edge of rd base and thus contributes to unique loop structure of trna asn . resulting suboptimal q:c base pairing leads to unbiased reading of u or c as the third codon letter. absence of queuosine modi cation, q happens to be also associated with uncontrolled rapid proliferation of cells and malignant growth. structural properties of ctg/cag repeats, and preliminary x-ray analysis of cug repeats y. sato, k. kimura, a. takénaka graduate school of bioscience and biotechnology, tokyo institute of technology, yokohama, japan. the human genome contains so many different types of repetitive sequences. some of them are tandem repeats of trinucleotides. their unusual expansions cause genetic diseases such as type myotonic dystrophy (dm ) and huntington's disease (hd), the unit sequences being ctg and cag, respectively. the numbers of repeats of the two complementary sequences change independently during dna replication or repair. the direct origin of dm is, however, the transcribed rna fragments with cug repeats, which forms a speci c structure and inhibits other protein syntheses. in the present study, structural versatilities of such dna and rna fragments has been examined. native pages of (cug) n show that the hairpin structure with even number is more stable than that with odd number. this difference might be ascribed to the structural difference at the hairpin head. the pages also show that duplex formation is dependent on coexisting cationic species and their concentration. crystal data of (cug) (a=b= . , c= . Å, and the space group r ) suggest that the asymmetric unit contains the rna fragment. an approximate crystal structure solved by molecular replacement techniques at . Å resolution shows that the rna fragments form a duplex similar to an a-form rna. context-dependent selection of promoter in a natural selection-type evolution reactor h. nagayasu, y. ageno, x. t. ma, y. husimi saitama university, saitama, japan using an isothermal ampli cation of hairpin dna/rna (developed by g.joyce), we drove a natural selection-type evolution reactor taking the speci c growth rate as the tness. we used hiv- rt and thermot rnap at c. starting from the random promoter pool, we selected the strongest promoter at c, which was separated by hamming distance from the strongest promoter at c. the latter was found to be identical to the natural t promoter. when we used a simple random pool, the selection process showed one-step convergence. when we used a random pool with a speci c short sequence at the upstream anking region of the random region, we observed an evolution process as convergencedivergence-convergence of the promoter sequence, driven by deletion of the speci c short sequence. this context-dependent selection was found to come through a neutral path, judged from the tness measurement. intronic sirna and mirna, and dna methylation gif sur yvette, france. abnormal activation of small g proteins is involved in several human diseases. small g proteins are activated by gdp/gtp exchange, which is stimulated by their guanine nucleotide exchange factors (gefs). thus, small g protein/gef complexes appear as emerging targets for interrupting signalling networks regulated by small g proteins in pathological contexts. the activation of small g proteins of the arf family is initiated by exchange factors which carry a sec catalytic domain stimulating the dissociation of the bound gdp nucleotide. the structure of the arf -gdp-arno reaction intermediate, trapped by a mutation of the catalytic glutamate, was recently solved by x-ray crystallography (pdb code: r s) acknowledgements: the work was supported by contract k- j. q. yin, f. chen, y. tan, t. gu institute of biophysics, chinese academic sciences, beijing, china sirna/mirna can ef ciently induce mrna cleavage or translational repression at the posttranscriptional level in a sequencespeci c manner. recently, it has been shown that these small rnas guide genome modi cation in mammalian cells. however, their ability to direct cognate dna methylation has been con rmed so far only in plants, and their biogenesis, functions, and modes of silencing genes are yet elusive. here, we report that small rnas derived from intron regions of some genes can target homologous dna sequence in promoter, 'utr or 'utr regions of genes in different human tumor cells. surprisingly, we also discovered that endogenous sirnas from introns of genes possessed a large number of target mrnas by using bioinformatics, and con rmed their existence in human cells with northern blot analysis. intronic small rnas generated by sliceosomes can form mature mirnas or sirnas through the processing of drosha and /or dicer. rt-pcr analysis indicated that vector-based small rna repressed expression of homologous genes at the transcriptional and/or translational levels. western blotting demonstrated that the expression of some proteins was greatly reduced or completely inhibited owing to promoter methylation. these ndings reveal that the expression of some genes can incredibly control cell activities at both protein and rna levels. our results also suggest that these small rnas may regulate gene expression in different modes of action. role of stacking in speci c recognition of capped rna by the cbc protein the stacking interaction involving -methylguanine moiety of mrna ' terminal cap (m g) and aromatic amino acid side chains is a common feature of all known cap-binding proteins. the crystal structures of the human cap binding complex (cbc) showed its induced folding upon m gpppg cap analogue binding. stabilization of the cbc-m gpppg complex by sandwich stacking of m g in between y and y is additionally enhanced by stacking of the second base of capped mrna with y . gibbs free energy of the association of various cbc mutants with the synthetic cap analogues, m gpppg, m gpppu, and m gtp, has been determined by uorimetric titration. preference of the wild type (wt) cbc for the dinucleotide analogues is also observed for the y a mutant, with the energy loss of . - . kcal/mol. however, all proteins with the mutated second stacking partner y prefer to bind m gtp compared with m gpppg. the binding of m gtp to the y a mutant is only . kcal/mol less favourable compared with the wt cbc. these divergences may be ascribed to smaller entropic costs of conformational rearrangement of cbc in the case of a smaller ligand, which can nd more favourable contacts when its second part is not ef ciently 'constrained' by stacking with y . supported by kbn p a annealing of the tar dna hairpin to a complementary tar rna hairpin, resulting in the formation of an extended duplex, is an essential step in the minus-strand transfer process of hiv- reverse transcription. in this work, we use gel-mobility-shift analysis to follow the kinetics of this reaction in the absence or presence of hiv- nc prepared by solid-phase peptide synthesis. to elucidate the reaction pathway, we use either the complete -nt tar hairpins or truncated -to -nt minihelices (mini-tar) derived from the top part (i.e. hairpin loop) of tar. assays were also carried out with mutant tar constructs. the annealing kinetics were studied systematically as a function of dna concentration and temperature. we show that the annealing initiates through a weak loop-loop kissing interaction, followed by a much slower conversion step, which results in formation of the extended duplex. nc facilitates both reaction steps, resulting in the overall -fold and -fold rate enhancement for mini-tar and tar annealing, respectively. we show that the kissing step is facilitated by the nc-induced nucleic acid aggregation, which is more pronounced for the longer tar hairpins. at the same time, the conversion steps in tar and mini-tar appear to be very similar and are similarly facilitated by nc - -fold. the later effect relays on the ability of nc to destabilize nucleic acid duplexes, and is equivalent to destabilization of a few base pairs required for the conversion initiation. linking of the n-terminus of a peptide to its encoding mrna s. ueno, h. arai, y. husimi department of functional materials science, saitama university, saitama, japanin evolutionary protein engineering, in vitro selection using a cellfree translation system has advantages of large library size and also of applicability to cytotoxic protein. although many in vitro protein selection techniques such as in vitro virus, ribosome display are developed, most of these are the techniques that link the genotype molecule to the peptide at its cterminus. we developed a method to link the genotype molecules to the nterminus of its encoding peptide. the mrna has dna-linker hybridize region, translation enhancer, initiation codon, single codon, amber stop codon, n-terminus sequence in gfp gene, his-tag and ochre stop codon. the mrna is also linked at '-terminus to sup trna via spacer and the speci c amino acid. the mrna is translated in the cell-free translation system. thus, c-terminus of the protein becomes free. moreover, in this system, the free protein of the full length is never generated. for this reason, the problem in the conventional technique, that is, the competition between the protein displayed on the genotype molecule and the free protein is eliminated. we succeeded to turn one round of the " life cycle " of the in vitro virus, judged by his-tag selection. key: cord- -ohdukhqt authors: patil, shital p.; goswami, ashutosh; kalia, kiran; kate, abhijeet s. title: plant-derived bioactive peptides: a treatment to cure diabetes date: - - journal: int j pept res ther doi: . /s - - -z sha: doc_id: cord_uid: ohdukhqt abstract: recent advances in analytical techniques have opened new opportunities for plant-based drug discovery in the field of peptide and proteins. enzymatic hydrolysis of plant parent proteins forms bioactive peptides which are explored in the treatment of various diseases. in this review, we will discuss the identified plant-based bioactive proteins and peptides and the in vitro, in vivo results for the treatment of diabetes. extraction, isolation, characterization and commercial utilization of plant proteins is a challenge for the pharmaceutical industry as plants contain several interfering secondary metabolites. the market of peptide drugs for the treatment of diabetes is growing at a fast rate. plant-based bioactive peptides might open up new opportunities to discover economic lead for the management of various diseases. graphic abstract: [image: see text] biomolecules play an important role in any ligand-target interaction and their recognition in any biological system starts the cascade of signalling steps, which is an important part of the normal body function. nucleic acids, carbohydrates, and lipids are the important ligands that bind and cause conformational changes in targets (receptors) but the messengers that release after binding and conformation changes are peptides. any defect in these signal transduction processes cause diseases (otvos and wade ; gautam ) . proteins, under the influence of proteolytic enzymes get fragmented at particular catalytic sites to form bioactive peptides. bioactive peptides are made up of - amino acids and lack protein-protein interaction because of their small size. they have properties like tissue affinity, specificity, and efficiency (moller et al. ) . the structure and the overall charge and hydrophobicity/hydrophilicity of the any bioactive peptide depends on the nature of amino acids, their sequences in the peptide backbone along with the n and c terminals. the relation between structure and biological activity of the bioactive peptides are yet not confirmed but the structure is an important arbitrator for their activity (li and yu ) . bioactive and nutraceutical peptides have a positive effect in the regulation of the health of an individual (kitts and weiler ) . they boost the quality of human health by preventing various ailments and by improving medical conditions related to lifestyle, metabolism, and immunity (vitetta et al. ) . peptide and protein drugs are available from the different sources for therapeutic uses, but as with many drugs, they have several advantages and disadvantages of their own. peptide drugs are effective and specific to their biological target but devoided of some of the essential qualities of a drug to withstand in the current market of therapeutics. proteins have a large chemical structure which makes their synthesis complex, tedious and expensive, the further low oral bioavailability of proteins limit their oral delivery hence they are administered through parenteral route (uhlig et al. ) . the worldwide peptide therapeutics market size valued at usd , . million in and assumes to grow at cagr of . % over the period to . ( ) . insulin is the first therapeutic peptide which has been used widely and shares good peptide market. it was first isolated by frederick banting and charles best from pancreatic islet extracts, further, j.b. collip developed the method for extraction and purification of insulin from other animal sources (quianzon and cheikh ) . eli lilly began producing insulin from the animal pancreas, later on, biosynthetic insulin was introduced in with the name humulin. novo nordisk, sanofi, and eli lilly together share . percent of the global insulin market. lantus (basal insulin) is a product of sanofi accounts for approximately . % of the total sales of top antidiabetic drug brands, in sale of lantus was usd . billion (dezzani ) . small molecules and peptide-based drug candidate belongs to the category of new chemical entities (nces) according to the food and drug administration, on the other hand, proteins used for the treatment of various diseases are classified under new biological entities (nbes). in recent years, therapeutic peptides are approved for the mitigation of various ailments mainly tumors, immunological disorders, blood disorders and metabolic disorders such as diabetes, obesity ( fig. ) (loganathan ) . the peptide drug, liraglutide (victoza) is a glucagon-like peptide- receptor (glp- r) agonist and used for the treatment of t dm has gained popularity as a top-selling drugs in recent past, which is marketed by novo nordisk (lund et al. ) . glatiramer acetate (copaxone), developed by teva is an immunomodulator and used to treat multiple sclerosis (duda et al. ) . leuprolide (lupron), developed by abbott is useful for the treatment of cancer and estrogendependent conditions that respond to hormone therapy (brito et al. ) . goserelin (zoladex) from astra zeneca is a gonadotropin-releasing hormone agonist (gnrh agonist) that suppresses the release of different sex hormones and is useful in the treatment of breast and prostate cancer (magon ) . other innovator companies include amgen, eli lilly, roche, and pfizer have peptide and protein-based drugs in different developmental stages will reach the market soon for the treatment of various complicated diseases (loganathan ) . till date, plant secondary metabolites, mainly small molecules are the established source of new drugs to treat various diseases (verpoorte ) , however, advancement in analytical technique, sophisticated purification methodology and in vitro assay system pointed out the researchers fig. uses of therapeutic peptides and proteins in various disease conditions (loganathan ) to look beyond the small molecules. identification of plant primary metabolites including proteins and peptides for the management of diseases have opened up a new horizon in the development of plant-based peptides as a drug candidate (otvos ) . this review summarizes various plant-based peptides reported for the treatment of diabetes, challenges with the development of peptide-based drugs and the future prospect of peptides as a drug. diabetes is a metabolic disorder and can hit anyone at different stages of life. though it is considered as one of the top health killers, its treatment is a huge challenge. diabetes can be characterized with symptoms of prolonged hyperglycemia, glucose intolerance, disturbance in the regulatory systems for storage and utilization of metabolic energy, including catabolism and anabolism of carbohydrate, lipids, and proteins in a diabetic individual due to lack of insulin production and impaired insulin receptor functioning or both (piero ; votey and peters ) . based on the causes and clinical survey, diabetes mellitus is categorized into four types as mentioned in table (deepthi et al. ) . ordinarily, the people below age suffer from the type diabetes, whereas type diabetes mainly occurs in adult and old age group. according to the world health organisation, approximately % of all diabetes cases are type , and the remaining % cases of diabetes worldwide are of type (baynes ) . diabetes is ruining the life of peoples worldwide. population suffering from diabetes is increasing and the imminence is expected to rise even more because of the current lifestyle issues. till the year , million people were affected by type diabetes and the number is expected to increase up to million people worldwide by (fig. ) . the data indicates that approximately % of diabetic patients will increase in the next two decades. the threat is every s a person dies with diabetes (roglic ) . from the statistics, it is clear that the population suffering from type diabetes is increasing and it requires more promising therapies. still, no promising drug is available which has the ability to completely cure type diabetes. generally, to treat type diabetes, oral hypoglycemic agents are used. the first line treatment is metformin, which is assisted with other hypoglycemic agents from different categories like thiazolidinediones, alpha-glucosidase inhibitors, sulphonylurea, dipeptidyl peptidase-iv (dpp-iv) inhibitors, sodium glucose cotransporter (sglt ) inhibitors. the risk associated with these therapies include hypoglycemia, weight gain, tiredness, diarrhea, and anemia risk, etc. over the period of time, these side effects lead to further complications for a visceral system like cardiovascular system and central nervous system (nathan et al. ). peptide drugs used for the treatment of diabetes are derived from different sources, like exendin- was initially isolated from the heloderma suspectum (gila monster) and its synthetic version had come to the market in the form of exenatide (aramadhaka et al. ) . liraglutide and semaglutide are structurally very close to glp- with improved half-life and resistant to dpp-iv mediated degradation (edmonds and price ) . nowadays, these peptides are being produced in large scale by using recombinant dna technology. the peptides which were obtained via synthesis or by using the recombinant technology are comparatively expensive than small molecules and are not affordable to most of the patients. the interest in health-promoting products from the natural origin and pharmaceutical formulations involving bioactive peptides are remarkably increasing (henninot et al. ) . numerous scientific reports have been published on bioactive peptides obtained from animal proteins. expedition by researchers in the field of bioactive peptides is of great interest as it opens the way to find economic lead from plants for the better care of diabetic patients (daliri et al. ). plentiful bioactive peptides have been reported from the animal and plant sources. most of the peptides that had bioactivity are being derived from animal products such as milk, eggs, meat, and fish. plant peptides are still under exploration and few bioactive peptides are reported from soy, wheat, etc (hartmann and meisel ) . in animals, proteins are associated with high-fat content and lead to diseases, including high blood pressure and heart diseases if consumed in the large amount. on the other hand, plant proteins neither have associated fat nor have any side effects (nehete et al. ) . previously, plant hormones were considered the only player through which cells communicate, but from the last decade as research was driven towards an understanding of plants signaling, secreted peptides, small rnas and transcription factors emerging as a new and well-defined player in the cell to the cell communication network (lindsey et al. ; van norman et al. ; murphy et al. ) . however, it is now clearly defined that signaling peptides of plant and animal origin are biosynthesized through the pathway that is evidently similar, although some aspects of the biosynthesis are still not identified. peptides secreted by plants have well recognized role in preliminary processes of development like the growth of meristem, organ shedding, cell elongation, cell multiplication, cell differentiation, geotropism and protection from the invader (ghorbani ) . the ubiquitous presences of proteins, peptides as mediator components convey to the proposals that these messenger peptides may have emerged evolutionary in microbes and were further integrated into the complicated mediator systems of complex organisms during evolution (roth et al. ) . existence of hormone-like peptides in plants including insulin-like peptide in spinacia oleracea l. (spinach) and lemna gibba g- , prolactin-like inhibitor in alfalfa, a substance with luteinizing hormone-releasing activity from leaves of avena sativa l. (oat) and somatostatin-like material in spinach support this assumption, it is speculated that the signaling peptides in plants existed from about billion years ago (collier et al. ; fukushima et al. ; leroith et al. ; morley et al. ) . plant proteins containing essential amino acids are important in the food chain to fulfill human physiological needs. the proteins from plants are derived from leaves, seeds, and fruits, out of which seeds are considered as an economical source of proteins. as compared to vegetables and fruits, seeds (legumes and cereals) contain a higher amount of protein (creighton ) . proteins in a mature seed represent - % in pisum sativum l. (pea), vicia faba l. (faba) bean and till - % in lupinus albus l. (lupin) and glycine max l. (soybean) (gueguen and cerletti ) . many peptides are reported from different plants for the treatment of diabetes as shown in tables and through various known targets such as (a) alpha-glucosidase inhibitors (b) alpha-amylase inhibitors (c) dipeptidyl peptidase-iv inhibitors (d) inhibitors of the glucose transporter system (e) insulin mimetics alpha-glucosidase inhibitors competitively inhibit small intestinal enzyme alpha-glucosidase, which converts nonabsorbable polysaccharides into absorbable monosaccharide, these effects together postpone and minimize the rise in postprandial plasma glucose level. inhibitors of this enzyme decrease the glucose toxicity (improves insulin sensitivity), decreases stress on beta cells (decreases post-meal hyperglycemia), increases glucagon like peptide- (glp- ) production hence increases insulin secretion (scheen ; mooradian and thurman ) . in general, normalization of postprandial hyperglycemia is difficult as compared to fasting hyperglycemia, inhibitors act specifically on the postprandial part of h blood glucose curve and causes a reduction in diabetes-associated hyperglycemia which is responsible for macrovascular complication in patients (mooradian and thurman ; ceriello et al. ). glycaemic control is achieved with these agents reduces glucotoxicity which increases insulin secretion from the beta cell (scheen ; salvatore and giugliano ) . all marketed alpha-glucosidase inhibitors are from natural origin, however, none of them belongs to peptide class. few research groups have reported peptides from plants having alpha-glucosidase inhibitory activity. peptides obtained from cannabis sativa l. (hemp) seeds contains hydrophobic amino acids (pro and leu), essential amino acids and branched chain amino acids in their structure and have reported to have enzyme inhibitory activity for this target (ren et al. ) . oat seed proteins hydrolysate obtained by protease (alcalase) digestion gives bioactive peptides which inhibits this enzyme. in vivo study also revealed that oat peptides, at a higher dose, showed the hypoglycaemic effect on stz-induced diabetic mice, reduces the food intake, stimulates insulin secretion, improves insulin sensitivity and elevate glycogenesis (zhang et al. ) . walnut hydrolyzed peptides (whps) from the fruit proteins of juglans mandshurica maxim. (walnut) showed anti-diabetic activity by inhibiting the enzyme. in vitro study of whps showed that peptides of molecular weight ( - kda) inhibit alpha-glucosidase with the inhibitory rate of ( . %) and they remarkably raise extracellular glucose consumption in insulin-resistant hepg cells. whereas, in vivo results showed that whps reduce . % of fasting blood glucose level by increasing insulin secretion, liver glucokinase, and glycogen level by . %, . %, and . % respectively (wang et al. ) . vigna angularis wild. (adzuki bean) proteins have reported enzyme inhibitory properties in mice model kk-ay of diabetes. the extract reduces the postprandial ( ) hook.f. in vitro and in vivo marella et al. ( ) cucurbita pepo l. in vitro (vaštag et al. ) lam. in vitro (gonzalez garza et al. ) l. in vitro arise ( ) theobroma cacao l. in vivo sarmadi et al. ( ) l. in vitro mojica et al. ( b) willd. in vitro nongonierma et al. ( ) l. in vivo and in vivo yibchok-anun et al. ( ) momordica charantia l. insulin mimetic in vivo (khanna et al. ) momordica charantia l. seed protein expressed in e.coli stimulated the phosphorylation of pdk and akt, enhanced expression of glut- , stimulated both the uptake of glucose in cells and the clearance of glucose in vitro and in vivo lo et al. ( ) hook.f. in vivo rajasekhar et al. ( ) hook.f. molina. duchesne. in vivo teugwa et al. ( ) zea mays l. enhance the secretion of glp- in vivo hira et al. ( ) glycine max l. increases the glucose uptake, enhance the expression of p-ir, p-irs , p-akt and membrane glut protein improve the insulin resistance hypoglycemic in vitro and in vivo lu et al. ( ) oryza sativa l. improve insulin resistance in vivo boonloh et al. ( ) roxb. in vivo poovitha et al. ( ) blood glucose in two sucrose challenge model i.e. normal and streptozocin-treated rats by . % and . % respectively. in vitro study of extruded adzuki bean proteins at mg/ml, concentration inhibits . % of rat intestinal alpha-glucosidase (yao et al. amylase (alpha- , -glucan- -glucanohydrolase) is an important enzyme that speeds up the hydrolysis of (alpha- , ) glycosidic linkage of carbohydrate and starch present in the diet and helps in absorption of non-absorbable polysaccharides after their conversion into absorbable monosaccharide (alagesan et al. ). inhibitors of amylase minimize the hydrolysis of (alpha- , ) glycosidic linkage and help in slow digestion of carbohydrate and thus delay the absorption of glucose and reduces the postprandial glucose level in blood (jayaraj et al. ). the pinto beans are a rich source of proteins and other nutrients but are not well explored for its therapeutic benefits. pinto bean protein fraction was digested enzymatically using protease (protamex) at ph . , and for the incubation period of -h with (e/s) ratio of : at °c, hydrolysed fraction was dialysed using molecular weight cutoffs and the fraction obtained with molecular weight < kda showed . ± . % of enzyme inhibition (ngoh and gan ) . similarly digestion with protease (bromelain) and dialysis with molecular weight cut-off < kda showed . ± . % of enzyme inhibition (oseguera-toledo et al. ) . the peptide obtained from cuminum cyminum l. (cumin) seeds, cumin seed peptide (csp ) had shown . % of alphaamylase inhibition (siow and gan, ) . in vitro study of triticum aestivum l. (wheat) albumin extract showed alphaamylase inhibition, wheat albumin restrained the peak of postprandial blood glucose levels in a dose-dependent manner. it showed the reduction in postprandial blood glucose by %, %, and % after administering . g, . g, and . g of wheat albumin respectively. in long-term administration study, . g of wheat albumin did not affect fasting blood glucose levels, while it reduces hemoglobin a c levels which reflects the metabolic control of individual suffering from diabetes (kodama et al. ) . a study revealed that different protein fractions isolated from the hordeum vulgare l. (barley) flour have been subjected to pancreatic hydrolysis and the obtained peptides have shown - % alpha-amylase inhibitory activity (alu'datt et al. ). cucurbitin protein was obtained from the cucurbita pepo l. (pumpkin) seeds, alcalase and pepsin hydrolysed fraction of it showed the enzyme inhibition (vaštag et al. ) . oligopeptides from the mulberry showed highest enzyme inhibition with ic . µg/ml. protein hydrolysates prepared from the seeds of moringa oleifera lam. by enzymes like trypsin, chymotrypsin and pepsin-trypsin for . and h. pepsin-trypsin digested fraction showed alpha-amylase inhibition and the ic was found to be . and . µg/ml for . h and h of hydrolysis respectively (gonzalez garza et al. ) . protein hydrolysate of citrullus lanatus l. (watermelon) seeds was prepared by using enzymes like pepsin, trypsin and alcalase where alcalase hydrolysate gives the highest enzyme inhibition and the ic was found to be . mg/ml (arise ) . autolysates of theobroma cacao l. fruits were prepared at ph . and . . autolysate at ph . showed highest inhibition of alpha-amylase as well as helps to release the insulin, in vivo study with autolysates showed decrease in the blood glucose level (sarmadi et al. ) . quinoa proteins were enzymatically hydrolyzed to obtain bioactive peptides, hydrolyzed fraction thus obtained showed . % inhibition of alpha-amylase at µm concentration (vilcacundo et al. ). two main gut-derived hormones glp- and glucosedependent insulinotropic polypeptide (gip), also known incretin hormones are responsible for the attainment of normoglycemia. they stimulate insulin secretion, decrease glucagon, inhibit gastric emptying, reduce food intake and appetite (drucker and nauck ) . in diabetic individuals, reduced secretion of incretin hormones have been observed. these gut hormones bind with their respective receptors and show their biological action (barnett ) . treatment of diabetes requires to restore either normal secretion or reduce the degradation of these hormones. dipeptidyl peptidase-iv (dpp-iv) is a protease enzyme that cleave the glp- hormone within minutes after its secretion and makes it inactive for biological function. thus, inhibition of dpp-iv has the potential to revert the hyperglycaemic condition. dpp-iv inhibitors block the rapid degradation of glp- and enhance the postprandial level of active glp- which further reduces the liver glucagon production and stimulate beta cells of the pancreas and increase insulin level (ahrén ) . dipeptides obtained from oryza sativa l. (rice) bran protein hydrolysates showed inhibition of dpp-iv enzyme and have ic . ± . mg/ml (hatanaka et al. ) . in another study, rice bran protein fractions have been defatted and hydrolyzed with umamizyme g and bioprase sp. the dipeptides obtained after digestion with umamizyme g showed inhibition of enzyme and the ic was found to be . ± . mg/ml (hatanaka et al. ) . the proteins extracted from amaranthus hypochondriacus l., have been subjected to simulated gastrointestinal digestion resulted into bioactive peptides. these peptides fraction have shown dose-dependent enzyme inhibition and have ic . mg/ml (velarde-salcedo et al. ) . glycine max l. (soybean) and lupinus albus l. (lupin) protein hydrolysate have been screened for the presence of bioactive peptides, and soy and lup were found to be efficient inhibitors of dpp-iv with ic values and μm respectively (lammi et al. ) . peptides obtained (cpgnk and ggglhk) from the protein digestion of common beans showed enzyme inhibitory activity and the ic (mg dw ml − ) for the pure peptides were found to be . ± . and . ± . respectively (mojica et al. b) . gastrointestinal digestion of proteins obtained from phalaris canariensis l. (canary) seed showed . % inhibition of dpp-iv (estrada-salas et al. ) . enzymatic digestion of quinoa proteins with papain showed enzyme inhibition and the ic for the hydrolysate-papain was found to be . ± . mg/ml (nongonierma et al. ) . study was performed on quinoa protein simulated duodenal digestion and the fraction (˃ kda) obtained after min of digestion showed highest dpp-iv inhibition and the ic (mg protein/ml) was found to be . ± . (vilcacundo et al. ). glucose, the metabolic fuel of any living cell, unable to cross lipid bilayer of the membrane due to its polar nature and membrane proteins (glucose transporter) associate with lipid bilayer helps glucose to travel across (abdul-ghani et al. ) . glut transporters facilitate the transport of glucose passively across the membrane along with its concentration gradient and do not require energy to operate, on the other hand sglt transporters actively transport glucose across the membrane against the concentration gradient thus these types of transporters require energy to operate. in order to receive energy for operation, sodium is transported along with its concentration or electrochemical gradient thus provide the needed amount of energy for the transportation of glucose across the membrane (musso et al. ; brown ) . in the human body, at least different glut transporters and sglt transporters exist which belongs to membrane class of proteins. excreted glucose is reabsorbed up to % in a convoluted segment of proximal tubule via high capacity, low-affinity sglt transporter while remaining % of glucose is reabsorbed in the distal segment of proximal tubule via high affinity, low capacity sglt transporter. in the case of diabetes, hyperglycemia causes the high load of glucose in order to compensate for this high load of glucose, an increase in expression of a glucose transporter gene in proximal tubule occurs (abdul-ghani et al. ) . plant peptides targeting glucose transporters are less explored. however, peptides having amino acid sequences aksplf, atnplf, feeln, and lsvsvl, isolated from the black beans have shown the ability to block the glucose transporters glut- and sglt- . the results of in vitro studies by using the caco- cell model have shown reduced glucose re-absorption by . % after h treatment of these bioactive peptides. the oral glucose tolerance test in rats has shown a . % decrease in postprandial glucose (p < . ) ( mg hydrolyzed protein isolate/kg bw) (mojica et al. a ). insulin hormone trigger several signaling events that control the metabolic fate of nutrients. insulin binds with alphasubunit of insulin receptor which causes the conformational changes in β-subunit of the insulin receptor, and these conformational changes in receptor activate the cytoplasmic tyrosine kinase domain. this activation further leads to the auto-phosphorylation which in turn trans-phosphorylate various intracellular substrate of the insulin receptor. these effects collectively control metabolic and mitogenic processes (nankar and doble, ) . tyrosine kinase domain activation by insulin mimetic, cause the auto-phosphorylation of receptor and trigger downstream signaling requires for metabolic activity of insulin. the other mechanism by which insulin mimetic (vanadate) works is by inhibiting the dephosphorylation of the insulin receptor via inhibition of tyrosine phosphatase (stapleton ) . the reports suggest that plant extracts of spinach and lemna gibba g- mimics like mammalian-insulin, which was confirmed by the radioimmunoassay. in an in vivo study on young rats indicated that the plant insulin-like material binds to insulin receptors on im- lymphocytes and stimulates glucose oxidation and lipogenesis in isolated adipocytes (collier et al. ) . vigna unguiculate l. (cowpea) containing bio-molecules have contributed in reducing the blood glucose level and enhancing the antioxidant status of patients. study on l rat skeletal muscle cells was performed where cells were treated with cowpea peptides at different doses ( . , , and ng) for h or insulin ( nm) for min. further, from the treated cells proteins were isolated for western blot analysis to check the phosphorylation of akt (a form of protein kinase b; pkb). the results of the study showed that the cowpea peptides have the ability to phosphorylated akt in the cell culture. this observation suggests that the treatment of cowpea peptides to the skeletal muscle cells can initiate the insulin signaling cascade. there is a possibility that cowpea peptides have the ability to mimic like insulin by inducing the same signaling cascade (uruakp, ) . linum usitatissimum l. (flax) seed peptides obtained from the protein hydrolysate ( kd) increases glucose uptake in l cells ). mc - - peptide fraction was obtained from momordica charantia l. showed hypoglycaemic effect in in vivo study. mc - - fraction decreases the blood glucose level by . % and . % in alloxan-induced diabetic mice with a time interval of and h respectively (yuan et al. ) . study on protein from the pulp of m. charantia l. showed increase glucose uptake in c c myocytes and t l adipocyte, the in vivo studies showed that the protein obtained from pulp helps in secretion of the insulin as well as act like insulin to lower the glucose level (yibchok-anun et al. ). polypeptide-p was isolated from the m. charantia l. acts like insulin mimetic (khanna et al. ) . another -residue insulin receptor (ir)-binding protein (mcirbp) was reported from the plant upon in vitro digestion yields mcirbp- , spanning residues - of mcirbp which increases the binding of insulin to its receptor, stimulate the phosphorylation of pdk and akt, induces the expression of glucose transporter , and stimulate both the uptake of glucose in cells and the clearance of glucose in diabetic mice (lo et al. ) . a novel protein called mcy having molecular weight kd was isolated from momordica cymbalaria l. fruit which showed hypoglycemic effect in in vivo study (rajasekhar et al. ) . the peptide fraction obtained from the soybean showed increase in glucose uptake in muscular cells, it is reported that obtained peptide fraction enhances the glucose uptake via amk activation (roblet et al. ) . protein extract of cucurbitaceae family containing seeds like telfairia occidentalis hook.f., citrullus lanatus l., lagenaria siceraria molina., and cucurbita moschata duchesne., showed a significant decrease in blood sugar level in in vivo study (teugwa et al. ) . peptide reported from the leaves of bauhinia variegata l, mimics like insulin which is evidently proved from the immunological reactivity and via in vivo study (azevedo et al. ) . zea mays l. (zein) seed protein hydrolysate increase the secretion of glp- directly in the ileum and indirectly in the duodenum in the rat and also confirmed with the help of glutag cell line study (hira et al. ). aglycine which is a bioactive peptide obtained from the soy bean increases the glucose uptake in c c cell, whereas in in vivo study on diabetic model it was observed that treatment with aglycine increases the expression of p-ir, p-irs , p-akt and membrane glut protein which results into hypoglycemia (lu et al. ) . soymorphin- also called µ-opioid obtained from soy bean decreases the blood glucose level via activation of adiponectin and pparα systems in diabetic kkay mice (yamada et al. ) . rice bran protein hydrolysate helps to improve insulin resistance and prevent metabolic diseases (boonloh et al. ) . protein extracted from the fruit pulp of momordica dioica roxb. decreases the blood glucose level in a diabetic model (poovitha et al. ) . plant scientist believe that there are more plants left unexplored which may contain insulin-like peptides or peptides which will mimic insulin (xavier-filho et al. ) . the success of any therapy and its acceptability depends on the ease of its delivery. the oral route of drug administration remains the most accepted route of drug delivery and most of the available drugs in the market are delivered through the oral route (morishita and peppas ) . an orally active drug should be stable enough to withstand the gastric microenvironment and also it should be permeable through the gastrointestinal membrane. small molecules are well tolerant to the gastric ph and permeated through the membrane whereas the large macromolecules are vulnerable to degradation in the gastrointestinal tract and are not permeable in their intact form to reach specific target successfully without being degraded in git (morishita and peppas ) . as the medical field advances, the peptide drugs are widely explored for their therapeutic potential in the treatment of various diseases. though these drugs are specific to their target, efficacious and potent, the oral bioavailability of peptide and peptide-based drugs always remains a challenge that limits their wide acceptance and market value. high cost of therapy is a major limitation associated with peptide drugs (ayoub and scheidegger ) . with the advancement of biotechnology, the recombinant synthesis of a peptide of interest opens a door for cheap and cost-effective peptides. also, the commercial interest for the production of bioactive peptides from the natural sources is elevated to reduce the cost but due to the lack of suitable technologies which helps to retain or increase the activity of bioactive peptides, the large-scale production gets hampered (craik et al. ) . another limitation associated with peptide-based drugs is their short half-life. when the peptides come in contact with intestinal mucosa, gastric acid in stomach and peptidases that are present in the blood convert peptides into amino acids and gets quickly eliminated from the body, thereby reducing its half-life. in order to achieve prolonged action and to reduce the frequent dosing, it is necessary to increase the half-life of the peptides. as peptide therapeutics advances, several non-degradable polymers and polymeric matrix are available to increase the half-life of peptides and proteins thus reducing the frequent dosing (brown ) . another strategy to increase the half-life of the peptide is to modify the amino acids which are susceptible to cleavage by enzymes (adessi and soto ) . oral delivery of the peptides is explored in past years but limits its use as peptides are degraded in the git tract (lau et al. ) . novo nordisk developed a semaglutide oral formulation which is currently in clinical trial phase . other non-invasive and suitable routes for the peptide and protein-based drugs are under investigation includes the nasal and pulmonary route which directly deliver the drugs to the blood component hence, reduces the overall drug degradation which is the main drawback associated with oral delivery of peptides (davies et al. ). different therapeutic peptides have made their way to the market in recent years, and it is assumed that the market demand will increase further as peptides are specific, selective and safe in comparison to small molecules (fosgerau and hoffmann ; loffet ) . traditional synthesis and drug design approaches are used by researchers and pharmaceutical companies for the development of the peptide-based drug candidate. the recombinant protein is another tool for the synthesis of the desired peptide in bacteria, yeast, and fungi. the market of peptides has now moved from the isolation of peptides from an animal source that can mimic the endogenous peptide to synthetic, semi-synthetic and recombinant peptides. further, development in this area is drug conjugated peptides and multifunctional peptides (fosgerau and hoffmann ) . peptides are susceptible to degradation when given orally, which is supposed to be the most convenient route of administration. the preferred route for the peptide delivery is intravenous, however, more research is directed for the delivery of peptides. recently, alternative routes are explored, which include oral, transdermal and intranasal route. as peptides are excellent biomarkers, they can also be utilized for diagnosing diseases. further, peptides are also found to be advantageous in vaccine development (brown ; sheridan ). plant peptides have been isolated and characterized in different ways as described in fig. . x-ray crystallography and nmr have contributed enormously in the field of structure elucidation of pure peptides (gomathi and subramanian fig. the general scheme of isolation, purification, and characterization of plant peptides (gomathi and subramanian ; krishnan and rupp ; sinz et al. ; wysocki et al. ) ; krishnan and rupp ) . other techniques like electron microscopy, fluorescence resonance energy transfer, chemical cross-linking emerged in recent year for the characterization of proteins and peptides (sinz et al. ) . mass spectrometry is frequently used technique in analyzing peptide mixtures and identifying their proposed structures. amino acid sequencing in mass spectrometry can be performed by two methods namely top-down sequencing and down-top sequencing. in top-down sequencing, the proteins analysis is performed without hydrolysis and the molecular weight can be detected by fragmentation pattern of the whole protein. a common approach is down-top sequencing in which proteolytic digestion is performed followed by ms analysis. software based on the algorithm of the amino acid sequence is available for protein identification (wysocki et al. ). plants remain a valuable source for developing a new drug candidate. they are still sharing their fair share of the new drug candidates and lead. as research interest is now shifted from small molecules to bio-molecules by virtue of several added benefits, plants are now under exploration for the presence of proteins and bioactive peptides for the disease management. though plant proteins and their functions have not been defined earlier, plant hormones were only considered through which cells communicate, as our understanding to these signalling pathways become clear, several secreted peptides and small rnas now known which regulates molecular recognition and thus, controls cell communication. from our diet we consume proteins which after gastrointestinal digestion get converted to bioactive peptides with more tissue affinity to specific receptor. several research publications in recent years claimed plant proteins and their bioactive peptides as a potential candidate in the treatment of various diseases specially in metabolic and life style borne diseases including cancer, diabetes and obesity. sophisticated instrument and advancement in technology leads easy isolation, purification and characterization of proteins and peptides in a short period of time, which further reduces the overall time of discovery. apparently, most of the bioactive peptides available for the treatment of diabetes have emerged from the synthetic route or derived from recombinant technology, which further adds in the overall cost of the treatment. to make peptide therapy economic there is a demand to explore plant-based biomolecules. this review discusses bioactive peptide isolated from various plant parts like leaves, fruits and seeds. the trend indicates that most of the bioactive peptides are obtained from seeds, although reports are available where bioactive peptides have been also isolated from leaves, whole fruits and pulps. it was observed in various in vitro and in vivo studies that protein hydrolysate obtained from aqueous extracts of plant are capable of inhibiting the enzymes and transporter systems responsible for the diabetes. expedition to find molecular targets and mechanism of action of plant based bioactive peptides is needed to use these bioactive peptides as a potential drug candidate. epidemiological data reveals that type diabetes requires promising therapy as the available synthetic drugs for the treatment have moderate to severe adverse effects. plant-derived bioactive peptides inhibit the enzymes like alpha-glucosidase, alpha-amylase, dipeptidyl peptidase-iv and glucose transporter systems involved in type diabetes. in vivo studies of a certain plant peptides fraction showed insulin-mimetic action in animal models. it is clear that plant-derived bioactive peptides are not explored to their full potential in comparison to other classes of natural products. the reason might be the lack of suitable instrumental techniques to identify, purify and characterize the peptides from plant source. however, the recent advancement in lcms, lc-nmr, and x-ray crystallography has bridged the gap and this would be the right time to embark on plant peptides. in future, systematic studies including dereplication for the early identification of known peptides, target identification followed by in vivo studies would help to speed up the plant peptides research. dipeptidyl peptidase- inhibitors: clinical data and clinical implications amylase inhibitors: potential source of anti-diabetic drug discovery from medicinal plants anti-oxidant, anti-diabetic, and anti-hypertensive effects of extracted phenolics and hydrolyzed peptides from barley protein fractions connectivity maps for biosimilar drug discovery in venoms: the case of gila monster venom and the anti-diabetes drug byetta® in vitro antioxidant and α-amylase inhibitory properties of watermelon seed protein hydrolysates peptide drugs, overcoming the challenges, a growing business isolation and intracellular localization of insulin-like proteins from leaves of bauhinia variegata dpp- inhibitors and their potential role in the management of type diabetes classification, pathophysiology, diagnosis and management of diabetes mellitus rice bran protein hydrolysates improve insulin resistance and decrease pro-inflammatory cytokine gene expression in rats fed a high carbohydrate-high fat diet a single luteinizing hormone determination hours after depot leuprolide is useful for therapy monitoring of gonadotropin-dependent precocious puberty in girls glucose transporters: structure, function and consequences of deficiency commercial challenges of protein drug delivery postprandial glucose regulation and diabetic complications coherent market insights partial purification and characterization of an insulin-like material from spinach and lemna gibba g the future of peptidebased drugs proteins: structures and molecular properties effect of oral semaglutide compared with placebo and subcutaneous semaglutide on glycemic control in patients with type diabetes: a randomized clinical trial a modern review of diabetes mellitus: an annihilatory metabolic disorder bloomberg markets top diabetes drugs anti-diabetic and antihypertensive activities of two flaxseed protein hydrolysate fractions revealed following their simultaneous separation by electrodialysis with ultrafiltration membranes the incretin system: glucagon-like peptide- receptor agonists and dipeptidyl peptidase- inhibitors in type diabetes glatiramer acetate (copaxone®) induces degenerate, th -polarized immune responses in patients with multiple sclerosis characterization of antidiabetic and antihypertensive properties of canary seed (phalaris canariensis l.) peptides peptide therapeutics: current status and future directions extraction and purification of a substance with luteinizing hormone releasing activity from the leaves of avena sativa defects in signal transduction proteins leading to disease. in: sitaramayya a (ed) introduction to cellular signal transduction signaling peptides in plants elucidation of secondary structures of peptides using high resolution nmr biofunctional properties of bioactive peptide fractions from protein isolates of moringa seed (moringa oleifera) proteins of some legume seeds: soybean, pea, fababean and lupin food-derived peptides with biological activity: from research to food applications production of dipeptidyl peptidase iv inhibitory peptides from defatted rice bran anti-oxidation activities of rice-derived peptides and their inhibitory effects on dipeptidylpeptidase-iv the current state of peptide drug discovery: back to the future glp- secretion is enhanced directly in the ileum but indirectly in the duodenum by a newly identified potent stimulator, zein hydrolysate, in rats amylase inhibitors and their biomedical applications in vitro antioxidant and antidiabetic activity of oligopeptides derived from different mulberry hypoglycemic activity of polypeptide-p from a plant source bioactive proteins and peptides from food sources. applications of bioprocesses used in isolation and recovery effects of single and long-term administration of wheat albumin on blood glucose control: randomized controlled clinical trials macromolecular structure determination: comparison of x-ray crystallography and nmr spectroscopy peptides derived from soy and lupin protein as dipeptidyl-peptidase iv inhibitors: in vitro biochemical screening and in silico molecular modeling study discovery of the once-weekly glucagonlike peptide- (glp- ) analogue semaglutide somatostatin-like material is present in flowering plants research progress in structure-activity relationship of bioactive peptides peptides: new signalling molecules in plants identification of the bioactive and consensus peptide motif from momordica charantia insulin receptor-binding protein peptides as drugs: is there a market? growth potential in peptide and protein therapeutics to enhance supplier capacities the soybean peptide aglycin regulates glucose homeostasis in type diabetic mice via ir/irs pathway glucagon-like peptide- receptor agonists for the treatment of type diabetes: differences and similarities gonadotropin releasing hormone agonists: expanding vistas mcy protein, a potential antidiabetic agent: evaluation of carbohydrate metabolic enzymes and antioxidant status optimization of enzymatic production of anti-diabetic peptides from black bean (phaseolus vulgaris l.) proteins, their characterization and biological potential evaluation of the hypoglycemic potential of a black bean hydrolyzed protein isolate and its pure peptides using in silico, in vitro and in vivo approaches characterization of peptides from common bean protein isolates and their potential to inhibit markers of type- diabetes, hypertension and oxidative stress bioactive peptides and proteins from foods: indication for health effects drug therapy of postprandial hyperglycaemia is the oral route possible for peptide and protein drug delivery? a prolactin inhibitory factor with immunocharacteristics similar to thyrotropin releasing factor (trh) is present in rat pituitary tumors (gh and w ), testicular tissue and a plant material, alfalfa small signaling peptides in arabidopsis development: how cells communicate over a short distance a novel approach to control hyperglycemia in type diabetes: sodium glucose cotransport (sglt) inhibitors: systematic review and meta-analysis of randomized trials non-peptidyl insulin mimetics as a potential antidiabetic agent medical management of hyperglycemia in type diabetes: a consensus algorithm for the initiation and adjustment of therapy: a consensus statement of the american diabetes association and the european association for the study of diabetes natural proteins: sources, isolation, characterization and applications enzyme-assisted extraction and identification of antioxidative and alpha-amylase inhibitory peptides from pinto beans (phaseolus vulgaris cv. pinto) quinoa (chenopodium quinoa willd.) protein hydrolysates with in vitro dipeptidyl peptidase iv (dpp-iv) inhibitory and antioxidant properties hard-to-cook bean (phaseolus vulgaris l.) proteins hydrolyzed by alcalase and bromelain produced bioactive peptide fractions that inhibit targets of type- diabetes and oxidative stress peptide-based drug design: here and now current challenges in peptide-based drug discovery diabetes mellitus-a devastating metabolic disorder protein extract from the fruit pulp of momordica dioica shows anti-diabetic, anti-lipidemic and antioxidant activity in diabetic rats history of insulin isolation and characterization of a novel antihyperglycemic protein from the fruits of momordica cymbalaria identification and characterization of two novel α-glucosidase inhibitory oligopeptides from hemp (cannabis sativa l.) seed protein enhancement of glucose uptake in muscular cell by soybean charged peptides isolated by electrodialysis with ultrafiltration membranes (eduf): activation of the ampk pathway world health organization the evolutionary origins of hormones, neurotransmitters, and other extracellular chemical messengers: implications for mammalian biology pharmacokinetic-pharmacodynamic relationships of acarbose hypoglycemic effects of cocoa (theobroma cacao l.) autolysates is there a role for α-glucosidase inhibitors in the prevention of type diabetes mellitus? proof of concept for next-generation nanoparticle drugs in humans chemical cross-linking and native mass spectrometry: a fruitful combination for structural biology extraction, identification, and structureactivity relationship of antioxidative and α-amylase inhibitory peptides from cumin seeds (cuminum cyminum) selenium: an insulin mimetic anti-hyperglycaemic globulins from selected cucurbitaceae seeds used as antidiabetic medicinal plants in africa the emergence of peptides in the pharmaceutical business: from exploration to exploitation influence of cowpea (vigna unguiculata) peptides on insulin resistance intercellular communication during plant development bioactivity evaluation of cucurbitin derived enzymatic hydrolysates in vitro inhibition of dipeptidyl peptidase iv by peptides derived from the hydrolysis of amaranth (amaranthus hypochondriacus l.) proteins exploration of nature's chemodiversity: the role of secondary metabolites as leads in drug development release of dipeptidyl peptidase iv, α-amylase and α-glucosidase inhibitory peptides from quinoa (chenopodium quinoa willd.) during in vitro simulated gastrointestinal digestion mind-body medicine: stress and its impact on overall health and longevity diabetes mellitus, type -a review evaluation of the antidiabetic activity of hydrolyzed peptides derived from juglans mandshurica maxim fruits in insulin-resistant hepg cells and type diabetic mice mass spectrometry of peptides and proteins plant insulin or glucokinin: a conflicting issue soymorphin- , a soy-derived μ-opioid peptide, decreases glucose and triglyceride levels through activating adiponectin and pparα systems in diabetic kkay mice alpha-glucosidase inhibitory activity of protein-rich extracts from extruded adzuki bean in diabetic kk-ay mice slow acting protein extract from fruit pulp of momordica charantia with insulin secretagogue and insulinomimetic activities purification and characterisation of a hypoglycemic peptide from momordica charantia l. var. abbreviata ser peptides derived from oats improve insulin sensitivity the authors want to thank niper ahmedabad, under the aegis of the ministry of chemicals and fertilizers, department of pharmaceuticals for providing facilities and research fellowships. key: cord- - friw ek authors: chumakov, s. p.; prassolov, v. s. title: organization and regulation of nucleocytoplasmic transport date: - - journal: mol biol doi: . /s sha: doc_id: cord_uid: friw ek separation of dna replication and transcription, which occur in the nucleus, from protein synthesis, which occurs in the cytoplasm, allows a more precise regulation of these processes. selective exchange of macromolecules between the two compartments is mediated by proteins of the nuclear pore complex (npc). receptor proteins of the karyopherin family interact with npc components and transfer their cargos between the nucleus and cytoplasm. nucleocytoplasmic transport pathways are regulated at multiple levels by modulating the expression or function of individual cargoes, transport receptors, or the transport channel. the regulatory levels have increasingly broad effects on the transport pathways and affect a wide range of processes from gene expression to development and differentiation. since eukaryotic genetic material is localized in a separate compartment (the nucleus), dna replica tion and transcription are spatially separated from protein synthesis, which occurs in the cytoplasm. the spatial separation provides the cell with additional opportunities to finely regulate the processes and, on the other hand, requires the highly selective exchange of many macromolecules between the nucleus and cytoplasm. exchange is due to a variety of receptor transport proteins, which interact with components of the nuclear pore complex (npc) to transport bound proteins between the two compartments. each of the receptors is involved in its specific transport pathway, transferring a certain set of substrates (cargoes). the state of a substrate also plays an important role in the transfer, since a cargo must have necessary signal sequences to properly bind to its receptor. the major ity of cell signaling pathways are involved in signal transduction between the nucleus and cytoplasm. thus, detailed knowledge of the organization and reg ulation of nucleocytoplasmic transport is essential for the understanding of cell metabolism and functional activity. macromolecular exchange between the nucleus and cytoplasm is mediated by nuclear pores. selective transport of molecules is due to the npc. nuclear pores were first identified in , when the nuclear membrane was examined by electron microscopy [ ] . further studies showed that holes in the nuclear mem brane are occupied by the npc, whose structure is evolutionarily conserved [ ] . the npc is a protein complex of - mda that has sevenfold symmetry and consists of more than proteins known as nucle oporins (nup). the spatial structure of the npc was established by tomographic reconstruction. the npc consists of a cylindrical central framework, eight microfilaments attached to it at the cytoplasmic side, and a nuclear basket of eight filaments, which are attached to the framework at the nuclear side and are distally bound with each other. the central part of the npc has a channel, which resembles a sand glass in shape and has a diameter of approximately nm in the narrowest region ( fig. ) [ ] . the cytoplasmic filaments of the npc are approx imately nm in length; the nuclear filaments, which form the nuclear basket, are nm. the length of the npc including the central framework of nm is approximately nm; its outer diameter is nm [ ] . npc components slightly vary in size among organisms but the general structure and proportion are the same [ ] . the npc number on the nuclear membrane broadly varies depending on the cell size and activity. one yeast cell has approximately npcs, while actively proliferating human cells have an npc den sity of - npcs/μm , corresponding to - npcs per nucleus. mature oocytes of the spur toed frog xenopus have npcs/μm , corresponding to × npcs per cell [ ] . the npc density increases as the cell progresses through the cell cycle [ ] . one npc transmits molecules per second on average; i.e., at least ten molecules simultaneously pass through one nuclear pore. proteins of up to nm can pass through the nuclear pore. proteins of less than - kda freely diffuse through the nuclear pore, while larger proteins ( - kda) must have a nuclear localization signal (nls) or a nuclear export signal (nes) to be transferred from the cytoplasm into the nucleus or backwards. the signals are bound by transport receptors, which are capable of passing through the npc both alone and in complex with another protein [ ] . highly efficient (more efficient than simple diffu sion) transfer of a protein of any size into the nucleus requires the npc mediated active transport mecha nisms [ ] . this transport depends on either affinity of the target protein for npc components or its com plexation with the receptor proteins that have such affinity. kariopherins, which are the largest class of transport receptors, include importins, exportins, and transportins, which are involved in both nuclear import and export. importin β (karyopherin in yeasts) is one of the best studied karyopherins. this protein binds with cargo molecules through the adap tor protein importin α (karyopherin in yeasts) and is capable of interacting with several nup proteins [ ] . according to a typical mechanism of interactions of karyopherins with a target protein and nup, a short motif (nls or nes) is recognized in the target pro tein and is bound by karyopherins or intermediate adaptor proteins, which then interact with karyo pherins. the resulting protein complex may interact with the nup proteins contained in the npc to enter the nucleus or leave it. proteins of the same size that lack a nls/nes are incapable of passing across the npc barrier. energy for transportation of complexes through the npc is generated by a high gradient of the ran gtpase between the nucleus and cytoplasm. nuclear rangtp is capable of releasing the target protein from an imported complex upon binding with karyo pherins. exportins occurring in the nucleus interact with their targets in the presence of rangtp, which is involved in transferring the complex through the npc into the cytoplasm, where rangtp is hydrolyzed to disassemble the transport complex and to replenish the cytoplasmic rangdp pool [ ] . proteins of the karyopherin family are involved in the majority of specific nucleocytoplasmic transport pathways. there are at least karyopherins in human cells and in yeast cells [ ] . target proteins that are transferred by karyopherins into or out of the nucleus usually have an nls or nes. the main, or classical, nls structure was established in studies of the nls of the sv large t antigen. this nls consists of one or two positively charged amino acid clusters, which are connected via a neutral linker [ ] . further studies revealed many nlss that lack this classical structure. functional activity of an nls may be regulated via posttranslational modification or signal masking as a result of conformational changes. the classical nls binds with the adaptor protein karyopherin α (importin α), which forms a hete rodimer with importin β. in turn, importin β ensures nuclear import [ ] . nls binding is due to a karyo pherin α region that contains ten arm repeats, each consisting of amino acid residues forming three α heli ces. the region assumes a superhelical structure, which has a shallow groove, which accommodates a nls. an nls is accommodated in the acceptor pocket of karyopherin α owing to its certain charge and hydrophobicity. binding with karyopherin α, one nls contacts several arm repeats [ , ] . most proteins of the karyopherin β family bind with a target protein directly. the primary structures of nlss recognizable by karyopherin β are far more diverse than the structures of nlss interacting with importin α. an nls often has a consensus sequence of several positively charged amino acid residues. such sequences were found in the nlss of histones, riboso mal proteins, and certain rna binding proteins. in some cases, a minimal functional nls is rather large. for instance, the m nls consists of amino acid residues, is enriched in glycine, and includes only a few positively charged residues. even longer nlss are known, indicating that a certain spatial structure of the karyopherin binding region plays an important role in complex formation [ ] . karyopherins that mediate nuclear export recognize ness [ , ] . minimal func tional ness have been identified for several proteins. a moderately conserved short motif with three or four hydrophobic amino acid residues (lpplerltl in the rev nes) has been studied in most detail to date. this nes is utilized in all eukaryotes and serves as a binding site for the karyopherin crm . the nes was found in at least proteins, including many tran scription factors, cell cycle regulators, and virus pro teins, such as rev and proteinase k inhibitor of the human immunodeficiency virus type (hiv ), where this signal was identified for the first time. like importin β , crm is capable of transferring com plexes through the npc, binding its target via an adaptor protein [ ] . certain proteins have nonhydrophobic ness. yeast msn p is the best known karyopherin that binds to such sites [ ] . this transportin is capable of medi ating both nuclear export and import of various pro teins [ ] . export of all known targets of msn p is induced by their phosphorylation, indicating that a phosphate group is contained in the nes or that phos phorylation indirectly affects the recognition of the nes by a receptor [ ] . some exportins utilize other karyopherins as tar gets. for instance, cas exports karyopherin α from the nucleus, thus, bringing it back into the cytoplasm [ ] . in addition to proteins, certain rnas are subject to karyopherin dependent export. at least two export ins, exportin t and exportin , are capable of directly binding rna. exportin t exports trna from the nucleus by binding to a trna structural element act ing as a signal. in higher eukaryotes, exportin trans fers microrna precursors from the nucleus upon recognizing their hairpin structure with a protruding ' end [ ] . the main properties that allow karyopherins to ensure efficient nuclear transport are their capabilities of binding with a target protein (directly or through an adaptor) and interacting with nup or rangtp [ ] . almost all karyopherins known in humans and yeasts are involved in either export or import. two proteins (human importin and yeast msn p) are exceptions, having both exporting and importing activities. known importins are more numerous than exportins. for instance, ten importing, two exporting, and one universal karyopherins were identified in yeast cells. karyopherin targets are far more numerous than karyopherins. a clear view of the situation is just emerging, since karyopherins recognize the localiza tion signals having low structural homology ( table ) . all karyopherins are similar in molecular weight ( - kda), charge, and domain structure. a ran binding domain is usually at the n end, a nup binding domain is in the center, and a target protein binding domain is at the c end. structural studies of four dif ferent karyopherins showed that a karyopherin mole cule contains approximately heat repeats, each consisting of amino acid residues that form two antiparallel α helices linked by a turn. the repeats stack together to form superhelical arches at both ends of the protein [ ] . one karyopherin may form differ ent complexes with different target proteins; i.e., the karyopherin molecule has many binding sites, each recognizing a certain motif. in addition, karyopherins may change their conformation depending on the tar get protein [ ] . this circumstance explains why one karyopherin transfers several proteins lacking homol ogous regions [ ] . transport both importins and exportins bind with rangtp. importins bind with rangtp with high affinity, and the target protein is consequently released from its complex with an importin. exportins, whose function requires the formation of a ternary complex with rangtp and a cargo, have low affinity for rangtp when not bound with a target protein [ , ] . a study of the atomic structure for the complexes of importin β and karyopherin β with rangtp showed that rangtp binds similarly with both receptors. rangtp interacts with a concave on the n terminal arch of importins, and its binding site has two spatially separate domains, one being closer to the n end of the arch, and the other closer to its c end [ ] . the c terminal domain that contacts rangtp is a negatively charged loop between two heat domains. the same loop was found to bind with proteins transported by karyo pherin β . this circumstance may explain why the target protein is released from its complex with impor tin once transferred into the nucleus (and in contact with rangtp). mutations of the c terminal rangtp binding domain result in a loss of the asso ciation between karyopherin binding with rangtp and the release of a cargo. thus, a contact of rangtp with this region may regulate the formation of the importin-cargo complex [ ] . structural studies of the exportin cse p in complex with rangtp and its cargo karyopherin α clearly demonstrate the difference in rangtp binding between import and export complexes. cse p forms a superhelix with n and c terminal arches, which clamp around rangtp on both sides. simulta neously, cse p binds with karyopherin α and fixes it in a conformation that prevents cargo binding. karyo pherin α also forms bonds with rangtp in this com plex. a feature of cse p-rangtp interaction is that the n terminal arch alone can form a weak bond with rangtp, but the complex becomes sufficiently stable only when rangtp binds with both n and c termi nal arches, which is conformationally possible only when cse p is bound with cargo. the resulting ternary complex resembles a tight spring, which stretches in the cytoplasm upon rangtp hydrolysis to release the cargo karyopherin α [ ] . the exportin crm is similar in several structural features to karyopherin β . like the c terminal rangtp binding site of karypherin β , crm has a flexible loop in the middle of the polypeptide chain. the domain is thought to shield the binding site for target proteins and, simultaneously, to prevent crm from forming a stable complex with rangtp. how ever, when crm binds with a target protein or rangtp, the loop changes its position and stimulates the formation of a stable ternary complex. it is possible that conformational changes of a flexible loop underlie a general mechanism of the formation and dissocia tion of transport complexes [ ] . at the same time, the exportin cse p lacks regions similar to the crm flexi ble loop, indicating that the mechanism is not univer sal [ ] . structural studies of karyopherin β in complex with the phenylalanine-glycine (fg) repeat contain ing fragment of yeast nup showed that the interactions between the two proteins are mostly hydrophobic and involve the phenylalanine residues of nup. the karyo pherin molecule forms two hydrophobic pockets: between heat repeats and and between repeats and . the pockets capture the amino groups of the phenylalanine residues contained in the fg repeats of nup [ ] . a similar nup binding region occurs at the c end of karyopherin β [ ] . there is evidence that importing karyopherins have higher affinity for the nup proteins located at the nuclear side of the npc, while exporting karyopherins have higher affinity for cytoplasmic nup proteins [ ] . a deletion analysis of all fg containing nup proteins (with a yeast model) made it possible to construct npcs having minimal sets of the fg domains. it was found by this means that asymmetrical fg domains (which occur exclusively at the cyptoplasmic or nuclear side of the npc) have no functional role. the role of asymmetrical nup proteins in nuclear transport is still unclear. the main role in nuclear transport is ascribed to the ran gtpase, which controls the formation and disso ciation of transport complexes. ran activity is strongly regulated by evolutionarily conserved proteins, which regulate both the intracellular distribution of ran and the transition between the gtp and gdp associated forms [ ] . although ran occurs predominantly in the nucleus, it is continuously delivered into the cyto plasm at a high rate ( molecules per second), mostly as a component of export complexes [ ] . re import of ran is due to ntf . ntf interacts with rangdp, which is the main cytoplasmic form of ran. the ntf -rangdp complex is transferred into the nucleus, which is due to the ability of ntf to interact with low affinity with fg containing nup proteins, as karyopherins do. the rangdp transfer is unidirec tional because rangdp is rapidly converted to rangtp on the nuclear surface of the npc. since ntf binds to the switchii region of rangdp and this region has another conformation in rangtp, the ntf -rangtp transport complex dissociates, and ntf is recycled into the cytoplasm [ ] (fig. ) . rangdp is converted to rangtp in the nucleus, where ran interacts with the ran guanine exchange factor (rangef). rangef is associated with chro matin, occurs in the nucleus at approximately one copy per nucleosome, and directly interacts with nucleosomal histones h a and h b. rangef stimu lates the exchange of gdp for gtp because a rangef loop (β wedge) penetrates into ran and releases gdp. binding with rangef fixes ran in the free form for a period long enough for ran to bind gtp contained in the nucleoplasm [ ] . although no preference for gtp is observed for nucleotide exchange in the presence of rangef in vitro, ran-gtp binding in vivo is more likely owing to a higher gtp : gdp ratio in the nucleoplasm. rangef does not always occur in a chromatin associated form. its free form is detectable in the cell both in the interphase and during mitosis [ ] . how ever, only chromatin associated rangef is capable of stimulating gtp-gdp exchange in complex with ran. the t n ran mutant, which is incapable of nucleotide exchange, leads to a fixation of free rangef on chromatin [ ] . it is most likely that gtp binding to ran is necessary for a detachment of the two proteins from chromatin. in addition, the associ ation of such exchange complexes with chromatin plays an important role in mitosis, since a high local concentration of rangtp is necessary for the proper assembly of microtubules in the vicinity of the chro mosome surface [ ] . high affinity interactions between rangtp and karyopherins leaving the nucleus take place both when cargoes are transported into the cytoplasm (with exportins) and when imported karyopherins are recyc led into the cytoplasm. in either case, a rangtpkaryopherin complex is moved onto the cytoplasmic surface of the npc and then interacts with rangap, which substantially (approximately by five orders of magnitude in vitro) increases the originally weak gtpase activity of ran. this interaction causes gtp hydrolysis to yield rangdp. rangdp in complex with a karyopherin is not fully accessible for rangap, and ranbp facilitates the rangtp-rangap bind ing via either generating a ranbp -rangtp-karyo pherin intermediate complex or releasing rangtp from karyopherin [ ] . the cytoplasmic localization of rangap and the nuclear localization of rangef underlie the mecha nism that maintains a high gradient of rangtp between the nucleus and cytoplasm [ ] . a fluores cence resonance energy transfer (fret) analysis showed that the difference between rangtp concen trations in the nucleus and cytoplasm is more than two orders of magnitude [ ] , which agrees well with mod eling results. rangtp plays an important role in nuclear trans port. a high rangtp concentration in the cytoplasm would distort nuclear import, causing premature dis sociation of import complexes. the cell has mecha nisms to ensure the proper location of the ran regula tors. rangef mediated exchange of gdp for gtp occurs exclusively in the nucleus owing to two import pathways [ ] . rangtp hydrolysis occurs in the cyto plasm, since rangap is too large to enter the nucleus through the npc. rangap can be sumoylated (covalently bound with a protein of the sumo family), and the resulting form has affinity for cyto plasmic nup [ ] . ranbp is small enough to enter the nucleus through the npc via diffusion. however, ranbp has a nes recognizable by the exportin crm , which ensures continuous active export of ranbp appearing in the nucleus via diffu sion into the cytoplasm. rangtp and karyopherins are the main means of nuclear transport. it should be noted, however, that the eukaryotic cell has several accessory factors that affect the transport efficiency. for instance, the effi ciency of karyopherin α dependent nuclear import is improved upon npap binding, which increases karyopherin α affinity for importin β . appearing in the nucleus, the npap -importin β -karyopherin α-nls cargo complex is affected by rangtp and cas, which acts as an exportin to recycle karyopherin α into the cytoplasm. the import complex is cleaved into two complexes, npap -importin β -rangtp and cas-karyopherin α-rangtp. a feature of the npap effect is that npap differently binds with importin β during import. it is thought that npap has a structure of a tri stable switch [ ] . ranbp , which is a npap analog involved in export, is struc turally similar to npap and binds with the exportin crm to increase its affinity for rangtp and nes containing proteins, thus improving the efficiency of crm dependent export [ ] . nucleoporins: their structure and functions the npc consists of more than different nup proteins, each occurring in at least eight copies [ ] . some nup proteins are predominantly included in the npc, while some others shuttle between the nucleus and cytoplasm and are only associated with the npc for a short while [ ] . the fg repeats, which are found in one third of all nup proteins, are particularly important in the func tioning of npc. there are approximately fg domains containing fg repeats in one npc. fg domains are not distinctly structured and are thought to line the inner wall of the nuclear channel. all known transport receptors bind with fg repeats, which are essential for nuclear transport [ ] . the interaction with a transport receptor involves mostly the phenyl alanine ring of the core fg region; the ring binds with hydrophobic amino acid residues on the surface of a transport receptor [ ] . hydrophilic regions, which occur between individual fg motifs and account for a major part of an fg domain, allow several fg regions of one domain to bind with a receptor [ ] and, pre sumably, are necessary for modulating the receptor binding (table ) . structural studies of the fg domains in yeast nup proteins by biophysical methods showed that the domains are unfolded and lack a distinct secondary structure in natural conditions [ ] . similar data were obtained for the fg domains of nup proteins of other organisms [ ] . as was revealed by electron micros copy, structures containing fg domains are flexible and are capable of moving along the npc. atomic force microscopy (afm) showed that the human nup fg domain ( amino acid residues) occurs as a long ( nm) unstructured sequence [ ] . the spatial structure of the npc changes dynami cally, adapting to nuclear transport requirements. two main types of npc conformations were identified by cryoelectron microscopy. conformations of one type are characterized by the cytoplasmic filaments extended towards the central channel of the npc. the filaments apparently interact with a protein complex transferred though the npc. conformations of the other type have stretched unfolded cytoplasmic fila ments [ ] . the arrangement of certain functional domains of nup changes in the course of nuclear transport. for instance, the c terminal end of the fg domain of nup moves onto the nuclear side of the npc when polyadenylated rna is introduced into the nucleus, while the n end remains associated with cytoplasmic fibrils [ ] . the fg domain of nup similarly moves along the npc as cargoes are transported [ ] . in addition to their role in nucleocytoplasmic transport, the fg domains may perform other func tions. for instance, the fg repeats of the rrm domain of mouse nup have a distinct secondary structure and do not interact with transport receptors. these domains interact with the ndc transmembrane protein and, possibly, are involved in forming the cen tral framework of the npc [ ] . nup proteins may play a role in transcriptional reg ulation by interacting with active genes. this assump tion is supported by the fact that npcs are not regu larly distributed through the nuclear membrane, but their distribution corresponds to the distribution of active chromatin within the nucleus. such a distribu tion is necessary for a correct rearrangement of the chromosomes and nuclear envelope during mitosis and for mrna export through specific npcs in the interphase [ ] . further studies revealed that yeast genes whose transcription is increasing move to the periphery of the nucleus, which is due to their binding with certain nup proteins [ ] . in addition, several nup proteins, such as nup p, bind to loci at the boundary between euchromatin and heterochromatin, thus preventing heterochromatin from spreading to transcriptionally active regions. as was revealed more recently, nup p binds to chromatin within a complex consisting of nup p, nup p (an element of the npc nuclear basket), prp (a yeast analog of rangef), and the htz histone protein [ ] . htz is responsible for the prevention of hetero chromatin spreading [ ] . nup p interacts with the promoters of functionally active genes, and this inter action depends on transcriptional activators and the tata box located ' of them [ ] . thus, the npc may serve as a multifunctional regulator of gene expression by distributing transcription activation signals and checking the quality of spliced mrna. nup proteins can act as a platform for the attach ment of various transport factors. for instance, nup , which is a component of the cytoplasmic fibrils of the npc, provides a platform for the cleavage of transport complex and a subsequent recycling of transport factors. the nup -ranbp complex has four ran binding sites and a binding site for the sumoylated ran activator (rangap ). to bind to this complex, rangap must be sumoylated, while ranbp has sumoylating activity. acting in complex with the e ligase ubc similarly to the e ligase, ranbp stabilizes rangap on the cytoplasmic fibrils [ ] . rangap is capable of stimulating hydrolysis of rangtp associated with export com plexes, which is essential for nuclear export. some nup proteins bind with proteins possessing desumoylating activity, such as senp (yeast ulp ) [ ] . there is evidence that ulp binds with nup p and mlps located on the nuclear side of the npc [ ] . other data indicate that ulp is anchored on the npc by npc associated karyopherins [ ] . this possibly facilitates desumoylation of hnrnps that were sumoylated on the cytoplasmic side of the npc and then transferred into the nucleus. yet this assumption disagrees with the data that ubc occurs on the nuclear side of the npc as well, possibly binding with nup p [ ] . apart from sumoylation, the dead box helicase dbp plays a substantial role in separating hnrnp from mrna. dbp also occurs on the cytoplasmic surface of the npc in complex with nup (yeast nup p). atpase activity of dbp depends on the binding of gle , an mrna export factor capable of binding to the npc, and is stimulated by inositol hexaphosphate (ip ) [ ] . in yeast cells, dbp is bound with mex , which is an analog of tap/nxf . the level of mex bound rna is elevated in cell lines with mutant dbp , implicating dbp in the mex recycling to the nucleus [ ] . thus, both karyopherin dependent export and mrna export depend on a protein stimulating ntpase activity (rangap in the case of ran gtpase and gle /ip in the case of atpase activity of dbp ). moreover, the interaction of these activators with a substrate is determined by specific binding sites of cytoplasmic nup proteins and is regulated by addi tional cofactors (sumoylation and ip ). hierarchic regulation of nuclear transport nuclear transport is regulated by several mecha nisms, which are organized hierarchically. a flow of proteins transferred between the nucleus and cyto plasm changes in response to various signals, such as hormones, cytokines, and growth factors, as well as signals regulating the cell cycle, differentiation, and the immune response, and in stress. modification of signal molecules via phosphorylation or dephosphory lation, which is the most clearly understood mecha nism regulating nuclear transport, may involve many kinases and phosphatases [ ] . kinases and phos phatases are regulated by many cell signals, directly linking external signaling factors and intracellular sig nals with changes in the nuclear import or export of signal molecules, such as cell cycle regulators, kinases, and transcription factors. many of these proteins have both nes and nls, which allow a fine regulation of their intracellular localization by changing the effi ciency of nuclear export or import [ ] . one of the key steps of nuclear transport is the interaction of importins and exportins with the nls or nes of a target protein. changes in nls or nes accessibility via inter or intramolecular masking are one of the most common mechanisms modulating the efficiency of nuclear transport of a particular protein. intramolecular masking occurs when the accessibility of the nls or nes decreases as a result of conforma tional changes in the protein containing the given site. an example of such regulation is provided by the nf κb p transcription factor. the factor occurs in the cyto plasm in the form of a p precursor, which has an nls inaccessible for binding with importin α. during the immune response, phosphorylation and degrada tion of the c terminal fragment of p unmasks the nls, which then binds with importin α to allow active transfer of nf κb into the nucleus [ ] . conformational changes that result from disulfide bonding between the amino groups of cysteine resi dues in one protein may also mask or unmask the nes or nls. for instance, a disulfide bond is formed in stress in the yeast transcription factor yap p between cys and cys , which belong to a cysteine rich region. an nes, which is in the same region, conse quently becomes inaccessible for xpo p [ ] . such masking takes place when binding with another protein makes the localization signals inac cessible for transportins. an example is provided by the nf at transcrip tion factor. at a higher ca + concentration, nf at binds with the ca + responsive phosphatase cal cineurin, which masks the crm binding nes of nf at . when the ca + concentration decreases, calcineurin dissociates from nf at , the nes becomes accessible for crm , and crm transfers nf at from the nucleus [ ] . the nuclear localization of the p tumor suppres sor is regulated by several mechanisms. one of these is p homotetramerization in the nucleus, which occurs in response to dna damage [ ] and masks the c terminal ness. it is essential for a nuclear export of p that the tetramer dissociate and the c terminal ness be unmasked [ ] . ligand binding may also mask the nls or nes of a receptor. for instance, the nes of the androgen receptor is close to its ligand binding domain. upon binding with the ligand, the nes becomes inaccessi ble for crm , and nuclear export is suppressed until the ligand dissociates from the receptor [ ] . an inter molecular masking of an nls or nes is possible upon dna or rna binding. hiv rev, which transfers nonspliced virus mrna from the nucleus into the cytoplasm, masks its own importin β dependent nls upon mrna binding [ ] . a release of mrna restores the ability of rev to bind with importin β , which then recycles rev into the nucleus. rev binds again with virus mrna, this binding facilitates rev dissociation from its complex with importin β , and a new round of mrna nuclear export starts. the yeast gal transcription factor and human sry chromatin remodeling factor have dna bind ing domains that overlap their nlss [ ] . binding to dna prevents their association with importin β and vice versa. it is possible that this mechanism is alterna tive to a rangtp mediated release of cargoes from transport complexes. the mechanism is effective when the local rangtp concentration is too low or ran activ ity is suppressed by high ca + concentrations [ ] . many various rnas are expressed in the nucleus. to be exported from the nucleus, rnas undergo post transcriptional modification, which is necessary for successful interactions with proteins of the transport complex. for instance, trnas acquire the capability of binding with exportin t at the last step of their maturation in the nucleus, and this capability ensures the export from the nucleus only for mature trnas. several cell mrnas are exported owing to cis regula tory elements. in mouse cells, certain retroviral tran scripts leave the nucleus owing to the constitutive transport element (cte), which is capable of binding directly to the mnxf export receptor [ ] . the mrnas of several genes involved in the cell cycle con trol have a β untranslated region (utr) that is recogni zed by the translation initiation factor eif e, which binds to the ' utr and mediates the mnxf depen dent export of these transcripts [ ] . in addition to an nes, mrna may contain an nls, which provides for an additional mechanism of gene expression regulation. the ' utr of the msf cytokinin gene contains a sequence that holds this mrna in the nucleus. further posttranscriptional modification in response to tgf β activates the export of the msf mrna and msf synthesis [ ] . signal elements may regulate the rna nuclear import as well. for instance, an nt sequence is necessary and sufficient for the nuclear import of the mir b microrna [ ] . the export of mature mrnas and ribosomal sub units results from a series of standard modifications essential for the binding with export receptors. during maturation, mrna interacts with various proteins to form mrnp. the composition of this complex changes in the course of mrna splicing, capping, and polyadenylation. mature mrna is capable of inter acting with the mnxf -nxt (yeast mex -mtr ) transport complex, which is necessary for cytoplasmic export [ ] . the mrnp transfer through the npc depends on ip , thus allowing phospholipase c to regulate the mrna nuclear export. similar modifica tions necessary for the export from the nucleus occur in rrna as well [ ] . phosphorylation of proteins at sites close to an nls or nes is capable of not only masking these sig nals, but also improving protein affinity for importins and exportins. for instance, ck kinase phosphoryla tion of the t antigen at ser / , which are adja cent to the nls, results in a fold increase in nls affinity for importin α, and this leads to a fold increase in the nuclear import of the t antigen [ ] . phosphorylation may affect the nuclear export as well. for instance, phosphorylation of pho at ser and ser improves the recognition of its nes by the msn p exportin [ ] . in addition to phosphorylation, target proteins may experience other types of posttranslational modifica tion, such as methylation and ubiquitination, which are also capable of regulating the intracellular location of the protein. for instance, the proper nuclear import of rna helicase a requires methylation of its nls, and pten phosphatase does not efficiently enters the nucleus until monoubiquitinated at certain lysine res idues [ ] . ubiquitinase ubcm appears in the nucleus only in an activated ubiquitinated form. this example sup ports the idea that the functional state of an enzyme may affect its intracellular localization [ ] . another mechanism regulating nuclear transport is a binding of nls or nes containing proteins with cytoplasmic or nuclear factors that anchor the bound proteins to retain them in the cytoplasm or nucleus. for instance, p is retained in the cytoplasm in the absence of stress signals owing to its binding with parc ubiquitin ligase. excess synthesis or inhibition of parc modulates the cytoplasmic localization of p [ ] . the hiv transactivator tat, the vascular growth factor angiogenin, and the interferon dependent tran scription factor ifi are retained in the nucleus via nls dependent binding with nuclear or nucleolar components. angiogenin is small enough to enter the nucleus via passive diffusion, and its nls does not interact with importins, but binds with nuclear pro teins. as a result, angiogenin is retained in the nucleus, and its backward diffusion into the cytoplasm is pre vented [ ] . such anchorage is due to nls phospho rylation in some cases. for instance, affinity of the ifi nls for nuclear proteins increases as a result of phosphorylation by the ck kinase. in contrast to these two proteins, tat has an nls that has affinity for both nuclear and cytoplasmic anchoring proteins, and its affinity depends on protein modification [ ] . different members of the importin family, espe cially importins α, have affinity for different groups of transport targets in higher eukaryotic cells. moreover, different transport complexes, for instance, β katenins or stat family proteins, pass through the npc via different pathways, binding exclusively with the fg repeats of a certain set of nup proteins [ ] . thus, the presence or absence of a particular nup or karyopherin can determine whether a certain protein is transferred through the nuclear pore. a difference in the tissue distribution of transport proteins affects the efficiency of nuclear transport of the same proteins. an example is provided by the drosophila melanogaster heat shock protein dhsf, which is transferred into the nucleus by importin α . importin α is absent in early embryo development, and, consequently, dhsf does not enter the nucleus [ ] . differences in the tissue distribution of importins α are especially clear in higher eukaryotes. for instance, importin α accounts for more than % of total pro tein in human striped muscle cells (i.e., its content is fold higher than that of importin α ) and is almost absent in heart, spleen, and kidney cells [ ] . in contrast, importin α is abundant in the heart, testis, skeletal muscle, and ovary, while importins α and α occur at a high content in the ovary and brain. the levels of mrna expression in one tissue greatly vary among different importins α; however, a high content of impor tin α is only characteristic of ovarian cells [ ] . competition between different karyopherins β for binding sites on the npc surface may also be subject to regulation. the contents and composition of karyo pherins β and their targets change during the cell life, affecting the efficiency of nucleocytoplasmic trans port. a higher content of a particular karyopherin β increases the transport efficiency of its targets [ ] , the saturation point differing between individual karyo pherins β. since different transport receptors are capable of interacting with the same sites on the npc surface, an excess of one karyopherin β may inhibit the transport of proteins associated with another karyo pherins. for instance, the import in cultured cells is approximately tenfold lower than in systems in vitro, possibly, because artificially reconstructed systems lack the majority of competing karyopherins β [ ] . the regulation of nucleocytoplasmic transport pathways changes in certain diseases. overproduction of karyopherin β and karyopherin α family proteins was observed in several colorectal, breast, and lung cancers. deregulation of karyopherin α , which is frequent in melanoma and breast cancer cells, corre lates with a low survival. it is still unclear which karyo pherin α dependent proteins are redistributed to determine the high malignant potential of these tumors. a possible cause is that karyopherin β is sequestered by overproduced truncated karyopherin α to alter the karyopherin α dependent import of p into the nucleus [ ] . the antiviral response is similarly altered in cells infected with the ebola virus or the avian influenza virus (sars cov) [ ] . hodgkin's lymphomas are characterized by excess phosphorylation and degradation of i κb, which results in an extremely high nuclear content of nf κb p because the intermolecular masking of its nls is deregulated [ ] . gametogenesis is well understood and is known to depend, to a substantial extent, on the nonuniform expression of various importin α genes. each of the three d. melanogaster importins α has its own expres sion pattern during spermatogenesis. importin α is syn thesized predominantly in the postmeiotic phase, reaching its maximum during spermatid elongation. the level of importin α remains low throughout the mitotic phase of spermatid development and is com pletely suppressed at spermatid elongation. importin α is actively synthesized in spermatogonia during the first four mitoses and during the two subsequent meioses. flies with a mutant importin α (imp α d ) have a low fertility. an elevated level of importins α and α in transgenic flies is capable of restoring the male fer tility, indicating that these importins may functionally substitute importin α to a certain extent during sper matogenesis. however, the fertility of imp α d mutant females is not restored at higher levels of other importins, suggesting a key role in oogenesis for importin α [ ] . Сaenorhabditis elegans importins are synthesized differentially in different cells. importins α and α occur mostly in germline cells, while importin α is found in somatic cells as well. suppression of importin α synthesis via rna interference blocks meiosis in pachytene. it seems that importins α and α are incapable of compensating for lack of importin α , indicating that importin α is essential for successful meiosis. inhibition of importin α leads to aneup loidy, an improper chromatin organization during cell division, and an incomplete restoration of the nuclear membrane after meiosis [ ] . in lower eukaryotes, different importins are critical at different stages of gametogenesis, indicating that the proper organization of nuclear transport is impor tant for meiosis. an essential developmental role of crm depen dent nuclear export was demonstrated in experiments with leptomycin b, which acts as a specific inhibitor of crm . when nondifferentiated gonad explants from female mouse embryos were treated with leptomycin b, the sox chromatin remodeling factor was redis tributed into the nucleus, as characteristic of nondif ferentiated male gonads [ ] . the sox content in the nucleus increased, which was apparently related to a role of crm in the export of sox into the cytoplasm. while crm and importins β and β occur in all cells in drosophila, exportin dcas, which is responsi ble for a recycling of importin α from the nucleus into the cytoplasm, is differentially expressed in different tissues. importin α synthesis is almost absent at the mid blastodermal stage. its level increases at subse quent stages, especially in embryonic nervous cells. mutations of dcas result in either a lethal phenotype or, in the case of hypomorphic dcas, developmental defects of the nervous system. such developmental defects are possibly caused by a lack of cytoplasmic importin α and a consequent decrease in the con centration of the notch regulated su (h) protein in the nucleus [ ] . nup proteins differ in affinity for importins and exportins, and, consequently, changes in the levels of certain nup proteins may affect the efficiency of nuclear transport. lack of nup b alters the nuclear import of all proteins possessing the classical nls, except for the spliceosomal factor u a [ ] . nup bs , which is functionally associated with the ranbp /nup complex, is synthesized exclu sively in spermatids [ ] . nup bs is capable of interacting with the af chromatin remodeling fac tor, which is contained in postmeiotic cells. factors similar to af may enter the nucleus via direct inter actions with certain nup proteins, as is the case with β catenins and smad family proteins. thus, nup bs possibly provides a specific binding site for af in haploid sperms [ ] . the npc, a dynamically changing macromolecu lar complex, can mediate both passive transport and active transport of specific complexes. some nup pro teins act as active components of a transport pathway. during mitosis, the npc presumably dissociates into several subcomplexes differing in nup composi tion. a functional npc is again reassembled from these subcomplexes in telophase [ ] . in lower eukaryotes, whose mitosis proceeds with out a disassembly of the nuclear membrane, certain nup proteins (gle /rae and nup in aspergillus) are separated from the npc to improve its permeabil ity [ ] . in saccharomyces cerevisiae, the npc is not disas sembled in mitosis even partly, but nup , which is bound with nup in the interphase, is phosphory lated and transferred onto nic . as a result, a high affinity binding site for karyopherin is opened in nup , and karypherin dependent nuclear trans port is suppressed [ ] . during mitosis, several nup proteins (nup - , nup , ranbp , and gle /rae -nup ) bind with the kinetochores and play a role in regulating the interactions between the kinetochores and microtu bules, which are also necessary for the npc formation after mitosis [ ] . as was revealed in experiments with xenopus, gle /rae induces the assembly of microtubules into a mitotic spindle. according to other data, gle /rae -nup inhibits the anaphase promoting complex (apc) to delay anaphase [ ] . dynamic changes in the npc composition affect the transmission of intracellular signals. the regula tion of nuclear transport is usually modulated by changes in the karyopherin interactions with target proteins. in particular, conformational changes in a protein or posttranslational modification of sites in the vicinity of or within the nls (or nes) may mask these signals. however, npcs are possibly capable of serving several independent transport pathways at the same time [ ] . if so, structural and functional changes in the npc may provide for a fine regulation of the transmission of certain signals. this hypothesis is supported by the results of studying the independent changes in nuclear import of various proteins (fig. ) . the npc contains approximately fg domains with several thousands of fg repeats [ ] . the fg repeats harbor various binding sites for transport pro teins. the multiplicity of fg associated transport posttranscriptional modification mrna splicing, rnp maturation, trna maturation nuclear retention due to the signal sequences of the ' utr maturation of ribosomal subunits intra and intermolecular interactions nls or nes masking via homo and heterodimerization n p c t r a n s p o r t r e c e p t o r s c a r g o fig. . nucleocytoplasmic transport may be regulated at the levels of npc components, transport receptors, or individual car goes. the regulations of higher hierarchic levels exert broader and less specific effects on transport processes. pathways is evident from the fact that only some of the nuclear import pathways are altered in yeast cells with functionally defective fg nup nsp s [ ] . transport pathways that involve different karyo pherins need different sets of fg domains [ ] . this finding well correlates with in vitro data. therefore, different pathways of transport through the npc have a common mechanism, but utilize different karyo pherin sites and may be regulated independently. experimental studies on amphibian oocyte nuclei: . investigation of the structure of the nuclear membrane by means of the electron microscope the ultrastructure of the nuclear envelope of amphibian oocytes: a reinves tigation: . the immature oocyte and dynamic aspects the nuclear pore com plex: nucleocytoplasmic transport and beyond snapshots of nuclear pore complexes in action captured by cryo electron tomography yeast nuclear pore complexes have a cytoplasmic ring and internal fila ments transport between the cell nucleus and the cytoplasm cell cycle dependent dynamics of nuclear pores: pore free islands and lamins the nuclear pore complex: oily spa ghetti or gummy bear? nucleocytoplasmic trans port: taking an inventory mechanisms of receptor mediated nuclear import and nuclear export regulating access to the genome: nucleocytoplasmic transport throughout the cell cycle karyo pherins: from nuclear transport mediators to nuclear function regulators importin alpha: a multipurpose nuclear transport receptor crystallographic analysis of the recognition of a nuclear localization signal by the nuclear import factor karyopherin alpha autoinhibition by an internal nuclear localization signal revealed by the crystal structure of mammalian importin alpha nuclear import and the evolution of a multi functional rna binding protein the hiv rev activation domain is a nuclear export signal that accesses an export path way used by specific cellular rnas nuclear export of ribosomal subunits the karyopherin kap p/msn p mediates nuclear import and nuclear export of different cargo proteins regulation of nuclear localization: a key to a door export of importin alpha from the nucleus is mediated by a specific nuclear transport factor microrna precursors in motion: exportin mediates their nuclear export karyopherins and nuclear import molecular basis for the recognition of a nonclas sical nuclear localization signal by importin beta conformational variability of nucleo cytoplasmic transport factors structure of the nuclear transport complex karyopherin beta ran × gpp nhp structural basis for the assembly of a nuclear export complex architecture of crm /exportin sug gests how cooperativity is achieved during formation of a nuclear export complex structural basis for the interaction between fxfg nucleoporin repeats and importin beta in nuclear trafficking importin beta contains a cooh terminal nucleoporin binding region important for nuclear transport gradient of increasing affinity of importin beta for nucleoporins along the pathway of nuclear import charac terization of ran driven cargo transport and the rangtpase system by kinetic measurements and computer simulation structural basis for the interaction between ntf and nucleoporin fxfg repeats structural basis for guanine nucleotide exchange on ran by the regulator of chromosome condensation (rcc ) a mechanism of coupling rcc mobility to rangtp production on the chromatin in vivo visualization of a ran gtp gradient in interphase and mitotic xenopus egg extracts co activation of rangtpase and inhibition of gtp dissociation by ran gtp binding protein ranbp the asymmetric distribution of the constituents of the ran system is essential for transport into and out of the nucleus nuclear import of the ran exchange factor, rcc , is mediated by at least two distinct mechanisms the nucleoporin ranbp has sumo e ligase activity npap /nup is a tri stable switch that stimulates importin alpha:beta mediated nuclear protein import ran binding protein is a cofactor for crm mediated nuclear protein export the yeast nuclear pore com plex: composition, architecture, and transport mechanism nup is a mobile nucleoporin with tran scription dependent dynamics association of nuclear pore fg repeat domains to ntf import and export complexes structural basis for the high affinity binding of nucleoporin nup p to the saccharomyces cerevisiae importin beta homologue disorder in the nuclear pore complex: the fg repeat regions of nucleoporins are natively unfolded rapid evolution exposes the boundaries of domain structure and func tion in natively unfolded fg nucleoporins from the trap to the basket: getting to the bottom of the nuclear pore complex nuclear pore complex structure and dynamics revealed by cryoelectron tomography nucle oporin domain topology is linked to the transport sta tus of the nuclear pore complex the crystal structure of mouse nup reveals atypical rnp motifs and novel homodimerization of the rrm domain pushing the envelope: structure, function, and dynamics of the nuclear periphery dynamic nuclear pore complexes: life on the edge the mobile nucleoporin nup p and chromatin bound prp p function in endogenous npc mediated transcrip tional control con served histone variant h a.z protects euchromatin from the ectopic spread of silent heterochromatin nup pi: the nucle opore promoter interaction of genes in yeast unconventional tethering of ulp to the transport channel of the nuclear pore complex by karyopherins enzymes of the sumo modification pathway localize to filaments of the nuclear pore complex the dead box protein dbp p is required to dissociate mex p from exported mrnps at the nuclear rim nuclear target ing signal recognition: a key control point in nuclear transport? regulation of nuclear transport: central role in development and transfor mation? pro cessing of the precursor of nf kappa b by the hiv protease during acute infection regulation of the yeast yap p nuclear export signal is mediated by redox sig nal induced reversible disulfide bond formation nf at activation requires suppression of crm dependent export by cal cineurin con struction of chimeric tumor suppressor p resistant to the dominant negative interaction with p mutants a leucine rich nuclear export signal in the p tetramerization domain: regulation of subcellular localization and p activity by nes masking identification and characterization of a ligand regulated nuclear export signal in androgen receptor inhibition of nuclear import mediated by the rev arginine rich motif by rna molecules the c ter minal nuclear localization signal of the sex determin ing region y (sry) high mobility group domain medi ates nuclear import through importin beta a sox defect of calm odulin dependent nuclear import in campomelic dys plasia/autosomal sex reversal eif e is a central node of an rna regulon that governs cellular proliferation the expression of migration stimulating factor, a potent oncofetal cytok ine, is uniquely controlled by ' untranslated region dependent nuclear sequestration of its precursor mes senger rna a hex anucleotide element directs microrna nuclear import exporting rna from the nucleus to the cytoplasm the protein kinase ck site (ser / ) enhances recognition of the simian virus large t antigen nuclear localiza tion sequence by importin roles of phosphoryla tion sites in regulating activity of the transcription fac tor pho ubiquitination regulates pten nuclear import and tumor suppression ubiquitin charging of human class iii ubiq uitin conjugating enzymes triggers their nuclear import parc: a cytoplasmic anchor for p novel properties of the nucleolar targeting signal of human angiogenin the hiv tat nuclear localization sequence confers novel nuclear import properties nucleocytoplasmic shuttling by nucleoporins nup and nup and crm dependent nuclear export control the subcellular dis tribution of latent stat develop mental regulation of the heat shock response by nuclear transport factor karyopherin alpha . develop ment cloning and characterization of hsrp gamma, a tissue specific nuclear transport factor evidence for distinct substrate specificities of importin alpha family members in nuclear protein import sim ple kinetic relationships and nonspecific competition govern nuclear import rates in vivo. in vivo truncated form of importin alpha identified in breast cancer cell inhibits nuclear import of p severe acute respiratory syndrome coronavirus orf antagonizes stat function by sequestering nuclear import factors on the rough endoplasmic reticulum/golgi membrane nuclear transport and cancer: from mechanism to intervention drosophila melanogaster importin alpha and alpha can replace importin alpha during spermatogenesis but not oogenesis a role for caenorhabditis elegans importin ima in germ line and embryonic mitosis a nuclear export signal within the high mobility group domain regulates the nucleocytoplasmic translocation of sox during sexual determination dcas is required for importin alpha nuclear export and mechano sensory organ cell fate specification in drosophila disruption of the fg nucleoporin nup causes selective changes in nuclear pore complex stoichiometry and function characterization and potential function of a novel testis specific nucle oporin bs expression pattern and cellular distribution of the murine homo logue of af partial nuclear pore complex disassembly during closed mitosis in aspergillus nidulans cell cycle regulated transport controlled by alterations in the nuclear pore complex the nuclear pore comes to the fore the rae nup complex prevents aneuploidy by inhibiting securin degradation nuclear trans port of single molecules: dwell times at the nuclear pore complex pores for thought: nuclear pore complex proteins analysis of nucleo cytoplasmic trans port in a thermosensitive mutant of nuclear pore pro tein nsp minimal nuclear pore complexes define fg repeat domains essential for transport key: cord- -yy bennu authors: penna, fabio; ballarò, riccardo; beltrá, marc; de lucia, serena; costelli, paola title: modulating metabolism to improve cancer-induced muscle wasting date: - - journal: oxid med cell longev doi: . / / sha: doc_id: cord_uid: yy bennu muscle wasting is one of the main features of cancer cachexia, a multifactorial syndrome frequently occurring in oncologic patients. the onset of cachexia is associated with reduced tolerance and response to antineoplastic treatments, eventually leading to clinical conditions that are not compatible with survival. among the mechanisms underlying cachexia, protein and energy dysmetabolism play a major role. in this regard, several potential treatments have been proposed, mainly on the basis of promising results obtained in preclinical models. however, at present, no treatment yet reached validation to be used in the clinical practice, although several drugs are currently tested in clinical trials for their ability to improve muscle metabolism in cancer patients. along this line, the results obtained in both experimental and clinical studies clearly show that cachexia can be effectively approached by a multidirectional strategy targeting nutrition, inflammation, catabolism, and inactivity at the same time. in the present study, approaches aimed to modulate muscle metabolism in cachexia will be reviewed. cancer-induced muscle wasting is one of the hallmarks of cachexia, a multifactorial syndrome that represents one of the most important comorbidities in oncologic patients. the occurrence of cachexia markedly complicates the management of cancer patients, negatively impinging on the tolerance and response to antineoplastic treatments, worsening the quality of life, and reducing survival. in particular, about % of deaths by cancer are due to cachexia, rather than to the tumor itself [ ] . few years ago, a classification of cachexia was proposed, defining three different stages: precachexia, cachexia, and refractory cachexia [ ] . prognosis progressively worsens going from patients with precachexia to those with refractory cachexia. in this regard, the earlier anticachexia treatments are set up, the better. for this reason, the research on cachexia is focused on two main goals: (i) to find out biomarkers useful to the early identification of a condition of still latent cachexia and (ii) to define treatment protocols useful to delay the progression from precachexia to refractory cachexia. skeletal muscle wasting in cancer patients has a good prognostic value, being predictor of reduced tolerance to chemotherapy and/or surgery, decreased ability to perform daily activities, and shortened survival. in addition, recent data report that loss of muscle mass negatively affects quality of life in cancer patients [ , ] ; such correlation might occur irrespectively of survival rates [ ] . being poor quality of life one of the most prominent and invalidating consequences of cancer cachexia, to investigate the mechanisms underlying cancer-induced muscle wasting is even more relevant to design therapeutic strategies that also take into account patient well-being. the possibility to underestimate the occurrence of muscle mass depletion exists, since the first approach to clinically evaluate a patient is to obtain information about body weight and body mass index (bmi). however, in face of no body weight loss and/or normal bmi, reduced muscle mass might well occur, being masked by fat or water content. another relevant point that frequently is poorly taken into consideration is that the loss of muscle mass is likely exacerbated by anticancer treatments. at present, different mechanisms have been proposed to lead to muscle wasting in cancer hosts, among which there are altered protein and energy metabolism and impaired myogenesis [ ] . several factors may contribute to these alterations, such as reduced calorie intake, hormonal unbalance, and systemic inflammation. cancer-driven production of proinflammatory cytokines plays a relevant role in tumor progression and markedly contributes to cachexia. indeed, in cancer patients, systemic inflammation correlates with increased resting energy expenditure and with reduced survival rate [ ] . along the same line, increased circulating levels of tumor necrosis factor α (tnfα), interleukin- (il- ), γ-interferon (inf), and, more recently, growth and differentiation factor (gdf ) have been reported in cachectic cancer patients [ , ] . the link existing between cytokines and cachexia has led to the inclusion of anti-inflammatory drugs in treatment protocols [ ] . the present review will focus on strategies able to modulate metabolism that may reveal useful to prevent/delay cancer-induced muscle wasting. altered protein turnover is a general feature of muscle wasting in cancer cachexia. in particular, protein breakdown rates are persistently increased, while protein synthesis rates can be reduced, unchanged, or even increased, depending on the model system [ ] . the different reaction kinetics that characterizes protein synthesis and degradation rates, zero and first order, respectively, implies that if degradation is higher than normal, the loss of total protein cannot be antagonized by simply increasing the rate of synthesis. taking this assumption for true, any anabolic approach should be associated with anticatabolic strategies in order to achieve a beneficial effect on muscle protein mass. . . protein degradation. several pieces of evidence suggest that the intracellular proteolytic systems in the skeletal muscle of cancer hosts are poised towards activation above the physiological levels ( figure ). particularly relevant in this regard are the pathways dependent on ubiquitinproteasome and autophagy. while the former mainly degrades short-lived and regulatory proteins, the latter gets rid of structural proteins and organelles [ ] . both experimental and human cancer cachexia were associated with increased activity of the ubiquitinproteasome pathway [ ] . of interest, alterations in molecular and biochemical markers of proteasome activation were observed in gastric cancer patients before any evidence of body weight loss, supporting the need to detect cachexia as early as possible [ ] . modulations of the ubiquitinproteasome proteolytic system, however, are not a general finding in cancer cachexia, as shown by studies reporting that it is not differently activated with respect to controls in the muscle of patients affected by non-small-cell lung cancer (nsclc; [ ] ) or esophageal cancer [ ] . the involvement of the autophagic-lysosomal proteolysis in muscle wasting was recognized just in the last fifteen years. two main reasons account for such delay: (i) autophagy was not considered as typically operated by the skeletal muscle as a response to stress conditions. such belief was definitely abandoned when autophagy was clearly demonstrated in fasted mice overexpressing green fluorescent protein- labeled lc [ ] . (ii) the study and detection of autophagy was not easy since the atg genes were not discovered [ ] . a number of studies reported that the autophagic system was overactivated, without reaching complete cargo degradation, in the muscle of both tumor-bearing animals and cancer patients [ , [ ] [ ] [ ] . in particular, despite autophagic flux was enhanced in mice bearing the c tumor, autophagosomes accumulated, likely due to exhaustion of the lysosomal compartment [ ] . a similar pattern could also be observed in cancer patients, as suggested by lc b-ii and p accumulation [ , ] . both proteasome and lysosomes, however, cannot directly degrade intact myofilaments. in this regard, a preliminary cleavage was proposed to be operated by other proteolytic systems, such as those dependent on caspases or calpains. these latter are ca + -dependent cysteine proteases, normally inactive and localized in the cytosolic compartment. when intracellular ca + concentrations increase, inactive calpains translocate to the cell membrane and become activated by autoproteolysis [ ] . the system also includes calpastatin, a physiological inhibitor, which is a substrate of active calpain itself. increased calpain expression was reported in the muscle of tumor-bearing animals [ ] , while rats transplanted with the yoshida ah- hepatoma showed a progressive reduction of calpastatin levels and increased in vitro cleavage of specific fluorogenic substrates [ ] . more recently, calpain activation was also demonstrated in mice bearing the c colon carcinoma [ ] . both increased or unchanged muscle calpain expression were reported in cancer patients [ , ] . several lines of evidence show that proinflammatory cytokines act as triggers, or at least as contributors, of cancer-induced protein hypercatabolism [ ] . briefly, data obtained in both experimental models and human pathology have demonstrated that cytokines such as tnfα and il- lead to reduced rates of protein synthesis paralleled by enhanced protein breakdown, both accounting for the loss of muscle mass [ ] . such effects depend, at least in part, on activation of the transcription factor nf-κb, as shown in both experimental and human cancer cachexia patients [ , ] . cancer-induced muscle wasting has also been associated with another proinflammatory cytokine, namely, tnf-like weak inducer of apoptosis (tweak) [ ] . the therapeutic approaches mainly pursued by researchers to counteract enhanced muscle protein breakdown have long been those specifically targeting the different proteolytic systems. the results, however, did not really clarify the issue. since the discovery of muscle-specific ubiquitin ligases, these were considered a good target to interfere with protein breakdown, being involved in determining both substratespecificity and proteasome degradation rate. among the enzymes belonging to this family, the first described were mafbx/atrogin- and murf /trim , respectively, involved in the degradation of structural proteins and of proteins contributing to cell proliferation, differentiation, and survival [ ] . subsequently, other members came out, such as trim and fbxo . more recently, fbxo / musa was shown to contribute to denervation-and fasting-mediated muscle loss [ ] , as well as to cancerinduced muscle wasting (unpublished data). genetic approaches specifically targeting these ubiquitin ligases proved effective in protecting the muscle against protein depletion [ ] ; however, at present, the use of these enzymes as therapeutic targets for muscle wasting is not validated yet. on the other side, the inhibition of proteasome activity by means of pharmacological inhibitors was effective just in few models of muscle atrophy but totally unable to protect tumor-bearing animals against muscle wasting [ ] . in contrast with these findings, few years ago, a study reported that inhibition of the ubiquitin-proteasome pathway by mg was able to improve experimental cancer cachexia [ ] . however, mg is a rather unspecific inhibitor, being able to block also calpains and autophagy [ , ] . finally, carfilzomib, an irreversible selective inhibitor of proteasome chymotrypsin-like activity, was shown to improve cachexia in tumor-bearing mice by inhibiting muscle protein breakdown [ ] . such improvement, however, was associated with reduced tumor burden, which could be the real mechanism underlying the beneficial effect of the treatment. several lines of evidence proposed that modulations of autophagy could be useful to improve cancer-associated muscle wasting. in this regard, muscle-specific gene strategies aimed at silencing beclin- , one of the proteins involved in autophagosome formation, showed that suppression of autophagy in the c hosts was unable to rescue myofiber diameter (unpublished data). in addition, pharmacological inhibition of autophagy in mice hosting the c tumor lead to death of the animals, suggesting that lysosomal degradation is mandatory to sustain the requirement of both energy and substrates in tumor hosts, at least when they are facing the terminal phase of cancer growth [ ] . the other way round, excessive stimulation of muscle autophagy, experimentally obtained by the overexpression of tp inp / dor, exacerbated muscle atrophy in tumor-bearing mice (unpublished data), while activation of autophagy by means of the mtor inhibitor rapamycin was shown to positively affect the skeletal muscle in the c hosts [ ] . such discrepancy might depend on the fact that while mtor inhibition affects stress-induced autophagy, tp inp hyperexpression targets basal autophagy. despite the literature report data supporting the involvement of ca + -dependent proteolysis in the pathogenesis of cancer-induced muscle wasting, protein hypercatabolism was not downregulated in preparations of isolated muscles obtained from tumor-bearing animals and incubated in the presence of calpain inhibitors [ , , ] . more recently, both pharmacological and genetic approaches aimed at inhibiting the ca + -dependent proteolytic system were not able to prevent or delay cancer-induced muscle wasting [ ] , although contrasting results were reported in this regard [ ] . while interfering with specific proteolytic systems does not seem to be an appropriate approach to prevent/delay cancer-induced muscle wasting, the modulation of bulk protein turnover appears more promising. in this regard, anti-inflammatory approaches revealed able to improve muscle protein turnover in tumor-bearing mice [ ] . more recently, administration of formoterol, a β -adrenergic agonist, to tumor-bearing animals revealed able to reverse muscle wasting [ ] . such an effect is mainly exerted by stimulating protein synthesis and inhibiting protein degradation rates. in particular, both the ubiquitin-proteasome and the autophagic-lysosomal proteolytic systems were downregulated in formoterol-receiving animals ( [ ] ; unpublished data). at present, only one study evaluated the effectiveness of formoterol, combined with megestrol acetate, in cachectic cancer patients [ ] . the results suggest that both muscle size and strength can be improved by the treatment, although more trials are needed to draw clear-cut evidence. . . protein synthesis. as reported above, depending on the situation, reduced, normal, or even increased muscle protein synthesis rates were shown in cancer cachexia. due to the rapid development of cachexia, tumor-bearing animals frequently show reduced protein synthesis rates, although this is not a general finding. indeed, rats bearing the ah- hepatoma, that usually die about days after tumor transplantation, showed muscle protein synthesis rates comparable to those of healthy animals [ ] . the situation is more complex when studying human pathology. reduced muscle protein synthesis was reported many years ago in patients affected by different types of tumors [ ] and more recently in prostate cancer patients [ ] . on the contrary, van dijk et al. [ ] reported that baseline protein synthesis rates were higher than control values in cachectic patients affected by pancreatic cancer. also, intermediate results are available in the literature: myofibrillar protein synthesis rates were analyzed in healthy people and weight-stable and weight-losing gastrointestinal cancer patients and found comparable among the different groups. similarly, no changes in whole body protein synthesis were reported in nsclc patients [ ] . the possibility to modulate protein synthesis in order to correct muscle atrophy or simply to provide an environment permissive for the maintenance of muscle mass was long studied. many approaches were tested, most of them consisting in nutritional strategies or in molecular modulations aimed at pushing muscle metabolism towards anabolism. most of these approaches revealed unsuccessful, giving rise to the idea that anabolic resistance occurs in cancer cachexia. just to provide few examples, conventional nutritional supplementation or the infusion with an amino acid cocktail did not stimulate muscle protein synthesis in advanced cancer patients [ ] . along this line, studies aimed at stimulating the anabolic pathway depending on igf- , by both pharmacological and genetic means, did not succeed in improving muscle wasting in tumor-bearing animals [ , ] . recently, however, patients not yet considered as refractory cachectic were proposed to display an anabolic window that could be exploited with nutritional interventions [ , , ] or with other anabolism-inducing strategies. as an example, patients with stage iii and iv nsclc showed a normal anabolic response to hyperaminoacidemia but not to isoaminoacidemia, suggesting that high substrate availability is relevant to induce anabolism in cancer hosts [ ] . consistently, muscle protein synthesis could be stimulated in advanced cancer patients by a high protein formula versus a conventional nutritional supplement [ , ] . these observations point out the possibility to overcome the anabolic resistance that occurs in cancer patients by providing specifically enriched nutritional supplements. stimulation of anabolism can be exerted by several means. particularly interesting in this regard is ghrelin, a mediator released by the stomach during fasting or caloric restriction. modulations of ghrelin levels exert remarkable effects on both energy and protein metabolism, such as the inhibition of autophagy in conditions characterized by systemic inflammation [ ] . the administration of ghrelin to tumor-bearing animals improved food intake, body weight, lean body mass, and chemotherapy-induced toxicity [ ] . both ghrelin and ghrelin analogues are currently being tested in clinical trials. among these, anamorelin was recently shown to improve lean body mass, total body mass, and hand grip strength in patients affected by nsclc [ ] . other studies, however, showed that anamorelin administration to cancer patients increased body weight and improved faact scores while did not enhance hand grip strength [ ] . acids. in addition to be necessary to synthesize proteins, free amino acids (faa) also act as regulators of protein metabolism. in particular, plasma faa, that even represent a small fraction of the total amino acid pool, are the main source of metabolically active nitrogen compounds. among faa, the essential amino acids were reported to stimulate protein synthesis and to inhibit protein degradation. such a role is mainly played by the three branched chain amino acids (bcaa), leucine in particular. alterations of amino acid metabolism are frequent features in cancer-induced muscle wasting. reduced amino acid uptake was generally observed in cancer patients, mainly due to the occurrence of anorexia, which also leads to decreased insulin secretion. both decreased amino acid availability and insulin levels inhibit the anabolic pathway dependent on mtor, resulting in downregulation of protein synthesis rates and stimulating protein degradation. inhibition of mtor signaling in cancer cachexia is enforced by proinflammatory cytokines [ ] . reduced amino acid uptake in the muscle was also reported to derive from altered amino acid transport. indeed, in the soleus muscle of tumor-bearing rats, the activity of system a was decreased, while no changes were observed for systems l and asc [ ] . of interest, tnfα was shown to impair amino acid transport in tumor-bearing rats [ ] . plasma glutamine levels were shown to be significantly reduced in tumor-bearing rats with respect to healthy animals [ ] . of interest, reduced glutamine availability could activate the metabolic sensor adenosine monophosphateactivated protein kinase (ampk, see below; [ ] ). finally, leucine oxidation was markedly increased in the muscle of rats bearing the yoshida ah- hepatoma [ ] . consistently, enhanced activity of the bcaa dehydrogenase was reported in rats bearing the walker carcinoma [ ] . several studies have proposed amino acid supplementation as a mean to improve cancer-induced muscle wasting. in experimental models of cancer cachexia, bcaa were shown to attenuate the loss of muscle mass. the underlying mechanisms of such effect are not clear, although downregulation of protein degradation and stimulation of protein synthesis were hypothesized [ ] . more recently, metabolomic alterations were proposed to be the basis of the positive effects exerted by a leucine-rich diet on cachexia in rats bearing the walker carcinoma, in the absence of effects on tumor mass [ ] . as for clinical studies, bcaa were proposed to improve anorexia [ ] , thus removing, partially at least, one of the mechanisms accounting for reduced amino acid uptake. other studies supported a beneficial role of bcaas on muscle protein metabolism, although this should be confirmed by larger randomized, blind, placebocontrolled trials [ ] . beta-hydroxy-beta-methylbutyrate (hmb), a metabolite of leucine, was shown to improve muscle wasting in experimental cancer cachexia, mainly by inhibiting protein degradation rather than stimulating protein synthesis [ ] . recently, hmb was proposed to be more effective than leucine in preventing body weight loss in tumor-bearing animals [ ] . such effect, however, could depend on the model system chosen, since hmb does not appear able to modulate muscle mass in mice bearing the c tumor (costelli et al., unpublished observations). the situation is even more confused in human cachexia. increase of both hemoglobin levels and fat-free mass were reported in advanced cancer patients administered a nutritional supplement containing hmb, arginine, and glutamine [ , ] . another study, however, was not able to demonstrate a beneficial effect of the same supplement in cancer patients [ ] , suggesting that the effectiveness of hmb in the clinical practice is still unclear and deserves further investigation. glutamine supplementation was reported to attenuate muscle protein wasting in cancer patients [ ] , as well as to improve the energy balance in rats bearing the walker tumor [ ] . finally, promising data are available about the possibility to treat cancer hosts with l-carnitine, an amino acid derivative that plays a role in fatty acid metabolism and energy production [ ] . a negative energy balance, generally resulting from both reduced production and increased expenditure, is a frequent occurrence in cancer patients. while resting energy expenditure (ree) frequently increases, likely due to enhanced thermogenesis, the occurrence of reduced physical activity, particularly in advanced cancer patients, paradoxically leads to a net decrease of total energy expenditure. the increased thermogenesis is consistent with the observation that in cachectic tumor-bearing animals, the expression of the brown adipose tissue-(bat-) specific uncoupling protein (ucp ) is higher than in controls, while ucp (ubiquitous) and ucp (expressed in bat and muscle) levels increase in the skeletal muscle only [ ] . similarly, muscle ucp mrna levels were higher in weight-losing than in non-weight-losing cancer patients or controls [ ] . the increase of ree in cancer cachexia is not a new observation; however, just recently, the underlying mechanisms start to be clarified. a central point in this regard is played by muscle mitochondria compartment, which is markedly affected in tumor hosts. indeed, both morphological [ , ] and functional alterations [ , ] were reported in experimental tumor-bearing animals. in particular, mitochondrial uncoupling and reduced oxidative capacity were associated with myofiber shift from oxidative to glycolytic fibers [ ] . impairment of the mitochondrial compartment results in decreased atp production, leading to an energy deficit that becomes even worse since it is coupled with steadily increased ree. consistently, reduced atp levels and increased activity of the energy sensor ampk were shown in the muscle of tumor-bearing animals [ , ] . the lack of an appropriate energy supply results in compromised cell function and reduced contractile force generation, leading to loss of muscle mass and strength. several factors can lead to mitochondrial alterations in the skeletal muscle. among these, proinflammatory cytokines play a major role. indeed, the activation of nf-κb induced by tnfα was reported to reduce both muscle oxidative capacity and the expression of factors regulating mitochondrial biogenesis. similar observations were reported when other inflammation-driven pathways such as the il- /stat or tgfβ/smad are activated above physiological levels [ ] . in addition to proinflammatory mediators, also oxidative stress, due to reactive oxygen and nitrogen species levels exceeding the compensative capacity of the intracellular antioxidant systems, contributes to mitochondrial function impairment. in this regard, there are several studies sustaining the involvement of oxidative stress in cancerinduced muscle wasting, although a clear-cut causative evidence is still lacking [ ] (ballarò et al., unpublished data; figure ). being mitochondria crucial to the maintenance of muscle oxidative metabolism, emergency routes can be activated in order to avoid mitochondrial dysfunction. in particular, mitochondrial biogenesis and dynamics as well as the disposal of damaged organelles, mainly by mitophagy, are promoted ( figure ). the other way round, impaired function of the emergency routes themselves could trigger the accumulation of altered mitochondria resulting in reduced muscle oxidative metabolism. in this regard, the expression of the peroxisome proliferator-activated receptor-γ (ppar-γ) coactivator- α (pgc- α), the master regulator of mitochondrial biogenesis and oxidative metabolism, was shown to be reduced in the skeletal muscle of tumorbearing mice [ ] , although this is not a constant finding [ , ] . mitochondria dynamics, representing the balance between fission and fusion processes, was shown to be altered in both experimental cancer cachexia and in cancer patients [ , ] . in this regard, impaired mitochondrial dynamics could drive the hyperactivation of muscle protein breakdown, likely through pathways depending on ampk and foxo, eventually leading to muscle wasting [ , ] . autophagy (mitophagy) is the main mechanism responsible for disposal of altered mitochondria. also, mitophagy was reported to be impaired in cancer cachexia, as shown by the observation that bnip l and parkin mrna increased in the muscle of cancer patients [ ] . similarly, bnip l protein levels were increased in the muscle of mice bearing the lewis lung carcinoma [ ] . on the whole, these observations suggest that in addition to mitochondria biogenesis and dynamics, also their disposal is perturbed in the skeletal muscle of tumor hosts, thus contributing to mitochondrial dysfunction and reduced muscle oxidative metabolism. several strategies were proposed to improve energy metabolism by acting on mitochondria. the first and perhaps simplest one, in theory at least, is exercise training, in particular a combination of both resistance and endurance exercise. these two types of training affect different but complementary targets, being able to improve force production and metabolic adaptations, respectively. of particular relevance is the observation that endurance training was reported to increase the number of mitochondria and to drive myofiber-type shift from glycolytic to oxidative, thus specifically targeting alterations that characteristically occur in the skeletal muscle of tumor hosts. however, these potentially favorable effects can be exploited just systematically practicing exercise, even if at a moderate level. this may not be an easy task in cancer patients that frequently present with chronic fatigue and comorbidities eventually leading to exercise intolerance. this point is supported by the observation that c -bearing mice did not benefit from exercise training, suggesting that the effort to exercise in already compromised animals was damaging rather than protective [ ] . consistently, excessive endurance exercise was associated with increased mitochondrial fission in the absence of mitophagy induction [ ] . in the last years, the possibility to mimic the effects of exercise by drugs has been gaining a growing consensus. the positive side is that this strategy will allow to overcome both poor patient compliance to exercise training and possible occurrence of exercise intolerance. the negative part is that generally exercise mimicking drugs do not totally recapitulate the effects of exercise itself. in this sense, these drugs do not properly hit the target; however, they could be a good compromise when exercise training cannot be proposed to the patient. at present, several options were investigated as exercisemimicking strategies. while most of them are pharmacological, also a genetic approach was proposed. this latter consists in manipulations able to increase the levels of pgc- α in the skeletal muscle. in this regard, improved exercise capacity and oxidative metabolism were reported in mice specifically overexpressing this factor in the muscle, resembling the phenotype induced by endurance training [ ] . pgc- α overexpression was shown to interfere with muscle atrophy induced by activation of the tweak-fn pathway [ ] and to improve cancer-induced muscle wasting in tumor-bearing mice [ , ] , although contrasting data were previously reported [ ] . several classes of drugs were proposed to modulate energy metabolism, among which are activators of ampk, sirtuin (sirt ), and trimetazidine (tmz). different compounds such as resveratrol, metformin, quercetin, and aicar can activate ampk [ ] . in this regard, aicar administration was shown to impinge on exercise capacity, oxygen consumption, and fatty acid oxidation [ ] . muscle atrophy induced by angiotensin ii was prevented by treatment with aicar. this drug also revealed able to activate autophagy and to improve muscle phenotype in both dystrophic mdx mice and animals bearing the c tumor [ , ] . metformin administration was shown to improve sarcopenia of aging and muscle wasting in severely burned patients [ , ] and was proposed to be useful to treat muscle wasting in cancer cachexia [ ] . ampk activation can also be induced by resveratrol, as demonstrated by the observation that ampkα -or ampkα -deficient mice were refractory to resveratrol-induced increase of both mitochondrial biogenesis and endurance exercise capacity [ ] . consistently, obese men receiving resveratrol showed improved inflammation, ampk activation, and increased expression of pgc- α and sirt protein levels [ ] . finally, an ampk-stabilizing peptide was reported to improve white adipose tissue wasting in tumor hosts [ ] . sirt belongs to a class of deacetylases deregulated in aging and in different chronic diseases, including cancer. sirt is also involved in the regulation of energy homeostasis; its expression is induced in response to caloric restriction [ ] and can be activated in the skeletal muscle by ampk [ ] . specific overexpression of sirt in the muscle resulted in a fast-to-slow myofiber type transition, producing an oxidative phenotype. consistently, muscle-specific sirt transgenic mice exposed to fasting or denervation showed a reduced expression of atrogenes in comparison with wildtype littermates [ ] . finally, improved muscle phenotype was reported in mdx/sirt double transgenic mice [ ] . in addition to ampk, resveratrol also activates sirt . in this regard, part of the above-described effects exerted by resveratrol derive from sirt -dependent modulations of pgc- α acetylation state [ ] . synthetic selective sirt activators such as srt are also available [ , ] . plasma lipid profile and insulin sensitivity were improved in healthy volunteers by administration of srt [ , ] . very few studies are actually available about srt action on both muscle mass and function. in this regard, srt appeared to reduce the depletion of muscle mass due to fasting or inactivity [ ] , at least in part by increasing pgc- α expression [ ] . tmz is a metabolic modulator that blocks fatty acid oxidation, shifting atp production to glucose oxidation and improving cell energy metabolism. indeed, atp synthesis through fatty acid β-oxidation requires more oxygen than glucose oxidation [ ] . along this line, the shift to glucose oxidation improves the use of the available oxygen, possibly increasing metabolic efficiency and skeletal muscle function. tmz was shown to increase the size of cultured myotubes [ ] and to improve both heart metabolism and exercise capacity in patients suffering from chronic stable angina [ ] . when administered to aged animals, tmz resulted in increased muscle strength [ ] . finally, treatment of c -bearing mice with tmz resembled some of the benefits triggered by exercise, among which fast-to-slow myofiber phenotype shift, pgc- α upregulation, oxidative metabolism enhancement, and grip strength increase (molinari et al., unpublished). the relevance of fatty acid oxidation to cachexia is also supported by a recent study showing that several tumor cell lines are able to release proinflammatory mediators, resulting in enhanced fatty acid oxidation and activation of p -dependent signaling in the skeletal muscle, well before tissue wasting occurs. in addition, the same study also showed that treatment of tumor-bearing animals with etomoxir, an inhibitor of fatty acid oxidation, rescued both muscle mass and body weight [ ] . a complex network of metabolic alterations sustained by hypercatabolism, energy deficit, and systemic inflammation is the milieu underlying cancer cachexia. while becoming overtly detectable in advanced cancer patients, such perturbations likely take place very early in the course of the disease, at least at the molecular level. protein and energy dysmetabolism in cachexia are quite well recognized; however, the available therapeutic strategies, although frequently promising from the preclinical point of view, have not yet reached validation to be used in the clinical practice. several drugs identified by experimental studies are currently tested in clinical trials for their ability to improve muscle metabolism in cancer patients. other emerging strategies are those aimed at interfering with the intestinal microbiota, previously reported to improve cachexia in a preclinical model [ ] . the available results of both experimental and clinical studies, however, have clearly indicated that single-targeted therapies will hardly be successful in the treatment of cachexia. in this regard, the view of a multidirectional approach, selectively tailored and, whenever possible, personalized, is gaining a growing consensus. such an approach should not just rely on nutritional counseling and pharmacologic treatment with anti-inflammatory and anticatabolic drugs but also include exercise training and/or exercisemimicking agents. in this regard, exercise mimetics could not only merely replace exercise training in depleted patients but also improve exercise tolerance and effectiveness in precachectic individuals, thus amplifying the beneficial action of exercise itself. last but not least, treatments aimed at preventing/correcting the metabolic alterations underlying cancer-induced muscle wasting might also impinge on tumor-targeted therapies improving their effectiveness and/ or enhancing patient tolerance to chemotherapy. in addition, metabolic modulators could also directly affect tumor growth. this is the case, for example, of exercise, that was shown to prevent or at least delay tumor growth [ ] . cancer cachexia: understanding the molecular basis definition and classification of cancer cachexia: an international consensus muscle mass and association to quality of life in non-small cell lung cancer patients sarcopenia is associated with quality of life and depression in patients with advanced cancer anamorelin for advanced non-small-cell lung cancer with cachexia: systematic review and meta-analysis definition of cancer cachexia: effect of weight loss, reduced food intake, and systemic inflammation on functional status and prognosis targeting obesity and cachexia: identification of the gfral receptor-mic- /gdf pathway novel targeted therapies for cancer cachexia coming back: autophagy in cachexia increased muscle proteasome activity correlates with disease severity in gastric cancer patients precachexia in patients with stages i-iii non-small cell lung cancer: systemic inflammation and functional impairment without activation of skeletal muscle ubiquitin proteasome system autophagic-lysosomal pathway is the main proteolytic system modified in the skeletal muscle of esophageal cancer patients in vivo analysis of autophagy in response to nutrient starvation using transgenic mice expressing a fluorescent autophagosome marker guidelines for the use and interpretation of assays for monitoring autophagy autophagic degradation contributes to muscle wasting in cancer cachexia aerobic exercise and pharmacological treatments counteract cachexia by modulating autophagy in colon cancer autophagy is induced in the skeletal muscle of cachectic cancer patients the calpain system muscle wasting associated with cancer cachexia is linked to an important activation of the atpdependent ubiquitin-mediated proteolysis anticytokine treatment prevents the increase in the activity of atpubiquitin-and ca + -dependent proteolytic systems in the muscle of tumour-bearing rats interference with ca + -dependent proteolysis does not alter the course of muscle wasting in experimental cancer cachexia calpain activity is increased in skeletal muscle from gastric cancer patients with no or minimal weight loss role of inflammation in muscle homeostasis and myogenesis anti-cytokine strategies for the treatment of cancer-related anorexia and cachexia nf-κb-mediated pax dysregulation in the muscle microenvironment promotes cancer cachexia nuclear transcription factor κb activation and protein turnover adaptations in skeletal muscle of patients with progressive stages of lung cancer cachexia targeting of fn prevents cancer-induced cachexia and prolongs survival bmp signaling controls muscle mass the role of e ubiquitin-ligases murf- and mafbx in loss of skeletal muscle mass effect of the specific proteasome inhibitor bortezomib on cancer-related muscle wasting mg -mediated inhibition of the ubiquitin-proteasome pathway ameliorates cancer cachexia severe acute respiratory syndrome coronavirus replication is severely impaired by mg due to proteasome-independent inhibition of m-calpain activation of the atp-ubiquitin-proteasome pathway in skeletal muscle of cachectic rats bearing a hepatoma increased atpubiquitin-dependent proteolysis in skeletal muscles of tumor-bearing rats calpain inhibitors ameliorate muscle wasting in a cachectic mouse model bearing ct colorectal adenocarcinoma anticachectic effects of formoterol phase i/ii trial of formoterol fumarate combined with megestrol acetate in cachectic patients with advanced malignancy tumor necrosis factor-alpha mediates changes in tissue protein turnover in a rat cancer cachexia model protein synthesis in muscle measured in vivo in cachectic patients with cancer attenuation of resting but not load-mediated protein synthesis in prostate cancer patients on androgen deprivation effects of oral meal feeding on whole body protein breakdown and protein synthesis in cachectic pancreatic cancer patients protein anabolic resistance in cancer igf- is downregulated in experimental cancer cachexia muscle atrophy in experimental cancer cachexia: is the igf- signaling pathway involved? cancer cachexia and diabetes: similarities in metabolic alterations and possible treatment central tenet of cancer cachexia therapy: do patients with advanced cancer have exploitable anabolic potential? normal protein anabolic response to hyperaminoacidemia in insulinresistant patients with lung cancer cachexia muscle protein synthesis in cancer patients can be stimulated with a specially formulated medical food high anabolic potential of essential amino acid mixtures in advanced nonsmall cell lung cancer current opinion in clinical nutrition and metabolic care anamorelin hydrochloride in the treatment of cancer anorexia-cachexia syndrome: design, development, and potential place in therapy anamorelin (ono- ) in japanese patients with non-small cell lung cancer and cachexia: results of a randomized phase trial anamorelin in patients with non-small-cell lung cancer and cachexia (romana and romana ): results from two randomised, double-blind, phase trials amino acid uptake in skeletal muscle of rats bearing the yoshida ah- ascites hepatoma tumour growth results in changes in placental amino acid transport in the rat: a tumour necrosis factor α-mediated effect cancer cachexia, malnutrition, and tissue protein turnover in experimental animals hypothesis: muscular glutamine deficiency in sepsis-a necessary step for a hibernation-like state? enhanced leucine oxidation in rats bearing an ascites hepatoma (yoshida ah- ) and its reversal by clenbuterol the energy state of tumor-bearing rats effect of branched-chain amino acids on muscle atrophy in cancer cachexia leucine-rich diet alters the h-nmr based metabolomic profile without changing the walker- tumour mass in rats branchedchain amino acids: the best compromise to achieve anabolism? cancer-induced muscle wasting: latest findings in prevention and treatment comparison of the anticatabolic effects of leucine and caβ-hydroxy-β-methylbutyrate in experimental models of cancer cachexia reversal of cancer-related wasting using oral supplementation with a combination of β-hydroxy-βmethylbutyrate, arginine, and glutamine supplementation with a combination of beta-hydroxy-beta-methylbutyrate (hmb), arginine, and glutamine is safe and could improve hematological parameters a randomized, doubleblind, placebo-controlled trial of a β-hydroxyl β-methyl butyrate, glutamine, and arginine mixture for the treatment of cancer cachexia (rtog ) glutamine supplementation in cancer patients l-glutamine supplementation promotes an improved energetic balance in walker- tumor-bearing rats lipid mobilization in cachexia: mechanisms and mediators muscle ucp- mrna levels are elevated in weight loss associated with gastrointestinal adenocarcinoma in humans mitochondrial and sarcoplasmic reticulum abnormalities in cancer cachexia: altered energetic efficiency? combination of exercise training and erythropoietin prevents cancer-induced muscle alterations cancer cachexia is associated with a decrease in skeletal muscle mitochondrial oxidative capacities without alteration of atp production efficiency skeletal muscle mitochondrial uncoupling in a murine cancer cachexia model disrupted skeletal muscle mitochondrial dynamics, mitophagy, and biogenesis during cancer cachexia: a role for inflammation antioxidant supplementation accelerates cachexia development by promoting tumor growth in c tumorbearing mice il- regulation on skeletal muscle mitochondrial remodeling during cancer cachexia in the apc min/+ mouse combined approach to counteract experimental cancer cachexia: eicosapentaenoic acid and training exercise altered mitochondrial quality control signaling in muscle of old gastric cancer patients with cachexia mitochondrial biogenesis and fragmentation as regulators of muscle protein degradation the opa -dependent mitochondrial cristae remodeling pathway controls atrophic, apoptotic, and ischemic tissue damage modulation of autophagy and ubiquitinproteasome pathways during ultra-endurance running skeletal muscle pgc- β signaling is sufficient to drive an endurance exercise phenotype and to counteract components of detraining in mice regulatory circuitry of tweak-fn system and pgc- α in skeletal muscle atrophy program a pgc- α isoform induced by resistance training regulates skeletal muscle hypertrophy increase in muscle mitochondrial biogenesis does not prevent muscle loss but increased tumor size in a mouse model of acute cancer-induced cachexia the therapeutic potential of skeletal muscle plasticity in duchenne muscular dystrophy: phenotypic modifiers as pharmacologic targets ampk and pparδ agonists are exercise mimetics ampk activation stimulates autophagy and ameliorates muscular dystrophy in the mdx mouse diaphragm effects of pharmacological interventions on muscle protein synthesis and breakdown in recovery from burns effects of the antidiabetic drugs on the age-related atrophy and sarcopenia associated with diabetes type ii metformin treatment modulates the tumour-induced wasting effects in muscle protein metabolism minimising the cachexia in tumour-bearing rats amp-activated protein kinase-deficient mice are resistant to the metabolic effects of resveratrol calorie restriction-like effects of days of resveratrol supplementation on energy metabolism and metabolic profile in obese humans an amp-activated protein kinase-stabilizing peptide ameliorates adipose tissue wasting in cancer cachexia in mice mammalian sirtuins: biological insights and disease relevance pgc- α, sirt and ampk, an energy sensing network that controls energy expenditure sirt protein, by blocking the activities of transcription factors foxo and foxo , inhibits muscle atrophy and promotes muscle growth the multifaceted functions of sirtuins in cancer metabolic control of muscle mitochondrial function and fatty acid oxidation through sirt /pgc- α a pilot randomized, placebo controlled, double blind phase i trial of the novel sirt activator srt in elderly volunteers a novel sirtuin (sirt ) inhibitor with p -dependent pro-apoptotic activity in non-small cell lung cancer cardiovascular effects of a novel sirt activator, srt , in otherwise healthy cigarette smokers srt extends survival of male mice on a standard diet and preserves bone and muscle mass the antiischemic effect of trimetazidine in patients with postprandial myocardial ischemia is unrelated to meal composition the metabolic modulator trimetazidine triggers autophagy and counteracts stress-induced atrophy in skeletal muscle myotubes trimetazidine improves exercise performance in patients with peripheral arterial disease improvement of skeletal muscle performance in ageing by the metabolic modulator trimetazidine excessive fatty acid oxidation induces muscle atrophy in cancer cachexia synbiotic approach restores intestinal homeostasis and prolongs survival in leukaemic mice with cachexia molecular mechanisms linking exercise to cancer prevention and treatment the authors declare that there is no conflict of interest regarding the publication of this paper. key: cord- -g ob b authors: xie, xiao-li; zheng, li-fei; yu, ying; liang, li-ping; guo, man-cai; song, john; yuan, zhi-fa title: protein sequence analysis based on hydropathy profile of amino acids date: - - journal: journal of zhejiang university science b doi: . /jzus.b sha: doc_id: cord_uid: g ob b biology sequence comparison is a fundamental task in computational biology. according to the hydropathy profile of amino acids, a protein sequence is taken as a string with three letters. three curves of the new protein sequence were defined to describe the protein sequence. a new method to analyze the similarity/dissimilarity of protein sequence was proposed based on the conditional probability of the protein sequence. finally, the protein sequences of nd (nadh dehydrogenase subunit ) protein of eight species were taken as an example to illustrate the new approach. the results demonstrated that the method is convenient and efficient. the comparative biological sequence is one of the issues in bioinformatics when analyzing similarities of function and properties of different sequences. similarly, evolutionary homology is analyzed by comparing dna and protein sequences. in general, there are two types of methodologies to conduct the comparison. one is an alignment-based method, and the other is an alignment-free method. sequence alignment is based on computeroriented and computer-intensive comparisons of sequences, and then a distance function or a score function is obtained. using the distance function, one can compare biological sequences. however, multiple sequence alignment of several hundred sequences always produces a bottleneck, firstly due to long computational time, and secondly due to possible bias of multiple sequence alignments for multiple occurrences of highly similar sequences (pham and zuegg, ) . therefore, the emergence of a study on alignment-free sequence analysis is obvious. until now, alignment-free sequence analysis is still in its early development. for most alignment-free methods, a biological sequence should be transformed into an object for which a linear algebra and statistical theory already has useful analytical tools. since , dna sequence has been represented in different dimension spaces (hamori and ruskin, ; hamori, ; nandy, ; ; nandy and basak, ; randić et al., ; randić, ; randić and balaban, ; zhang et al., ; liao and wang, ; liao et al., ; nandy et al., ; bai et al., ; feng and wang, ) . each nucleotide of a given dna sequence is a point in different dimension spaces, and these graphical representations can allow us to qualitatively analyze dna sequences, and provide a way of viewing, sorting and comparing various genomic sequences. based on the graphical representation, it is possible to numerically characterize dna sequence and further quantitatively measure similarity of different dna sequences. although protein sequence and dna sequence belong to symbolic sequences, compared with dna sequence, there are fewer methods for the graphical representation of protein sequence. this is mainly because extension of dna graphical representation to protein sequences would enormously increase the number of possible alternative assignments for the amino acids. the amino acid sequence is the key to understanding protein structure and function in the cell, so analysis of amino acid sequence is an important part of post-genomic studies. recently, several schemes have been proposed in protein graphical representation (randić and krilov, ; vinga and almeida, ; bai and wang, ; li j. et al., ; li c. et al., ; munteanu et al., ; yau et al., ; yao et al., ; wen and zhang, ) . in order to plot amino acid sequence, amino acids in protein sequences are divided into different types, including protein sequence regarded as a word with three, four, or five different letters. since ordering amino acids based on their physicochemical properties may offer better insights into comparative study of protein than representation of protein based on the random ordering of amino acid, randić ( ) and yao et al. ( ; outlined different d graphical representations of protein sequence based on different physicochemical properties. the graphical representation of protein sequence cannot only describe amino acid sequence, but also measure similarity/ dissimilarity of different protein sequences. however, the methods only consider the string's information of protein, and do not consider adjacent string's information of amino acid sequence. here, we choose conditional probability to measure adjacent string's information. in this paper, we converted a protein sequence into three-letter sequence based on hydropathy profile of amino acid and defined the three curves to represent different hydropathy features. we then selected conditional probability as a new invariant for the protein sequences. to illustrate the proposed method, we made a comparison of the sequences belonging to eight nd (nadh dehydrogenase subunit ) proteins from http://www.ncbi.nlm.nih.gov/: human , rat (ap_ ), and mouse (np_ ). according to the hydropathy profile of amino acids, the amino acids can be classified into three groups (nei and kumar, ; liu and wang, ) : internal group (f, i, l, m, v), external group (d, e, h, k, n, q, r), and ambivalent group (s, t, y, c, w, g, p, a). the amino acid of internal group tends to occur in the inner side of the protein's spatial structure, while the amino acid of external group tends to appear at the surface. in order to characterize the hydropathicity of a protein primary structure, we defined a primary protein sequence as a symbolic sequence including three letters according to the following rule: where s(i) is the letter in the ith position in the protein primary sequence, and f(s(i)) is the substitution for s(i). since the hydropathy profile can detect more evolutionary relationships, in the next section, we analyzed the new protein sequence containing three letters through different mathematical methods. given a protein primary sequence with length n, we transformed it into a new sequence according to the above definition. for example, for the protein sequence, s=mmyalfllsvglvmgfvgfs, then f(s)=iiaaiiiiaiaiiiaiiaia. to obtain more information, we defined three curves of the sequence. firstly, we let ie ea ia if ( ( )) i, otherwise, where i ranges from to n. then, let y n u and n are y axis and x axis, respectively, and then we can draw three different curves, which are named as ie, ia, and ea curves of the protein sequence. the three different curves can give us some information about the protein sequence. according to the ie curve, we can compare the numbers of the amino acids belonging to the internal group and the external group at different positions. the ia curve can then be used to compare the numbers of the amino acids belonging to the internal group and the ambivalent group at different positions. finally, the ea curve can compare the numbers of the amino acids of the external group and the ambivalent group at different positions. according to the above definitions of three different curves, we drew three curves of nd proteins for the eight species (fig. ). fig. shows that the amino acids of the internal group in nd protein sequences are more than the amino acids of the external group, and the amino acids of the ambivalent group are more than the amino acids of the external group. furthermore, it is evident that g. seal and h. seal have similar curves, rat and mouse's curves are almost identical, and the three curves of human, gorilla, and chimpanzee are similar, but wallaroo's curve is different from curves of other species. protein sequence is composed of three parts, internal group, external group and ambivalent group, so we regard the random numerical sequence to be composed of three parts (+ , , − ). we calculated the conditional probability, which was invariant to quantity protein sequences. for example, let x i ie represents the state of the ith (i= , , ..., n) moment, state space s={+ , , − }. there are nine conditional probabilities as follows: ( ), according to the above definition, we can obtain these conditional probabilities of a given protein sequence. the conditional probability of each of nd proteins is listed in table . given two protein sequences, we can obtain two nine-component vectors whose elements are conditional probabilities for each protein sequence. based on the vectors, we can compare different protein sequences. in general, similarities of the two vectors can be obtained by calculating euclidean distance. the smaller the euclidean distance of two vectors is, the more similar are the protein sequences. the euclidean distance of two vectors u and v is as follows: where u i and v i denote the components of vectors u and v, respectively. k is the dimension of vectors u and v. yao et al. ( ) proposed a new similarity measure of sequences, and coefficient of determination (r ), which is defined as: r can vary from to , and represents the percent of the data, which is the closest to the line of best fit. the larger the coefficient of determination of two vectors is, the more similar are the protein sequences. in tables and , we give the similarity/dissimilarity matrices for the eight nd sequences based on euclidean distance and coefficient of determination amongst nine-component vectors. as shown in tables and , it is obvious that nd proteins of human, gorilla, and chimpanzee are more similar to each other. in addition, nd proteins are more similar for (g. seal, h. seal) and (mouse, rat). however, nd protein of wallaroo is very dissimilar to others amongst the eight species. the results are consistent with the known fact of evolution (yao et al., ) . biology sequence analysis is a fundamental task in computational biology, whose aim is to detect similarity/dissimilarity relationships between molecular sequences. some alignment-free methods to analyze similarities/dissimilarities of dna sequences have been proposed. however, there are few alignmentfree methods to analyze protein sequences. the amino acid sequence of a protein is the key to understanding its structure and function in the cell, so we present a new method to analyze protein primary sequence in this paper. the method is based on the graphical representation and conditional probability taken as the numerical characterization of the protein sequence. the demonstrable significance of the new method is that it cannot only analyze similarity/dissimilarity of protein sequences, but also provide more biological information about the protein sequences. according to the ie curve, we can compare the numbers of amino acids of the internal and external groups at different positions. also the ia curve can be used to compare the numbers of amino acids of the internal and ambivalent groups at different positions. the ea curve can be used to compare the numbers of amino acids in the external and ambivalent groups at different positions. therefore the three curves show the distribution of the three types of amino acids. furthermore, the conditional probability reflected the distribution of the two adjacent amino acids. the new approach was applied to nd protein sequences of several species and results have shown that the introduction of hydropathy profile of amino acids into protein sequence is effectual and feasible. a -d graphical representation of protein sequences based on nucleotide triplet codons a representation of dna primary sequences by random walk a d graphical representation of rna secondary structures based on chaos game representation novel dna sequence representation h curves, a novel method of representation of nucleotide series especially suited for long dna sequences -d graphical representation of protein sequences and its application to coronavirus phylogeny simplification of protein sequence and alignment-free sequence analysis analysis of similarity of dna sequences based on d graphical representation application of d graphical representation of dna sequence protein-based phylogenetic analysis by using hydropathy profile of amino acids enzymes/non-enzymes classification model complexity based on composition, sequence, d and topological indices a new graphical representation and analysis of dna sequence structure: i. methodology and application to globin genes two-dimensional graphical representation of dna sequences and intron-exon discrimination in intronrich sequences simple numerical descriptor for quantifying effect of toxic substances on dna sequences mathematical descriptors of dna sequences: development and applications molecular evolution and phylogenetics a probabilistic measure for alignment-free sequence comparison condensed representation of dna primary sequences -d graphical representation of proteins based on physico-chemical properties of amino acids characterization of -d sequences of proteins on a four-dimensional representation of dna primary sequences on the characterization of dna primary sequences by triplet of nucleic acid bases alignment-free sequence comparison-a review a d graphical representation of protein sequence and its numerical characterization analysis of similarity/dissimilarity of protein sequences similarity/dissimilarity studies of protein sequences based on a new d graphical representation a protein map and its application the z curve database: a graphic representation of genome sequences we would like to thank dr. jian-gang wang (college of animal science and technology, northwest a&f university, china), jian-zhong luo (department of foreign languages, northwest a&f university, china), feng an and dr. jun-li du (college of sciences, northwest a&f university, china) for their helpful suggestions. key: cord- -exbs tz authors: pumchan, ansaya; krobthong, sucheewin; roytrakul, sittiruk; sawatdichaikul, orathai; kondo, hidehiro; hirono, ikuo; areechon, nontawith; unajak, sasimanas title: novel chimeric multiepitope vaccine for streptococcosis disease in nile tilapia (oreochromis niloticus linn.) date: - - journal: sci rep doi: . /s - - - sha: doc_id: cord_uid: exbs tz streptococcus agalactiae is a causative agent of streptococcosis disease in various fish species, including nile tilapia (oreochromis niloticus linn.). vaccination is an effective disease prevention and control method, but limitations remain for protecting against catastrophic mortality of fish infected with different strains of streptococci. immunoproteomics analysis of s. agalactiae was used to identify antigenic proteins and construct a chimeric multiepitope vaccine. epitopes from five antigenic proteins were shuffled in five helices of a flavodoxin backbone, and in silico analysis predicted a suitable rna and protein structure for protein expression. f and e were identified as the best candidates for a chimeric multiepitope vaccine. recombinant plasmids were constructed to produce a recombinant protein vaccine and dna vaccine system. overexpressed proteins were determined to be kda and kda in the e. coli and tk systems, respectively. the efficacy of the chimeric multiepitope construct as a recombinant protein vaccine and dna vaccine was evaluated in nile tilapia, followed by s. agalactiae challenge at × ( ) cfu/ml. relative percentage survival (rps) and cumulative mortality were recorded at approximately – % and – %, respectively. these chimeric multiepitope vaccines should be applied in streptococcosis disease control and developed into a multivalent vaccine to control multiple diseases. immunogenic protein characterization. proteins bound to a s. agalactiae antibody were eluted from protein a agarose and divided into two fractions. the first fraction was subjected to - % gradient sds-page to observe the protein features and compare the protein profile from serotypes ia and iii. the second fraction was subjected to lc-ms/ms mass spectrometry to identify the immunogenic proteins. the protein profile from the immunoprecipitation on d-sds-page demonstrated that the major protein (approximately kda) corresponded to rabbit immunoglobulin. however, several bacterial proteins could not be bound to rabbit immunoglobulin and were removed through the flow-through fraction (ft), whereas the protein that specifically bound to the anti-s. agalactiae antibody could be detected in the eluted fraction (fig. ) . comparative immunoproteomics analysis of s. agalactiae serotypes ia and iii was determined by lc-ms/ ms and assessed by a venn diagram ( supplementary fig. ). one hundred proteins were matched and identified between serotype ia and serotype iii via in-house protein databases, resulting in shared proteins between serotype ia and serotype iii. the protein expression levels of the common proteins were determined by hierarchical clustering (hcl). two groups of immunogenic proteins were demonstrated based on their abundance, and proteins were overexpressed in serotype iii, whereas there was a lower abundance of immunogenic proteins in serotype iii than in serotype ia (fig. ) . regarding specific antigen-antibody interactions, and proteins were uniquely identified in serotypes ia and iii, respectively (supplementary figs. , ) . linear β-cell epitope prediction and chimeric vaccine design. the epitopes of immunogenic proteins were predicted by the bcpreds server based on b cell epitopes to be used in chimeric multiepitope vaccine construction. in this study, not only immunogenic proteins from the immunoproteomics analysis were used but also other subunit vaccine candidates were subjected to epitope prediction and combined to produce a chimeric multiepitope vaccine. the amino acid sequences of the c-β protein (bac), surface protein rib (rib), lpxtg cell wall anchor domain-containing protein (spb ), surface immunogenic protein (sip), and cell surface protein heat map with hierarchical clustering (hcl) of normalized protein abundance reveals the differentially expressed immunogenic proteins. the expression value showed in the relative intensities ranges from the highest protein abundance (red) to the lowest protein abundance (green) expression value. alternatively throughout the backbone likely provides potential bioactivity . considering α/β fold structure, flavodoxin from escherichia coli [pdb accession code: chy] was utilized as a linker to combine the epitope fragments from five antigenic proteins. predicted epitopes were randomly displayed on the α-helix structure of flavodoxin, generating , designed models due to the variance of epitopes of bac and of epitopes of sip protein. after joining, protein conformation was examined by molecular modeling with , constructs. i-tasser and stereochemical qualitative allowance manifested from f and e showed appropriate potential tertiary structure with optimal c-scores between − and . f and e also demonstrated the highest score of the amino acid allowance region in the ramachandran plot. the f multiepitope model represented . %, . %, and . % of residues located in the most favored, allowed, and disallowed regions, respectively. meanwhile, the ramachandran plot regions for the e designed model comprised . %, . %, and . %, respectively, of the residues ( supplementary fig. ). the epitope arrangements in f and e were represented in a d structure of chimeric proteins, showing that all chosen epitopes were exposed to the protein surface. the five epitopes were displayed as α-helical layers surrounding chy linkers, which appeared as five-stranded parallel β-sheets at the structure's center, with the order (fig. ). codon optimization of chimeric multiepitope vaccines and plasmid construction. the ectopic expression of bacterial protein in the fish cells may not be achieved due to different codon utilization in the bacterial system. subsequently, codon optimization of the chimeric multiepitope vaccine was analyzed by geneart ™ 's gene optimization according to iso standards (registration no. ) to apply the codon bias of oreochromis niloticus. the region of an ideal gc content range-between % to %-was well optimized. moreover, negative cis-acting sites included internal tata-boxes, chi-sites and ribosomal sites; at-rich or gc-rich sequence stretches; rna instability motifs; repeat sequences; rna secondary structures; and splice donor and acceptor sites in higher eukaryotes, which were successfully removed from these chimeric multiepitope dna vaccine sequences. the best two predicted chimeric multiepitope vaccines were designated f and e . codon adaptation index (cai) presented f and e scores that matched in codon utilization with that of nile tilapia of . and . , respectively. the codon quality distribution index of f and e demonstrated that the codons within the dna sequence were distributed frequently in - positions at % and % ( supplementary fig. a-d) . the average gc content of both chimeric multiepitope vaccines was % ( supplementary fig. e ,f). single-stranded rna-folding prediction revealed the minimum free energy (mfe) secondary structure of f and e ( supplementary fig. the protparam server demonstrated a theoretical pi of . and a molecular mass of kda for f and e . the total number of negatively (asp and glu) and positively (arg and lys) charged amino acid residues of f was and residues, while for e , there were and residues, respectively. the estimated half-life of both chimeric multiepitope constructs was approximately h in mammalian reticulocytes (in vitro), more than h in yeast (in vivo), and over h in e. coli (in vitro). f showed aliphatic index and grand average of hydropathicity values of . and − . , respectively, whereas e showed values of . and − . , respectively. the f and e proteins were indicated to be stable proteins, as represented by instability indexes of . and . , respectively. antigenicity of the f and e chimeric multiepitope vaccines was predicted as . % and . % at a . % threshold for the bacterial model, consistent with antigenpro server prediction by representing . and . , respectively. these results indicate that both vaccine candidates have high potential antigenic www.nature.com/scientificreports www.nature.com/scientificreports/ properties. conformational b cell epitopes from the d protein structure computed by the discotope server demonstrated b cell epitope residues in both f and e at a − . threshold ( table ) . interestingly, the number of epitopes was reduced when computed at the − . and − . thresholds, with f showing and b cell epitope residue regions, respectively, while e contained only and b cell epitope residue regions, respectively (table ). recombinant plasmids harboring e and f were constructed, namely, pet a (+)_ e or _ f and pcdna . (+)_ e or _ f , which were used to determine the recombinant chimeric multiepitope vaccine expression (fig. ) . chimeric multiepitope protein expression was tested in a bacterial expression system and a fish cell (tk- ) culture expression system. these results demonstrated that both chimeric multiepitope proteins could be expressed in both systems, with the expression detectable within h in e. coli ( kda) and within days post-transfection in tk- cells ( kda) (fig. ). larger-sized chimeric multiepitope proteins in the e. coli expression system resulted from an additional tag at the n-terminus, which was contained in the pet expression vector. vaccine efficacy. after vaccination, fish were challenged with s. agalactiae, and infected fish showed clinical signs of streptococcosis disease, such as swirling swimming, opaque eye, exophthalmia and abscess. these moribund fish were collected, and bacteria were re-isolated, showing that they were infected with s. agalactiae serotype iii ( supplementary fig. s ) . www.nature.com/scientificreports www.nature.com/scientificreports/ dna vaccine efficacy testing showed that fish immunized with either f or e had cumulative mortality rates of . ± . % and . ± . %, respectively, which were not significantly different from those of the fkc-vaccinated fish (p > . ). however, in the control group [empty vector; pcdna . (+)], . ± . % mortality was observed at days post-challenge (fig. a ). the recombinant chimeric multiepitope protein vaccination showed that f and e produced cumulative mortality rates of . ± . % and . ± . %, respectively, which were significantly lower than those of the negative control group, at % (p < . ) (fig. a) . the f and e dna vaccines demonstrated similar patterns of rps, with . ± . % and . ± . %, respectively, which were not significantly different from those of the fkc-immunized fish ( . ± . %). however, they were significantly higher than those of the recombinant protein vaccines, which showed . ± . % and . ± . % for e and f , p < . , respectively (fig. b ). immune response. to determine the immune response, dot blot analysis of serum prepared from e -or f -vaccinated fish was used. it was demonstrated that the dna vaccine could gradually activate the production of fish antibodies from the st to the th week. the pattern of antibody response differed from that for the recombinant protein vaccine, with the highest activation of antibody production being significantly produced in the nd - rd week and suddenly dropping in the th week. the highest induction was observed in fkc-immunized fish (fig. ) . dot blot analysis of vaccinate fish sera against whole cell lysate of s. agalactiae serotype ia and iii demonstrated that fish vaccinated with recombinant protein vaccine e and f showed cross-reactivity to whole cell lysate of s. agalactiae serotype ia and iii ( supplementary fig. s ). for the reverse vaccinology approach, computational analysis using a variety of bioinformatics tools is robust and beneficial when identifying appropriate vaccine candidates . bacterial genomics and proteomics analysis indeed help researchers analyze proteins, short domains, and pathogenic epitopes that provide high immunogenicity and high antigenicity for multimeric vaccine development . therefore, immunoproteomics should be applied as a preliminary process to screen antigenic proteins and minimize potential candidates for vaccine development . several immunogenic proteins in this study were described previously, such as c a peptidase and laminin-binding surface protein (lmb), which are cell surface proteins that have an important function in www.nature.com/scientificreports www.nature.com/scientificreports/ chemoattractant activities and are proteins promoting invasion of group b streptococcus (gbs) , . however, the current immunoproteomics analysis from this study identified new immunoreactive proteins, such as bacteriocin transport accessory protein, dihydrofolate reductase, ssu ribosomal protein s p, transposase tnpa, , -alpha-glucan, cell wall surface anchor family protein, and the gtp-binding protein era. as expected, most of these are cell surface proteins, which are suggested to be associated with bacterial virulence , . subsequently, the identified immunoreactive proteins may be used in further vaccine development. multiepitope vaccines are an interesting issue since constructed vaccines designed by in silico analysis may elicit cellular immunity and provide effective responses , . it is known that immunodominant b cells could strongly induce both cellular and humoral immunity; thus, evaluation of b cell epitopes was performed to identify potent epitopes before integrating them to produce a multiepitope vaccine. moreover, this vaccine type is more efficient than whole antigens for controlling staphylococcus spp. infections , . from the present study, table . predicted conformational b-cell epitopes from d structure of designed chimeric multiepitope vaccines using discotope . server. www.nature.com/scientificreports www.nature.com/scientificreports/ linear b cell epitope prediction was assessed and identified potent epitopes from common immunogenic and virulence proteins that were present in serotypes ia and iii. the previous studies supported that one of the chosen proteins, sip, represented a highly conserved protein among gbs isolates and showed cross-protective immunity against gbs infections , , , . prediction of candidate antigenic proteins can be used to select the bacterial strains that carry antigenic genes, as well as to determine high expression levels in the target host and the accessibility of particular antigens in host organisms . therefore, these selected immunogenic proteins might be suitable for consideration in a rational vaccine design. rational chimeric multiepitope vaccine design was achieved by randomly combining epitopes from immunogenic proteins and conjugating with core structures of flavodoxin (β- - - chy) to produce a secondary structure with α/β folding. in addition to the α/β-type folding of flavodoxin, it was also useful to construct our chimeric multiepitope vaccine by forcing the chosen epitope segments to fit within α-helix loops and protrude out of the d-folded structure since that configuration benefits protein solubility by exposure to water molecules . additionally, this linker may promote the solubility of the constructed vaccine and help enhance the recognition of the vaccine by the host's immune system, which contributes to vaccine efficacy. f and e presented the most favored region of protein folding, with the stereochemical quality representing the disallowed region at only . %, which is acceptable since the minimum quality should be less than % . it is suggested that in silico analysis could design a chimeric multiepitope vaccine that could probably manifest effective properties , . to achieve a high level of protein expression in nile tilapia, codon optimization was conducted to improve the transcription and translation capability by removing all possible cis-acting sequence motifs, which may have a negative impact on protein expression. both proteins had a cai > . and a codon with frequent distribution (cfd) > %, which are acceptable for high expression in the target organism , . the gc content of f and e was optimized between - % and had a suitable thermodynamic ensemble free energy, which allowed rna folding and thermodynamic stability , . the overall points suggested that the modeled f chimeric multiepitope vaccine was clearly the best candidate vaccine. numerous effective single-serotype gbs vaccines have been reported, including vaccines for controlling streptococcosis in tilapia , , , , , , . however, it is known that single-serotype whole-cell inactivated vaccines have limitations during cross-prevention against different serotypes. for instance, a s. iniae vaccine (serotype i) could not protect atlantic salmon from infection by s. iniae (serotype ii) . meanwhile, mixed-serotype vaccines (serotypes iv and vii) could promote antiserum levels and enhance the survival rate of newborn pups against streptococcal infection . although formalin-killed vaccines generally provide highly protective effects compared with those of subunit vaccines and dna vaccines, the subunit and dna vaccines may replace the original formalin-killed vaccines or inactivated vaccines due to their promising efficacy, which are similar to those of inactivated vaccines, and longer shelf life , . evidence suggests that dna and subunit vaccines can efficiently trigger the immune system and promote protective efficacy, with an rps value greater than % , , , . nevertheless, these vaccines have limitations, such as their mass production costs, and they may require various optimizations to obtain the highest stable storage conditions , . regarding this idea, a chimeric multiepitope vaccine composed of different epitopes from different proteins common in both serotypes ia and iii was generated to achieve broad protection against different serotypes and increase their stability. interestingly, the designed chimeric multiepitope dna vaccine and protein vaccine exhibited effective prevention in nile tilapia against s. agalactiae, with efficacy similar to that of the whole-cell inactivated vaccine. this evidence supports the strategy of rational vaccine design through b cell recognition using in silico analysis. importantly, immunoproteomics analysis could assist the preliminary determination of suitable immunogenic proteins for vaccine development due to the distinct antigenic determinants that can mediate dissimilar immune responses. the criterion in immunogenic protein selection for vaccine development has focused on the ability of a particular protein to induce an immune response. among identified proteins shared in both serotypes, in addition to providing the highest bcpred scores (table ) , the proteins chosen were also reported as virulence proteins and used as vaccine candidates for streptococcosis disease prevention . for example, c-β protein (bac) can lead to antibody production through fc region binding of human iga . sip protein has been shown to mediate protection against streptococcal infection , , . additionally, the chosen immunogenic protein should be conserved among streptococcus spp., so it would be suitable for application in cross-reactive prevention among s. agalactiae serotypes , . moreover, it should be mentioned that peptide vaccines or epitopes with only amino acid residues may trigger immune responses through binding directly to mhc-i or mhc-ii molecules. these molecules localize to nonprofessional antigen-presenting cells. vaccines containing proteins with longer amino acid sequences can enhance the presentation of epitopes to dendritic cells due to t cell induction , , . herein, the comparative efficacy of both the f and e dna and recombinant protein vaccines indicated that the dna vaccine provided a higher efficacy than the recombinant protein vaccine. this result suggests that the dna vaccine can prolong the activation of the immune response by triggering both humoral and cellular immune responses , . moreover, the clearance rate of the recombinant protein vaccine in the host system may be faster than that of the dna vaccine. this difference implies that the dna vaccine can enter the host cell to produce chimeric multiepitope protein, with that protein existing in the host system for longer than the recombinant protein vaccine, thus enhancing its bioavailability. taken together, these data indicate that the antigen combination has shown promise for streptococcosis disease control in nile tilapia. this research demonstrated a novel platform for rational vaccine design based on chimeric vaccine development that used flavodoxin with a tim-barrel structure as a template. our chimeric protein backbone is suitable for presenting epitopes to be recognized by the host immune system. with epitopes, it could activate antibody production and demonstrated promising protection against bacterial disease similar to that of a whole-cell inactivated vaccine. this platform will promote the production of multivalent vaccines to control multiple diseases and for other applications in the future. experimental fish, bacterial strain and antibody. all male s. agalactiae-free nile tilapia (oreochromis niloticus linn.) were obtained from a commercial gap farm in thailand. the experiments were conducted in accordance with guidelines approved by the national research council of thailand. the experimental fish were anesthetized with clove oil to minimize stress during vaccination and challenge testing. s. agalactiae serotypes ia and iii were cultured as described previously . s. agalactiae serotype iii was used for polyclonal antibody (pab) production, which was kindly provided by prof. ikuo hirono, tumsat, japan. antibody against igm of nile tilapia was kindly provided by assist. prof. eakapol wangkahart. mahasarakham university, thailand. immunoproteomics analysis. s. agalactiae was grown in bhi broth at °c with agitation until reaching exponential phase. bacterial cells were collected by centrifugation, lysed in µl of lysis buffer [tris-buffered saline (tbs) with % tween- and . % lysozyme] and incubated at °c for min following sonication on ice. protein a agarose beads (cell signaling, usa) were added to the bacterial protein lysate, and nonspecific proteins were removed by min of centrifugation at , × g at °c. clarified supernatant was supplemented with % glycerol and then with a pab specific to s. agalactiae serotype iii ( : dilution). then, µl of protein a agarose beads were added to separate bound immunogenic proteins, and the bound proteins were separated by acetone precipitation [ : (v/v)]. precipitated proteins were solubilized in mm tris-hcl with . % sds, and a lowry assay was used to measure the protein concentration. the protein profile was assessed by fractionating µg of protein on a nupage - % bis-tris protein gel (thermofisher, usa). µg of immunogenic protein was mixed with a lysis buffer ( . % rapidgest sf in mm ammonium bicarbonate) and mm dtt in mm ammonium bicarbonate at °c for h. this step was followed by incubation with mm iodoacetamide (iaa) in mm ammonium bicarbonate at room temperature for min in the dark. the protein solution was cleaned up by a zeba spin desalting column before digestion with ng of sequencing-grade trypsin (promega, germany) at °c for h. tryptic peptides were dried at °c under a vacuum and then protonated with . % formic acid in lc water before injection into an lc-ms/ms. the tryptic peptides' immunoproteomics profiles were analyzed using an ultimate ™ nano/capillary lc system (dionex, uk) and hybrid quadrupole q-tof impact ii ™ (bruker daltonics gmbh, germany) equipped with a nano-captivespray ion source. first, nl of extracted peptide was subjected to a trapping column (thermo scientific, pepmap , c , μm i.d. × mm) through a full loop injection before being resolved in an analytical column (pepswift c nano column, μm × cm, i.d.) at °c. the linear gradient method was used to elute peptides with mobile phase a ( . % formic acid in water) and mobile phase b ( . % formic acid in % acetonitrile) at a . µl/min constant flow rate into the mass spectrometer. electrospray ionization was conducted at . kv using captivespray. mass spectra (ms) and ms/ms spectra were fully acquired in positive ion mode (compass . for otofseries software, bruker daltonics). mass accuracy was assessed using positive detection mode after internal calibration with sodium trifluoroacetate (na-tfa) within . ppm. raw lc-ms/ ms spectra were collected using compassxport version . . . (bruker daltonics gmbh, germany) to convert all spectra into the mzxml data format. the mzxml files of the lc-ms/ms datasets for label-free quantification of peptides were evaluated based on the ms profile by maxquant software. chimeric multiepitope vaccine design. the linear b cell epitope was predicted by bcpred . the scop and cath databases were used to design an appropriate chimeric multiepitope vaccine structure (an: wp_ ). a d structure was rendered by i-tasser (iterative threading assembly refinement) using the qualifying c-score value as a confidence score . to refine the tertiary structure, the derived i-tasser results in the pdb files were prepared using the galaxyrefine server, which performed a repeated structure perturbation, and the best structural relaxation candidates were chosen . moreover, to obtain the best chimeric multiepitope vaccine candidates, the residues were determined according to residue stereochemical quality for all the refined chimeric multiepitope models and validated by the procheck program v. . . to generate ramachandran plots . codon optimization. amino acid sequences were reverse-translated to nucleotide sequences using nile tilapia codon usage (oreochromis niloticus [gbvrt]: ). the codon adaptation index (cai) of the designed vaccine candidates' nucleotides was analyzed by an optimizer program (http://genomes.urv.es/optimizer/) and combined with geneart tm 's gene optimization process (thermo fisher scientific, usa). the secondary structure of the single-stranded rna folding and free energy of the thermodynamic ensemble were calculated by the rnafold web server . the optimized dna sequence was synthesized by geneart ® gene synthesis (thermo www.nature.com/scientificreports www.nature.com/scientificreports/ to verify the ectopic expression of the chimeric multiepitope dna vaccine, pcdna . (+)_ e or _ f was transfected into tk (tilapia kidney ) tilapia cells using effectene transfection reagent (qiagen, germany). the transfected fish cell cultures were maintained with leibovitz's l- media containing % fbs and penicillin-streptomycin at °c, and dna vaccine expression was determined after week. recombinant chimeric multiepitope protein was purified by ni-nta agarose beads (qiagen) with a gradient concentration buffer of imidazole ranging from mm to mm. subsequently, the gel filtration chromatography method was performed by fast protein liquid chromatography (fplc) incorporated with a hiprep / & / sephacryl s- high-resolution column (ge healthcare, usa) using a × pbs buffer with a ml/min flow rate. recombinant protein detection was confirmed by sds-page analysis and western blot analysis using an anti-his tag antibody (recombinant protein vaccine) or an anti-flag (rabbit igg) (dna vaccine) and anti-rabbit antibody conjugated to ap (alkaline phosphatase). vaccine efficacy analysis. to evaluate vaccine performance, nile tilapia (o. niloticus) were immunized with chimeric multiepitope vaccines (recombinant protein and dna vaccines), followed by bacterial challenge. a total of experimental groups, namely, ) the f recombinant protein vaccine, ) e recombinant protein vaccine, ) f dna vaccine, ) e dna vaccine, ) formalin-killed (fkc) s. agalactiae vaccine , and ) pcdna . (+) [empty vector], were conducted in triplicate. before vaccination, streptococcosis-free nile tilapia ( ± g) were transferred into glass aquarium tanks containing l of water for one week. after a week of acclimatization, fish were vaccinated according to above mentioned groups. all fish were maintained under running and aerated water at ± °c and fed with commercial pellet feed twice a day. for the chimeric multiepitope protein vaccination, purified f and e proteins were mixed with montanide isa (seppic, france) in a : ratio prior to intraperitoneal injection with µg of protein per fish. for the chimeric multiepitope dna vaccine, plasmid dna of f and e were purified by ultracentrifugation using a cscl gradient and dissolved in te buffer (ph . ) to obtain a concentration of . µg/µl. the dna vaccine was applied to the fish with µg of dna through intramuscular injection. fkc and pcdna . (+) were used as positive and negative controls, respectively. the schedule of vaccine efficacy analysis and immune response analysis was demonstrated in supplementary fig. . for the immune response analysis, blood was drawn from the caudal vein to separate serum for the immunoblotting assay, and those fish were transferred to another separate tank. the analysis was performed every week, using fish in each treatment from the st week to the th week. after one month of vaccination, vaccinated fish in each treatment group were taken from among the remaining fish for serum collection and anesthetized with eugenol before challenge with s. agalactiae (serotype iii) at × cfu/ml through ip administration. mortality and clinical signs of infected tilapia were recorded daily for weeks. the brain, head kidney, and liver were collected from moribund fish for bacterial isolation and identification . cumulative mortality and relative percentage survival (rps) were calculated . a one way analysis of variance (anova) was used for statistical analysis and p < . was considered significant. to detect the antibody response after immunization, antibody production was evaluated through dot blot analysis using the minifold ® i dot blot system (ge healthcare, germany). briefly, µl of purified e , f proteins, or a whole-cell lysate of s. agalactiae ( µg/ml) were spotted on a nitrocellulose membrane and blocked with blocking solution ( . % bsa in tbst) before adding µl of serum of the different treatment groups as above mentioned. then, the membrane was probed with a primary antibody (anti-igm at : , ) for . h, followed by washing times with tbst buffer and min of incubation with an anti-mouse igg hrp-linked ab ( : , ). subsequently, the signal was detected with a chemidoc ™ imaging system (bio-rad) after adding a substrate reagent (perkinelmer, usa). the integrated density of the dot blot was analyzed by imagej (version .x) . five different piscidins from nile tilapia, oreochromis niloticus: analysis of their expression and biological functions prevention and control of viral disease in aquaculture development of a quantitative pcr assay for monitoring streptococcus agalactiae colonization and tissue tropism in experimentally infected tilapia parasites and diseases streptococcus agalactiae serotype distribution and antimicrobial susceptibility in pregnant women in gabon microevolution of streptococcus agalactiae st- from australia indicates dissemination via imported tilapia and ongoing adaption to marine hosts or environment molecular serotyping, virulence gene profiling and pathogenicity of streptococcus agalactiae isolated from tilapia farms in thailand by multiplex pcr a microwave-irradiated s. agalactiae vaccine provides partial protection against experimental challenge in nile tilapia oreochromis niloticus efficacy of an experimentally inactivated s. agalactiae vaccine in nile tilapia (oreochromis niloticus) reared in brazil development of live attenuated streptococcus agalactiae vaccine for tilapia via continuous passage in vitro a recombinant truncated surface immunogenic protein (tsip) plus adjuvant fia confers active protection against group b streptococcus infection in tilapia development and efficacy of feed-based recombinant vaccine encoding the cell wall surface anchor family protein of s. agalactiae against streptococcosis in oreochromis sp safety and immunogenicity of an oral dna vaccine encoding sip of streptococcus agalactiae from nile tilapia oreochromis niloticus delivered by live attenuated salmonella typhimurium protective efficacy of cationic-plga microspheres loaded with dna vaccine encoding the sip gene of s. agalactiae in tilapia assembly and role of pili in group b streptococci dentification of universal group b streptococcus vaccine by multiple genome screen protection of nile tilapia (oreochromis niloticus l.) against streptococcus agalactiae following immunization with recombinant fbsa and alpha-enolase designing an efficient multi-epitope peptide vaccine against vibrio cholera via combined immunoinformatics and protein interaction based approaches immunoinformatics analysis and in silico designing of a novel multi-epitope peptide vaccine against staphylococcus aureus multiple b-cell epitope vaccine induces a staphylococcus enterotoxin b-specific lgg protective response against mrsa infection designing of complex multi-epitope peptide vaccine based on omps of klebsiella pneumonia: an in silico approach evaluation of β-amino acid replacements in protein loops: effects on conformational stability and structure crystal structure of escherichia coli chey refined at . -a resolution reliable b cell epitope predictions: impacts of method development and improved benchmarking a novel multi-epitope peptide vaccine against cancer: an in silico approach structure of the streptococcal cell wall c a peptidase enhanced expression of lmb gene encoding laminin-binding protein in streptococcus agalactiae strains harboring is in scpb-lmb intergenic region adhesion, invasion and evasion: the many functions of the surface proteins of staphylococcus aureus multi-epitope vaccines: a promising strategy against tumors and viral infections novel targeted immunotherapy approaches for staphylococcal infection identification of group b streptococcal sip protein, which elicits cross-protective immunity the i-tasser suite: protein structure and function prediction designing a novel multi-epitope peptide vaccine against pathogenic shigella spp. based immunoinformatics approaches novel immunoinformatics approaches to design multi-epitope subunit vaccine for malaria by investigating anopheles salivary protein exploring dengue genome to construct a multi-epitope based subunit vaccine by utilizing immunoinformatics approach to battle against dengue infection on the normalization of the minimum free energy of rnas by sequence length efficacy of feed-based adjuvant vaccine against streptococcus agalactiae in oreochromis spp growth, immune responses and protection of nile tilapia oreochromis niloticus immunized with formalinkilled streptococcus agalactiae serotype ia and iii. vaccines recovery of streptococcus iniae from diseased fish previously vaccinated with a streptococcus vaccine conjugate vaccines against group b streptococcus types iv and vii. the journal of infectious diseases development of streptococcus agalactiae vaccines for tilapia streptococcal β protein has separate binding sites for human factor h and iga-fc identification of group b streptococcal sip protein, which elicits cross-protective immunity immunotherapy of established (pre)malignant disease by synthetic long peptide vaccines a web resource for designing subunit vaccine against major pathogenic species of bacteria prospects for control of emergimg infectious dyseases with plasmid dna vaccines dna vaccines: ready for prime time? predicting linear b-cell epitopes using string kernels scope: structural classification of proteins extended, integrating scop and astral data and classification of new structures aqua and procheck-nmr: programs for checking the quality of protein structures solved by nmr viennarna package . . algorithms for molecular biology dna synthesis and biological security prediction of n-glycosylation sites in human proteins precision mapping of the human o-galnac glycoproteome through simplecell technology protein identification and analysis tools on the vaxijen: a server for prediction of protective antigens, tumor antigens and subunit vaccines high-throughput prediction of protein antigenicity using protein microarray data first report of streptococcus agalactiae isolated from oreochromis niloticus in piura, peru: molecular identification and histopathological lesions dna extraction from . µm sterivex filters and cesium chloride density gradient centrifugation potency and efficacy test of a vaccine in addition with adjuvant against koi herpesvirus in koi (cypronus carpio) nih image to imagej: years of image analysis the authors declare no competing interests. supplementary information is available for this paper at https://doi.org/ . /s - - - .correspondence and requests for materials should be addressed to s.u. publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/licenses/by/ . /. key: cord- - oca mrm authors: shen, wen-jun; cui, wenjuan; chen, danze; zhang, jieming; xu, jianzhen title: rpirls: quantitative predictions of rna interacting with any protein of known sequence date: - - journal: molecules doi: . /molecules sha: doc_id: cord_uid: oca mrm rna-protein interactions (rpis) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. as the number of available rna-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand rna-protein interactions by computational methods. in this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. the derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. we propose a novel machine learning method, called rpirls to predict the interaction between any rna and protein of known sequences. for the rpirls classifier, each protein sequence comprises up to diverse amino acids but for the rpirls- g classifier, each protein sequence is represented by using -letter reduced alphabets based on their physiochemical properties. we evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, rpi-pred and ipminer. on the non-redundant benchmark test sets extracted from the pridb, the rpirls method outperformed rpi-pred and ipminer in terms of accuracy, specificity and sensitivity. further, rpirls achieved an accuracy of % on the prediction of lncrna-protein interactions. the proposed method can also be extended to construct rna-protein interaction networks. the rpirls web server is freely available at http://bmc.med.stu.edu.cn/rpirls. the interactions of proteins with other proteins, peptides, dnas and rnas govern most the essential molecular function. rna-protein interactions (rpis) have a critical influence on post-transcriptional gene regulation [ ] [ ] [ ] , viral assembly [ ] [ ] [ ] , cellular defence [ ] , protein synthesis [ , ] and various other fundamental biological processes [ , ] . a significant portion of transcripts is long non-coding rnas (lncrnas) which are not translated into proteins and are longer than nucleotides [ ] . lncrnas normally function with their interacting proteins [ ] . for instance, the lncrna hotair regulated the hoxd locus in trans by interacting with pcg proteins [ ] ; several lncrnas were shown to be able to interact with auf , a protein linked to aging and cancer [ ] ; lncrnas binding to jarid protein were essential for the recruitment of prc to the chromatin [ ] ; lncrna gas inhibited hepatitis c virus replication by decoying hcv ns protein [ ] . hence, the study of rpis is essential for understanding their functions. compared to those of protein-protein interactions and dna-protein interactions, current knowledge regarding rna-protein interactions, especially lncrna-protein interactions is still limited. in this study, we propose a novel machine learning method, which we call rna-protein interaction prediction based on regularized least squares (rpirls), to quantitatively predict the potential rna-protein interactions. the experimental determination of rpis remains expensive and time-consuming [ ] [ ] [ ] , but fortunately, the accumulated rpi experimental data facilitate the development of computational models for rpi prediction [ ] [ ] [ ] . in , pancaldi and b .. ahler [ ] introduced a computational approach for rbp (rna binding protein)-mrna interaction prediction. they employed support vector machines (svms) and random forests (rfs) based on more than physical and functional features of rpis, including gene ontology, chromosomal position, gene and protein physical properties, protein localization, experimental translation, mrna properties, predicted protein structure, utr properties and genetic interactions. bellucci et al. [ ] proposed a method called catrapid for the prediction of protein lncrna interaction. they evaluated the interaction propensities of protein-rna based on their physicochemical properties, including secondary structure, hydrogen bonding and van der waals. muppirala et al. [ ] developed a method called rpiseq, which predicted rpis solely based on primary sequences. the rpiseq method still employed svms and rfs but exploited different features. they represented each sequence of proteins and rnas as the normalized frequencies of the corresponding -mer and -mer, respectively. in , based on the same feature vectors presented in muppirala et al., wang et al. [ ] first reduced the dimensionality of feature vectors, and then performed the rpis prediction by using naive bayes classifier which assumed the independence of attributes and by using extended naive bayes classifier which considered the correlation between attributes. lu et al. [ ] integrated the information on the secondary structure, hydrogen bonding propensities and van der waals of lncrnas and proteins with fisher's linear discriminant model. in , suresh et al. [ ] developed a method called rpi-pred to predict rpis by considering the high-order d protein and rna structure information. in , pan et al. [ ] proposed a new method named ipminer that integrated deep learning with stacked ensembling to improve the prediction performance of ncrna-protein interactions. in this paper, we classified rna-protein pairs as interacting or non-interacting by integrating derived kernel with regularized least squares (rls) [ ] . the motivation is to relate the sequence information of proteins and rnas to their biological functions, i.e., interactions. our method attempted to extract discriminant subsequence features from amino acid sequences and nucleotide sequences. the derived kernel measures the similarity between two biological sequences by capturing nucleic acid or amino acid compositions and repetitive sequence patterns. we used regularized least squares in learning as the computations performed by rls algorithms can be expressed using just inner products, hence allowing efficient implementation of kernel-based learning, in addition the rls algorithms often perform comparable to the best batch classifiers [ ] . since the dimensionality of feature space increases exponentially with the template size, for computational sake, we set upper limit for template size. on the other hand, we categorized amino acids into several groups based on their physiochemical properties [ ] [ ] [ ] , the reduced alphabet representation of the protein sequence allows larger template size and also decreases the dimensionality of feature space. we considered the derived kernel with two-layer architecture, hence there were two template sets, denoted as t p and t r needed to be constructed for protein and rna, respectively. here we considered all possible substrings of the same length making up a template set. the template set t p for amino acid sequences was composed of substrings with k continuous amino acids, while the template set t r for nucleic acid sequences was composed of substrings with l continuous nucleic acids. in order to extract discriminant subsequence features from amino acid sequences and nucleotide sequences, we explored the effect on rpi prediction over a range of choices for the template sizes of protein and rna. the training set rpi was used to determine these parameters. in our case, we used different template sizes of protein and rna chosen from set { , , . . . , } ∪ { , , . . . , } for rpirls and { , , . . . , } ∪ { , , . . . , } for rpirls- g. with different combination of protein template size and rna template size, the combined kernelk dk was integrated with rls to predict rna-protein interactions. the ten-fold stratified cross-validation has been verified to be the best algorithm for model selection on a large scale experiments [ ] , therefore on the data set rpi , we tuned the parameter λ by ten-fold stratified cross-validation with the optional parameter set {λ = e n , n = − , · · · , }. the data set rpi was divided into ten mutually exclusive folds and the mean response of each fold was approximately equal. in each test we merged parts of the samples as the training set and left the other part as the test set. the parameter λ was chosen by leave-one-out cross-validation on the training set. for rpirls, in all the ten sets, λ = e − uniformly achieved the best performance in the training data. tables and showed the performance of the proposed method in terms of auc and accuracy with different combination of parameters k and l, respectively. the experiment results showed that when the protein template size k = and the rna template size l = , the model performs best with auc score of . and accuracy of . . the other measurements of specificity (sp) and sensitivity (se) were . and . , respectively. while for rpirls- g, λ = e − uniformly performed best in all the ten sets. table showed that the method achieved the best prediction accuracy of . when the protein template size k = and the rna template size l = . the other measurements (auc, sp and se) were observed as . , . and . , respectively. the computational results showed that the rpirls classifier outperformed the rpirls- g classifier in terms of various performance measurements, indicating that the diversity of amino acids at a sequence is important for the prediction of rpis. the performance of predicting rpis was evaluated by using -fold stratified cross-validation on the rpi data set. different combinations of parameters k and l were evaluated. remark on the symbols of template sizes: k stands for template size of amino acid sequences; l stands for template size of nucleic acid sequences. the best auc in the table is marked in bold. remark on the symbols of template sizes: k stands for template size of amino acid sequences; l stands for template size of nucleic acid sequences. the best accuracy in the table is marked in bold. in order to evaluate the reliability and robustness of rpirls and rpirls- g, we compared them with other two state-of-the-art methods rpi-pred and ipminer. the rpi and rpi data sets after removing overlapping rpis with the training data were evaluated. both the rpirls and rpirls- g classifiers were trained on the rpi data set, and tested on the rpi and rpi data sets, respectively. as shown in tables and , rpirls outperformed the rpirls- g, rpi-pred and ipminer methods on both data sets. for the rpi data set as shown in table , the performance of the rpirls method was . , . , . and . for predictive accuracy, auc, specificity and sensitivity, respectively. while the predictive accuracy of the rpi-pred and ipminer methods were just . and . , respectively which were much lower than rpirls's. the remaining measurements (specificity and sensitivity) were observed as . and . , respectively for rpirls, and . and . , respectively for ipminer. the rpirls method outperformed rpi-pred and ipminer in terms of accuracy, specificity and sensitivity on the rpi data set. table . predictive performance of, rpirls- g in terms of the accuracy on the, rpi training data set over varying template sizes. the performance of predicting rpis was evaluated by using -fold stratified cross-validation on the rpi data set. different combinations of parameters k and l were evaluated. remark on the symbols of template sizes: k stands for template size of amino acid sequences; l stands for template size of nucleic acid sequences. the best accuracy in the table is marked in bold. similar results were observed on the rpi data set in table . the specificity of the rpi-pred and ipminer methods was just . and . , respectively, indicating there was a positive bias in their predictions of performance. a low specificity increases the labor, cost, and time needed to perform the required experimental tests, but our rpirls method achieved both reasonable specificity and sensitivity. furthermore, we evaluated the rpirls classifier on large-scale rna-protein pairs in the currently available rpintdb data base. the rpirls method correctly predicted out of rpis, reaching the predictive accuracy of %. to explore the effectiveness of the proposed method on predicting ncrna-protein interactions, a large-scale ncrna-protein interaction data set (we called nrpi ) was retrieved from the npinter data base [ ] . we trained rpirls and rpirls- g on the rpi data set, and tested it on the nrpi . table showed the prediction results compared with the rpi-pred classifier on the nrpi data set. the ipminer method showed a significantly positive bias on predicting ncrna-protein interactions, thus here we didn't include it into the comparison. the predictive accuracy for different organisms were separately computed. for the six organisms, our method rpirls performed best for the homo sapiens and saccharomyces cerevisiae, rpi-pred performed best for drosophila melanogaster, escherichia coli and mus musculus, and both methods obtained the same predictive accuracy for the caenorhabditis elegans. rpirls outperformed rpirls- g over all six organisms. for ncrna-protein pairs, the rpirls method achieved an accuracy of % compared to % for rpirls- g and % for the rpi-pred method. we further tested rpirls and rpirls- g on the lnrpi data set which was a subset of the nrpi data set and consisted of only lncrna-protein interactions (lncrpis). our model achieved an overall accuracy of % compared to % for rpirls- g and % for the rpi-pred classifier as shown in table . the predictive performance of rpirls was improved in out organisms compared with its performance on the nrpi data set. the results indicated the effectiveness of our method to predict lncrna-protein interactions only by using primary sequences of proteins and rnas. predicting lncrna-protein interaction networks is useful to explore the molecular mechanisms that are regulated by lncrnas [ , ] . in this experiment, we evaluated the performance of rpirls on building lncrpi networks and further compared its performance with rpi-pred. on the basis of the data in npinter, we analyzed the results of four organisms, i.e., caenorhabditis elegans, drosophila melanogaster, escherichia coli and saccharomyces cerevisiae, consisting of , , and lncrpis, respectively. for caenorhabditis elegans, the rpirls method correctly identified all lncrpis (blue edges) while the rpi-pred method correctly identified out of . as shown in figure , rpi-pred made incorrect prediction for the pair of n -g ebf (red edges). in figure , rpi-pred correctly predicted all lncrpis, whereas rpirls missed out of lncrpis for drosophila melanogaster. these out of incorrect predictions which were observed between two proteins p and q vss (yellow rectangle) and three signal recognition particle (srp) rnas n , n and n (green ellipse), formed the srp rna-protein interactions. for escherichia coli, the rpirls classifier made much more errors than the rpi-pred method, with predictive accuracies of % vs. %. the performance of rpirls for escherichia coli was much poorer than that for the other five organisms. in order to analyze why rpirls had relative poor performance on escherichia coli, we estimated the amino acid composition of escherichia coli compared with that of the other five organisms. as shown in figure , we found that escherichia coli had relative higher observing frequencies of alanine and valine as well as much lower content of serine compared with that of the other five organisms. the amino acid composition bias in escherichia coli probably leaded to poor results. as shown in figure , among incorrect predicted pairs, rpis corresponded to protein hubs, e.g., p a h , p afz , p ag , p ce , p ce , p and p , each of which appearing as a yellow rectangle node was shown to interact with six transfer-messenger rnas (tmrnas), e.g., n , n , n , n , n and n (green ellipse). for saccharomyces cerevisiae, as showed in figure , among incorrect predicted pairs, rpis were involved in protein hubs (p , q and q ), in which each protein interacted with small nuclear rnas (snrnas: n , n , n and n ), and other rpis corresponded to protein hubs (p , p , p , q , q , q and q ), each of which interacted with three small nucleolar rnas (snornas: n , n and n ). the rpirls classifier correctly identified out of rpis, achieving a high accuracy of %, compared of % (correctly predicted out of pairs) for rpi-pred. in this work, we illustrated the effectiveness and reliability of rpirls in predicting rpis for eukaryotic organisms in networks which comprised a variety protein hubs and rna hubs. mammalian cells contain more than different proteins interacting with rna [ ] . normally, any individual rna can interact with multiple proteins [ , ] . conversely, most proteins are capable of interacting with multiple rnas [ ] . given the number of rnas and rna-binding proteins, the number of possible rpis is enormous. high-throughput sequencing methods have accumulated huge amount of rna-protein binding experimental data and opened new possibilities to measure and understand rna-protein interactions by computational methods. most of the previous computational works on rpis focus on the prediction of rna-binding proteins or rna-binding residues in a protein sequence [ , [ ] [ ] [ ] . to our knowledge, very limited works have been developed to predict the specific associations between rnas and proteins, which play a critical role in post-transcriptional gene regulatory networks. complex networks of rpis mediate post-transcriptional gene regulation and therefore prediction of rpis helps us to gain insight into regulatory networks [ , ] . the work presented here provided a computational method, called rpirls, to classify rna-protein pairs as interacting or non-interacting by integrating a sequence-based derived kernel with regularized least squares. the derived kernel exploited the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. our results demonstrated that only the sequence structures of rnas and proteins provide sufficient information to accurately predict rna-protein interactions, especially long non-coding rna-protein interactions. specifically, the rpirls classifier considered each protein sequence comprising up to diverse amino acids, while the rpirls- g classifier encoded each protein sequence by using the -letter reduced alphabets according to amino acid physiochemical properties. the computational results showed that the rpirls classifier was superior to the rpirls- g classifier in reliability and effectiveness, indicating that the diversity of amino acids at a sequence has critical impact on the function of rna-binding proteins. on two non-redundant benchmark data sets extracted from the pridb, the rpirls method outperformed rpi-pred and ipminer in terms of accuracy, specificity and sensitivity. compared with rpi-pred and ipminer, the rpirls method obtained a reasonable sensitivity at a lower false positive rate. further, rpirls achieved an accuracy of % compared to % for rpi-pred on the prediction of lncrna-protein interactions. the rpirls method can be extended to construct rna-protein interaction networks and therefore helps us to gain insights into post-transcriptional gene regulation. the reason for the good performance of the proposed method may be due to several factors. firstly, the use of similarity scores is a significant conceptual change in protein/rna evaluation, quantifying the overall similarity between proteins, rnas and their interactions. combining kernels by tensor product for the set of rna-protein pairs allowing to share information across the rna-binding proteins considerably improved the prediction, especially in the case of rnas with few known binding proteins. secondly, we have found that contiguous k-mer frequencies alone captured rich statistical information on the repetitive conserved motif of rna-protein pairs and the diversity of amino acids at a sequence has also contributed to an observed improvement contrast to rpi-pred which just applied -letter frequency for both protein and rna. finally, a kernel works as a measure of similarity and supports the application of powerful machine learning algorithms such as regularized least squares which we used in this paper. the rls mehod enables us to efficiently search for an optimized parameter λ at essentially no additional cost [ ] . further, our model was trained on a large data set which contained rna-protein pairs, and yielded more robust results. in contrast, ipminer had much more model parameters to fit as combining deep learning with stacked ensembling, however, was trained on a small data set of just rna-protein pairs, and thus showed a significantly positive bias on predicting ncrna-protein interactions. the main disadvantage of the proposed method is that the method is purely data-driven, in the sense that it relies solely on information derived from amino acid sequences and nucleic acid sequences, and thus does not consider higher structural information of protein and rna. while this may be seen as an advantage, since it can predict any rna-protein pair of known sequences. the increase of the number of protein-rna complexes in protein data bank [ ] has opened possibilities for researchers to develop secondary data bases and to gain valuable insight into the structure and function of these complexes. the protein-rna interface data base (pridb) v . [ ] identifies interfacial residues in rna-protein complexes and also calculates atomic distances between interfacial residues. the rb and rb are two precalculated data sets in the pridb, which respectively consist of and rna-binding protein chains. after combining the rb and rb data sets, we obtained a total of experimentally validated non-redundant rna-protein pairs, which had at least two atoms respectively coming from rna and protein with distance no more than Å. next, we removed redundant rna-protein pairs, which are the same protein chains interacting with the same rna chains. further, we removed those rna-protein pairs with amino acid sequence length < or nucleic acid sequence length < . finally, we obtained a positive sample set which consisted of experimentally validated rna-protein pairs. so far there are no definite negative samples of rna-protein interactions that are available. to construct a balanced negative sample set ("rna-protein non-interacting pairs"), we made it by randomly permute the proteins in the positive sample set but kept the rna fixed. we repeated the permutation process until no negative pairs existed in the positive sample set. as a result, the training set, called rpi , was composed of rna-protein interacting pairs and rna-protein non-interacting pairs. several data sets were employed to evaluate the performance of the proposed methods. our rpirls method was first evaluated using two popular non-redundant data sets of rpis studied in [ ] . the rpi data set consisted of experimentally validated rna-protein pairs extracted from the pridb data base. while the rpi data set eliminated all rpi pairs with ribosomal proteins or ribosomal rnas from the rpi data set. to avoid overlapping between training and testing data sets, those rpis overlapping the training data were removed, leaving the rpi data set of rpis and the rpi data set of rpis. their corresponding negative pairs were generated by following the same steps as developing the training negative pairs. next, we tested the performance of the rpirls method on a large scale data set extracted from the rna-protein interaction data base (rpintdb) (http://pridb.gdcb.iastate.edu/rpiseq/download.php). this data set consisted of , experimentally validated rpis from several sources, including the rpidb, npinter data base and high-throughput experiments published in literature. the fourth data set were extracted from the npinter data base which we called nrpi . the nrpi data set consisted of , experimentally validated ncrna-protein pairs from six model organisms, i.e., caenorhabditis elegans, drosophila melanogaster, escherichia coli, homo sapiens, mus musculus and saccharomyces cerevisiae. we constructed the fifth data set called lnrpi by extracting only lncrna-protein pairs from the npinter data base. this data set contains , experimentally validated lncrna-protein pairs. in this paper, we proposed two classifiers for predicting rpis based on different representations of protein sequences. for the rpirls classifier, each amino acid sequence comprised up to different amino acids. while for the rpirls- g classifier, we adopted the same amino acid classification approach as [ , ] . the derived kernel was proposed by smale et al. [ ] on images inspired by neuroscience of visual cortex. in what follows, we briefly described the construction of derived kernel on sequences. suppose a is a finite set called the alphabet. in the work here a is the set of amino acids (for rpirls), alphabets (for, rpirls- g) or nucleic acids. let a = a and define a k+ = a k × a recursively for any k ∈ n. we say s is a string if s ∈ ∪ ∞ k= a k , and s = (s , . . . , s k ) is a k-mer (e.g., a sequence of length k) if s ∈ a k for some k ∈ n with s i ∈ a . the process of computing the derived kernel mainly includes three steps as below: step . set an initial kernelk at the first layer. here the initial kernel is defined as: where x, y ∈ a k . x = {x , . . . , x k } and y = {y , . . . , y k } are substrings of the same length k. x = y if and only if x i = y i for i = , . . . , k. step . let f = ( f , . . . , f n ), denote by | f | the length of f , so here | f | = n. then define the second layer neural response of f at t : where t is the template set at the first layer, here we consider all possible substrings of length k making up the template set t , so here t = a k . h is the transformation set at the first layer. step . compute the second layer derived kernel by normalizing the inner product of two neural responses: where ·, · l (t ) denotes the l inner product with respect to the uniform measure |t | ∑ t∈t δ t , where |t | is the cardinality of the template set t and δ t is the dirac measure; with correlation normalization: . this process can continue if appropriate higher level templates are defined. at each layer (local) derived kernels are built by recursively pooling over previously defined local kernels. here, for the -layer derived kernel, pooling is accomplished by taking an average over a set of transformations which calculating the frequency of a template that occurs in a sequence. in this paper we deal with inner product kernels k which satisfies the mercer condition, are known to be an instance of reproducing kernels. next with correlation normalization,k is also a reproducing kernel andk(x, x) = for all x ∈ x. the kernel function is symmetric (i.e., k( f , g) = k(g, f )), and non-negative (i.e., k( f , f ) ≥ ), therefore it can be interpreted as a measure of similarity. we first apply the kernel to the set r which contains nucleic acid sequences, and denote it byk r , and then apply the kernel to the set of amino acid sequences p, denote it byk p , and lastly combine two kernels in a natural way by tensor product for the set of rna-protein pairs . the reproducing kernel for two rna-protein pairs (r, p), (r , p ) ∈ r × p is defined by: k dk ((r, p), (r , p )) =k r (r, r )k p (p, p ). since bothk r (r, r ) andk p (p, p ) are positive definite kernels,k dk ((r, p), (r , p )) is obviously a positive definite kernel too [ ] . after combining the kernel with other kernel-based supervised learning algorithm, we can predict rpis to any rna-protein pairs with known primary sequences. the rls algorithm is one of the most widely used models for regression. let k be a kernel on a finite set x. write h k to denote the inner product space of functions on x defined by k. supposez = {(x i , y i )} m i= is a sample set (called the training set) with x i ∈ x and y i ∈ r for each i. the rls can be written as follows: we integrated rls with the combined kernel k =k dk , hence the main construction is to computē herein, we aim to develop a novel method to distinguish rna-protein interaction pairs from non-rna-protein interaction pairs. therefore, for the binary classification case with y i ∈ {− , } for each i, iff ≤ , the predicted class is − ( denotes non-interaction), otherwise it is (denotes interaction). one important step of rls is to find a "good" value of the regularization parameter λ > in equation ( ). they were selected from an optional set Λ by leave-one-out cross-validation [ ] on the training data. we never used testing data for parameter selection which is under the risk of over-fitting. the sensitivity (se) and specificity (sp) are used to measure the ability of identifying positive and negative instances, respectively. they are defined by the accuracy which is used to measure the prediction quality, is defined by accuracy = tp + tn tp + tn + fp + fn . the auc (area under the receiver operating characteristic curve) is further employed to measure the predictive performance, which is for perfect prediction and . for random prediction. rna regulons: coordination of post-transcriptional events rpicool: a tool for in silico rna-protein interaction detection using random forest a census of human rna-binding proteins sequence-specific interaction of r coat protein with its ribonucleic acid binding site rna-rna and rna-rotein interactions in coronavirus replication and transcription diverse roles of host rna binding proteins in rna virus replication rna-protein interactions in human health and disease the three-dimensional structure of the ribosome and its components ribosomal protein structures: insights into the architecture, machinery and evolution of the ribosome emerging roles of rna and rna-binding protein network in cancer cells rna processing and its regulation: global insights into biological networks long noncoding rnas in interaction with rna binding proteins in hepatocellular carcinoma long noncoding rnas: functional surprises from the rna world functional demarcation of active and silent chromatin domains in human, hox loci by noncoding rnas par-clip analysis uncovers, auf impact on target rna fate and genome integrity jarid is implicated in the initial xist-induced targeting of, prc to the inactive x chromosome long non-coding rna gas inhibited hepatitis c virus replication by binding viral ns protein rip-chip: the isolation and identification of mrnas, micrornas and protein components of ribonucleoprotein complexes from cell extracts hits-clip yields genome-wide insights into brain alternative rna processing transcriptome-wide identification of rna-binding protein and microrna target sites by par-clip protein-rna interactions: structural analysis and functional classes advances in rip-chip analysis: rna-binding protein immunoprecipitation-microarray profiling quantitative analysis of rna-protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes in silico characterization and prediction of global protein-mrna interactions in yeast predicting protein associations with long noncoding rnas predicting rna-protein interactions using only sequence information de novo prediction of rna-protein interactions from sequence information computational prediction of associations between long non-coding rnas and proteins predicting ncrna-protein interaction using sequence and structural information ipminer: hidden ncrna-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction applications of regularized least squares to pattern classification simulations of the dynamics at an rna-protein interface prediction of rna-binding proteins from primary sequence by a support vector machine approach prediction of rna binding sites in proteins from amino acid sequence a study of cross-validation and bootstrap for accuracy estimation and model selection the noncoding rnas and protein related biomacromolecules interaction database molecular mechanisms of long noncoding rnas function of lncrnas and approaches to lncrna-protein interactions principles and properties of eukaryotic mrnps transcriptome-wide analysis of protein-rna interactions using high-throughput sequencing a neural network method for identification of rna-interacting residues in protein a server for the computational prediction of rna-binding residues in protein sequences prediction of protein-rna binding sites by a random forest method with combined features dissecting the expression dynamics of rna-binding proteins in posttranscriptional regulatory networks deciphering the role of rna-binding proteins in the post-transcriptional control of gene expression the protein data bank pridb: a protein-rna interface database introduction to the peptide binding problem of computational immunology: new results. found generalized cross-validation as a method for choosing a good ridge parameter this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license author contributions: w.s. and j.x. conceived and designed the experiments; w.s., w.c. and j.z. performed the experiments; w.s. and w.c. analyzed the data; w.s. and d.c. contributed web tools; w.s. and j.x. wrote the paper. the authors declare no conflict of interest. key: cord- -ampip od authors: bagowski, christoph p; bruins, wouter; te velthuis, aartjan j.w title: the nature of protein domain evolution: shaping the interaction network date: - - journal: curr genomics doi: . / sha: doc_id: cord_uid: ampip od the proteomes that make up the collection of proteins in contemporary organisms evolved through recombination and duplication of a limited set of domains. these protein domains are essentially the main components of globular proteins and are the most principal level at which protein function and protein interactions can be understood. an important aspect of domain evolution is their atomic structure and biochemical function, which are both specified by the information in the amino acid sequence. changes in this information may bring about new folds, functions and protein architectures. with the present and still increasing wealth of sequences and annotation data brought about by genomics, new evolutionary relationships are constantly being revealed, unknown structures modeled and phylogenies inferred. such investigations not only help predict the function of newly discovered proteins, but also assist in mapping unforeseen pathways of evolution and reveal crucial, co-evolving inter- and intra-molecular interactions. in turn this will help us describe how protein domains shaped cellular interaction networks and the dynamics with which they are regulated in the cell. additionally, these studies can be used for the design of new and optimized protein domains for therapy. in this review, we aim to describe the basic concepts of protein domain evolution and illustrate recent developments in molecular evolution that have provided valuable new insights in the field of comparative genomics and protein interaction networks. the protein universe is the collection of proteins of all biological species that exist or have once existed on earth [ ] . our sampling and understanding of it began over half a century ago, when the first peptide and protein sequences were determined by sanger [ , ] and, subsequently, the sequencing of rna and dna [ ] [ ] [ ] . in the meantime, the genome projects of the last decade have uncovered an overwhelming amount of sequence data and researchers are now starting to address a series of fundamental questions that should shed light onto protein evolution processes [ ] [ ] [ ] [ ] . for instance, how many gene encoding sequences are present in one genome? how many sequences are repetitive and are these sequences similar in the various organisms on earth? which genes were involved in the large scale genome duplications that we see in animals? a comparison of sequences for evolutionary insight is best achieved by looking at the structural and functional (sub)units of proteins, the protein domains. by convention, domains are defined as conserved, functionally independent protein sequences, which bind or process ligands using a core structural motif [ ] [ ] [ ] . examples of domain modes of actions in signaling cascades for instance, are to connect different components into a larger complex or to bind signaling-molecules [ , ] . protein domains can usually fold independently, likely due to their relatively limited size, and are well known to behave as independent genetic elements within genomes [ , ] . the sum of these features makes protein domains readily identifiable from raw nucleotide and amino acid sequences and many protein family resources (e.g., superfamily and smart [see table ]) indeed fully rely on such sequence similarity and motif identifications [ , ] . the algorithms that are used for domain identification are built around a set of simple assumptions that describe the process of evolution. in general, evolution is believed to form and mold genomes largely via three mechanisms, namely i) chemical changes through the incorporation of base analogs, the effects of radiation or random enzymatic errors by polymerases, ii) cellular repair processes that counter mutations, and iii) selection pressures that manifest themselves as the positive or negative influence that determines whether the mutation will be present in subsequent generations [ , ] . by definition, each of these phenomena styles, reproductive strategies, or the lack of apparent polymerase-dependent proofreading such as in positivestranded rna viruses [ ] [ ] [ ] [ ] . consequently, substitution rates need therefore be calculated to correctly compare two or more sequences and hunt uncharted genomes for comparable domains. particularly this last strategy, using general rate matrices like blosom and pam, is an elegant example of how new protein functions can be discovered [ ] [ ] [ ] [ ] [ ] . fast algorithms for pair-wise alignments can be found in the basic local alignment search tool (blast), whereas multiple sequence alignments (msas, fig. a) in which multiple sequences are compared simultaneously are commonly created with for example clustalx and muscle (see table ) [ ] [ ] [ ] [ ] . close relatives, sharing an overall sequence identity above for example % and a set of functional properties, can also be grouped into families and subfamilies. in turn, these families share also evolutionary relationships with other domains and form together so-called domain superfamilies [ , ] . evolutionary distances between related domain sequences can easily be estimated from sequence alignments, provided that the correct rate assumptions are made. subsequently, these can be used to compute the phylogenies of the domain that share an evolutionary history. these, often tree-like graphs (fig. b) , depend heavily on rate variation models, such as molecular clocks or relaxed molecular clocks (e.g., maximum likelyhood and bayesian estimation), which are calibrated with additional evidence fig. ( a) . it was computed using bayesian estimation and presents the best-supported topology for the alignment. numbers indicate % support by the two methods used, while # indicates gene duplication events in the common ancestor and * marks a species-specific duplication event. for computational details, please see [ ] . such as fossils and may therefore also provide valuable information on aspects like divergence times and ancestral sequences [ ] [ ] [ ] . commonly used phylogenetic analysis strategies are listed in table . a limitation of all inferred phylogenetic data is that it is directly dependent on the alignment and less so on the programs used to build the phylogenetic tree [ ] . one of the shortcomings of automated alignments may thus derive from the fact that they commonly employ a scoring and penalty procedure to find the best possible alignment, since these parameters vary from species to species [ , ] , as mentioned above. careful inspection of alignments is therefore advisable, even though software has been developed that combines the alignment procedure and phylogenetic analysis iteratively in one single program [ ] . although sequence and phylogenetic analysis provide a relatively straightforward way for looking at domain divergence, comparison of solved protein structures has shown that protein tertiary organizations are much more conserved (> %) than their primary sequence (> %) [ ] . for this reason, protein structures and their models provide significantly more insight into the relations of protein domains and how domain families diverged [ ] . for example, the inactive guanylate kinase (gk) domain present in the maguk family was shown to originate from an active form of the gk domain residing in ca + channel beta-subunits (cacnbs) through both sequence and structural comparison [ ] . furthermore, identification of functionally or structurally related amino acid sites in a fold sheds light on the complex, co-evolutionary dynamics that took place during selection [ ] . as described above, the evolution of a protein domain is generally the result of a combination of a series of random mutations and a selection constraint imposed on function, i.e., the interaction with a ligand. the interaction between protein and ligand can be imagined as disturbances of the protein's energy landscape, which in turn bring about specific, three-dimensional changes in the protein structure [ , ] . binding energies however, need not be smoothly distributed over the protein's binding pocket as a limited number of amino acids may account for most of the free-energy change that occurs upon binding [ ] [ ] [ ] . in these cases, new binding specificities (including loss of binding) may therefore arise through mutations at these hot spots. an example is a recent study of the pdz domain in which it was shown that only a selected set of residues, and in particular the first residue of -helix ( b ), directly confers binding to a set of c-terminal peptides [ ] . the folding of a domain is essentially based on a complex network of sequential inter-molecular interactions in time [ ] . this has of course significant implications for domain integrity, particularly if one assumes that the core of a protein domain is and has to be largely structurally conserved. indeed, even single mutations that arise in this area may easily derail the folding process, either because their free energy contribution influences residues in the direct vicinity or disturbs connections higher up in the intermolecular network [ ] . it is therefore hypothesized that protein evolution took place at the periphery of the protein domain core, and that gradual changes via point mutations, insertions and deletions in surface loops brought about the evolutionary distance we see among proteins to date [ , [ ] [ ] [ ] . however, distant sites also contribute to the thermodynamics of catalytic residues. this is achieved through a mechanism called energetic coupling, which is shaped by a continuous pathway of van der waals interactions that ultimately influences residues at the binding site with similar efficiency as the thermodynamic hotspots [ , ] . indeed in such cases, evolutionary constraints are not placed on merely one amino acid in the binding pocket, but on two or more residues that can be shown to be statistically coupled in msas [ , ] . in addition to contributions to binding, these principles also explain why the core of a domain structure will remain largely conserved, while at functionally related places residues can (rapidly) co-evolve with an overall neutral effect [ ] . of course, these aspects of co-evolution are also of practical consequence for structure prediction and rational drug design [ ] . through selective mutation, protein domains have been the tools of evolution to create an enormous and diverse assembly of proteins from likely an initially relatively limited set of domains. the combined data in genbank and other databases now covers over . species with at least complete genomes and this greatly facilitates genome comparisons [ ] [ ] [ ] . following such extensive comparisons, currently > domain superfamilies are recognized in the recent release of the structural classification of proteins (scop) [ ] and it has become clear that many proteins consist of more than one domain [ , , ] . indeed, it has been estimated that at least % of the domains is duplicated in prokaryotes, whereas this number may even be higher in eukaryotes, likely reaching up to % [ ] . there are various mechanisms through which protein domain or whole proteins may have been duplicated. on the largest scale, whole genome duplication such as those seen in the vertebrate genomes duplicated whole gene families, including postsynaptic proteins, hormone receptors and muscle proteins, and thereby dramatically increased the domain content and expanded networks [ , , ] . on the other end of the scale, domains and proteins have been duplicated through genetic mechanisms like exon-shuffling, retrotranspositions, recombination and horizontal gene transfer [ ] [ ] [ ] . since the genetic forces, like exon-shuffling and genome duplication vary among species, the total number of domains and the types of domains present fluctuate per genome. interestingly, comparative analyses of genomes have shown that the number of unique domains encoded in organisms is generally proportional to its genome size [ , ] . within genomes, the number of domains per gene, the socalled modularity, is related to genome size via a power-law, which is essentially the relation between the frequency f and an occurrence x raised by a scaling constant k (i.e., f (x) x k ) [ , ] . a similar correlation is found when the multi-domain architecture is compared to the number of cell types that is present in an organism, i.e., the organism complexity or when the number of domains in a abundant superfamily is plotted against genome size (fig. ) [ , ] . given the amount of domain duplication and apparent selection for specific multi-domain encoding genes in, for example, vertebrates, it may come as little surprise that not all domains have had the same tendency to recombine and distribute themselves over the genomes [ , ] . in fact, some are highly abundant and can be found in many different multi-domain architectures, whereas others are abundant yet confined to a small sample of architectures or not abundant at all [ , ] . is there any significant correlation between the propensity to distribute and the functional roles domains have in cellular pathways? some of the most abundant domains can be found in association with cellular signaling cascades and have been shown to accumulate non-linearly in relation to the overall number of domains encoded or the genome size [ ] . additionally, the on-set of the exponential expansion of the number of abundant and highly recombining domains has been linked to the appearance of multicellularity [ ] . a reoccurring theme among these abundant domains is the function of protein-protein interaction and it appears that particularly these, usually globular domains, have been particularly selected for in more complex organisms [ ] . this positive relation is underlined by the association of these abundant domains with disease such as cancer and gene essentiality as the highly interacting proteins that they are part of have central places in cascades and need to orchestrate a high number of molecular connections [ , ] . their shape and coding regions, which usually lie within the boundaries of one or two exons, make them ideally suited for such a selection, since domains are most frequently gained through insertions at the n-or c-terminus and through exon shuffling [ ] [ ] [ ] . from a mutational point of view, protein-protein interaction domains are different from other domains as well and this appears to be particularly true for the group of small, relatively promiscuous domains like sh and pdz. these domains are promiscuous in the sense that they both tend to physically interact with a large number of ligands [ , ] and are prone to move through the genome to recombine with many other domains. it has been found that particularly these domains evolve more slowly than non-promiscuous domains [ ] . this likely stems from the fact that they are required to participate in many different interactions, which makes selection pressures more stringent and the appearance of the branches on phylogenetic trees relatively short and more difficult to assess when co-evolutionary data in terms of other domains in the same gene family or expression patterns is limited [ , ] . non-promiscuous domains on the other hand can quite easily evade the selection pressure by obtaining compensatory mutations either within themselves or their specific binding partner [ ] . the overall phenomenon that the number of protein domains and their modularity increases as the genome expands has not been linked to a conclusive biological explanation yet. a rationale for the increase in interactions and functional subunits, however, may derive from the paradoxical absence of correlation between the number of genes encoded and organism complexity, the so-called g-value paradox [ ] . there is indeed evidence that domains involved in the same functional pathway tend to converge in a single protein sequence, which would make pathways more controllable and reliable without the need for supplementary genes [ ] . additionally, the number of different arrangements found in higher eukaryotes is, given the vast scale of unique domains present, relatively limited. this in turn implies that evolutionary constraints have played an important role in selecting the right domain combinations and the right order from n-to c-terminus in multi-domain proteins [ , ] . in fact, the ordering and co-occurrence of domains was demonstrated to hold enough evolutionary information to construct a tree of life similar to those based on canonical sequence data [ ] . furthermore, the increased use of alternative splicing and exon skipping in higher eukaryotes likely supplied a novel way of proteome diversification by restricting gene duplication and stimulating the formation of multi-domain proteins [ , ] . in plants, however, the latter notion is not supported since both mono-and dicots show limited alternative splicing and a more extensive polyploidy [ ] [ ] [ ] . it is clear that some of the above characteristics are underappreciated in the phylogenetic analysis of linear amino acid sequences. moreover, the effects of evolution extend even further than these aspects and entail transcriptional and translational regulation, intramolecular domain-domain interactions, gene modifications and post-translational protein modifications [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . new methods are thus being developed to take into account that when sequences evolve, their close and distant functional relationships evolve in parallel. correlations of mutations have already been found between residues of different proteins [ , ] and compensating mutational changes at an interaction interface were shown to recover the instability of a complex [ ] . these observations are evidence for the current evolutionary models for the protein-protein interaction (ppis) networks that are being constructed through large-scale screens [ ] [ ] [ ] . in these, a gene duplication or domain duplication (depending on the resolution of the network) implies the addition of a node, while the deletion of a gene or domain reduces the amount of links in the network (fig. ) . in the next step, extensive network rewiring may take place, driven by the effect of node addition or node loss in the network (i.e., the duplicability or essentiality of a domain/protein) and mutations in the domain-interaction interface [ , , [ ] [ ] [ ] . beyond mutations at the domain and protein level, regulation of protein expression provides another vital mechanism through which protein networks can evolve. microarray studies are now well under way to map genome-wide ex-pression levels of related and non-related genes under a variety of conditions [ , [ ] [ ] [ ] . for example, transcriptional comparisons have investigated aging [ ] and pathogenicity [ ] . unfortunately, given the highly variable nature of gene expression and the fact that different species may respond different to external stimuli, such comparisons can only be performed under strictly controlled research conditions. to date most studies have therefore focused on the embryogenesis, metamorphosis, sex-dependency and mutation rates of subspecies [ , [ ] [ ] [ ] [ ] . other studies have revealed valuable information on promoter types and duplication events [ ] [ ] [ ] [ ] . to overcome the limitations mentioned in the previous paragraph, the analysis of co-expression data has been developed to supplement the direct comparison of individual gene expression changes [ ] . in this procedure, a coexpression analysis of gene pairs within each species precedes the cross-comparison of the different organisms in the study. this approach thus primarily focuses on the similarity and differences of the orthologous genes within network, and is therefore ideally suited for the study of protein domain evolution and has already revealed that species-specific parts fig. ( ) . evolutionary models for protein-protein interactions. the evolution of protein networks is tightly coupled to the addition or deletion of nodes. additionally, events that introduce mutations in binding interfaces of proteins may result in the addition or loss of links in the network. node addition may take place through e.g., domain duplication or horizontal gene transfer, while rewiring of the network is mediated by point mutations, alternative splice variants and changes in gene expression patterns. of an expression network resulted via a merge of conserved and newly evolved modules [ , , ] . finding evolutionary relationships protein domains is mostly based on orthology and thus commonly performed on best sequence matches. identifying these and categorizing them depends largely on multiple sequence alignments and this will in most cases give good indications for function, fold and ultimately evolution. however, this approach usually discards apparent ambiguities that arise from speciesspecific variations (e.g., due to population size, metabolism or species-specific domain duplications or losses) and may therefore introduce significant biases [ ] . biases may also derive from the method of alignment, the rate variation model used to infer the phylogeny, and the sample size used to build the alignment [ , , ] . care should therefore be taken to not regard orthology as a one-to-one relationship, but as a family of homologous relations [ ] , to select for appropriate analysis methods [ , ] and extend comparative data to protein interactions and expression profiles [ ] . indeed, as our wealth of biological information expands, our systems perspective will improve and provide us with an opportunity to reveal protein domain evolution at the level network organization and dynamics. large-scale expression studies are beginning to show us evolutionary correlations between gene expression levels and timings [ , , , , ] , while others demonstrate spatial differences between paralogs or (partial) overlap between interaction partners [ ] [ ] [ ] [ ] . indeed, when we are able to map the spatiotemporal aspects of inter-and intra-molecular interactions we will begin to fully understand the versatile power of evolution that shaped the protein universe and life on earth [ ] . phylogenetic continuum indicates "galaxies" in the protein universe: perliminary results on the natural group structures of proteins the chemistry of amino acids and proteins some peptides from insulin nucleotide sequence from the coat protein cistron of r bacteriophage rna use of dna polymerase i primed by a synthetic oligonucleotide to determine a nucleotide sequenc of phage fl dna dna sequencing with chain-terminating inhibitors the genome sequence of drosophila melanogaster flybase: genomes by the dozen initial sequencing and comparative analysis of the mouse genome insights into social insects from the genome of the honeybee apis mellifera selectivity and promiscuity in the interaction network mediated by protein recognition modules modular peptide recognition domains in eukaryotic signaling the multiplicity of domains in proteins the modular nature of apoptotic signaling proteins regulatory potential, phyletic distribution and evolution of ancient, intracellular smallmolecule-binding domains protein families and their evolution: a structural perspective the folding and evolution of multidomain proteins the superfamily database in : families and functions smart: identification and annotation of domains from signalling and extracellular protein sequences comparative genomics: genome-wide analysis in metazoan eukaryotes distribution of indel lengths heterogeneity of nucleotide frequencies among evolutionary lineages and phylogenetic inference review of concepts, case studies and implications who do species vary in their rate of molecular evolution infidelity of sars-cov nsp -exonuclease mutant virus replication is revealed by complete genome sequencing who's your neighbor? new computational approaches for functional genomics protein function in the post-genomic era the role of pattern databases in sequence analysis gene ontology: tool for the unification of biology unique and conserved features of genome and proteome of sarscoronavirus, an early split-off from the coronavirus group lineage phylip version . . deptartment of genetics gapped blast and psi-blast: a new generation of protein database search programs the clustal_x windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools comparison of methods for searching protein sequence databases an insight into domain combinations evolutionary trees from dna sequences: a maximum likelihood approach mrbayes: bayesian inference of phylogenetic trees mammalian evolution and biomedicine: new views from phylogeny multiple sequence alignment: in pursuit of homologous dna positions bayesian coestimation of phylogeny and sequence alignment the relation between the divergence of sequence and structure in proteins molecular evolution of the maguk family in metazoan genomes why should we care about molecular coevolution the propagation of binding interactions to remote sites in proteins: analysis of the binding of the monoclonal antibody d . to lysozyme structural stability of binding sites: consequences for binding affinity and allosteric effects revealing the architecture of a k+ channel pore through mutant cycles with a peptide inhibitor structural plasticity in a remodeled protein-protein interface a specificity map for the pdz domain family the linkage between protein folding and functional cooperativity: two sides of the same coin? empirical and structural models for insertions and deletions in the divergent evolution of proteins analysis of insertions/deletions in protein structures structural similarity of loops in protein families: toward the understanding of protein evolution the effect of inhibitor binding on the structural stability and cooperativity of the hiv- protease evolutionary conserved pathways of energetic connectivity in protein families how frequent are correlated changes in families of protein sequences? an improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution evolution of vertebrate genes related to prion and shadoo proteins--clues from comparative genomic analysis evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes data growth and its impact on the scop database: new developments estimating the number of protein folds and families from complete genome data insights into the molecular evolution of the pdz-lim family and indentification of a novel conserved protein motif independent elaboration of steroid hormone signaling pathways in metazoans integration of horizontally transferred genes into regulatory interaction networks takes many million years prokaryotic evolution in light of gene transfer how the global structure of protein interaction networks evolves the impact of comparative genomics on our understanding of evolution modular genes with metazoan-specific domains have increased tissue specificity evolution of protein domain promiscuity in eukaryotes the structure of the protein universe and genome evolution modules, multidomain proteins and organismic complexity detecting protein function and protein-protein interaction from genome sequences lethality and centrality in protein networks comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks domain deletions and substitutions in the modular protein evolution genome evolution and the evolution of exon-shuffling-a review significant expansion of exon-bordering protein domains during animal proteome evolution thermodynamic basis for promiscuity and selectivity in protein-protein interactions: pdz domains, a case study promiscuous binding nature of sh domains to their target proteins expansion of genome coding regions by acquisition of new genes the geometry of domain combination in proteins different levels of alternative splicing among eukaryotes how did alternative splicing evolve? alternative splicing and gene duplication are inversely correlated evolutionary mechanisms polyploidy and genome evolution in plants comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome structural characterization of the intramolecular interaction between the sh and guanylate kinase domains of psd- identification of an intramolecular interaction between the sh and guanylate kinase domains of psd- interplay of pdz and protease domain of degp ensures efficient elimination of misfolded proteins comparative biology: beyond sequence analysis a genetic signature of interspecies variations in gene expression genome-wide scan reveals that genetic variation for transcriptional plasticity in yeast is biased towards multi-copy and dispensable genes identification of tightly regulated groups of genes during drosophila melanogaster embryogenesis a gene-coexpression network for global discovery of conserved genetic modules similarities and differences in genome-wide expression data of six organisms accurate prediction of proteinprotein interactions from sequence alignments using a bayesian method correlated mutations contain information about protein-protein interaction mutually compensatory mutations during evolution of the tetramerization domain of tumor suppressor p lead to impaired hetero-oligomerization functional organization of the yeast proteome by systematic analysis of protein complexes a human protein-protein interaction network: a resource for annotating the proteome protein function, connectivity, and duplicability in yeast evolution and topology in the yeast protein interaction network modularity and evolutionary constraint on proteins comparing genomic expression patterns across species identifies shared transcriptional profile in aging genome-wide functional analysis of pathogenicity genes in the rice blast fungus a mutation accumulation assay reveals a broad capacity for rapid evolution of gene expression evolution of gene expression in the drosophila melanogaster subgroup sexdependent gene expression and evolution of the drosophila transcriptome microarray analysis of drosophila development during metamorphosis conservation and coevolution in the scale-free human gene coexpression network conservation and evolution of gene coexpression networks in human and chimpanzee brains cross-species sequence comparisons: a review of methods and available resources impact of taxon sampling on the estimation of rates of evolution at sites comparative genomics beyond sequence-based alignments: rna structures in the encode regions comparative analysis of splice form-specific expression of lim kinases during zebrafish development towards cellular systems in d gene expression map of the arabidopsis shoot apical meristem stem cell niche a gene expression map of arabidopsis thaliana development key: cord- -plptulfb authors: tilocca, bruno; soggiu, alessio; greco, viviana; piras, cristian; arrigoni, norma; ricchi, matteo; britti, domenico; urbani, andrea; roncada, paola title: immunoinformatic-based prediction of candidate epitopes for the diagnosis and control of paratuberculosis (johne’s disease) date: - - journal: pathogens doi: . /pathogens sha: doc_id: cord_uid: plptulfb paratuberculosis is an infectious disease of ruminants caused by mycobacterium avium subsp. paratuberculosis (map). map is an intracellular pathogen with a possible zoonotic potential since it has been successfully isolated from the intestine and blood of crohn’s disease patients.since no cure is available, after the detection of the disease, animal culling is the sole applicable containment strategy. however, the difficult detection of the disease in its subclinical form, facilitates its spread raising the need for the development of effective diagnosis and vaccination strategies. the prompt identification and isolation of the infected animals in the subclinical stage would prevent the spread of the infection.in the present study, an immunoinformatic approach has been used to investigate the immunogenic properties of map proteins. these proteins were chosen according to a previously published immunoproteomics approach. for each previously-described immunoreactive protein, we predicted the epitopes capable of eliciting an immune response by binding both b-cells and/or class i mhc antigens. the retrieved peptide sequences were analyzed for their specificity and cross-reactivity. the final aim is to employ the discovered peptides sequences as a filtered library useful for early-stage diagnosis and/or to be used in novel multi-subunit or recombinant vaccine formulations. bovine paratuberculosis, also known as johne's disease (jd) is an infectious disease of ruminants caused by mycobacterium avium subsp. paratuberculosis (map). it is characterized by chronic and progressive granulomatous enteritis. the infected animals initially show normal appetite and food consumption, but the intestinal wall thickening and the impaired nutrient absorption cause a reduced feed-conversion rate and a progressive weight loss. milk yield is also negatively affected by the progression of the infection. nevertheless, clinical manifestations do not involve all infected animals [ ] [ ] [ ] ; the subclinical stageof infection can last from to years [ ] and, despite the absence of visible symptoms, animals in this stage can shed map and spread the disease [ , [ ] [ ] [ ] . for these aforementioned reasons, this pathology leads to significantly increased veterinary costs worldwide [ , , ] . the causal agent of jd is map. it is considered a zoonotic pathogen [ ] because of its possible link with crohn's disease. map infection affects animals and there is considerable evidence that might be a co-cause of human crohn's disease [ ] . map isolation from the intestine and blood of crohn's disease patients has extensively documented. more precisely, map presence was found to be seven times higher in crohn's disease patients than what has been found in patients with any other bowel inflammation [ , ] . map also infected animals and crohn's disease patients show similar alterations of the immune system response reinforcing the hypothesis about the analogy between the two [ ] [ ] [ ] [ ] . map is a slow-growing bacterium, commonly acquired via the fecal-oral route by both animals and humans [ ] . despite the pathogenetic mechanism of map, infection has not been fully understood, it has been demonstrated that its acid resistance enables it to survive in the gastric environment, allowing its entrance in the intestinal tract. at the ileal level, map invades the lymphatic system overlying peyer's patches. this stimulates the host's immune response that, besides activating the humoral response, promptly phagocytizes map into macrophages [ ] . as an intracellular pathogen, map can either survive into macrophagic cells or being killed and disassembled to present its antigens to t-lymphocytes [ ] . evidence from multibacillary jd revealed a massive humoral antibody response along with a tendency to suppress the cell-mediated immune response [ , , ] . whereas, a recent comparative study between two groups of cows, one in the sub-clinical and the other in the clinical stage, highlighted an increased t-cell activity in the first group of animals compared to the second one [ ] . studies on cattle at the early stage of map infection revealed an upregulation of class i mhc molecules, suggesting a pivotal role of these molecules in the very beginning of the infective process [ ] ; this is of great interest for both diagnosis and prophylaxis-oriented studies. figure provides an overview of the major immunological mechanisms triggered bymap infections. to date, jd diagnosis relies on both direct (map culture, pcr, microarrays etc.) and indirect (elisa) detection of map from feces, milk and necroscopy-derived tissues. however, all the available diagnostic methods suffer from sensitivity (especially in the sub-clinical phase) that strongly reduce their robustness and efficient applicability on large-scale control programs. the failure to detect the subclinical infection makes it difficult to timely apply the control measures necessary to block the spread of the infection within the same, and to other, herds. a thorough comprehension of the etiopathogenetic mechanisms of map infection and host response would be beneficial for diverse research scenarios, providing guidance for the design of map-specific diagnostic tools capable of jd diagnosis in the subclinical phase. from this perspective, a previous study from our research group [ ] employed an immunoproteomic approach to investigate and select map-specific immunoreactive proteins. here, incubation of map proteome with sera from infected bovines highlighted several possible candidate immunoreactive proteins. these candidates represent a good starting point for an immunoinformatic analysis of their sequences in order to find the best immunoreactive sub-sequences and epitopes. this would provide a library of peptides that might be useful for novel prophylactic strategies and/or for their potential application in the immune-based detection of map. the rapid development of the bioinformatics tools and databases provides the possibility to detect the antigenic/epitopic regions of given protein sequences. this innovative strategy for the in-silicoanalysis is time-and cost-effective compared to the "old-fashioned" laboratory-based approach. recently, carlos et al. [ ] and rana et al. [ ] applied immunoinformatics-based studies to detect class ii mhc epitopes possibly useful for the control of jd in a rapid and cost-effective manner. over the last decade, these computational approaches lead to the achievement of successful epitopes prediction in several research fields as virology [ , ] , bacteriology and cancer research [ ] . in this study, previously-selected immunogenic proteins [ ] were studied via several immunoinformatics approaches aiming at the detection of the most promising peptide sequences useful for diagnostic purposes. the parameters taken into account were affinity for both the humoral antibody binding and the class i mhc molecule binding. we predicted the most suitable peptide sequences and discuss their potential employment in the design of innovative control measures against jd, with a specific focus on the early diagnosis of jd and/or potential use in novel specific vaccine formulations. pathogens , , x for peer review of over the last decade, these computational approaches lead to the achievement of successful epitopes prediction in several research fields as virology [ , ] , bacteriology and cancer research [ ] . in this study, previously-selected immunogenic proteins [ ] were studied via several immunoinformatics approaches aiming at the detection of the most promising peptide sequences useful for diagnostic purposes. the parameters taken into account were affinity for both the humoral antibody binding and the class i mhc molecule binding. we predicted the most suitable peptide sequences and discuss their potential employment in the design of innovative control measures against jd, with a specific focus on the early diagnosis of jd and/or potential use in novel specific vaccine formulations. the peptide sequences of the previously-identified immunoreactive proteins [ ] were used to recall the novel protein identifiers in the ncbinr protein database. because of the continuous evolution of the data repositories and the increasing knowledge on their entries, some protein accession numbers were re-classified into other identifiers. table summarizes the blast-based alignments of the peptides performed to line up the selected proteins to the current identification system. all pblast alignments matched at % with the reference protein kept. the low e-value of each alignment supports the attribution of the immunoreactive proteins to the novel identifiers. the peptide sequences of the previously-identified immunoreactive proteins [ ] were used to recall the novel protein identifiers in the ncbinr protein database. because of the continuous evolution of the data repositories and the increasing knowledge on their entries, some protein accession numbers were re-classified into other identifiers. table summarizes the blast-based alignments of the peptides performed to line up the selected proteins to the current identification system. all pblast alignments matched at % with the reference protein kept. the low e-value of each alignment supports the attribution of the immunoreactive proteins to the novel identifiers. once the updated protein identifiers are inferred, the major immunogenic domains of the selected proteins were predicted through an immunoinformatic approach. prediction of the linear b-epitopes provided a list of epitopes capable of eliciting antibody production (supplementary table s ). all the selected proteins showed relevant epitopes from an immunogenic point of view. a large number of short epitope sequences is predicted for each immunogenic protein; whereas, an average of six candidate epitopes (min -max ) meeting the threshold of a minimal length of aminoacids is predicted for each of the selected immunogenic proteins. figure depicts the ten immunogenic proteins of map along with the relative distribution of the predicted b-epitopes. whole protein calculated immunogenic potential based on the type-b epitopes prediction indicates the "hypothetical protein map_ c" (aas ) as the most immunogenic one. this evidence is supported by its highest number of predicted epitopes and the highest average epitope length ( figure ). on the other hand, the fructose-bisphosphate aldolase (eta ), reported the lowest number of predicted epitopes along with the lowest epitope sequence length. regardless of the number of predictions, candidate epitopes are evenly mapped over the full sequence length of the immunogenic proteins, suggesting a good versatility of the predicted sequences ( figure ). pathogens , , x for peer review of prediction of binding affinity for the diverse class i bolas histocompatibility antigens predicted a high number of peptide sites. the full list of class i mhc epitope prediction is provided in the supplementary table s . epitope prediction from the previously selected immunogenic proteins yielded a total of peptides, each of which scoring a peculiar binding affinity. peptides scoring a binding affinity among the top . % are considered as strong binders (sb); whereas, peptides with a percentile rank comprised between . % and % were labelled as weak binders (wb). sorting all the entries using the "sole" wb and sb resulted in a total of candidate epitopes when considering all the mhc haplotypes for the ten immunogenic proteins (supplementary table s ) . for a better evaluation of the most suitable map epitopes, we focused our attention towards the sequences that are most commonly recognized by the immune system effectors (i.e., bola haplotypes and, in turn, cd + t-cells). figure lists, for each of the tested immunogenic protein, the shared epitopes among the mhc haplotypes. pathogens , , x for peer review of prediction of binding affinity for the diverse class i bolas histocompatibility antigens predicted a high number of peptide sites. the full list of class i mhc epitope prediction is provided in the supplementary table s . epitope prediction from the previously selected immunogenic proteins yielded a total of peptides, each of which scoring a peculiar binding affinity. peptides scoring a binding affinity among the top . % are considered as strong binders (sb); whereas, peptides with a percentile rank comprised between . % and % were labelled as weak binders (wb). sorting all the entries using the "sole" wb and sb resulted in a total of candidate epitopes when considering all the mhc haplotypes for the ten immunogenic proteins (supplementary table s ) . for a better evaluation of the most suitable map epitopes, we focused our attention towards the sequences that are most commonly recognized by the immune system effectors (i.e., bola haplotypes and, in turn, cd + t-cells). figure lists, for each of the tested immunogenic protein, the shared epitopes among the mhc haplotypes. summarizes bolas haplotypes predicted to bind each sequence as means of a color scale that depends on the binding affinity. peptides "amlqdmail" belongs to heat shock protein (caa . ); "aqldlsnal" and "hmlrhqqir" belong to hypothetical protein map_ c (aas . ); "arleppgpl" belong to hypothetical protein map c (aas . ); "gvfnleatl" and "neraveeal" belong to the protein fixa (aas . ); "rlleiepal", "aagqigysl" and "alegvvmel" belong to the malate dehydrogenase protein (p . ); "amqtlpqvl" and "rmaqtgspl" belong to phosphoglucosamine mutase (q s . ); "nqielhpll", "teravsaal", "amdaceasl" and "tqsytplal" belong to uncharacterized oxidoreductase map_ (q vk . ); "emfdlihqm" and "amrkwessm" belong to fructose-bisphosphate aldolase (eta . ); "glyefftpl", "gmigltqal" and "tqmtaaipl" belong to -ketoacyl-acp reductase (ajk . ); "fitwrgipl" and "nqsaiatyl" are peptides of the hypothetical protein ega _ (azp . ). the vast majority of the listed epitopes are classified as sb; while eight of them, belonging to the proteins malate dehydrogenase (p ), uncharacterized oxidoreductase map_ (q vk ) and hypothetical protein ega _ (azp ), are to be considered as wb on the basis of their affinity rank. regardless of the binding affinity, all these sequences are predicted to be commonly bound by a plurality of mhc haplotypes. an average of (min -max ) suitable epitopes are selected for each of the tested protein. such epitopes are predicted to be recognized by five diverse bolas out of the six mhc haplotypes used for the computer-based prediction; except for the immunogenic proteins fixa (aas ) whose epitopes can be bound by four bolas out of six. the bola-hd , bola-jsp. and bola-t c are able to recognize all the selected epitopes sequences . selected class i mhc binding peptides. the heat map displays the selected t-epitopes and summarizes bolas haplotypes predicted to bind each sequence as means of a color scale that depends on the binding affinity. peptides "amlqdmail" belongs to heat shock protein (caa . ); "aqldlsnal" and "hmlrhqqir" belong to hypothetical protein map_ c (aas . ); "arleppgpl" belong to hypothetical protein map c (aas . ); "gvfnleatl" and "neraveeal" belong to the protein fixa (aas . ); "rlleiepal", "aagqigysl" and "alegvvmel" belong to the malate dehydrogenase protein (p . ); "amqtlpqvl" and "rmaqtgspl" belong to phosphoglucosamine mutase (q s . ); "nqielhpll", "teravsaal", "amdaceasl" and "tqsytplal" belong to uncharacterized oxidoreductase map_ (q vk . ); "emfdlihqm" and "amrkwessm" belong to fructose-bisphosphate aldolase (eta . ); "glyefftpl", "gmigltqal" and "tqmtaaipl" belong to -ketoacyl-acp reductase (ajk . ); "fitwrgipl" and "nqsaiatyl" are peptides of the hypothetical protein ega _ (azp . ). the vast majority of the listed epitopes are classified as sb; while eight of them, belonging to the proteins malate dehydrogenase (p ), uncharacterized oxidoreductase map_ (q vk ) and hypothetical protein ega _ (azp ), are to be considered as wb on the basis of their affinity rank. regardless of the binding affinity, all these sequences are predicted to be commonly bound by a plurality of mhc haplotypes. an average of (min -max ) suitable epitopes are selected for each of the tested protein. such epitopes are predicted to be recognized by five diverse bolas out of the six mhc haplotypes used for the computer-based prediction; except for the immunogenic proteins fixa (aas ) whose epitopes can be bound by four bolas out of six. the bola-hd , bola-jsp. and bola-t c are able to recognize all the selected epitopes sequences among the immunogenic proteins. on the other hand, the bola-t a is not showing any binding affinity to the epitope sequences; while bola-d . and bola-t b fail to bind the epitopes of aas protein (figure ). the class i mhc epitopes as of figure are further aligned against both the mycobacteria and cow databases to assess the specificity of the predicted epitope sequences for map. complete list of alignments is available in the supplementary table s . sequence alignment highlighted that the peptide amdaceasl and amrkwessm respectively of the "uncharacterized oxidoreductase map_ " (q vk ) and "fructose-bisphosphate aldolase" (eta ) proteins are the most specific for map. specifically, amdaceasl scores % identity with the map and the mycobacterium aviumcomplex (mac); whereas, hits with other mycobacteria specimens are featured by a lower sequence identity (below %) and a far higher e-value when compared with map and mac hits ( . vs. . , supplementary table s ) . similarly, the peptide amrkwessm scores % sequence identity with map and mac and only less than % of sequence similarity is scored by the alignments with other mycobacteria. the e-value supports the robust alignment against the map and mac (e-value . ) in spite of the other alignments (e-value > ) further supporting the hypothesis on the specificity of this peptide sequence (supplementary table s ). concerning the alignment of the peptides against the cow database, both amdaceasl and amrkwessm did not score relevant matches with any of the cow proteins. several hits were matching with discontinuous sequences of the cow proteins database with high e-values (supplementary table s , topic better commented in the discussion section). the host's immune response to map infection is complex and heterogeneous. debates on the sequelae of immunological events following map infection are currently ongoing. nevertheless, it seems widely accepted that the early stage of the infection elicits an important cell-mediated response. once map is phagocytized, its antigen presentation is accomplished through the loading of the processed antigen onto mhc molecules. the bovine mhc genes complex (i.e., bovine leukocyte antigen, bola) is carried in the chromosome and represent a fundamental component of the bovine immune system that allows the recognition and presentation of a virtually infinite number of antigens [ ] (figure ). such a high versatility relies on diverse factors, including the polygenetic origin of the mhc genes, the codominance of the parental alleles, the polymorphism of the genetic variants and the peptides/proteins splicing [ ] . class i mhc molecules recognize, bind and present peptide antigens from intracellular pathogens to cd + t-lymphocytes [ ] . in this view, class i mhc molecules and cytotoxic t-lymphocytes (ctl) are likely to play a pivotal role in the early stage of the map infection [ ] . thus, of potential interest for early diagnosis-oriented studies and the design of efficient vaccine formulations. class i mhc peptide antigens are to be considered among the main triggers of the cell-mediated responseand their specific immunostimulation would lead to a more efficient prophylactic outcome. nevertheless, a study from rana etal. highlighted an important involvement also of the humoral response to map infection, other than the adaptive immunity mediated by the class ii mhc molecules [ ] . huge efforts have been made to optimize diagnostics for the efficient detection of map by means of both direct and indirect methods [ , , ] . the slow-growing rate of map along with the reduced sensitivity of the culture-based methods raised the need to develop alternative diagnostic strategies. pcr-based diagnosis targeting the multicopy insertion sequence is held the promise of fast detection of map in both environmental and clinical samples. however, the presence of is -like sequences in other bacterial specimens resulted in a reduction of the pcr specificity. this, along with the elevated costs of the reagents, equipment and procedures, precludes the pcr applicability in large-scale programs [ ] . among the indirect methods, elisa-based detection of anti-map antibodies enables faster diagnosis time but still suffer from drawbacks related to sensitivity and specificity. although great improvements have been made in optimizing elisa kits to reduce cross-reactions with environmental mycobacterium strains [ , ] . still, this method suffers from a lack of sensitivity. moreover, the high antigen similarity between map and mycobacterium bovis obstacles the discrimination between bovines infected with tuberculosis and inoculated with live or attenuated paratuberculosis vaccines [ ] . this promotes the seek of molecular target(s) capable of offering a more robust diagnosis. the present work describes a companion study that relies on previously-obtained datasets of our research group [ ] . employing an immunoproteomic approach, we experimentally validated the whole map proteome for its capability of being complexed by the antibodies naturally occurring in sera of infected bovines. ms-based identification of the immunogenic proteins enabled the detection of protein candidates whose protein sequences have been now further investigated for their immunogenic features. we employed an immunoinformatic approach for further focusing on the peptide sequences, potentially involved in the immunostimulation. a key point of the immunoinformatic approach is the prediction of the protein epitope sequences. epitopes prediction can be based on several features such as physical-chemical properties and structural folding of the primary protein sequence [ ] [ ] [ ] . the present investigation is mainly focused on linear epitopes because protein-antibody complexes were selected through two-dimensional electrophoresis ( -de) and western blotting; thus, on linearized proteins [ , ] . however, the application of other mass spectrometry technologies is quickly developing in the field on immunoproteomics [ ] there are still significant limitation to map on a large scale conformational epitopes. as expected from the previous experimental data, all the screened protein sequences showed the capability of being recognized by both b-cells and class i bolas. the comprehension of recognition and binding of map by the host immune cells is still controversial. some studies document a relevant humoral response to map infections. on the other hand, other pieces of evidence describe the importance of the cell-mediated response to control map growth [ ] [ ] [ ] [ ] [ ] . from our perspective and, according to the collected evidence, map-targeted antibodies could play a key role in the specific and sensitive detection of this pathogen in the subclinical stage of the infection. b-cell epitopes prediction highlighted the "hypothetical protein map_ c" (aas ) protein as the most active in stimulating antibody production. this finding is in agreement with our previous study [ ] , where this protein showed a high level of immunoreactivity exclusively against the serum of the map infected animals. to the best of our knowledge, this protein was not described before as an antigen and, according to our dataset on its functional domains, it is possible to hypothesize that it is part of a surface-associated dehydrogenase with oxidoreductase activity involved in pentose phosphate pathway [ , ] . the fructose-bisphosphate aldolase (eta ), instead, is described as less prone to elicit antibody production. this is consistent with its intracytoplasmic localization and with its major role in the central metabolism. despite its cellular localization, several moonlighting properties have been described as part of its multiple functions [ ] [ ] [ ] . interestingly, b-cell epitopes prediction highlighted a homogenous distribution of multiple peptide sequences throughout all the proteins primary sequences (figure ). this suggests the potential usefulness of the selected proteins for a variety of implications where two or more epitopes are needed in a single protein molecule (e.g., sandwich elisa, and other indirect diagnostic tests ensuring higher sensibility) [ , ] . prediction of the class i bolas binding peptides confirmed the immunogenicity of the previously studied proteins. similarly to hlas, bolas are highly polymorphic proteins; thus, including a plurality of bolas while computing the peptide binding affinity would benefit the robustness and reliability of the prediction [ , ] . among the class i mhc epitopes predicted in the present study, the hypothetical protein ega _ (azp ) differs by only one amino-acidic residue from the map membrane protein c (v kre ), whose immunogenic properties have been already demonstrated by both our previous investigation and other studies [ , , ] . it is, indeed, a surface-exposed protein involved in the mechanism of invasion of the epithelial cells [ , ] . its expression is upregulated when the map is exposed to the physicochemical conditions similar to the intestine environment and the specific block of this protein reduces the virulence up to % [ ] . interestingly, this protein is among the entries classified as wb suggesting that more immunogenic properties can also be exploited by the other wb protein besides the others predicted as being sbs. moreover, we specifically focused on the sole peptide sequences whose binding affinity is shared among multiple bolas. in this manner, the most suitable epitope sequences are likely to have a broad recognition in a higher portion of the bovine population [ , ] . epitopes identified with this approach are of potential interest for diverse purposes and studies, including the investigations aimed at elucidating the order of immunological events following the map infection, and shedding light on the controversial aspect of suppression, or not, of the cellular-mediate immune response following map infection [ ] . to prove selected epitopes as suitable candidates for the unbiased diagnosis of map infection, we aligned the peptides sequences against a database comprising the closest taxonomically-related bacteria. such alignment generated a steep reduction of the number of input sequences and returned two peptides suitable for a specific diagnosis of map. these two candidates were not overlapping with other mycobacteria other than mycobacterium avium complex (mac). the described approach resumes the pipeline of an in-silico method, therefore, empirical tests will be required for the definitive assessment of differential diagnosis capability of the selected sequences. the sequence alignment against the host-specific protein (i.e., the publicly available cow proteome) fails to identify significant sequence identities. the only alignment hits observed (supplementary table s ) were not continuously overlapping and showed a low percentage of identity and a high e-value. acknowledged the prediction of linear epitopes, the matching of our candidates with gapped sequences of cow is likely to be of a negligible relevance since regarding amino acid residues that are not laying in a concatenate order. thus, we speculate that the candidate epitopes suggested in this study are of potential value for the design of either multi-subunit or recombinant vaccines to confer protection against the first-time infection of the calves by map. nevertheless, confirmatory experimental trials are warmly encouraged especially to assess the specificity between map infected animals and bovine with tuberculosis [ , ] . although less significant, a certain level of identity hasbeen registered with other mycobacterium strains. however, the discrepancy observed in the sequence alignment might be used as the driving force for the differential diagnosis. at this purpose, application of optimized laboratory protocols expecting high stringency condition might be the key to improve the specificity of the diagnostic methods. finally, empirical evaluation of the synergistic effect of both b-cell epitopes and the class i mhc epitopes are desirable. this will aim at the evaluation of the successful diagnosis of map infection at the subclinical stage and at the potential in elicitation of protective immunity. to conclude, the present study describes an innovative pipeline based on the in-silico survey of selected immunoreactive proteins capable to uncover the immunogenic features of each protein. this pipeline was applied to the detection of a restricted number of peptides potentially useful for the diagnosis of jd at the early subclinical stage. obtained results are as well useful for the implementation of innovative vaccination strategies. the obtained results confirmed the immunogenicity observed experimentally through the immunoproteomic approach applied to the map proteome. this evidence demonstrates, once again, reciprocal support between immunoproteomics and immunoinformatics. nevertheless, empirical confirmations are warmly required to test the provided epitope sequences both in-vitro and ex-vivofor the possible detection of the subclinical phase of the infection and for the efficacy of the eventual vaccinal formulations. such experimental tests might also help with the comprehension of the controversial role of the host immune cell-response underlying behind jd. complementation of the linear epitopes array with other conformational ones is also of importance for befitting efficacy and safety of the deliverables empirical confirmation may serve as further proof of the robustness of the immunoinformatics approaches as key contributors in the study of diverse infectious diseases. this would provide reliable scientific support in a safe, rapid and cost-effective approach. the current study focuses on ten proteins whose immunoreactivity has been experimentally investigated by means of an immunoproteomic approach [ ] . brielfy, the map proteome was incubated with sera from infected animals to screen for proteins with immunoreactive potential. the most promising entries were then subjected to ms-based identification. identifiers of the candidate immunoreactive proteins were queried in the ncbi non-redundant (ncbinr) protein database to retrieve the whole protein sequences and export them as a fasta file. update of the protein accession numbers operated by the reference data repository (i.e., ncbi) required the run of a protein sequence alignment for the attribution of the novel protein identifiers (gi numbers). selected peptide sequences of the immunoreactive proteins were searched against the ncbinr database restricted to mycobacterium avium subsp. paratuberculosis (taxid ) and the best hit was used to transform the former protein accession numbers into the novel ncbi protein gis. the list of proteins employed in this study along with their current gi number is provided in table and supplementary table s . prediction of the protein sequences that are likely to elicit antibody production and/or bind class i mhc proteins was performed through two tools that are commonly employed for the epitope prediction [ ] , namely iedb (http://tools.iedb.org/bcell/) and netmhc (http://www.cbs.dtu.dk/ services/netmhc/), respectively for b-and class i mhc epitopes prediction. bepipred algorithm was chosen for the prediction of linear protein epitopes capable of binding b-cells. this employs a combination of a hidden markov model and a propensity scale method [ ] . each protein residue is scored for its epitope behavior and the sole aminoacid with a score greater than or equal to . was considered as a potential epitope. linear peptide epitopes of the least length of aminoacids were selected for this study. prediction of epitopes binding class i mhc molecules was performed through netmhc prediction tool, using the artificial neural network (ann) algorithm [ ] . the algorithm was set for the prediction of nine-aminoacids long peptides capable of binding the following bola alleles: bola-d . ; bola-hd ; bola-jsp. ; bola-t a; bola-t b; bola-t c. the binding affinity of the peptides wasscored, and a percentile rank wasprovided by computationally comparing the score of each queried peptide sequence against , natural peptides of the same length. peptides scoring a binding affinity up to . % were considered as strong binders (sb); whereas, peptides with a percentile rank comprised between . % and % were labelled as weak binders (wb). all other peptides were discarded [ , , ] . the resulting list of selected peptide epitopes was further quality-checked and filtered. for each of the selected proteins, the epitopes shared among the major number of bola haplotypes were kept (i.e., the most commonly recognized in the bovine population), resulting in a consensus list of epitopes to be further used in the study. a summary of the experimental worklow employed in this study is provided in figure . . schematic representation of the immunoinformatic approach. blue and orange arrows refer to the input and output of the data, respectively. data arising from our previous immunoproteomic study [ ] were used to retrieve the protein sequences of the immunogenic proteins, selected on the basis of their capability of being complexed by the immunoglobulins in the sera of the infected cows. the sequence of the immunogenic proteins is subjected to the epitope prediction through dedicated tools and algorithms. the b-epitope prediction was performed via the iedb prediction tool, that provides a list of candidate b-epitopes. the class i mhc epitopes waspredicted via netmhc. this makes use of the bola haplotypes from the data repository (ncbi) as references for computing the linear peptides capable of being recognized and presented by the diverse bolas. the list of potential t-epitopes is further refined by selecting the most commonly recognized epitopes with a relatively high binding affinity. the refined list of epitopes is further tested for sequence specificity and cross-reactivity by pblast alignment versus the mycobacteria and cow protein database which, in turn, arise from the publicly available data repository (ncbi). the list of epitope sequences was further analyzed through the basic local alignment search tool for protein sequences (pblast) [ ] . this tool implements the pam algorithm to compare protein sequences and calculates the robustness of matches as means of expected values (e-value). . schematic representation of the immunoinformatic approach. blue and orange arrows refer to the input and output of the data, respectively. data arising from our previous immunoproteomic study [ ] were used to retrieve the protein sequences of the immunogenic proteins, selected on the basis of their capability of being complexed by the immunoglobulins in the sera of the infected cows. the sequence of the immunogenic proteins is subjected to the epitope prediction through dedicated tools and algorithms. the b-epitope prediction was performed via the iedb prediction tool, that provides a list of candidate b-epitopes. the class i mhc epitopes waspredicted via netmhc. this makes use of the bola haplotypes from the data repository (ncbi) as references for computing the linear peptides capable of being recognized and presented by the diverse bolas. the list of potential t-epitopes is further refined by selecting the most commonly recognized epitopes with a relatively high binding affinity. the refined list of epitopes is further tested for sequence specificity and cross-reactivity by pblast alignment versus the mycobacteria and cow protein database which, in turn, arise from the publicly available data repository (ncbi). the list of epitope sequences was further analyzed through the basic local alignment search tool for protein sequences (pblast) [ ] . this tool implements the pam algorithm to compare protein sequences and calculates the robustness of matches as means of expected values (e-value). this value describes the statistic of matches occurring "by chance"; thus, it decreases exponentially as the score of the match increases. in the pblast, each epitope sequence has been aligned against both mycobacteria (ncbi taxid ) and cow (ncbi taxid ) protein repertoires to evaluate sequence specificity and cross-reactivity (figure ) , of importance while selecting candidate epitopes to be employed for the effective diagnosis and/or prophylaxis of map. supplementary materials: the following are available online at http://www.mdpi.com/ - / / / /s . table s . b-cell binding protein epitope prediction. the file summarizes the b-epitope prediction results. each sheet of the xls file is relative to one of the ten selected proteins. "position" column is relative to the position of each aminoachid along the protein sequence; "residue" indicates the type of aminoacid; "score" is relative to the epitope propensity attributed by the algorithm and "assignment" rank each aminoacid residue as epitope or not depending on the prefixed settings. table s . class i mhc binding protein epitope prediction. the xls file summarizes the results in a table reporting: the predicted affinity (nm); the percentile rank, and the predicted binding core for all the selected alleles. two additional columns summarize the predictions across alleles: harmonic mean of the %rank calculated over all specified alleles (h_avg_ranks); the number of alleles covered by a given peptide (n_binders). table s . selected peptide epitopes alignment against the mycobacteria and cow databases. the xls file provides a summary of the pblast alignment search. full description of the table columns and the alignment criteria is available at https://blast.ncbi.nlm.nih.gov/blast.cgi?page=proteins. table s . full list of the prototypic peptides alignment. the file summarizes the p-blast alignment of the two selected epitopes against both the mycobacteria and the cow database. full description of the table columns and the alignment criteria is available at https://blast.ncbi.nlm.nih.gov/blast.cgi?page=proteins. the authors declare no conflict of interest. defining resilience to mycobacterial disease: characteristics of survivors of ovine paratuberculosis experimental infection model for johne's disease using a lyophilised, pure culture, seedstock of mycobacterium avium subspecies paratuberculosis gene expression profiles during subclinical mycobacterium avium subspecies paratuberculosis infection in sheep can predict disease outcome control of paratuberculosis: who, why and how. a review of countries paratuberculosis in sheep and goats johne's disease) in cattle and other susceptible species temporal patterns and quantification of excretion of mycobacterium avium subsp paratuberculosis in sheep with johne's disease mycobacterium avium subsp. paratuberculosis: pathogen, pathogenesis and diagnosis peptidomics in veterinary science: focus on bovine paratuberculosis exploring the zoonotic potential of mycobacterium avium subspecies paratuberculosis through comparative genomics mycobacterium paratuberculosis as a cause of crohn's disease could mycobacterium avium subspecies paratuberculosis cause crohn's disease, ulcerative colitis· · · and colorectal cancer? facts, myths and hypotheses on the zoonotic nature of mycobacterium avium subspecies paratuberculosis mycobacterium avium subspecies paratuberculosis and crohn's disease: a systematic review and meta-analysis the immunopathogenesis of crohn's disease: a three-stage model mycobacterium avium ss. paratuberculosis zoonosis-the hundred year war-beyond crohn's disease mycobacteria in drinking water distribution systems: ecology and significance for human health identification of immunoreactive proteins of mycobacterium avium subsp. paratuberculosis differential cytokine gene expression profiles in the three pathological forms of sheep paratuberculosis longitudinal study of clinicopathological features of johne's disease in sheep naturally exposed to mycobacterium avium subspecies paratuberculosis divergent antigen-specific cellular immune responses during asymptomatic subclinical and clinical states of disease in cows naturally infected with mycobacterium avium subsp. paratuberculosis expression of genes associated with the antigen presentation and processing pathway are consistently regulated in early mycobacterium avium subsp. paratuberculosis infection in silico epitope analysis of unique and membrane associated proteins from mycobacterium avium subsp. paratuberculosis for immunogenicity and vaccine evaluation proteome-wide b and t cell epitope repertoires in outer membrane proteins of mycobacterium avium subsp. paratuberculosis have vaccine and diagnostic relevance: a holistic approach comparative computational analysis of sars-cov- nucleocapsid protein epitopes in taxonomically related coronaviruses. microbes infect molecular basis of covid- relationships in different species: a one health perspective. microbes infect epitope prediction by novel immunoinformatics approach: a state-of-the-art review the major histocompatibility complex in bovines: a an antigenic peptide produced by peptide splicing in the proteasome pathogenesis of bacterial infections in animals membrane and cytoplasmic proteins of mycobacterium avium subspecies paratuberculosis that bind to results of multiple diagnostic tests for mycobacterium avium subsp. paratuberculosis in patients with inflammatory bowel disease and in controls pathogenesis, molecular genetics, and genomics of mycobacterium avium subsp. paratuberculosis, the etiologic agent of johne's disease. front rapid expression of mycobacterium avium subsp. paratuberculosis recombinant proteins for antigen discovery immune reactions in cattle after immunization with a mycobacterium paratuberculosis vaccine and implications for the diagnosis of m. paratuberculosis and m. bovis infections immunoinformatics: a brief review immunoinformatics and epitope prediction in the age of genomic medicine a secondary antibody-detecting molecular weight marker with mouse and rabbit igg fc linear epitopes for western blot analysis applications of maldi-tof mass spectrometry in clinical proteomics model for immune responses to mycobacterium avium subspecies paratuberculosis in cattle susceptibility to and diagnosis of mycobacterium avium subspecies paratuberculosis infection in dairy calves: a review gene expression profiling of monocyte-derived macrophages following infection with mycobacterium avium subspecies avium and mycobacterium avium subspecies paratuberculosis analysis of the immune response to mycobacterium avium subsp. paratuberculosis in experimentally infected calves mycobacterium avium subsp. paratuberculosis antibody response, fecal shedding, and antibody cross-reactivity to mycobacterium bovis in m. avium subsp. paratuberculosis-infected cattle herds vaccinated against johne's disease cloning and expression of a gene cluster encoding three subunits of membrane-bound gluconate dehydrogenase from erwinia cypripedii atcc in escherichia coli fructose- , -bisphosphate and aldolase mediate glucose sensing by ampk fructose-bisphosphate aldolases: an evolutionary history the moonlighting protein fructose- , -bisphosphate aldolase of neisseria meningitidis: surface localization and role in host cell adhesion a short history, principles, and types of elisa, and our laboratory experience with peptide/protein analyses using elisa diagnostic performance of direct and indirect methods for assessing failure of transfer of passive immunity in dairy calves using latent class analysis improved prediction of bovine leucocyte antigens (bola) presented ligands by use of mass-spectrometry-determined ligand and in vitro binding data improved prediction of mhc ii antigen presentation through integration and motif deconvolution of mass spectrometry mhc eluted ligand data production and characterization of monoclonal antibodies against a major membrane protein of mycobacterium avium subsp. paratuberculosis. clin. vaccine immunol a mycobacterium avium subsp. paratuberculosis rela deletion mutant and a kda major membrane protein elicit development of cytotoxic t lymphocytes with ability to kill intracellular bacteria the mycobacterium avium subsp. paratuberculosis kda protein plays a role in invasion of bovine epithelial cells mhc peptidome deconvolution for accurate mhc binding motif characterization and improved t-cell epitope predictions pooshang bagheri, k. in silico rational design of a novel tetra-epitope tetanus vaccine with complete population coverage using developed immunoinformatics and surface epitope mapping approaches new latex bead agglutination assay for differential diagnosis of cattle infected with mycobacterium bovis and mycobacterium avium subsp. paratuberculosis duplex pcr for differential identification of mycobacterium bovis, m. avium, and m. avium subsp. paratuberculosis in formalin-fixed paraffin-embedded tissues from cattle improved method for predicting linear b-cell epitopes gapped sequence alignment using artificial neural networks: application to the mhc class i system in silico identification of epitopes in mycobacterium avium subsp. paratuberculosis proteins that were upregulated under stress conditions protein database searches using compositionally adjusted substitution matrices key: cord- -tm hn hz authors: mockett, a.p.adrian title: envelope proteins of avian infectious bronchitis virus: purification and biological properties date: - - journal: j virol methods doi: . / - ( ) - sha: doc_id: cord_uid: tm hn hz immunoadsorbents, made with monoclonal antibodies, were used to purify the spike and membrane proteins of infectious bronchitis virus (ibv). the purified proteins were inoculated into rabbits to produce antisera. the rabbit anti-spike sera neutralized the infectivity of the virus whereas the anti-membrane sera did not. ibv-infected chickens produced antibodies to both the spike and membrane proteins. both these antibodies were at their highest concentration about – days after inoculation, whereas neutralizing antibodies were present only at very low concentrations at that time. neutralizing antibodies were at their highest concentration days after inoculation. a second inoculation of virus at days induced an anamnestic antibody response to the spike and membrane proteins and also for the neutralizing antibodies. the neutralizing, anti-spike and anti-membrane antibodies all reached highest concentrations – days after this inoculation. the advantages of purifying viral proteins using affinity chromatography with monoclonal antibodies are discussed. avian infectious bronchitis virus (ibv) is a coronavirus whose principal site of replication is the ciliated epithelial cells of the respiratory tract mucosa of chickens. viral replication occurs in the cytoplasm of the cell and virions are formed by budding from the endoplasmic reticulum. there are three viral structural proteins: spike (s; peplomers), membrane (m) and nucleocapsid (n). s and m proteins are both glycosylated and parts of them are exposed at the surface of the virion. the spike protein consists of two glycopolypeptides, s and $ , which have molecular weights of kda and kda, respectively (cavanagh, ) . the membrane protein is present as a number of distinct species which have molecular weights ranging from kda to kda; the molecular weight differences are associated with the various degrees of glycosylation. n protein ( kda) is associated with the viral rna. the ibv spike protein, associated with the outer projections, plays an important part in the infection of cells. chicken antisera to this protein and spike-specific monoclonal antibodies (mockett et al.. ) can neutralize the infectivity of the virus. a similar function has been found for the spike protein of murine coronavirus mhv- (collins et al., ; fleming et al., ) and the porcine coronavirus tgev (garwes et al., ) . the spike protein of ibv also contains strain-specific determinants (mockett et al., ) . the membrane protein appears to be a more highly conserved antigen and it is possible that only a small amount (approx. kda) is exposed at the viral surface (boursnell et al., ) . the nucleocapsid protcin interacts with the viral rna to form a helical nucleocapsid. the objectives of this work were to produce immunoadsorbents using monoclonal antibodies which have been prepared previously (mockett et al., ) and to purify the virus-coded proteins of the viral envelope in a single step in relatively large amounts. this allowed hyperimmune rabbit antisera to the proteins to be produced and tested for neutralizing antibodies. in addition the sequential humoral antibody response of chickens after ibv infection has been studied using the purified viral proteins and whole virus in elisas and compared to the results using the neutralization test. the massachusetts m strain of ibv was grown in the allantoic cavities of i -day-old embryonated chicken eggs and purified on isopycnic sucrose gradients as described by cavanagh ( ) . purified virus was pelleted in a x ml rotor at , x g for h at °c and resuspended in phosphate-buffered saline (pbs). an equal volume of pbs containing % (wt./vol.) np was added, mixed using a dounce homogeniser and incubated for h at °c. the material was centrifuged for min in an eppendorf microcentrifuge and the resulting supernatant, containing soluble viral components, was used for the affinity chromatography purification. monoclonal antibodies (designated a and c! ) to the spike and membrane proteins respectively of ibv strain m were prepared (mockett et al., ) . the gammaglobulin fraction of ascitic fluids containing either anti-spike or anti-membrane monoclonal antibodies was isolated by salt precipitation using a final concentration of % (wt./vol.) na so . for the spike immunoadsorbent . mg of gammaglobulin was coupled to . mg of cnbr-sepharose b (pharmacia) according to the manufacturers' instructions and for the membranc immunoadsorbent . mg was coupled to the same amount of gel. unreactive groups on the gel were blocked using m ethanolamine, pit . , and any non-covalently bound proteins wcre removed by repeated washings with . m nahco~ buffer, ph . , containing . m naci and . m acetic acid buffer, ph . , containing . m naci. the immunoadsorbent was stored in pbs containing . % nan~ at °c until used. it was washed twice with m nh scn in pbs containing . % octylglucoside, four times with pbs and twice with pbs containing % np before use. all wash volumes were ml. the solubilised virus preparation was mixed with the immunoadsorbent for h at °c using a rotary stirrer. the gel was poured into a chromatography column and washed with pbs containing . % np ( ml) and pbs containing . % octylglucoside ( ml). m nh,scn in pbs containing . % octylglucoside was added and fractions of ml collected. the absorbance at nm of each of the fractions was read using a sp pyeunicam spectrophotometer. the fractions in the absorbance peak were dialysed against pbs. a sample of each fraction was then subjected to electrophoresis in a polyacrylamide gel. those fractions containing detectable viral protein were pooled and constituted the purified protein prepar~,tion. ten per cent polyacrylamide slab gels containing sds were used (laemmli, ) and after electrophoresis samples were stained first with coomassie brilliant blue r- and then silver (morrisey, ) . new zealand white rabbits were used. samples of purified proteins were mixed with an equal volume of freund's complete adjuvant and inoculated intramuscularly into the rabbits. a similar inoculation was given wk later. after wk the same antigen in incomplete freund's adjuvant was inoculated subcutaneously. five months later blood was collected from the ear vein and the resulting serum stored at - °c until used. the houghton poultry research station line of rhode island red chickens was used. ibv (m ) was inoculated intratracheally ( ciliostatic dose fifty (cd ); darbyshire et al., ) into chickens and sequential blood samples were taken from the wing vein (see fig. for times after inoculation). sera from the blood samples were stored at - °c until used. a serum pool for each time of sampling was made by mixing an equal volume of serum from each of the eight samples. three different antigens were used for the ei,isas: spike protein, membrane protein and ib virus. the purified spike and membrane proteins were used at a dilution of : in carbonate buffer whilst the purified ib virus was used at a : dilution. antigens were adsorbed for i h at °c. in the second step chicken sera were serially diluted in . m nacl containing . ~ c np (saline/np ) from an initial . iog dilution. any specifically bound antibody was detected using a rabbit anti-chicken igg serum ( : ) and a goat anti-rabbit igg alkaline phosphatase conjugate ( : , ) (sigma), both diluted in saline/np . the substrate was p-nitrophenyl phosphate ( mg/ml)in diethanolamine buffer and the reaction stopped using m naoh. each step was for min at °c, the reaction volumes pl and the plate was washed three times with pbs containing . c~ tween between each step. titres were calculated graphically (mockett and darbyshire, ) using an absorbance value of . as the cut off. the chicken antisera were tested for neutralizing antibody to ibv as described by darbyshire et al. ( ) . rabbit sera were precipitated with °~ (wt./vol.) nazso and the resulting gammaglobulin tested because rabbit sera have high concentrations of non-specific inhibitors of ib virus replication. the viral proteins purified by affinity chromatography are shown in fig. . the spike protein, which is composed of two polypeptides, was the only protein detected in the two fractions shown using the sensitive silver staining procedure. similarly the membrane protein was not contaminated with other proteins, although this protein did not stain as well as the spike. there were other stained bands present, but these are artifacts sometimes observed, even in the absence of protein, with this staining procedure. the purification was highly reproducible. the purified viral proteins were inoculated into rabbits and the gammaglobulin fraction of the antisera tested for neutralizing activity. only the anti-spike gammaglobulin neutralized the virus. however the membrane protein was a good immunogen because the rabbit anti-membrane sera tested in the elisa had high activity against the whole virus: in fact the titres were higher than those of the rabbit anti-spike sera (see table ). the results of testing sera from ibv-infected chickens for antibody to spike and membrane proteins showed that both anti-spike and anti-membrane antibodies were produced early after infection (see fig. ). peak titres were between and days after infection. the antibody response to the whole virus had a similar profile. however, the the second inoculation of virus induced an anamnestic antibody response. the elisa detected a similar increase in antibody titres using spike, membrane and whole virus -the peak was at - days after infection. the neutralizing antibody response was also similar and the peak titres were at days after infection which contrasted to the slow rise to the peak titres after the primary inoculation. this paper describes the application of affinity chromatography using monoclonal antibodies for the purification of the two viral structural proteins present at the surface of the ib virion -spike and membrane. a previous report has described procedures for the purification of these viral proteins and also nucleocapsid protein, the only other major structural protein (cavanagh, ) . ibv was solubilised in np detergent and centrifuged in a sucrose gradient containing this detergent in order to purify the nucleocapsid protein. the addition of m naci to the sucrose solutions was required for the purification of the spike and membrane proteins, as they co-migrated in gradients containing low salt concentrations. however, the nucleocapsid protein could not be purified in gradients containing high salt concentrations. the yield of material from these gradients was relatively low, due to the limited number of fractions which contained purified viral components. in other studies purified spike material contained some nucleocapsid protein and the membrane preparation contained other proteins which were thought to be of cellular origin. there are a number of advantages in using affinity chromatography. by making use of the specificity of the antibody pure material can be isolated, even from a crude mixture of proteins. the method is very quick and easy and the immunoadsorbent can be used several times. thus, relatively large amounts of purificd material can be obtained. the availability of spike and membrane proteins in a highly purified form will allow more biochemical, structural and immunological studies to be done. the conditions used to solubilise the virus did not dissociate the two spike polypeptides, therefore, both s i and $ were detected in thc material eluted from the anti-spike immunoadsorbent. the results of experiments using the rabbit antisera to the viral proteins confirmed the biological importance of the spike protein as only antibodies to this protein neutralized the infectivity of the virus. thc chicken, about days after an ibv infection, has antibodies to both the spike and membrane proteins in its serum but only very low concentrations of neutralizing antibodies. thc profile of neutralizing antibodies shown in this paper agrees with previous published findings (holmes, ; mockett and darbyshire, ; hawkes et al., ) . the results show that anti-spike antibodies produced early after infection are non-neutralizing, as assessed by our in vitro technique. this raises the question as to the function of these antibodies in thc chicken. previous evidence has shown only the spike protein to be capable ofeliciting neutralizing antibodies. there is a possibility that the anti-spike antibodies could be neutralizing in vivo and the function of the anti-membrane antibodies could be similar. the possible role of these antibodies in protection remains to be resolved. purified viral proteins can be used to determine which is responsible for protecting thc chicken from infection. protection tcsts such as thc ciliostasis (darbyshire, ) and the mixed infcction (escherichia coil and ibv) (smith et al., ) tcsts are available. it is only by using methods such as affinity chromatography that stflficientl~, large amounts of pure viral proteins can be made available to enable such tests to bc done. . \.'iru~ rc~ i . pov, cll virolog, , ~. darbyshire artwrlght, lg":'bl vet ph. i). i hcsis, i. tnl,ct~,it', oi ]'he author wishes to thank ms. ,i.k.a. ('ook for her help with the neutralilatton tests, ms. debra southee for her excellent technical assistance and i)r. i'.i).k. br()wn for useful discussions. key: cord- - su x ed authors: aiking, harry; de boer, joop; vereijken, johan m. title: transition feasibility and implications for stakeholders date: journal: sustainable protein production and consumption: pigs or peas? doi: . / - - - _ sha: doc_id: cord_uid: su x ed nan role of citizens who support more stringent environmental standards might be greater than their economic role as consumers when they make trade-offs between the prices of pork and npfs. third, the ecological approach argued that for developing a coherent set of indicators no framework turned out to be available, which ( ) links pressures to environmental consequences, and ( ) focuses on how each of the indicators can contribute to the problem solving logic required to answer questions of environmental sustainability. rather, the existing frameworks take a one-dimensional perspective. therefore, causal networks were recommended, describing how individual indicators are interrelated, thus providing a more complete, holistic picture of what is happening in the environment. in short, process knowledge provides insight into cause-andeffect networks and is put forward to guide the selection of appropriate indicator sets. together, irrespective of the approach, a clear consensus was provided ( ) that npfs are environmentally more sustainable than pork and ( ) that the most important impacts are due to tampering with (a) land use and the cycling of (b) water, (c) carbon and (d) nitrogen, leading to biodiversity loss, climate change, eutrophication and acidification (chapter ). given the convergent results concerning the relative advantages of npfs, the question emerges whether this information will be enough for policymakers who want to choose between alternative protein chains. as we shall see in greater detail later, policymakers should try to avoid suboptimal solutions and that makes it important for them to start with a comparative analysis of a large number of options. presently, the environmental assessment tool is limited to the agricultural production phase. after extension to the rest of the production and consumption chain the method should be generalised beyond comparing just protein food options. in view of the increasing relevance of linkages between transitions, it should be extended sufficiently to become a tool to assess the environmental sustainability of transitions in general. because a large part of the environmental impact of the pork chain takes place outside western europe, it is important to also look at the global dimension. reducing this environmental impact in developing countries of asia and south america by shifting to a more plant protein centred consumption pattern in western europe may have important economic repercussions in those developing countries. in turn, this may or may not cause negative environmental impacts there, depending on what alternative livelihood strategies will be developed to compensate reduced feed crop exports to western europe. so north-south issues clearly require further study. six projects aimed at assessing aspects of the technological feasibility of pea-derived npfs were performed in profetas. two of them were directed at aspects concerning primary production of peas and two at aspects concerning processing of peas to npfs. in addition, in one project a tool is being developed to optimise the npf chain. last but not least, in one project the options for the non-protein fractions were studied. in the breeding project it was shown that between different varieties there is a large variation in protein composition. this variability can be exploited in classical breeding programmes to obtain pea varieties that are optimised towards npf production. furthermore, a tool has been developed for genetic modification of pea and other legumes. its application will enable selective removal of certain storage proteins (which affect texture formation) and of certain enzymes (which affect flavour compounds) in order to improve the quality of the raw material. as an alternative, genetic modification may be used in r&d for screening the effects of alterations in protein and flavour composition. such an approach will speed up classical breeding while field crops are kept gm free. in the cultivation project, a crop growth model has been developed that predicts yield of peas and pea protein as a function of genotype and environmental variables (such as solar radiation, temperature and rainfall). the model is generic; it can be extended to any arable crop. the model can be used to optimise production (e.g. with respect to resource use efficiency such as water) or to define the characteristics a variety should have for a given environment. an important insight resulting from this project is that using varieties with a high protein content cannot increase the protein yield per hectare. to achieve the latter, other traits (e.g. plant architecture to sustain leaf area duration during the grain filling period) should also be changed. furthermore, attention was drawn to agronomic aspects, such as resistance to lodging and soil-borne diseases, which indicate that peas may not be the best choice to function as starting material for npf production. the texture formation studies yielded new insights into a process that is of prime importance in texture formation, i.e. heat-induced gelling behaviour of pea proteins. this behaviour was shown to be affected by the protein . . composition and by the rate of cooling during processing. this implies that a range of textures can be produced (a) by varying the protein composition by selecting appropriate pea varieties and (b) by varying the processing parameters. this provides basic tools to food technologists to direct the texture of pea-based npfs towards consumer demands. comparative studies using soy and pea proteins led to the insight that despite similarities between proteins on the molecular level, their gelling behaviour may be different. on the one hand, this could mean that a large range of textures can be made using a limited set of proteins. on the other hand, this observation likely sets limits to the interchangeability of proteins. protein-flavour interactions studies using model volatile flavour compounds provided information on the amounts of flavour compounds associated with the pea proteins under various conditions. they also showed that the interactions with flavour compounds differ per type of pea protein. so again, varying protein composition or process parameters offers possibilities to tune the flavour of npfs. furthermore, it was shown that saponins, not only present in peas but also in numerous other legumes (including soy) are responsible for a bitter taste. this bitterness may be reduced by heating, thus providing a technological tool to adjust taste. it was also found that in pea protein preparations "off-flavours" seem to be present that are derived from fat oxidation. similar "off-flavours" have been found in other plant protein preparations too; very well known examples are soy protein preparations. the project on chain design will not be completed before . tools developed in this project are expected to be useful in the design of food chains to optimise these chains either towards a single goal (such as product quality, costs, or environmental load), or even towards multiple goals. preliminary results are that the cost price of npfs could be less than or at least comparable to that of pork. comparison of the environmental load by using exergy analysis showed that an npf chain is more efficient than a pork chain only if the non-protein fractions of pea seeds (mainly starch) are put to use. options for the non-protein fractions were investigated in a project added later to study the feasibility of plant-derived npfs from a different perspective. a tool was developed to assess different protein crops on the potential uses of the non-protein fractions. only bulk fractions were distinguished and evaluated on their suitability to be used as food, feed, stock and biofuel. the conclusion was drawn that -at present -oil crops may be at an advantage, because the oil fractions have better application perspectives than other non-protein fractions. interestingly, a transition from intensive meat production to npf production from any crop would lead to a shortage of soy oil for food applications, rather than to a surplus of this non-protein fraction. if it is assumed that all intensive livestock farming disappears and only grass-fed and other extensive meat production would remain, then this shift -irrespective of the crop (pea or soy) -would release an enormous area (over million ha) of highly productive land for other uses. for example, if used to grow biomass, this would be sufficient to cover approximately % of the current world energy demand. the research on technological feasibility has focussed primarily on basic problems and on issues that needed to be solved before products could be designed. with respect to processing some important insights and tools are still missing. for instance, neither have texturisation processes been studied, nor have products (or product concepts) actually been made. such research is indispensable to allow a consumer-oriented development of npfs. at present it is still unknown whether more traditional texturing processes such as extrusion can deliver the range of textures desired by consumers. it may well be that novel techniques have to be deployed and further developed, such as techniques based on phase separation. furthermore, in actual npfs other components (e.g. hydrocolloids, fats) will be present. no information is available about the effects of such components on the texture and flavour of the npfs. with respect to flavour, stakeholders from industry have indicated that starting materials for npf production should preferably be free from "off-flavours". the prevention of formation of such compounds or ways to remove them should be investigated. a lot of knowledge on this subject is already available from research on soy proteins. missing insights with respect to the use of the newly developed protocol to genetically modify peas include, among others, its effectiveness and efficiency as well as its effects on cultivation. these effects not only include those relevant for npf processing (e.g. effects of protein composition on texture and flavour) but also those relevant for primary production. questions have to be answered regarding the effects of the modification on for example (a) germination power of the seeds, (b) viability of the modified peas in the field and (c) an eventual yield penalty for total protein content because of the modification. furthermore, the applicability of this new protocol to modify crops other than peas needs to be further explored. to exploit the interesting possibilities offered by the crop growth model, it has to be validated and to be extended to crops other than peas. then it can be a powerful tool in identifying differences among various crops in resource use efficiency for e.g. biomass and protein production, hence in contributing to a sustainable primary production of raw materials for npfs. in summary, quite a lot of research is still required to actually develop and produce npfs that meet consumer preferences. in addition to the technological issues discussed above, the project on chain design will most likely result not just in answering questions but also in raising others. furthermore, options for the non-protein fractions should be detailed by means of scenario studies, with particular attention for the projected shortage of soy oil for food applications. finally, the issue of the crops to be used for npf production is still open. as argued in section . , even in europe peas may not be the preferred crop for npf production. more research concerning crop choice is required. in addition to sustainability -which is an evident societal preferencesocietal desirability was studied in profetas with respect to ( ) the behaviour and preferences of consumers and ( ) the food-related political and economic developments of the next decades. the underlying notion of this work was that the desirability of a diet shift is in proportion to its fitting in with the behavioural patterns of current and future generations. in addition, it was argued that a lack of fit would create less serious problems if mitigating measures can be taken. based on a long-term view on behaviour, the potential for a diet shift in relation to socio-cultural changes was examined. a newly developed analytical framework sorts insights on influences on behaviour into a logical order. its cascade-like structure embodies the view that a long-term development will create opportunities for food choices that match its general direction, whereas it will put constraints on others. the analysis of long-term changes indicated that there is a favourable socio-cultural context for decisions that make consumers and producers less dependent on meat proteins. one of the most salient results of this work, however, is the contrast between, on the one hand, a series of impressive changes in dietary choices over the last few centuries and particularly over the last few decades and, on the other hand, the observation that an individual will not easily change his or her food preferences from one day to the next. this contrast underlines the value of the analytical framework in combination with smallscale consumer research. by using currently available meat substitutes as a model to analyse consumer behaviour and food choices, it was possible to develop several . social desirability tools that may guide npf product development. it appears that the currently available meat substitutes will not become popular without additional measures. analysis of consumers and consumption behaviour with respect to meat and meat substitutes provided insights such as (a) non-vegetarian consumers of meat substitutes are not impressed by environmental arguments, (b) only a small group of consumers is open to completely new products due to so-called "neophobia", (c) consumers would like more information on usage and preparation of meat substitutes and on ingredients used. consumer sensory preferences were analysed and their translation into product characteristics was attempted, yielding insights such as (a) attention should be paid to satiating properties of npfs, which are related to protein content, (b) most people want soft, smooth or crispy meat substitutes to have a seasoned, meat-like flavour and a brown colour, (c) meat substitutes should have the same place in the dish as meat. other than originally anticipated, the latter results clearly show that people want npfs to have meat-like characteristics. they confirm the notion that people will habitually look for what is familiar when they are trying to make sense of something, such as an invitation to try a product. the retrospective character of sense making can explain why non-vegetarian consumers keep relying on distinctions drawn in the past and why they evaluate meat substitutes by using meat-based criteria. the selection environment in which npfs have to survive is partly dependent on their positioning in the market, for example as cheap (bulk) proteins or as quality products (specialties). the prospects of the new products may also depend on linkages with other issues, such as increasing meat prices or health promotion. these prospects are not only dependent on consumer responses in a potential usage situation, but also on processes at the level of markets and public institutions. given the fact that technology development takes place in a changing and malleable world, the stakeholders of a new technology may opt to monitor and influence its selection environment. profetas applied some advanced tools such as policy analysis and econometric modelling, as well as a skilled examination of the rules of international institutions with regard to novel foods. in a political analysis it was investigated how politics and public policy affect the possibilities of a diet shift. as expected, it appeared that governmental policy does not have many direct influences on food choices (i.e. the proportion of meat vs. plant proteins in the nation's diet), but mostly indirect influences. these influences demonstrate that production and consumption are, at least partially, facilitated by a political "infrastructure", implicitly favouring certain products or production processes over others. to uncover indirect influences, it was analysed how current food practices have developed in the netherlands since (i.e. when the physician g.j. mulder brought up the issue of proteins and the societal effects of the lack of proteins in most people's diets). given the current sustainability-related issues, the political infrastructure still seems to favour the option of more protein production and consumption, although that option is not actively promoted by governmental action. from a sustainability perspective, however, it may be desirable to have a political infrastructure that favours a more divergent food system with different approaches to food. this would mean, for example, that the option of producing more plant protein does not necessarily get more emphasis than the option of simply "eating less meat". two complementary econometric analyses studied agricultural production and consumption and the patterns exhibited (by adaptations to the existing gemat model), and institutional arrangements affecting agricultural production and trade (adapting the gtap model), respectively. it was shown that a protein transition results in a decrease of environmental pressure on land under all scenarios (gtap) and that the potential is even larger (gemat). though crucially important to the feasibility of a transition, interestingly, the two models employed disagree on future meat price development. from gtap a price decrease may be inferred, but from gemat a price increase. the underlying cause of this disagreement is the implicit assumption, deeply embedded in the models, to what extent agricultural efficiency will continue to increase in the future. these results show the added value of employing two complementary models. a study of institutional aspects provided the insight that even though international institutions (such as the eu and the wto) may provide both incentives and barriers for the introduction, marketing and promotion of npfs, on balance the barriers exceed the incentives. since these barriers (primarily intended to resist protectionist practices in international trade) cannot be circumvented, a successful introduction and marketing of npfs should be taking them into proper account. in short, it is easier to start from already authorised foods and ingredients than to start from foods and ingredients that still need to be authorised, especially in europe if they are gm crop derived. for the promotion of npfs not too much should be expected from traditional government instruments such as taxes (on meat) and subsidies (on npfs). subsidies have already lost their appeal in most eu countries for purely domestic reasons, plus they are heavily restricted by eu regulations concerning state aid and the single market. if npfs are to become a success, it should primarily be through private, commercial means and action. international institutions can protect and support commercial interests, however, through the international protection of intellectual property rights. each of the tools has provided relevant information about opportunities and barriers for npfs, but, under the present conditions of uncertainty, none could lead to conclusive answers. for example, one of the drawbacks of consumer research is that the currently available meat substitutes are, in fact, sold in a niche market and that they are almost twice as expensive as the cheapest meats. in order to improve the relevance of consumer research (sensory research, in particular), actual pea-derived npf products -in the form of cheap protein products or as quality products -are indispensable. since such products are not available yet, they should be crafted for this purpose. these products could also be used in assessing the relation between sensory properties of npfs and their physical characteristics, a very difficult field of research, but of great importance for consumer-oriented product development. evidently, it is important for product developers to realize that "the average consumer" does not exist. subsequently, it is important to realize that different npfs will have to be developed to fulfil the needs of different consumer groups. all of the tools may seriously be influenced by the use of pre-conceptions in thinking about the future. the role of pre-conceptions is particularly great when questions have to be answered about the social desirability of policy options. it was shown that pre-conceptions lead to hidden assumptions that blur the arguments for or against an option. for instance, one of the preconceptions defined a diet shift primarily as an opportunity to develop products with a larger profit margin than meat. in contrast, it may be more sensible to develop plant protein ingredients that can serve as a low price alternative to meat protein ingredients. another example is the implicit assumption mentioned above that meat prices will continue to decrease in the future, in consequence of continuing agricultural efficiency. probably, policymakers may show wisdom by expecting that the meat prices will not decrease but increase as a result of changes in the world market and agroecological constraints on production. in fact, it seems inevitable that the present growth of spending power in china will put the world market prices of meat and feed under pressure. in addition, it should be emphasized that not too much should be expected of traditional government instruments such as taxes and subsidies. although the latter are effective tools, their application is fraught with political difficulties. although the various tools to analyse the social desirability of a diet shift have not identified strong arguments against it, a critical reflection on the results may lead to the conclusion that the corresponding profetas hypothesis deserves some more stringent tests than the arguments that have been described. in addition to the limitations mentioned above, it should be . . noted that the present analyses might have overstated the importance of agriculture to the political and economic processes in modern society. other linkages, in particular to non-food issues, may have been overlooked or underestimated. it is clear that health issues -which have not been directly studied in profetas -require further attention. for example, it has emerged recently that intensive production of poultry and pigs in close proximity with people may play an important role in the adaptation of originally poultry-specific viruses via pigs to human beings as suitable hosts (pilcher, ; chen et al., ) . under such conditions -primarily extant in south east asia -new viruses are frequently spawned. it seems more and more likely that recent incidents such as with sars and avian influenza are correlated with the intensive meat production there, and that the frequency of such incidents will continue to rise in parallel in the future. apart from animal welfare issues, this health aspect, in itself, seems a good reason to reduce intensive meat production in general, and that of poultry and pigs in particular, at least under conditions such as in south east asia, which is globally an important producing and exporting area. from profetas the conclusion emerges that there are several sound reasons that support the triple hypothesis. that is, a shift in the western diet from meat proteins to plant-derived protein products appears to be ( ) environmentally more sustainable than present trends, ( ) technologically feasible, and ( ) socially desirable. interestingly, the citizen generally seems to consider sustainability to be socially desirable, but the consumer does not like the taste of present meat substitutes. nevertheless, the main evidence is in support of a transition to make food production and consumption more sustainable. given the aims of profetas, the programme has not included specialized transitions research. nevertheless, many insights have been generated on the protein transition and on its linkages with the energy and freshwater transitions. these insights are described below. first, it has become clear that the protein transition is a necessity at the global level. without it, food production and sustainability are on a collision course. from time to time, there will be signals, alarming reports or dramatic events, which may contribute to the opening of a policy window. if the window opens, however, the market system will not simply guarantee that . transition feasibility . . the most sustainable alternative technology will win. market failures, but also government failures, may result in the selection of a suboptimal solution. second, study of the technological feasibility has revealed a number of options, but also some gaps in the required knowledge, such as the preferred crop for npf production. although many criteria for such a crop have been established (such as low fertiliser requirement, high protein yield, and preferably already part of the established food system), a more conclusive choice cannot be made yet, for criteria in other areas (such as concerning the non-protein fraction) have not yet fully materialised. presently, the top choice in europe may be rapeseed or pea yet, but in asia it will probably be soy. third is the insight that the protein, energy and freshwater transitions are inextricably intertwined and that all parts of the crop or seed -protein and non-protein -should be considered one combined chain. although the former, in particular, is a novel insight, both the former and the latter views fit in nicely with the present trend towards a biobased economy, aiming to derive food and feed, chemicals and other non-food materials, and energy from plant crops in the most efficient way. fourth, there is support from various actors, although this depends on linkages with other issues. feedback from dutch governmental actors and ngos revealed enthusiasm for a protein transition, however, they are not inclined to initiate one themselves. in contrast, they do feel committed towards an energy transition. for most western consumers, sustainability or the environment is not an incentive for food choice, however, health is. due to a number of meat crises and other food scares, health and animal welfare are valued as increasingly important issues by western consumers. in fact, sales of meat substitutes are increasing every year and they are being bought and eaten by non-vegetarian consumers. in conclusion, several trends in different areas (protein, biofuel, water saving), on different geographic levels (local to global; western to developing countries) and concerning different actors (consumer, government, industry, ngos) have been identified by profetas, whichtaken together -may lead towards a protein transition. a major step is raising the awareness that all these trends are linked up and cannot be seen in isolation. so a major achievement of profetas is the insight to propose bringing together all these different actors, all with their own agendas and multiform aims. this is, we believe, the true definition of a societal transition. for pragmatic reasons, in profetas many boundary conditions were assumed, such as the focus on the western consumer and the focus on peas. at this stage, it seems timely to start focusing the attention on how -and where -a protein transition may be achieved in a dynamic, multiform society. alternative options such as deriving proteins from algae should be investigated, as well as the feasibility of reducing the present protein overconsumption in western countries (the "eating less protein" option). more direct research on transitions seems in order. the latter requires better insight into the role of explicit as well as implicit pre-conceptions in thinking about the future. in this respect, the use of multi-method strategies appears to be indispensable. multi-method strategies may not only validate each other or specify complementary aspects of a complex phenomenon, but they may also show interesting divergences. profetas has shown several divergences that are extremely important for further research into transitions. some examples are: the divergence between pre-conceptions that see a diet shift primarily as an opportunity to develop products with a larger profit margin than meat and pre-conceptions about developing plant based products that can serve as a low price alternative to meat protein ingredients. the contrast in the studies of consumer behaviour between, on the one hand, a series of impressive changes in dietary choices over the last few centuries and decades and, on the other hand, the observation that an individual will not easily change his or her food preferences from one day to the next. the assumption that meat prices will continue to decrease in the future, in consequence of continuing agricultural efficiency, versus the assumption that meat prices will increase as a result of changes in the world market and agro-ecological constraints on production. in spite of historical trends, it seems inevitable that the strong increase in meat consumption in countries such as china will put the world market prices of meat and feed under pressure. further research may be necessary into historical transitions, as well as into future general trends in global society. with respect to the latter, developing a number of contrasting visions of a potential global future is often considered helpful. in order to arrive at a robust transition strategy, these visions can be confronted with desirable options for protein food development. such an approach should not be too general, for example, by focussing on aggregated parameters such as the growth rate of the world economy. more specific factors, such as global public health, should be an explicit part of the picture. so does taking geographic and cultural . . inhomogeneities into account, which clearly requires international cooperation. for the purpose of dutch and european policy making, it can be argued that protein sustainability is a medium to long-term issue on a global scale. in the not too distant future, intensive meat production worldwide should be discontinued, or at least strongly reduced. in practice, the notion of an approaching collision between current food production and sustainability means that imminent disturbances of the protein production chain will let themselves be known to policymakers through all kinds of signals. obviously, some of the weak signals could be a gradual rise of meat and feed prices. stronger signals may include human and animal health incidents, such as with sars and avian influenza, which are probably correlated with intensive meat production. against the background of irregular signals that vary in strength, governmental policymakers may have to manage at least three emerging goals: the first goal is to detect and interpret the signals correctly and to attribute them without delay to the relevant disturbances of the protein production chain. the bse case has shown that a delay might have disastrous consequences on the controllability of the whole policy making process. the second goal is to minimize any negative effects that the disturbances may have on society. for instance, it may be wise to be prepared for a large-scale vaccination campaign. the third goal is to prevent the opportunity to take action getting lost as a result of counteracting processes that favour suboptimal solutions. that is, solutions that are suboptimal from the perspective of long-term sustainability objectives, not aimed at the root of the problem, such as temporary bans on certain types of meat. the prevention of suboptimal solutions may require that governmental policymakers take action to correct market failures, for example, where private actors do not take full responsibility for the societal consequences of their activities. however, suboptimal solutions may also result from government failures, such as subsidies given to the wrong group or at the wrong moment. moreover, the various linkages between the protein transition and other transitions on different levels of scale will seriously complicate any attempt to apply straightforward forms of strategic planning. the least a robust policy should entail is an open mind to problem solving. for governmental policymakers it would be wise to avoid fixation on one particular technological option. in contrast, this might involve a decision to actively stimulate a range of divergent potential solutions to be developed, for example, by supporting research in the pre-competitive stage of technology development. this may especially be necessary where private parties have difficulties in making sense of the linkages between transitions, i.e. protein, energy and water. this approach would entail: slighting the mental barriers between thinking about food, energy and water policy, or about natural resources in general. trying to identify links with specific other policies, such as with the fight against obesity, the aims for renewable energy (such as the eu biomass action plan) and the aims to promote organic farming. considering alternatives such as npfs as sustainable successors to animal products (such as pork) with regard to export of products and know-how and considering incentives to r&d in that area. in combination with an open-minded approach, it is extremely important that governmental policymakers pay attention to the selection environment in which newly developed npfs have to survive. governmental policymakers can do a lot to prevent suboptimal solutions by monitoring and influencing the selection environment npfs are facing. some points in need of attention are the following: initiating removal of national, eu and international (wto) institutional barriers to the introduction of npfs on the market. increasing the transparency of the food chain in order to enable citizens and consumers to make sound choices. continuing and internationally promoting "green" thinking by acknowledging the role of plants for improving sustainability. in fact, the presently emerging trend towards a "biobased economy" is a first step toward the latter. in addition to deriving materials and energy from plant crops in combination, by the same token, food can be added to the list because crops are the ultimate renewables, degradable and all, and particularly sustainable if little fertiliser is required (such as with nitrogenfixing protein crops). european (eu) policymakers are in much the same boat as their dutch counterparts, where they have to deal with global environmental changes. the goals for biofuels and organic farming are easily linked to the protein transition. in addition, recovering the presently lacking self-sufficiency in protein rich feed crops may be an incentive for eu policy towards striving for sustainability by means of a protein transition. this may be even more so in view of the expected increase in demands (and consequently, competition) for protein rich feed crops by industrialising countries such as china. an additional incentive may be provided by the potential savings on freshwater use by agriculture, given the rising need expressed by the world water forum. for industrial policymakers an important question will be whether or not they want to be among the first movers in the market of new protein products. a company's decision on this topic will be governed by strategic and political circumstances at the time the options are contemplated, such as the ripeness of an issue. these circumstances depend on its own capabilities, its position in the industry, the economic situation of this industry and the industry's public image. the ripeness of an issue will in particular be influenced by the technological state-of-the-art (including innovative power), by the durability of the issue (a trend or a hype), by public opinions (including campaigns that emphasize the "ills" of an industry), and by linkages that improve the market success of a particular product segment. in view of the many potential linkages between a protein transition and other transitions, it should be emphasized that developing alternatives to an established technology may require many innovations and that these will not occur automatically as the outcome of a linear process. the currently predominating policy of multinational companies seems to be rather riskaversive by doing r&d themselves only on a moderate scale and, when deemed necessary, strategically buying emerging small companies with innovative products. the progress of small companies may be highly dependent on the relational context in which an innovation is located, such as the other innovations on which it builds, the status of the actors that own competing innovations, and the underlying community of experts involved in its elaboration. in addition, many chance factors may affect the content and the timing of innovation decisions. accordingly, the prospects of npfs may depend on many initiatives. as a start, they may entail: implementing "green" thinking in policies towards stepwise innovation and stepwise improving sustainability. slighting the mental barriers between thinking about raw materials, energy and water. integral thinking is a cornerstone of the emerging "biobased economy". considering plant-derived alternatives as sustainable successors to animal products with regard to export of products and of know-how, and increasing r&d in that area. this way of thinking coincides very well with parts of the mission statements of many companies: to create innovative products that promote a sustainable future. next to contributing to a sustainable future, npfs could also meet another important issue for industry: making profit. preliminary profetas calculations indicate that cost price of npfs could be lower or at least comparable to that of pork. furthermore, meat prices are expected to increase. this increase will be higher than the increase in the cost of plant proteins because of the low conversion factor of plant protein to meat protein. hence the gap between the costs of npfs and meat will become bigger, thereby increasing the profit margin. furthermore, npfs are directed towards a growing market. among others, the increase will evidently depend on the extent to which the products meet consumer demands and preferences, not only with respect to sensory properties but also to other issues such as health and animal welfare. this will be discussed in section . . those industrial policymakers who want to be among the first movers on the npf market may start thinking about questions such as: which markets are the most interesting options, specialty or bulk, which products and how to position the products on the market? what are the preferences of the different consumer groups and which ones can be translated to product characteristics? what techniques are required to produce the desired products? which proteins or, even more basic, which crops seem feasible where? what chains should be developed and optimised, and with whom? this sounds partly like revisiting technological projects of profetas and indeed it is, but now on the level of actual and competitive products, rather than on the level profetas was directed at, i.e. the development of a toolkit on a basic, pre-competitive level. however, to answer the questions asked above in a science-based way and to provide a sound base for future npf development, further pre-competitive research on technological issues is still required. a number of subjects for such research have been discussed for example in section . . in addition to technological issues, questions should be addressed such as: cooperation with what other parties is useful (e.g. for optimal use of the crop)? how may a protein transition spread across the world (e.g. which countries are best suited for introducing and testing npfs)? what may be the side effects of a protein transition (e.g. with respect to the north-south divide)? what trends in other areas (transport, technologies, lifestyles) may be relevant? for smaller companies it may pay to closely watch the emerging trend towards a "biobased economy" and start thinking about strategic alliances. as long as innovation is not "directed" by larger companies (cf. transition "management" as proposed by the dutch government), niches for innovative products will open, be it plant-derived protein foods or products made from the non-protein fraction of crops, products for saving water and/or energy, or others. the consequences of a protein transition will affect many stakeholders, such as consumers, retailers, farmers and ngos. without addressing each of these groups individually, the present section intends to sketch the most pertinent implications. what consumers should ideally do in the context of a protein transition may boil down to (a) eating one third less protein (the average dutch over-consumption), (b) replacing one third by plant-derived proteins, and (c) replacing the remaining third by extensively produced meat (such as most beef and lamb). although in the netherlands intensively produced pork converts much food industry waste in a sustainable way, globally this is not a representative example. theoretically, the proposed threefold option has both environmental and health benefits. in practice, few consumers will be convinced immediately, but they may do so in the long run. at present, the environmental benefits may not appeal to many consumers. the well-known activist storyline of "global nature" under threat and in need of protection from a global community has become too simplistic. modern westerners do not tend to think in terms of one big environment that is the same for everyone. they want credible solutions that give them the feeling that they are "doing the right thing." in contrast, the fact that many people are no longer aware of the animal origin of meat indicates that there is an increasing indifference toward the origins of proteins. at first sight, this seems to open possibilities for npfs. some producers might be tempted to change the protein chain in a way that does not have to be noticed by the people who are eating it, which may create a substantial shift from meat to plant protein foods without much consumer involvement. on second thought, however, a low-involvement approach may not be the optimal strategy to pursue more sustainable food choices. if people are no longer aware of meat's animal origin, they will also be less inclined to transition feasibility and implications for stakeholders . pay attention to animal welfare. this may have negative consequences for attempts to stimulate sustainable agriculture by promoting high quality meat from well-treated animals or by simply eating less meat. an additional reason to not opt for a low-involvement approach refers to the societal value conflicts that are expected in many technology-related areas. in europe, the recent example of genetically modified food has shown that a lowinvolvement approach may backfire if people get the impression that they are part of a "hidden" transition. therefore, it is essential that all the people concerned are mindful of any transitions of food production methods. one of the ways to involve people is a discussion on personal health aspects, which have many links to the protein transition. in contrast to plantderived diets, meaty diets generally contain more saturated fats -associated with heart and coronary disease -and are sometimes associated with overconsumption of calories -leading to obesity. npfs may not only exert a beneficial effect on health indirectly, via these relationships, but maybe also directly. that plant proteins may provide such an effect is indicated, for example, by the claim approved by the american food and drug administration: "diets low in saturated fat and cholesterol that include grams of soy protein a day may reduce risk of heart disease" (fda claim cfr . , october ). proteins from crops other than soy seem to exert the same protective effect, according to the fda, though this protective effect does not go unchallenged. last but not least, npfs are complex foods for which the amount of calories can be set via the choice of ingredients, which may be an important tool in view of the increase in obesity. admittedly, certain plant-derived foods may generate more allergies than meat, but this affects a minority of people compared to the stifling incidence of heart and coronary disease, let alone the downright epidemic incidence of obesity ( million people afflicted worldwide). public health aspects, such as food safety, add to personal health aspects perceived by consumers. due to recent meat crises, both with chemical contaminants (such as dioxins, antibiotics, growth hormones) and pathogenic microbes (such as foot-and-mouth disease, avian influenza) european consumers are rather keen on food safety. other aspects -such as the proposed relationship between intensive pork and poultry production in south east asia and the increasing outbreaks of avian influenza -may not be topics consumers think about when they are shopping for food, but reminders of these issues may gradually induce "green" thinking by acknowledging the role of plants for improving their quality of life. the preceding chapters and sections have once again made it crystalclear how large the environmental burden of meat production is. conservatively estimated, it requires a - fold larger agricultural area and energy input and produces - fold more eutrophication than the production of plant protein. not only does meat production bring about over fold more acidification, but it also appropriates - fold more of the dwindling freshwater resources. in addition, it produces a lot more pollution by pesticides, heavy metals and antibiotics. as an entirely novel finding, profetas has clearly demonstrated that the protein transition is coupled inseparably to two other societal transitions, namely those towards sustainable energy production and towards sustainable water use (section . ). the freshwater link is evident from the resource difference indicated above. the energy production link primarily regards the release of agricultural area, which is freed for biomass production. furthermore, the considerable proportion ( - %) of non-protein biomass released as a byproduct during npf production may be utilised very efficiently for energy production. an evident case of win-win-win, with a triple environmental gain due to combined savings on protein, energy and water (section . ). concerning technological feasibility many questions have been answered, both in primary production, processing and chain development. generic as well as dedicated tools have been developed or are under development. among the generic tools is the crop growth model (which still has to be validated) and the model for chain design (which is still under development). applied knowledge concerning flavour-binding properties of pea proteins should be rated among the specialised tools. as might be expected, in addition to answers, many new questions have been raised as well (see chapter ). these concern, among others, the way in which process and ingredient selection may fulfil consumer wishes for npfs within certain target groups (sections . , . and . ). this puts forward three major challenges: how to obtain a wide range of textures by using plant proteins, how these textures are affected by ingredients other than proteins and how to obtain the desired flavour? next to questions regarding processing, also questions with respect to primary production remain to be answered. these questions concern among others the extent to which protein composition may be manipulated without affecting the viability of the plant in the field, and which crops are the most suitable to yield raw materials with the desired specifications (section . ). another major issue is the extent to which new transition feasibility and implications for stakeholders . conclusions developments in molecular biology and breeding may affect the suitability of crops for npf production. from the sustainability perspective, the societal desirability has been established beyond doubt, particularly concerning resource and pollution aspects (land, water and energy uses, and their implications for biodiversity loss and climate change). furthermore, increased availability of plant protein based foods will undoubtedly make an important contribution to food protein security, which is presently under increasing pressure from world-wide increasing meat consumption. doubtlessly, such a transition will have a significant impact on north-south relationships and the poverty issue in the world (section . ). concerning societal desirability it has also become apparent that different actors can hold different interests and opinions here (section . ). it has been clearly established that the consumer is the player who holds the key to a short-term protein transition and that western consumers currently rate health above sustainability. for consumer-oriented product development, much more interaction is necessary between product developers and various user groups, such as consumers leaning towards health, convenience or culinary traditions. although profetas was originally designed from the western perspective, it has become clear that in developing countries the incentives to achieve a protein transition are not just different (section . ), but generally much stronger (section . ). in china, for example, meat production generates pressure by its inability to meet the national demand on top of the pressure generated by severe local pollution. crops tailored to climatic (section . ) and cultural (section . ) characteristics are available. it can be concluded that, although the developed countries are primarily responsible for the unbalanced meat consumption referred to earlier, it is primarily the developing countries that are confronted with the effects (see section . ). the latter, therefore, may experience stronger and more direct incentives to strive for a protein transition. however, the required technological expertise may be available in developed countries mainly. at any rate, if mineral oil and meat prices continue to rise, particularly in developing countries there will be opportunities for the onset of a combined protein plus biomass transition. since the majority of the developing countries are particularly short of freshwater resources, the additional implicit water conservation will be considered an important bonus. in profetas, neither transitions, nor their feasibility have been studied directly. rather, insights and tools have been developed to facilitate a potential transition from meat protein to plant-derived protein products in the near future. these have been summarised in the present chapter. for the reasons outlined above we feel that a protein transition -reducing intensive livestock farming and cultivation of feed crops -is not just beneficial to the environment, but also more sustainable, most certainly socially desirable, and in the long run inevitable. it is yet unclear whether npfs will be able to replace meat, to what extent, on what scale internationally, and how rapidly. when the window of opportunity opens, however, it will be crucial to have the technology available for more sustainable alternatives. facilitating a transition does not seem easy. however, it is interesting to note that -while exclusively working on an approach to make protein production and consumption more sustainable -the profetas research community has yielded an integrated solution for various global problems far beyond its original scope. in addition to promising much more sustainable protein production -directly contributing to biodiversity and resource conservation and probably indirectly to animal welfare and human health -profetas simultaneously indicated realistic options to produce a significant amount of biomass for sustainable energy production, and to save an immense volume of freshwater. this suggests that integral thinking combined with an even further extension of the disciplines involved (including health aspects, in particular) may be a promising avenue to meet the foreseeable challenges of the next few decades. the evolution of h n influenza viruses in ducks in southern china increasing virulence of bird flu threatens mammals key: cord- -v lpw r authors: viktorovskaya, olga v.; greco, todd m.; cristea, ileana m.; thompson, sunnie r. title: identification of rna binding proteins associated with dengue virus rna in infected cells reveals temporally distinct host factor requirements date: - - journal: plos negl trop dis doi: . /journal.pntd. sha: doc_id: cord_uid: v lpw r background: there are currently no vaccines or antivirals available for dengue virus infection, which can cause dengue hemorrhagic fever and death. a better understanding of the host pathogen interaction is required to develop effective therapies to treat denv. in particular, very little is known about how cellular rna binding proteins interact with viral rnas. rnas within cells are not naked; rather they are coated with proteins that affect localization, stability, translation and (for viruses) replication. methodology/principal findings: seventy-nine novel rna binding proteins for dengue virus (denv) were identified by cross-linking proteins to dengue viral rna during a live infection in human cells. these cellular proteins were specific and distinct from those previously identified for poliovirus, suggesting a specialized role for these factors in denv amplification. knockdown of these proteins demonstrated their function as viral host factors, with evidence for some factors acting early, while others late in infection. their requirement by denv for efficient amplification is likely specific, since protein knockdown did not impair the cell fitness for viral amplification of an unrelated virus. the protein abundances of these host factors were not significantly altered during denv infection, suggesting their interaction with denv rna was due to specific recruitment mechanisms. however, at the global proteome level, denv altered the abundances of proteins in particular classes, including transporter proteins, which were down regulated, and proteins in the ubiquitin proteasome pathway, which were up regulated. conclusions/significance: the method for identification of host factors described here is robust and broadly applicable to all rna viruses, providing an avenue to determine the conserved or distinct mechanisms through which diverse viruses manage the viral rna within cells. this study significantly increases the number of cellular factors known to interact with denv and reveals how denv modulates and usurps cellular proteins for efficient amplification. dengue is a mosquito-borne viral disease that infects - million people annually, resulting in dengue fever that is either asymptomatic or flu-like. however, tens-of-thousands of people develop the more severe, and sometimes fatal, dengue hemorrhagic fever/shock syndrome (dhf/dss) [ ] . denv is found in most tropical and many subtropical areas with more than countries being endemic for denv [ ] . there is no approved vaccine or antiviral therapeutic available for this life-threatening disease. given the seriousness of infection, the expanding geographical range of the denv, and the limitations in the existing measures of control and prevention, there is a pressing need to better understand the biology and pathogenesis of denv. denv is a single-stranded positive-sense rna virus that belongs to the flaviviridae family. it has a ' cap, no poly(a) tail, highly structured '-and '-untranslated regions (utr), and a single open reading frame (orf) [reviewed in [ ] ]. following virus entry, the viral rna is released into the cytoplasm. viral translation and replication occur in membranous assembly "factories" localized in the perinuclear region of endoplasmic reticulum (er) [ ] . the positivestranded rna molecules are encapsidated; virions are further processed as they are transported through the secretory pathway to the cell surface and released extracellularly [reviewed in [ ] ]. in addition to the viral proteins, cellular proteins, termed host factors, participate in most, if not all, steps of the denv life cycle, including entry, translation, replication, virion assembly, and release [ ] . since viruses require host factors for efficient amplification, targeting host factors can provide an effective antiviral target for which the virus has no genetic control over. therefore, it may be more difficult for viruses to evolve escape mutants that can replicate efficiently in the absence of the host factor [ , ] . several cellular proteins are known to impact denv infection. for example, the polypyrimidine-tract-binding protein (ptbp ) is relocalized from the nucleus to the cytoplasm following denv infection where it enhances denv amplification by binding to the denv 'utr and to ns a, a viral protein required for the formation of the viral replication complex [ ] [ ] [ ] [ ] . ptbp may also stimulate denv translation [ ] , although this is still controversial [ , ] . while most of the known denv rna binding proteins enhance viral amplification, several reduce denv titers [ ] [ ] [ ] . one such factor, ybx , inhibits viral translation [ ] . although previous studies have laid a foundation for establishing critical interactions between viral rna and cellular proteins [ [ ] and reviewed in [ ] ], the host factors identified thus far likely represent only a fraction of the total network of denv host factors. previously, we have described a high-throughput mass spectrometry method termed tux-ms (thiouracil cross-linking mass spectrometry) to identify host factors that interact with viral rna during a live infection in cell culture [ ] . importantly, tux-ms allows for identification of proteins that are bound directly to the viral rna in living cells. briefly, during a viral infection in cell culture, thiouridine is biosynthetically incorporated into the viral rna to serve as a zero-distance cross-linker upon exposure to ultraviolet (uv) light. thus, proteins that are bound directly to the viral rna during a live infection are cross-linked to the rna prior to disruption of cellular compartmentalization. this is particularly valuable for the identification of denv host proteins, since denv amplification is tightly associated with cellular membrane structures [ ] . following cross-linking, the viral rna together with cross-linked proteins are isolated under denaturing conditions and identified by mass spectrometry-based proteomics. using tux-ms, we reported previously the successful identification and validation of host factors for poliovirus, pointing to a low false discovery rate of < % [ ] . here, we expanded the tux-ms methodology for use with other types of rna viruses, and investigated rna-protein interactions during denv infection. we modified the method to use virus-specific dna oligos to capture the viral rna and cross-linked proteins. furthermore, we used metabolic labelling with stable isotopes to accurately quantify relative protein levels. this quantitative thiouridine cross-linking mass spectrometry (qtux-ms) analysis identified novel host proteins, which were not previously shown to be involved in denv infection. we placed these findings in the context of whole proteome changes upon denv infection, and further validated and functionally analysed a subset of the novel denv host factor candidates. overall, validation of the qtux-ms identified factors using secondary assays indicates a low rate of false positives ( %), suggesting that the majority of the other identified qtux-ms factors may also play significant roles in denv viral amplification. hela uprt cells expressing uracil phosphoribosyltransferase (uprt) were generated previously by transduction of hela cells (atcc, ccl- ) with uprt-gene containing lentivirus [ ] . huh . uprt cells were generated by transduction of huh . cells (a kind gift from charles m. rice, rockefeller university) with the same lentiviral construct as in [ ] . hela uprt and huh . uprt cells were cultured at °c and % co in complete dulbecco's modified minimum essential medium (dmem) supplemented with % fbs (fetal bovine serum; atlanta biologicals) and penicillin-streptomycin and grown with mg/ml g (sigma) to select for the uprt gene; huh . uprt cells were additionally supplemented with non-essential amino acids (cellgro). for silac labelling huh . uprt cells were passed : at least twice in silac dmem (thermo scientific) with % dialyzed heat-inactivated fbs (thermo), l-proline ( mg/l) and penicillin-streptomycin, and either mg/l 'heavy' ( c l-lysine and c - n l-arginine; cambridge isotope laboratories, inc.) or mg/l 'light' l-lysine and l-arginine amino acids [ ] . dengue virus serotype (denv ), strain (genebank accession number u ) was kindly provided by dr. robert striker (university of wisconsin-madison). denv was propagated in the mosquito c / cells at °c and % co in advanced dmem supplemented with % fbs, penicillin-streptomycin (cellgro), l-glutamate and % tryptose phosphate broth ( g/l tryptose; g/l glucose; g/l sodium chloride and . g/l disodium hydrogen phosphate). titers for denv were determined using plaque assays in bhk cells. for infections, cells were incubated with virus containing media for hours, washed twice with the dmem media after removal of the virus and incubated in the serum-free dmem for the indicated time. infections and titer determination of adenovirus (ad ) were performed exactly as in [ ] . the denv antisense biotin-labelled dna fragments were generated using pcr and primers listed in (s table) from the denv complementary dna (cdna) and correspond to positions - and - of denv genome. pcr was followed by removal of the unlabelled dna strand according to the nanolink streptavidin magnetic beads (solulink) manual. the mixture of two biotinylated single-stranded dna fragments of base pairs (bp) and bp long were bound to nanolink streptavidin magnetic beads magnetic beads according to the manufacturer's protocol. - x human hepatoma huh . uprt cells labelled with either 'light' or 'heavy' silac media were infected with denv (moi = ) or mock-treated, respectively. then, virus was replaced with silac media with mm -thiouracil and % fbs. at hours post-infection (hpi) the cells were washed with pbs and irradiated at nm uv light for min, collected, cell pellets were frozen on dry ice and stored at - °c. cell pellets were lysed in the lysis buffer ( mm tris-hcl ph , mm magnesium chloride, mm sodium chloride, . % tween- , mm dithiothreitol, recombinant rnasin ribonuclease inhibitor [ units/ml; promega], × complete protease inhibitor cocktail [roche]), with a half volume of - μm glass beads (sigma) by shaking at frequency of hz for min using a retsch mm mixer mill. an aliquot of 'light' and 'heavy' cell lysates were removed and the remaining lysates were incubated with the streptavidin magnetic beads labelled with denv antisense dna fragments for to min allowing for viral rna to bind. the beads were washed twice with wash buffer i ( mm tris-hcl ph , mm potassium chloride, . % tween- ), once with wash buffer ii ( mm tris-hcl ph , mm sodium chloride, . % sodium deoxycholate) and once with mm tris ph . . the samples were eluted at °c for min in μl of mm tris ph . . the eluted 'light' and 'heavy' samples were mixed at a : ratio, rna was degraded with . ng/μl bovine rnase a (fisher scientific) at °c for hrs and subjected to quantitative mass spectrometry-based proteomic analysis. quantitative mass spectrometry-based proteomic analysis of rnainteracting proteins (qtux-ms) protein eluates and their respective mixed light/heavy input lysates were subjected to in-solution enzymatic digestion using a filter-aided sample preparation approach [ ] , then analyzed by nlc-ms/ms, as described in s methods and in [ ] . light (denv ) and heavy-labelled (mock) cell pellets were lysed in mm abc containing % sodium deoxycholate at °c to ensure denaturation and virus inactivation. the protein concentrations were determined by the bca assay and mixed in equal protein amounts ( μg total). proteins were subjected to in-solution digestion with trypsin, then fractionated and analyzed by nlc-ms/ms as described in the s methods. proteomic data analysis qtux-ms, its respective mixed input lysate, and the whole cell silac instrument raw data were separately processed using the maxquant software (ver. . . . ), configured with default settings, except for experiment-specific parameters, which are described in the s methods. the mass spectrometry proteomics data have been deposited to the proteomexchange consortium via the pride [ ] partner repository with the dataset identifier pxd . using the filtered list of protein identifications, unique gene symbols were used for downstream functional ontology analyses. the gene ontology annotations from uniprot were used to generate and assign the denv rna interacting candidates into broader functional categories. for the whole cell silac protein expression study, genes and their associated ratios were analyzed by panther gene enrichment (panther database ver . , - - ) [ ] using the panther pathway and protein class ontologies. significant differential protein abundance was determined as a function of ontological classification versus the overall population (bonferroni-corrected p-value < . ). for specific functional protein ontologies that were differentially regulated, a subset were selected for analysis by the reactome functional interaction (fi) network cytoscape plugin [ ] . the reactome fi plugin was used to visualize candidate host factors identified by qtux-ms. sirna transfections x hela uprt cells were transfected with pmol of silencer select negative control (ambion) or the specified sirnas (s table) using the xtreamgene sirna transfection reagent system (roche). hrs later, x cells/well were plated for infection. hours post transfection cells were infected (moi = . ) with denv or ad (n = ). at either (ad ) or (denv ) hpi, virus titers were determined by plaque assays using or bhk cells, respectively. knockdown efficiency was measured hrs post transfection by rt-qpcr. experiments were performed in two biological replicates for each host factor. statistical analysis was performed using student's t-test. cell viability and proliferation assay hrs post sirna transfection equal numbers of cells (either x or x ) were seeded in -well plates. cell viability was measured hours after sirna-mediated knockdown of individual host factors using the vybrant mtt [ -( , -dimethyl- -thiazolyl)- , -diphenyl- h-tetrazolium bromide] assay kit (invitrogen) according to the manufacturer's protocol and reported relative to the negative-control sirna (set to %; n = ). statistical analysis was performed using student's t-test. cdna was generated from μg trizol (ambion) purified total rna using moloney murine leukemia virus (mmlv) reverse transcriptase (promega) as described by the manufacturer using random primers (invitrogen). qpcr was performed using iq sybr green supermix (bio-rad) with the primers listed in the s table. the amplification efficiency for each primer set was ± % as determined using a standard curve. development of a quantitative thiouridine cross-linking mass spectrometry (qtux-ms) method for identification of proteins associated with the denv rna tux-ms can be used to identify host factors by incorporating -thiouridine ( su), a zero-distance cross-linker, into the viral rna (vrna) to enable cross-linking of proteins bound to vrna during a live infection in cell culture [ ] . cross-linking is carried out under physiological conditions prior to cell lysis to ensure specificity and reduce false-positives from non-specific rna protein interactions that occur upon loss of compartmentalization. vrna is isolated under denaturing conditions and cross-linked proteins are identified using liquid chromatography-tandem mass spectrometry (lc-ms/ms). to improve quantification of the tux-ms identified host proteins (qtux-ms), a silac (stable isotope-labelled amino acids in cell culture) approach [ ] was used to label the uninfected (mock) and infected cells with either 'heavy' or 'light' amino acids, respectively ( fig a) . when -thiouracil ( tu) is present in the medium, huh . uprt human hepatoma cell lines stably expressing uprt (uracil phosphoribosyltransferase) convert tu to ump. then, the ump is converted to thiouridine triphosphate ( sutp) by cellular kinases [ ] . both cellular and viral rna polymerases use sutp as a substrate during rna synthesis, which serves as a zero-distance cross-linker, covalently binding proteins to rnas upon exposure to long wave uv-light. importantly, protein-protein crosslinking is very inefficient at long uv wavelengths, ensuring that only proteins in direct contact with the reactive thiol group of the su-containing rna will be cross-linked [ ] . we have shown previously that immunoisolation of candidate vrna-binding proteins identified by tux-ms (and confirmed by western) could be specifically co-isolated with viral rna [ ] . together, this study established that tux-ms can identify bona fide interactions between host proteins and viral rna. the tux-ms method was originally developed to capture polyadenylated rna using oligo(dt) beads [ ] . however, as denv rna is not polyadenylated, we modified the method to use sequence specific capture of the vrna using magnetic beads. following crosslinking in huh . uprt cells infected with denv at hpi and affinity capture of the vrna, the ribonucleoprotein complexes were eluted from the beads, 'heavy' and 'light' eluates were mixed, and rnase a was used to degrade the rna. the proteins were digested insolution with trypsin and subjected to quantitative ms-based proteomics ( fig b) . the median 'light' to 'heavy' peptide and protein ratios were calculated, reflecting the specificity of vrnaprotein capture. we identified several classes of proteins, including denv proteins, known denv host factors, and putative rna-interacting host proteins, but most of the qtux-ms identified factors have not been previously identified through interactions with denv (s table) . consistent with previous knowledge of flaviviral rna [ ] [ ] [ ] [ ] , our qtux-ms analysis identified several viral proteins-c, e, ns , ns a and ns -as associated with vrna. in addition, qtux-ms identified six known denv host factors: polypyrimidine tract-binding protein (ptbp ), interleukin enhancer-binding factor (ilf ), calreticulin (calr), calnexin and heterogeneous nuclear ribonucleoproteins hnrnp h and hnrnp k, as well as a known denv anti-viral protein-eukaryotic initiation factor a (eif a)-and other proteins previously shown to associate with denv rna or proteins (s table) . altogether, since several known host factors were identified using qtux-ms, this suggests that the adapted method is effective at identifying host factors for denv. for identification of novel host factors, cellular proteins with a denv/mock silac ratio of . -fold (n quantified peptides) were considered putative denv vrna interactions. this threshold was selected when considering the median variance in the silac ratio (for (blue) amino acids are infected with denv or treated with mock, respectively, in the presence of tu. su is incorporated into cellular and denv rnas and proteins are uv cross-linked to the contacting thio-containing rna (represented as either balls to indicate native conformation or curved lines to indicate denatured proteins) in living cells at hpi prior to cell lysis under denaturing conditions. viral ribonucleoprotein complexes were isolated using dna molecules complementary to denv rna bound to magnetic beads, the rna was degraded with rnase a and the proteins were identified by mass spectrometry. (b) workflow for quantitative proteomic analysis of rna-bound host factors isolated in (a). isolated proteins were mixed between mock and virus-infected samples, digested into peptides, and analysed by mass spectrometry. relative 'light' and 'heavy' peptide abundances were quantified to determine the specificity of interaction. host factor candidates were identified and subjected to functional validation. doi: . /journal.pntd. .g proteins with > quantified peptides), which was approximately %. therefore, we opted for a conservative cut-off at %, representing twice this median value or . -fold. common environmental and non-human cell culture contaminants were excluded, since they existed in only the 'light' silac state. in addition, our qtux-ms samples also contained histones: h , h , h a, h b, h . and macroh a. . in a previous study, histones were shown to play roles in dengue infection [ ] . however, their functions were mediated through an interaction with a viral capsid protein and were shown to be independent of rna. in addition, histones are primarily nuclear and highly abundant proteins commonly detected (> %) in control affinity purifications compiled across diverse protein-protein interaction studies [ ] . for these reasons, histones likely represent non-specific associations rather than denv rna binding factors, and thus were excluded from further analysis. in total, cellular proteins passed these selection criteria, of which have not been previously shown to associate with denv (s table, s dataset). several of the known denv host factors were enriched but did not meet the stringent inclusion criteria (s table) , suggesting that there may be additional host factors below our enrichment threshold (see s dataset). importantly, the subset of host factors represents a significant potential expansion in the number of known denv host factors, providing a valuable resource to test for pro-viral or antiviral activities during denv infection. it is well recognized that viral infections can induce significant changes in cellular proteomes [ ] [ ] [ ] and an increase in protein levels during denv infection may contribute to the increased protein capture measured by qtux-ms. to address this, we used silac-ms to quantify proteome (i.e., total protein abundance) changes following denv infection. comparison of qtux-ms and proteome silac ratios showed that the protein abundances for the qtux-ms-identified vrna-binding factors remained largely unchanged (fig a, s fig and s dataset). on average, for these proteins, the denv-induced changes in whole cell abundance were ± . -fold, suggesting that their identification by qtux-ms was not due to an increase in their abundance in the cell following denv infection. noteworthy, a retrospective qualitative comparison of the qtux-ms identified factors for denv with those identified in the tux-ms analysis on poliovirus revealed less than % were identified for both viruses [ ] ( fig b) . since the identified proteins are largely denv specific, qtux-ms is not biased towards identifying a sub-set of cellular rna binding proteins. taken together, our results indicate that the enrichment ratios measured by qtux-ms is predominantly due to their specific association with the denv rna. while proteins that bound denv rna did not show significant changes in abundance upon infection, we performed bioinformatics analysis on the complete proteome dataset of whole cell abundance to determine the global proteome effects of denv infection under these conditions. in total, , host proteins were quantified by silac ms in biological duplicates (s dataset). the abundance ratios between biological duplicates were reproducible, with only~ % of the ratios varying by > % (fig a) . from these duplicates, an average abundance ratio was calculated and the respective proteins were analyzed by panther gene enrichment analysis (s fig) [ ] . this analysis found systematic up regulation of proteins in the ubiquitin proteasome pathway (upp), comprising members of the s proteasome as well as various ubiquitinconjugating enzymes (s fig). many of these enzymes are linked to ubiquitin-dependent proteasome degradation, consistent with the current knowledge that the upp is important for production of infectious virions [ , , ]. yet, other enzymes, such ube n and ube v , catalyze polyubiquitination at lys- , which does not lead to proteasome degradation but rather participates in transcriptional activation of target genes and may promote innate immune signaling [ , ] . in contrast, proteins in the transporter protein class were on average down regulated (fig b and s fig). assembly of the annotated proteins into reactome protein networks identified several subnetworks with various transporter activities (s fig). while the abundances of mitochondrial transporters and nucleoporins were the most consistently decreased, not all transporters were down regulated; for example, the lipoprotein (apo) transporters were increased in expression (fig c) . interestingly, the rna binding protein class was significantly down regulated (fig b and s fig) , though the effect was not as pronounced as the transporter class (fig b) . the overall down-regulation of rna binding proteins appears to be driven by changes in cytoplasmic and mitochondrial ribosomal subunits, and proteins involved in rna degradation and processing (s fig). nevertheless, the relative protein abundance for the set of (known and putative) denv binding factors identified by qtux-ms was largely unchanged, despite being enriched in rna processing and translation factors (fig a) . overall, the quantitative proteome analysis suggests that denv selectively alters the abundance of proteins, and reveals several pathways that could be directly or indirectly modulated in the host response to denv infection. to gain insight into potential molecular mechanisms and biological processes of the qtux-ms identified factors, we performed a functional network-based analysis using the curated human pathway relationships from the reactome database. this analysis revealed a high degree of connectivity, with proteins forming a large interconnected network (fig ) . the densest network connectivity included proteins involved in rna processing/translation (orange nodes) and dna binding/transcription (blue nodes). several additional proteins with rna and/or translational activities were also identified, but lacked annotation in reactome (orange single nodes). overall, our bioinformatic evaluation further supports the ability of qtux-ms to capture vrna-bound host factors and points to their possible function in denv amplification. since most of the factors were associated with rna processing in the reactome analysis (fig ) , we focused on these factors for functional analysis of their roles in dengue infection. we have randomly selected six qtux-ms identified host proteins with functions in rna processing/translation, which were enhanced in the denv sample ranging from . -to . -fold (denv/mock). thus, by validating factors that are only modestly enhanced in the qtux-ms analysis this will indicate if the qtux-ms identified factors that are near the cut-off of . -fold are bona fide host factors or not. we assessed the effect of sirna knockdown of these factors (fig a) on viral production. hela uprt cells were used for sirna silencing experiments due to higher sirna transfection efficiency compared with huh . uprt cells. knockdown of five out of six qtux-ms identified candidates: non-pou domain-containing octamer-binding protein (nono), embryonic stem cell-specific -hydroxymethylcytosine-binding protein (hmces), rbmx (rna-binding motif protein, x chromosome), hnrnp m and hnrnp f significantly decreased denv production, while knockdown of hnrnp l had no effect on denv titer ( fig b; s fig) . knockdown of ptbp , a positive control [ , ] , also resulted in decreased viral titers (fig ) . for negative controls, we selected two rna binding proteins (ddx and hnrnp a ) that were not identified by qtux-ms. denv titers were not altered following silencing of these two proteins, suggesting that only specific rna binding proteins are used by denv. altogether, our data suggests that rbmx, nono, hmces, hnrnp m, hnrnp f are required for viral production. these results demonstrate that qtux-ms is a robust method with a low false discovery rate for high-throughput identification of viral host factors. reduced viral amplification could be due to compromised cell fitness rather than a specific requirement of the virus for a particular host factor. using an mtt assay, we confirmed that knockdown of these factors did not impact cell viability (fig c) . as a positive control, knockdown of g bp did reduce cell fitness, as previously shown (s fig) [ ] . to more rigorously rule out any potential effects of host factor knockdown on cell fitness that would affect viral amplification, an unrelated virus (adenovirus ), was amplified following knockdowns of the candidate factors. adenovirus replication was not significantly decreased by rbmx, nono, hnrnp m, hnrnp f or hmces sirna knockdown (fig b and s c fig) demonstrating that knockdown of these factors does not affect cell fitness for viral amplification. given that these proteins bind directly to viral rna and are required for viral amplification, rbmx, nono, hnrnp m, hnrnp f and hmces are novel denv host factors. to determine if the host factor is required for a step prior or subsequent to viral replication, vrna was quantified by qrt-pcr in the denv infected cells knocked down for rbmx, nono, hnrnp m, hnrnp f or hmces (fig ) . as a positive control, knockdown of ptbp , which is required for denv replication [ ] , reduced denv rna levels. similarly, the intracellular vrna levels were reduced in cells knocked down for either hnrnp f, rbmx or hmces. the decrease in dengue rna levels (fig ) is consistent with the decrease in viral titers ( fig b) . thus, hnrnp f, rbmx or hmces are required for the early steps in the viral replication cycle, such as translation, replication or rna stability. in contrast, knockdown of hnrnp m and nono did not change intracellular viral rna levels despite the dramatic decrease in viral titers, suggesting they may play a role downstream of replication. altogether, we have identified and validated five novel host factors for denv, demonstrating that qtux-ms can identify factors that function at different stages of the virus life cycle. an estimated % of the world's population is at risk from dengue for which vaccines or antivirals are not yet available. since diagnostic tests can detect denv infection at early stages, administration of antivirals could significantly improve survival rates as viral load is correlated with symptom severity [ ] . using antivirals that target host factors may limit the appearance of drug-resistant viruses and may be effective for all denv serotypes and possibly for multiple flaviviruses [ , , , ] . the qtux-ms analysis identified novel cellular proteins, for which the majority are distinct from those previously identified for poliovirus using a similar approach [ ] . this suggests that unrelated rna viruses have evolved to utilize distinct host rna binding proteins. interestingly, ptbp and nono, which were identified in both the poliovirus and denv tux-ms analyses, were shown to be required for production of both viruses (this study, figs and ) [ , , ] . further analysis of virus-specific and shared host factors will reveal whether unrelated viruses utilize similar or diverse mechanisms to control viral rna replication, processing and packaging within cells. the host factors that enhance amplification of both poliovirus and dengue, (fig b) could serve as attractive targets for the development of broad-spectrum antivirals. the novel denv rna interactions identified in our study reveal a large network of cellular proteins which belong to different functional classes primarily associated with the nucleic acid metabolism, including numerous components of splicing, rna processing and translation machineries. these factors likely play direct roles in denv translation, replication or packaging. in addition, we have detected multiple components of cell signaling and stress response, such as several members of - - adapter proteins, heat shock proteins and β-catenin. these factors are known to regulate diverse pathways, including host innate immune and cellular homeostasis [ ] [ ] [ ] suggesting their possible role in host antiviral response or viral strategies to subvert the innate immune response. we demonstrated that the majority of the qtux-ms factors that we selected for validation were required for efficient denv amplification (fig ) . specifically, we found that hnrnp f, hmces and rbmx are required for the early steps in the viral life cycle. in contrast, hnrnp m and nono appear to act downstream of viral rna replication (fig ) , which may be significant given that both have been shown to be in a complex together [ ] . nono and its binding partners are predominantly nuclear, bind rna, and are involved in pre-mrna processing, splicing, and rna transport, as well as in transcriptional activation and repression [ ] [ ] [ ] . interestingly, other nono binding partners: psf/sfpq (polypyrimidine tract-binding protein (ptb)-associated splicing factor) and matrin were identified by qtux-ms as well. many of the qtux-ms identified cellular proteins are hnrnps, which encompass a large class of rna binding proteins that either localize to the nucleus or shuttle between the nucleus and the cytoplasm in order to perform multiple functions in rna metabolism, from transcription to rna turnover [ ] [ ] [ ] . importantly, the vast majority of these factors have established roles in viral infections or in modulating the antiviral host response to various viruses [ ] [ ] [ ] [ ] , including denv (s table and references within). our study establishes that rbmx (hnrnp g), hnrnp f and hnrnp m are required for efficient denv amplification (fig b) . since several hnrnps, such as ptbp (hnrnp i), hnrnp a and hnrnp k, were previously shown to re-localize from the nucleus to the viral replication sites during denv infection [ , , ] , rbmx, hnrnp f, hnrnp m and possibly other nuclear qtux-ms identified factors are also likely to be either actively recruited to the viral replication sites or retained in the cytoplasm upon denv infection. interestingly, the qtux-ms identified hnrnps affect different steps in the denv life cycle (fig ) , suggesting that they have distinct functions during infection. this is consistent with studies that have suggested that denv rna structures are dynamic during the viral life cycle [ ] and may suggest that host factors play an important role in these structural changes. furthermore, we showed that knockdown of some hnrnps (hnrnp a and hnrnp l) did not affect dengue viral titers significantly demonstrating that only certain hnrnps are required for denv amplification. since some of hnrnps are known to modulate cellular gene expression in response to dengue infection [ , ] , we can not rule out that some of the observed effects on virus titers derive from their roles in regulating host mrnas. among the numerous qtux-ms identified factors of interest, our study is the first to demonstrate the involvement of hmces (or c orf ) in viral infection. while the cellular role of human hmces is currently unknown, the mouse homologue was suggested to be an rnabinding protein and predicted to contain a putative peptidase domain [ , ] . interestingly, it is possible that the nucleic acid binding domain enhances the protease activity or visa versa as has been shown for other such proteins [ ] [ ] [ ] . for example, adenovirus uses a nucleic acid binding protease to localize the protease activity to the viral substrates [ ] . it has been suggested that the protease is recruited to the empty capsid as an inactive protease, then it becomes fully activated once bound to the viral dna inside the virion. using the dna as a guide wire, it moves along the nucleic acid, searching for capsid and core proteins to cleave, which is required to render the viral particle infectious [ , , ] . however, since we observed that knockdown of hmces resulted in a decrease in viral rna, it seems more likely that it might participate in translation, replication or the switch from translation to replication as has been shown for other nucleic acid binding proteases [ , ] . only one of the qtux-ms identified factors was increased at the protein level in whole cells following denv infection, suggesting that qtux-ms identifications derived from the specific associations to vrna rather than changes in protein abundances. our analysis of the host cell proteome upon infection also revealed interesting changes, including up regulation of proteins in the ubiquitin pathway and down regulation of transporter proteins. the ubiquitin-proteasome pathway is one of two major cellular pathways used to degrade to % of proteins. previous studies on denv infected cell lines and patient samples showed that the ubiquitin pathway was upregulated [ , , ] . many groups have consistently shown that the ubiquitin proteasome pathway is critical for amplification of a number of flaviruses, including denv and west nile virus [ , , , ] . however, it remains controversial as to which step in viral amplification is affected by ubiquitination, but it appears to be early during internalization or viral genome release [ , , ] . further studies will be required to understand how denv up-regulates the pathway and the mechanism that ubiquitination has in denv amplification. altogether, our study has significantly increased the number of cellular proteins known to interact with the denv rna during a live infection in cells. we have also placed these interactions in the context of proteome abundance changes in the infected cells. a recent study by phillips and colleagues [ ] exploited a cross-linking label-free ms approach to identify denv rna associating proteins in cell culture by cross-linking the rna to the proteins using short wavelength uv light and isolating denv rna bound proteins by anti-sense dna affinity capture [ ] . while their method identified several denv host factors [ , [ ] [ ] [ ] , the qtux-ms method reported here resulted in improved identification of known dengue host factors and putative denv rna interacting proteins. there could be several reasons for these results, such as, the qtux-ms approach achieves greater cross-linking efficiency by using long wave uv light to form crosslinks to -thio-uridines compared to short-wavelength uv light, which is inherently inefficient [ ] . moreover, since thio-uridine is a zero distance cross-linker for rna-bound proteins at long uv wavelengths and protein-protein cross-links are not formed at long uv wavelengths, qtux-ms may also have achieved improved specificity, as only proteins in direct contact with the viral rna would be captured [ ] . additionally, qtux-ms used an ms-based silac approach to determine which host proteins were specifically enriched in the vrna isolations versus mock. though isotope-labelling is not applicable in all model systems, it does afford greater quantitative accuracy compared to label-free ms strategies [ ] . overall, the qtux-ms method identified cellular proteins that bind to denv rna, which include previously known or putative interactions. importantly, five out of the six qtux-ms identified novel factors that were tested were shown to be bona fide host factors. we used robust assays to show that the identified host factors were specifically required for denv amplification and did not merely result in a decrease in cell fitness for viral amplification. future studies will reveal whether the identified factors may also be required for other flavivirus infections that cause life-threatening illnesses, such as yellow fever, west nile, zika, japanese and tick-borne encephalitis. therefore, our data demonstrates that qtux-ms is an effective technique for identifying novel virus host factors that can be used for a broad spectrum of rna viruses by simply designing antisense dna oligonucleotides to allow for efficient sequence-specific isolation of the vrna. supporting information s table containing all proteins quantified by qtux-ms, including proteins enriched in denv infection (red highlighted rows) and those that did not meet the specificity threshold. for each protein group, the following are provided (from left to right), uniprot accession number, gene name, protein name, the linear and log denv/ mock qtux-ms enrichment ratios (columns d and e), the qtux-ms ratio variability, the number of qtux-ms quantified peptides, the average log denv/mock whole cell relative abundance silac ratio, whether this ratio was up (u) or down (d) regulated by more than ± . -fold, the average silac ratio variability, the average number of quantified peptides, the number of razor+unique peptides and sequence coverage for qtux-ms, the protein's molecular weight and sequence length, and the complete fasta header entry for the primary protein group member. columns h-k are cross-referenced from the respective whole cell data (b). nd = not detected. (b) table containing all proteins quantified in whole cell lysates by silac. for each protein group, the following are provided (from left to right), uniprot accession number, gene name, protein name, the log denv/mock relative abundance ratios for replicates , , and the average (columns d and e), whether this ratio was up (u) or down (d) regulated by more than ± . -fold. for each replicate, the following are provide: the silac ratio variability, the number of quantified peptides, the number of razor+unique peptides, and the number of unique peptides, total protein intensity for light (denv) and heavy (mock) conditions, and the overall sequence coverage. the complete fasta header entry is listed for the primary protein group accession number. (xlsx) s (a) hela uprt cells were transfected with either control or specific sirnas. hours post transfection cells were counted and seeded ( x cells/well) in -well plates in triplicates. hours post transfection cells were infected with denv at moi . and virus released to the media collected hours post infection. denv titers were measured using plaque assays. the bars represent average values from triplicate, a standard error is reported. the representative data from one of at least two independent experiments is shown. (b) to verify sensitivity of the mtt assay, we performed sirna knockdown of g bp , which is known to bind denv and was previously shown to affect cell viability [ , ] . relative viability of non-infected cells was measured using an mtt assay (invitrogen) and represented exactly as described in fig c. cell viability or proliferation is decreased by knockdown of g bp compared with control sirna (p< . ). sirna transfection was performed as described in the methods section using previously published sirna sequence [ ] . in cells treated with g bp -specific sirna but not control sirna, g bp protein was knocked down to the levels undetectable by western analysis using antibodies against g bp (abcam, ab ). (c). hela uprt cells were seeded . x cells in mm plates and transfected with either control or specific sirnas. hours post transfection cells were infected with ad at moi . and collected hours post infection. sirna knockdown efficiency was determined by measuring respective mrna levels in comparison to β-actin mrna abundance as described in the experimental section and reported relative to control sirna transfection (upper panel). ad titers were determined using plaque assays on cells. the bars represent average values from triplicate, a standard error is reported. the representative data from one of at least three independent experiments is shown. (tif) the global distribution and burden of dengue perspectives for the treatment of infections with flaviviridae dengue virus life cycle: viral and host factors modulating infectivity. cellular and molecular life sciences: cmls composition and three-dimensional architecture of the dengue virus replication and assembly sites targeting a host process as an antiviral approach against dengue virus targeting host factors to treat west nile and dengue viral infections the polypyrimidine tract-binding protein is required for efficient dengue virus propagation and associates with the viral replication machinery polypyrimidine tract-binding protein is relocated to the cytoplasm and is required during dengue virus infection in vero cells polypyrimidine tract-binding protein influences negative strand rna synthesis of dengue virus translation elongation factor- alpha, la, and ptb interact with the ' untranslated region of dengue virus rna eukaryotic initiation factor ai interacts with ns a of dengue virus and plays an antiviral role. biochemical and biophysical research communications y box-binding protein- binds to the dengue virus '-untranslated region and mediates antiviral effects identification of proteins bound to dengue viral rna in vivo reveals new host proteins important for virus replication revisiting dengue virus-host cell interaction: new insights into molecular and cellular virology thiouracil cross-linking mass spectrometry: a cell-based method to identify host factors involved in viral amplification stable isotope labeling by amino acids in cell culture, silac, as a simple and accurate approach to expression proteomics universal sample preparation method for proteome analysis the functional interactome landscape of the human histone deacetylase family. molecular systems biology update of the pride database and its related tools panther version : expanded protein families and functions, and analysis tools reactomefiviz: a cytoscape app for pathway and network-based data analysis biosynthetic labeling of rna with uracil phosphoribosyltransferase allows cell-specific microarray analysis of mrna synthesis and decay photo-leucine and photo-methionine allow identification of protein-protein interactions in living cells subcellular localization and some biochemical properties of the flavivirus kunjin nonstructural proteins ns a and ns a japanese encephalitis virus nonstructural protein ns has rna binding and atpase activities rna sequences and structures required for the recruitment and activity of the dengue virus polymerase structural analysis of viral nucleocapsids by subtraction of partial projections dengue virus capsid protein binds core histones and inhibits nucleosome formation in human liver cells the crapome: a contaminant repository for affinity purification-mass spectrometry data high-throughput quantitative proteomic analysis of dengue virus type infected a cells the ubiquitin-proteasome pathway is important for dengue virus infection in primary human endothelial cells proteomic analysis of host responses in hepg cells during dengue virus infection dengue virus genome uncoating requires ubiquitination. mbio the ubiquitin-conjugating system: multiple roles in viral replication and infection activation of the ikappab kinase complex by traf requires a dimeric ubiquitin-conjugating enzyme complex and a unique polyubiquitin chain trim is an innate immune sensor for the retrovirus capsid lattice gap targets and downregulates g bp to suppress cell growth and potentiate cisplaitin-mediated cytotoxicity to colon carcinoma hct cells dengue viremia titer, antibody response pattern, and virus serotype correlate with disease severity. the journal of infectious diseases rna interference screen for human genes associated with west nile virus infection dengue virus evolution under a host-targeted antiviral a cytoplasmic -kda protein that is required for translation of picornavirus rna by internal ribosomal entry is identical to the nuclear pyrimidine tract-binding protein translation of polioviral mrna is inhibited by cleavage of polypyrimidine tract-binding proteins executed by polioviral c(pro) genome-wide rnai screen reveals a new role of a wnt/ctnnb signaling pathway as negative regulator of virus-induced innate immune responses the role of the - - protein family in health, disease, and drug development. drug discovery today virus-heat shock protein interaction and a novel axis for innate antiviral immunity hnrnp m interacts with psf and p (nrb) and co-localizes within defined nuclear structures psf and p (nrb)/nono-multi-functional nuclear proteins kinesin transports rna: isolation and characterization of an rnatransporting granule the multifunctional protein p nrb/psf recruits the exonuclease xrn to facilitate pre-mrna ' processing and transcription termination advances in experimental medicine and biology hnrnp complexes: composition, structure, and function. current opinion in cell biology hnrnp proteins and the biogenesis of mrna. annual review of biochemistry hnrnps relocalize to the cytoplasm following infection with vesicular stomatitis virus rna-rna and rna-protein interactions in coronavirus replication and transcription hnrnp l and nf interact with hepatitis c virus '-terminal untranslated rna and promote efficient replication high-affinity interaction of hnrnp a with conserved rna structural elements is required for translation and replication of enterovirus nf binds the dengue virus rna ' terminus and is a positive regulator of dengue virus replication the heterogeneous nuclear ribonucleoprotein k (hnrnp k) is a host factor required for dengue virus and junin virus multiplication dynamic rna structures in the dengue virus genome the heterogeneous nuclear ribonucleoprotein k (hnrnp k) interacts with dengue virus core protein dengue virus infection induces upregulation of hn rnp-h and pdia for its multiplication in the host cell novel autoproteolytic and dna-damage sensing components in the bacterial sos response and oxidized methylcytosine-induced eukaryotic dna demethylation systems the rna-binding protein repertoire of embryonic stem cells human adenovirus proteinase: dna binding and stimulation of proteinase activity by dna rna binding by the ns protease of the hepatitis c virus dna and rna binding by the mitochondrial lon protease is regulated by nucleotide and protein substrate the active adenovirus protease is the intact l k protein processing of the l / k protein by the adenovirus protease: a new substrate and new insights into virion maturation cellular protein modification by poliovirus: the two faces of poly(rc)-binding protein host gene expression profiling of dengue virus infection in cell lines and patients west nile virus and dengue virus capsid protein negates the antiviral activity of human sec protein through the proteasome pathway west nile virus genome amplification requires the functional activities of the proteasome appraising the roles of cbll and the ubiquitin/proteasome system for flavivirus entry and replication antisense-mediated affinity purification of dengue virus ribonucleoprotein complexes from infected cells quantitative mass spectrometry of denv- rna-interacting proteins reveals that the dead-box rna helicase ddx binds the db and db ' utr structures a)-binding protein binds to the non-polyadenylated ' untranslated region of dengue virus and modulates translation efficiency. the journal of general virology role of human heterogeneous nuclear ribonucleoprotein c /c in dengue virus replication detecting rna-protein interactions by photocross-linking using rna molecules containing uridine analogs we would like to thank those who generously supplied reagents: lncx uprt-myc, tat, and vsvg plasmids (edward mocarski), denv (robert striker), and huh . cells (charlie rice). we would like to thank quynh-mai trinh for technical assistance. conceptualization: ovv tmg imc srt. key: cord- -oetrdm g authors: kozak, marilyn title: regulation of protein synthesis in virus-infected animal cells date: - - journal: adv virus res doi: . /s - ( ) - sha: doc_id: cord_uid: oetrdm g this chapter summarizes the structural features that govern the translation of viral mrnas: where the synthesis of a protein starts and ends, how many proteins can be produced from one mrna, and how efficiently. it focuses on the interplay between viral and cellular mrnas and the translational machinery. that interplay, together with the intrinsic structure of viral mrnas, determines the patterns of translation in infected cells. it also points out some possibilities for translational regulation that can only be glimpsed at present, but are likely to come into focus in the future. the mechanism of selecting the initiation site for protein synthesis appears to follow a single formula. the translational machinery displays a certain flexibility that is exploited more frequently by viral than by cellular mrnas. although some of the parameters that determine efficiency have been identified, how efficiently a given mrna will be translated cannot be predicted by summing the known parameters. the translation of viral mrnas: where the synthesis of a protein starts and ends, how many proteins can be produced from one mrna, and how efficiently. the next section focuses on the interplay between viral and cellular mrnas and the translational machinery. that interplay, together with the intrinsic structure of viral mrnas, determines the patterns of translation in infected cells. the final section points out some possibilities for translational regulation that can only be glimpsed at present, but are likely to come into focus in the future. to keep the project manageable, i have concentrated on animal viruses. plant viruses are mentioned, however, when they provide the best (or sometimes the unique) example of a given mechanism. the structural requirements for mrna function have been determined by inspection'of natural eukaryotic mrnas, followed by manipulation of features that looked suspicious. the general structural characteristics of eukaryotic mrnas have been reviewed previously (kozak, a) and will not be elaborated here. the discovery of the m g cap on a wide variety of viral and cellular mrnas (shatkin, ) was a provocative clue that the mechanism of initiation in eukaryotes differs from prokaryotes. although the list of plant virus mrnas that are translated without a cap has grown in recent years, picornaviruses and caliciviruses are still the only animal viruses known to be translated without a cap (nomoto et al., ; ehresmann and schaffer, ) . indeed, the near-indispensibility of the m g cap may be inferred from the fact that animal viruses that replicate in the cytoplasm routinely encode their own capping and methylating enzymes. this is true not only for poxviruses (moss et d., ) , where the vast coding capacity of the genome allows room for frills, but also for reovirus (furuichi et al., , vesicular stomatitis virus (vsv) (abraham et al., , and alphaviruses (cross, ) ) in which the small size of the genome limits the encoded proteins to the barest essentials. the m g cap enhances both the stability and translatability of mrnas. transcripts that are capped but not methylated are stable, but nonetheless untranslatable (furuichi et al., ; horikami et al., ) . much of the discussion that follows assumes that a scanning mechanism underlies the initiation process. the scanning model postulates that a s ribosomal subunit binds initially at the ' end of the mrna viral translation and migrates until it reaches the first aug triplet. if the first aug codon occurs in the optimal context (accaugg-see kozak, a kozak, , a kozak, , a all s subunits stop there, and that aug serves as the unique site of initiation. if the first aug triplet occurs in a suboptimal context, only some s subunits will initiate there; some will migrate beyond that site and initiate at an aug codon farther downstream. the scanning hypothesis is not universally accepted, but it is supported by extensive evidence from many laboratories (reviewed by kozak, kozak, , b kozak, , a . two alternative models have been suggested from time to time. one is that ribosomes bind directly to the sequence around the aug codon, but experiments designed to distinguish between scanning and direct binding do not support the latter (kozak, a (kozak, , b . a hybrid mechanism in which % of the ribosomes scan from the ' end, while % of the binding occurs directly at the aug start site, is difficult to rule out, however. another suggestion is that secondary structure might guide the choice of aug codons (this idea is evaluated a few paragraphs hence). one consequence of the scanning mechanism is that deleting the "ribosome binding site" (i.e., the normal initiator codon and flanking sequences) will not abolish translation; ribosomes will simply use the next aug codon downstream, which, in some cases, has been shown to direct the synthesis of a biologically active, truncated protein (downey et al., ; halpern and smiley, ; katinka and yaniv, ) . conversely, introducing spurious upstream aug codons will reduce initiation from the authentic start site-a prediction that has been verified many times with laboratory constructs (bandyopadhyay and temin, ; lomedico and mcandrew, ; smith et al., ; zitomer et al., ) as well as with naturally occurring variant forms of mrna from the early and late regions of simian virus (sv ) (barkan and mertz, ) . when the context around an upstream aug codon conforms closely to the accaugg consensus sequence, initiation from the downstream site is suppressed almost completely (kozak, , b; liu et al., ; m. scott and h. varmus, personal communication) . when the context around the upstream aug codon is less ideal, initiation from the downstream site is reduced but not abolished (kozak, a) . stated in a more positive way, when the '-proximal aug codon occurs in a suboptimal context, ribosomes are able to initiate at the first and the second aug codons. this "leaky" scanning process is further explained and documented in section i ,c. the scanning mechanism predicts that translation should be downregulated by any ploy that interferes with the linear movement of s ribosomal subunits from the cap to the aug codon: binding of a protein to the '-noncoding sequence; introducing spurious out-of-frame aug codons, as mentioned above; annealing cdna fragments that are complementary to the '-untranslated sequence (haarr et al., ; perdue et al., ; privalsky and bishop, ; willis et al., ) ; or creating a stable hairpin anywhere upstream from the aug codon, as described in the next section. on the other hand, the simplicity of the scanning mechanism suggests few possibilities for enhancing translation. although we know what features should be absent from the leader for a message to be efficient, the only features known to contribute in a positive way are the m g cap and the sequence directly flanking the initiator codon. a promising place to look for other positive effectors is the tripartite leader on late adenovirus mrnas. transposition of the -nucleotide tripartite leader sequence to heterologous mrnas stimulates their translation -fold (berkner and sharp, ; logan and shenk, ) , but the feature responsible for the stimulation has not been pinpointed, and could turn out disappointingly to be a long sequence that simply lacks all of the negative effectors cited above. the impression that the leader sequences on most viral mrnas do not contain unidentified translational "enhancers" is reinforced by the ease with which '-noncoding sequences can be deleted without deleterious effects (bendig et al., ; spindler and berk, a; villarreal et al., ) . if our intuition is correct that "extra" 'aoncoding sequences are more likely to inhibit than to help, the trend toward short '-noncoding sequences on many viral mrnas becomes significant (reviewed by kozak, b ; see also rose and moss, ) . indeed, the -nucleotide leader sequence on the mrna that encodes adenovirus polypeptide ix seems to mediate translation more efficiently than the long tripartite leader that has received so much attention (lawrence and jackson, ) . the synthesis of polyoma virus t antigen was significantly reduced in only one of the mutants studied by bendig et al. ( ) -a mutant in which the deletion extended to within two nucleotides of the aug codon. this fits with evidence from other sources that (only) the nucleotides immediately preceding the aug codon are part of the ribosome recognition sequence. secondary structure in viral mrnas might have various effects on translation. . one might expect secondary structure to inhibit more when it occurs near the cap, which is the presumptive entry site for ribosomes, than when a hairpin occurs farther downstream, because s ribosomal subunits once bound must be able to melt secondary structure to some extent. (one knows for sure that s ribosomes melt secondary structure during the elongation phase of protein synthesis; the triplet code could not be read linearly otherwise.) the prediction that s ribosomal subunits can melt their way through secondary structure within the interior of the leader sequence has been verified: introducing a -base-pair hairpin (ag - kcal/mol) nucleotides downstream from the cap did not impair the translation of preproinsulin mrna in uzuo (kozak, b) . the effects of secondary structure close to the cap have not yet been tested systematically, but it has been noted that the ' end of alfalfa mosaic virus rna- is unfolded (gehrke et al., ) and rna- is a notoriously efficient message. godefroy-colburn et al., ( b) claim more generally that the degree of cap accessibility of the four alfalfa mosaic virus mrnas correlates with their translational efficiency, but the correlation appears weak. the cap was indeed least accessible on rna- , which ranks lowest in translational efficiency, but the cap was equally accessible on rnas , , and , which differ fold in competitive efficiency (godefroy-colburn, a) . . although we expect ribosomes to melt secondary structure to some extent, there must be a limit to that ability. whereas a hairpin of - kcalimol at the midpoint of the leader sequence (involving neither the cap nor the aug codon) did not reduce the synthesis of preproinsulin under normal culture conditions, a hairpin of - kcal/mol nearly abolished translation (kozak, ) . because the hairpin did not encroach on the aug codon, the observed inhibition seems incompatible with the direct-binding hypothesis, but is consistent with the scanning hypothesis. pelletier and sonenberg ( ) have also shown that translational efficiency decreases as secondary structure in the 'noncoding region increases. . there is no experimental support for the idea that secondary structure orients the cap and the aug codon, thus determining which aug will initiate translation. were that true, denaturation should impair translation; in fact, denaturation often enhances (payvar and schimke, ) . nor is there support for the idea that downstream cistrons are silent due to conformational constraints: attempts to activate internal initiation sites by denaturing viral mrnas invariably fail (collins et al., ; monckton and westaway, ) . a popular idea is that when secondary structure sequesters the '-proximal aug triplet, it might be skipped by ribosomes in favor of the next exposed aug codon (darlix et al., ; ghosh et al., ; hay and aloni, ; nomoto et al., ) . the results of a direct test contradict that notion, however, when the primary sequence around the '-proximal aug codon in a chimeric preproinsulin mrna was favorable for initiation, no translation from a downstream site could be detected irrespective of whether the first aug codon was single stranded or base paired (kozak, b) . thus, s ribosomal subunits appear to scan linearly, melting the secondary structure (ag - kcal/mol) t o reach each aug codon in turn. if a hairpin is too stable to be melted (ag - kcal/mol), the s subunit apparently stalls, but it does not "jump over" the barrier. . in some viral mrnas, sequences at the ' end are complementary, to a limited extent, to those at the ' end (antczak et al., ; dasgupta et az., ) . that arrangement might be expected to inhibit translation-an expectation that has been confirmed recently using mrnas with artificially constructed terminal complementary sequences (spena et al., ) . some viruses seem to take measures to preclude such inhibition. whereas the genomic rnas of influenza (robertson, ) and bunyaviruses (eshita and bishop, ) have complementary 'and ' erminal sequences, that potentially deleterious structure is not copied into mrna, inasmuch as the ' terminus of each mrna stops short of the ' end of the template strand (bouloy et al., ; eshita et al., ; hay et al., ) . arenaviruses also produce mrnas that lack the complementary sequences present at the termini of genomic rna (auperin et al., ) . . incubation in hypertonic culture medium has been used often to study protein synthesis in virus-infected cells (see yates and nuss, , and references therein) . hypertonic shock results in the rapid and reversible inhibition of protein synthesis at the level of initiation (saborio et al., ) . an intermediate concentration of salt or sucrose permits a residual low level of translation, under which circumstance viral protein synthesis nearly always predominates over cellular protein synthesis (cherney and wilhelm, ; nuss et al., ; oppermann and koch, ) . it is difficult to deduce the mechanism of this differential response from inspection of natural forms of viral and cellular mrnas. however, a cloned preproinsulin gene has been experimentally converted from hypertonic resistant to hypertonic sensitive by inserting into the '-noncoding sequence the oligonucleotide agcttgggccgtggtgg, thereby creating a base-pair hairpin around the aug initiator codon (mutant b hp in kozak, b) . a reasonable interpretation is that the hairpin structure (ag - kcal/mol), which does not inhibit translation under nor-ma culture conditions, is stabilized under hypertonic conditions to the point where it becomes inhibitory. an alternative explanation, currently under investigation, is that the primary sequence of the oligonucleotide insert underlies the enhanced sensitivity of mutant b hp to hypertonic stress. if the first explanation turns out to be correct, one might suggest by extrapolation that most viral mrnas are less structured near the ' end than are most cellular mrnas, and for that reason viral mrnas are more resistant to hypertonic stress. herpes simplex virus mrnas are a notable exception: they are unusually sensitive to hypertonic inhibition (stevely and mcgrath, , perhaps because their high g + c content generates extensive secondary structure. . the mechanism of action of interferon is too complex to discuss here, except to mention that double-stranded regions of rna, either free or incorporated into the mrna structure (debenedetti and baglioni, ; knight et al., , are critical in activating and targeting the interferon-induced enzymes. the deleterious effects of interferon on the stability and translation of viral mrnas have been reviewed by lengyel ( ) . the monocistronic rule means more than simply producing one protein from one mrna. a number of viral mrnas encode two or more proteins in nonoverlapping reading frames; with few exceptions, however, (see section ii,c), it is exclusively the '-proximal cistron that gets translated (shih and kaesberg, ; reviewed by kozak, ; . to cope with the usual inability of eukaryotic ribosomes to initiate at internal sites in mrna, the genomes of animal viruses are punctuated at one of four levels, as described below. the structures of plant virus rna genomes and their patterns of expression have been reviewed by davies and hull ( , and they are not exceptional. the mode of expression of cauliflower mosaic virus, which has a circular dna genome, is exceptional indeed, and is discussed in section i ,c. the following descriptions are generalized; additional details and references have been published elsewhere (kozak, b) . each virus is classified according to its major mode of punctuation, which is often not the exclusive mode. . the genome itself is segmented. each segment typically consists of one gene, which is transcribed end to end, or nearly so. there is usually a simple correspondence between the size of the mrna and the size of the mature protein derived therefrom. reoviruses, influenza viruses, and bunyaviruses fit this description. arenaviruses and nodaviruses (e.g., black beetle virus) have segmented rna genomes but rely also on other mechanisms. . the viral genes are linked, but internal start and stop sites for transcription generate a separate mrna for each protein. punctuation is accomplished for the most part at the level of transcription rather than by posttranscriptional processing. again, the size of the mrna usually corresponds to the size of the mature p r~t e i n .~ this group includes poxviruses, herpesviruses, rhabdoviruses (vsv), and paramyxoviruses. here the genome lacks internal transcriptional and translational stoplstart sites. the genome-sized mrna is translated end to end to produce a "polyprotein," more than amino acids in length, which is cleaved to generate the mature viral proteins. the extreme situation in which all viral proteins are derived from a single precursor is characteristic of picornaviruses and flaviviruses (castle et al., ; c . m. . [rice et al. ( ) present a lucid explanation of some older data that had suggested a different translational strategy for flaviviruses.] posttranslational cleavage supplements other modes of punctuation in many animal virus systems, and is especially important in the maturation of retrovirus and alphavirus proteins. . the fourth, rather heterogeneous group of viruses characteristically produce big transcripts that cannot be translated completely: ribosomes bind at the ' end and translate only up to the first stop codon, and the downstream cistrons in these polycistronic mrnas are usually silent. the downstream cistrons become translatable when they are moved closer to the ' end, which is accomplished by producing truncated or subgenomic mrnas. various mechanisms generate these shortened transcripts. conventional splicing of nuclear transcripts is used by retroviruses, papovaviruses, and parvoviruses. adenoviruses also use splicing, on a rather grand scale (nevins, ; ziff, ) . coronaviruses use a novel cytoplasmic fusion mechanism to transfer a common leader sequence to each of six, progressively shorter, subgenomic mrnas (budzilowicz et al., ; lai et al., ; spaan et al., ) . in the case of alphaviruses and parvoviruses, initiation at a n whereas the molecular weight correlation between mrnas and proteins holds for most early vaccinia virus genes (cooper and moss, ; hruby and ball, ) , late vaccinia mrnas are notoriously heterogeneous in size, apparently because transcription does not terminate discretely (mahr and roberts, ; rose and moss, ) . the '-proximal portions of such transcripts are assumed to be translationally silent. in the case of herpes simplex virus, the size of many mrnas corresponds simply to the size of the encoded protein, but more complex mrnas also exist (wagner, ) ; the functional significance of the latter is not yet clear. internal transcriptional promoter produces the subgenomic mrnas that encode the major capsid proteins (brzeski and kennedy, ; janik et al., ) . hepadnaviruses (hepatitis b and others) cannot yet be classified, since mrnas have been identified for some but not all of the viral proteins (tiollais et al., ) . the major subgenomic mrna is initiated at an internal promoter, and there is no evidence for splicing. the heterogeneous initiation sites for transcription in hepatitis viruses might be a means to regulate translation, as suggested by laub et al. ( ) and enders et al. ( ) . arenaviruses are a special case. the genomic s-rna segment codes for two structural proteins, n and gpc, but only gpc can be translated conceptually directly from the ' half of virion rna; the ' half of the sequence is an antisense version of the n gene (auperin et al., ) . thus, a subgenomic complementary mrna is produced to translate the n protein. although gpc could in theory be translated from the full-length viral s-rna, a subgenomic rna corresponding to the ' portion of s-rna is also present in infected cells. this might be necessary to avoid "hybrid arrest" which could occur if translation were attempted with full-length viral and antiviral transcripts. whereas most eukaryotic mrnas are functionally monocistronic, certain viral mrnas have been shown to synthesize two separately initiated polypeptides. with few exceptions we can rationalize the the mechanisms outlined herein cannot explain the (inefficient) internal initiation that occurs in a mutant form of rous sarcoma virus src mrna (mardon and varmus, ) . poliovirus mrna also initiates translation at more than one site, at least in uitro (celma and ehrenfeld, ) , but one cannot attempt an explanation until the sites have been identified. [dorner et al. ( ) claim to have localized an internal initiation site, but they did not prove that the template rna was intact. the fact that they could demonstrate "internal initiation" in extracts from reticulocytes but not from poliovirusinfected cells hints of an artifact.] because the poliovirus '-noncoding sequence has eight aug triplets upstream from the major translational start site (kitamura et al., ; racaniello and baltimore, ) , spurious initiation events are expected in that region. on the other hand, the upstream aug triplets would not preclude initiation of the polyprotein from the ninth aug codon, because seven of the upstream aug triplets lie in a weak context; the only one that lies in a favorable context is followed by an inframe terminator codon, which would allow reinitiation. the same explanations are compatible with the genomic sequences of many other picornaviruses callahan et al., ; forss et al., ; linemeyer et al., ) . the two structural peculiarities of picornavirus mrnas-presence of upstream aug codons and absence of production of two proteins from a single mrna by invoking one of the following mechanisms, each of which is experimentally supported. these mechanisms (with the exception of reinitiation) might be considered errors, i.e., the results of imprecise execution of some step in translation. a system that functions with less-than-perfect fidelity apparently gains the advantage of versatility. the scanning model postulates that, when the '-proximal aug codon occurs in a suboptimal context, ribosomes will initiate at that site as well as at another aug codon farther downstream. several nucleotides near the aug codon are known to affect the efficiency of initiation, but the most important determinants are a purine (preferably a) in position - , and g in position + ; we can predict the occurrence of leaky scanning by focusing on those two positions. in each of the bifunctional viral mrnas listed in fig. , the more '-proximal initiation site lies in a suboptimal context, thus rationalizing the ability of some ribosomes to reach the start of the second cistron. (in sv s mrna, influenza b, and adenovirus- , which are bracketed in the center of the figure, the sequence flanking the first aug codon is not really weak, but it is not perfect; thus, some - % of the s subunits are expected to bypass the first aug codon and reach the second. that may be adequate to produce the second protein in the case of adenovirus and influenza virus, but it does not seem adequate to explain the synthesis of sv vp , which is an abundant protein. in sv s mrna, however, ribosomes can reinitiate at the vp start site, as explained below.) the scanning model does not necessitate that the second aug codon lie in a stronger context than the first, although that usually is the case; it is necessary only that the first aug codon lie in a context that is less than optimal. each mrna listed in the upper part of fig. produces two unrelated proteins, translated from two different reading frames. the mrnas in the lower part of the figure initiate at two aug codons in the same reading frame, thereby producing long and short versions of the same protein. whereas the relaxed scanning mechanism accounts qualitatively for the dual function of the mrnas listed in fig. , the model is not very a cap-might be related it is possible that, when cap binding proteink) are not part of the s initiation complex, aug codons in suboptimal contexts are recognized even less efficiently than usual, and the barrier effect of the upstream aug codons in poliovirus mrna would thus be minimized. perhaps p is cleaved (see section iii,d) to directly facilitate viral translation, rather than to inhibit host translation. good a t predicting the frequency with which ribosomes initiate a t each site. one problem is that the ratio of initiation a t sites and in vivo is often different from that i n vitro (bos et al., ; clarke et al., ; dethlefsen and kolakofsky, ; , and the ratio changes when salt or other reaction conditions are varied. that is hardly surprising because the fidelity of initiation in vitro is sensitive to reaction conditions (jense et al., ; kozak, b; petersen and hackett, ) . on the other hand, the in vivo ratio might be skewed if one protein is less stable or less efficiently extracted than the other. in addition to the obvious economy of using one mrna to make two proteins, in a fixed ratio, their simultaneous production might allow the polypeptides to interact as the nascent chains grow. it would be amusing to determine whether complementation is less efficient when two proteins that are normally translated from one mrna are instead synthesized from separate templates. the hundreds of eukaryotic cellular genes that have been sequenced to date invariably initiate translation a t aug. when alternate initiator codons were tested experimentally, however, they were not inert. eukaryotic ribosomes can initiate a t gug (kozak, unpublished data) and uug (zitomer et al., ) , but the efficiency is a t least -fold lower than at an aug codon in the same context; initiation at gug, uug, or other nonstandard codons is (barely) detectable only when the codon is preceded by the optimal a in position - (m. k., unpublished data). there is credible, albeit not definitive, evidence that alternate initiator codons are used in two virus systems to produce minor virion components. one is adeno-associated virus capsid protein b, which probably initiates a t an acg codon that lies upstream from the major aug start site (becerra et al., ) . [an acg codon in coliphage t mrna is also recognized as an initiator codon by wheat germ ribosomes i n vitro (anderson and buzash-pollert, ) . although the template is unnatural in that case, the evidence for initiation a t acg is irrefutable.] the second natural example is gpr gag, a nonessential but nonetheless conserved form of gag produced by moloney murine leukemia virus (edwards and fan, ; the nucleotide sequence of the region is given by shinnick et al., ) . gpr gag is analogous to the elongated form of gag produced by feline leukemia virus, except that the latter is presumably initiated at an upstream aug codon in a weak context (fig. l ) , whereas in murine leukemia virus the most likely initiation site(s) are upstream gug and/or cug codons that lie in a favorable context. charles van beveren has shown that gpr gag is produced not only by the left-most column shows that in most cases the sequence around the first functional initiator codon is suboptimal with respect to the nucleotides in positions - and + , thus explaining how some s ribosomal subunits can reach the second initiation site. in the case of the coronaviruses and black beetle virus, the indicated proteins are predicted but have not yet been demonstrated. although the . -kda protein predicted from adenovirus region e has not been seen, its ribosome binding site has been proven functional by demonstrating the synthesis of a fusion protein from an appropriately engineered mutant virus (wold et al., ) . all of the other proteins listed here have been detected in infected cells, and most have also been synthesized in cell-free translation systems. notes: %since the ' ends of hepatitis virus and some bunyavirus mrnas are heterogeneous (laub et al., ; patterson et al., , the second protein could be translated, without invoking leaky scanning, from the portion of the mrna population that lacks upstream aug codons. binfluenza virus rna- is unusual in that the first and second aug codons are separated by only four nucleotides (shaw et al., ) , but that probably does not explain the ability of ribosomes to initiate at both sites. in a version of preproinsulin mrna in which the first and second aug codons (both in the perfect context for initiation) were moloney virus, but also by two other murine leukemia viruses that have no aug codons upstream from the major (pr gag) start site (personal communication). thus, there is no alternative to believing that nonstandard codon(s) are used to initiate the elongated form ofgag. experiments to pinpoint the start sites are in progress in van beveren's laboratory. although reinitiation was documented years ago in prokaryotes, there was no reason to suspect a similar phenomenon in eukaryotes until laboratory manipulations with cloned genes yielded some results separated by five nucleotides, ribosomes were unable to initiate a t the second member of the pair (kozak, b) . 'because the first reading frame terminates upstream from the second in sv mrnas, ribosomes could reach the start site for vp by a combination of leaky scanning and reinitiation. dthe arrangement of aug codons near the ' end of sv late s mrna is gccaugg (out-of-frame at position - ) . . . uccaugg (start of vp ) . . . ccuaugc (out-of-frame a t position - ) . . . ggaaugg (start of vp ) . we postulate that leaky scanning allows some s ribosomal subunits to bypass the first aug triplet (position - ) in order to initiate vp . that does not contradict the fact that, in s mrna, the aug codon in position - initiates the agnogene product. by extrapolating from the systematic measurements carried out in another system (kozak, a) , we would expect - of the ribosomes to initiate a t the aug codon in position - , while - should reach the next aug; that seems sufficient to produce vp , which is a minor component of the virion. synthesis of vp might depend on leaky scanning (bypassing the first three aug codons) as well as reinitiation, inasmuch as ribosomes that initiate a t the first aug codon would terminate before reaching the vp start site. ethe nucleotide in position - varies among strains of foot-and-mouth disease virus, and the relative yields of p a and p vary accordingly (clarke et al., ) . fit is likely that two annaug sequences farther upstream are also used to produce longer forms of surface antigen (heermann et al., ) . most transcripts lack the extreme upstream aug codons, however. nthe two proteins postulated for feline leukemia virus are indeed seen in infected cells, but the mechanism of synthesis postulated here has not been proven. infrequent initiation at weak, upstream aug codons is also suspected with mrnas from some other retroviruses (gruss et al., ; willumsen et al., ) . references: sendai virus: giorgi et al., . measles virus: bellini et al., . reovirus: cashdollar et al., ernst and shatkin, ; kozak, ; sarkar et al., . bunyaviruses: eshita and fuller et al., pasek et al., ; persing et al., . feline leukemia virus: laprevotte et al., . herpes simplex: haarr et al., wagner et al., . that are difficult to explain ~t h e r w i s e .~ the principal observation is that eukaryotic ribosomes can initiate at an internal aug codon, when another aug codon occurs upstream and in a highly favorable context (thus ruling out leaky scanning), provided that a terminator codon occurs in-frame with the first aug codon and upstream from the second (kozak, b; liu et al., ; m. scott and h. varmus, personal communication) . we envision that when a complete "minicistron," i.e., an aug triplet followed by a terminator codon, occurs upstream, it is translated; but the s ribosome does not detach at the terminator codon. rather, the s subunit dissociates while the s subunit remains bound to the message and resumes scanning. when the s subunit reaches the next aug codon, it reinitiates translation. reinitiation is more eficient when the terminator codon precedes, rather than when it overlaps, the aug codon (m. kozak, unpublished) . with respect t o natural mrnas rather than laboratory constructs, elegant genetic manipulations implicate reinitiation in the translation of rous sarcoma virus src mrna (hughes et al., ) and cauliflower mosaic virus mrna , dixon et al., . the latter is the most striking example to date of a functionally polycistronic mrna in eukaryotes. the overlapping arrangement of cistrons rules out the possibility of reinitiation with many other viral mrnas (contreras et al., ; meshi et al., ; schwartz et al., ; . however, in some instances in which adjacent cistrons do not overlap, and reinitiation is therefore expected, it has not been observed (barker et al., ; goelet et al., ; knowland, ; ou et al., ) . reinitiation, together with leaky scanning, could theoretically account for translation of the sv agnogene pro- the alternative to reinitiation is to postulate that eukaryotic ribosomes can initiate directly at an internal aug codon, and that they usually fail to do so only because the downstream site is occluded by the stream of s ribosomes advancing from upstream. occlusion indeed occurs during the translation of polycistronic prokaryotic transcripts, but the inhibitory effect of an overlapping upstream cistron is sometimes only twoor threefold (das and yanofsky, ; hoess et al., ) . berkhaut et al. ( ) claimed to see complete inhibition of translation of the ms lysis protein when the coat protein cistron overlapped, but the unknown sensitivity of their biological assay complicates the interpretation. moreover, their claim that a strong upstream initiation site (for coat protein) suppresses initiation from the much weaker site for lysis protein hardly compares with the situation in eukaryotes, where an upstream aug codon can completely suppress initiation from an equally favorable downstream site (kozak, b, ) . the essential difference between the occlusion and reinitiation mechanisms is that the former postulates direct binding of ribosomes to internal aug codons, while the latter prohibits such binding. there is experimental evidence against direct binding (kozak, a (kozak, , b and against occlusion (kozak, ) . tein and vp from the same mrna, although neither mechanism has been experimentally demonstrated with sv . (the simultaneous occurrence of two phenomena complicates the task of demonstrating either one.) reinitiation is expected within the leader region of rous sarcoma virus genomic rna, where three small open reading frames (orfs), one of them headed by an aug codon in a highly favorable context, precede the gag coding sequence (schwartz et al., ) . it has been difficult to demonstrate synthesis of the predicted leader peptides, perhaps because their small size makes them unstable. with admirable persistence, however, hackett et al. ( ) have devised a sensitive assay with which they have detected small amounts of the peptide encoded in the first minicistron of rous sarcoma virus. parenthetically, when one is designing experiments to probe the function of a particular viral or cellular product, one must remember that introducing a nonsense codon near the beginning of a gene might not abolish its function. if an in-frame aug codon occurs downstream from the nonsense codon, ribosomes will probably reinitiate and the truncated polypeptide might be functional. the mechanism by which reverse transcriptase is synthesized has long puzzled retrovirologists. the pol coding sequence is not preceded by an initiator codon; rather, reverse transcriptase is derived by cleavage from a joint gag-pol precursor (murphy et al., ; oppermann et al., ) . the problem is that the genomic arrangement of gag and pol sequences would seem to preclude their joint translation. in avian retroviruses, gag and pol are in different, partially overlapping, reading frames (schwartz et al., ) ; in murine retroviruses, gag and pol are in the same frame but are separated by a terminator codon (shinnick et al., ) . in both cases, the solution involves a translational "error." with avian retroviruses, about % of the ribosomes shift reading frames somewhere near the end of the gag sequence, thereby producing from one message both gag and a small amount of the gag-pol fusion protein. jacks and varmus ( ) have shown beyond reasonable doubt that frameshifting occurs near the gag-pol junction in a cell-free translation system from reticulocytes. by using mrna that was transcribed in uitro from cloned rous sarcoma virus dna, they excluded the possibility that a low-abundance, spliced transcript served as the template for the fusion protein. inspection of the gag-pol junction sequences in several other retroviruses leads one to expect that frameshifting is not limited to the avian system. neither is it limited to eukaryotes, of course. frameshifting occurs under intrigu-ing circumstances in a few bacterial and phage genes (craigen et al., ; dunn and studier, ; kastelein et al., ) . the excitement that accompanied the old discovery of a "readthrough" version of coliphage qp coat protein (weiner and weber, ) has been rekindled recently by finding a similar phenomenon in eukaryotic systems. in murine retroviruses, for example, the gag and pol sequences are separated by a single uag terminator codon, the occasional suppression of which generates a gag-pol fusion protein. the first hint of this came from supplementing a cell-free translation system with yeast suppressor trna, which indeed enhanced the synthesis of the gag-pol precursor (philipson et al., ) . the notion was confirmed for both murine and feline leukemia viruses when yoshinaka et al. ( a,b) directly determined the amino acid sequence of the protease that constitutes the nh,-terminal portion of the pol gene product. suppression of a terminator codon is not peculiar to retroviruses, for it occurs also with alphaviruses (lopez et al., ; strauss et al., ) , tobacco mosaic virus (pelham, ) , and probably carnation mottle virus (guilley et al., ) . suppression of the uag codon in tobacco mosaic virus rna has been traced to the major tyrosine-specific trnas which, in tobacco cells, have the anticodon sequence gjia (beier et al., a,b) . the most abundant trnaer from wheat germ has the highly modified queuine base (q) in place of g in the wobble position of the anticodon, and it is not able to suppress. thus, minor differences in trna structure can be an important determinant of host range for some viruses. in one sense, suppression solves the problem of how to produce a full-length protein from an interrupted coding sequence. but that probably misplaces the emphasis. the real problem might be how to produce only a small amount of an essential protein that might be toxic if overproduced. an inefficient mechanism, such as suppression or frameshifting, is an ideal solution. whereas the features described in the preceding section are intrinsic to viral mrnas, and can be demonstrated readily in a "universal" reticulocyte lysate, the translation of viral mrnas in uiuo is influenced by specific conditions that prevail in the cytoplasm of infected cells. the way in which the translational machinery is partitioned viral translation between viral and host mrnas is one important consideration. because the literature concerning inhibition of host protein synthesis by animal viruses has already been reviewed at length (fraenkel-conrat and wagner, ; kaariainen and ranki, ; shatkin, ) , i shall be selective in my coverage. an overview of the phenomenology is presented in table i . the general mechanisms of host shutoff defined by these phenomena are described briefly in sections b and c, which are followed by a detailed discussion of two viruses-poliovirus and adenovirus-that seem to merit more attention. the phenomenon of host shutoff is not as widespread as might appear from table i . retroviruses, paramyxoviruses, parvoviruses, and flaviviruses do not suppress host translation, and papovaviruses actually stimulate host protein synthesis. because host shutoff is interesting, and because it is easier to detect viral protein synthesis against a clean background, virologists understandably have focused on systems that demonstrate the phenomenon. the inhibition of host protein synthesis may be of more interest to virologists than to viruses, however. in many cases, the yield of infectious progeny from a virus that fails to shut off host protein synthesis is the same as from another virus strain (or the same virus in a different cell line) in which host protein synthesis is obliterated (detjen et al., ; gillies and stollar, ; jen and thach, ; lodish and porter, ; minor et at., ; munemitsu and samuel, ; read and frenkel, ; sharpe and fields, ) . a virus strain that suppresses host macromolecular synthesis sometimes replicates faster in culture than one that does not, however. whether the inhibition of host protein synthesis is beneficial or harmful or irrelevant to the virus during the course of natural infections is not known. in short, with a few viruses inhibition of host protein synthesis might be a strategic move, necessary for efficient expression of viral genes, but no unequivocal example can be cited. in most instances, host shutoff is likely to be an unintentional side effect of viral gene expression-an effect of no real value, and possibly even harmful, to the virus. it is interesting that poliovirus replicates better during coinfection with cytomegalovirus than during single infection; cytomegalovirus stimulates the cell functions that are turned off by poliovirus (furukawa et al., ) ! there are examples of nonpermissive virus-cell systems in which macromolecular synthesis is inhibited so effectively that neither host nor viral proteins can be made (brown and moyer, ; drillien et al., ; jones et al., ) . in such cases the wild-type virus must have a way to throttle the shutoff mechanism. that notion will be pursued in the section on adenoviruses. throughout this section i have tried to point out wrinkles in the data, uncertainties in some popular interpretations, and alternative mechanisms. this critical slant is intended not to minimize the value of the work that has been done, but to stimulate reconsideration of some paradigms that may have been accepted or rejected too quickly. experiments probing the mechanism of host shutoff are difficult. some of the pitfalls and caveats might be stated at the outset. certain techniques that are used to block virus infection at a particular step, in order to define the extent of viral expression that is needed to effect host shutoff, might inadvertently create a new inhibitory mechanism. in the resulting confusion one learns little about the physiological mechanism of inhibition. for example, treatment of poliovirus-infected cells with guanidine not only blocks the synthesis of progeny rna (which is the intended purpose), but also causes double-stranded rna to accumulate to higher-than-normal levels (baltimore, ); and double-stranded rna is a potent inhibitor of translation. experiments showing that a temperature-sensitive mutant virus which makes no progeny rna nevertheless shuts off host protein synthesis as effectively as wild-type poliovirus suffer the same defect. the mutant-infected cells accumulate massive amounts of partially double-stranded "replicative intermediates" which are likely to inhibit translation, irrespective of the normal shutoff mechanism (hewlett et al., ) . in short, the problem with many experiments is that translation can be inhibited in a variety of ways, and in the process of blocking one pathway, another can be activated. for the same reason, the assumption that the mechanism of host shutoff is the same at high multiplicities of infection as at low multiplicities is untenable. in the case of encephalomyocarditis (emu virus, the effect on host protein synthesis has been shown to differ qualitatively as a function of multiplicity (alonso and carrasco, ) . with poliovirus, the familiar statement that guanidine does not prevent host shutoff is true only when the cells are infected at a high multiplicity (helentjaris and ehrenfeld, ) . at a normal multiplicity of infection, guanidine does block host shutoff, and therefore it is not clear that viral rna synthesis (which is the guanidine-sensitive step) is uninvolved in the normal mechanism of host shutoff by poliovirus. the specific deficiency or alteration in the translational machinery can sometimes be pinpointed by studying protein synthesis in extracts prepared from virus-infected cells, provided that one appreciates the limitations of that approach. the notion that one can study the mechanism of host shutoff by one virus by using a second virus as a stand-in for host mrna is questionable, because proteins encoded by two differ- helentjaris and ehrenfeld ( ) ; nuss et al. ( ) . ( ) bienz et al. ( ) . ( ) bossart and bienz ( ) ; femandez-munoz and darnell ( ) . ( ) celma and ehrenfeld ( ) . ( ) h e a l and . ( ) etchison etal. ( ) ; a. dasgupta, personal communication. ( ) helentjaris and ehrenfeld ( ) . emc uirus in hela cells: ( ) jen e t d . ( ) . ( ) carrasco and lacal ( ). ( ) alonso and carrasco ( ) . ( ) jen et al. ( ) . ( ) alonso and carrasco f b); lacal and c a r r a m ( ). ( ) mosenkis et al. ( ) ; a. p. rice etal. ( ) . sindbis and sfv: ( ) lachmi and kaiiriiiinen ( ) ; wengler and wengler ( ) . ( ) and ( ) simizu ( ) . ( ) van steeg et al. ( ) ; wengler and wengler ( ) . ( ) carrasco and lacal( ) ; gamy et d. ( ) . ( ) van steeg et al. ( ) . ( ) simizu ( ) . vsv: ( ) lodish and porter ( ) ; mcallister and wagner ( ) . ( ) grinnell and wagner ( ) . ( ) jaye et al. ( ) ; lodish and porter ( ) ; nisbioka and silverstein ( a) . ( ) lodish and porter ( ) ; otto and lucas-lenard ( ). ( ) francoeur and stanners ( ) ; . ( ) centrella and lucas-lenard ( ) ; dratewka-kos etal. ( ) . reouirus in l cells: ( ) zweerink and joklik ( ) . ( ) sharpe and fields ( ) . ( ) ( ) . skup and millward ( ) . ( ) sharpe and fields ( ) . influenza virus: ( - ) inglis ( ) ; . ( ) lazarowitz etal. ( ) . ( ) carrasco and lacal ( ) . ( ) katze et al. ( , . adenouirus: ( ) castiglia and flint ( ) . ( ) babich et al. ( ) ; beltz and flint ( ) . ( ) babich et al. ( ) . ( ) castiglia and flint ( ) . ( ) see text. ( ) babiss and ginsberg ( ) . vaccinia uirus: ( ) hruby and ball ( ) ; oppermann and koch ( ) . ( ) salzman etal. ( ) . ( ) cooper and moss ( ) ; rice and roberts ( ) . ( ) oppermann and koch ( ) ; rice and roberts ( ) . ( ) norrie etal. ( ) . ( ) bablanian etal. ( ) . herpes simpler: ( ) pereira et al. ( ) . ( ) fenwick and walker ( ) ; stenberg and pizer ( ) . ( ) stage -see text; stage - inglis ( ) ; nishioka and silverstein ( b) . ( ) silverstein and engelhardt ( ) . ( ) fenwick and walker ( ) ; hackstadt and mallavia ( ) . ( ) fenwick and walker ( ) ; nishioka and silverstein ( b) ; read and frenkel ( ) . frog virus : ( ) cited in mosenkis et al. ( ) . all other entries are from willis et al. ( ) . bthe timing of host shutoff relative to the onset of viral translation is indicated. a capitalized entry in this or any other column identifies the probable major mechanism of host shutoff. "coincident" in capitals means that competition probably underlies host shutoff. cfunctional stability is usually evaluated by the ability of host mrnas, extracted from infected cells, to be translated in a cell-free reticulocyte lysate. dthis column indicates the presence or absence of a temporal correlation between the inhibition of host protein synthesis and the influx of sodium ions that often accompanies virus infection (carrasco and lacal, ) . ea change in cap binding protein was postulated because extracts from sfv-infected cells were unable to translate most capped mrnas, with the exception of emc and sfv late s mrnas (van steeg et al., . although it is true that efficient mrnas like emc and sfv s can be translated without benefit of the m c cap, it does not follow that cap binding protein(s are deficient in every instance where translation of those mrnas persists in the face of an overall decline. efficient mrnas will be selectively translated when any component of the translational machinery is made limiting. the best evidence for this is the ability of both emc and sfv s mrna to be translated in emc virus-infected cells, in which host translation is drastically inhibited by a mechanism that has not been difined, but that clearly does not involve cap binding protein (mosenkis et al., ) . wan steeg et al. ( ) have postulated that capsid protein is responsible for host shutoff by sfv, but the evidence is not compelling: the binding of host mrna to ribosomes was only slightly inhibited in fig. of their paper, and the inhibition was a t the level of s rather than s ribosomes. the fact that translation of late viral s mrna was unaffected is not adequate evidence of specificity, since s mrna-by virtue of its high efficiency-would be relatively resistant to any inhibitor, physiological or otherwise. gwith type- reovirus in l cells, infection at a multiplicity of infection (moi) of caused no significant decrease in translation; a t mol of , translation gradually declined by - % (sharpe and fields, ) . with type- reovirus (moi of zo), overall protein synthesis was initially stimulated in both hela and l cells; translation declined later only in l cells (munoz et al., a) . hrecent data do not corroborate an earlier hypothesis concerning inzctivation of a cap-specific translation factor (skup and millward, ) . although extracts from reovirus-infected cells translate capped reovirus mrnas poorly, other cap-dependent mrnas, such as globin and tobacco mosaic virus, are translated efficiently in such extracts (lemieux et al., ) ; and capped sv mrnas are translated in cells coinfected with reovirus (daher and samuel, ) . perhaps translation of capped reovirus mrnas is inhibited (artificially) in extracts from infected cells because viral structural proteins, which must be abundant in those extracts, adsorb to the homologous mrnas and sequester them from ribosomes. 'in contrast with most other host mrnas, the synthesis of histone mrnas is inhibited in adenovirus-infected cells (flint et al., ) . ]the -kda e b protein probably functions only indirectly to shut off host translation. the protein is required for efficient cytoplasmic accumulation of late viral -as, which might in turn shut off host protein synthesis by competition (see text). proteins from regions e b and e may function as a complex. khost transcripts were stable by hybridization when hela cells were infected in the presence of actinomycin d (rosemond-hornbeak and moss, ) but were degraded during productive infection of l cells by vaccinia virus (rice and roberts, ) . the second observation seems more pertinent. 'ben-hamida et al. ( ) have purified a component from vaccinia virions that blocks the binding of met-trna to s ribosomes in uitro, but the physiological (in uiuo) mechanism of host shutoff seems to require the expression of viral genes. there is no evidence that eif- function is impaired in infected cells. it is possible, however, that some component in the eif- cycle is altered in a positive way, i.e., a way that prevents inactivation by eif- kinase (whitaker- dowling and youngner. ) . "it is clear that host translation can be inhibited rapidly in the presence of drugs that preclude the synthesis of viral mrna (moss, ) . but it is not clear that the normal shutoff mechanism is at work in such cases (see text). "the virion-mediated rapid shutoff of host translation is not usually seen with herpes simplex type . except in vero cells; type virus displays the early shutoff function in all cell types. an important, albeit undeciphered, clue is that type virus interferes with the early shutoff by type virions in doubly infected friend erythroleukemia cells (hill et al., . ent viruses can often be produced simultaneously in cells in which host protein synthesis is suppressed (alonso and carrasco, a,c; otto and lucas-lenard, ) . some features of the intracellular environment, such as ionic changes that favor the translation of viral over host mrnas, are inevitably lost when the cells are lysed, and other features are not easily preserved. for example, phosphorylated initiation factors have sometimes been inadvertently restored to normal during their purification (centrella and lucas-lenard, ; wong et al., ) . phosphorylation of eif- has also been missed on occasion because eif- (ap), retains the ability to function stoichiometrically, and the defect is evident only if one assays for catalytic function (safer, ) . on other occasions, phosphorylation of eif- has been missed because a high concentration of gtp in the lysate masks the functional defect (schneider et al., ) . extreme care is needed also to preclude the artifactual modification of initiation factors-by proteolysis, for example-during the preparation of cell-free extracts. the fact that one can reproduce in vitro the preferential translation of viral over host mrnas does not necessarily mean that one is studying the physiological mechanism of host shutoff. if viral mrnas are even slightly more efficient than host mrnas, as is often the case, any manipulation that establishes competition will favor the viral mrnas. one cannot define how competition is established in uiuo by showing that competition occurs in uitro. for example, the fact that translation of vaccinia mrnas is more resistant than host mrnas to inhibition by poly(a) when translation is studied in cell-free extracts from reticulocytes (bablanian and banerjee, ; coppola and bablanian, ) does not mean that vaccinia virus inhibits host translation by flooding the cytoplasm with short, polyadenylated transcripts. such transcripts are indeed produced in infected cells, but only when drugs are used to block the synthesis of normal viral mrnas (rosemond-hornbeak and moss, ) . the aforementioned problem of an experimental manipulation creating a new inhibitory mechanism, rather than exposing the normal mechanism, almost certainly applies here. the tendency to attribute functional significance to foreign agents that cosediment with polysomes should be resisted. everything cosediments with polysomes to some extent. the presence of a trace of ade- eif- , eukaryotic initiation factor , is responsible for binding initiator methionyl-trna to the s ribosomal unit, and eif- (ap) is eif- phosphorylated on its a-subunit. when eif- is phosphorylated, the reaction in which gdp is exchanged for gtp fails. that reaction is mediated by an accessory protein called gef, which becomes trapped in an inactive complex with eif- (ap). because the pool size of gef is small, phosphorylation of only % of the eif- pool can completely inhibit translation (safer, ; siekierka et al., ) . novirus va-rna in the polysome region of sucrose gradients (schneider et al., ) , for example, is almost certainly unrelated to the function of va-rna. it is common to find viral capsid proteins stuck to ribosomes, and it is wise to treat such contamination as contamination, until it is proven otherwise. the known and suspected mechanisms by which translation of viral mrnas is facilitated, usually to the disadvantage of host mrnas, fall into four categories. . competition may be suspected when the decline in host protein synthesis and the onset of viral protein synthesis coincide. on the other hand, competition is an insufficient explanation when host protein synthesis is severely inhibited before the onset of viral translation, as occurs with poliovirus, herpes simplex virus, and frog virus . often competition is exacerbated by a decline in the overall translational capacity, which may be brought about by changes in the ionic environment or in the translational machinery. when initiation is limiting, most mrnas accumulate in small polysomes, the size of which increases upon exposure to a low concentration of cycloheximide. (cycloheximide slows elongation, thus causing the number of ribosomes to increase on mrnas that were previously limited at the initiation step.) the characteristic shift in polysome size upon exposure to cycloheximide is seen, for example, in cells infected by vsv (jaye et al., ) or adenovirus (perlman et al., ) . the competition between host and viral mrnas that takes place in uiuo is sometimes not maintained when the translation of endogenous mrnas is studied in extracts from infected cells, probably because the concentrations of critical components change during the preparation of such extracts. . inactivation of a normal component of the translational machinery. the resulting deficiency enables only a subset of mrnas, mostly viral, to be translated. the hallmark of this mode of regulation is the ability to restore translation to cell-free extracts by adding back the missing factor, in practice, this is not as easy as it sounds. there is evidence for inactivation of initiation factor eif- in several virus systems, as noted in table i . alterations in the initiation factor that mediates the translation of capped mrnas have been postulated for several viruses (table i) , but the story seems credible only in the case of poliovirus, which is described below. identifying characteristic here is that extracts from infected cells cannot be reactivated by the addition of normal initiation factors, but can be reactivated by washing the ribosomes to remove the inhibitor. based on these criteria, pensiero and lucas-lenard ( ) have postulated the production of an inhibitor during mengovirus infection. because mengovirus and host mrnas are equally sensitive to the inhibitor in cell-free extracts, one must invoke competition (for the residual functional ribosomes) t o explain the selective persistence of viral translation in uiuo. this seems justified in view of the extraordinary efficiency of mengovirus mrna when translation is carried out in uitro under conditions of competition (abreu and lucas-lenard, ) . until the postulated inhibitor has been identified, however, we cannot be certain that mengovirus belongs in category . the hallmark here is that viral mrnas should be translated more efficiently in cell-free extracts from infected than from uninfected cells. frog virus meets this criterion (raghow and granoff, ) . it is possible that some other animal viruses alter the translational machinery in a "positive" way. the best evidence t o date comes from plant viruses, however. a genetic analysis of temperature-sensitive mutants of alfalfa mosaic virus strongly suggests that rnas- and - encode or induce a factor that facilitates the translation of coat protein from rna- (huisman et al., ) . extrapolating that mechanism to brome mosaic virus would explain why rna- fails to synthesize coat protein when it is injected (without rnas- , - , and - ) into barley protoplasts (kiberstis et al., ) . unfortunately, cellfree systems from infected plant cells are not available to test the hypothesis. the aforementioned hints are only hints. no virus has yet been proved to produce a new or alter an old translational factor in a way that specifically promotes its own translation. since competition is the most common mechanism of translational regulation in virus-infected cells, that topic merits more attention. there are at least three variations on the theme. although the overall ability to translate poliovirus mrna is about the same when cell-free systems are reconstituted with factors from infected or uninfected cells (brown and ehrenfeld, ) , there is a qualitative difference in the selection of initiation sites when factors from infected cells are used (brown and ehrenfeld, ) . other experiments support the idea that poliovirus (bernstein et al., ) as well as vaccinia (moss and filler, ) and human t-lymphotropic virus type i (rosen et al., ) produce something that enhances the synthesis of viral proteins. the enhancing substance could viral translation in l cells infected by reovirus type , host protein synthesis is dramatically shut off. the elegant genetic studies of sharpe and fields ( ) revealed that the s gene, which encodes the major capsid protein u , is responsible for the inhibition. the effect of u might be indirect, inasmuch as the same gene product is responsible for inhibiting host rna synthesis. although host shutoff by type reovirus in sc- cells is less dramatic than with type virus in l cells, the mechanism of type shutoff is better understood due to the careful quantitative studies of thach and colleagues (walden et al., ) . their conclusion was rather surprising: the intrinsic translational efficiency of reovirus mrnas is not higher than that of host mrnas, but rather, reovirus translation dominates because viral mrnas accumulate in massive amounts-up to % of the total mrna in the cell! the evidence that reovirus mrnas initiate translation less efficiently than most host mrnas is twofold: ( ) the size of reovirus polysomes is smaller than host polysomes that code for proteins of comparable size; and ( ) whereas a low concentration of cycloheximide reduces the synthesis of host proteins (which is the result expected for mrnas of "normal" efficiency), the translation of reovirus proteins is actually enhanced by a low concentration of cycloheximide. whereas reovirus mrnas appear to be less efficient than most host mrnas, vsv mrnas are probably translated as efficiently as host mrnas, but not more so. competition is simply proportional to the concentration of viral mrnas in the cytoplasm (lodish and porter, ) , and vsv and host mrnas that encode the same-sized proteins are on polysomes of the same size (lodish and porter, ) . in some cell lines infected by some strains of vsv, a portion of the eif- pool seems to be inactivated (centrella and lucas-lenard, ; dratewka-kos et al., ) . although that would intensify the competition, it is obvious that lowering the eif- level per se cannot explain the selective inhibition of host translation. selective shutoff requires that viral mrnas be more abundant than host mrnas, or more efficient, or be a virus-specific translation factor, or a protease inhibitor that stabilizes viral proteins, or a nuclease inhibitor, or something else. recent evidence indeed suggests that vaccinia encodes a function that protects late viral mrnas against degradation (pacha and condit, ) . both. the aforementioned experiments argue against vsv mrnas being unusually efficient, but other experiments have been taken as evidence for the contrary view. because vsv mrnas are more resistant than host mrnas to hypertonic stress, nuss et al. ( ) have suggested that the viral mrnas are intrinsically more efficient. their interpretation seems reasonable, but it must be carefully circumscribed. if high salt exacerbates some deleterious feature in the mrna (such as secondary structure) to the point where it becomes inhibitory, then the hierarchy of mrna strengths that one observes under hypertonic conditions might be irrelevant to normal growth condition^.^ the progressive inhibition of host protein synthesis during infection by vaccinia virus is probably due to competition, in proportion to the concentration of each mrna. viral mrnas are not apparently more efficient than host mrnas (cooper and moss, ; . degradation of host mrnas (table i ) and the massive synthesis of viral transcripts probably tip the balance in favor of viral protein synthesis. have suggested that differential association of mrnas with the cytoskeleton might also play a role, but that is a difficult hypothesis to test. in contrast with reovirus and vsv, the concentration of emc virus mrna in infected cells may be too low for simple competition to effect the observed switch from host to viral translation, even though emc mrna is translated more efficiently than host mrnas both in uiuo (jen et al., ) and in vitro (golini et al., ; svitkin et al., ) . in view of the overall decline in translation that begins hours postinfection, however, the idea that emc virus mrna outcompetes host mrnas for the low, residual translational capacity seems reasonable. the overall decline is most likely due to an influx of monovalent cations, since the two events are temporally correlated (lacal and carrasco, ) . host translation is restored when emc virus-infected cells are shifted to hypotonic medium carrasco, , b , and excess salt, sufficient to inhibit the translation of host mrnas in uitro, dramatically stimulates the translation of emc rna (carrasco and smith, ). recall that, although reovirus mrnas are not more efficient than host mrnas in unperturbed cells, reovirus translation, like that of vsv, dominates when cells are subjected to hypertonic stress (nuss et al., ) . in other studies, the creation of a hairpin (ag - kcalimol) within the '-noncoding region of preproinsulin mrna impaired translation only in hypertonic medium; the hairpin did not inhibit translation under normal culture conditions (kozak, b ). a similar mechanism might mediate the switch from host to viral translation during infection by alphaviruses (see sindbis and sfv, i.e., semliki forest virus, in table i) , since the influx of sodium ions exactly coincides with the overall decline in protein synthesis. the magnitude of the ion influx remains controversial (gray et al., ; munoz et al., b) . translation of sfv mrna is more resistant than host protein synthesis to hypertonic conditions (garry et al., , but the resistance is not as dramatic as with emc virus (alonso and carrasco, . the notion that alphaviruses inhibit host translation by competition seems viable even if something more than enhanced permeability to monovalent cations is needed to explain the overall decline. the fact that sfv mrna can be translated in emc-infected cells (alonso and carrasco, ~ , in which the overall translational capacity is very low, identifies sfv late s mrna as an efficient message. consistent with the competition hypothesis, the time of host shutoff coincides with the production of viral mrna (lachmi and kaariainen, ) and the severity of inhibition correlates with the yield of viral rna in mutant-infected cells (atkins, ) . polysomes containing sfv (wengler and wengler, ) or emc virus mrna do not increase in size upon exposure to cycloheximide, suggesting that those mrnas are efficient enough to be fully loaded with ribosomes even when the overall translational capacity is low. influenza virus mrnas are translated with extraordinarily high efficiency i n uitro (katze et al., ) . because the shutoff of host protein synthesis coincides with the onset of influenza virus protein synthesis and there is no overall decline in translation, simple competition would seem adequate to explain the switch from host to viral translation. in the case of adenovirus, competition is probably exacerbated by a reduction in functional eif- levels. these issues are discussed in more detail in section ii ,e. in some virus systems, competition might dictate the switch from synthesis of early to late viral proteins. picornaviruses, rhabdoviruses, and influenza virus are uninteresting in this regard, as they display little or no temporal control over protein synthesis. the existence of a temporal switch is questionable for reoviruses, but all of the other entries in table i , as well as the papovaviruses, show a striking earlyto-late transition. in every case, the switch is effected primarily a t the level of transcription: the mrnas that encode late proteins are not synthesized until late. in several cases, however, early mrnas persist in the cytoplasm at late times, and some form of translational regulation seems to limit their expression (hruby and ball, ; johnson and spear, ; lachmi and kaariainen, ; vassef et al., ) . with black beetle virus that phenomenon can be attributed to competition, because translation of the late mrna predominates over early mrna in cell-free extracts under conditions of competition (friesen and rueckert, ) . the same explanation probably holds for alphaviruses. on the other hand, late vaccinia virus mrnas do not appear to be more efficient than early viral mrnas (cooper and moss, ; oppermann and koch, ) . instead, degradation of some early vaccinia transcripts (hruby and ball, ) might be part of the switching mechanism. the translation of early mrnas could be further reduced by the accumulation of "anti-early mrna" (boone et al., , which could inhibit translation much as antisense rna does in other experimental systems (izant and weintraub, ) . lo with baculoviruses, temporal switching involves the sequential activation of upstream promoters, such that the small, early mrnas are replaced by progressively longer overlapping transcripts (friesen and miller, ) . the resulting relegation of early protein coding sequences to the ' ends of late transcripts probably prohibits their translation. promoter switching late in sv infection also generates forms of mrna from which t antigen is translated inefficient y.l thus, although transcription plays the dominant role, translational mechanisms-involving competition or other ploys-contribute to the temporal switch in expression of viral genes in some systems. the current thinking is that poliovirus selectively shuts off host protein synthesis by inactivating a -kda protein (~ ) which is a subunit of the initiation factor that mediates the translation of capped mrnasll because the ' end of poliovirus mrna is uncapped, inac- few systems other than vaccinia show much potential for regulating translation by "hybrid arrest." complementary transcripts accumulate in the nuclei of many virusinfected cells, but the complementary sequences are usually edited from cytoplasmic mrnas. in the case of adenoviruses, for example, where transcription switches periodically from one dna strand to the other, the ' ends of the juxtaposed mature mrnas rarely overlap (lemoullec et al., ) . the ' ends of papovavirus early and late mrnas do characteristically overlap, however. although the proteins that can be cross-linked to the m g cap have a disturbing tendency to change from year to year, two proteins in mammalian cells that reproducibly cross-link are p -cbp and p -cbp. p is not a "cap binding protein" inasmuch as it does not cross-link to the cap, but p does copurify with p -cbp and p -cbp. "he aggregate, called eif- f, is considered by most people to be the functional "cap binding factor." the functions of cap binding proteins have been reviewed by shatkin ( ) . tivation of the cap binding factor should not impair the translation of viral mrna. the idea is appealing because it is straightforward, but some of the supporting data are less so. the experiment that gave birth to the hypothesis was provocative. using an antiserum against initiation factor eif- , etchison et al. ( ) showed by immunoblotting that p is clipped during the first few hours after infection of hela cells by poliovirus. because affinitypurified antibodies against p recognized a protein of the same size in some preparations of cap binding factor, the working hypothesis was that cleavage of p inactivated the cap-binding initiation factor. indeed, an activity from uninfected cells that was subsequently purified, based on its ability to restore translation to poliovirus-infected cell-free extracts, contained p and the cap binding proteins. these experiments are described below in more detail. this is the most promising explanation to come forth, but there are some irregularities and many lacunae in the supporting data. . there is a discrepancy between the kinetics of degradation of p and the kinetics of host shutoff (etchison et al., ) . the same anomaly occurs during infection by rhinovirus, which degrades p in a manner similar to poliovirus (etchison and fout, ) : the rate of translation is still half-maximal a t the time when p disappears from the polyacrylamide gels. in many of the experiments carried out with cell-free extracts, the question of timing was disregarded, and cells were routinely harvested hours postinfection (lee and sonenberg, ; lee et al., a) , which is well beyond the point when host translation is precipitously shut off. . the extent of cleavage is difficult to evaluate quantitatively. it seems dangerous to accept the recommendation of bernstein et al. ( ) to focus on the accumulation of the -kda cleavage products without also monitoring the disappearance of p , because cleavage need not always be arrested at the -kda level. in some experiments the concentration of cleavage products in immunoblots from infected cells greatly exceeded the concentration of intact p in uninfected cells (bernstein et al., ) . . a p cleavage pattern qualitatively similar to that which occurs in infected cells, although not nearly as extensive, is evident in some extracts from uninfected cells (bernstein et al., ; fig. in lee et al., a) . because the link between degradation of p and virus infection is not tight, it was not surprising to find that the virus-encoded protease c is not responsible for cleaving p (lee et al., b; lloyd et al., ) . it would seem wise to include a spectrum of protease inhibitors during the preparation of extracts. phenylmethylsulfonyl fluoride is the only one routinely used, at concentrations ranging from mm (which is adequate; etchison et al., ) to mm (bernstein et al., ) or . mm (lee and sonenberg, ) . one is reminded of the old excitement concerning "processing" of sv t antigen (ahmad-zadeh et al., ) which turned out to be an artifact of extraction (smith et al., ) . . in uninfected hela cells, the concentration of p -cbp is -fold lower than p (duncan and hershey, a,b) . [the concentration of p -cbp is also low in reticulocytes (hiremath et al., ) . if p and p (together with p ) function as the cap binding complex, large changes in the p pool-although easy to detect by immunoanalysis-are unlikely to alter the rate of translation; but small changes in the pool of p , which might go undetected with immunological or biochemical probes, would significantly impair translation. . a mutant of poliovirus called hf has been described in which the synthesis of viral rna is normal in cv- cells, but viral protein synthesis is inefficient, host translation is inhibited more slowly than usual, and p is not rapidly cleaved (bernstein et al., ) . (the phenotype of hf in hela cells, which are a more natural host than cv- monkey cells, is more complex. the synthesis of viral rna is greatly reduced and all protein synthesis, host and viral, is inhibited very early, again without the concomitant cleavage of p .) the authors argue cogently that wild-type poliovirus appears to encode a function, absent from hf , that promotes (or "avoids the early inhibition of") viral translation, and they argue less cogently that hf is translated poorly as a consequence of the failure to selectively inhibit host translation. to me, the second postulate seems redundant. lack of the putative positive factor would be sufficient to account for the poor translation of viral mrna, and the failure to inhibit host translation with normal kinetics could as likely be the result of ineffkient viral translation as the cause.i if the slow inhibition of translation by although cleavage of p is not detectable at all in hela cells infected by mutant hf , cleavage products are clearly evident in cv- cells a t hours postinfection (fig. a, lane , in bernstein et al., ) . that is later than normal, and the cleavage is less extensive than normal, but some cleavage does occur. one could argue similarly that cleavage of p in wild-type infected cells is the consequence of the abundant accumulation of viral proteins, rather than a precondition. the issue might be resolved by treating wild-type infected cells with guanidine, which allows only limited synthesis of viral proteins while host translation is inhibited with the usual rapid kinetics. it would be informative to know whether p undergoes cleavage under those circumstances. hf in cv- cells is a delayed version of the normal shutoff mechanism, then cleavage of p must not be central to the normal shutoff mechanism. the authors argue, to the contrary, that shutoff by hf is mechanistically different, inasmuch as the inhibition affects both host and viral mrnas in hf -infected cells, whereas host translation is preferentially inhibited in wild-type-infected cells. however, selective stimulation of viral translation (mediated by the product that is defective in hf ) superimposed on a general inhibition of translation, would mimic selective inhibition. the authors contend that the ability of guanidine to block host shutoff by hf distinguishes it from the normal shutoff mechanism, but the experiment (in cv- cells) was done without testing wild-type virus in parallel, as could have been done by adding guanidine at the start of the infection rather than after hours. . the assumption that tobacco mosaic virus, sindbis virus, and vsv mrnas are appropriate stand-ins for host mrna in the restoring assay is questionable rose et al., ; tahara et al., ) . in cells singly infected by vsv, viral mrnas are translated in preference to host mrnas under some conditions (nuss et al., ) ; thus, vsv mrnas are not equivalent to most host mrnas. when cells are doubly infected with poliovirus and sfv (which is akin to sindbis), or with poliovirus and vsv, conditions can be found that allow the simultaneous translation of poliovirus mrna and the capped mrnas of vsv or sfv (alonso and carrasco, a) . thus, the factor that restores to poliovirus-infected extracts the ability to translate vsv or alphavirus mrnas might not be sufficient to restore the translation of most host mrnas. globin is the only cellular mrna that has been shown to work in the restoring assay (edery et al., , and its translational efficiency resembles viral mrnas more than the average cellular mrna. it would be reassuring to omit the usual micrococcal nuclease pretreatment of lysates, and show that the addition of cap binding factor to poliovirus-infected cell-free extracts restores the translation of authentic endogenous host mrnas. the association of p with p -and p -cbps does not prove that p is a necessary component of the complex. in early studies, preparations of p -cbp that lacked the p subunit did preferentially stimulate the translation of capped mrnas in hela cell-free extracts (tahara et al., ; sonenberg et al., ) . those results were considered wanting because p -cbp failed to reproducibly restore translation to extracts from poliovirus-infected cells, whereas an aggregate of p , p , and p could restore (tahara et al., ) . but there is no reason to reject the aforementioned demonstration that p -cbp by itself does stimulate in uninfected systems. using the restoring assay to define the structure of cap binding factor would be acceptable only if one knew that cap binding factor was deficient in infected-cell extracts. grifo et al. ( ) showed that translation of globin mrna was stimulated by the p o-p -p aggregate, even when the system was saturated with p -and p -cbps. those data prove only that the system which they reconstituted from partially purified subfractions of a reticulocyte lysate was deficient in p ; the data do not prove that p is an essential component of cap binding factor. (indeed, translation of the uncapped mrna from satellite tobacco necrosis virus was stimulated to the same extent as capped globin mrna. the function of p would be clearer if one could show that antibodies against p inhibit the function of cap binding factor. such experiments have not been reported. indeed, the immunological evidence is far from convincing even for the original p -cbp. a monoclonal antibody "directed against cap binding proteins" was shown to inhibit the translation of capped mrnas, but the antibody reacted with higher molecular weight proteins and not with p -cbp (sonenberg et al., ) . the claim that the higher molecular weight polypeptides were related to p -cbp no longer seems valid, because polyclonal antibodies obtained recently against p -cbp react only with that polypeptide (hiremath et al., ) . in extracts from uninfected hela cells, p copurifies to some extent with both cbps and eif- (etchison et al., ) . whereas it is known that p restores translation to poliovirus-infected extracts when it is introduced in association with cap binding proteins, it is not known whether p would also enhance were it introduced in association with eif- . in several studies, eif- failed to restore translation to extracts from poliovirus-infected cells, but it was usually tested on an equal weight basis, vis-a-vis the other initiation factors (table iv in grifo et al., ; rose et al., ) . because eif- is so massive, it must be tested on an equal molar basis. an experiment which intended to show the eif- from poliovirusinfected cells is fully functional failed to prove the point, because the assay for eif- was carried out in the presence of cap binding factor from uninfected cells (etchison et al., ) . the exogenous cap binding factor may have contributed a component (such as p ) which was necessary for, and absent from, infected-cell eif- . the assay would have been more meaningful had an uncapped mrna been used, thus allowing the function of eif- to be evaluated without the necessity of adding cap binding factor. whether p should be considered a component of eif- or of the cap recognition factor involves more than semantics. whereas inactivation of cap binding factor would be sufficient to explain the selective inhibition of host translation, eif- is apparently needed for translating all mrnas; were eif- activity low, poliovirus would have to outcompete host mrnas for the residual activity. casting a wider net might identify other components that are involved in host shutoff by poliovirus. a few candidates have been ruled out. initiation factors eif- a and eif- b, for example, appear to be unaltered (duncan et al., ) . the normal association of host mrnas with the cytoskeleton is disrupted shortly after infection by poliovirus (lenk and penman, ) . whether that is the cause, or the effect, or is unrelated to inhibition of host translation remains unclear. such a dramatic effect seems unlikely to be gratuitous, but in other systems disruption of the cytoskeleton does not preclude all translation (welch and feramisco, ) . follow-up studies in the poliovirus system have not significantly extended penman's original, provocative observation. when bonneau et al. ( ) infected cv- cells first with vsv (which does not dissociate host mrnas from the cytoskeleton) and then superinfected with poliovirus, translation of vsv g and m proteins was inhibited and those mrnas were released from the cytoskeleton; unfortunately, it was not shown that vsv n and ns mrnas, which continued to be translated, remained bound to the cytoskeleton. the conclusion that translation requires association with the cytoskeleton hardly seems warranted. carrasco has suggested that the increased permeability of virusinfected cells to monovalent cations might mediate the switch from host to viral translation (carrasco and lacal, ; carrasco and smith, ) . when infected cells are incubated in medium containing sufficient excess nacl to inhibit the translation of most other proteins, poliovirus translation is fairly resistant (alonso and carrasco, a; nuss et al., , but the resistance is not as striking as with emc virus (alonso and carrasco, b) . the stimulation of in uitro translation by high salt is also less obvious with poliovirus mrna than with some other picornaviruses (bossart and bienz, ) . in the natural course of infection by poliovirus, the precipitous decline in host translation occurs within the first hours, prior to the observed increase in intracellular sodium ions (nair et al., ) . moreover, the synthesis of cellular proteins cannot be reactivated by incubating poliovirusinfected cells in hypotonic medium (alonso and carrasco, b) , a manipulation that works beautifully with emc virus. thus, hypertonicity does not appear to underlie the shutoff of host protein synthesis by poliovirus. morrow et al. ( ) made the astonishing discovery recently that the host-encoded kinase that is responsible for phosphorylating eif- binds to and mediates the replication of poliovirus rna. although that seems about as auspicious as a sheep shaking hands with a wolf, one can think of ways to rationalize such a dangerous move. if the pool of viral rna that serves as template for replication has to be kept free of ribosomes, for example, the presence of eif- kinase in replication complexes could help by phosphorylating the local pool of eif- . indeed, eif- might become globally phosphorylated in infected cells, and the resulting eif- deficiency could contribute to the inhibition of host protein synthesis. whereas older studies suggested that eif- was not deficient in polio-infected cells (brown and ehrenfeld, ; helentjaris and ehrenfeld, ) , the translation of heterologous mrnas in infected-cell extracts was restored to a limited extent by the addition of eif- (rose et al., ) -an effect that the authors chose to ignore. asim dasgupta has reopened the question, and his careful measurements reveal extensive phosphorylation of eif- (a) in poliovirus-infected cells (personal communication). the adenovirus system has generated considerable excitement recently because genetic manipulations, pioneered in shenk's laboratory, have revealed a regulatory mechanism that is novel, and yet connects to the extensive older literature on inactivation of initiation factor eif- . the focal point is a small virus-encoded rna called va-rna,. thimmappaya et al. ( ) found that, in cells infected by an adenovirus mutant that produced no va-rnai, late viral mrnas were synthesized, processed, and transported, but failed to be translated. in the absence of va-rna,, translation was blocked at the level of initiation (schneider et al., ) and the defect was ultimately localized to eif- . overwhelming evidence now supports the hypothesis that, in the absence of va-rna,, a kinase becomes activated that phosphorylates, and thus inactivates, eif- schneider et al., ; siekierka et al., ) . eif- kinase (one of the enzymes involved in the antiviral action of interferon; see lengyel, ) exists in uninfected hela cells in an inactive state, and is apparently activated by double-stranded rna that accumulates in infected cells as a by-product of adenovirus transcription (o'malley et al., ) . l the exact mechanism by which va-rnai blocks the action of eif- kinase is not yet known. an intriguing scenario can be extrapolated from a model that was proposed by rosen et al. ( ) in another context. their model proposes that high molecular weight double-stranded rna activates and targets eif- kinase: because both the kinase and eif- have binding sites for dsrna, high molecular weight dsrna could link the two proteins. by virtue of its small size, va-rna, might be able to bind to eif- or to eif- kinase, but not simultaneously to both. va-rnai would thus block the phosphorylation of eif- much as a monovalent hapten blocks antigen-antibody interactions. the proposal rationalizes the known properties of va-rna,: its small size (about nucleotides), doubled-stranded structure (monstein and philipson, , and the high concentration that is required to confer protection. whether the double-stranded regions of va-rnai are crucial for its function is not yet clear. have mutated va-rna, and found that extensive regions could be deleted without affecting biological activity, although certain other mutations were deleterious. further experiments will be needed to pinpoint the essential region(s) in va-rnai. a second adenovirusencoded species called va-rnai, rescues translation far less efficiently than va-rna, (thimmappaya et al., , and va-rnaii appears less extensively base paired (mathews and grodzicker, ) . in addition to its proven protective effect on eif- , it has been suggested that va-rna, might interact directly with viral mrnas to promote their translation (kaufman, ; schneider et al., ; svennson and akusjarvi, ) . a sequence-specific interaction seems unlikely, however, because the small rnas encoded by epstein-barr virus (which are related to adenovirus va-rnas by size but not sequence) can substitute to some extent for va-rnai (bhat and thimmappaya, , , and the facilitating effect of va-rna, extends a virus that is protected (by va-rna or some other mechanism) from the deleterious effects of its own symmetrical transcription process would also have some resistance to interferon. an interesting story along these lines is emerging with vaccinia virus (rice and kerr, ; whitaker-dowling and youngner, ) . sen et al. ( ) showed that, once kinase has been activated by binding dsrna, incubation with ribonuclease i does not abolish the ability of kinase to phosphorylate histone h (which was chosen as a convenient substrate), but neither the extent of trimming by the nuclease, nor the activity of the trimmed kinase on eif- , were determined. thus, the experiment does not contradict the targeting hypothesis for dsrna. not only to late adenovirus mrnas that carry the standard tripartite leader, but also to early adenovirus mrnas and late mrnas with a truncated version of the tripartite leader (svensson and akusjarvi, , adeno-associated virus mrnas (janik et al., , and various heterologous mrnas (svensson and akusjarvi, ) . the protective effect of va-rna, and the shutoff of host translation in adenovirus-infected cells might be two aspects of a single mechanism. if one postulates that the production of va-rna, by wild-type adenovirus is sufficient to protect only a portion of the eif- pool, the switch from host to viral protein synthesis could occur because late adenovirus mrnas outcompete host mrnas for the small residual translational capacity. the hypothesis that competition occurs during the late stage of infection by adenovirus is not entirely ad hoc. the overall translational capacity is low late in the infection (castiglia and flint, ) ; a portion of the eif- pool is phosphorylated (m. mathews, personal communication) ; polysomes are small and their size increases in response to a low dose of cycloheximide (perlman et al., ) ; and the decline in host translation correlates with the temporal onset and magnitude of late viral translation (castiglia and flint, ) . every mutation that has been shown to prevent host shutoff also prevents the cytoplasmic accumulation of late viral mrnas (babiss et al., ; halbert et al., ) . an interesting set of experiments by logan and shenk ( ) can be rationalized in terms of competition during the late stage of infection. they observed that transposition of the late tripartite leader to the early e a genes had no effect on the efficiency of translation of e a products at early times, but significantly enhanced the translation of e a mrnas at late times. this is understandable if there is no competition early in the infection, allowing efficient and inefficient mrnas to be translated equally well. the facilitating effect of the tripartite leader would become evident only late in the infection, when eif- has been partially inactivated and competition has set in. one might think along similar lines to explain the surprising ability of influenza virus mrnas to be translated in adenovirus-infected cells . in wild-type adenovirus-infected cells, in which host protein synthesis is drastically reduced, both adenovirus and influenza virus mrnas are translated efficiently. in cells infected by d , a deletion mutant that produces no va-rna,, adenovirus proteins are translated inefficiently, as noted above, but influenza virus proteins are still synthesized in abundance. despite that striking observation, there is little support or necessity for the notion that influenza virus establishes its own translational system. a simpler explanation is that the very low capacity for protein synthesis that persists in the absence of va-rna, is sufficient for the translation of influenza virus mrnas. for that explanation to be correct, influenza virus mrnas would have to be translated with extraordinary efficiency, and that prediction has recently been confirmed (katze et al., ) . what makes all this so intriguing is that the ' ends of influenza virus mrnas, which presumably dictate their high translational eficiency, are derived from host mrnas (krug, ) . it appears as if the viral capspecific endonuclease (which selects the cellular mrnas that will serve as donors) is biased toward the same features that facilitate translation. indeed, that deduction has been verified directly, at least in vitro. bouloy et al. ( bouloy et al. ( , found that p-globin mrna, which is translated more efficiently than a-globin mrna, is also a more efficient primer for influenza transcription; and alfalfa mosaic virus rna- , which translates in vitro with extraordinary eficiency, is the best known primer for influenza transcription. [from the fact that mrnas with '-o-methyl groups in the penultimate position of the cap function better than monomethylated caps as primers for influenza virus transcription (bouloy et al., ) , one is tempted to suggest that '- methyl groups enhance translation, although there is little direct evidence for that view!] if the selection of primers in infected cells follows the pattern that is seen in vitro, the influenza virus takeover scheme is indeed remarkable: the most efficient cellular mrnas would be sacrificed to construct viral mrnas that ips facto translate most efficiently. a few other possibilities for regulating translation in virus-infected cells are discussed briefly below. none of these topics is well understood at present, and the musings should be considered little more than that. cauliflower mosaic virus seems to be the only case in which the ability of eukaryotic ribosomes to reinitiate is fully exploited to produce several full-length proteins from one mrna. the structure of a few katze et al. ( ) have shown that influenza virus partially suppresses the activation of eif- kinase, and they suggest that this underlies the ability of influenza virus to replicate in cells infected by adenovirus mutant d . that interesting hypothesis would be stronger if it could be shown that influenza virus replicates even in cells infected by the adenovirus double mutant (vai -vaii -), and if superinfection with influenza virus could be shown not just to reduce the level of activated kinase, but to increase the level of functional eif- . other viral and cellular mrnas leads one t o predict that reinitiation is necessary for ribosomes to reach the major protein coding sequence, but the upstream open reading frames (orfs) in such mrnas are characteristically short. in some cases, however, the small peptides encoded near the ' end of the message might be biologically important. genetic studies indicate that this is certainly the case with the agnogene product of sv (margolskee and nathans, ) . in contrast, the three peptides encoded within the avian retrovirus leader sequence probably are not functional because there is little conservation of amino acid sequences among virus strains (hackett et al., ) . in retrovirus mutants that lack most of the leader sequence, the only known deficiency is the absence of a cis-acting packaging signal (mann et al., ; nishizawa et al., ) . comparison of different strains of poliovirus reveals that the number of upstream aug codons varies and the coding properties of the small orfs are not conserved (toyoda et al., ) . upstream minicistrons that do not encode anything interesting might nevertheless be important for regulation. several possibilities come to mind for retroviruses. the least interesting idea is that upstream aug codons accumulate, not from design, but from defaultbecause the deleterious effects on translation can easily be compensated by using efficient transcription signals to mass-produce retrovirus mrnas. the opposite view is that upstream aug codons are deliberately retained to throttle the synthesis of a protein that would be harmful if overproduced (tarpley and temin, ) . while that seems a reasonable ploy to use for oncogenes, it makes little sense when extended to viral structural genes. a third possibility derives from the observation that reinitiation usually is not % efficient. with preproinsulin constructs, for example, in which the efficiency of reinitiation is routinely % (kozak, b) , one might ask what the remaining % of the ribosomes are up to. one scenario is that, after s ribosomes have moved through the '-proximal orf, % of the s subunits detach at the terminator codon while the rest remain on the message, resume scanning, and reinitiate at the second aug codon. a more interesting possibility is that all s subunits remain bound and resume scanning, but only % reinitiate at the closest aug codon, perhaps because the codon-recognition step in inefficient in the absence of met-trna,, cap binding proteins, and/or other initiation factors-all of which were presumably released at an earlier step. [we do not know the precise sequence of events during initiation, but it seems likely that the factors that mediate the binding of met-trna, and mrna to the s ribosomal subunit are released prior to or during the joining of the s subunit at the first aug codon (moldave, ) .] if the factor-deficient s subunits that are unable to reinitiate at the second aug codon eventually become competent, they might reinitiate father downstream. thus, the effect of an upstream minicistron could be to loosen the process of initiation in a way that permits ribosomes to reach otherwise inaccessible internal aug codons. there is no evidence for this, as yet. we know only that reinitiation at the closest aug codon (following a terminator codon) is less than % efficient. yet another way in which ribosomes might gain access to internal aug codons, even in a message in which the major open reading frame initiates with a "strong" aug codon, relies on the presence of weak, out-of-frame initiator codons in the retrovirus leader sequence and the ability of ribosomes to reinitiate. this hypothetical scheme is best illustrated by using as an example an avian retrovirus mrna that encodes the e m glycoprotein (fig. ) . katz et al. ( ) have studied the effects of mutations in the leader region of this mrna, using as an a hypothesis whereby minor initiation sites in the leader sequence of retroviruses create a shunt that directs ribosomes to internal initiation sites. the diagram represents a subgenomic mrna that encodes the enu protein of avian leukosis virus (katz et al., ) . messenger rna is represented by a wavy line, the pathway followed by s ribosomal subunits is shown above the mrna, and the pathway of s ribosomes is shown below the mrna. a solid black line traces the pathway followed by most s subunits: they scan from the m g cap to the start of the enu coding sequence, marked "major start site," where a s subunit joins and translation begins. (some s subunits will stop and initiate at three upstream aug codons, but in each instance there is a nearby terminator codon, enabling ribosomes to reinitiate. thus, the upstream aug codons are irrelevant for the present discussion and are not shown.) of more significance are the many nonstandard codons (gug, uug, etc.) that lie in the standard context for initiation. such codons occur frequently in the - reading frame which is open (in the functional ev- viral genome) over a stretch of about nucleotides preceding the major enu start site; the open - reading frame ends nucleotides beyond the start of the enu coding sequence at uaa ss- , which is labeled t ~ in the figure. were a few s subunits to recognize the nonstandard upstream codons as initiation sites, the resulting s ribosomes-translating in theframe-would bypass the normal enu start site. a dashed line traces the pathway of this shunt. the main point is that ribosomes that terminate at t- could reinitiate a t an internal site which would be inaccessible were it not for the shunt. assay the ability to complement a replication-defective (env-) strain of rous sarcoma virus. their results are provocative. point mutations in positions - and - (i.e., and nucleotides upstream from the aug codon that initiates enu) caused a -fold reduction in complementation. on the other hand, the translational efficiency of deletion mutants varied from to % of the wild type level, and the variation did not correlate with the presence or absence of any particular portion of the leader sequence. to explain this puzzling pattern (or rather the absence thereof), skalka suggests that the mutations perturb some aspect of secondary structure that is critical for translation. because that idea is difficult to formulate in a way that can be tested, it can do no harm to consider an alternative explanation. the biological assay that was used has the advantage of being exquisitely sensitive, but it has the disadvantage of measuring the yield of enu protein only indirectly: the authors did not show a -fold reduction in enu synthesis; they showed a -fold drop in complementation. what if complementation were to require, in addition to enu, a second minor protein-either a truncated form of enu that initiates a little farther downstream or a small protein encoded in an alternate reading frame? (there is an open reading frame beginning at aug,,,,-,,,,, for example, that could direct the synthesis of a -kda protein.) because the context at the major enu start site is highly favorable, all of s ribosomal subunits that reach that site should initiate there; production of the putative internally initiated protein would therefore require a mechanism for shunting some ribosomes beyond the major enu start site. the hypothesis illustrated in fig. is that a small fraction of the ribosomes initiate within the leader sequence at weak sites (nonstandard codons that lie in a favorable context for initiation) in the - reading frame, and translate in that frame past the major enu start site, terminating at the site labeled tin fig. . the small fraction of ribosomes that follow this shunt could reinitiate to produce the second protein postulated above. the notion that the enu gene encodes two products is certainly ad hoc, but it rationalizes the behavior of skalka's mutants. the deleterious mutation in position - creates a terminator codon in the - reading frame, which would short-circuit the shunt and prevent synthesis of the internally initiated protein. in all of the deletion mutants that fail to complement effkiently, the weak upstream start sites are either in-frame with the major enu start site or terminate upstream from it-again abolishing the shunt. on the other hand, all of the deletion mutants that retain the ability to complement efficiently retain one or more weak upstream start sites (such as gug in position - in mutant , or uug in position - in mutant / ) which can feed ribosomes into the shunt. the hypoth-viral translation esis could be tested in two ways. one is to directly measure the yield of the major enu protein-which we predict will not vary, because it is the internally initiated protein that is deficient in these mutants. the best test would make use of a null mutant called pd / that lacks the major enu start site: that mutant should still make the second protein encoded within the enu gene, and therefore should complement all of the other mutants that have lost the shunt. viruses that replicate in the cytoplasm have the potential for coupling transcription with translation. for example, if ribosomes were to bind the ' end of reovirus mrnas as the nascent chains emerge from the subviral particle, the mrnas would be recruited for translation before the chains grew long enough to fold. that might enhance translation considerably, because the pattern of cleavage by t, rnase suggests that the capped terminus might be sequestered in mature reovirus mrnas (kozak and shatkin, b) . it would be fun to know whether reovirus mrnas are translated more efficiently in naturally infected cells than in cells transfected with cloned viral genes which are transcribed from a plasmid vector. the idea of coupling is ad hoc for reovirus, but there is a glimmer of evidence in the case of silkworm cytoplasmic polyhedrosis virus; whereas performed viral mrnas were inactive in reticulocyte or wheat germ translation systems, viral proteins were synthesized during coupled transcription-translation in frog oocytes (ikegami et al., ) . payne and mertens ( ) obtained somewhat different results, in that some viral proteins were made in vitro in the absence of transcription; but the polyhedron protein that predominates in vivo was still not produced in vitro. in the vaccinia virus system, cooper and moss ( ) observed more efficient synthesis of vaccinia proteins when transcription and translation were coupled. synergism could also occur in the opposite direction; i.e., viral transcription might be facilitated by translation. during the early hours of reovirus infection in l cells, transcription is mainly from genome segments m , s , s , and one of the l segments (nonoyama et al., ) . because mrnas from segments m , s , and s bind ribosomes very efficiently (shatkin and kozak, , one wonders whether preferential transcription is the consequence of preferential tran~ ation.l~ the the hypothesis is complicated, but not necessarily contradicted, by the finding that m , , and s are preferentially transcribed in viuo even in the presence of cycloheximide (shatkin and kozak, ) . although cycloheximide blocks elongation by s ribosomes, s ribosomal subunits could still bind to the nascent transcripts. possibility that coupled translation enhances transcription was fleetingly entertained for some other cytoplasmic viruses (ball and white, ; cooper and moss, ) , but the reticulocyte lysate appeared to enhance transcription only because it conferred protection against nucleolytic degradation . it remains possible that transcription and translation are obligatorily coupled in some less well studied rna viruses, as has been hinted for bunyaviruses (patterson and kolakofsky, ; pattnaik and abraham, ) . in the case of viruses that replicate in the nucleus, the possibility that movement of mrnas out of the nucleus might be coupled with translation has been raised from time to time. coupling clearly is not obligatory, because viral mrnas accumulate in the cytoplasm under many circumstances in which translation is blocked. a good example is the cytoplasmic accumulation of late adenovirus mrnas in the absence of va-rna,. on the other hand, the transport and translation of mrnas are sometimes coordinated. a striking example occurs in adenovirus-infected hela cells that are superinfected with influenza virus : whereas adenovirus blocks both the transport and translation of host mrnas, influenza virus mrnas escape both blocks. the probable mechanism that enables influenza virus mrnas to be translated was discussed in section ii ,e. what mechanism enables influenza mrnas to bypass the block that retains host mrnas in the nucleus? suggested one possibility, namely, an influenza virus-specific transport system. but it seems simpler to look for a single explanation that would account for both the preferential transport and translation. there could be competition at the level of transport, and the same features that make a message highly translatable might make it highly transportable. an alternative view is that the two processes are coupled. one might envision s ribosomal subunits monitoring the nuclear pores, such that only mrnas that can be translated under given circumstances will be transported. along those lines, babiss et az. ( ) have noted that, whereas host mrnas are neither transported nor translated in wildtype adenovirus-infected cells, transport and translation of host mrnas are coordinately restored by mutations in early viral genes that reduce the cytoplasmic accumulation of late viral mrnas. as an extension of the idea that a message will be transported only if it can be translated, one might suggest that mrnas are transported as soon as they become translatable. the consequence would be that translation could sometimes regulate the extent of splicing. some splicing events that could occur, were the transcript kept longer in the nucleus, would be prevented by "prematurely" pulling the mrna out. svensson et al. ( ) invoked this notion to explain some of their observations on the processing of adenovirus early mrnas. coupling of splicing with transport, and transport with translation, would explain why few if any incompletely processed transcripts enter the cytoplasm: no matter how many introns are present in a primary transcript, it remains in the nucleus until every intron has been removed-in effect, until it becomes translatable. it would seem as if the easiest way to judge whether a transcript is translatable is to attempt to translate it. the shutoff of host protein synthesis by herpes simplex virus might not involve a modification in the translational machinery per se. late (second-stage) shutoff is clearly caused by the massive degradation of host mrna. the puzzle of how the nuclease is targeted, such that it degrades host but not viral mrnas, has not yet been solved. a partial explanation might be that herpes virus mrnas are more highly structured, by virtue of their high g + c content. the unusual sensitivity of herpes virus mrnas to hypertonic stress is consistent with the hypothesis that they have extensive secondary structure. the irreversible (read and frenkel, ) early shutoff of host translation by a structural component of the herpes virion also seems to involve cleavage of host mrnas-enough to inactivate them for translation (fenwick and mcmenamin, , although they can still be detected by hybridization, more or less (nishioka and silverstein, b; schek and bachenheimer, ) . since mutants that are defective in stageone shutoff can still induce secondary shutoff of host protein synthesis (read and frenkel, , two distinct viral gene products, either nucleases or activators thereof, are apparently involved. a herpes virus mutant that is defective in stage-one host shutoff is defective in switching off the translation of early viral mrnas as well (read and frenkel, ) . the differential accumulation of adenovirus early mrnas is also mediated, in part, by the regulated degradation of some transcripts (wilson and darnell, ) . degradation of host mrnas might be part of the mechanism by which vaccinia and influenza viruses reduce host translation (see table i ), although clear-cut genetic evidence, such as that described for herpes virus, is lacking in those systems. the extent to which gene expression is regulated by posttranslational proteolytic degradation is probably not fully appreciated. there are striking, isolated examples, for example, the selective degradation of measles virus m protein (sheppard et al., , the rapid turnover of some early adenovirus proteins (spindler and berk, b) , and stabilization of the cellular protein p by its interaction with sv t antigen (oren et al., ) . given the intricacies of the ubiquitin pathway for proteolysis, it might be surprising were that pathway not perturbed by virus infection. some animal viruses might encode a function that protects foreign (i.e., viral) proteins from degradation, analogous to the pin function of bacteriophage t (simon et al., ) . although the pattern of codon usage in viral genes is sometimes different from that of the cellular genome, imbalances in the trna pool probably do not affect the yield of most viral proteins because the rate-limiting step is usually initiation rather than elongation. moreover, while there is convincing evidence for a preferred pattern of codon usage in highly expressed bacterial and yeast genes (bennetzen and hall, ; ikemura, ikemura, , , codon preference seems to be more relaxed in higher eukaryotes (tso et al., , and therefore the cellular trna pool might not be markedly skewed. consistent with the idea that codon usage is not a major regulatory factor in viral gene expression, the close conservation of amino acid sequences between some viruses is not always accompanied by conservation in the choice of codons (ou et al., ) . the degree to which expression might be limited by trna deficiencies has been tested in escherichia coli by using cloned genes that are rich in rare codons. the availability of trna was found to limit translation only when the mrna concentration was extraordinarily high (pedersen, ; robinson et al., ) . codon usage might regulate translation in more subtle ways, however. one possibility with some experimental justification is that ribosomes pause briefly at rare codons (lizardi et al., ; misra and reeves, ; varenne et al., ) . la discontinuous elongation is not incompatible with efficient translation, as pausing has been detected during the synthesis of some very abundant proteins (cepko and sharp, ; lizardi et al., ) . slowing translation in certain positions might facilitate folding of the polypeptide and/or its interaction with other components, however. the pattern of codon usage in the signal peptide portion of some genes encourages this notion (spieth et al., ) . the suppression of nonsense codons and the occurrence of frameshifting (see section ii,c) might also be facilitated by an imbalance between the cellular trna pool and the viral pattern of codon usage. * an alternative explanation for discontinuous elongation is that ribosomes pause when they encounter hairpin structures in the mrna, but that idea is without experimental support. what we have learned about the structure and function of animal virus mrnas can often be extrapolated to cellular mrnas. the mechanism of selecting the initiation site for protein synthesis certainly appears to follow a single formula. the translational machinery displays a certain flexibility (leaky scanning, frameshifting, etc.) that is exploited more frequently by viral than by cellular mrnas. that no (doubt reflects the limited coding capacity of most viral genomes. in contrast, it would seem easier and more efficient for the expansive cellular genome to separately encode two versions of a protein than to attempt to skirt the "monocistronic rule" in the ways described for viruses.lg it is important to remember that there are rules for breaking the monocistronic rule. using those principles, we can correctly predict the qualitative aspects of viral protein synthesis, with very few exceptions. we understand much less about the quantitative aspects of translation, however. although some of the parameters that determine efficiency have been identified in the preceding pages, or at least surmised, we usually cannot predict how efficiently a given mrna will be translated by summing the known parameters. future studies will almost certainly uncover other features that affect translational efficiency: "repressor" proteins, perhaps, or helix-unwinding proteins, or effects of '-noncoding sequences, or aspects of mrna primary and secondary structure that are not yet obvious. the suggestion that it is easier to block translation than to enhance it merits repetition. the most efficient mrnas might be those that cannot interact with regulatory rnas, proteins, etc. it is sometimes but not always true that viral mrnas are translated more efficiently than cellular mrnas. i persist in believing that many viruses inhibit cellular protein synthesis inadvertently, and gain little thereby. understanding the mechanism of host shutoff is nonetheless interesting. it might aid in designing virus vectors, and in our understanding of the conditions that promote persistent virus infections (ahmed and fields, ) . acknowledgments i thank many colleagues who kindly sent reprints and preprints, and especially those who gave me permission to cite their unpublished observations. in several instances the the '-terminal structure of the methylated mrna synthesized in vitro by vsv cellular protein synthesis shutoff by mengovirus: translation of nonviral and viral mrnas in extracts from uninfected and infected ascites tumor cells two forms of sv tantigen in abortive and lytic infection role of the s gene in the establishment of persistent reovirus infection in l cells reversion by hypotonic medium of the shutoff of protein synthesis induced by emc virus translation of capped viral mrnas in poliovirus-infected hela cells protein synthesis in hela cells double-infected with emc virus and poliovirus translation of capped virus mrna in emc virus-infected cells can acg serve as an initiation codon for protein synthesis in eukaryotic cells? sequences at both termini of the genes of reovirus serotype (dearing) the effect of infection with sindbis virus and its temperature sensitive mutants on cellular protein and dna synthesis sequencing studies of pichinde arenavirus s rna indicate a novel coding strategy, an ambisense viral s rna effect of adenovirus on metabolism of specific host mrnas: transport control and specific translational discrimination adenovirus type early region l b gene product is required for efficient shutoff of host protein synthesis adenovirus e b proteins are required for accumulation of late viral mrna and for effects on cellular mrna translation and transport polyriboadenylic acid preferentially inhibits in vitro translation of cellular compared to vaccinia virus mrnas inhibition of protein synthesis by vaccinia virus coupled transcription and translation in mammalian and avian cell-free systems the replication of picornaviruses expression from an internal aug codon of herpes simplex thymidine kinase gene inserted in a retrovirus vector the number of ribosomes on sv late s mrna is determined in part by the nucleotide sequence of its leader complete nucleotide sequence of alfalfa mosaic virus rna sequence analysis of hepatitis a virus cdna coding for capsid proteins and rna polymerase direct mapping of adeno-associated virus capsid proteins b and c: a possible acg initiation codon structure of the fmdv translation initiation site and of the structural proteins uag readthrough during tmv rna translation: isolation and sequence of two trnastyr with suppressor activity from tobacco plants the molecular basis for differential translation of tmv rna in tobacco and wheat germ measles virus p gene codes for two proteins inhibition of hela cell protein synthesis during adenovirus infection regulatory mutants of polyoma virus defective in dna replication and the synthesis of early proteins solubilization of a protein synthesis inhibitor from vaccinia virions codon selection in yeast translational interference a t overlapping reading frames in prokaryotic mrna effect of the tripartite leader on synthesis of a nonviral protein in an adenovirus recombinant poliovirus mutant that does not selectively inhibit host cell protein synthesis two small rnas encoded by epstein-barr virus can functionally substitute for the virus-associated rnas in the lytic growth of adenovirus construction and analysis of additional adenovirus substitution mutants confirm the complementation of vai rna function by two small rnas encoded by epstein-barr virus structural requirements of adenovirus vai rna for its translation enhancement function differential inhibition of host cell rna synthesis in several picornavirus-infected cell lines effect of viral infection on host protein synthesis and mrna association with the cytoplasmic cytoskeletal structure intermolecular duplexes formed from polyadenylated vaccinia virus rna the . kb e b mrna of human ad and ad codes for two tumor antigens starting at different aug triplets regulation of protein synthesis in hep cells and their cytoplasmic extracts after poliovirus infection globin mrnas are primers for the transcription of influenza viral rna in uitro both the -methyl and the '-o-methyl groups in the cap of mrna strongly influence its ability to act as primer for influenza virus rna transcription a transcript from the s segment of the germiston bunyavirus is uncapped and codes for the nucleoprotein and a nonstructural protein sequencing of coronavirus ibv genomic rna: three open reading frames in the ' "unique" region of mrna d translation of poliovirus rna in uitro: changes in cleavage pattern and initiation sites by ribosomal salt wash initiation factor preparations from poliovirusinfected cells restrict translation in reticulocyte lysates the white pock mutants of rabbit poxvirus: in uitro translation of early host range mutant mrna synthesis of alphavirus-specified rna complex regulation of sv earlyregion transcription from different overlapping promoters three intergenic regions of coronavirus mouse hepatitis virus genome rna contain a sequence that is homologous to the '-end of the viral mrna leader sequence molecular cloning and complete sequence determination of rna genome of human rhinovirus type permeabilization of cells during animal virus infection sodium ions and the shut-off of host cell protein synthesis by picornaviruses sequences of the s genes of the three serotypes of reovirus effects of adenovirus infection on rrna synthesis and maturation in hela cells sequence analysis of the viral core protein and membrane proteins of the flavivirus west nile virus primary structure of the west nile flavivirus genome region coding for all nonstructural proteins effect of poliovirus double-stranded rna on viral and host-cell protein synthesis translation of poliovirus rna in uitro: detection of two different initiation sites regulation of protein synthesis in vsvinfected cells by decreased initiation factor activity assembly of adenovirus major capsid protein is mediated by a nonvirion protein differential translation in normal and adenovirus type -infected cells and cell-free systems two initiation sites for foot and mouth disease virus polyprotein in vivo synthesis and processing of sindbis virus nonstructural proteins in uitro overlapping of the vp -vp gene and the vp gene in the sv q genome transcription of vaccinia virus mrna coupled to translation in uitro in uitro translation of immediate early, early, and late classes of rna from vaccinia virus-infected cells discriminatory inhibition of protein synthesis in cell-free systems by vaccinia virus transcripts bacterial peptide chain release factors: conserved primary structure and possible frameshift regulation of release factor identification of a unique guanine- -methyltransferase in semliki forest virus infected cell extracts mechanism of interferon action: differential effect of interferon on the synthesis of sv and reovirus polypeptides in monkey kidney cells structure-function relationship of rous sarcoma virus leader rna a ribosome binding site sequence is necessary for efficient expression of the distal gene of a translationally-coupled gene pair nucleotide sequence of a viral rna fragment that binds to eukaryotic ribosomes sequence of the '-untranslated region of brome mosaic virus coat protein messenger rna genome expression of plant positive-strand rna viruses inhibition of mrna binding to ribosomes by localized activation of dsrna-dependent protein kinase in vitro synthesis of the nonstructural c protein of sendai virus translational specificity in reovirus-infected mouse fibroblasts gene organization of the transforming region of adenovirus type dna initiation of translation of the cauliflower mosaic virus genome from a polycistronic mrna: evidence from deletion mutagenesis oligonucleotide directed mutagenesis of cauliflower mosaic virus dna using a repair-resistant nucleoside analogue identifies an agnogene initiation codon in uitro translation of poliovirus rna: utilization of internal initiation sites in reticulocyte lysate peptide maps and nterminal sequences of polypeptides from early region a of human adenovirus catalytic utilization of eif- and mrna binding proteins are limiting in lysates from vsv infected l cells host range restriction of vaccinia virus in cho cells: relationship to shutoff of protein synthesis regulation of initiation factors during translational repression caused by serum depletion cellular levels and covalent modification of the subunits of the cap binding protein complex, eif- f protein synthesis factors a and b are not altered by poliovirus infection of hela cells complete nucleotide sequence of bacteriophage t dna and the locations of t genetic elements functional characterization of eukaryotic mrna cap binding protein complex: effects on translation of capped and naturally uncapped rnas sequence relationship of glycosylated and unglycosylated gag polyproteins of moloney murine leukemia virus mapping the major transcripts of ground squirrel hepatitis virus: the presumptive template for reverse transcriptase is terminally redundant reovirus hemagglutinin mrna codes for two polypeptides in overlapping reading frames the complete sequence of the m rna of snowshoe hare bunyavirus reveals the presence of internal hydrophobic domains in the viral glycoprotein analyses of the mrna transcription processes of snowshoe hare bunyavirus s and m rna species human rhinovirus infection of hela cells results in the proteolytic cleavage of the p cap binding subunit viral translation proteolysis of a , dalton polypeptide associated with eif and a cap binding protein complex early virion-associated suppression of cellular protein synthesis by herpes simplex virus is accompanied by inactivation of mrna suppression of the synthesis of cellular macromolecules by herpes simplex virus structural difference between the ' termini of viral and cellular mrna in poliovirus-infected cells: possible basis for the inhibition of host protein synthesis effect of adenovirus infection on expression of human histone genes nucleotide sequence and genome organization of foot and mouth disease virus evidence against the role of k + in the shutoff of protein synthesis by vsv temporal regulation of baculovirus rna: overlapping early and late transcripts early and late functions in a bipartite rna virus: evidence for translational control by competition between viral mrnas bunyavirus nucleoprotein, n, and a nonstructural protein, nss, are coded by overlapping reading frames in the s rna reovirus messenger rna contains a methylated, blocked '-terminal structure: m g( ')ppp(s')gmpcp mechanism of formation of reovirus mrna '-terminal blocked and methylated sequence, m gpppgmpc '-terminal structure and mrna stability enhanced poliovirus replication in cytomegalovirus-infected human fibroblasts na+ and k + concentrations and the regulation of the interferon system in chick cells na+ and k + concentrations and the regulation of protein synthesis in sindbis virus-infected chick cells '-conformation of capped alfalfa mosaic virus rna may reflect its independence of cap structure or of cap-binding protein for efficient translation sv early mrnas contain multiple '-termini upstream and downstream from a hogness-goldberg sequence; a shift in ' termini during the lytic cycle is mediated by large t antigen heterogeneity and ' erminal structures of the late rnas of sv protein synthesis in lysates of aedes albopictus cells infected with vsv sendai virus contains overlapping genes expressed from a single mrna translational discrimination between the four rnas of alfalfa mosaic virus cap accessibility correlates with the initiation efficiency of amv rnas competition between cellular and viral mrnas in uitro is regulated by a messenger discriminatory initiation factor protein synthesis in cells infected with semliki forest virus is not controlled by intracellular cation changes new initiation factor activity required for globin mrna translation inhibition of dna-dependent transcription by the leader rna of vsv sv recombinant molecules express the gene encoding p transforming protein of harvey murine sarcoma virus sequence of the black beetle virus subgenomic rna and its location in the viral genome nucleotide sequence and genome organization of carnation mottle virus rna utilization of internal aug codons for initiation of protein synthesis directed by mrnas from normal and mutant genes encoding herpes simplex virusspecified thymidine kinase synthesis in vitro of a seven amino acid peptide encoded in the leader rna of rous sarcoma virus sodium and potassium transport in herpes simplex virus-infected cells adenovirus early region encodes functions required for efficient dna replication, late gene expression, and host cell shutoff effects of deletions on expression of the hsv thymidine kinase gene from the intact viral genome: the amino terminus is dispensable for catalytic activity influenza virus mrnas are incomplete transcripts of the genome rnas attenuation of late sv mrna synthesis is enhanced by the agnoprotein and is temporally regulated in isolated nuclear systems large surface proteins of hepatitis v virus containing the pre-s sequence inhibition of host cell protein synthesis by uvinactivated poliovirus control of protein synthesis in extracts from poliovirus-infected cells isolation and preliminary characterization of temperature-sensitive mutants of poliovirus type virion component of hsv type kos interferes with early shutoff of host protein synthesis induced by hsv type immunological detection of the mrna cap-binding protein site-specific recombination of bacteriophage a: dna sequence of regulatory regions and overlapping structural genes for int and xis characterization of the infections of permissive cells by host range mutants of vsv defective in rna methylation control of expression of the vaccinia virus thymidine kinase gene mapping and identification of the vaccinia virus thymidine kinase gene mutation of a termination codon affects src initiation alfalfa mosaic virus temperature sensitive mutants transcriptioncoupled translation of silkworm cytoplasmic polyhedrosis virus genomic rna in xenopus oocytes correlation between the abundance ofe. coli transfer rnas and the occurrence of the respective codons in its protein genes correlation between the abundance of yeast transfer rnas and the occurrence of the respective codons in protein genes inhibition of host protein synthesis and degradation of cellular mrnas during infection by influenza and hsv constitutive and conditional suppression of exogenous and endogenous genes by anti-sense rna expression of the b u s sarcoma virus pol gene by ribosomal frameshifting biosynthesis of reovirus-specified polypeptides: the reovirus sl mrna encodes two primary translation products biosynthesis of reovirus-specified polypeptides polypeptide cleavages in the formation of poliovirus proteins requirement for adenovirus dna-binding protein and va-i rna for production of adeno-associated virus polypeptides adeno-associated virus proteins: origin of the capsid components identification of the sv agnogene product: a dna binding protein further studies on the inhibition of cellular protein synthesis by vsv inhibition of host translation in emc virus-infected l cells: a novel mechanism comparison of initiation rates of emc virus and host protein synthesis in infected cells shutoff of hela cell protein synthesis by emc virus and poliovirus: a comparative study two initiation sites for translation of poliovirus rna in uitro: comparison of lsc and mahoney strains evidence for translational regulation of hsv type gd expression restriction of vsv in a nonpermissive rabbit cell line is at the level of protein synthesis inhibition of cell functions by rna-virus infections lysis gene expression of rna phage ms depends on a frameshift during translation of the overlapping coat protein gene deletions of n-terminal sequences of polyoma virus t-antigens reduce but do not abolish transformation of rat fibroblasts role of the avian retrovirus mrna leader in expression: evidence for novel translation control metabolism and expression of rna polymerase i transcripts in influenza virus-infected cells nuclear-cytoplasmic transport and vai rna-independent translation of influenza viral mrnas in late adenovirusinfected cells translational control by influenza virus: suppression of the kinase that phosphorylates the alpha subunit of initiation factor eif- and selective translation of influenza viral mrnas identification of the components necessary for adenovirus translational control and their utilization in cdna expression vectors viral protein synthesis in barley protoplasts inoculated with native and fractionated brome mosaic virus rna primary structure, gene organization and polypeptide expression of poliovirus rna interferon regulates c-myc gene expression in daudi cells at the post-transcriptional level protein synthesis directed by the rna from a plant virus in a normal animal cell how do eucaryotic ribosomes select initiation regions in messenger rna? inability of circular mrna to attach to eukaryotic ribosomes migration of s ribosomal subunits on mrna when initiation is perturbed by lowering magnesium or adding drugs evaluation of the "scanning model" for initiation of protein synthesis in eucaryotes possible role of flanking nucleotides in recognition of the aug initiator codon by eukaryotic ribosomes mechanism of mrna recognition by eukaryotic ribosomes during intiation of protein synthesis analysis of ribosome binding sites from the sl message of reovirus: initiation a t the first and second aug codons comparison of initiation of protein synthesis in procaryotes, eucaryotes, and organelles translation of insulin-related polypeptides from mrnas with tandemly reiterated copies of the ribosome binding site compilation and analysis of sequences upstream from the translational start site in eukaryotic mrnas selection of initiation sites by eucaryotic ribosomes: effect of inserting aug triplets upstream from the coding sequence for preproinsulin point mutations define a sequence flanking the aug initiator codon that modulates translation by eucaryotic ribosomes influences of mrna secondary structure on initiation by eucaryotic ribosomes migration of s ribosomal subunits on mrna in the presence of edeine identification of features in '-terminal fragments from reovirus mrna which are important for ribosome binding priming of influenza viral rna transcription by capped heterologous rnas relationship between membrane integrity and the inhibition of host translation in virus-infected mammalian cells characterization of leader , - . virus-infected cells rna sequences on the virion and mrnas of mouse hepatitis virus, a cytoplasmic rna virus nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus synthesis of hepatitis b surface antigen in mammalian cells: expression of the entire gene and the coding region translation of adenovirus serotype late mrnas influenza virus structural and nonstructural proteins in infected cells and their plasma membranes inactivation of cap-binding proteins accompanies the shut-off of host protein synthesis by poliovirus isolation and structural characterization of cap-binding proteins from poliovirus-infected hela cells poliovirus protease c (p - c) does not cleave p of the eucaryotic mrna cap-binding protein complex expression of vaccinia virus early mrnas in ehrlich ascites tumor cells. part of the polysomes at an early stage of virus infection are not bound to the cytoskeleton translation of cellular and viral early mrna in cell-free systems from uninfected and (vaccinia) virusinfected cells a t the early stage mrna discrimination in extracts from uninfected and reovirus-infected l-cells polyadenylic acid addition sites in the adenovirus type major late transcription unit biochemistry of interferons and their actions the cytoskeletal framework and poliovirus metabolism molecular cloning and partial sequencing of hepatitis a viral cdna initiation of translation at internal aug codons in mammalian cells discontinuous translation of silk fibroin in a reticulocyte cell-free system and in intact silk gland cells poliovirus protease does not mediate cleavage of the , -da component of the cap binding protein complex translational control of protein synthesis after infection by vsv vsv mrna and inhibition of translation of cellular mrna-is there a p function in vsv? adenovirus tripartite leader sequence enhances translation of mrnas late after infection eukaryotic ribosomes can recognize preproinsulin initiation codons irrespective of their position relative to the '-end of mrna the nonstructural proteins of sindbis virus as studied with an antibody specific for the c terminus of the nonstructural readthrough polyprotein differential inhibition of host protein synthesis in l cells infected with rna-temperature-sensitive mutants of vsv arrangment of late rnas transcribed from a . -kb ecori vaccinia virus dna fragment construction of a retrovirus packaging mutant and its use to produce helper-free defective retrovirus frameshift and intragenic suppressor mutations in a rous sarcoma provirus suggest src encodes two proteins suppression of a vp mutant of sv by missense mutations in serine codons of the viral agnogene virus-associated rnas of naturally occurring strains and variants of group c adenoviruses nucleotide sequence of the coat protein cistron and the '-noncoding region of cucumber green mottle mosaic virus rna influence of the host cell on proteins synthesized by different strains of influenza virus intermediates in the synthesis of tolc protein include an incomplete peptide stalled at a rare arg codon eukaryotic protein synthesis restricted translation of the genome of the flavivirus kunjin in vitro the conformation of adenovirus vai-rna in solution the host protein required for in vitro replication of poliovirus is a protein kinase that phosphorylates eif shutoff of host translation by emc virus infection does not involve cleavage of the eif f polypeptide that accompanies poliovirus infection inhibition of hela cell protein synthesis by the vaccinia virion irreversible effects of cycloheximide during the early period of vaccinia virus replication formation of the guanylylated and methylated '-terminus of vaccinia virus mrna biosynthesis of reovirus-specified polypeptides. multiplication rate but not yield of reovirus serotypes and correlates with the level of virus-mediated inhibition of cellular protein synthesis the regulation of translation in reovirus-infected cells modification of membrane permeability during semliki forest virus infection cell-free synthesis of a precursor polyprotein containing both gag and pol gene products by rauscher murine leukemia virus s rna guanidine-sensitive na+ accumulation by poliovirus-infected hela cells adenovirus gene expression: control at multiple steps of mrna biogenesis the complete sequence of the chicken crystallin gene and its ' flanking region alterations in the protein synthetic apparatus of friend erythroleukemia cells infected with vsv or herpes simplex virus requirement of protein synthesis for the degradation of host mrna in friend erythroleukemia cells infected with hsv type unusual features of the leader sequence of rous sarcoma virus packaging mutant tk the '-end of poliovirus mrna is not capped with m'g( ')ppp(s') complete nucleotide sequence of the attenuated poliovirus sabin strain genome control of transcription of the reovirus genome vaccinia virusinduced changes in "a+] and [ k + ] in hela cells selective blockage of initiation of host protein synthesis in rna-virus-infected cells a mechanism for the control of protein synthesis by adenovirus va rnal on the regulation of protein synthesis in vaccinia virus infected cells a joint product of the genes gag and pol of avian sarcoma virus: a possible precursor of reverse transcriptase post-translational regulation of the k cellular tumor antigen in normal and transformed cells the influence of the host cell on the inhibition of virus protein synthesis in cells doubly infected with vsv and mengovirus sequence studies of several alphavirus genomic rnas in the region containing the start of the subgenomic rna characterization of a ts mutant of vaccinia virus , - . novel function that prevents virus-induced breakdown of rna hepatitis b virus genes and their expression in e. coli characterization of lacrosse virus smallgenome transcripts multiple leader rnas and mrnas are transcribed from the lacrosse virus small genome segment identification of four complementary rna species in akabane virus-infected cells the reoviridae methylmercury hydroxide enhancement of translation and transcription of ovalbumin and conalbumin mrnas e . coli ribosomes translate in viuo with variable rate leaky uag termination codon in tobacco mosaic virus rna characteristics of a coupled cellfree transcription and translation system directed by vaccinia cores insertion mutagenesis to increase secondary structure within the 'moncoding region of a eukaryotic mrna reduces translational efiiciency evidence for the presence of an inhibitor on ribosomes in mouse l cells infected with mengovirus regulation of herpesvirus macromolecular synthesis. properties of a polypeptides made in hsv- and hsv- infected cells utilization of messenger in adenovirus- -infected cells at normal and elevated temperatures a frameshift mutation in the pre-s region of the human hepatitis b virus genome allows production of surface antigen particles but eliminates binding to polymerized albumin characterization of ribosome binding on rous sarcoma virus rna in uitro translation of mulv and msv rnas in nuclease-treated reticulocyte extracts: enhancement of the gag-pol polypeptide with yeast suppressor trna proteins specified by avian erythroblastosis virus: coding region localization and identification of a previously undetected erb-b polypeptide molecular cloning of poliovirus cdna and determination of the complete nucleotide sequence of the viral genome cell-free translation of frog virus mrnas herpes simplex virus mutants defective in the virion-associated shutoff of host polypeptide synthesis and exhibiting abnormal synthesis of q (immediate early) viral polypeptides the genome of simian virus regulation of a protein synthesis initiation factor by adenovirus va-rnai interferon-mediated, doubled-stranded rna-dependent protein kinase is inhibited in extracts from vaccinia virus-infected cells vaccinia virus induces cellular mrna degradation double-stranded rnadependent protein kinase and - a system are both activated in interferon-treated, emc virus-infected hela cells nucleotide sequence of yellow fever virus: implications for flavivirus gene expression and evolution structure of the flavivirus genome ' and ' terminal nucleotide sequences of the rna genome segments of influenza virus codon usage can affect efficiency of translation of genes in e. coli inhibition of translation by poliovirus: inactivation of a specific initiation factor transcriptional and translational mapping and nucleotide sequence analysis of a vaccinia virus gene encoding the precursor of the major core polypeptide b inhibition of host protein synthesis by vaccinia virus: fate of cell mrna and synthesis of small poly(a)-rich polyribonucleotides in the presence of actinomycin d messenger rna specificity in the inhibition of eukaryotic translation by double-stranded rna post-transcriptional regulation accounts for the transactivation of the human t-lymphotropic virus type selective and reversible inhibition of initiation of protein synthesis in mammalian cells b or not b: regulation of the catalytic utilization of eif the synthesis of a dna-like rna in the cytoplasm of hela cells infected with vaccinia virus increased phosphorylation of eif- a in interferon-treated, reovirus-infected mouse fibroblasts degradation of cellular mrnas induced by a virion-associated factor during herpes simplex virus infection of vero cells adenovirus vai rna facilitates the initiation of translation in virus-infected cells rna prevents phosphorylation of eif a subsequent to infection nucleotide sequence of rous sarcoma virus interferon, double-stranded rna, and protein phosphorylation reovirus inhibition of cellular rna and protein synthesis: role of the s gene the reoviridae capping of eucaryotic mrnas molecular mechanisms of virus-mediated cytopathology mrna cap binding proteins: essential factors for initiating translation the reoviridae complete nucleotide sequence of the neuraminidase gene of influenza b virus a previously unrecognized b virus glycoprotein from a bicistronic mrna that also encodes the viral neuraminidase measles virus matrix protein synthesized in a subacute sclerosing panencephalitis cell line translation of brome mosaic viral ribonucleic acid in a cell-free system derived from wheat embryo nucleotide sequence of moloney murine leukaemia virus mechanism of translational control by partial phosphorylation of the a subunit of eukaryotic initiation factor translational control by adenovirus: lack of va-rna, during adenovirus infection results in phosphorylation of eif- and inhibition of protein synthesis alterations in the protein synthetic apparatus of cells infected with herpes simplex virus stabilization of proteins by a bacteriophage t gene cloned in e . coli coding sequence of coronavirus mhv-jhm mrna coronavirus mhv-jhm mrna has a sequence arrangement which potentially allows translation of a second, downstream open reading frame reovirus-induced modification of cap-dependent translation in infected l cells regulation of translation in l-cells infected with reovirus cytoplasmic methionine transfer rnas from eukaryotes extraction and fingerprint of sv large and small t-antigens production of human beta interferon in insect cells infected with a baculovirus expression vector differential stimulation of capped mrna translation in uitro by cap binding protein probing the function of the eucaryotic ' cap structure by using a monoclonal antibody directed against cap-binding proteins coronavirus mrna synthesis involves fusion of non-contiguous sequences translation efficiency of zein mrna is reduced by hybrid formation between the '-and '-untranslated region the nucleotide sequence of a nematode vitellogenin gene translation efficiency of adenovirus early region a mrnas deleted in the ' untranslated region rapid intracellular turnover of adenovirus early region a proteins herpes simplex virus-induced changes in cellular and adenovirus rna metabolism in an adenovirus type -transformed human cell line the effect of hypertonic conditions on protein synthesis in cells infected with herpes virus sequence coding for the alphavirus nonstructural proteins is interrupted by an opal termination codon adenovirus va rnai: a positive regulator of mrna translation adenovirus va rnai mediates a translational stimulation which is not restricted to the viral mrnas splicing of adenovirus early region a mrnas is non-sequential a cell-free model of the emc virus-induced inhibition of host cell protein synthesis two forms of purified m g-cap binding protein with different effects on capped mrna translation in extracts of uninfected and poliovirus-infected hela cells the location of v-src in a retrovirus vector determines whether the virus is toxic or transforming adenovirus vai rna is required for efficient translation of viral mrnas a t late times after infection the hepatitis b virus complete nucleotide sequences of all three poliovirus serotype genomes isolation and characterization of rat and human glyceraldehyde- -phosphate dehydrogenase cdnas: genomic complexity and molecular evolution of the gene shutoff of neuroblastoma cell protein synthesis by semliki forest virus: loss of ability of crude initiation factors to recognize early sfv and host mrnas infection of neuroblastoma cells by sfv translation is a nonuniform process. effects of trna availability on the rate of elongation of nascent polypeptide chains translational control of early protein synthesis at the late stage of vaccinia virus infection mutational alterations within the sv leader segment generate altered s and s mrnas individual hsv transcripts: characterization of specific genes. jn "the herpesviruses nucleotide sequence of the thymidine kinase gene of herpes simplex virus type the role of mrna competition in regulating translation a single uga codon functions as a natural termination signal in the coliphage q p coat protein cistron disruption of the three cytoskeletal networks - marilyn kozak in mammalian cells does not affect transcription, translation, or protein translocation changes induced by heat shock protein synthesis in bhk- cells infected with semliki forest virus characterization of a specific kinase inhibitory factor produced by vaccinia virus which inhibits the interferon-induced protein kinase nucleotide sequence of an immediateearly frog virus gene macromolecular synthesis in cells infected by frog virus further genetic localization of the transforming sequences of the p v-ras gene of harvey murine sarcoma virus control of mrna concentration by differential cytoplasmic half-life evidence that a g u a u e a and ccaagmga initiate translation in the same mrna in region e of adenovirus differential phosphorylation of soluble versus ribosome-bound eif in the ehrlich ascites tumor cell resistance to inhibitors of mammalian cell protein synthesis induced by preincubation in hypertonic growth medium murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon translational readthrough of an amber termination codon during synthesis of feline leukemia virus protease splicing in adenovirus and other animal viruses s. cereuisiae ribosomes recognize non-aug initiation codons studies on the intracellular synthesis of reovirus-specified proteins key: cord- -wzwpyvll authors: castelló, alfredo; Álvarez, enrique; carrasco, luis title: the multifaceted poliovirus a protease: regulation of gene expression by picornavirus proteases date: - - journal: j biomed biotechnol doi: . / / sha: doc_id: cord_uid: wzwpyvll after entry into animal cells, most viruses hijack essential components involved in gene expression. this is the case of poliovirus, which abrogates cellular translation soon after virus internalization. abrogation is achieved by cleavage of both eif gi and eif gii by the viral protease a. apart from the interference of poliovirus with cellular protein synthesis, other gene expression steps such as rna and protein trafficking between nucleus and cytoplasm are also altered. poliovirus a(pro) is capable of hydrolyzing components of the nuclear pore, thus preventing an efficient antiviral response by the host cell. here, we compare in detail poliovirus a(pro) with other viral proteins (from picornaviruses and unrelated families) as regard to their activity on key host factors that control gene expression. it is possible that future analyses to determine the cellular proteins targeted by a(pro) will uncover other cellular functions ablated by poliovirus infection. further understanding of the cellular proteins hydrolyzed by a(pro) will add further insight into the molecular mechanism by which poliovirus and other viruses interact with the host cell. a great variety of animal viruses encode for proteases that accomplish crucial functions during the biological cycle of the virus [ ] . usually, the main function of these proteases is to proteolyze viral polypeptide precursors to render mature viral proteins that form part of viral capsids or participate in virus vegetative processes [ ] . although both dna and rna viruses can encode proteases, the proteolytic tailoring of polypeptide precursors is most common among viruses with positive single-stranded rna genomes, such as picornaviruses, flaviviruses, caliciviruses, and retroviruses [ ] [ ] [ ] [ ] [ ] . this mechanism of gene expression by proteolytic processing serves to compress the genetic information of viruses in the limited space provided by the genome. in this manner, viruses reduce the genetic space occupied by and untranslated regions (utrs), the signals devoted for mrna transcription and to initiate translation are minimal, such that, for instance, in the case of picornaviruses or flaviviruses, only one and utr is necessary for viral replication, transcription, translation, and morphogenesis, despite the fact that several viral proteins are synthesized by the infected cells. in addition, a number of polypeptide precursors may exhibit functions that differ from those present in their mature products. in the case of poliovirus (pv), eleven mature proteins are produced from a single translation initiation event, and at least two precursors, bc and cd, accomplish functions which are not present in their mature proteins. taking together all these considerations, the "proteolytic strategy" provides the small rna viruses with an advantageous and efficient mechanism for distribution of the genome to accomplish all the viral biological functions with the smaller genetic space. apart from generation of active viral proteins that participate in capsid morphogenesis and genome replication, viral proteases may also target a number of cellular proteins. proteolysis of these cellular substrates can very much affect a variety of cellular processes and play an important role in virus-induced cytopathogenesis [ , ] . in this regard, productive poliovirus infection induces rapid morphological alterations in host-cell. among them, the most prevalent is the accumulation of numerous membranous vesicles in the cytoplasm, derived from endoplasmic reticulum where the viral proteins c and bc play a central role [ ] . in addition, cellular shape is modified upon viral replication giving rise to cell rounding, which is most probably induced by disorders in the cytoskeletal network [ ] . finally, chromatin condensates at late times postinfection, associated with the nuclear envelope except for sites where nuclear pores are placed [ ] . interestingly, individual expression of the viral proteases a pro and c pro leads to the induction of most of these cytopathic effects, supporting the idea that these proteases actively contribute to the viral-induced morphological changes [ ] . indeed, long-term expression of either a pro or c pro triggers the activation of caspases and, thus, cell death by apoptosis [ , ] , reflecting the strong cytotoxicity of both proteases. in addition to the cytopathic effects induced by a pro and c pro , hydrolysis of host proteins may impact on other cellular functions such as the antiviral responses to virus infection. activation of innate immunity pathways, as well as the establishment of an antiviral response, is absolutely dependent on signals traversing the nuclear membrane through the nuclear pore complex. therefore, many viruses block cellular gene expression at different levels, that is, translation, transcription or protein and rna trafficking between nucleus and cytoplasm. the blockade of active trafficking can inhibit the nuclear import of antiviral signals or prevent the export of cellular mrnas detrimental to virus processes. all these effects can be achieved by hydrolysis of specific cellular proteins. the precise number of cellular proteins degraded by a viral protease, which is known as the "degradome," still remains unknown for a given viral protease. perhaps, one of the beststudied proteases in this respect is pv a pro . the discovery that pv a pro bisects the initiation factor of translation eif g leading to the regulation of translation in the infected cells has attracted much attention from many laboratories during the past three decades [ , ] . more recently, a pro has been involved in the alteration of rna and protein trafficking between the nucleus and the cytoplasm upon proteolysis of several nucleoporins [ ] [ ] [ ] . the present paper focuses on the multifaceted activities of a pro and its regulation of different viral and cellular processes. pv is a prototype member of the picornaviridae family that infects cells of human or simian origin cytolytically or persistently and is responsible for poliomyelitis in humans [ ] . the rna genome is housed in a naked capsid formed by copies of each of the four structural proteins: vp , vp , vp , and vp . the infectious cycle commences by the attachment of a viral particle to cellular receptors present at the cell surface [ , ] . this interaction leads to virion internalization and destabilization of the capsid, which adopts a less compact structure. once the rna is released in the cytoplasm, it interacts with the translational machinery, directing the synthesis of viral proteins during the early phase of infection. the pv genome is composed of a single-stranded rna copy of positive polarity of about . kb [ , ] . this rna molecule is uncapped and contains a poly(a) tail at its end and a single open reading frame, which encodes for a polyprotein of about two thousands amino acid residues. this polyprotein is proteolytically processed giving rise to the mature viral proteins [ ] (figure (a) ). three different cleavages can be distinguished on the viral polyprotein: (i) polysomal cleavages that are produced on the nascent polypeptide chain. the first of these cleavages is catalyzed by a pro at its amino terminus separating the p precursor that encodes for the structural proteins from the rest of nonstructural polypeptides (figure (b) ). the second cleavage still on polysomes is performed by c pro , releasing the p precursor ( abc) from p ( abcd); (ii) cytoplasmic cleavages that are mostly exerted by c pro and (iii) hydrolysis of vp (vp -vp ), which is concomitant with the morphogenesis of virus particles [ , ] . all these hydrolytic events lead to the formation of eleven mature proteins and several precursors such as p , p , p , vp , vp , vp , bc, ab, and cd. this last precursor, cd, can be used as substrate by a pro or c pro . the alternative cleavage carried out by a pro renders the mature products c and d , whereas c pro generates the canonical proteins c pro and d pol . however, the biological significance of this alternative cleavage is obscure because pv mutated at a pro -cleavage site on cd does not exhibit defects in virus replication [ ] . the nonstructural proteins that are generated participate in the replication of viral genomes [ , ] . to this end, the positive rna genome is recognized at its end by proteins of the replication complex to synthesize the complementary rna strand of negative polarity. in this process, b protein, also known as vpg, acts as a primer to initiate viral rna transcription [ ] . this leads to the formation of a doublestranded rna molecule, also known as the replicative form. the negative rna synthesized serves in turn as a template to direct the synthesis of several copies of positive rna, so this process leads to the production of several nascent rna molecules with a vpg molecule bound to their ends on the negative rna molecule forming a replicative intermediate. the positive rna molecules synthesized may participate in three processes (i) to serve as templates for synthesizing more negative rna molecules; (ii) as mrnas that will be engaged in translation, and (iii) as genomes that will be encapsidated in new viral particles. in picornaviruses, the only type of mrna molecule known is exactly the same as the genome. once the synthesis of several thousands of positive rna molecules is performed, the late phase of translation takes place, giving rise also during this period to a great amount of viral proteins, some of which will participate in virus morphogenesis. this late phase of infection is preceded by the abrogation of cellular mrna translation, such that only viral proteins are being synthesized late in the pv life cycle [ ] . in the case of picornaviruses, transcription is dependent on continuous viral protein synthesis [ ] . thus, inhibition of viral mrna translation provokes the sudden blockade of viral rna synthesis. moreover, translation is coupled to transcription, such that viral rnas transfected journal of biomedicine and biotechnology figure : (a) structure of poliovirus genome and proteolytic processing of its polyprotein. the pv genome consists of a single-stranded, positive-sense polarity rna molecule, which encodes a single polyprotein. the nontranslated region (ntr) is covalently linked to the viral protein vpg. the pv genome is polyadenylated (a n ) in its end. the polyprotein contains four structural (p ) and seven nonstructural (p and p ) proteins that are released from the polypeptide chain by proteolytic processing mediated by the viral-encoded proteinases a pro and c pro / cd pro . the intermediate products of processing bc, cd, and ab exhibit functions distinct from those of their respective final cleavage products. the alternative cleavage carried out by a pro rendering the mature products c and d is also shown. (b) once the ribosome has synthesized the pv a pro sequence and continues translation on the p region, the autocatalytic activity of pv a pro is manifested by cleaving itself at its amino terminus still on the nascent polypeptide chain. this cleavage liberates the p precursor that will render the capsid proteins on subsequent proteolytic events catalyzed by c pro or cd pro . cleavage at the carboxy terminus of pv a pro on the p precursor is accomplished by pv c pro , leaving free a pro and generating the bc precursor. into picornavirus-infected cells are not able to direct protein synthesis [ ] . therefore, these two processes of viral macromolecular biosynthesis are tightly coupled, making it difficult to determine exactly the function affected in some pv mutants. notably, continuous lipid and cellular membrane synthesis is also necessary for pv rna synthesis [ ] . the morphogenesis of progeny virions in pv-infected cells is observed concomitantly with viral rna translation and replication. the release of new viral particles takes place by cell lysis, due to membrane permeabilization that occurs at the late phase of infection [ ] . viroporin b and its precursor bc are responsible for this permeabilization upon the formation of pore channels in cellular membranes [ , ] . pv has represented a useful model to gain insight into diverse aspects of molecular biology and gene expression. a number of discoveries concerning animal viruses with rna genomes were initially made in pv. for example, the presence of uncapped mrna, the sequencing and development of an infectious cdna clone, the three-dimensional structure of a virus particle, the discovery of the ires elements, the synthesis of an infectious virus in a cell-free system, the chemical synthesis of a complete viral genomes, among others, were initially reported in pv [ ] [ ] [ ] [ ] [ ] . in addition, the first time that eif g was found proteolytically cleaved was in pv-infected cells [ ] . picornaviruses encode different proteases depending on the virus species although it is common to all of them to encode c pro and its precursor cd pro (figure ). in pv, both these exhibit protease activity, and they execute most of the hydrolytic events on the viral polyprotein [ , , ] . apart from these two proteases, c pro and cd pro , picornaviruses also contain a a gene, whose product in some species exhibits proteolytic activity, as is the case for pv ( figure ) . a pro has a limited proteolytic effect on the polyprotein and its function is most probably one of altering cellular functions by the cleavage of a number of cellular proteins. in this regard, the best studied of these cleavages is the bisection of eif g ( figure ) [ ] . the general organization of the picornavirus genomes is to encode for p -p -p precursors giving rise to - - mature products. some picornavirus species, in addition, encode a leader protein (l) placed before p ( figure ) [ ] . in the case of aphthoviruses, such as foot-and-mouth disease virus (fmdv), the l protein has proteolytic activity and it is known as l pro [ , ] . during polyprotein synthesis l pro is the first protein synthesized and its autoproteolytic activity releases itself from the rest of the polypeptide chain. thus, the only known hydrolysis executed by l pro on the polyprotein is to hydrolyze between its carboxy terminus and the amino terminus of vp ( figure ). because l pro does not play a direct role in viral replication [ ] and its protease activity has a limited impact in the viral polyprotein, this protease may be involved in the interaction with the hostcell [ ] [ ] [ ] . indeed, l pro also exhibits proteolytic activity on eif g acting at a position close to that of pv a pro (figure ) [ ] [ ] [ ] . in the case of fmdv, the a protein is reduced to a small peptide of residues that does not hydrolyze eif g; instead it induces the release of a from its carboxy terminus by a ribosomal skip mechanism [ , ] . this model proposes that fmdv a modifies the activity of the ribosome to promote hydrolysis of the peptidyl( a)-trna(gly) ester linkage at the c-terminus of a, thereby releasing the polypeptide from the translational complex. however, not all l or a proteins from picornaviruses exhibit protease activity, since in the case of emcv, which encodes both proteins, neither has been demonstrated to possess proteolytic activity [ ] . all known proteases have been classified in four classes and many subgroups according to three parameters: (i) their catalytic center, (ii) their substrate specificity, and (iii) their three-dimensional structure. the classification of the different picornavirus proteases initially relied upon the effect of protease inhibitors. compounds that blocked sulphydryl groups abrogated the proteolytic activity of a pro and c pro , suggesting that the nucleophilic aminoacid in the active site was cysteine [ , ] . however, another cysteine inhibitor such as e had no effect on these proteases, while l pro activity was inhibited not only by sulphydrylactive compounds but also by e [ , ] . these findings together with structural observations imply that l pro belongs to the class of papain-like cysteine proteinases [ , [ ] [ ] [ ] . comparison of the structure of picornavirus proteases with prototypes of cellular ones revealed that both a pro and c pro are similar in structure to the chymotrypsin like group [ ] [ ] [ ] . picornavirus c pro reflects similarities to the staphylococcus aureus proteinase, whereas a pro is more akin to streptomyces griseus proteinase a. pv a pro is a protein composed of amino acids that belongs to the cysteine protease group [ ] . pv a pro is autocatalytically processed at its amino terminus between the capsid protein vp and a [ ] (see figure (b)). the determinants of substrate specificity of picornaviral pv a pro have been investigated in detail by identification of cleavage sites by n-terminal edman degradation, mutational analysis and using synthetic peptides as substrates [ , [ ] [ ] [ ] . pv a pro can recognize a wide variety of amino acid residues at the p position. the determinants of substrate specificity for pv a pro lie at positions p , p , p , and p , which are preferentially occupied with ile/leu, thr/ser, gly, and pro, respectively. moreover, the determinants of substrate specificity of hrv and coxsackievirus a pro are very similar to those found for pv a pro [ ] [ ] [ ] . the yeast two-hybrid system has been used to identify the substrate sequence interacting with pv a pro . all the sequences identified contain the leu-x-thr-z motif (x for any amino acid; z for a hydrophobic residue) in positions from p to p suggesting the presence of a common interacting site on pv a pro substrates [ ] . several a pro variants have been generated in the entire pv genome or in the isolated a gene. generation of pv a pro mutants was initially used to identify the cys , his , and asp as the residues that form part of the catalytic triad of a pro [ , ] . these data have been confirmed in the structure of hrv a pro [ ] . the role of the conserved cys and his residues in the structure-function relationship has also been studied by mutagenesis. the residues cys , cys , cys , and his play a critical role in the cis and trans proteolytic activity by maintaining a pro structure [ ] . the structure of hrv a pro shows that the zn + ion is coordinated tetrahedrally by the side chains of these conserved cys and his residues. this zn + ion is tightly bound near to the c-terminal domain and may be important for the stability of the a pro [ , , ] . the yeast saccharomyces cerevisiae has been used as a system to obtain pv a pro variants [ ] . the fact that this protease is very toxic for yeast has been exploited to generate a pro variants devoid of this cytotoxicity. using this approach, several pv a pro unable to cleave eif g have been obtained. the characterization of these mutants revealed a region in a pro involved in the interaction with substrates but none of the mutations were found in the catalytic triad. a parallelism has been observed between the ability of these pv a pro variants to block protein synthesis and to cleave eif g [ ] . pv mutants in a gene that lack trans but not cis proteolytic activity have been also identified. normal journal of biomedicine and biotechnology processing of the viral polyprotein is observed upon infection with these pv variants, whereas eif g remains intact in these cells [ ] . interestingly, rna replication of those mutant viruses is hampered, suggesting that there is a correlation between pv rna replication and the trans activity of a pro . it is controversial whether pv a pro contributes directly to viral replication or not. although for many years it was thought that pv a pro plays a direct role in pv replication, more recent studies have shown that a full-length dicistronic pv construct lacking a pro is capable to give rise to progeny viruses. moreover, virus yields of pv variants lacking the p coding region is partially restored when p is expressed in trans, suggesting that cleavage of the viral polyprotein by pv a pro is not essential for viral replication. [ ] . however, it is known that a pro is important for inducing the cytophatic effect and for avoiding the inhibition of pv replication in interferon (ifn) α treated cells [ ] . in agreement with the idea that a pro participates in viral rna replication, a fraction of this protease localizes in pv replicative foci although the majority of a pro is associated with the matrix structure in the cytoplasm of infected cells [ ] . however, the presence of a pro in the proximity of replication complexes does not demonstrate that it participates directly in the replication process. . . structure and functioning of eif g. the process of translation can be divided in different steps: initiation, elongation, termination and ribosome recycling. the synthesis of cellular proteins is highly regulated, and in this sense, the most precisely controlled step is the initiation of translation (for a recent review, see [ , ] ). for most eukaryotic mrnas, the initiation of translation commences with the recognition of the cap structure ( mgpppn) and the poly(a) tail by the heterotrimeric complex eif f and the poly(a)-binding protein (pabp), respectively, followed by the recruitment of the s preinitiation complex containing the s ribosomal subunit, the ternary complex met-trna met i -eif -gtp, and the eukaryotic initiation factors (eifs), , a, , and . then, the preinitiation complex scans along the untranslated region until an aug initiation codon is encountered in a favourable context. the perfect complementarity between the aug start codon and the anticodon of met-trna i leads to the arrest of scanning and the hydrolysis of gtp in the ternary complex. the release of eif -gdp and other factors triggers the interaction of the preinitiation complex with the s ribosomal subunit to form the s initiation complex, proceeding to translation of the mrna coding region. a number of eukaryotic initiation factors participate in both mrna binding and scanning of the utr of the mrna by the small ribosomal subunit. the cap-binding protein eif e, together with the dead-box helicase eif a and the translation initiation factor eif g, forms the protein complex eif f. the heterotrimeric complex eif f is required for recruiting the s preinitiation complex onto the cap structure located at the end of the mrna [ ] . in this sense, eif g, the larger polypeptide of eif f, functions as an adaptor molecule that bridges the mrnas to ribosomes via interactions with factors eif e (which binds the cap structure), pabp (which binds the poly(a) tail), and eif , which interacts with the s ribosome subunit ( figure ) [ , [ ] [ ] [ ] . in addition, eif g also contains binding sites for other polypeptides involved in translation, such as the rna helicase eif a ( figure ) [ ] , which is required to unwind the secondary structure within the mrna leader sequence that would otherwise inhibit ribosome scanning. the simultaneous interaction of eif g with eif e and pabp promotes circularization of the mrna in a closed loop that facilitates the initiation of new rounds of translation by the proximity of the and ends [ , ] . furthermore, eif g also interacts with the mitogen-activated protein kinase (mnk ) (figure ), which phosphorylates eif e, although the role of this phosphorylation in the initiation of translation in still unclear [ ] [ ] [ ] [ ] [ ] . two forms of eif g, known as eif gi and eif gii, have been identified in mammalian cells. both forms show only % amino acid sequence identity but they are thought to be functionally interchangeable due to the high homology in key domains that interact with other factors [ ] . evidence obtained by specific depletion of each eif g form or differential cleavage of each of them by specific proteases (see below) points to the idea that both factors should be lacking for complete abolition of protein synthesis [ , ] . eif gi is the dominant form in hela cells, in which the ratio between eifgi and eif gii is : [ ] . however, the specific role of each form of eif g in the initiation of translation remains unknown. it was proposed that both eif g forms are differentially regulated by different kinases, supporting the hypothesis that eif gi and eif gii could drive differentially translation initiation [ ] . eif gi is phosphorylated in response to serum and in a rapamycin-dependent manner at ser , , and , although the role of these posttranslational modifications is still under investigation [ ] . in addition, eif gi is phosphorylated by p -activated protein kinase (pak- ) that is induced under stress conditions. this phosphorylation takes place in the eif e-binding site of eif g and avoids the interaction between these two factors, inhibiting cap-dependent initiation of translation [ ] . on the other hand, eif gii is phosphorylated during mitosis [ ] by calmodulin-dependent kinase i at ser [ ] . therefore, activity of eif gi and eif gii might be tightly and reversibly regulated by phosphorylation under different physiological conditions. nevertheless, this factor is also subjected to irreversible modifications such as caspasemediated proteolysis, which is triggered during apoptosis and leads to shutoff of protein synthesis. caspase- cleaves directly eif gi in positions and removing pabp, mnk , and one eif a-binding domain from the eif gi core ( figure ) [ , ] . in contrast, eif gii is degraded during apoptosis with a delayed kinetics in relation to eif gi proteolysis, correlating with the shutoff of the protein synthesis [ ] . furthermore, many eif gi isoforms have been detected in hela cells and these are synthesized from several distinct mrnas via alternative promoter usage and alternative splicing [ ] . the largest is the eif gi-a isoform, which contains , residues, while the eif gi-b, -c, -d, ande are shorter variants [ ] . it has been described that the longer isoforms are more active in translation initiation, most probably because they contain the pabp-binding site [ ] . pv results in a rapid shutoff of host-cell protein synthesis, whereas viral mrna translation takes place efficiently [ ] . it was initially observed that the inhibition of host-cell translation in pv-infected cells correlated with the proteolysis of a component of the eif f complex with a molecular mass of about kda (later identified as eif g) [ , ] . this cleavage is exerted by a pro and can be prevented by both insertion of mutations that abolish the protease activity and addition of a inhibitors [ , , ] . interestingly, this proteolysis is more effective when eif e is interacting with eif g, suggesting that pv a pro preferentially acts on the eif g pool involved in translation [ ] . cleavage of eif gi also occurs in cells infected with other picornaviruses such as hrv, coxsackievirus, and fmdv [ , , ] . interestingly, a pro from pv, hrv, and coxsackieviruses cleave eif gi at positions / (figure ), suggesting the conservation of the specificity of enterovirus a proteases for the substrate determinants present in eif gi [ , ] . cleavage at position / separates eif e-and eif binding sites of eif gi, contained in n-terminal and cterminal fragments respectively, thus decoupling mrna and ribosome recruiting activities. the pv a pro cleaves eif gi directly and does not require any additional proteins for this process to occur [ , ] . however, the fact that pv a pro is not copurified with eif gi fragments from pv-infected cell extracts suggest that a pro induced the activation of a host protease, which in turn cleaves eif g during pv infection [ ] . in addition, it has been proposed that eif and an unknown host-cell protein could act as cofactors for eif gi cleavage by pv a pro [ ] . in this sense, zamora and colleagues suggested that pv infection activates at least two host-cell proteases, which together with pv a pro , cleave eif gi [ ] . nevertheless, no additional evidence has been put forward to support this hypothesis and the identity of these host proteases has not yet been determined. many reports have demonstrated that the kinetics of protein synthesis shutoff and eif gi cleavage are not correlated in pv-infected cells [ ] [ ] [ ] . these data clearly indicate that additional translation factors may be cleaved to achieve an efficient inhibition of cellular mrna translation. in this regard, additional reports showed that eif gii is also proteolyzed by a pro during pv and hrv infections and that this cleavage is exerted between amino acids / leading to a proteolytic pattern similar to eif gi. interestingly, eif gii is significantly more resistant to a pro -mediated cleavage than eif gi and the kinetics of protein synthesis shutoff close correlates with eif gii cleavage in pv-and rhv-infected cells [ , ] . in those studies, gradi and colleagues proposed that hydrolysis of both eif gi and eif gii is required for achieving pv-and hrv-mediated inhibition of host-cell mrna translation and that the cleavage of eif gii is the rate-limiting step in the shutoff of host-cell translation after infection with those viruses [ , ] . a variety of approaches have been devised to express pv a pro in order to cleave eif g in culture cells or in cellfree systems. of these approaches, the most straightforward system has been the addition of the purified pv a pro , usually as a hybrid protein such as mbp- a pro , to cellfree systems such as rabbit reticulocyte lysates (rrls), hela and krebs- extracts [ , , [ ] [ ] [ ] [ ] . in this sense, the addition of about to μg mbp- a pro suffices to hydrolyze eif g in those cell-free systems [ , [ ] [ ] [ ] . an alternative method to cleave eif g in an in vitro system is the translation of an mrna encoding pv a pro in translation competent extracts [ ] . this assay has the advantage of providing genuine and freshly made pv a pro , leading to total cleavage of eif g in the test tube after several minutes of translation. many different approaches have been explored in culture cells, the most popular being transfection of plasmids encoding pv a pro in different eukaryotic cell types [ , , [ ] [ ] [ ] [ ] [ ] [ ] [ ] . several plasmids have been utilized in this respect, and perhaps the most successful one is ptm - a, which is transfected in mammalian cells that transiently express t rna polymerase by infection with a recombinant vaccinia t virus [ , , , , ] . the amount of protease synthesized in this system is similar to that found in pv-infected cells at late times of infection, but these amounts are reached - hours after transfection. similar results have been obtained in cells constitutively expressing t polymerase, which comprises a less pleiotropic system, because vaccinia virus proteins are not expressed (unpublished data). since pv a pro targets a number of different cellular proteins, which affect several cellular functions depending on the amount of protease synthesized (see below), in some instances, it is useful to express a pro at low levels. we have explored many alternative methods trying to get a system that allows us to control the levels of pv a pro into the cells. novoa and colleagues developed a protocol based on the addition of hybrid proteins bearing pv a pro . these recombinant proteins enter into the cytoplasm on cell membrane permeabilization by different methods such as addition of mbp- a pro mixed with replicationally inactive chicken adenovirus particles [ ] . cleavage of eif g following these protocols takes place after incubation for - hours, suggesting that the amount of protease internalized is probably low but sufficient to hydrolyze eif gi in virtually all culture cells [ ] . probably one of the most attractive systems is a stable cell line that inducibly express pv a pro , obtained in two different laboratories including ours [ , ] . in these cell lines, pv a pro is synthesized when tretracycline is removed from the culture medium, leading to low expression of pv a pro that induces efficient cleavage of eif g after h post induction correlating with a potent inhibition of cellular translation. finally, long term expression of pv a pro in tet off cell lines triggers apoptosis [ , , ] . the main drawback of this cell line is the low pv a pro escape under repression conditions that gives rise to a basal cytotoxicity. probably, the most efficient method is electroporation of an mrna encoding a pro under the control of emcv leader sequence (ires- a) [ ] . the biggest advantage of this method is the capacity to regulate levels of a pro expression by controlling the amounts of ires- a transfected. for example, electroporation of μg of ires- a into ∼ . · hela cells leads to total cleavage of both eif gi and eif gii in only h, resulting in an almost complete shutoff of cellular protein synthesis. in contrast, electroporation of low amounts of ires- a ( μg) into hela cells induces efficient cleavage of eif gi, whereas eif gii remains largely intact. therefore, -fold more ires- a mrna is required to cleave eif gii compared to eif gi [ ] . based on the ires- a mrna electroporation method we were able to induce the differential proteolysis of eif gi and eif gii in a timeand dose-dependent manner. kinetics of protein synthesis shutoff and eif gii cleavage is closely correlated in hela cells, resembling what was found in pv-infected cells [ ] . in agreement with what was observed with the addition of exogenous recombinant proteins [ ] , translation of de novo synthesized mrnas showed higher susceptibility to low doses of pv a pro than mrnas already engaged in translational machinery [ ] . these results suggested a possible specific role of eif gi in the pioneer round of translation in agreement with a previous report [ ] . however, specific ablation of eif gi using sirnas induced a moderate inhibition of luciferase synthesis from de novo synthesized and preexisting mrna (about % in both cases) [ ] . these findings reported by welnowska and colleagues indicated that the higher susceptibility of de novo synthesized mrna translation to low doses of ires- a might be produced by an additional effect of pv a pro on another gene expression step. in this regard, further studies demonstrate that the stronger impact of a pro on de novo synthesized mrnas is due to the concomitant inhibition of rna nuclear export by nucleoporin cleavage, which is also achieved under these conditions (see below) [ ] . interestingly, cellular mrnas are able to initiate translation after a polysome runoff with high salt treatment when eif gi is totally cleaved by pv a pro , whereas it is completely abolished when both forms of eif g are proteolyzed [ ] . taken together these set of data from cell expressing a pro [ , ] as well as from pv-infected cells [ ] [ ] [ ] we can conclude that complete shutoff of the protein synthesis induced by pv a pro is achieved when both eif gi and eif gii are completely cleaved. therefore, when the levels of one of the two populations of eif g remain unaffected either because it is not cleaved by pv a pro [ ] or it is not depleted by sirnas [ ] , extensive host protein synthesis takes place. the infection of pv and coxsackievirus also leads to hydrolysis of pabp [ , ] . this cleavage is carried out by pv c pro and coxsackievirus a pro and c pro and it might actively contribute to the host translational shutoff induced by these viruses [ , ] . in conclusion, the proteolysis of different components of the translation initiation machinery by picornavirus proteases can account for the shutoff of host translation induced after infection although the specific contribution of hydrolysis of eif gi, eif gii, and pabp remains still unclear. infection of animal cells with fmdv also leads to proteolysis of eif g and to rapid inhibition of cellular translation [ ] . the proteolysis of eif g is carried out by the two virally encoded proteases l pro and c pro [ , ] . l pro cleaves both eif gi and eif gii extremely rapidly at positions ( figure ) and , respectively, located seven and one amino acids upstream of the a pro cleavage sites on eif gi and eif gii [ , ] . the cleavage of eif g by fmdv l pro results in the rapid shutoff of host-cell protein synthesis [ ] . although the initial cleavage of eif gi can be carried out by fmdv l pro in the absence of virus replication [ ] , a sequential cleavage of the c-terminal fragment of eif gi by fmdv c pro also occurs in bhk cells at early stages of infection concomitant with the shutdown of viral translation. the c pro cleavage site on eif gi has been located at position , amino acids downstream of the l pro cleavage site [ ] although the role of this sequential cleavage is still unclear. the amino acid segment of eif g located between the l pro and c pro cleavage sites binds rna and was suggested to be critical for mrna scanning by the preinitiation complex [ ] . interestingly, this secondary cleavage does not occur in human cell lines due to an amino acid substitution at the cleavage site on eif gi [ ] . infection of cells with other picornaviruses such as cardioviruses (emcv and mengovirus) leads to a shutoff of host-cell protein synthesis. however, eif g remains unaffected in these cells. these findings indicate that apart from eif g cleavage, there are other mechanisms that may block host translation by picornaviruses. in addition to picornavirus l pro and a pro , proteases from other viruses can also cleave eif g. the protease of human immunodeficiency virus type- (hiv- ) hydrolyzes eif gi during infection of human cd + cells [ ] . the cleavage of eif gi takes place at positions , , and journal of biomedicine and biotechnology , separating it in three domains ( figure ) [ , ] . interestingly, hiv- protease efficiently cleaves eif gi, but not eif gii, both in cell-free systems and in mammalian cells [ ] . the differential sensitivity of eif gi and eif gii to hiv- protease is more selective than that observed with picornaviral a proteases [ , ] . hiv- protease also cleaves pabp at positions and separating the two first rna-recognition motifs from the c-terminal domain of pabp [ ] . cleavage of eif gi and pabp by hiv- protease is sufficient to inhibit the translation of capped and polyadenylated mrnas in cell-free systems, as well as in transfected cells [ , ] . in contrast, ires-driven translation is unaffected or even enhanced by hiv- pr after cleavage of both eif gi and pabp [ , ] . moreover, the translation of capped and polyadenylated hiv- genomic mrna remains unaffected in hela extracts under these conditions suggesting that viral protein synthesis might persist at late phases of hiv- infection where those factors are cleaved [ ] . in contrast, a previous report claimed that the hydrolysis of eif gi impaired the translation of both capped and ires-driven mrnas in reticulocyte lysate assays [ ] . however, the different effect observed by these authors on ires-driven translation can be due to differences already reported between hela extract and rrl [ , ] . in addition, eif g cleavage is executed by proteases from other retrovirus species, such as hiv- , simian immunodeficiency virus (siv), human t-cell leukemia virus (htlv- ), moloney murine leukemia virus (momlv), and mouse mammary tumor virus (mmtv). these proteases hydrolyze eif gi and eif gii with different cleavage patterns and kinetics [ ] . and indeed, several retroviruses, including hiv, siv, and momlv, promote the translation of their gag gene products by internal ribosome entry, indicating that eif g cleavage could be compatible with viral protein synthesis in infected cells [ ] [ ] [ ] . furthermore, cleavage of eif gi and eif gii also occurs in feline calicivirusinfected cells although the cleavages occur at different sites to those observed for picornavirus proteases [ ] . in addition, the c-like protease of two caliciviruses, like pv c pro , cleaves pabp perhaps as a complementary strategy to inhibit cellular translation [ ] . the fact that proteases from many picornaviruses, retroviruses and caliciviruses, target eif g, and, in some cases, pabp, strongly suggest that those viruses may share a common mechanism to regulate cellular and viral translation. however, further investigation is required to determine the specific contribution of eif gi, eif gii and pabp to the shutoff of host-cell translation and virus protein synthesis. the biological cycle of picornaviruses is confined to the cytoplasm of infected cells. however, some of the pv proteins are able to target nuclear proteins such as transcription and splicing factors and proteins involved in nuclear-cytoplasmic trafficking. one of the best studied cases in this regard is the cleavage of nucleoporins (nups), components of the nuclear pore complex (npc), by pv and hrv a pro , which directly impacts on nuclear-cytoplasmic trafficking of proteins and rnas [ , , ] . complementarily, emcv l protein affects on the phosphorylation status of nup and induces similar effects to those described for pv a pro in the transport of macromolecules through npc [ ] . nevertheless, nups are not only targets for picornavirus proteins but also for matrix (m) protein from vesicular stomatitis virus (vsv) [ ] and nonstructural protein (ns ) from influenza virus [ ] , which also impair components of the nuclear export-import host machinery following analogous mechanisms. taking into account that proteins from different positive and negative strand rna viruses target nups, we highlight in this paper npc as a key target for viral proteins, although the possible role of npc in virus biological cycle is still under intensive research. nucleus and cytoplasm are physically separated by a semipermeable barrier known as the nuclear envelope. due to this compartmentalization, a large number of macromolecules traverse the nuclear envelope to reach their biological destination. for example, proteins are synthesized by ribosomes in the cytoplasm, but some of them such as polymerases, transcription factors, nucleosome components and splicing factors, have to traverse the npc to reach the nucleoplasm. conversely, all rna species are transcribed and processed in the nucleus and later, most of mature rnas are exported through the npc to the cytoplasm, where their biological roles take place. therefore, the regulation of rna and protein trafficking between nucleus and cytoplasm directly impacts on gene expression [ , ] . npc forms large structures (∼ mda) embedded in the nuclear envelope with a polarized eightfold symmetrical core. it is sandwiched by a cytoplasmic and nuclear ring, which projects eight filaments of about nm into the cytoplasm and a basket-like structure of about nm into the nucleoplasm [ , ] . the npc is composed of multiple copies ( , or ) of ∼ different proteins, called nups, that are grouped in three major classes: (i) the phenylalanineglycine (fg)-containing nucleoporins that actively work in the nuclear-cytoplasmic trafficking of macromolecules; (ii) the structural components, which lack fg-rich domains and (iii) the membrane integral proteins, which anchor the npc to the nuclear envelope [ , ] . whereas the two last groups of nucleoporins play a role in the architecture and localization of the npc, the fg-nucleoporins directly regulate the transport of rnas and proteins through the npc, and they are the main nucleoporin class targeted by viruses (figure ) . movement of ions, metabolites and other small molecules between nucleus and cytoplasm takes place by passive diffusion; however, transport cargos larger than kda require the participation of specific receptors and carriers [ , ] . fg nucleoporins are placed on both cytoplasmic and nucleoplasmic sides of the npc and play a central role in the active transport of macromolecules. the fg domains of nucleoporins are unfolded regions that participate in energy-independent transient interactions with the cargo receptors during the docking and translocation processes [ ] . nevertheless, the delivery of the cargo and some of the directional steps require hydrolysis of gtp. trafficking of proteins through npc is mediated by a family of conserved transport receptors named karyopherins, which recognize short peptides known as nuclear localization signal (nls) and nuclear export signal (nes) [ , ] . in addition, karyopherins also recognize nucleotide sequences during the export of some classes of rnas [ ] . due to their key role, karyopherins involved in cargo import are known as importins and those involved in export are known as exportins. the rangtp cycle also plays a central role in karyopherin activity and, therefore, in trafficking of macromolecules through the npc. importins bind the cargo in the cytoplasm and release it on binding to rangtp in the nucleus. in contrast, exportins bind the cargo in the nucleus together with rangtp; and then the ran-associated gtp is hydrolyzed in the cytoplasm by rangap and cargo is liberated [ , ] . proteins containing constitutively active nls are predominantly nuclear; but in some cases, the accessibility of nls or the nls itself is modified to selectively regulate the localization of the protein [ ] . a similar regulatory mechanism is also exerted for control of nes activity [ ] . for example, the nls of the transcription factor nfκb is masked by the interaction with the inhibitor iκbα. however, iκbα is degraded on proinflammatory stimuli, exposing the nls of nfκb for importin recognition. therefore, under proinflammatory conditions, nfκb is imported to the nucleus, where it triggers a specific gene response [ ] . in addition to "masking strategies," phosphorylation, ubiquitination or methylation of nls or nes also influences (negatively or positively) their recognition by importins or exportins. therefore, trafficking of proteins between nucleus and cytoplasm could be finely regulated by posttranslational modifications in the nls and nes [ , ] . conceptually, export of most of nuclear rna follows a similar mechanism to that described above for protein trafficking. this process also involves cargo receptors, export factors, and nucleoporins to deliver mature rna to the cytoplasm, but in this case, structure and function of nuclear export signals are not well understood. aminoacylated trnas are necessary in the cytoplasm for protein synthesis, but trnas are transcribed in the nucleus. trna export to the cytoplasm is mediated by exportin-t, which belongs to the karyopherin family. exportin-t forms a complex together with rangtp in the nucleus, and once the exportin-t-rangtp-trna complex reaches the cytoplasm, rangap induces the hydrolysis of the ran-associated gtp and the release of the trna [ , ] . it has been proposed that exportin- could act as an auxiliary protein in this process [ ] . however, later reports in yeast have opened the possibility of alternative pathways for trna nuclear export [ ] . traffic of snrna follows a complex mechanism involving adaptor proteins. snrnas are transcribed by the rna polymerase (pol) ii (with the exception of u snrna which is produced by poliii) and then they are capped but not polyadenylated. cap-binding proteins (cbp) and interact with the cap of snrnas in the nucleus and recruit an export adaptor known as phax [ ] . this adaptor is phosphorylated and in this state, it is able to interact with crm , another member of karyopherin family. this interaction together with the joining of rangtp is essential for snrna trafficking [ ] . hydrolysis of rangtp and dephosphorylation of phax lead to the release of the snrna in the cytoplasm. ribosomal proteins are assembled together with the different rrnas in the nucleolus following a complex process of maturation to give rise to the ribosomal subunits s and s. although is known that preribosomal subunits are exported by separate routes that involve crm and rangtp, nowadays the exact mechanism followed by s preribosomal subunits to leave the nucleus remains unclear [ ] . however, s preribosomal subunit relies on nmd adaptor, which mediates the interaction with crm [ , ] . the release of the s preribosomal subunit requires two gtpase steps: (i) the hydrolysis of the ran-associated gtp induces the liberation of crm ; and (ii) the hydrolysis of gtp mediated by the cytoplasmic gtpase lsg , which induces the release of nmd [ ] . finally, mrnas compose the most heterogeneous group of rnas, varying in length and structure. thus, different export factors and adaptor proteins associate with each subpopulation of mrnas [ ] . mrnas are transcribed by polii, and, concomitantly with this process, a number of rna-binding proteins assemble with them. these rnabinding proteins exert different modifications in immature mrnas such as polyadenylation, splicing and capping. in addition, export factors and adaptor proteins are also recruited to nascent pre-mrnas, playing a further function in nuclear export. trex (transcription-coupled export) complex is recruited to the end of nascent pre-mrnas in a splicing-dependent manner by means of the interaction of one of its components, namely aly/ref, with the subunit of the nuclear cap-binding complex cbp [ ] . the trex complex also recruits the conserved rna-helicase uap that is important for mrnp biogenesis [ ] . tap-p , (also known as nxf -nxt ) directly binds the mrna immediately after splicing and actively participates in mrna export. aly/ref, tap-p and uap associate with exon junction complexes (ejc), which are deposited in a splicingdependent manner at - nt of every exon-exon junction [ , ] . this interaction network makes spliced mrnas more susceptible to export and couple splicing and mrnatrafficking. in fact, unspliced mrnas are exported by an alternative and less efficient pathway that involves crm and rangtp. this alternative route is also followed by mrnas encoding some protoncogenes and cytokines [ , ] . it is important to mention that although mrna can follow different nuclear export pathways, in all cases the interaction of export receptors with nucleoporins plays an essential role in the transport of the mrnps throughout the npc [ ] . protein and rna trafficking by pv a pro . pv infection strongly impacts on hostcell protein localization, giving rise to an unusual cytoplasmic distribution of nuclear proteins [ ] . this particular effect has been characterized by different laboratories for a number of nuclear factors involved in several cellular processes and containing different types of nlss [ ] . nevertheless, not all nuclear proteins are re-localized after pv infection, evidencing the presence of a viral-specific mechanism affecting protein subcellular distribution [ ] . gustin and colleagues proposed the inhibition of nuclear protein import machinery as the cause of the cytoplasmic accumulation of nuclear proteins in pv-infected cells (figure ). they demonstrated that pv impairs protein trafficking across the npc by expressing gfp proteins encoding classical or transporting nlss in mock and infected cultured cells [ , , ] . these recombinant proteins accumulate in the cytoplasm after pv infection, being almost completely depleted from the nucleus. nevertheless, pv does not affect gfp distribution when the nls is mutated or deleted, since the small size of gfp allows its inefficient efflux throughout the npc [ ] . interestingly, cell-free nuclear import assays demonstrated that nls-containing gfp is unable to traverse the npc when cells are previously infected with pv [ ] . in agreement with these findings, shuttling endogenous proteins, such as heterogeneous nuclear ribonucleoprotein (hnrnp) a and hnrnp k, are detected in the cytoplasm of infected cells from hpi but are undetectable in the nucleus at . hpi. however, cytoplasmic accumulation of nuclear resident proteins such as hnrnp c requires longer times of infection [ ] . these findings support the idea that the distribution of a protein that shuttles between nucleus and cytoplasm may be strongly altered by the disruption of protein trafficking pathways, as compared to nuclear resident proteins. however, not all the nuclear factors are redistributed to the cytoplasm of pv-infected cells. this is the case for sc (a serine/arginine-rich splicing factor), fibrillarin or tata-binding protein (tbp), which remain in the nucleus for the duration of infection. the different behaviour of these groups of proteins could arise as a consequence of different turnover; thus, the distribution of highly stable nuclear proteins might be less affected by the inhibition of protein nuclear import than those proteins with low stability. alternatively, pv might not impair all protein import pathways, and some might remain operative, thereby allowing the nuclear import of several families of nuclear proteins. interestingly, some nuclear factors such as la antigen [ ] , ptb [ ] , sam [ ] , or nucleolin [ ] have been shown to interact with pv rna or viral proteins. these proteins accumulate in the cytoplasm of pv-infected cells and, consequently, their availability for viral replication is increased [ , [ ] [ ] [ ] . cytoplasmic accumulation of nuclear factors might be produced mostly by the blockade of nuclear-cytoplasmic trafficking. however, loss of the nls after the cleavage of ptb and la proteins by pv c pro may also contribute to the subcellular relocalization of those proteins in pv-infected cells [ , ] . redistribution of nuclear proteins as well as impairment of cellular import machinery was also observed in cells infected with other picornaviruses such as hrv [ ] or emcv [ ] . these findings indicate that most picornaviruses might share similar strategies to impair nuclear-cytoplasmic trafficking machinery. an alternative hypothesis has been proposed as the cause of the redistribution of nuclear factors. belov and colleagues reported that the nuclear envelope is permeabilized on pv infection, allowing nuclear proteins to diffuse across the nuclear membrane [ ] . indeed, electron microscopy revealed that pv a pro induces severe structural damage in npc [ ] . however, this hypothesis did not clarify why other proteins resident in the nucleus do not diffuse to the cytoplasm after pv infection. both protein nuclear import and permeabilization of nuclear envelope could be integrated together as sequential steps in pv biological cycle. protein import blockade is detected early after infection [ ] , correlating with the first modifications in the npc (see below). however, prolonged expression of viral proteins might induce nuclear membrane leakiness, reflected by stronger alterations in npc architecture [ ] . nevertheless, what might the biological relevance of these events be for a cytoplasmic virus? the most evident answer, which was extensively commented above, is that inhibition of nuclear protein import and further nuclear envelope leakiness might increase the presence of nuclear proteins in the cytoplasm of infected cells, which has been proposed in some cases to play a relevant role in pv replication. in addition, many transcription factors are arrested in the cytoplasm of uninfected cells (e.g., nf-κb, irf , and irf ), but they are immediately activated and imported to the nucleus after proinflammatory extracellular signals or on the activation of intracellular sensors as a consequence of the viral replication. once in the nucleus, these factors trigger the transcription of a set of genes involved in the antiviral response [ ] . inhibiting the import of these transcription factors, pv might prevent or, at least attenuate, the establishment of a hostile intracellular environment. further effort will be made in the future to explore this attractive hypothesis. interestingly, nup , nup and nup , components of the npc belonging to fg nup family, were found to be degraded in pv-as well as hrv-infected cells (figure ) [ , , ] . these proteins are essential factors of the nuclear-cytoplasmic trafficking machinery since their nterminal fg-rich domains serve as docking sites for soluble transport factors [ , ] . nup , nup and nup are proteolyzed in pv-infected cells following different kinetics. thus, nup is the cleaved early after infection (from hpi), whereas nup and nup are targeted at late times after infection (from hpi) [ , ] . in agreement with these findings, cleavage of nup is induced even in presence of inhibitors of pv replication, suggesting that small amounts of viral proteins are sufficient for this proteolysis to occur. however, cleavage of nup , and nup are efficiently prevented on arrest of viral replication, probably because they are only efficiently achieved when large amounts of viral proteins are produced [ ] . therefore, nup , nup , and nup exhibit different susceptibilities to pv replication, thus pv might have a gradual impact on the nuclear-cytoplasmic trafficking machinery. nup is also proteolyzed by caspase- and caspase- during apoptosis induction; however, the involvement of these cellular proteases in pv-induced nup cleavage has been ruled out by different laboratories. first, nup cleavage journal of biomedicine and biotechnology products generated upon caspase activation differ to those found in pv-infected cells [ ] . in addition, nup is cleaved in pv-infected cells, but it remains intact despite caspase activation [ ] . most importantly, pv-induced npc structural damage takes place in cells lacking caspase and [ ] , and nup and nup are efficiently cleaved in pv-infected cells even in presence of the caspase inhibitor z-vad [ ] . all together, these data support the idea that one or more viral proteins play a direct role in the cleavage of those nups. in agreement with this hypothesis, pv-induced npc damage is prevented by pv a pro inhibitors such as elastatinal, elastase, and mpcmk, suggesting an involvement of this viral protease in the alteration of npc [ ] . indeed, individual expression of pv a pro in hela cells as well as addition of this protease to cell-free systems gives rise to nup , nup , and nup cleavage [ , , ] . in agreement with the data obtained from pv-infected cells, on pv a pro expression in hela cells, nup is cleaved faster than nup and nup , which suggests the presence of optimal cleavage sites in this protein [ ] . proteolysis of nup in pv-infected cells as well as in cell-free systems generates two different cleavage products of around - kda and kda [ ] . there are two optimal cleavage sites in nup for pv a pro located between aminoacids - and - , containing gly at p , thr at p , and leu at p . hydrolysis at both sites results in n-and cterminal products with predicted molecular masses of and or and kda, in good agreement with the size of the peptides detected experimentally in pv-infected cells and a-treated hela extracts [ , ] . an explanation for the delayed kinetics of nup and nup with respect to nup is that optimal pv a pro cleavage sites were not found in these nups (unpublished data). recently, park and colleagues have reported that pv a pro directly cleaves nup at six different positions rendering multiple proteolytic products. these cleavage sites are located between aminoacids and , thus releasing the fg-rich region from the protein core [ ] . functionally, loss of the fg-rich region might make nup inactive for interaction with cargo receptors. this hypothetical mechanism of nup functional decoupling could be extrapolated to nup and nup (figure ) . however, it remains unknown whether pv a pro is able to directly cleave nup and nup or where cleavage might occur. nup , nup and nup are also involved in rna export from the nucleus and therefore, cleavage by pv a pro might also impact on this process (figure ) . however, oligo d(t) hybridization studies showed that pv infection does not affect distribution of the polyadenylated mrna bulk after hpi [ ] . a more detailed analysis revealed that expression of pv a pro in hela cells induces a number of disorders in rna location. nuclear export of cellular mrnas is inhibited in a pro -expressing cells in a dose dependent manner concomitantly with nup , nup and nup cleavage [ ] . interestingly, mrna export of constitutively expressed mrnas such as β-actin is less affected than that of newly synthesized mrnas. for example, tetracycline-induced luciferase mrna was almost totally retained in the nucleus when these nups are cleaved by pv a pro . this effect was also observed for endogenous mrnas such as il- , c-myc or p mrnas which are induced on pv a pro expression. therefore, pv a pro could counteract the induction of proapoptotic (c-myc and p ) and proinflammatory (il- ) responses by accumulating the c-myc, p and il- mrnas in the nucleus [ ] . this export blockage may prevent the establishment of a hostcell response against pv infection. these findings could explain why pv a pro is essential for replication of pv in cells pre-treated with ifn-α [ ] . furthermore, impairment of mrna export strongly alters the localization of mrnas with high turnover as compared to constitutively expressed and highly stable mrnas such as β-actin [ ] . as observed in pv-infected cells, oligo d(t) hybridization revealed that pv a pro expression hardly affects the distribution of the polyadenylated mrna pool after short times of expression ( h). however, nuclear accumulation of polyadenylated mrna bulk is detected when cells are exposed to pv a pro for longer times ( and h) . in this regard, the progression of the alterations of polyadenylated mrna localization in a pro -expressing cells takes place as follows: (i) disruption of nuclear mrna-containing foci (ii) appearance of mrnacontaining granules in the cytoplasm (most probably stress granules) and (iii) depletion of cytoplasmic mrnas. these events were more clearly observed when high amounts of pv a pro are synthesized, reflecting that nuclear accumulation of mrnas is a time-and dose-dependent process [ ] . nevertheless, pv a pro is not only able to block mrna export, but also rrna and snrna transport. both s rrna and u snrna accumulate in the nucleus of a proexpressing cells in a dose-dependent manner. most probably, cleavage of nup , nup and nup is involved in these effects, since rrna, snrna, and mrna are exported using different cargo receptors and auxiliary proteins (see above) but all of them relay in nup activity to traverse npc [ ] . importantly, trnas (val-trna) are exported normally despite pv a pro expression, indicating that some rna nuclear export pathways are not affected by this viral protease. in fact, nup , nup , and nup are not directly involved in trna export [ ] , reinforcing the idea that nucleoporin cleavage plays a central role in the impairment of protein and rna trafficking by pv (figure ) . notably, ifn-γ induced a specific increase of nup levels in hela cells that counteracts the inhibition of mrna export by pv a pro [ , ] . collectively, these findings reflect the central role of nup in pv infection and in antiviral response, since its overexpression by itself prevents, at least in part, blockade of nuclear rna export. therefore, secretion of ifn-γ by immune cells might allow the induction of antiviral response by neighbouring cells by increasing the levels of nup in order to protect nuclearcytoplasmic protein and rna trafficking pathways. the physiological relevance of the crosstalk between nup -pv a pro in pv (and other viruses) infection might be studied in the future with cellular and animal systems. that target the nuclear pore. as mentioned above, hrv also induces cleavage of nup , nup and most probably nup , leading to the impairment of nuclear protein import [ ] . nevertheless, pv and hrv a pro exhibit high homology and both proteases are therefore expected to share common targets. in contrast, the a gene of cardioviruses encodes a short peptide with autoproteolytic activity but lacks trans-protease activity. however, emcv and mengovirus are able to damage npc [ ] . as occurs in pv-infected cells, these cardioviruses induce both protein nuclear import inhibition and late membrane leakiness but they do not induce the cleavage of nups [ , , ] . cardioviruses encode an additional protein known as l protein, which is highly cytopathic although it lacks protease activity (in contrast to aphthovirus l pro ). l protein contains a zinc finger domain, and an acidic region, which is proposed to be phosphorylated in infected cells [ ] . individual expression of this protein in cultured cells or in cell-free systems induces several cellular disorders including the inhibition of protein nuclear import that resembles that observed in emcv-infected cells [ , ] . several studies reported emcv l protein mutations that resulted in defective virus growth phenotypes in cell culture [ , ] . in particular, mutations in the zinc finger domain (cys ala and cys ala) or in the acidic region (thr ala) partially avoid blockade of protein trafficking between the nucleus and the cytoplasm [ ] . taken together, these data support the involvement of cardiovirus l protein in npc damaging and in the inhibition of protein nuclear import, inducing nups phosphorylation rather than their cleavage. indeed, nup is quickly and strongly phosphorylated after emcv infection ( hpi), and it was clearly detected by conventional western blotting. nevertheless, analysis with pro-q diamond phosphoprotein stain revealed that nup and nup are also phosphorylated to a certain extent upon emcv infection. notably, nups phosphorylation was avoided when cys ala mutation was inserted in the l protein sequence, suggesting that the zinc finger domain is essential for this posttranslational modification to occur [ ] . however, the exact role of nups phosphorylation in nuclear-cytoplasmic trafficking is still unknown. the idea that nups phosphorylation could regulate protein import and rna export as a switch that turns the different pathways on/off should be pursued in more detail. as mentioned above, ran is essential for the regulation of most of the nuclear export and import pathways, because it acts as a cofactor modulating the affinity of importins and exportins for the cargo. the rangtp cycle is described in detail in section . . it has been described that emcv l directly interacts with ran, and this interaction is abrogated by the insertion of c a mutation in l [ ] . however, the potential role of l/ran interaction in the modulation of rangtp cycle and its impact in nuclear import and export pathways have not yet been studied. alteration of nuclear-cytoplasmic trafficking is not only restricted to picornaviruses but has also been observed with negative strand viruses such as vesicular stomatitis virus (vsv) and influenza virus. her and collaborators reported that rna export and protein import are strongly inhibited by vsv matrix (m) protein, by microinjection of oocytes with radiolabeled rnas and proteins. radiolabeled trnas, mrnas, u snrnas, and rrnas were injected directly into the nucleus of xenopus laevis oocytes, and the subcellular localization of those rnas was monitored by autoradiography. the conclusion of this work is that mrna, u snrna, and rrna but not trna trafficking is blocked by vsv m protein [ ] , in agreement with our findings on a proexpressing hela cells [ ] . in addition, protein nuclear import was monitored by microinjection of radiolabeled proteins containing nls in the oocytes cytoplasm. this assay revealed that vsv m protein abrogates protein nuclear import to the same extent as treatment with specific inhibitors of this pathway such as wga [ ] . these interesting findings support the idea that pv a pro and vsv m protein could target similar host proteins to impair macromolecule trafficking between nucleus and cytoplasm. in agreement with this possibility, it was found that the vsv m protein interacts with nup [ ] , which is one of the primary targets of pv a pro [ ] . the n-terminal domain of vsv m is sufficient to block rna nuclear export and aa - may play an essential role in this blockade, because their mutations to ala completely abrogate this inhibitory effect. indeed, the n-terminal domain of m protein is involved in the interaction with nup and, in particular, aa - , because their mutation to ala blocks the binding of vsv m to nup [ ] . in addition, binding of m to nup requires active mrna export pathways since, treatment with inhibitors such as wga hampers this interaction. vsv m is also able to induce the accumulation of endogenous polyadenylated mrnas in the nucleus of hela cells, and this effect is prevented again by mutations in aa - of m [ ] . furthermore, vsv m interacts with rae , which plays an essential role in mrna nuclear export by its interaction with nup and mrnps. overexpression of either nup or rae- prevents the nuclear accumulation of polyadenylated mrnas, suggesting that both factors may play a role in the blockade of mrna nuclear export by vsv m protein [ ] . interestingly, ifn-γ specifically increases the level of both nup and rae- and indicates a potential antiviral effect of these proteins. indeed, overexpression of both proteins by ifn-γ treatment counteracts the inhibitory effects of vsv m protein on mrna nuclear export, highlighting the possibility of a crosstalk between m and ifn-γ that might control the fate of the viral replication in infected animals [ , ] . influenza virus replication also impacts on nuclearcytoplasmic trafficking and leads to the nuclear accumulation of host mrnas [ , ] . influenza virus ns protein is a major virulence factor that is essential for pathogenesis, because it impairs innate and adaptive immunity by inhibiting host signal transduction and gene expression [ , ] . ns forms a complex with nxf /tap, p /nxt [ ] , rae , and e b-ap , which are components of the mrna nuclear export machinery (see above). individual expression of ns in t cells induces the accumulation of polyadenylated mrnas in the nucleus, suggesting that the interaction of ns with these export factors yields an inactive complex for mrna export. influenza virus also induces a strong reduction of nup steady-state levels although the viral mechanisms involved in this process are still unknown [ ] . expression of reporter luciferase mrna synthesized from a nuclear plasmid is inhibited by ns . however, this inhibition is overcome by overexpression of nxf , p , rae- or nup , evidencing the role of ns interaction with these factors in the impairment of mrna nuclear export [ ] . furthermore, mouse cells expressing low levels of nup or/and rae- show greater susceptibility to influenza infection, resulting in a significant increase in cell death and virus production. in addition, mrnas encoding antiviral factors or immunomodulators such as irf- , mhc i and icam accumulated more in the nucleus of those cells than in cells expressing normal levels of nup or rae- [ ] . all these data support the physiological role of ns interaction with rna export factors as well as the reduction of nup levels in influenza pathogenicity. interestingly, vsv m, influenza ns and pv a pro expression gives rise to similar effects on mrna trafficking. all these viral proteins target nup and other components of the cellular machinery involved in nuclear-cytoplasmic trafficking. survival of motor neurons (smn) complex is composed by smn and a class of proteins called gemins, which localize in both cytoplasm and nucleoplasm [ , ] . gemin and gemin constitute the core of the complex where the other gemins associate by means of numerous proteinprotein interactions from the periphery. the smn complex is involved in the biogenesis of uridine-rich small nuclear ribonucleoprotein (u snrnp) in the cytoplasm and then the u snrnp carries out the splicing of pre-mrnas in the nucleus [ ] [ ] [ ] . the snrnps are composed of the major u snrnas u , u , u , u , and u as well as a group of seven proteins known as sm ribonucleoproteins that collectively make up the extremely stable sm core of the snrnp. gemins (except gemin- ) associate with sm proteins to form a heptameric ring structure in the presence of u snrnas [ ] . after sm core assembly, the u snrnps are imported to the nucleus, localizing in foci known as cajal bodies, where further maturation processes take place [ ] . gemin- , one of the main components of smn complex, is cleaved in pv-infected hela cells leading to a kda cleavage product. scission of gemin- negatively impacts on the kinetics of sm core assembly, which is prevented in presence of inhibitors of pv replication [ ] . these results indicated that high levels of pv proteins are required for this process to occur, as is the case of nup , nup , and eif gii, [ , ] . pv a pro is able to hydrolyze purified gemin- in vitro, rendering a cleavage product similar to that found in pv-infected hela cells [ ] . only one potential a procleavage site, between the amino acids tyr and gly (vhtyg), was found in this smn complex component. proteolysis of gemin- at this position would render two cleavage products of about - kda, in agreement with the polypeptide of about kda found in pv-infected and a pro -expressing cells. in addition, g e mutation avoids direct hydrolysis of gemin- exerted by pv a pro in vivo and in vitro [ ] . taken together, these findings support the notion that vhtyg is the cleavage site for pv a pro in gemin- . although hydrolysis of gemin- is exerted in cells transfected with plasmid encoding pv a pro , it does not take place when this protease is expressed from exogenous mrnas; contrary to that found with eif gi, eif gii, nup , nup and nup [ ] (and unpublished data). a probable explanation for this difference is that pv a pro is expressed at lower levels from transfected mrnas than from plasmids [ ] (castello et al., ) . thus, gemin- cleavage may be a very late event in pv-infected cells because it requires expression of high amounts of pv a pro . gemin- hydrolysis may directly impact on pre-mrna splicing since this event reduces the availability of smn complexes, which is involved in u snrnps biogenesis. nevertheless, alstead and colleagues could not detect any apparent effect of gemin- proteolysis in splicing of cellular pre-mrnas [ ] . therefore, the physiological relevance of gemin- cleavage in pv biological cycle remains unknown. in addition to eifs, nups and proteins from smn complex, pv a pro is able to cleave proteins involved in other cellular processes, such as transcription. tbp is cleaved by pv a pro between amino acids tyr and gly in vitro, although this cleavage only removes the first aa located at the n-terminus and does not inhibit transcription carried out by rna polymerase ii [ , ] . these findings are in agreement with the fact that host mrna transcription takes place in a pro -expressing cells when both translation and rna nuclear export are inhibited, upon cleavage of eif g and nups [ ] . one attractive hypothesis is that pv a pro could cleave specific initiation factors affecting specific rather than general mrna transcription in order to modulate host-cell response to viral infection. further studies in this direction can be carried out using microarray platforms to detect precise alterations in cellular transcriptome after pv-infection or a pro expression. these studies could be complemented by screening for new host factors cleaved by pv a pro using different in silico and experimental approaches. the study of viral proteases is crucial to understand the mechanism used by animal viruses to replicate their genomes and to translate viral mrnas at the molecular level. in addition, we wish to draw attention to the concept that viral proteases can be used as tools to reveal the exact functioning of their target cellular proteins. in this regard, pv a pro has been very useful for examining the requirements for eif g to translate different cellular or viral mrnas. in addition to this, in this paper, we have highlighted the role of pv a pro not only in the processing of the poliovirus polyprotein but also in the interaction with the host-cell. interestingly, a single viral protein is able to modulate many steps of gene expression in order to generate an optimal intracellular environment for the viral biological cycle. in particular, pv a pro cleaves cell proteins involved in transcription, pre-mrna splicing, nucleus/cytoplasm transport and translation, in order to hijack those host functions and to concentrate the cellular resources on the production of the viral progeny. for example, pv a pro inactivates host translational machinery for capped cellular mrnas by cleaving eif g, whereas viral protein synthesis takes place under those conditions by ires-driven translation [ ] . because of the decrease of cellular mrna translatability in pv-infected cells, host ribosomes are available for viral protein synthesis. probably, the inhibition of host protein synthesis may prevent the production of antiviral proteins. similarly, the blockade of nucleus/cytoplasm transport of macromolecules might isolate nuclear processes from cytoplasmic cellular ones, hampering the arrival of specific proinflammatory transcription factors to the nucleus and of mrnas encoding proinflammatory, antiviral or proapoptotic proteins to the cytoplasm of infected cells. finally, transcription and pre-mrna splicing could be also modulated by pv a pro and might reduce the availability of mature mrnas. nevertheless, very little is known about the potential role of pv a pro in transcription and pre-mrna splicing, and further studies of these steps of gene expression in pv-infected and a-expressing cells should be carried out. making use of the "omics" technologies it would be possible to identify changes in the host transcriptome in those cells, allowing us to understand the readjustment of host gene expression to viral infection and to the cytotoxic effect of pv a pro . gene ontology tools can be used to cluster the pathways (kegg), molecular activities and biological functions of the genes which are transcriptionally up-or downregulated or not affected after these unfavourable stimuli. in silico analysis could be carried out in order to identify whether these gene clusters belong to particular networks controlled by specific transcription factors. this approach will give us additional information about potential host targets for pv a pro . complementarily, deep sequence (rnaseq), specific microarrays types and conventional rt-pcr could be used to screen for nonspliced or abnormal spliced mrna variants in pv-infected and a pro -expressing cells. finally, microarrays can be employed to identify the cytoplasmic and nuclear transcriptome in those cells to determine whether mrna nuclear export inhibition induced by pv a pro impacts on the distribution of the entire host mrna bulk or in specific mrna pools. taken together, all this information may provide us with a general and deep vision of the modification induced by pv a pro on the different steps of mrna metabolism. additionally, the physiological role of nups and gemin- cleavage might be studied using different models. one possible and interesting approach is to engineer stable cell lines expressing noncleavable versions of nup , nup , nup and/or gemin- and then to analyze the fitness of pv in the different cell types, especially in presence of extracellular antiviral stimuli such as ifns or interleukins, which will activate different epigenetic programs. these studies will provide essential information to help us understand the specific role that those host factors play in pv infection. the total number of cellular proteins targeted by pv a pro (degradome) remains unknown, but it can be anticipated that with the expanding use of proteomic methodologies, this analysis will be known soon not only for pv a pro , but also for other viral proteases of interest. furthermore, the analysis of the pv a pro -induced degradome in human cells will be of general interest for many researchers, including virologists and cellular biologists. this goal could be achieved combining in silico prediction of a pro cleavage sites and experimental tools such as proteomics. in the first case, blom and collaborators developed a bioinformatics tool using neural network algorithms to predict cellular targets for picornavirus proteases [ ] . this approach has been successfully used to predict the cleavage of dystrophin by coxsackievirus a pro [ ] although most of the predicted human targets for rhinovirus and enterovirus a pro have not been proved yet. in addition, the algorithm did not predict the cleavage of cellular targets that have been later demonstrated to be proteolyzed by picornaviral a pro such as nup and cytokeratin [ , , ] . thus, it would be necessary to develop an improved algorithm able to find optimal cleavage sequences in the host proteome by implementing the proteolytic sites known for newly described a pro targets. many parameters have to be taken into account, including the protein localization (cytoplasmic and nuclear protein will be considered, but not proteins resident in the lumen of other organelles such as re or peroxisomes), the exposure of the cleavage site to the solvent (the sequence must be accessible to the protease), and the secondary structure in which the proteolytic site is included (optimally, unstructured regions). potential targets could be ordered by their degree of homology with optimal cleavage sequences, as well as with the degree in which they fulfill the above prerequisites. on the other hand, novel pv a pro targets can be identified by proteomic tools such as two-dimensional differential gel electrophoresis (dige) or quantitative proteomics such as stable isotope labeling by amino acids in cell culture (silac) coupled to monodimensional electrophoresis. following these two methods, it will be feasible to identify proteins with reduced levels on a pro expression and, in addition, to detect the cleavage products that will appear as lower size peptides. by the uncovering of novel a pro targets we will be able to map the cellular networks impacted by pv a pro and to integrate them in the context of pv infection. in fact, the role of a pro hijacking host processes could be potentially expanded to other cellular pathways with direct impact on control of viral infection. such knowledge will provide more insight into our understanding of the cytopathogenicity of viral proteases at the molecular level. viral proteases structure and function of picornavirus proteinases hiv- : fifteen proteins and an rna the structures of picornaviral proteinases organization and expression of calicivirus genes proteolytic processing of foamy virus gag and pol proteins new insights into flavivirus nonstructural protein poliovirus, pathogenesis of poliomyelitis, and apoptosis poliovirus and apoptosis induction of membrane proliferation by poliovirus proteins c and bc two types of death of poliovirus-infected cells: caspase involvement in the apoptosis but not cytopathic effect individual expression of poliovirus a and c induces activation of caspase- and parp cleavage in hela cells picornavirus rna translation: roles for cellular proteins translational control by viral proteinases effects of poliovirus infection on nucleo-cytoplasmic trafficking and nuclear pore complex composition differential targeting of nuclear pore complex proteins in poliovirusinfected cells rna nuclear export is blocked by poliovirus a protease and is concomitant with nucleoporin cleavage the pathogenesis of poliomyelitis: what we don't know poliovirus cell entry: common structural themes in viral cell entry pathways one hundred years of poliovirus pathogenesis genetics of poliovirus picornavirus genome: an overview processing determinants and functions of cleavage products of picornavirus polyproteins proteolytic processing of poliovirus polyprotein: elimination of a(pro)-mediated, alternative cleavage of polypeptide cd by in vitro mutagenesis poliovirus rna-dependent rna polymerase ( dpol): structure, function, and mechanism possible unifying mechanism of picornavirus genome replication protein-primed rna synthesis by purified poliovirus rna polymerase inhibitors of poliovirus uncoating efficiently block the early membrane permeabilization induced by virus particles viral translation is coupled to transcription in sindbis virus-infected cells phospholipid biosynthesis and poliovirus genome replication, two coupled phenomena effects of viral replication on cellular membrane metabolism and function viroporinmediated membrane permeabilization: pore formation by nonstructural poliovirus b protein viroporins primary structure, gene organization and polypeptide expression of poliovirus rna three-dimensional structure of poliovirus at . Å resolution internal initiation of translation of eukaryotic mrna directed by a sequence derived from poliovirus rna cell-free, de novo synthesis of poliovirus synthetic viruses: a new opportunity to understand and prevent viral disease inhibition of hela cell protein synthesis following poliovirus infection correlates with the proteolysis of a , -dalton polypeptide associated with eucaryotc initiation factor and a cap binding protein complex virus-encoded proteinases of the picornavirus super-group genome organization and encoded proteins structure of the foot-and-mouth disease virus leader protease: a papain-like fold adapted for self-processing and eif g recognition structural and biochemical features distinguish the foot-and-mouth disease virus leader proteinase from other papain-like enzymes the foot-and-mouth disease virus leader proteinase gene is not required for viral replication the leader proteinase of foot-and-mouth disease virus inhibits the induction of beta interferon mrna and blocks the host innate immune response a conserved domain in the leader proteinase of foot-and-mouth disease virus is required for proper subcellular localization and function differential ifnα/β production suppressing capacities of the leader proteins of mengovirus and foot-and-mouth disease virus leader protein of foot-and-mouth disease virus is required for cleavage of the p component of the cap-binding protein complex relationship of p cleavage during picornavirus infection to a proteinase sequencing extremely efficient cleavage of eif g by picornaviral proteinases l and a in vitro foot-and-mouth disease virus a oligopeptide mediated cleavage of an artificial polyprotein analysis of the aphthovirus a/ b polyprotein 'cleavage' mechanism indicates not a proteolytic reaction, but a novel translational effect: a putative ribosomal 'skip the cleavage activities of aphthovirus and cardiovirus a proteins purification and partial characterization of poliovirus protease a by means of a functional assay poliovirus proteinase c: large-scale expression, purification, and specific cleavage activity on natural and synthetic substrates in vitro multiple proteases in foot-and-mouth disease virus replication antiviral effects of a thiol protease inhibitor on foot-and-mouth disease virus putative papain-related thiol proteases of positive-strand rna viruses. identification of rubi-and aphthovirus proteases and delineation of a novel conserved domain associated with proteases of rubi, α-and coronaviruses identification of the active-site residues of the l proteinase of foot-and-mouth disease virus identification of critical amino acids within the foot-and-mouth disease virus leader protein, a cysteine protease structural similarity of poliovirus cysteine proteinase p - c and cellular serine proteinase of trypsin cysteine proteases of positive strand rna viruses and chymotrypsin-like serin proteases: a distinct protein superfamily with a common structural fold viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications a second virus-encoded proteinase involved in proteolytic processing of poliovirus polyprotein substrate requirements of a human rhinoviral a proteinase determinants of substrate recognition by poliovirus a proteinase cleavage specificity on synthetic peptide substrates of human rhinovirus proteinase a genetic selection of poliovirus a(pro)-binding peptides identification of essential amino acid residues in the functional activity of poliovirus a protease mutational analyses support a model for the hrv a proteinase the structure of the a proteinase from a common cold virus: a proteinase responsible for the shut-off of host-cell protein synthesis characterization of the roles of conserved cysteine and histidine residues in poliovirus a protease the a proteinase of human rhinovirus is a zinc containing enzyme spectroscopic characterization of rhinoviral protease a: zn is essential for the structural integrity the yeast saccharomyces cerevisiae as a genetic system for obtaining variants of poliovirus protease a mutational analysis of poliovirus a(pro): distinct inhibitory functions of a(pro) on translation and transcription defective rna replication by poliovirus mutants deficient in a protease cleavage activity a protease is not a prerequisite for poliovirus replication proteinase a is essential for enterovirus replication in type i interferontreated cells viable polioviruses that encode a proteins with fluorescent protein tags regulation of translation initiation in eukaryotes: mechanisms and biological targets the mechanism of eukaryotic translation initiation and principles of its regulation eif initiation factors: effectors of mrna recruitment to ribosomes and regulators of translation molecular mechanisms of translational control eif g: a multipurpose ribosome adapter elf g: translation's mystery factor begins to yield its secrets human eukaryotic translation initiation factor g (eif g) possesses two separate and independent binding sites for eif a a newly identified n-terminal amino acid sequence of human eif g binds poly(a)-binding protein and functions in poly(a)-dependent translation circularization of mrna by eukaryotic translation initiation factors mnk , a new map kinaseactivated protein kinase, isolated by a novel expression screening method for identifying protein kinase substrates mitogen-activated protein kinases activate the serine/threonine kinases mnk and mnk human eukaryotic translation initiation factor g (eif g) recruits mnk to phosphorylate eif e phosphorylation and dephosphorylation events that regulate viral mrna translation regulation of capdependent translation by eif e inhibitory proteins differential cleavage of eif gi and eif gii in mammalian cells: effects on translation translation of mrnas from vesicular stomatitis virus and vaccinia virus is differentially blocked in cells with depletion of eif gi and/or eif gii eukaryotic translation initiation factor g is targeted for proteolytic cleavage by caspase during inhibition of translation in apoptotic cells selective modification of eukaryotic initiation factor f (eif f) at the onset of cell differentiation: recruitment of eif gii and long-lasting phosphorylation of eif e serum-stimulated, rapamycin-sensitive phosphorylation sites in the eukaryotic: translation initiation factor gi inhibition of capdependent translation via phosphorylation of eif g by journal of biomedicine and biotechnology protein kinase pak suppression of cap-dependent translation in mitosis phosphorylation screening identifies translational initiation factor gii as an intracellular target of ca + /calmodulin-dependent protein kinase i cleavage of polypeptide chain initiation factor eif gi during apoptosis in lymphoma cells: characterisation of an internal fragment generated by caspase- -mediated cleavage cleavage of eukaryotic translation initiation factor gii correlates with translation inhibition during apoptosis translation of eukaryotic translation initiation factor gi (eif gi) proceeds from multiple mrnas containing a novel capdependent internal ribosome entry site (ires) that is active during poliovirus infection generation of multiple isoforms of eukaryotic translation initiation factor gi by use of alternate translation initiation codons specific isoforms of translation initiation factor gi show differences in translational activity regulation of protein synthesis in hela cells. . inhibition during poliovirus infection control of protein synthesis in extracts from poliovirus-infected cells. i. mrna discrimination by crude initiation factors poliovirus proteinase a induces cleavage of eucaryotic initiation factor f polypeptide p a poliovirus a(pro) mutant unable to cleave cd shows inefficient viral protein synthesis and transactivation defects direct cleavage of elf g by poliovirus a protease is inefficient in vitro human rhinovirus infection of hela cells results in the proteolytic cleavage of the p cap-binding complex subunit and inactivates globin mrna translation in vitro mapping the cleavage site in protein synthesis initiation factor eif- γ of the a proteases from human coxsackievirus and rhinovirus a proteinases of coxsackie-and rhinovirus cleave peptides derived from eif- gamma via a common recognition motif the eif g-eif e complex is the target for direct cleavage by the rhinovirus a proteinase poliovirus a proteinase cleaves directly the eif- g subunit of eif- f complex relationship of eukaryotic initiation factor to poliovirus-induced p cleavage activity multiple eif gi-specific protease activities present in uninfected and poliovirus-infected cells lack of direct correlation between p cleavage and the shut-off of host translation after poliovirus infection monensin and nigericin prevent the inhibition of host translation by poliovirus, without affecting p cleavage proteolysis of human eukaryotic translation initiation factor eif gii, but not eif gi, coincides with the shutoff of host protein synthesis after poliovirus infection eukaryotic initiation factor gii (eif gii), but not eif gi, cleavage correlates with inhibition of host cell protein synthesis after human rhinovirus infection high level expression in escherichia coli cells and purification of poliovirus protein a(pro) cleavage of p by purified poliovirus a(pro) in cell-free systems: effects on translation of capped and uncapped mrnas poly(a)-binding protein interaction with elf g stimulates picornavirus ires-dependent translation hiv- protease inhibits cap-and poly(a)-dependent translation upon eif gi and pabp cleavage human immunodeficiency virus tat-activated expression of poliovirus protein a inhibits mrna translation the effect of poliovirus proteinase a(pro) expression on cellular metabolism. inhibition of dna replication, rna polymerase ii transcription, and translation expression of poliovirus a(pro) in mammalian cells: effects on translation efficient cleavage of p by poliovirus a(pro) expression in mammalian cells: effects on vaccinia virus hybrid proteins between pseudomonas exotoxin a and poliovirus protease a(pro) effects of poliovirus a(pro) on vaccinia virus gene expression the eukaryotic translation initiation factor gi is cleaved by different retroviral proteases hybrid proteins between pseudomonas aeruginosa exotoxin a and poliovirus apro cleave p in hela cells cleavage of eukaryotic translation initiation factor g by exogenously added hybrid proteins containing poliovirus a(pro) in hela cells: effects on gene expression a stable hela cell line that inducibly expresses poliovirus a(pro): effects on cellular and viral gene expression poliovirus a protease induces apoptotic cell death eif g is required for the pioneer round of translation in mammalian cells cleavage of poly(a)-binding protein by enterovirus proteases concurrent with inhibition of translation in vitro cleavage of poly(a)-binding protein by coxsackievirus a protease in vitro and in vivo: another mechanism for host protein synthesis shutoff? efficient cleavage of ribosome-associated poly(a)-binding protein by enterovirus c protease cleavage of poly(a)-binding protein by poliovirus c protease inhibits host cell translation: a novel mechanism for host translation shutoff foot-andmouth disease virus leader proteinase: purification of the lb form and determination of its cleavage site on eif- γ foot-and-mouth disease virus c protease induces cleavage of translation initiation factors eif a and eif g within infected cells cleavage of eukaryotic translation initiation factor gii within foot-and-mouth disease virus-infected cells: identification of the l-protease cleavage site in vitro sequential modification of translation initiation factor elf gl by two different foot-andmouth disease virus proteases within infected baby hamster kidney cells: identification of the c cleavage site conducting the initiation of protein synthesis: the role of eif g hiv- protease cleaves eukaryotic initiation factor g and inhibits cap-dependent translation in vitro cleavage of eif gi but not eif gii by hiv- protease and its effects on translation in the rabbit reticulocyte lysate system cleavage of eif g by hiv- protease: effects on translation hiv protease cleaves poly(a)-binding protein cap-poly(a) synergy in mammalian cellfree extracts. investigation of the requirements for poly(a)-mediated stimulation of translation initiation characterization of an internal ribosomal entry segment within the ' leader of avian reticuloendotheliosis virus type a rna and development of novel mlv-rev-based retroviral vectors an internal ribosome entry segment promotes translation of the simian immunodeficiency virus genomic rna the leader of human immunodeficiency virus type genomic rna harbors an internal ribosome entry segment that is active during the g/m phase of the cell cycle cleavage of eukaryotic initiation factor elf g and inhibition of host-cell protein synthesis during feline calicivirus infection calicivirus c-like proteinase inhibits cellular translation by cleavage of poly(a)-binding protein inhibition of nuclear import and alteration of nuclear pore complex composition by rhinovirus leader-induced phosphorylation of nucleoporins correlates with nuclear trafficking inhibition by cardioviruses vesicular stomatitis virus matrix protein inhibits host cell gene expression by targeting the nucleoporin nup influenza virus targets the mrna export machinery and the nuclear pore complex exporting rna from the nucleus to the cytoplasm crossing the nuclear envelope: hierarchical regulation of nucleocytoplasmic transport the yeast nuclear pore complex: composition, architecture, transport mechanism proteomic analysis of the mammalian nuclear pore complex the part and the whole: functions of nucleoporins in nucleocytoplasmic transport nuclear pore complex is able to transport macromolecules with diameters of about nm nucleocytoplasmic transport: taking an inventory the nuclear pore complex: bridging nuclear transport and gene regulation nuclear localization signal and protein context both mediate importin α specificity of nuclear import substrates regulation of nuclear transport: central role in development and transformation inhibition of crm -p interaction and nuclear export of p by poly(adp-ribosyl)ation arginine methylation of rna helicase a determines its subcellular localization identification of a trna-specific nuclear export receptor identification of a nuclear export receptor for trna exportin- -mediated nuclear export of eukaryotic elongation factor a and trna phax, a mediator of u snrna nuclear export whose activity is regulated by phosphorylation nmd p is a crm p-dependent adapter protein for nuclear export of the large ribosomal subunit s pre-ribosome formation viewed from assembly in the nucleolus until export to the cytoplasm release of the export adapter, nmd p, from the s ribosomal subunit requires rpl p and the cytoplasmic gtpase lsg p human mrna export machinery recruited to the ' end of mrna splicing factor sub p is required for nuclear mrna export through its interaction with yra p the exon-exon junction complex provides a binding platform for factors involved in mrna export and nonsense-mediated mrna decay magoh, a human homolog of drosophila mago nashi protein, is a component of the splicing-dependent exon-exon junction complex a ranindependent pathway for export of spliced mrna delineation of mrna export pathways by the use of cell-permeable peptides viral ribonucleoprotein complex formation and nucleolar-cytoplasmic relocalization of nucleolin in poliovirus-infected cells inhibition of nucleo-cytoplasmic trafficking by rna viruses: targeting the nuclear pore complex a cellular protein that binds to the '-noncoding region of poliovirus rna: implications for internal translation initiation a cytoplasmic -kda protein that is required for translation of picornavirus rna by internal ribosomal entry is identical to the nuclear pyrimidine tract-binding protein human protein sam relocalization and interaction with poliovirus rna polymerase in infected cells intracellular redistribution of truncated la protein produced by poliovirus c(pro)-mediated cleavage kh domain integrity is required for wild-type localization of sam translation of polioviral mrna is inhibited by cleavage of polypyrimidine tractbinding proteins executed by polioviral c(pro) bidirectional increase in permeability of nuclear envelope upon poliovirus infection and accompanying alterations of nuclear pores cytoplasmic nucleic acid sensors in antiviral immunity versatility at the nuclear pore complex: lessons learned from the nucleoporin nup caspasedependent proteolysis of integral and peripheral proteins of nuclear membranes and nuclear pore complex proteins during apoptosis specific cleavage of the nuclear pore complex protein nup by a viral protease role of nucleoporin induction in releasing an mrna nuclear export block nucleocytoplasmic traffic disorder induced by cardioviruses a picornavirus protein interacts with ran-gtpase and disrupts nucleocytoplasmic transport leader protein of encephalomyocarditis virus binds zinc, is phosphorylated during viral infection, and affects the efficiency of genome translation inhibition of ran guanosine triphosphatase-dependent nuclear transport by the matrix protein of vesicular stomatitis virus vsv disrupts the rae /mrnp mrna nuclear export pathway influenza virus ns protein inhibits pre-mrna splicing and blocks mrna nucleocytoplasmic transport rig-i-mediated antiviral responses to single-stranded rna bearing -phosphates inhibition of retinoic acid-inducible gene i-mediated induction of beta interferon by the ns protein of influenza a virus gemin is a novel component of the survival motor neuron complex and functions in small nuclear ribonucleoprotein assembly gemin is required for the architecture and function of the survival motor neuron complex a multiprotein complex mediates the atp-dependent assembly of spliceosomal u snrnps smnrp is an essential pre-mrna splicing factor required for the formation of the mature spliceosome specific sequence features, recognized by the smn complex, identify snrnas and determine their fate as snrnps biogenesis of small nuclear rnps inhibition of u snrnp assembly by a virus-encoded proteinase poliovirusencoded protease a(pro) cleaves the tata-binding protein but does not inhibit host cell rna polymerase ii transcription in vitro the interaction of cytoplasmic rna viruses with the nucleus cleavage site analysis in picornaviral polyproteins: discovering cellular targets by neural networks enteroviral protease a cleaves dystrophin: evidence of cytoskeletal disruption in an acquired cardiomyopathy a proteinase of human rhinovirus cleaves cytokeratin in infected hela cells our group is supported by a dgicyt grant (no. bfu - ). the institutional grant awarded to the centro de biología molecular "severo ochoa" by the fundación ramón areces is also acknowledged. a. castello is a beneficiary of marie curie ief fellowship (seventh framework programme). alfredo castelló and enriqueÁlvarez contributed equally to this work. key: cord- - jzzy authors: reggiori, fulvio; komatsu, masaaki; finley, kim; simonsen, anne title: autophagy: more than a nonselective pathway date: - - journal: int j cell biol doi: . / / sha: doc_id: cord_uid: jzzy autophagy is a catabolic pathway conserved among eukaryotes that allows cells to rapidly eliminate large unwanted structures such as aberrant protein aggregates, superfluous or damaged organelles, and invading pathogens. the hallmark of this transport pathway is the sequestration of the cargoes that have to be degraded in the lysosomes by double-membrane vesicles called autophagosomes. the key actors mediating the biogenesis of these carriers are the autophagy-related genes (atgs). for a long time, it was assumed that autophagy is a bulk process. recent studies, however, have highlighted the capacity of this pathway to exclusively eliminate specific structures and thus better fulfil the catabolic necessities of the cell. we are just starting to unveil the regulation and mechanism of these selective types of autophagy, but what it is already clearly emerging is that structures targeted to destruction are accurately enwrapped by autophagosomes through the action of specific receptors and adaptors. in this paper, we will briefly discuss the impact that the selective types of autophagy have had on our understanding of autophagy. three different pathways can deliver cytoplasmic components into the lumen of the lysosome for degradation. they are commonly referred to as autophagy (cell "self-eating") and include chaperone-mediated autophagy (cma), microautophagy, and macroautophagy. cma involves the direct translocation of specific proteins containing the kferq pentapeptide sequence across the lysosome membrane [ , ] . microautophagy, on the other hand, entails the invagination and pinching off of the lysosomal limiting membrane, which allows the sequestration and elimination of cytoplasmic components. the molecular mechanism underlying this pathway remains largely unknown. the only cellular function that so far has been indisputably assigned to microautophagy is the turnover of peroxisomes under specific conditions in fungi [ ] . recently, it has been reported the existence of a microautophagy-like process at the late endosomes, where proteins are selectively incorporated into the vesicles that bud inward at the limiting membrane of these organelles during the multivesicular bodies biogenesis [ ] . in contrast to cma and microautophagy, macroautophagy (hereafter referred to as autophagy) entails the formation of a new organelle, the autophagosome, which allows the delivery of a large number of different cargo molecules into the lysosome. autophagy is a primordial and highly conserved intracellular process that occurs in most eukaryotic cells and participates in stress management. this pathway involves the de novo formation of vesicles called autophagosomes, which can engulf entire regions of the cytoplasm, individual organelles, protein aggregates, and invading pathogens ( figure ). the autophagosomes fuse with endosomal compartments to form amphisomes prior to fusion with the lysosome, where their contents are degraded and the resulting metabolites are recycled back to the cytoplasm (figure ). unique features of the pathway include the double-membrane structure of the autophagosomes, which were originally characterized over years ago from detailed electron microscopy studies [ ] . starting in the s yeast mutational studies began the genetic and molecular characterization of the key components required to initiate and build an autophagosome in response to inactivation of mtorc (but also other cellular and environmental cues), the ulk complex is activated and translocates in proximity of the endoplasmic reticulum (er). thereafter, the ulk complex regulates the class iii pi k complex. atg l, a multimembrane spanning protein, is also involved in an early stage of autophagosome formation by probably supplying part of the membranes necessary for the formation and/or expansion. local formation of pi p at sites called omegasomes promotes the formation of the phagophore, from which autophagosomes appear to be generated. the pi p-binding wipi proteins (yeast atg homolog), as well as the atg -atg -atg l complex and the lc -phosphatidylethanolamine (pe) conjugate play important roles in the elongation and closure of the isolation membrane. finally, the complete autophagosome fuses with endosomes or endosome-derived vesicles forming the amphisome, which subsequently fuses with lysosomes to form autolysosomes. in the lysosomes, the cytoplasmic materials engulfed by the autophagosomes are degraded by resident hydrolases. the resulting amino acids and other basic cellular constituents are reused by the cell; when in high levels they also reactivate mtorc and then suppress autophagy. [ ] . subsequently, genetic and transgenic studies in plants, worms, fruit flies, mice, and humans have underscored the pathway's conservation and have begun to unveil the intricate vital role that autophagy plays in the physiology of cells and multicellular organisms. for a long time, autophagy was considered a nonselective pathway induced as a survival mechanism in response to cellular stresses. over the past several years, however, it has become increasingly evident that autophagy also is a highly selective process involved in clearance of excess or dysfunctional organelles, protein aggregates and intracellular pathogens. in this introductory piece, we will briefly discuss the molecular mechanisms of selective types of autophagy and their emerging importance as a quality control to maintain cellular and organismal health, aspects that will be presented in deep in the reviews of this special issue of the international journal of cell biology and highlighted by the research papers. . . the function of the atg proteins. autophagosomes are formed by expansion and sealing of a small cistern known as the phagophore or isolation membrane ( figure ). once complete, they deliver their cargo into the hydrolytic lumen of lysosomes for degradation. a diverse set of components are involved in the biogenesis of autophagosomes, which primarily includes the proteins encoded by the autophagyrelated genes (atg). most atg genes have initially been identified and characterized in yeast. subsequent studies in higher eukaryotes have revealed that these key factors are highly conserved. to date, atg proteins have been identified and are part of the core atg machinery essential for all autophagy-related pathways [ ] . upon autophagy induction, these proteins associate following a hierarchical order [ , ] to first mediate the formation of the phagophore and then to expand it into an autophagosome [ , ] . while their molecular functions and their precise contribution during the biogenesis of double-membrane vesicles remain largely unknown, they have been classified in functional groups of genes: ( ) the atg /ulk complex, ( ) the phosphatidylinositol -kinase (pi k) complex, ( ) the atg trafficking system, and ( ) the two parallel ubiquitinlike conjugation systems ( figure ). the atg /ulk complex consists of atg , atg , and atg in yeast, and ulk / , atg , fip and atg in mammals [ ] [ ] [ ] [ ] . this complex is central in mediating the induction of autophagosome biogenesis and as a result it is the terminal target of various signaling cascades regulating autophagy, such as the tor, insulin, pka, and ampk pathways [ ] (figure ). increased activity of the atg /ulk kinase is the primary event that determines the acute induction and upregulation of autophagy. it is important to note that ulk is part of a protein family and two other members, ulk and ulk , have been shown play a role in autophagy induction as well [ , ] . the expansion of this gene family may reflect the complex regulation and requirements of the pathway in multicellular long-lived organisms. stimulation of the ulk kinases is achieved through an intricate network of phosphorylation and dephosphorylation modifications of the various subunits of the atg /ulk complex. for example, atg is directly phosphorylated by tor and the phosphorylation state of atg modulates its binding to atg and atg . inactivation of tor leads to a rapid dephosphorylation of atg , which increases atg -atg -atg complex formation, stimulates the atg kinase activity and induces autophagy [ , ] . the matg is also essential for autophagy, but seems to directly interact with ulk , ulk and fip independently of its phosphorylation state [ , ] . in addition, there are several phosphorylation events within this complex as well, including phosphorylation of matg by ulk , ulk , and tor; phosphorylation of fip by ulk and ulk ; phosphorylation of ulk and ulk by tor [ , ] . additional studies are required to fully characterize the functional significance of these posttranslational modifications. autophagy is also regulated by the activity of pi k complexes. yeast contains a single pi k, vps , which is present in two different tetrameric complexes that share common subunits, vps , vps , and atg [ ] . complex i is required for the induction of autophagy and through its fourth component, atg , associates to the autophagosomal membranes where the lipid kinase activity of vps is essential for generating the phosphatidylinositol- -phosphate (pi p) that permits the recruitment of other atg proteins [ , ] (figure ). complex ii contains vps as the fourth subunit and it is involved in endosomal trafficking and vacuole biogenesis [ ] . there are three types of pi k in mammals: class i, ii, and iii. the functions of class ii pi k remains largely unknown, but both classes i and iii pi ks are involved in autophagy. while class i pi k is principally implicated in the modulation of signalling cascades, class iii pi k complexes regulate organelle biogenesis and, like yeast, contain three common components: hvps , p (vps ortholog), and beclin (atg ortholog). the counterparts of atg and vps are called atg l/barkor and uvrag, respectively [ ] [ ] [ ] . the atg l-containing complex plays a central role in autophagy and functions very similarly as the yeast complex i by directing the class iii pi k complex i to the phagophore to produce pi p and initiate the recruitment of the atg machinery ( figure ). atg l is thought to be present on the er irrespective of autophagy induction [ ] . upon starvation, atg l localizes to autophagosomal membranes [ ] . importantly, depletion of atg l reduces pi p production, impairs the formation of autophagosomal precursor structures, and inhibits autophagy [ , , , ] . the uvrag-containing class iii pi k complex also regulates autophagy but it appears to act at the intersection between autophagy and the endosomal transport pathways. uvrag initially associates with the bar-domain protein bif- , which may regulate matg trafficking from the trans-golgi network (tgn) [ , ] . uvrag then interacts with the class c vps/hops protein complex, promoting the fusion of autophagosomes with late endosomes and/or lysosomes [ ] . finally, the uvrag-containing class iii protein complex binds to rubicon, a late endosomal and lysosomal protein that suppresses autophagosome maturation by reducing hvps activity [ , ] . importantly, both the atg l-and uvrag-containing complexes interact through beclin with ambra , which in turn tethers these protein complexes to the cytoskeleton via an interaction with dynein [ , ] . following the induction of autophagy, ulk phosphorylates ambra thus releasing the class iii pi k complexes from dynein and their subsequent relocalization triggers autophagosome formation. therefore, ambra constitutes a direct regulatory link between the atg /ulk and the pi k complexes [ ] . together with the atg /ulk and the pi k complexes, atg is one of the first factors localizing to the preautophagosomal structure or phagophore assembly site (pas), the structure believed to be the precursor of the phagophore [ , ] (figure ). atg is the only conserved transmembrane protein that is essential for autophagy. it is distributed to the pas and multiple additional cytoplasmic tubulovesicular compartments derived from the golgi [ ] [ ] [ ] . atg cycles between these two locations and consequently it is thought to serve as a membrane carrier providing the lipid building blocks for the expanding phagophore [ ] . one of the established functions of atg is that it leads to the formation of the yeast pas when at least one of the cytoplasmic tubulovesicular compartments translocates near the vacuole [ ] . atg is also essential to recruit the pi k complex i to the pas [ ] . retrieval transport of yeast atg from the pas and/or complete autophagosome is mediated by the international journal of cell biology atg -atg complex [ ] and appears to be regulated by the atg /ulk and pi k complexes [ ] . mammalian atg (matg ) has similar characteristics to its yeast counterpart. matg localizes to the tgn and late endosomes and redistributes to autophagosomal structures upon the induction of autophagy (figure ) [ ] , further promoting pathway activity [ , [ ] [ ] [ ] . as in yeast, cycling of matg between locations also requires the atg /ulk complex and kinase activity hvps [ , ] . the core atg machinery also entails two ubiquitin-like proteins, atg and atg /microtubule-associated protein (map )-light chain (lc ), and their respective, partially overlapping, conjugation systems [ ] [ ] [ ] (figure ). atg is conjugated to atg through the activity of the atg (e like) and the atg (e -like) enzymes. the atg -atg conjugate then interacts with atg , which oligomerizes to form a large multimeric complex. atg /lc is cleaved at its c terminus by the atg protease to generate the cytosolic lc -i with a c-terminal glycine residue, which is then conjugated to phosphatidylethanolamine (pe) in a reaction that requires atg and the e -like enzyme atg . this lipidated form of lc (lc -ii) is attached to both faces of the phagophore membrane. once the autophagosome is completed, atg removes lc -ii from the outer autophagosome surface. these two ubiquitination-like systems appear to be closely interconnected. on one hand, the multimeric atg -atg -atg complex localizes to the phagophore and acts as an e -like enzyme, determining the site of atg /lc lipidation [ , ] . on the other hand, the atg /lc conjugation machinery seems to be essential for the optimal functioning of the atg conjugation system. in atg -deficient mice, atg -atg conjugation is markedly reduced, and normal dissociation of the atg -atg -atg complex from the phagophore is delayed [ ] . some evidences suggest that these two conjugation systems also function together during the expansion and closure of the phagophore. for example, overexpression of an inactive mutant of atg inhibits the lipidation of lc and leads to the accumulation of a number of nearly complete autophagosomes [ ] . while controversial [ ] , it has been postulated that atg /lc also possesses fusogenic properties, thus mediating the assembly of the autophagic membrane [ , ] . it has to be noted that mammals possess at least genes coding for lc /atg proteins that can be grouped into three subfamilies: ( ) the lc subfamily containing lc a, lc b, lc b and lc c; ( ) the gammaaminobutyrate receptor-associated protein (gabarap) subfamily comprising gabarap and gabarapl (also called gec- ); ( ) the golgi-associated atpase enhancer of kda (gate- ) protein (also called gabarap-l /gef ) [ ] . although in vivo studies show that they are all conjugated to pe, they appear to have evolved complex nonredundant functions [ ] . membranes. the origin of the membranes composing autophagosomes is a long-standing mystery in the field of autophagy. a major difficulty in addressing this question has been that phagophores as well as autophagosomes do not contain marker proteins of other subcellular compartments [ , ] . a series of new studies has implicated several cellular organelles as the possible source for the autophagosomal lipid bilayers. the plasma membrane and elements of the trafficking machinery to the cell surface have been linked to the formation of an early autophagosomal intermediate, perhaps the phagophore [ ] [ ] [ ] [ ] [ ] . it is possible that early endosomal-and/or golgiderived membranes are also key factors in the initial steps of autophagy [ , , ] . the golgi, moreover, appears also important for autophagy by supplying at least in part the extra lipids required for the phagophore expansion [ , [ ] [ ] [ ] [ ] . the endoplasmic reticulum (er) is also central in this latter event. while the relevance of the er in autophagosome biogenesis was already pointed out a long time ago [ , , , ] , recently two electron tomography studies have demonstrated the existence of a physical connection between the er and the forming autophagosomes [ , ] . these analyses have revealed that the er is connected to the outer as well as the inner membrane of the phagophore through points of contact, supporting the notion that lipids could be supplied via direct transfer at the sites of membrane contact. in line with this view, it has been found that atg l is associated to the er and pi p is generated on specific subdomains of this organelle from where autophagosomes emerge under autophagy-inducing conditions [ , ] (figure ). it has also been proposed that the outer membrane of the mitochondria is the main source of the autophagosomal lipid bilayers, but while the experimental evidences appear to show that mitochondria are essential for the phagophore expansion, it remains unclear whether these organelles play a key role in the phagophore biogenesis [ ] . the discrepancy between the conclusions of the various studies has not allowed yet drawing a model about the membrane dynamics during autophagosome biogenesis. the different results could be due to the different experimental conditions and model systems used by the various laboratories. alternatively, the lipids forming the autophagosomes could have different sources depending on the cell and the conditions inducing autophagy [ , ] . a third possibility is that the source of phagophore membrane could depend on the nature of the double-membrane vesicle cargo. additional investigations are required to shed light on these issues. despite the potential of curing, quite a substantial range of specific pathological conditions by inducting autophagy, there are currently no small molecules that allow to exclusively stimulate this pathway [ ] . nevertheless, there is a variety of chemicals that by acting on signaling cascades that also regulate autophagy permit to trigger this degradative process. these agents fall into two distinct categories based on the mechanism of action; whether they work through an mtordependent (rapamycin or torin) or mtor-independent pathway (e.g., lithium or resveratrol) [ ] . in addition to these compounds, there are biological molecules such as interferon γ (ifnγ) and vitamin d that can be used to stimulate autophagy especially in experimental setups [ , ] . international journal of cell biology inhibition of autophagy can also be beneficial in specific diseases but as for the inducers there are no compounds that exclusively block this pathway without affecting other cellular processes. the small molecules inhibiting autophagy include wortmannin and -methyladenine, which hamper the activity of the pi k; bafilomycin a and chloroquine, which impair the degradative activity of lysosomes [ ] . they are currently solely used in the basic research on autophagy. it is becoming increasingly evident that autophagy is a highly selective quality control mechanism whose basal levels are important to maintain cellular homeostasis (see below). a number of organelles have been found to be selectively turned over by autophagy and cargo-specific names have been given to distinguish the various selective pathways, including the er (reticulophagy or erphagy), peroxisomes (pexophagy), mitochondria (mitophagy), lipid droplets (lipophagy), secretory granules (zymophagy), and even parts of the nucleus (nucleophagy). moreover, pathogens (xenophagy), ribosomes (ribophagy), and aggregate-prone proteins (aggrephagy) are specifically targeted for degradation by autophagy [ ] . selective types of autophagy perform a cellular quality control function and therefore they must be able to distinguish their substrates, such as protein aggregates or dysfunctional mitochondria, from their functional counterparts. the molecular mechanisms underlying cargo selection and regulation of selective types of autophagy are still largely unknown. this has been an area of intense research during the last years and our understanding of the various selective types of autophagy is starting to unravel. a recent genomewide small interfering rna screen aimed at identifying mammalian genes required for selective autophagy found candidate genes to be required for viral autophagy and of those genes were also required for parkin-mediated mitophagy [ ] . in general, these pathways appear to rely upon specific cargo-recognizing autophagy receptors, which connect the cargo to the autophagic membranes. the autophagy receptors might also interact with specificity adaptors, which function as scaffolding proteins that bring the cargo-receptor complex in contact with the core atg machinery to allow the specific sequestration of the substrate. the selective types of autophagy appear to rely on the same molecular core machinery as non-selective (starvation-induced) bulk autophagy. in contrast, the autophagy receptors and specificity adapters do not seem to be required for nonselective autophagy. autophagy receptors are defined as proteins being able to interact directly with both the autophagosome cargo and the atg /lc family members through a specific (wxxl) sequence [ ] , commonly referred to as the lc -interacting region (lir) motif [ ] or the lc recognition sequences (lrs) [ ] . based on comparison of lir domains from more than autophagy receptors it was found that the lir consensus motif is an eight amino acids long sequence that can be written d/e-d/e-d/e-w/f/y-x-x-l/i/v. although not an absolute requirement, usually there is at least one acidic residue upstream of the w-site. the terminal l-site is occupied by a hydrophobic residue, either l, i, or v [ ] . the lir motifs of several autophagy receptors have been found to interact both with lc and gabarap family members in vitro, but whether this reflects a physiological interaction remains to be clarified in most cases. it should be pointed out that not all lir-containing proteins are autophagy cargo receptors. some lir-containing proteins, like atg and atg b, are recruited to autophagic membranes to perform their function in autophagosome formation [ , ] , whereas others like fyve and coiled-coil domain-containing protein (fyco ) interact with lc to facilitate autophagosome transport and maturation [ ] . others might use an lir motif to become degraded, like dishevelled, an adaptor protein in the wnt signalling pathway [ ] . the adaptor proteins are less well-described, but seem to interact with autophagy receptors and work as scaffold proteins recruiting and assembling the atg machinery required to generate autophagosomes around the cargo targeted to degradation. examples of autophagy adaptors are atg and alfy [ , ] . the list of specific autophagy receptors is rapidly growing and the role of several of them in different types of selective autophagy will be described in detail in the reviews of this special issue. here we will briefly discuss the best studied form of selective autophagy, the yeast cytosol to vacuole targeting (cvt) pathway, as well as the best studied mammalian autophagy receptor, p /sequestosome (sqstm ) (figure ). the cvt pathway is a biosynthetic process mediating the transport of the three vacuolar hydrolases, aminopeptidase (ape ), aminopeptidase (ape ) and α-mannosidase (ams ), and the ty transposome into the vacuole [ , ] . ape is synthesized as a cytosolic precursor (prape ), which multimerizes into the higher order ape oligomer, to which ape , ams , and ty associate to form the socalled cvt complex, prior to being sequestered into a small autophagosome-like cvt vesicle. sequestration of the cvt complex into cvt vesicles is a multistep process, which requires the autophagy receptor atg , which facilitates binding to atg at the pas, as well as the adaptor protein atg (figure (a) ) [ ] . atg acts as a scaffold protein by directing the cvt complex and atg reservoirs translocation to the pas in an actin-dependent way and then recruiting the atg /ulk complex [ , ] . the pi p-binding proteins atg , atg , and atg are also required for the cvt pathway [ , ] , but their precise function remains to be elucidated. interestingly, atg overexpression was found to recruit more atg and atg to the pas resulting in more cvt vesicles. this observation indicates that atg levels could regulate the rate of selective autophagy, and maybe also the size of the cargo-containing autophagosomes in yeast [ , ] . indeed, a series of studies has revealed that atg is also involved in other types of selective autophagy such as mitophagy and pexophagy. however, the autophagy receptors involved in the different atg -dependent types figure : representative selective autophagy. (a) the cytoplasm-to-vacuole targeting (cvt) pathway. ape is synthesized as a cytoplasmic precursor protein with a propeptide and rapidly oligomerizes into dodecamers that subsequently associate with each other to form a higher order complex. the autophagy receptor atg directly binds to the complex and mediates the recruitment of another cvt pathway cargo, ams , leading to the formation of the so-called cvt complex. atg also interacts with the autophagy adaptor atg and this protein allows the transport of the cvt complex to the site where the double-membrane vesicle will be generated. at this location, atg tethers the atg proteins essential for the cvt vesicle formation and the direct binding of atg to atg permits the exclusive sequestration of the cvt complex into the vesicle. (b) a model for p and nbr as autophagy receptors for ubiquitinated cargos. p and nbr bind with ubiquitinated cargos via their ubiquitin-associated (uba) domain and this interaction triggers the aggregate formation through the oligomerization of p via its phox and bem p (pb ) domain. furthermore, p interacts with both autophagy-linked fyve protein (alfy), which serves to recruit atg and to bind pi p, and directly with lc . this latter event appears to organize and activate the atg machinery in close proximity of the ubiquitinated cargos, which allows to selectively sequester them in the autophagosomes in analogous to the cvt pathway. of selective autophagy are different as atg is required for mitophagy [ , ] , whereas atg is essential for pexophagy [ ] . like atg , these two proteins have an atg -binding lir motif and directly interact with atg . mammalian cells appear to not possess an atg homologue, and further studies are necessary to delineate the molecular machinery involved in sequestration and targeting of different cargoes for degradation by autophagy in higher eukaryotes. the mechanism of the cvt pathway is reminiscent of the selective form of mammalian autophagy called aggrephagy, which involves degradation of misfolded and unwanted proteins by packing them into ubiquitinated aggregates. in both cases aggregation of the substrate (prape or misfolded proteins) is required prior to sequestration into cvt vesicles or autophagosomes, respectively [ ] [ ] [ ] . similar to cvt vesicles, aggregate-containing autophagosomes appear to be largely devoid of cytosolic components suggesting that the vesicle membrane expands tightly around its cargo [ ] . aggrephagy also depends on proteins with exclusive functions in substrate selection and targeting [ , , , ] . the autophagy receptors p and neighbour of brca gene (nbr ) bind both ubiquitinated protein aggregates through an ubiquitin-associated (uba) domain and to lc via their lir motifs and, thereby, promote the specific autophagic degradation of ubiquitinated proteins (figure (b) ) [ , , , , ] . nbr and p also contain an nterminal phox and bem p (pb ) domain through which they can oligomerize, or interact with other pb -containing binding partners [ ] . in addition to being a cargo receptor for protein aggregates, p has been implicated in autophagic degradation of other ubiquitinated substrates such as intracellular bacteria [ ] , viral capsid proteins [ ] , the midbody remnant formed after cytokinesis [ ] , peroxisomes [ , ] , damaged mitochondria [ , ] , and bacteriocidal precursor proteins [ ] . the pb domain was recently found to be required for p to localize to the autophagosome formation site adjacent to the er [ ] , suggesting that it could target ubiquitinated cargo to the site of autophagosome formation or alternatively promote the assembly of the atg machinery at this location. international journal of cell biology the large scaffolding protein autophagy-linked fyve (alfy) appears to have a similar function as the specificity adaptor atg . alfy is recruited to aggregate-prone proteins through its interaction with p [ ] and through a direct interaction with atg and pi p it serves to recruit the core atg machinery and allow formation of autophagic membranes around the protein aggregate [ ] (figure (b) ). interestingly, alfy is recruited from the nucleus to cytoplasmic ubiquitin-positive structures upon cell stress suggesting that it might regulate the level of aggrephagy [ ] . in line with this, it was found that overexpression of alfy in mouse and fly models of huntington's disease reduced the number of protein inclusions [ ] . it will be interesting to determine whether alfy, as p , is involved in other selective types of autophagy such as the one eliminating midbody ring structures or mitochondria. it is well known that posttranslational modifications like phosphorylation and ubiquitination are involved in the regulation of the activity of proteins involved in autophagy and degradation of autophagic cargo proteins, respectively. however, little is known about how these modifications may regulate selective autophagy. the fact that the core atg machinery is required for both nonselective and selective types of autophagy gives raise to the question of whether these two types of autophagy may compete for the same molecular machinery. such a competition could be detrimental for the cells undergoing starvation and to avoid this, there might be a tight regulation of the expression level and/or activity of the proteins specifically involved the selective autophagy. it has recently been proposed that phosphorylation of autophagy receptors might be a general mechanism for the regulation of selective autophagy. dikic and coworkers noted that several autophagy receptors contain conserved serine residues adjacent to their lir motifs and indeed, the tank binding kinase (tbk ) was found to phosphorylate a serine residue close to the lir motif of the autophagy receptor optineurin. this modification enhances the lc binding affinity of optineurin and promotes selective autophagy of ubiquitinated cytosolic salmonella enterica [ ] . in yeast, phosphorylation of atg , the autophagy receptor for mitophagy, by mitogen-activated protein kinases was found to be required for mitophagy [ , ] . the atg /lc proteins themselves have also been found to become phosphorylated and recent works have identified specific phosphorylation sites for protein kinase a (pka) [ ] and protein kinase c (pkc) [ ] in the nterminal region of lc . interestingly, the n-terminal of lc is involved in the binding of lc to lir-containing proteins [ ] . it is therefore tempting to speculate that phosphorylation of the pka and pkc sites might facilitate or prevent the interaction of lc with lir-containing proteins such as p . it has been found that phosphorylation of the pka site, which is conserved in all mammalian lc isoforms, but not in gabarap, inhibits recruitment of lc into autophagosomes [ ] . the role of ubiquitin in autophagy has so far been ascribed as a signal for cargo degradation. ubiquitination of aggregate prone proteins, as well as bacteria and mitochondria, has been found to serve as a signal for recognition by autophagy receptors like p and nbr , which are themselves also degraded together with the cargo that they associate with [ ] . the in vivo specificity of p and nbr toward ubiquitin signals remains to be established under the different physiological conditions. interestingly, it was recently found that casein kinase -(ck -) mediated phosphorylation of the p uba domain increases the binding affinity of this motif for polyubiquitin chains leading to more efficient targeting of polyubiquitinated proteins to autophagy [ ] . ck overexpression or phosphatase inhibition reduced the formation of aggregates containing the polyglutamine-expanded huntingtin exon fragments in a p -dependent manner. the e ligases involved in ubiquitination of different autophagic cargo largely remains to be identified. however, it is known that the e ligases parkin and rnf both regulate mitophagy [ , ] . smurf (smad-specific e ubiquitin protein ligase ) was recently also implicated in mitophagy, as well as in autophagic targeting of viral particles [ ] . interestingly, the role of smurf in selective autophagy seems to be independent of its e ligase activity, but it rather depends on its membrane-targeting c domain, although the exact mechanism involved remains to be elucidated. it is also not clear whether ubiquitination could serve as a signal to regulate the activity or binding selectivity of proteins directly involved in autophagy, and whether this in some way could regulate selective autophagy. the role of ubiquitinlike proteins as sumo and nedd in autophagy is also unexplored. acetylation is another posttranslational modification that only recently has been implicated in selective autophagy. the histone de-acetylase (hdac ), initially found to mediate transport of misfolded proteins to the aggresome [ ] , was lately implicated in maturation of ubiquitinpositive autophagosomes [ ] . the fact that hdac overproduction in fly eyes expressing expanded polyq proteins is neuroprotective further indicates that hdac activity stimulates aggrephagy [ ] . furthermore, the acetylation of an aggrephagy cargo protein, muntant huntingtin, the protein causing huntington's disease, is important for its degradation by autophagy [ ] . hdac has been also implicated in parkin-mediated clearance of damaged mitochondria [ ] . the acetyl transferase(s) involved in these forms of selective autophagy is currently unknown, but understanding the role of acetylation in relation to various aspects of autophagy is an emerging field and it will very likely provide more mechanistic insights into these pathways. basal autophagy acts as the quality control pathway for cytoplasmic components and it is crucial to maintain the homeostasis of various postmitotic cells [ ] . while this quality control could be partially achieved by nonselective autophagy, growing lines of evidence have demonstrated international journal of cell biology that specific proteins, organelles, and invading bacteria are specifically degraded by autophagy (figure ). mice deficient in autophagy die either in utero (e.g., beclin and fip knockout mice) [ ] [ ] [ ] or within hours after birth due, at least in part, to a deficiency in the mobilization of amino acids from various tissues (e.g., atg , atg , atg , atg , and atg l knockout mice) [ , [ ] [ ] [ ] [ ] . as a result, to investigate the physiological roles of autophagy, conditional knockout mice for atg , atg , or fip and various tissue-specific atg knockout mice have been established and analyzed [ , , ] . for example, the liver-specific atg -deficient mouse displayed severe hepatomegaly accompanied by hepatocyte hypertrophy, resulting in severe liver injuries [ ] . mice lacking atg , atg , or fip in the central nervous system exhibited behavioral deficits, such as abnormal limb-clasping reflexes and reduction of coordinated movement as well as massive neuronal loss in the cerebral and cerebellar cortices [ ] [ ] [ ] . loss of atg in cardiac muscle caused cardiac hypertrophy, left ventricular dilatation, and systolic dysfunction [ ] . skeletal muscle-specific atg or atg knockout mice showed age-dependent muscle atrophy [ , ] . pancreatic β cell-specific atg knockout animals exhibited degeneration of islets and impaired glucose tolerance with reduced insulin secretion [ , ] . podocytespecific deletion of atg caused glomerulosclerosis in aging mice and these animals displayed increased susceptibility to proteinuric diseases caused by puromycin aminonucleoside and adriamycin [ ] . proximal tubule-specific atg knockout mice were susceptible to ischemia-reperfusion injury [ ] . finally, deletion of atg in bronchial epithelial cells resulted in hyperresponsiveness to cholinergic stimuli [ ] . all together, these results undoubtedly indicate that basal autophagy prevents numerous life-threatening diseases. how does impairment of autophagy lead to diseases? ultrastructural analyses of the mutant mice revealed a marked accumulation of swollen and deformed mitochondria in the mutant hepatocytes [ ] , pancreatic β cells [ , ] , cardiac and skeletal myocytes [ , ] and neurons [ ] , but also the appearance of concentric membranous structures consisting of er or sarcoplasmic reticulum in hepatocytes [ ] , neuronal axons [ , ] and skeletal myocytes [ ] , as well as an increased number of peroxisomes and lipid droplets in hepatocytes [ , ] . in addition to the accumulation of aberrant organelles, histological analyses of tissues with defective autophagy showed the amassment of polyubiquitylated proteins in almost all tissues (although the level varied from one region to another) forming inclusion bodies whose size and number increased with aging [ ] . consequently, basal autophagy also acts as the quality control machinery for cytoplasmic organelles (figure (a) ). although this could be partially achieved by bulk autophagy, these observations point to the existence of selective types of autophagy, a notion that is now supported by experimental data. p /sqstm is the best-characterized disease-related autophagy receptor and a ubiquitously expressed cellular protein conserved among metazoan but not in plants and fungi [ ] . besides a role of p as the receptor, this protein itself is specific substrate for autophagy. suppression of autophagy is usually accompanied by an accumulation of p mostly in large aggregates also positive for ubiquitin (figure (a) ) [ , ] . ubiquitin and p -positive inclusion bodies have been detected in numerous neurodegenerative diseases (i.e., alzheimer's disease, parkinson's disease, and amyotrophic lateral sclerosis), liver disorders (i.e., alcoholic hepatitis and steatohepatitis), and cancers (i.e., malignant glioma and hepatocellular carcinoma) [ ] . very interestingly, the p positive aggregates observed in hepatocytes and neurons of liver-and brain-specific atg deficient mice, respectively, as well as in human hepatocellular carcinoma cells, are completely dispersed by the additional loss of p strongly implicating involvement of p in the formation of diseaserelated inclusion bodies [ , ] . through its self-oligomerization, p is involved in several signal transduction pathways. for example, this protein functions as a signaling hub that may determine whether cells survive by activating the traf -nf-κb pathway, or die by facilitating the aggregation of caspase and the downstream effector caspases [ , ] . on the other hand, p interacts with the nrf -binding site on keap , a component of the cullin -type ubiquitin ligase for nrf , resulting in stabilization of nrf and transcriptional activation of nrf target genes including a battery of antioxidant proteins [ ] [ ] [ ] [ ] [ ] . it is thus plausible that excess accumulation or mutation of p leads to hyperactivation of these signaling pathways, resulting in a disease onset (figure (b) ). paget's disease of bone is a chronic and metabolic bone disorder that is characterized by an increased bone turnover within discrete lesions throughout the skeleton. mutations in the p gene, in particular in its uba domain, can cause this illness [ ] . a proposed model explaining how p mutations lead to the paget's disease of bone is the following: mutations of the uba domain cause an impairment in the interaction between p and ubiquitinated traf and/or cyld, an enzyme deubiquitinating traf , which in turn enhances the activation of the nf-κb signaling pathway and the resulting increased osteoclastogenesis (figure (b) ) [ ] . if proven, this molecular scenario could open the possibility of using autophagy enhancers as a therapy to cure paget's disease of bone. it is established that autophagy has a tumor-suppressor role and several autophagy gene products including beclin and uvrag are known to function as tumor suppressor proteins [ ] . the tumor-suppressor role of autophagy appears to be important particularly in the liver. spontaneous tumorigenesis is observed in the livers of mice with either a systemic mosaic deletion of atg or a hepatocytespecific atg deletion [ , ] . importantly, no tumors are formed in other organs in atg mosaically deleted mice. enlarged mitochondria, whose functions are at least partially impaired, accumulate in atg -or atg -deficient hepatocytes [ , ] . this observation is in line with the previous data obtained in ibmk cell lines showing that both the oxidative stress and genomic damage responses are activated by loss of autophagy [ , ] . again, it is clear that accumulation of p , at least partially, contributes to tumor growth because the size of the atg −/− liver tumors is reduced by the additional deletion of p [ ] , which may cause a dysregulation of nf-κb signaling [ ] and/or a persistent activation of nrf [ ] . almost all tissues with defective autophagy are usually displaying an accumulation of polyubiquitinated proteins [ ] . loss of autophagy is considered to lead to a delay in the global turnover of cytoplasmic components [ ] and/or to an impaired degradation of substrates destined for the proteasome [ ] . both observations could partially explain the accumulation of misfolded and/or unfolded proteins that is followed by the formation of inclusion bodies. as discussed above, p and nbr act as autophagy receptors for ubiquitinated cargos such as protein aggregates, mitochondria, midbody rings, bacteria, ribosomal proteins and virus capsids [ , ] (figure ). although these studies suggest the role of p as an ubiquitin receptor, it remains to be established whether soluble ubiquitinated proteins are also degraded one-by-one by p and possibly nbr . a mass spectrometric analysis has clearly demonstrated the accumulation of all detectable topologies of ubiquitin chain in atg deficient livers and brains, indicating that specific polyubiquitin chain linkage is not the decisive signal for autophagic degradation [ ] . because the increase in ubiquitin conjugates in the atg deficient liver and brain is completely suppressed by additional knockout of either p or nrf [ ] , accumulation of ubiquitinated proteins in tissues defective in autophagy might be attributed to p mediated activation of nrf , resulting in global transcriptional changes to ubiquitin-associated genes. further studies are needed to precisely elucidate the degradation mechanism of soluble ubiquitinated proteins by autophagy. concomitant with the energy production through oxidative phosphorylation, mitochondria also generate reactive oxygen species (ros), which cause damage through the oxidation of proteins, lipids and dna often inducing cell death. therefore, the quality control of mitochondria is essential to maintain cellular homeostasis and this process appears to be achieved via autophagy. it has been postulated that mitophagy contributes to differentiation and development by participating to the intracellular remodelling that occurs for example during haematopoiesis and adipogenesis. in mammalian red blood cells, the expulsion of the nucleus followed by the removal of other organelles, such as mitochondria, are necessary differentiation steps. nix/bnip l, an autophagy receptor whose structure resembles that of atg , is also an outer mitochondrial membrane protein that interacts with gabarap [ , ] and plays an important role in mitophagy during erythroid differentiation [ , ] (figure (c) ). although autophagosome formation probably still occurs in nix/bnip l deficient reticulocytes, mitochondrial elimination is severely impaired. consequently, mutant reticulocytes are exposed to increased levels of ros and die, and nix/bnip l knockout mice suffer severe anemia. depolarization of the mitochondrial membrane potential of mutant reticulocytes by treatment with an uncoupling agent results in restoration of mitophagy [ ] , emphasizing the importance of nix/binp l for the mitochondrial depolarization and implying that mitophagy targets uncoupled mitochondria. haematopoietic-specific atg knockout mice also exhibited severe anaemia as well as lymphopenia, and the mutant erythrocytes markedly accumulated degenerated mitochondria but not other organelles [ ] . the mitochondrial content is regulated during the development of the t cells as well; that is, the high mitochondrial content in thymocytes is shifted to a low mitochondrial content in mature t cells. atg or atg deleted t cells fail to reduce their mitochondrial content resulting in increased ros production as well as an imbalance in pro-and antiapoptotic protein expression [ ] [ ] [ ] . all together, these evidences demonstrate the essential role of mitophagy in haematopoiesis. recent studies have described the molecular mechanism by which damaged mitochondria are selectively targeted for autophagy, and have suggested that the defect is implicated in the familial parkinson's disease (pd) [ ] (figure (c) ). pink , a mitochondrial kinase, and parkin, an e ubiquitin ligase, have been genetically linked to both pd and a pathway that prevents progressive mitochondrial damage and dysfunction. when mitochondria are damaged and depolarized, pink becomes stabilized and recruits parkin to the damaged mitochondria [ , [ ] [ ] [ ] . various mitochondrial outer membrane proteins are ubiquitinated by parkin and mitophagy is then induced. of note, pd-related mutations in pink and parkin impair mitophagy [ , [ ] [ ] [ ] , suggesting that there is a link between defective mitophagy and pd. how these ubiquitinated mitochondria are recognized by the autophagosome remains unknown. although p has been implicated in the recognition of ubiquitinated mitochondria, elimination of the mitochondria occurs normally in p -deficient cells [ , ] . when specific bacteria invade host cells through endocytosis/phagocytosis, a selective type of autophagy termed xenophagy, engulfs them to restrict their growth [ ] (figure (d) ). although neither the target proteins nor the e ligases have yet been identified, invading bacteria such as salmonella enterica, listeria monocytogenes, or shigella flexneri become positive for ubiquitin when they access the cytosol by rupturing the endosome/phagosome limiting membrane [ , ] . these findings raise the possibility that ubiquitin also serves as a tag during xenophagy. in fact, to date, three proteins, p [ , , ] , ndp [ ] , and optineurin [ ] have been proposed to be autophagy receptors linking ubiquitinated bacteria and lc . an ubiquitin-independent mechanism has recently been revealed; recognition of a shigella mutant that lacks the icsb gene requires the tectonin domaincontaining protein (tecpr ), which appears to be a new type of autophagy adaptor targeting shigella to atg -and wipi- -positive membranes [ ] . interestingly, the shigella icsb normally prevents autophagic sequestration of this bacterium by inhibiting the interaction of shigella virg with atg indicating that some bacteria have developed mechanism to inhibit or subvert autophagy to their advantage [ ] . this latter category of pathogens also includes viruses such as herpes simplex virus- (hsv- ), which express an inhibitor (icp . ) of atg /beclin [ ] . however, it was recently shown that a mutant hsv- strain lacking icp . becomes degraded by selective autophagy in a smurf dependent manner [ ] , suggesting that selective autophagy plays an important role in our immune system. recently, a different antimicrobial function has been assigned to autophagy and this function appears to be selective. during infection, ribosomal protein precursors are transported by autophagy in a p -dependent manner into lysosomes [ ] . these ribosomal protein precursors are subsequently processed by lysosomal protease into small antimicrobial peptides. importantly, it has been shown that induction of autophagy during a mycobacterium tuberculosis infection leads to the fusion between phagosomes containing this bacterium and autophagosomes, and the production of the antimicrobial peptides in this compartment kills m. tuberculosis [ ] . while the molecular mechanism is largely unknown, autophagy contributes at least partially to the supply of free fatty acids in response to fasting (figure (e) ). fasting provokes the increase of the levels of free fatty acids circulating in the blood, which are mobilized from adipose tissues. these free fatty acids are rapidly captured by various organs including hepatocytes and then transformed into triglycerides by esterification within lipid droplets. these lipid droplets appear to be turned over by a selective type of autophagy that has been named lipophagy in order to provide endogenous free fatty acids for energy production through β-oxidation [ ] . indeed, liver-specific atg deficient mice display massive accumulation of triglycerides and cholesterol in the form of lipid droplets [ ] . agoutirelated peptide-(agrp-) expressing neurons also respond to increased circulating levels of free fatty acids after fasting and then induce autophagy to degrade the lipid droplets [ ] . similar to the case in hepatocytes, autophagy in the neurons supplies endogenous free fatty acids for energy production and seems to be necessary for gene expression of agpr, which is a neuropeptide that increases appetite and decreases metabolism and energy expenditure [ ] . originally, it was assumed that autophagy was exclusively a bulk process. recent experimental evidences have demonstrated that through the use of autophagy receptors and adaptors, this pathway can be selective by exclusively degrading specific cellular constituents. the list of physiological and pathological situations where autophagy is selective is constantly growing and this fact challenges the earliest concept whether autophagy can be nonselective. it is believe that under starvation, cytoplasmic structures are randomly engulfed by autophagosomes and delivered into the lysosome to be degraded and thus generate an internal pool of nutrients. in yeast saccharomyces cerevisiae, however, the degradation of ribosomes, for example, ribophagy, as well as mitophagy and pexophagy, and the transport of the prape oligomer into the vacuole under the same conditions requires the presence of autophagy receptors [ , [ ] [ ] [ ] . as a result, these observations suggest that autophagy could potentially always operate selectively. this is a conceivable hypothesis because this process allows the cell to survive stress conditions and the casual elimination of cytoplasmic structure in the same scenario could lead to the lethal depletion of an organelle crucial for cell survival. future studies will certainly provide more molecular insights into the regulation and mechanism of the selective types of autophagy, and this information will also be important to determine if indeed bulk autophagy exists. agrp: agouti-related peptide ampk: amp-activated protein kinase alfy: autophagy-linked fyve protein ams : α-mannosidase ape : aminopeptidase ape : aminopeptidase atg: autophagy-related gene bnip l: b-cell leukemia/lymphoma (bcl- )/adenovirus e b interacting protein ck : casein kinase cma: chaperone-mediated autophagy cvt: cytoplasm to vacuole targeting er: endoplasmic reticulum fip : focal adhesion kinase family interacting protein of kd fyco : fyve and coiled-coil domain-containing protein gabarap: gamma-aminobutyrate receptor-associated protein gate- : golgi-associated atpase enhancer of kda hdac : histone de-acetylase hops: homotypic fusion and protein sorting hsv- : herpes simplex virus- keap : kelch-like ech-associated protein lc : microtubule-associated protein (map )-light chain lir: lc -interacting region lrs: lc recognition sequences nbr : neighbour of brca gene ndp : nuclear dot protein (ndp) nf-κb: nuclear factor κb nix: nip-like protein x nrf : nf-e related factor pas: phagophore assembly site pb : phox and bem p international journal of cell biology pe: phosphatidylethanolamine pd: parkinson's disease pi k: phosphatidylinositol -kinase pi p: phosphatidylinositol -phosphate pka: protein kinase a pkc: protein kinase c ros: reactive oxygen species rubicon: run domain and cysteine-rich domain containing beclin -interacting protein smurf : smad-specific e ubiquitin protein ligase sumo: small ubiquitin-like modifier sqstm : p /sequestosome tbk : tank binding kinase tecpr : tectonin domain-containing protein traf : tumour necrosis factor receptor-associated factor tor: target of rapamycin tgn: trans-golgi network uba: ubiquitin associated ulk : unc- -like kinase uvrag: uv-resistance associated gen vps: vacuolar protein sorting. chaperone-mediated autophagy in protein quality control chaperone-mediated autophagy: molecular mechanisms and physiological relevance microautophagy in mammalian cells: revisiting a -year-old conundrum microautophagy of cytosolic proteins by late endosomes seeing is believing: the impact of electron microscopy on autophagy research eaten alive: a history of macroautophagy dynamics and diversity in autophagy mechanisms: lessons from yeast characterization of autophagosome formation site by a hierarchical analysis of mammalian atg proteins hierarchy of atg proteins in pre-autophagosomal structure organization autophagosome formation: core machinery and adaptations toward unraveling membrane biogenesis in mammalian autophagy ulk ·atg ·fip complex mediates mtor signaling and is essential for autophagy nutrient-dependent mtorcl association with the ulk -atg -fip complex required for autophagy ulk-atg -fip complexes mediate mtor signaling to the autophagy machinery a novel, human atg binding protein, atg , interacts with ulk and is essential for macroautophagy regulation mechanisms and signaling pathways of autophagy autophagy mediates the mitotic senescence transition tor-mediated induction of autophagy via an apg protein kinase complex an overview of the molecular mechanism of autophagy two distinct vps phosphatidylinositol -kinase complexes function in autophagy and carboxypeptidase y sorting in saccharomyces cerevisiae assortment of phosphatidylinositol -kinase complexes-atg p directs association of complex i to the pre-autophagosomal structure in saccharomyces cerevisiae beclin forms two distinct phosphatidylinositol -kinase complexes with mammalian atg and uvrag autophagic and tumour suppressor activity of a novel beclin -binding protein uvrag identification of barkor as a mammalian autophagy-specific factor for beclin and class iii phosphatidylinositol -kinase autophagy requires endoplasmic reticulum targeting of the pi -kinase complex via atg l two beclin -binding proteins, atg l and rubicon, reciprocally regulate autophagy at different stages amino terminus of the sars coronavirus protein a elicits strong, potentially protective humoral responses in infected patients bif- interacts with beclin through uvrag and regulates autophagy and tumorigenesis bif- regulates atg trafficking by mediating the fission of golgi membranes during autophagy beclin -binding uvrag targets the class c vps complex to coordinate autophagosome maturation and endocytic trafficking distinct regulation of autophagic activity by atg l and rubicon associated with beclin -phosphatidylinositol- -kinase complex the dynamic interaction of ambra with the dynein motor complex regulates mammalian autophagy ambra regulates autophagy and development of the nervous system an atg -containing compartment that functions in the early steps of autophagosome biogenesis apg p/cvt p is an integral membrane protein required for transport vesicle formation in the cvt and autophagy pathways membrane delivery to the yeast autophagosome from the golgi-endosomal system the atg -atg complex regulates atg and atg retrieval transport from the pre-autophagosomal structure the atg -atg complex is recruited to autophagic membranes via phosphatidylinositol -phosphate and exerts an essential function starvation and ulk -dependent cycling of mammalian atg between the tgn and endosomes recruitment of atg to the preautophagosomal structure by atg is essential for selective autophagy in budding yeast atg recruits atg to organize the pre-autophagosomal structure coordinated regulation of autophagy by p alpha mapk through matg and p ip evolution of atg function and regulation a ubiquitin-like system mediates protein lipidation a protein conjugation system essential for autophagy mammalian autophagy: core molecular machinery and signaling regulation the atg l complex specifies the site of lc lipidation for membrane biogenesis in autophagy the atg -atg conjugate has a novel e -like activity for protein lipidation in autophagy the atg conjugation system is indispensable for proper development of autophagic isolation membranes in mice snare proteins are required for macroautophagy atg , a ubiquitin-like protein required for autophagosome formation, mediates membrane tethering and hemifusion lc and gate- n termini mediate membrane fusion processes required for autophagosome biogenesis atg : an autophagy-related ubiquitin-like protein family lc and gate- /gabarap subfamilies are both essential yet act differently in autophagosome biogenesis studies on cellular autophagocytosis. the formation of autophagic vacuoles in the liver after glucagon administration purification and characterization of autophagosomes from rat hepatocytes ralb and the exocyst mediate the cellular starvation response by direct international journal of cell biology activation of autophagosome assembly quantitative analysis of autophagy-related protein stoichiometry by fluorescence microscopy autophagosome precursor maturation requires homotypic fusion molecular mechanisms and regulation of specific and nonspecific autophagy pathways in yeast plasma membrane contributes to the formation of pre-autophagosomal structures exit from the golgi is required for the expansion of the autophagosomal phagophore in yeast saccharomyces cerevisiae the golgi complex as a source for yeast autophagosomal membranes characterization of the isolation membranes and the limiting membranes of autophagosomes in rat hepatocytes by lectin cytochemistry the conserved oligomeric golgi complex is involved in double-membrane vesicle formation during autophagy studies on the mechanisms of autophagy: maturation of the autophagic vacuole studies on induced cellular autophagy. i. electron microscopy of cells with in vivo labelled lysosomes a subdomain of the endoplasmic reticulum forms a cradle for autophagosome formation d tomography reveals connections between the phagophore and endoplasmic reticulum autophagosome formation from membrane compartments enriched in phosphatidylinositol -phosphate and dynamically connected to the endoplasmic reticulum mitochondria supply membranes for autophagosome biogenesis during starvation the emergence of autophagosomes the origin of the autophagosomal membrane chemical modulators of autophagy as biological probes and potential therapeutics autophagy and cytokines vitamin d, vitamin d receptor, and macroautophagy in inflammation and infection guidelines for the use and interpretation of assays for monitoring autophagy in higher eukaryotes how shall i eat thee image-based genome-wide sirna screen identifies selective autophagy factors structural basis of target recognition by atg /lc during selective autophagy p /sqstm binds directly to atg /lc to facilitate degradation of ubiquitinated protein aggregates by autophagy structural basis for sorting mechanism of p in selective autophagy selective autophagy mediated by autophagic adapter proteins the structure of atg b-lc complex reveals the mechanism of lc processing and delipidation during autophagy autophagy-related protein (atg ) family interacting motif in atg mediates the atg -atg interaction and is crucial for the cytoplasm-to-vacuole targeting pathway fyco is a rab effector that binds to lc and pi p to mediate microtubule plus end-directed vesicle transport autophagy negatively regulates wnt signalling by promoting dishevelled degradation the selective acroautophagic degradation of aggregated proteins requires the pi p-binding protein alfy mechanism of cargo selection in the cytoplasm to vacuole targeting pathway the cvt pathway as a model for selective autophagy selective autophagy regulates insertional mutagenesis by the ty retrotransposon in saccharomyces cerevisiae autophagy in organelle homeostasis: peroxisome turnover autophagy: molecular machinery for self-eating cooperative binding of the cytoplasm to vacuole targeting pathway proteins, cvt and cvt , to phosphatidylinositol -phosphate at the pre-autophagosomal structure is required for selective autophagy atg is a phosphoinositide binding protein required for efficient lipidation and localization of atg during uptake of aminopeptidase i by selective autophagy peroxisome size provides insights into the function of autophagy-related proteins atg is a mitochondrial protein that confers selectivity during mitophagy mitochondria-anchored receptor atg mediates degrada-tion of mitochondria via selective autophagy ppatg tags peroxisomes for turnover by selective autophagy p /sqstm forms protein aggregates degraded by autophagy and has a protective effect on huntingtin-induced cell death p /sqstm and alfy interact to facilitate the formation of p bodies/alis and their degradation by autophagy the propeptide of aminopeptidase mediates aggregation and vesicle formation in the cytoplasm-to-vacuole targeting pathway a role for nbr in autophagosomal degradation of ubiquitinated substrates homeostatic levels of p control cytoplasmic inclusion body formation in autophagy-deficient mice the adaptor protein p /sqstm targets invading bacteria to the autophagy pathway autophagy protects against sindbis virus infection of the central nervous system midbody ring disposal by autophagy is a post-abscission event of cytokinesis ubiquitin signals autophagic degradation of cytosolic proteins and peroxisomes peroxisomal dynamics nix is critical to two distinct phases of mitophagy, reactive oxygen species-mediated autophagy induction and parkin-ubiquitin-p -mediated mitochondrial priming pink /parkinmediated mitophagy is dependent on vdac and p /sqstm delivery of cytosolic components by autophagic adaptor protein p endows autophagosomes with unique antimicrobial properties p targeting to the autophagosome formation site requires self-oligomerization but not lc binding alfy, a novel fyve-domain-containing protein associated with protein granules and autophagic membranes phosphorylation of the autophagy receptor optineurin restricts salmonella growth phosphorylation of serine on atg mediates mitophagy two mapk-signaling pathways are required for mitophagy in saccharomyces cerevisiae regulation of the autophagy protein lc by phosphorylation protein kinase c inhibits autophagy and phosphorylates lc the nterminus and phe residue of lc recruit p /sqstm into autophagosomes serine phosphorylation of p /sqstm regulates selective autophagic clearance of ubiquitinated proteins parkin is recruited selectively to impaired mitochondria and promotes their autophagy rnf , a novel mitochondrial ubiquitin e ligase, regulates autophagy through interaction with bnip the deacetylase hdac regulates aggresome formation and cell viability in response to misfolded protein stress hdac controls autophagosome maturation essential for ubiquitin-selective quality-control autophagy hdac rescues neurodegeneration and provides an essential link between autophagy and the ups acetylation targets mutant huntingtin to autophagosomes for degradation disease-causing mutations in parkin impair mitochondrial ubiquitination, aggregation, and hdac -dependent mitophagy autophagy: renovation of cells and tissues role of fip in cardiac and liver development and its regulation of tnfα and tsc-mtor signaling pathways promotion of tumorigenesis by heterozygous disruption of the beclin autophagy gene beclin , an autophagy gene essential for early embryonic development, is a haploinsufficient tumor suppressor impairment of starvation-induced and constitutive autophagy in atg -deficient mice the role of autophagy during the early neonatal starvation period atg a controls dsdnadriven dynamic translocation of sting and the innate immune response loss of the autophagy protein atg l enhances endotoxin-induced il- β production suppression of basal autophagy in neural cells causes neurodegenerative disease in mice neural-specific deletion of fip leads to cerebellar degeneration caused by increased neuronal death and axon degeneration loss of autophagy in the central nervous system causes neurodegeneration in mice the role of autophagy in cardiomyocytes in the basal state and in response to hemodynamic stress autophagy is required to maintain muscle mass suppression of autophagy in skeletal muscle uncovers the accumulation of ubiquitinated proteins and their potential role in muscle damage in pompe disease autophagy is important in islet homeostasis and compensatory increase of beta cell mass in response to high-fat diet loss of autophagy diminishes pancreatic β cell mass and function with resultant hyperglycemia autophagy influences glomerular disease susceptibility and maintains podocyte homeostasis in aging mice autophagy protects the proximal tubule from degeneration and acute ischemic injury inducible disruption of autophagy in the lung causes airway hyper-responsiveness autophagy regulates lipid metabolism autophagy in mammalian development and differentiation ref( )p, the drosophila melanogaster homologue of mammalian p , is required for the formation of protein aggregates in adult brain p is a common component of cytoplasmic inclusions in protein aggregation diseases persistent activation of nrf through p in hepatocellular carcinoma cells cullin -based polyubiquitination and p -dependent aggregation of caspase- mediate extrinsic apoptosis signaling p at the crossroads of autophagy, apoptosis, and cancer physical and functional interaction of sequestosome with keap regulates the keap -nrf cell defense pathway keap facilitates p -mediated ubiquitin aggregate clearance via autophagy p /sqstm is a target gene for transcription factor nrf and creates a positive feedback loop by inducing antioxidant response elementdriven gene transcription the selective autophagy substrate p activates the stress responsive transcription factor nrf through inactivation of keap a noncanonical mechanism of nrf activation by autophagy deficiency: direct interaction between keap and p recent advances in understanding the molecular basis of paget disease of bone autophagy regulation in cancer development and therapy autophagydeficient mice develop multiple liver tumors autophagy mitigates metabolic stress and genome damage in mammary tumorigenesis autophagy suppresses tumor progression by limiting chromosomal instability autophagy suppresses tumorigenesis through elimination of p oncogene-induced nrf transcription promotes ros detoxification and tumorigenesis autophagy inhibition compromises degradation of ubiquitin-proteasome pathway substrates biogenesis and cargo selectivity of autophagosomes ubiquitin accumulation in autophagy-deficient mice is dependent on the nrf -mediated stress response pathway: a potential role for protein aggregation in autophagic substrate selection nix is a selective autophagy receptor for mitochondrial clearance nix directly binds to gabarap: a possible crosstalk between apoptosis and autophagy essential role for nix in autophagic maturation of erythroid cells nix is required for programmed mitochondrial clearance during reticulocyte maturation the autophagy protein atg is essential for hematopoietic stem cell maintenance a critical role for the autophagy gene atg in t cell survival and proliferation autophagy is essential for mitochondrial clearance in mature t lymphocytes identification of atg -dependent transcriptional changes and increases in mitochondrial mass in atg -deficient t lymphocytes mechanisms of mitophagy pink stabilized by mitochondrial depolarization recruits parkin to damaged mitochondria and activates latent parkin for mitophagy pink is selectively stabilized on impaired mitochondria to activate parkin pink -dependent recruitment of parkin to mitochondria in mitophagy p /sqstm is required for parkin-induced mitochondrial clustering but not mitophagy; vdac is dispensable for both p /sqstm cooperates with parkin for perinuclear clustering of depolarized mitochondria ubiquitination-mediated autophagy against invading bacteria shigella phagocytic vacuolar membrane remnants participate in the cellular response to pathogen invasion and are regulated by autophagy recognition of bacteria in the cytosol of mammalian cells by the ubiquitin system listeria monocytogenes acta-mediated escape from autophagic recognition the tbk adaptor and autophagy receptor ndp restricts the proliferation of ubiquitin-coated bacteria a tecpr -dependent selective autophagy pathway targets bacterial pathogens escape of intracellular shigella from autophagy chronic oxidative stress sensitizes hepatocytes to death from -hydroxynonenal by jnk/c-jun overactivation autophagy in hypothalamic agrp neurons regulates food intake and energy balance cvt /gsa functions in sequestering selective cytosolic cargo destined for the vacuole mature ribosomes are selectively degraded upon starvation by an autophagy pathway requiring the ubp p/bre p ubiquitin protease cvt is a receptor for the cytoplasm-to-vacuole targeting pathway the authors thank shun kageyama (tokyo metropolitan institute of medical science) for helping in the creation of the figures used in the paper. f. reggiori key: cord- -s otdtwq authors: mandal, nakul; lewis, geoffrey p.; fisher, steven k.; heegaard, steffen; prause, jan u.; la cour, morten; vorum, henrik; honoré, bent title: proteomic analysis of the vitreous following experimental retinal detachment in rabbits date: - - journal: j ophthalmol doi: . / / sha: doc_id: cord_uid: s otdtwq purpose. the pathogenesis of rhegmatogenous retinal detachment (rrd) remains incompletely understood, with no clinically effective treatment for potentially severe complications such as photoreceptor cell death and proliferative vitreoretinopathy. here we investigate the protein profile of the vitreous following experimental retinal detachment using a comparative proteomic based approach. materials and methods. retinal detachment was created in the right eyes of six new zealand red pigmented rabbits. sham surgery was undertaken in five other rabbits that were used as controls. after seven days the eyes were enucleated and the vitreous was removed. the vitreous samples were evaluated with two-dimensional polyacrylamide gel electrophoresis and the differentially expressed proteins were identified with tandem mass spectrometry. results. ten protein spots were found to be at least twofold differentially expressed when comparing the vitreous samples of the sham and retinal detachment surgery groups. protein spots that were upregulated in the vitreous following retinal detachment were identified as albumin fragments, and those downregulated were found to be peroxiredoxin , collagen-iα fragment, and α- -antiproteinase f. conclusions. proteomic investigation of the rabbit vitreous has identified a set of proteins that help further our understanding of the pathogenesis of rhegmatogenous retinal detachment and its complications. rhegmatogenous retinal detachment (rrd) is characterized by the accumulation of subretinal fluid between the neurosensory retina and retinal pigment epithelium following the formation of a retinal break [ ] . the pathogenesis of rrd is complex and incompletely understood, involving age-related and/or inherited structural and molecular changes of the vitreous extracellular matrix and vitreoretinal interface, and the process of posterior vitreous detachment [ ] . the annual incidence of the condition has been estimated at . per , , and although primary surgical reattachment is successful in the great majority of cases, photoreceptor cell death, growth factors as a result of rrd significantly contributes to the pathogenesis of pvr, though the basic cause as well as a clinically effective therapeutic approach for this condition remains elusive [ , ] . proteomics studies proteins on a large scale in pursuit of a global and integrated view of disease processes at the protein level, which may potentially lead to the identification of novel biomarkers and therapeutic targets useful in clinical practice [ ] [ ] [ ] [ ] [ ] . rrd would likely be associated with alterations in the proteomic profiles of both the retina and vitreous. indeed, we initially undertook the first such retinal study from which a number of potentially important proteins were identified [ ] . the present study extends the proteomic investigation to the vitreous of this rabbit model of retinal detachment, building upon previous such analyses of human vitreous [ ] [ ] [ ] [ ] , in order to add further knowledge of the underlying pathophysiology [ , ] . inferior retinal detachment was created in the right eyes of six new zealand red pigmented rabbits. the eyes were normal with no evidence of disease on examination. combined injections of xylazine ( . mg/kg) and ketamine ( . mg/kg) were administered intramuscularly to induce anesthesia and analgesia. the pupils were dilated with topical drops of atropine and tropicamide ( % solutions). a pipette tip, with an external diameter of approximately m, was inserted into the eye through a pars plana incision. sodium hyaluronate (healon, . % in a balanced salt solution; pharmacia, piscataway, nj) was infused via a glass pipette between the neurosensory retina and retinal pigment epithelium. healon was necessary to prevent spontaneous retinal reattachment, and . % is the most dilute solution that maintains the detachment for extended periods. approximately % of the retina beneath the medullary rays, which included the central retina, was detached ( figure ). sham surgery was performed in the right eyes of five other rabbits that were used as controls, which involved surgical entry of the vitreous cavity without disruption of the retina. scleral incisions were closed with - nylon suture. seven days postoperatively the animals were euthanized by the administration of sodium pentobarbital ( mg/kg; butler schein, dublin, oh) and the eyes enucleated. after removal of the cornea and lens, the associated vitreous of the sham and detached retinas was extracted and immediately snap-frozen in liquid nitrogen within separate vials. there was no gross evidence of blood or other contamination of the vitreous samples at the time of tissue harvesting. the vitreous samples were stored at − ∘ c until further use. all of the animal experiments undertaken in this study were in accordance with the standards of the national institutes of health animal care and use committee protocols, the arvo statement for the use of animals in ophthalmic and vision research, and the guidelines of the animal resource center, university of california, santa barbara. the rabbit vitreous samples were homogenized and dissolved in a lysis buffer containing m urea, % (v/v) triton x- , % (v/v) immobilized ph gradient (ipg) buffer (ph - nonlinear), and % (w/v) dithiothreitol (dtt). the total protein content in each vitreous sample was determined with non-interfering protein assay (calbiochem, san diego, ca). the protein samples were stored at − ∘ c until further use. the extracted proteins were first fractionated by isoelectric focusing (ief) using ph - nonlinear cm ipg strips (ge healthcare, chalfont st. giles, buckinghamshire, uk). the ipg strips were rehydrated for h at room temperature in l lysis buffer each containing g protein from individual vitreous samples and l rehydration buffer ( m urea, % (w/v) -[( -cholamidopropyl)dimethylammonio]- -propanesulfonate (chaps), . % (w/v) dtt, and % (v/v) ipg buffer), using the immobiline drystrip reswelling tray (ge healthcare). the ief was undertaken on a multiphor ii electrophoresis system (ge healthcare) at v for h and v in two steps for h and . h in a gradient mode at ∘ c with the use of a multitemp iii thermostatic circulator (ge healthcare). before the second-dimension sodium dodecyl sulfate (sds) polyacrylamide gel electrophoresis (page), the ipg strips were equilibrated firstly for min with gentle agitation in ml of equilibration solution ( . % (w/v) tris-hcl, ph . , m urea, % (v/v) glycerol, % (w/v) sds, and . % (w/v) dtt) and secondly using . % (w/v) iodoacetamide and bromophenol blue. the ipg strips were then transferred to % polyacrylamide gels for electrophoresis, which was performed at a maximum voltage of v for approximately h to separate the proteins vertically on the basis of molecular mass. staining. the two-dimensional ( d) gels were silver stained using a protocol optimized for protein identification with mass spectrometry [ ] . in brief, the gels were fixed overnight in % (v/v) ethanol, % (v/v) acetic acid, and . % (v/v) formaldehyde. the gels were washed times for min in % (v/v) ethanol and pretreated for min in . % (w/v) na s o ⋅ h o. they were then rinsed in water and stained for min in . % (w/v) agno and . % (v/v) formaldehyde. following further rinsing with water, development was undertaken for approximately min in % (w/v) na co , . % (v/v) formaldehyde, and . % (w/v) na s o ⋅ h o. the development was arrested in a fixative solution of % (v/v) ethanol and % (v/v) acetic acid. the d gels were then dried between cellophane sheets and sealed in plastic envelopes. silver stained d gels were scanned on a gs- calibrated imaging densitometer (bio-rad, hercules, ca) using the quantity one program (bio-rad), and the pdquest software (bio-rad) was used to define, quantify, and match the protein spots on each of the d gels. all well-defined protein spots that were at least twofold (mann-whitney test, < . ) differentially expressed between the sham and retinal detachment vitreous groups were selected for identification with nanoliquid chromatographyelectrospray ionization tandem mass spectrometry (lc-ms/ms). the d gels were removed from their plastic envelopes and rehydrated in water. the selected protein spots were carefully excised from the gels with a scalpel and subjected to in-gel digestion with trypsin gold (mass spectrometry grade; promega, madison, wi). the peptide samples that were obtained were analyzed by lc-ms/ms as previously described [ ] . in brief, peptides generated by trypsin digestion were separated on an inert nano-lc system (lc packings, san francisco, ca) connected to a q-tof premier mass spectrometer (waters, milford, ma). the masslynx sp (waters) was used to obtain spectra and the raw data was processed using proteinlynx global server . (waters). the processed data were used to search the total part of the swiss-prot database using the online version of the mascot ms/ms ions search facility (matrix science, ltd.). the search was undertaken with doubly and triply charged ions with up to two missed cleavages, a peptide tolerance of ppm, one variable modification, carbamidomethyl-c, and a ms/ms tolerance of . da. contaminating peptides such as trypsin, keratin, bovine serum albumin, and all peptides originating from previous samples were disregarded. at least one "bold red" (matrix science ltd., http://www .matrixscience.com/) peptide match was required in the search for protein hits. individual peptide ions scores above approximately indicated identity or extensive homology giving a less than % probability that the observed match was a random event. all peptides for the protein hits are reported (table ) . blotting. in each case three micrograms of vitreous sample protein was separated on novex - % gradient tris-glycine polyacrylamide gels (invitrogen corporation, carlsbad, ca) and subsequently transferred to nitrocellulose hybond-c extra membranes (ge healthcare). the membranes were blocked overnight with % skimmed milk in mm na hpo , mm nah po , mm nacl, and . % tween buffer, ph . . membranes were incubated with anti-albumin (genway biotech, ca, usa; : ) and anti-peroxiredoxin (abcam, cambridge, uk; : ). no suitable antibodies were commercially available for the rabbit f isoform of - -antiproteinase or the rabbit collagen-i fragment that was identified with lc-ms/ms. following washing, the membranes were further incubated with appropriate horseradish peroxidase-conjugated secondary antibodies: p sheep and p mouse (both : ; dako, glostrup, denmark). proteins were visualized with the enhanced chemiluminescence system (ge healthcare) and imaging system (fujifilm las- , tokyo, japan). up to approximately protein spots were clearly resolved on each of the d gels. ten protein spots were found to be significantly and at least twofold differentially expressed between the sham and detachment vitreous groups (figure ). three protein spots were upregulated and seven spots were downregulated. from the three upregulated protein spots, two were identified as fragments of albumin (spots and ), whilst spot could not be identified. four of the seven downregulated protein spots were identified as fragment of collagen-i (spot ), - -antiproteinase f (spots and ), and peroxiredoxin (spot ). protein spots , , and could not be identified ( figure ; table ). western blotting developed with anti-albumin showed a heavy band at approximately kda, which is likely to represent the full length protein, whilst multiple bands below this suggest the presence of several fragments, some of which may correspond with those identified with the d-page analysis (figure , left) . peroxiredoxin has a deduced molecular mass of approximately kda. however, spot containing peroxiredoxin migrates with a molecular mass around kda with d-page (figure ) , and this size was verified by western blot analysis (figure , right). though a single and specific band was achieved with anti-peroxiredoxin , western blotting could not be reliably used for quantification due to a weak signal near the detection limit and variable background reaction. analysis with d-page revealed fragments of albumin to be upregulated in the vitreous following retinal detachment. albumin is the most abundant protein in plasma, aqueous, and vitreous humor, where in the latter it constitutes around - % of total protein [ ] [ ] [ ] . serum proteins such as albumin are present in the aqueous and vitreous humor at a relatively lower level compared to the vascular circulation from where they may have in part originated [ , ] . western blot analysis showed an intense band at approximately kda corresponding with the full length albumin protein, with multiple lower molecular mass bands that are likely its fragments. increase of albumin and its fragments may signify increased proteolysis and the passage of albumin into the vitreous. indeed, the breakdown of the blood-retinal barrier that occurs with retinal detachment has also been implicated in the increase of other such proteins in the vitreous [ , [ ] [ ] [ ] [ ] [ ] [ ] . it is also possible that albumin in the vitreous may arise from de novo synthesis in the retina, similar to the reported increased gene and protein expression of albumin in the corneal epithelium during wound healing [ , ] . extraocular albumin is known to have diverse and important functions, which include maintenance of colloid osmotic pressure, transport of biomolecules, and inactivation of toxins through intermolecular binding [ , ] . albumin can also act as an antioxidant by scavenging reactive oxygen species and sequestration of metal ions and has anti-inflammatory and apoptotic regulatory abilities [ ] [ ] [ ] [ ] . vitreal albumin has been proposed to transport long chain fatty acids into the lens for biosynthesis of lenticular lipids [ , , ] . indeed, albumin is likely to have many such important roles in the eye, which requires further investigation. in the present study we observed peroxiredoxin to have a molecular mass above kda using both d-page and western blot analyses. however, the predicted molecular mass of the peroxiredoxin family of proteins is approximately kda- kda. this variation may represent the well-studied property of these proteins to undergo oligomerization, which can be promoted by a number of factors including overoxidation of cysteine residues of peroxiredoxin [ ] . although the present experiments were conducted in standard reducing conditions that aim to break cysteine bonds, we obtained a band well above kda. this is in keeping with another study, which also showed some peroxiredoxin western blot bands appearing at molecular mass much higher than kda that was suggested to result from oligomerization or posttranslational modification [ ] . our finding could also represent a novel alternative splicing variant of peroxiredoxin , as reported for peroxiredoxin [ ] . the peroxiredoxins are a group of ubiquitous antioxidant proteins that currently comprise six members in mammals [ ] . these proteins are primarily found at high levels intracellularly, mainly within the cytosol, but are also present in the mitochondria, peroxisomes, and nuclei, and they may be exported [ ] . furthermore, presence of peroxiredoxin has been shown in plasma, not only as a result of hemolysis but also possibly by secretion from the t lymphocytes [ , ] . these multifunction enzymes act as antioxidants by using redox active cysteines for the reduction and degradation of hydrogen peroxide, peroxynitrite, and organic hydroperoxides [ , ] . oxidative stress is thought to result from an imbalance between reactive oxygen species production and antioxidant ability and is recognized to be an important factor in the pathogenesis of a number of age-related and neurodegenerative diseases, which include age-related cataract, agerelated macular degeneration, glaucoma, diabetic retinopathy, retinal detachment, and pvr [ ] [ ] [ ] [ ] . indeed, the present study showed a decrease in the vitreal levels of peroxiredoxin following retinal detachment. this may be in keeping with reported reductions in the levels of other members of the antioxidant defense system such as glutathione and ascorbic acid both in vitreal and in blood samples of patients suffering from pvr [ , ] . furthermore, apart from their role as antioxidants, the peroxiredoxins can affect a diverse range of biological processes that include cellular proliferation, differentiation, and apoptosis by influencing signal transduction pathways that employ hydrogen peroxide as a secondary messenger [ , ] . recent studies on tears from patients with glaucoma have also identified peroxiredoxin as having a possible involvement in inflammation [ , ] . indeed, peroxiredoxin and other members of this family of proteins are liable to have a significant role in the pathophysiology of retinal detachment. a fragment of collagen-i was identified in the vitreous of the rabbit; however, type i collagen has not previously been identified as a natural component of the mammalian vitreous and is rather known to be a constituent of early pvr membranes [ ] [ ] [ ] and retinal blood vessels [ , ] . a mixture of type ii, ix, and v/xi hybrid collagen fibrils, which are separated out mainly by water and ions attracted to hyaluronan, characterizes the vitreous body [ ] . collagen, possibly with the aid of adhesive-like intermediate molecules, may provide the basis of vitreoretinal adhesion by connecting the vitreous with the retinal inner limiting membrane (ilm). this attachment is extremely strong in the vitreous base since the fibrils pass through the ilm to merge into underlying collagen networks and crypts [ , , ] . collagen is also a significant component of both epiretinal and subretinal pvr membranes [ , ] , and type i collagen is recognized to be a principal constituent during their early development [ ] [ ] [ ] . the presence of collagen in the subretinal space, a place normally devoid of this protein, suggests that certain cells, particularly the rpe and müller cells associated with membranes, are able to synthesize collagen under certain pathological conditions such as retinal detachment and pvr [ ] [ ] [ ] [ ] . however, the present analysis suggests collagen-i fragment to be found in sham vitreous, which furthermore showed a decreased concentration following retinal detachment that may indicate perturbed proteolytic activity. matrix metalloproteinases (mmp) and other proteolytic enzymes that are able to degrade and remodel vitreal collagen have been found to be increased with rrd and pvr [ ] [ ] [ ] [ ] , which could be in keeping with the decrease in - -antiproteinase shown in the present study. further studies will be necessary to confirm the source and nature of collagen-i in the vitreous and the possible mechanisms of collagen fragmentation that may be an important feature of vitreous liquefaction and rrd [ , , ] . alpha- -antiproteinase. d-page showed - -antiproteinase (also called - -antitrypsin or - -proteinase inhibitor) at two closely positioned spots, which were largely in keeping with their predicted molecular mass but differing by their charge. currently, four isoforms of - -antiproteinase have been identified in the rabbit, termed f, s , s , and e, which is a similar picture to the multiple variants identified in humans [ , ] . alpha- -antiproteinase is an acute phase protein and archetypal member of the superfamily of serine protease inhibitors (serpin), which are involved in a wide range of biological processes that includes inflammation, angiogenesis, blood coagulation, ecm remodeling, and tumor suppression [ ] . this protein has the ability to inhibit a large number of serine proteases though its principle target is neutrophil elastase [ ] . indeed, - -antiproteinase originally received much attention because its deficiency increases the risk of a variety of clinical conditions, such as chronic obstructive pulmonary disease, which can result from unrestrained elastase activity. we found the f isoform of rabbit - -antiproteinase to be downregulated in the vitreous following retinal detachment. the f isoform of - -antiproteinase is the only one of the rabbit isoforms so far identified that has been shown to have the oxidizable methionine residue site that is present in human - -antiproteinase [ ] . the oxidation of methionine to methionine sulfoxide, which can occur during episodes of inflammation as a result of oxygen-free radicals secreted by leucocytes, has an inhibiting effect upon - -antiproteinase function. this process is thought to enhance the ability of proteinases such as elastase to locally degrade tissue debris that occurs at sites of inflammation [ , ] . alpha- -antiproteinase is primarily produced in the liver and circulated to the rest of the body tissues via the blood; however, extrahepatic sites of its synthesis have been identified, which include blood monocytes, alveolar macrophages, bronchial and gastrointestinal epithelial cells, and the cornea [ ] [ ] [ ] [ ] . the protein has also been localized to the tear film, aqueous humor, and vitreous, where in the latter a phosphorylated form of - -antiproteinase has been suggested as a potential biomarker of idiopathic macular hole and rhegmatogenous retinal detachment [ ] [ ] [ ] . it has been postulated that one of the main functions of corneal - -antiproteinase is to protect against the damaging effects of neutrophil elastase produced during corneal inflammation [ ] , and it may be expected that a similar role in addition to others is applicable to vitreal - -antiproteinase, though this requires further investigation. this proteomic investigation of the rabbit vitreous has identified a set of proteins that assist our understanding of the pathogenesis of rhegmatogenous retinal detachment and its journal of ophthalmology complications. further studies will be necessary to clarify the role of these proteins. certain proteins, such as those of low abundance and at the extremes of molecular mass, together with membrane proteins, can be difficult to resolve and detect using the d-page technique. therefore, complementary proteomic methods such as gel-free mass spectrometry should be considered in future work in order to help address these limitations. the authors alone are responsible for the content and writing of the paper. recent trends in the management of rhegmatogenous retinal detachment pathogenesis of rhegmatogenous retinal detachment: predisposing anatomy and cell biology the epidemiology and socioeconomic associations of retinal detachment in scotland: a two-year prospective population-based study cellular effects of detachment and reattachment on the neural retina and the retinal pigment epithelium epithelial-mesenchymal transition and proliferation of retinal pigment epithelial cells initiated upon loss of cell-cell contact a novel strategy to develop therapeutic approaches to prevent proliferative vitreoretinopathy ocular proteomics with emphasis on two-dimensional gel electrophoresis and mass spectrometry proteomics of uveal melanomas suggests hsp- as a possible surrogate marker of chromosome loss proteomic analysis of human vitreous associated with idiopathic epiretinal membrane analytical platforms in vitreoretinal proteomics dye-free porcine model of experimental branch retinal vein occlusion: a suitable approach for retinal proteomics protein changes in the retina following experimental retinal detachment in rabbits proteome profiling of vitreoretinal diseases by cluster analysis vitreous proteomic analysis of proliferative vitreoretinopathy elucidation of the pathogenic mechanism of rhegmatogenous retinal detachment with proliferative vitreoretinopathy by proteomic analysis proteomic analyses of the vitreous humour translational vitreous proteomics improved silver staining protocols for high sensitivity protein identification using matrix-assisted laser desorption/ionization-time of flight analysis components of the fibrinolytic system in the vitreous body in patients with vitreoretinal disorders proteome analysis of human vitreous proteins role of albumin as a fatty acid carrier for biosynthesis of lens lipids shifting the paradigm of the blood-aqueous barrier blood-aqueous barrier breakdown associated with rhegmatogenous retinal detachment risk factors for proliferative vitreoretinopathy levels of pentosidine in the vitreous of eyes with proliferative diabetic retinopathy, proliferative vitreoretinopathy and retinal detachment elevation of vitreous leptin in diabetic retinopathy and retinal detachment immunoglobulins in paired specimens of vitreous and subretinal fluids from patients with rhegmatogenous retinal detachment experimental retinal detachment. ix. aqueous, vitreous, and subretinal protein concentrations elevated albumin in retinas of monkeys with experimental glaucoma changes in albumin precursor and heat shock protein expression and their potential role in response to corneal epithelial wound repair all about albumin: biochemistry, genetics, and medical applications albumin: biochemical properties and therapeutic potential the anti-apoptotic activity of albumin for endothelium is mediated by a partially cryptic protein domain and reduced by inhibitors of g-coupled protein and pi- kinase, but is independent of radical scavenging or bound lipid mitochondria are the major targets in albumin-induced apoptosis in proximal tubule cells albumin is a major serum survival factor for renal tubular cells and macrophages through scavenging of ros in vivo passage of albumin from the aqueous humor into the lens structure, mechanism and regulation of peroxiredoxins peroxiredoxin and peroxidase enzymatic activity of mammalian spermatozoa peptides with dual binding specificity for hla-a and hla-e are encoded by alternatively spliced isoforms of the antioxidant enzyme peroxiredoxin cloning of bovine peroxiredoxins-gene expression in bovine tissues and amino acid sequence comparison with rat, mouse and primate peroxiredoxins plasma proteome of severe acute respiratory syndrome analyzed by twodimensional gel electrophoresis and mass spectrometry advances in our understanding of peroxiredoxin, a multifunctional, mammalian redox protein peroxiredoxins, oxidative stress, and cell proliferation interleukin- , nitric oxide and glutathione status in proliferative vitreoretinopathy and proliferative diabetic retinopathy determination of ascorbic acid in human vitreous humor by high-performance liquid chromatography with uv detection vitreous levels of oxidative stress biomarkers and the radical-scavenger -microglobulin/a m in human rhegmatogenous retinal detachment activation of signaling pathways and stress-response genes in an experimental model of retinal detachment peroxiredoxin evolution and the regulation of hydrogen peroxide signaling differential protein expression in tears of patients with primary open angle and pseudoexfoliative glaucoma shotgun proteomics reveals specific modulated protein patterns in tears of patients with primary open angle glaucoma naïve to therapy proliferative vitreoretinopathy: pathobiology, surgical management, and adjunctive treatment promotion of adhesion and migration of rpe cells to provisional extracellular matrices by tnf proliferative vitreoretinopathy membranes. an immunohistochemical study collagens and collagen-related matrix components in the human and mouse eye retinal microvessel extracellular matrix: an immunofluorescent study adult vitreous structure and postnatal changes structure and composition of the inner limiting membrane of the retina. sem on frozen resin-cracked and enzyme-digested retinas of macacca mulatta collagen formation by periretinal cellular membranes matrix and the retinal pigment epithelium in proliferative retinal disease variation in epiretinal membrane components with clinical duration of the proliferative tissue human retinal müller cells synthesize collagens of the vitreous and vitreoretinal interface in vitro enzymatic breakdown of type ii collagen in the human vitreous correlation of the extent and duration of rhegmatogenous retinal detachment with the expression of matrix metalloproteinases in the vitreous a prospective study of matrix metalloproteinases in proliferative vitreoretinopathy correlation of matrix metalloproteinase levels with the grade of proliferative vitreoretinopathy in the subretinal fluid and vitreous during rhegmatogenous retinal detachment age-related liquefaction of the human vitreous body: lm and tem evaluation of the role of proteoglycans and collagen various forms of rabbit plasma - -antiproteinase mammalian -antitrypsins: comparative biochemistry and genetics of the major plasma serpin the serpins are an expanding superfamily of structurally similar but functionally diverse proteins. evolution, mechanism of inhibition, novel functions, and a revised nomenclature expression of the alpha -proteinase inhibitor gene in human monocytes and macrophages biosynthesis of -proteinase inhibitor by human lung-derived epithelial cells regulation of -proteinase inhibitor release by proinflammatory cytokines in human intestinal epithelial cells corneal synthesis of -proteinase inhibitor ( -antitrypsin) alpha- -antitrypsin in aqueous humour from patients with corneal allograft rejection identification of phosphotyrosyl proteins in vitreous humours of patients with vitreoretinal diseases by sodium dodecyl sulphate-polyacrylamide gel electrophoresis/western blotting/ matrix-assisted laser desorption time-of-flight mass spectrometry the authors appreciate the excellent technical support pro- the authors report no conflict of interests. key: cord- -d szap authors: permyakova, n. v.; uvarova, e. a.; deineko, e. v. title: state of research in the field of the creation of plant vaccines for veterinary use date: - - journal: russ j plant physiol doi: . /s sha: doc_id: cord_uid: d szap transgenic plants as an alternative of costly systems of recombinant immunogenic protein expression are the source for the production of cheap and highly efficient biotherapeuticals of new generation, including plant vaccines. in the present review, possibilities of plant system application for the production of recombinant proteins for veterinary use are considered, the history of the “edible vaccine” concept is briefly summarized, advantages and disadvantages of various plant systems for the expression of recombinant immunogenic proteins are discussed. the list of recombinant plant vaccines for veterinary use, which are at different stages of clinical trials, is presented. for many thousands of years plants have served humanity as a source of medicinal substances. how ever, only at the turn of the century xxi with the appearance of dna technologies, it became possible to modify plant genomes and to create new types of plants (transgenic plants), which are capable of syn thesizing and accumulating in their tissues recombi nant proteins from various heterologous systems. to date transgenic plants in which nuclear and chloro plast genomes have been transformed with genes encoding heterologous proteins that are important in the treatment of various diseases -antigens of infec tious agents, antibodies, immunomodulators, etc. have been created [ , ] . of principal importance of this work development was the creation of the "edible vaccine" concept, the essence of which is the use of genetically modified plants containing protein anti gens of infectious agents for oral delivery of relevant antigens to the mucosa of the gastrointestinal tract of warm blooded animals. vaccination based on the programming of the spe cific mechanisms of warm blooded animal protection against pathogens is the most efficient method for the struggle against infectious diseases, which often result in a mass mortality. in agriculture, there is no alterna tive to livestock vaccination, because there are no anti viral drugs that are suitable for a wide use in animal husbandry. the importance of animal vaccination indirectly affects human health, because the use of vaccines significantly reduces the amount of pharma ceuticals in the food chain. as a rule, animal immune mechanisms are acti vated by the direct introduction of infectious agents or their components. at present, most of used vaccines are preparations on the basis of inactivated agents. although these vaccines manifest the high immunoge nicity, they are not without serious shortcomings. among such disadvantages are the increased sensitiv ity of the organism to them, the large load on the immune system, the reactogenicity of vaccines (side effects), their toxicity. etc. the application of molecular biology and genetic engineering methods opened wide prospects for the manufacturing of vaccines of new generation, which immunogenic components can be biological mole cules or their fragments. dna fragments or proteins of the infectious agent cell envelopes can serve as immu nogenic components. when the gene encoding the envelope protein of the infectious agent is transferred into the genome of another organism, for example, plant, then the cells of such plant will synthesize the protein antigen capable of formation of resistance to this agent. thus, the introduction into the organism not the whole pathogen but only its part, which is not capable of inducing infection development, will pro vide for the effect of vaccination. one fourth of the total pharmaceutical market of drugs for veterinary use, and it is constantly expanding. [ ] . preparation of medicinal substances for the pro duction of veterinary products is based on various approaches, including biotechnology using genetically modified (transgenic) organisms for these purposes; such expression systems as bacteria, yeast, cells of insects and mammals are used. the application of genetically modified plants with genes encoding phar maceutic proteins inserted in the genome opens new prospects for obtaining recombinant proteins, includ ing plant vaccines [ ] . this review is devoted to the analysis of possibilities of producing recombinant immunogenic proteins for veterinary use on the basis of plant expression systems, and the history of the concept of "edible vaccines" for animal immunization. of plant vaccines the idea of the usage of plant cells for the synthesis and accumulation of recombinant protein antigens was for the first time successfully realized in by c. arntzen and his colleagues [ ] . just this team of researchers not only demonstrated a possibility of the accumulation of the surface hbsag antigen of hepati tis b virus but also its capability for self assembling in the virus like particles in transgenic tobacco plants. the virus like particles isolated from plant tissues were identical to the particles of hbsag antigen of indus trial recombinant vaccine obtained in the yeast expres sion system and also to virus like particles from the blood plasma of patients infected with hepatitis b virus. thus, it became obvious that genetically modi fied plants producing and accumulating protein anti gens of various infectious agents can be used for oral delivery of corresponding agents to the mucosa of the gastrointestinal tract of warm blooded animals, i.e., as "edible vaccines." the next important step in the development of the "edible vaccine" concept on the basis of genetically modified plants was the creation of transgenic plants producing the heat labile enterotoxin of escherichia coli [ , ] and b subunit of the cholera toxin [ ] . heat labile toxin of e. coli consists of two parts: lt a (enzyme) and lt b (pentamer of receptor binding polypeptides). lt b binds to the receptors on the sur face of membranes of epithelial cells of the mammal small intestine and transports lt a in the intestinal cells, where it induces changes in the cell metabolism and cell dehydration. when the two parts of the heat labile enterotoxin are separated, the appearance of the lt b protein complex on the surface of epitheliocytes will stimulate a strong immune response of intestine mucosa without the appearance of any disease signs. just this feature was the basis for the research of c. arntzen [ ] team on the creation of plant vaccine providing for the resistance to enterotoxigenic e. coli toxins. the authors established that lt b synthesized in transgenic tobacco and potato plants and also lt b isolated from e. coli delivered orally to mice induced similar immune responses. later, lt b sequence was optimized for expression in plant cells and transferred into the potato genome [ ] . in potato tubers, the protein assembled correctly in oligomers and was accumulated in amounts suffi cient for the induction of the immune response at oral delivery to the organism. on the basis of clinical trials of the "candidate" plant lt b vaccine, it was estab lished that the consumption of raw potato tubers con taining . - mg lt b by volunteers resulted in the formation of serum and mucosal immune responses with high titers of antibodies [ ] . the initial concept of "edible vaccine" was heavily criticized by researchers who believed that in the aggressive medium of the gastrointestinal tract the recombinant protein should be destroyed. however, later it was experimentally established that the recom binant b subunit of the cholera toxin fused with green fluorescent protein (gfp) protected by the plant cellu lose cell wall at oral delivery is capable of passing through the gastrointestinal tract and reaching the antigen containing cells of the mouse intestine [ ] . the results obtained confirmed the possibility of using plants synthesizing protein antigens of various infec tious agents for oral delivery of antigens to the mucosa of the gastrointestinal tract and experimentally con firmed a consistency of the "edible vaccine" concept. it became evident that genetically modified plants can be used for the creation of plant vaccines as a raw mate rial, and separate plant parts (fruits, roots, berries, leaves, etc.) can be used directly in food without pre liminary heat treatment. response formation in the process of evolution, mammals developed secondary lymphoid mucosal tissue capable of antigen absorption, processing them, and using for the induc tion of the mucosal response. it was established that in this case both cellular and humoral immunity were formed. it is of importance that adaptive mucosal immunity can distinguish between usual food and symbiotic antigens and infectious agents [ ] . the scheme of the mechanism of mucosal immune response formation is presented in fig. . in the digestive tract, which is one of the pathways for the penetration of a variety of pathogens into organism, the main associated lymphoid tissue is the peyer's patches. peyer's patches, inductive sites of the intestine, which contain the dome, underlying the fol licle (b zones with germinal center) and interfollicu lar region containing t cells. the surface of the dome is covered by a specialized follicle associated epithe lium containing the folded cells (m cells), which are able to absorb and transport antigens from the intesti nal lumen. after successful capture, the antigen is par tially cleaved and enters into dendritic cells (see fig. , step ). dendritic cells are especially important in the initiation of adaptive immune responses, since they migrate to the lymph nodes (see fig. , step ), and the mediators act in the development of various subpopu lations of t helper cells from naive t lymphocytes and also can interact with b lymphocytes (see fig. , step ). the activated b and t cells leave the peyer's patches and penetrate into the circulatory system. mature t helper cells then return to the mucosa sur pathogens (antigen) epithelium t helper cells face to function as effectors (see fig. , step ). t helper expressing interleukin (il ) increases the expression of the receptors of polymeric immunoglobulin (pig) and secretion of antigen spe cific iga (fig. , step ) . subsequent generation of mature plasma cells producing iga leads to the induction of antigen specific protection of local and distal mucosal surfaces. since the mucosal immune response has a general ized nature, oral (i.e., mucosal) vaccination is not only the immune response of the mucous membranes, but also the overall immune response of the organism [ ] [ ] [ ] [ ] . it is proved that vaccination via the surface of mucosa membrane can specifically activate the immune response to infection without the develop ment of such processes related to the disease as inflammation or toxicity [ , ] . of plant vaccines in comparison with traditional expression systems, plant systems are attractive to researchers in many ways, primarily due to the absence of the risk of plant cell infection with animal pathogens, viruses, prions, etc. plants are capable of the synthesis of most recom binant antigens with the same posttranslational modi fications as in animal cells [ ] . plant vaccines can play especially important role in the protection of ani mals against diarrheal diseases and diseases, which infectious agents penetrate into the organism through the mucosal tissues. modern techniques of genetic engineering allow to selectively direct the recombinant proteins expressed in plant cells to various plant organs (seeds, tubers, fruits, etc.) [ ] . this possibility greatly simplifies the large scale production of plant vaccines and reduces their cost. according to the experts, the final price of the product (recombinant protein) produced in a plant expression system will be much less than the price of a similar protein produced, for example, in mammalian cell culture [ ] . since recombinant proteins can be accumulated in the storage organs or seeds, they are able to be main tained without any changes and the loss of biological activity for a long time (months and years). it was established that the recombinant protein of cholera toxin b subunit remained stable in transgenic rice grains for at least months, when grains were stored under room temperature conditions [ ] . recombi nant protein antigens remained stable in rice grains during three years and provided for the formation of the protective immunity in mice against cholera agent or against enterotoxigenic e. coli [ ] ; in soybean seeds and soy milk, the preservation of antigen stabil ity was observed for four years [ ] .thus, grains of transgenic plants can be transported to the site of final destination without additional freezing and treatment, and this ensures the retention of activity of the recom binant protein activity, its stability, and the constancy of dosage. a somewhat different picture is observed when lyo philization is used as a method of the conservation of protein antigens synthesized by plant cells. it was established that the recombinant protein of the norovirus envelope retained its immunogenicity in tis sues of both lyophilized and air dried tomato fruits [ ] , but the tested samples differed in immunogenic ity. the immunogenicity of this recombinant protein in air dried tomato fruits was somewhat higher in comparison with lyophilized fruits. similar results were obtained in experiments on the lyophilization of potato tubers synthesizing the protein of norovirus envelope, e.g., the immunogenicity of lyophilized tubers was lower than that of fresh tubers [ ] . how ever, additional studies are required for the final solu tion of this question. despite these advantages, plant vaccines are not without disadvantages. one of them is differences in protein posttranslational modifications in plants and animals, e.g., in glycosylation of the recombinant pro tein [ ] . it is known that more than a half of proteins synthesized by eukaryotic cells are glycosylated and more than a third of currently applied biopharmaceu tics are glycoproteins [ ] . although the activity of most proteins does not depend on glycosylation, in some cases it may be critical. specific features of pro tein glycoforms can affect their folding, stability, trans portation, and changes in their functional activity and immunogenicity. examples of biopharmaceuticals, which functional activity depends on the specific gly coform, are erythropoietin, antibodies, blood anti gens, some interferons and hormones [ ] . scheme of the n glycan complex formation in plants and animals (humans, for example) is shown in fig. . the most significant difference in glycosylation is that the plant β , xylose is attached to the core mannose residue and α , fucose -to n acetylglu cosamine residue of the core glycan. in human cells xylose is not used at all in glycosylation and a proximal fucose residue is attached to glycans through the α , bond. it was established that sugar residues attached at posttranslational modifications of recom binant proteins in plant cells themselves were capable of exhibiting the immunogenicity. approximately in one quarter of patients with allergy symptoms, ige antibodies specific for complex glycans, which include xylose or fucose, were revealed [ ] . differ ences in glycosylation during posttranslational modi fications of recombinant proteins in plant and mam mal cells can be eliminated by genetic engineering techniques, which was successfully demonstrated on the moss physcomitrella patens, in which the genes encoding the enzymes β , xylosyltransferase and α , fucosyltransferase responsible, respectively, for xylosylation and fucosylation of proteins were switched off by the "knock down" method [ ] . it is known that when the foreign gene is inserted in the genome of the transgenic plant, the level of its expression depends on the site of its insertion, which determines the level of protein (antigen in particular) accumulation in plant tissues. in this connection, one of disadvantages of plant vaccines is the difficulty in the standardization of their dosage. it is at this stage the development of the "edible vaccine" concept has undergone substantial revision, since the possibility of oral delivery of the recombinant antigen with the raw plant material was untenable because of the variability of the recombinant protein content. according to the researchers involved in the development of "candi date" plant vaccines, the antigen dosage problem in plant tissues can be successfully solved by the intro duction of additional treatment of the plant material: its refinement (to equalize the concentration of the antigen), drying or lyophilization. it is also necessary to introduce an additional stage associated with the development of rapid methods for determining and monitoring the dosage of the recombinant antigen. after appropriate preparation, plant vaccines can be encapsulated, tableted, and used in practice under corresponding medical supervision [ , ] . thus, performed studies allow a suggestion that the plant vaccine based on the genetically modified plants is capable of inducing protective immunity and open new opportunities for the creation of low cost and easy handling vaccines against infectious diseases of animals. it is of importance that developed to date, highly effective methods of cultivation of agro eco nomically important plant species, as well as the seed production system for a particular culture make the plants attractive to be used as "biofactories" for man ufacturing cheap recombinant proteins for medical purposes. roplasts). among plant expression systems with stable integration of the transgene in the nuclear genome, a separate group includes the cultivated duckweed, microalgae, and cell culture systems in vitro: cell sus pensions, cultures of "hairy roots", moss protonemas, which cultivation conditions are a completely closed environment (bioreactors). promising is the transient expression system, in which the target gene is intro duced into plant cells and is expressed for a short period of time (several days), but is not integrated into the genome. each of these expression systems has its advantages and disadvantages; the main details of these systems are considered below. the most widely used system of heterologous gene expression, in particular for plant vaccine production, are transgenic plants with the stable integration of the transgene into the nuclear genome. the creation of these plants involves the transfer of foreign genes into the genome of plant cells using agrobacterium tumefa ciens or bioballistics and subsequent regeneration of transformed plants from these cells. being inserted into the nuclear genome, transgene becomes its resi dent part, is stably expressed, and is maintained in subsequent generations. using this expression system for producing recom binant proteins, including plant vaccines, is still ham pered by relatively low levels of transgene expression, which is, as a rule, less than % of total soluble protein (tsp) and also by its variability in different plant organs and tissues within a single plant and in different plants. most often, the variability in expression of the transgene is due to the random nature of its insertion into the nuclear genome (effect of position) and may be associ ated with partial or complete transgene inactivation [ , ] . experts believe that the use of plant expression sys tems for obtaining plant vaccines is economically bene ficial at the level of expression of a target gene, which allows the accumulation of recombinant protein in an amount not less than % of tsp [ ] . a lot of ways to increase the level of foreign gene expression in transgenic plants is developed; they are very fully discussed in the reviews [ , , ] . among them are the optimization of the codon composition of the target sequence, the usage of strong promoters, the addition of introns or regions of binding with the nuclear matrix (sar), and others. the search for tis sue and organ specific promoters, i.e., promoters providing for target gene expression in definite plant tissues or organs, is of a special interest. for example, the usage of promoters directing transcription of the target gene predominantly in the seeds can increase the yield of the target protein by an order or several fold: up to - % of tsp in rice grains and up to % of tsp in tobacco seeds [ ] . it is of importance that as compared with leaves, seeds contain less pro teases and much less water; therefore, recombinant protein is saved better. despite the fact that transgenic plants with stable transgene expression in the nuclear genome are used most widely, it is just this expression system that induces a cautious attitude of human society. one of the fears is associated with the possibility of transgene transfer from the cross pollinated plants into the genomes of wild relatives at growing biotechnological crops in open field. to solve this problem, researchers developed various agricultural technologies as well as fixing male sterility in transgenic plants to prevent unwanted cross pollination of cultivated and wild spe cies. examples of production of various immunogenic proteins using for this purpose transgenic plants with stable transgene integration into the nuclear genome are presented in table . chloroplasts are most attractive among plant expression systems. genetically modified plants with the stable transgene integration into the chloroplast genome were called transplastome plants. the specific organization of chloroplasts allows achieving a high dose of foreign gene in transplastome plants, which provides for the efficient production of the target pro tein. the record of the recombinant protein yield was achieved by transplastomic tobacco plants with the bacteriophage lysin gene plygbs, encoding a hydro lase of the bacterial cell wall, the level of which accu mulation in the leaves amounted to % of tsp [ ] . although in some cases, negative physiological changes were observed in plants with such high level of foreign gene expression, usually there were no devia tions in the development of such plants. problems of transplastome plant adaptation to the high level of for eign gene expression are discussed in the review of bally et al. [ ] . transgene delivery to chloroplasts is performed using bioballistics (gene gun); its integration into the chloroplast genome occurs via homological recombi nation [ ] . the advantage of chloroplast expression system is in the absence of the effect of position observed in the case of random pattern of transgene distribution in the nuclear genome. in plastids, there is no transgene splicing (inactivation); therefore, its expression is stably preserved in subsequent genera tions. due to the prokaryotic organization of chloro plast genome, there is a possibility of co expression of several genes within a single operon [ ] . an impor tant specificity of plastids is that they are inherited through the maternal line and usually are absent from pollen. therefore, as distinct from usual transgenic plants, transplastome plants are safe for environment because the uncontrolled spread of the transgene into other plants is prevented [ , ] . it should be noted that protein posttranslational modifications, e.g., assembling multimer proteins, the tmv-tobacco mosaic virus; camv-cauliflower mosaic virus; tsp-total soluble protein; (+) immune response is revealed at oral immunization; (++) immune response is revealed at oral immunization, animals did not die after virus infection. species of immunized animals are indicated, the way of antigen delivery is indicated in the cases when it was not oral; the amount of survived infected animals is indicated in the cases when the survival was less than %. no. formation of disulfide bridges, lipid modifications, etc., occur successfully in chloroplasts. it is estab lished that recombinant proteins synthesized in chlo roplasts do not differ from native ones in their func tional activities [ , ] . chloroplast attractiveness as the system of recombinant protein expression is in the fact that they are closed structures and it preserves the metabolites, which when released into the cytosol are toxic to plant cells, such as b subunit of cholera toxin [ ] and trihalose in tobacco cells [ ] . on the basis of the above works, it becomes appar ent that transplastome plants can be regarded as the most promising system for efficient production of pro tein antigens. however, the main disadvantage of such expression system at plant vaccine manufacturing is that chloroplasts cannot glycosylate proteins. at present, the creation of transplastome plants is also associated with some technological problems related to the absence of the efficient system of regeneration for most plant species and also with the absence of the efficient system of transgen delivery to the chloroplast genome providing for the high percent of transformed plant yield. the suspension cell cultures on the basis of geneti cally modified plants attract the attention of research ers as promising potential systems for biopharmaceu tics production. such cultures can be obtained from loose callus tissues induced from genetically modified explants or on the basis of co culturing of the cell sus pension and a. tumefasciens. after the assessment of growth characteristics and cell line screening to obtain promising lines capable of the accumulation of great amounts of recombinant proteins, such lines can be cultivated in bioreactors for target protein obtaining. an attractive feature of the cell cultures as expres sion systems for recombinant protein production, as compared to the use of whole plants for this purpose, is that cell culture can be unified in growth character istics, cell dimensions and types. moreover, cells are grown under strictly controlled conditions, when the product accumulation does not depend on the sea sonal weather changes and allows the permanent product obtaining in bioreactors. the additional insertion of signal peptide nucleotide sequences into the construct permits a protein secretion into the intercellular space, which allows target protein isola tion directly from the culture liquid. the addition of recombinant protein stabilizers to the suspension cul ture increases the yield of the target protein [ ] . by their capabilities, plant cell cultures are compa rable in production of therapeutic recombinant pro teins with the conventionally used mammalian cell cultures, such as chinese hamster ovary cells. how ever, as distinct from the mammal suspension cultures, they are not infected by any animal pathogens. to data, there are many examples of plant cell cultures with the yield of recombinant protein in the amount more than mg/l, which is a threshold value for starting the commercial manufacture of the product [ ] . an example of a commercially successful produc tion of veterinary vaccine products is developed by dow agro sciences company (united states) system concert™, patented as an effective and safe system for the production of vaccine proteins in cultured plant cells, cultured in a bioreactor. the first plant vaccine against newcastle disease virus of birds obtained in the tobacco cell culture was approved for use by the min istry of agriculture of the united states in . the disadvantages of this expression system are still insufficiently high yield of the target recombinant pro tein and the instability of foreign genes in cultured plant cells due to the epigenetic silencing of the trans gene transcription [ ] . advantages, disadvantages, and specific features of recombinant protein produc tion in the plant cell cultures are discussed in reviews [ , ] . there are many examples of successful use of aqueous plants, such as duckweed, unicellular algae, and mosses, which are cultivated similarly as plant cell suspensions in bioreactors, for recombinant pro tein expression. duckweed attracts the attention of researchers as a potential highly efficient system for recombinant pro tein expression due to its capability of a rapid biomass accumulation: it can be doubled for - h. geneti cally modified duckweed as a potential producer of biopharmaceutic proteins can be used by animals as a raw or dried food. duckweed is a monocotyledonous angiosperm; for foreign gene transfer into its genome, the methods of agrobacterial transformation and bioballistics are used. examples are known when genetically modified duckweed accumulated recombi nant protein in the amounts up to % of tsp, as assessed after the accumulation of gfp [ ] . sequenc ing the duckweed chloroplast genome is close to com pleting, which opens up some prospects for a signifi cant increase in the yield of recombinant proteins. the systems of biopharmaceuticals production using genetically modified microalgae are actively developed. algae combine advantages of both bacteria (rapid growth and simplicity of cultivation) and higher plants (a capability of posttranslational modifications and photosynthesis). chlamydomonas reinhardtii is most promising among algae: it has a short time of bio mass doubling (about h), it is easily subjected to nuclear and chloroplast transformation, it can be grown under photoautotrophic conditions or with the addition of acetate as the source of carbon. nuclear transformants usually give rather low yield of protein product; therefore, recombinant protein production by this alga is based on the transformation of chloro no. plast, which occupies about % of the cell volume [ ] . c. reinhardtii nuclear and chloroplast genomes are sequenced, and this simplifies substantially any genetic engineering manipulations. known examples of protein antigen production in chloroplasts of c. reinhardtii are b subunit of cholera toxin fused with the coat protein of foot and mouth disease virus [ ] or with d fibronectin binding domain of staphylococ cus aureus [ ] , as well as protein virus cryptokary osis (shrimp disease) [ ] and e protein of swine fever [ ] . the green moss physcomitrella patens is the only representative of bryophytes, the genome of which is currently completely sequenced and approaches to its transformation are developed. the peculiarity of this moss is that at the stage of the haploid juvenile game tophyte (protonema) this moss is morphologically similar to filamentous algae and easily enough culti vated in a bioreactor [ , ] . under certain culture conditions the moss can be in the stage of protonema indefinitely long. the attractiveness of this moss spe cies as the system for recombinant protein expression is that, as distinct from plants, fragments of foreign dna can be integrated in its genome through homo logical recombination, which reduces substantially a possibility of transferred gene inactivation. the p. pat ens cells are capable of postranslational modification of proteins of eukaryotic origin. since at the step of gametophyte the moss has the haploid number of chromosomes, it becomes possible to modify the func tioning of individual genes, in particular the moss lines conducting glycosylation of recombinant proteins as in mammalian cell type were obtained [ ] . the firm grenovation (germany) is developing the technology of biopharmaceutical protein production on the basis of p. patens in bioreactors. expression system for pro ducing biopharmaceuticals based on the green moss is not the part of the food chain and is characterized by a high degree of biosafety. as distinct from above described systems based on the stable expression of foreign genes integrated into nuclear or chloroplast genomes, during transient expression target proteins are synthesized in the plant cell during relatively short time (several days) without insertion into the plant genome. at present, the fol lowing approaches are used for transient gene expres sion in plants: gene delivery with the help of agrobac terium, the use of plant virus vectors, and magnifec tion [ ] [ ] [ ] . tobacco mosaic virus (tmv), potato x virus, alfalfa mosaic virus, and cowpea mosaic virus are used as virus vectors [ ] . the availability of infec tious cdna clones, the small size of the viral genomes, the short time required for the expression of a target gene, and a high level of expression provides for a high attraction of this system. the rapid development of this expression system led to substantial modifications of the first gene inser tion vectors or full virus vectors, which is a recombi nant virus that behaves as wild type virus but is capable of expressing additional genes. the next step was the creation of "disarmed vectors" (deconstructed vec tors) lacking a number of original virus genes, and gene replacement vectors, in which a portion of the viral genes is replaced by alien genes [ ] . viral vectors have several substantial disadvantages: a tendency to the loss of foreign insertion in the process of virus spreading over the plant and a potential risk for envi ronment related to the presence of infectious recombi nant viral particles. launch vectors represent an alternative to recom binant plant viruses; cdna of these viruses is deliv ered to plants within t dna region of agrobacterial ti plasmid. firstly, primary transcription of t dna occurs in the nucleus; then viral rna is released into the cytoplasm, where its further amplification, trans lation, and protein synthesis occur [ ] . by , the system of agroinfiltration based on the use of plant viruses and agrobacterial binary plasmids was upgraded and named as magnifection [ ] . at magnifection, multiple agrobacterial lines carrying different parts of the tmv genome are used simulta neously. after agrobacterial transfer into the plant cell nucleus, separate parts of viral genome are assembled in plants in the completely functional viral replicon [ ] . the substantial modification of the viral genome, including numerous point mutations for the removal of potential sites of splicing, intron insertion, and the removal of the gene encoding envelope proteins, pro vided for the highly efficient system capable of recom binant protein synthesis (up to g/kg of fresh tissue), which is more than % of tsp [ ] . among disadvantages of transient system is a necessity for recombinant protein isolation and purifi cation immediately after its accumulation in the plant, because, as distinct from seeds and fruits, plant leaves and stems cannot be stored for a long time. the systems of transient expression of recombinant proteins are rather promising in the cases when a rapid production of a small amount of proteins is required. experiments with transient expression in plants are held indoors, which reduces the risks associated with biosafety to almost zero. examples of manufacturing immunogenic proteins for veterinary using transient expression systems are presented in table . the sys tems of transient expression for the production of recombinant proteins are described in more details in reviews [ , , , , ] . "candidate" plant vaccines for veterinary use table presents examples of using various expres sion systems for the production of "candidate" plant vaccines for veterinary use. main specific immunogenic no. proteins synthesized at respective diseases (structural proteins, hemagglutinins, glycoproteins) are usually used as antigens. the most commonly used method of transgene construct delivering into plant cells is still the agrobacterial transformation. in some cases, the level of target protein expression was rather high [ ] [ ] [ ] [ ] , espe cially in the systems of transient expression [ ] [ ] [ ] , and suitable for product commercialization. in all experiments using the "candidate" plant vac cines, the formation of a specific immune response was demonstrated in vivo, and in most experiments immunogenic proteins were delivered to animals just orally. "candidate" plant vaccines were usually tested on mice, but in approximately a quarter works the ani mals subjected to the disease were tested. protein s of the transmissible swine gastroenteritis virus synthe sized in the cells of transgenic maize [ , , ] or tobacco [ ] and delivered into the body of pigs as a food supplement, provided for % survival of ani mals after infection [ ] . rabbit protein vp virus synthesized in potato [ ] and other plants (tobacco, pea, rape) [ ] and delivered orally enhanced protective immunity: after infecting rabbits with this virus all ani mals survived [ ] . the effect of plant recombinant antigen was comparable with commercially used vac cines. the use of plant vaccines for the vaccination of wild animals using edible baits (e.g., vaccine against rabies) will lead to an increase in the proportion of wild animal populations having immunity to the rabies virus. a potential possibility to reduce the cost of produc tion of biopharmaceutics using genetically modified plants served at the end of the xx century as an impe tus for more than twenty biotechnological companies to initiate commercial programs. as seen from the table , many biological products of plant origin are developed, expressed in different types of plants and plant cell cultures. for a variety of reasons, including the still skeptical attitude of the human community to the biosafety of genetically modified plants, many of these works remained in the framework of laboratory tests. at the moment three companies function on the biotechnology market of veterinary preparations, two from the united states and one from canada (table ). in the united states the dow agro sciences company presented a recombinant plant viral hn protein of the newcastle disease virus (approved by usda) and a mixture of antiviral vaccines at the first stage of clinical trials. the second american company at thomas jef ferson university has developed a plant anti rabies vaccine (completion of phase ). the canadian guardian biosciences company presented plant vac cine against chicken coccidiosis at the second phase of clinical trials. production and wide distribution of biopharmaceu ticals is hampered by a number of circumstances. the first of them is related to the problem of biosafetythe cultivation of genetically modified plants in the field can lead to the accidental introduction of foreign genes into crops grown for human consumption. therefore, most companies producing biopharma ceutics focused on plant species, which are absent from the food chain of humans and animals and also on growing of genetically modified plants preventing their cross pollination with other crops. the second difficulty is related to the necessity of plant material treatment for the removal of various undesired com pounds, such as lignin, proteases, phenolic com pounds, and pigments, especially in the case of plant species, which are not consumed. all these facts result in the requirement of additional studies. the third cir cumstance is due to the fact that until now all aspects of maintaining and growing of plants producing biop harmaceuticals are not settled at the legislative level. ambiguity and vagueness of the existing legislation in this area lead to the fact that large biopharmaceutical and biotechnology companies do not tend to invest in the development of technological lines and research programs in this area, which significantly inhibits the development of the industry. a significant problem for the development of vac cines for veterinary use, especially those used in agri culture, is the need to minimize the price of the final product. the vaccine should be inexpensive for entre preneurs engaged in commercial animal breeding and fully subsidized, if you intend to use the vaccine for mass immunization and the prevention of the disease spread in underdeveloped regions. as a result, the potential income of manufacturers of vaccines for ani mals is much less than that for vaccines intended for humans. for example, in , the volume of the mar ket for the vaccine against human papilloma virus was estimated at more than billion dollars, but the mar ket for the most popular animal vaccines (against foot and mouth disease of cattle and against mycoplasma hyopneumoniae in pigs) together amounted to only - % of this amount [ ] . animal vaccines are cheaper and the volume of market for them is less; therefore, the investments in their development are substantially less as compared with investments in the production of vaccines for humans, whereas the com plexity and diversity of both hosts and pathogens in the case of vaccines for animals is much higher. expression systems based on the use of plant cells still have a limited application or are used primarily in some laboratories. nevertheless, biopharming (the biotechnological production of various substances for medicine) in plants has attracted the attention of researchers and manufacturers in developed countries. first biopharmaceuticals of plant origin, such as anti bodies (anti hbsag required to purify the hepatitis b vaccine), therapeutic and dietary proteins ("intrinsic factor" required at vitamin b deficiency, gastric lipase), have entered the market, that is an excellent illustration of this progress [ ] . despite the fact that today the number of biophar maceutical proteins expressed in plant cells is enor mous, many questions still remain unresolved. the methods for target recombinant protein quantification and purification are still not developed for most of products. the problems of transgene silencing and increased expression of target protein genes are still at the stage of research. the important task that has yet to be solved is to achieve a stable level of expression in different batches of plant raw material. not much work appeared for judging about maintaining the sta bility of recombinant proteins after the harvest, pro cessing, and storage. all of these problems require additional expenses for further research. one of the main obstacles for the leading research groups working on the development and production of plant vaccines, given the financial constraints, is the fulfillment of the relevant official regulations govern ing the use of oral medications. to date, the purified vaccines and therapeutic proteins of plant origin must meet the same standards relating to the production, biosafety, purification, storage, dosage, etc. as any other recombinant proteins for medical purposes. nevertheless, despite these difficulties, there were first biopharmaceuticals of plant origin that passed all the necessary tests and were approved for use by the relevant authorities. some new products having spe cific advantages over similar products obtained in mammalian cell cultures were developed. such com panies as sembiosys genetics inc. (calgary, can ada), medicago inc. (quebec, canada), protalix bio therapeutics (karmiel, israel), and orf genetics (iceland) proved the possibility of quick establishing of the production of purified plant proteins, which are quite competitive in today's market. progress has been made in the formation of the legal framework related to the cultivation of transgenic plants, testing and use of plant biopharmaceuticals. several production pro cesses based on transgenic plants have already received a brand gmp (good manufacturing practice), the interest of manufacturers to this field of biotechnology began to increase again. this work was performed within the framework of the project vi. . . (no. ) development and improvement of genetic constructs to optimize the expression of target genes and the production of recombi nant proteins for medical purposes in transgenic plants and animals. plant produced vaccines: promise and reality evolution of plant made pharmaceu ticals current status of veterinary vac cines clinical trials fuel the promise of plant derived vaccines expression of hepatitis b surface antigen in transgenic plants oral immunization with a recombinant bacterial anti gen produced in transgenic plants edible vaccine protects mice against escher ichia coli heat labile enterotoxin (lt): potatoes express ing a synthetic lt b gene effi cacy of food plant based oral cholera toxin b subunit vaccine immunogenicity in humans of a recombinant bacte rial antigen delivered in a transgenic potato receptor mediated oral delivery of a bioencapsulated green fluorescent protein expressed in transgenic chlo roplasts into the mouse circulatory system delivery of plant made vaccines and therapeutics defending the mucosa: adjuvant and carrier formula tions for mucosal immunity induction of secretory immunity and memory at mucosal surfaces, vaccine oral delivery of human biopharmaceuti cals, autoantigens and vaccine antigens bioencapsu lated in plant cells the mucosal immune response to plant derived vaccines posttranslational modifica tion of therapeutic proteins in plants seed based expression systems for plant molecular farming the economic potential of plant made pharmaceuticals in the manu facture of biologic pharmaceuticals rice based mucosal vac cine as a global strategy for cold chain and needle free vaccination secretory iga mediated protection against v. cholerae and heat labile enterotoxin producing enterotoxigenic escherichia coli by rice based vaccine stability of a soybean seed derived vaccine antigen following long term storage, processing and transport in the absence of a cold chain tomato is a highly effective vehicle for expression and oral immunization with norwalk virus capsid protein production of plant made pharma ceuticals: from plant host to functional protein post translational modifica tions in the context of therapeutic proteins current achievements in the production of complex biopharmaceuticals with moss bioreactors low dose oral immunization with lyophilized tissue of herbicide resistant lettuce expressing hepatitis b surface antigen for prototype plant derived vaccine tablet formulation rna mediated chromatin based silencing in plants transcriptional gene silencing in plants plant based production of biopharma ceuticals production of heterologous proteins in plants: strategies for optimal expression expression of heterologous genes in plant systems: new possibilities exhaustion of the chloroplast protein synthesis capac ity by massive expression of a highly stable protein anti biotic metabolic adaptation in transplastomic plants massively accumu lating recombinant proteins transplastomic plants overexpression of the bt cry aa operon a ¸n´l in chloroplasts leads to formation of insecticidal crys tals chloroplast vector systems for biotechnology applications determining the transgene containment level provided by chloroplast transformation expression of the native cholera toxin b subunit gene and assembly as functional oligomers in transgenic tobacco chloroplasts chloroplast expression of his tagged gus fusions: a general strategy to over produce and purify foreign proteins using transplas tomic plants as bioreactors accumulation of trehalose within transgenic chloro plasts confers drought tolerance plants as bioreactors for the production of vaccine anti gens towards high yield production of pharmaceutical proteins with plant cell suspension cultures position effects and epigenetic silencing of plant transgenes bioreactor systems for in vitro production of foreign proteins using plant cell cultures high expression of transgene protein in spirodela micro algae come of age as a platform for recombinant protein production foot and mouth disease virus vp pro tein fused with cholera toxin b subunit expressed in chlamydomonas reinhardtii chloroplast heat stable oral alga based vaccine pro tects mice from staphylococcus aureus infection factors effecting expression of vaccines in microalgae recombination and expression of classical swine fever virus (csfv) structural protein e gene in chlamydomonas rein hardtii chroloplasts viral vec tors for the expression of proteins in plants agrobacterium mediated transient expression as an approach to production of recombi nant proteins in plants a novel two component tobacco mosaic virus based vector system for high level expression of multiple ther apeutic proteins including a human monoclonal anti body in plants magnifec tion -a new platform for expressing recombinant vac cines in plants plant based vaccines: unique advan tages expression of the newcastle disease virus fusion protein in transgenic maize and immunological studies multimerization of peptide antigens for pro duction of stable immunogens in transgenic plants generation and immunogenicity of japanese encepha litis virus envelope protein expressed in transgenic rice induction of protective immunity in swine by recombinant bamboo mosaic virus express ing foot and mouth disease virus epitopes in planta production of two peptides of the classical swine fever virus (csfv) e glycoprotein fused to the coat protein of potato virus x expression in plants and immunogenicity of plant virus based exper imental rabies vaccine delivery of subunit vaccines in maize seed a corn based deliv ery system for animal vaccines: an oral transmissible gastroenteritis virus vaccine boosts lactogenic immu nity in swine immunogenicity of porcine transmissible gastroenteritis virus spike protein expressed in plants oral immunization using tuber extracts from transgenic potato plants expressing rabbit hemorrhagic disease virus capsid protein pea derived vaccines demonstrate high immunoge nicity and protection in rabbits against rabbit haemor rhagic disease virus plant made pharmaceuti cals: leading products and production platforms pro duction of immunogenic vp protein of bovine group a rotavirus in transgenic potato plants rotavirus vp expressed by pvx vectors in nicotiana benthamiana coats pvx rods and also assembles into viruslike parti cles expression of rotavirus capsid protein vp in transgenic potato and its oral immuno genicity in mice protective lactogenic immunity conferred by an edible peptide vaccine to bovine rotavirus produced in transgenic plants bovine herpes virus gd protein pro duced in plants using a recombinant tobacco mosaic virus (tmv) vector possesses authentic antigenicity induction of a protective antibody response to foot and mouth disease virus in mice fol lowing oral or parenteral immunization with alfalfa transgenic plants expressing the viral structural protein vp induction of a protective antibody response to fmdv in mice following oral immunization with transgenic stylosanthes spp. as a feedstuff additive expression of hemagglutinin protein of rinderpest virus in transgenic tobacco and immunogenicity of plant derived protein in a mouse model systemic and oral immunogenicity of hemagglutinin protein of rinderpest virus expressed by transgenic peanut plants in a mouse model expression of hemagglutinin protein of rinderpest virus in trans genic pigeon pea oral immunogenicity of the plant derived spike protein from swine transmissible gastroenteritis coronavirus cloning and sequence analysis of the korean strain of spike gene of porcine epidemic diarrhea virus and expression of its neutraliz ing epitope in plants successful oral prime immunization with vp from rabbit haemorrhagic disease virus pro duced in transgenic plants using different fusion strate gies mucosal and sys temic immunization elicited by newcastle disease virus (ndv) transgenic plants as antigens expres sion of the fusion glycoprotein of newcastle disease virus in transgenic rice and its immunogenicity in mice expression of immunogenic s glycoprotein of infectious bronchitis virus in trans genic potatoes transient expression of the ectodomain of matrix protein (m e) of avian russian immunization with plant expressed hemagglutinin protects chickens from lethal highly pathogenic avian influenza virus h n challenge infection immunogenicity study of plant made oral subunit vaccine against porcine reproductive and res piratory syndrome virus (prrsv) expression of the rabies virus glycoprotein in transgenic tomatoes immunization against rabies with plant derived antigen development of an edible rabies vaccine in maize using the vnukovo strain induction of a protective immune response to rabies virus in sheep after oral immunization with transgenic maize, expressing the rabies virus glycoprotein expression of the rabies virus nucleoprotein in plants at high levels and evaluation of immune responses in mice expression of rabies virus g pro tein in carrots (daucus carota) key: cord- -li sc z authors: ma, jingjiao; wu, rujuan; xu, guanlong; cheng, yuqiang; wang, zhaofei; wang, heng’an; yan, yaxian; li, jinxiang; sun, jianhe title: acetylation at k of the ns protein is important for the replication and virulence of influenza virus date: - - journal: vet res doi: . /s - - - sha: doc_id: cord_uid: li sc z non-structural protein (ns ) of influenza virus is a multifunctional protein that plays an important role in virus replication and virulence. in this study, an acetylation modification was identified at the k residue of the ns protein of h n influenza virus. to further explore the function of the k acetylation modification of the ns protein, a deacetylation-mimic mutation (k r) and a constant acetylation-mimic mutation (k q) were introduced into the ns protein in the background of a/wsn/ h n (wsn), resulting in two mutant viruses (wsn-ns - r and wsn-ns - q). in vitro and mouse studies showed that the deacetylation-mimic mutation k r in the ns protein attenuated the replication and virulence of wsn-ns - r, while the constant acetylation-mimic mutant virus wsn-ns - q showed similar replication and pathogenicity as the wild-type wsn virus (wsn-wt). the results indicated that acetylation at k of the ns protein has an important role in the replication and virulence of influenza virus. to further explore the potential mechanism, the type i interferon (ifn-i) antagonistic activity of the three ns proteins (ns - q, ns - r, and ns -wt) was compared in cells, which showed that the k r mutation significantly attenuated the ifn-β antagonistic activity of the ns protein compared with ns -wt and ns - q. both ns -wt and ns - q inhibited the ifn-β response activated by rig-i card domain, mavs, tbk , and irf more efficiently than the ns - r protein in cells. taken together, the results indicated that acetylation at ns k is important for the ifn antagonistic activity of the ns protein and virulence of the influenza virus. influenza virus non-structural protein (ns ) is a multifunctional protein that is responsible for interacting with cellular factors to antagonize the host antiviral response during viral infection [ ] . the major role of the ns protein is inhibition of both interferon (ifn) and ifn-stimulated proteins by different mechanisms. ns inhibits the transcription of type i ifn by binding the ′ triphosphate viral double-stranded rnas generated during viral replication to prevent the recognition of viral genomic material by host pattern recognition receptors (prrs), including rig-i, dsrna-dependent protein kinase r (pkr), and ′ ′-oligoadenylate synthetase (oas)/rnase l [ ] [ ] [ ] . ns can also interact directly with rig-i in the absence of rna binding to inhibit the conformational change of rig-i required for mavs activation [ ] . moreover, ns is able to interrupt mrna maturation by inhibiting the nuclear export of host mrnas by binding to host poly(a)-binding protein ii (pabpii) and cleavage and polyadenylation specific factor (cpsf ), which are required for host mrna processing, resulting in the accumulation of ifn pre-mrnas in the nucleus of infected cells [ ] . in addition, ns also antagonizes the open access *correspondence: lijinxiang@caas.cn; sunjhe@sjtu.edu.cn shanghai key laboratory of veterinary biotechnology, key laboratory of urban agriculture (south), ministry of agriculture, school of agriculture and biology, shanghai jiao tong university, shanghai , china chengdu national agricultural science and technology center, sichuan, china full list of author information is available at the end of the article ifn signalling response by regulating other host factors, such as phosphoinositide -kinase (pi k) activity, crklike protein (crkl), and the jak-stat signalling pathway [ ] [ ] [ ] [ ] [ ] . the ns protein, typically - aa in length depending on the strain, contains four functional regions: an rna binding domain (rbd, - aa), linker region (lr, - aa), effector domain (ed, - aa), and c-terminal "tail" (ctt, - aa) [ ] . multiple basic amino acids (e.g., r, r, k, and r) in the rbd are important for rna binding activity and suppressing the activation of pkr [ , ] . the ed plays an important role by targeting multiple host factors, such as pkr, cpsf , and p β (pi k), to inhibit antiviral responses and enhance viral replication [ ] . the residue w in the ed domain is important for the dimerization of the ns protein, and the w r substitution impaired ns dimerization and attenuated the virus in vivo [ ] . in addition, residues e, d, and v play important roles in the binding of ns to cleavage and polyadenylation specificity factor (cpsf ), and mutations in those residues weaken the binding of ns to cpsf and impair the ability of the ns protein to shut off host gene expression [ , ] . post-translational modifications, such as phosphorylation, sumoylation, and acetylation, are important for protein function. phosphorylation at t, t, and t of the ns protein are important for interferon antagonistic activity and replication of human influenza virus [ , ] . sumoylation at positions and of ns are crucial for host protein expression shutoff and replication of h n influenza virus [ ] . acetylation is an important post-translational modification that occurs in two forms [ ] . one is the co/post-translational acetylation at the n α -termini of the nascent polypeptide chains [ ] . the other form is acetylation of the ε-amino group of lysine, which was first recognized in histones regulating gene translation [ ] and later was found in non-histone proteins [ ] . the acetylation status is reversible and well balanced by lysine (k) acetyltransferases (kats) and lysine deacetylases (kdacs), which are tightly regulated to perform many cellular functions [ ] . dysfunction of the acetylation machinery can inhibit protein functions and consequently lead to severe diseases [ , ] . acetylation has been found in multiple proteins of influenza viruses. acetylation was identified in the np protein of influenza virus, and deacetylation of the np protein prevented the virus from assembling functional virus particles [ ] . a histone-like sequence (histone mimic) was identified in the ns protein of influenza a h n , which contributes to suppression of the antiviral response [ ] . the n-terminal acetylation of pa-x is required for the host shutoff activity of pa-x and for viral polymerase activity [ , ] . acetylation of pa has been reported to be crucial for polymerase activity, and deacetylated pa protein restricts iav rna transcription and replication. the influenza virus haemagglutinin (ha) has three conserved cysteine residues ( , , and ) at its c terminus serving as acylation sites that are essential for the formation of fusion pores and infectivity [ ] . in the present study, an acetylation modification at k of the ns protein was identified and characterized. the results showed that the deacetylation-mimic mutation k r in the ns protein attenuated the replication and virulence of the virus in vitro and in vivo. ifn-β antagonist assays indicated that the k r mutation attenuated ifn antagonistic ability compared with the ns -wt or ns - q (constant acetylation-mimic) proteins. overall, this study indicated that acetylation at k of the ns protein plays an important role in the replication and virulence of influenza virus. madin-darby canine kidney (mdck) cells were maintained in eagle's minimal essential medium (emem, hyclone, grand island, usa) with % foetal bovine serum (fbs, gibco, grand island, usa), l-glutamine (gibco), and % antibiotic (gibco). human embryonic kidney (hek) t cells and adenocarcinomic human alveolar basal epithelial cells (a cells) were maintained in dulbecco's modified eagle's medium (dmem, hyclone) supplemented with % fbs (gibco), l-glutamine (gibco), and % antibiotic (gibco). the virus strain a/wsn/ h n (wsn), a mouse-adapted human influenza virus, was propagated and titrated in mdck cells. to identify the putative acetylation sites in influenza viral proteins, mass spectrometry was conducted with concentrated influenza virus. briefly, the h n virus was propagated in mdck cells, and then a total of ml of virus stock was prepared. to concentrate the virus, the collected virus was pelleted by centrifugation at g for min to remove the cell debris. clarified virus supernatants were layered on a % (w/v) sucrose cushion and centrifuged at g for h. the virus pellet was suspended in water and subjected to mass spectrometry analysis performed by ptm biolabs llc (hangzhou, china). to generate mutant viruses, the site mutations k r (deacetylation-mimic mutation) and k q (constant acetylation-mimic) were introduced into the reverse genetic plasmid phw -wsn-ns by a commercial site-directed mutagenesis kit (invitrogen, grand island, usa). the mutant viruses were rescued in the background of wsn-h n virus as described previously [ ] , resulting in wsn-ns - r and wsn-ns - q viruses, and all the mutant viruses were verified by sequencing. then, the ns -wt, ns - r, and ns - q genes were amplified and cloned into the pcdna . -flag expression vector (flag-ns -wt, flag-ns - r, and flag-ns - q). the three viruses were inoculated on monolayer mdck and a cells cultured in -well plates with multiplicities of infection (mois) of . and . for each virus, respectively. each time point was set up in triplicate, and then the samples were collected at , , , and hours post-inoculation (hpi). the supernatants were titrated on mdck cells cultured in -well plates following the reed and muench method to calculate tcid /ml. the cells were collected and subjected to western blotting. briefly, the cell lysates were separated on a sodium dodecyl sulfate (sds)-polyacrylamide gel and transferred to a pvdf membrane. the membrane was blocked in pbs containing % skim milk and then incubated with rabbit anti-ns polyclonal antibody (genscript, piscataway, usa) and then with horseradish peroxidase-conjugated secondary anti-mouse antibody (thermo fisher, grand island, usa). the proteins were visualized by using an ecl kit (yeasen, shanghai, china). additionally, to determine the protein expression levels of the three ns expression plasmids, the flag-ns -wt, flag-ns - q, and flag-ns - r plasmids were transfected into t cells. forty-eight hours post-transfection, the cells were collected and subjected to western blotting. forty-eight -week-old female balb/c mice were randomly allocated into four groups, and each group contained mice. three groups were challenged with the indicated viruses, and one group was challenged with pbs as a control. the mice were inoculated with virus intranasally with . tcid of virus in µl solution under slight anaesthesia with co . the mice were monitored for body weight, clinical signs, and survival rate each day until days post-infection (dpi), and they were euthanized if they lost more than % of their original body weight. three mice from each group were euthanized at and dpi. the mouse lungs were collected for viral titration and cytokine analysis. to quantify the cytokine levels of il- β, ifn-β, and tnf-α in mouse lungs, total rna was extracted from lung tissues, reverse-transcribed and subjected to quantitative realtime polymerase chain reaction as described previously [ ] . to detect the ifn-β antagonistic ability of ns proteins, t cells were transfected with the indicated ns expression plasmids ( . μg/well) together with a plasmid expressing firefly luciferase under the control of the ifn-β promoter (pgl-ifn-β-luc, . μg/well), the renilla luciferase expressing plasmid prl-tk ( . μg/ well), and the ifn-β stimulator poly(i:c) ( . μg/well) or a plasmid expressing the active caspase recruitment domain (card) of rig-i (pcdna-rig-i . μg/well), pcdna-mavs ( . μg/well), pcdna-tbk ( . μg/well) or pcdna-irf ( . μg/well) as described previously [ ] . twenty-four hours post-transfection, the cells were lysed and subjected to a dual-luciferase reporter assay kit (promega, madison, usa). mdck cells cultured on glass slides were infected with wsn-ns -k r, wsn-ns -k q, or wsn-wt at an moi of . all cells were fixed with % paraformaldehyde (pfa) and permeabilized with . % triton x- in pbs at , , and hpi. to detect the ns proteins, the fixed cells were incubated with rabbit anti-ns polyclonal antibody (genscript), followed by fitc-conjugated antirabbit igg antibody (yeasen), and then the cells were stained with dapi. all images were obtained on a leica tcs sp confocal microscope (leica microsystems inc., buffalo grove, usa). the animal study was conducted in accordance with the guidelines of the animal care and use committee of shanghai jiao tong university, and the animal study protocols were approved by shanghai jiao tong university (approval no. ). all data were analysed using analysis of variance (two-way anova) in graph-pad prism version . (graphpad software inc., la jolla, usa); a p-value of . or less was considered significant. the mass spectrometry results showed that one acetylation modification was identified at position k of the ns protein of the wsn-wt virus (figure ). to further explore whether k is subtype-specific, we compared the ns amino acid sequences of randomly selected influenza virus strains of each endemic subtype in birds and humans from genbank. most avian h n ( %), h n ( . %), h n ( %), human h n ( . %), and human h n ( . %, isolated before ) viruses contained k in the ns genes, whereas . % of pandemic h n viruses contained r in the ns protein. these data demonstrated that the k residue is relatively conserved in influenza viruses except the pandemic h n . since the ns protein is an important virulence marker and antagonist of host innate immunity, we chose to further explore the influence of acetylated k on virus replication and virulence in this study. to mimic deacetylated lysine, a k r substitution was introduced into the ns protein, since an r substitution prevents acetylation but preserves the positive charge, and a mutant virus containing the ns -k r substitution was generated in the background of wsn virus (wsn-ns - r). moreover, to mimic constantly acetylated lysine at k of the ns protein, a k q substitution that is a known acetylation mimic was introduced into ns , resulting in a mutant virus containing ns -k q (wsn-ns - q). to determine the effect of acetylated k on virus replication, mdck and a cells were infected with wsn-wt or the two mutant virus wsn-ns - r (deacetylation mimic) or wsn-ns - q (constant acetylation mimic) at the indicated mois to obtain multicycle growth curves. all three viruses replicated efficiently in mdck and a cells, and wsn-ns - q and wsn-wt replicated to similar levels at each time point. however, the growth of the deacetylated mutant virus wsn-ns - r was significantly impaired compared with that of the other two viruses in mdck and a cells at and hpi, which indicated that the acetylated k of ns is important for virus replication in vitro at the late stage of infection (figures a and b) . similarly, ns protein levels were detected by western blotting. the results showed that the ns expression levels of wsn-ns - r were lower than those of wsn-wt and wsn-ns - q at different time points in mdck and a cells (figures c and d) . all the mice infected with viruses showed clinical signs such as ruffled fur, depression, and inappetence. the mice infected with constantly acetylated wsn-ns - q displayed more severe clinical signs and started to show mortality earlier than wsn-wt-infected mice. both wsn-ns - q and wsn-wt caused % mortality in infected mice. however, the deacetylated wsn-ns - r virus infection resulted in % mortality, and the mortality was delayed by days compared with other viruses, which indicated that the deacetylation-mimic k r substitution attenuated the wsn-ns - r virus in mice ( figures a and b) . virus titers were slightly lower in the lungs of mice infected with wsn-ns - r than in the other two groups ( figure c) . notably, significantly higher levels of ifn-β, il- β, and tnf-α mrna were detected in wsn-ns - r-infected mice than in the other two groups at dpi but not at dpi ( figures d-f) , which indicated that wsn-ns - r was less efficient at inhibiting the innate immune response than wsn-ns - q and wsn-wt at dpi in mice. to determine whether the k r and k q mutations affect the expression of the ns protein, the expression levels of flag-ns -wt, flag-ns - q, and flag-ns - r in t cells were compared. the three proteins were expressed at similar levels in transfected t cells, which indicated that the k r and k q mutations did not influence protein expression ( figure e) . the major function of the ns protein is inhibition of type i ifn induction, and the acetylated k residue is located in the effector domain of ns , which is important for its ifn antagonistic ability. to determine the effect of the acetylated k residue on ifn suppression by the ns protein, the inhibition of ifn-β promoter activity by flag-ns -wt, flag-ns - q, and flag-ns - r was evaluated. the results showed that flag-ns -wt and acetylation-mimic flag-ns - q suppressed the ifn-β promoter activity stimulated by poly(i:c) ( figure a) ; however, the deacetylation-mimic flag-ns - r protein was significantly less capable of inhibiting the activation of the ifn-β promoter compared with the acetylated ns proteins ( figure a ). this result indicated that the impaired ifn-β antagonistic ability might be responsible for the attenuation of the wsn-ns - q virus in vitro and in vivo. influenza virus infection stimulates type i ifn production by signal transduction from rig-i to tbk to irf . to further explore how the k r mutation attenuated the ifn-β antagonistic ability of the ns protein, we co-transfected ns expression plasmids, an ifn-β reporter plasmid and different type i interferon pathway components, including rig-i card, tbk , and the active form of irf , into t cells. the results showed that ns - q and ns -wt inhibited the ifn-β response stimulated by each component more efficiently than ns - r, which suggested that the acetylated k residue is important for inhibiting the ifn-β response of ns that targets factors downstream of irf or other proteins (figures b-d) . two nuclear localization signals ( - and - ) have been identified in the ns protein, which drive ns to the nucleus during the early stage of infection. to determine whether k r changes the subcellular localization of ns during infection, mdck cells were infected with the three viruses. the ns protein of wsnwt was located in the nucleus and cytoplasm of infected cells at and hpi ( figures a and b) . at the late stage of infection ( hpi), the ns protein was mainly located in the nucleus and perinuclear area of infected cells (figure c ). the ns - q protein was located in both the nucleus and cytoplasm at hpi and hpi, while it was mainly located in the perinuclear area of infected cells at hpi. in contrast, the ns - r protein accumulated mostly in the cytoplasm during the whole infection course. this result indicated that the deacetylation-mimic k r substitution retained ns protein in the cytoplasm of infected cells, suggesting that the acetylated k residue is important for the nuclear localization of the ns protein ( figures a-c ). post-translational modification is important for protein function, stability, cellular localization, and protein-protein interactions. recent studies have shown that posttranslational modifications of viral proteins modulate the virus life cycle, e.g., phosphorylation of influenza viral proteins (ns , m , and np) plays important roles in virus replication [ ] [ ] [ ] [ ] . the ubiquitination of np and m proteins is crucial for viral rna replication and the production of infectious virus particles [ ] . acetylation is an important post-translational modification in eukaryotes, but the occurrence and function of acetylation in influenza viral proteins remain largely unclear. giese et al. reported that acetylation of k, k, and k of np proteins is important for virus polymerase activity and replication [ ] . in the present study, we identified and characterized the acetylation of k in the ns protein, and the deacetylation of k q affected viral replication in cells at and hpi. the expression levels of ns -k q and ns -k r in transfected t cells were similar ( figure e ), while the ns levels in infected mdck and a cells were different ( figures c and d) . this result could be attributed to the deacetylation affecting virus replication, which resulted in low expression of ns in infected cells. moreover, the acetylation of q contributes to the ifn antagonistic ability of the ns protein. the mrna levels of ifn-β, il- β, and tnf-α in mouse lungs of the wsn-ns -k r-infected group were significantly higher than those in the other two groups at dpi, which indicated that ns - r was less efficient at inhibiting the production of innate antiviral cytokines at dpi in mice. the ns protein of influenza virus is a virulence factor that inhibits the antiviral immunity of the infected host, and c-terminal truncation has been widely used as a strategy to generate attenuated virus vaccine candidates [ ] . one mechanism used by ns to inhibit the ifn response is through direct binding and sequestration of rna as well as direct interaction with trim and complex formation with the rna sensor rig-i, resulting in inhibition of the activation of the rig-i card and hence inhibition of irf activation [ ] . the rna binding, rig-i and trim interacting domains are located in the n-terminus ( - aa) of the ns protein. however, the acetylated k is located in the ed domain, and acetylation of k may not affect the rna binding capability of the ns protein. the ns -k r substitution impaired the suppression of ifn promoter activation by poly(i:c), rig- card, tbk- , and irf , which suggested that the ns -k r substitution affected the ifn antagonism of ns either through targeting downstream of irf or a general mechanism that ns uses to inhibit ifn, such as interaction with cpsf , resulting in inhibition of the processing of mrna, including ifn mrna [ ] . notably, the cpsf protein is mainly located in the nucleus and is required for the ′ end processing of all host pre-mrnas. interestingly, the deacetylation-mimic k r substitution retained ns protein in the cytoplasm of infected cells, resulting in a possible impaired interaction between cpsf and the ns protein, subsequently leading to attenuated ifn antagonism. benjamin hale and colleagues found that the a/california/ / (h n ) virus has r in ns , and ns was unable to suppress general host gene expression. nevertheless, the rk substitution in the ns / protein restored its ability to block general gene expression and bind cpsf [ ] . this could explain why attenuation of the ifn antagonistic ability of ns -k r is independent of rig-i card, tbk- , and irf activation. in addition, anastasina et al. [ ] reported that the ns protein binds to cellular dna to block the cellular transcription of ifns and isgs; thus, ns proteins retained in the cytoplasm lose their cellular dna binding function, resulting in impaired ifn-β antagonistic ability. two known nuclear localization sequences (nlss) of ns proteins are located at the - and - positions. the - nls of the ns protein is highly conserved among influenza a virus strains [ ] . the second nls ( - ) is virus strain specific, and the pandemic h n lacks the second nls in the ns protein [ ] . single point mutations, either r a, r a, or k a, completely eliminated importin protein binding, which transports target proteins to the nucleus [ ] . notably, in this study, acetylation of k located outside of the nls affected the cellular localization of ns protein, and the deacetylation-mimic k r substitution blocked the nuclear localization of the ns protein in infected cells; however, the underlying mechanism remains unknown. interestingly, the ns -k residue is relatively conserved in most influenza viruses, except for the pandemic h n . the pandemic h n has r in ns , which causes inefficient general host gene expression shutoff, while r k restores its ability to block general host genes and bind cpsf [ ] . potentially, the pandemic h n virus might use different strategies to overcome the ifn response compared with the other influenza viruses. overall, we identified an acetylation of k of the ns protein of influenza virus, and the acetylation of k plays an important role in the cellular localization, ifn antagonistic ability, replication, and virulence of influenza virus. conformational plasticity of the influenza a virus ns protein inhibition of retinoic acid-inducible gene i-mediated induction of beta interferon by the ns protein of influenza a virus rig-i-mediated antiviral responses to single-stranded rna bearing ′-phosphates immunogenicity and protection efficacy of replication-deficient influenza a viruses with altered ns genes structural basis for a novel interaction between the ns protein derived from the influenza virus and rig-i structural basis for suppression of a host antiviral response by influenza a virus influenza virus non-structural protein (ns ) disrupts interferon signaling a site on the influenza a virus ns protein mediates both inhibition of pkr activation and temporal regulation of viral rna synthesis the primary function of rna binding by the influenza a virus ns protein in infected cells: inhibiting the ′- ′ oligo (a) synthetase/rnase l pathway influenza a virus inhibits type i ifn signaling via nf-kappabdependent induction of socs- expression influenza a virus abrogates ifn-gamma response in respiratory epithelial cells by disruption of the jak/stat pathway the multifunctional ns protein of influenza a viruses influenza a virus virulence depends on two amino acids in the n-terminal domain of its ns protein to facilitate inhibition of the rnadependent protein kinase pkr contribution of ns effector domain dimerization to influenza a virus replication and virulence ns protein amino acid changes d n and v i affect interferon responses, thermosensitivity, and virulence of circulating h n human influenza a viruses the k e amino acid substitution in the canine influenza virus h n ns protein restores its ability to inhibit host gene expression roles of the phosphorylation of specific serines and threonines in the ns protein of human influenza a viruses threonine phosphorylation of non-structural protein regulates the replication of influenza a virus by reducing the binding affinity with rig-i modification of nonstructural protein of influenza a virus by sumo years of protein acetylation: from gene regulation to epigenetics, metabolism and beyond proteomics analyses reveal the evolutionary conservation and divergence of n-terminal acetyltransferases from yeast and humans acetylation and methylation of histones and their possible role in the regulation of rna synthesis acetylation and deacetylation of non-histone proteins the world of protein acetylation genetic dissection of histone deacetylase requirement in tumor cells the many roles of histone deacetylases in development and physiology: implications for disease and therapy role of influenza a virus np acetylation on viral growth and replication suppression of the antiviral response by an influenza histone mimic n-terminal acetylation by natb is required for the shutoff activity of influenza a virus pa-x hdac restricts influenza a virus by deacetylation of the rna polymerase pa subunit acylation-mediated membrane anchoring of avian influenza virus hemagglutinin is essential for fusion pore formation and virus infectivity analysis of recombinant h n wild-type and mutant viruses in pigs shows that the q l mutation in ha is important for transmission quantification of murine cytokine mrnas using real time quantitative reverse transcriptase pcr herpes simplex virus ubiquitinspecific protease ul inhibits beta interferon production by deubiquitinating traf mapping the phosphoproteome of influenza a and b viruses by mass spectrometry effects of the s residue of the h n swine influenza virus ns protein on interferon responses and virus replication ubiquitination of the cytoplasmic domain of influenza a virus m protein is crucial for production of infectious virus particles phosphorylation and dephosphorylation of threonine in nucleoprotein is crucial for the replication of influenza a virus attenuation of the virulence of a recombinant influenza virus expressing the naturally truncated ns gene from an h n equine influenza virus in mice inefficient control of host gene expression by the pandemic h n influenza a virus ns protein influenza virus ns protein binds cellular dna to block transcription of antiviral genes nuclear and nucleolar targeting of influenza a virus ns protein: striking differences between different virus subtypes influenza a h n subtype virus ns protein targets into the nucleus and binds primarily via its c-terminal nls /nols to nucleolin and fibrillarin this study was supported by the national natural science authors' contributions jm, jl, and js designed the study; jm was involved in the acquisition of data, analysis, and figure preparation; rw, gx, yc, and zw contributed to some of the laboratory experiments and data analysis; hw and yy helped revise the manuscript; jl and js supervised the study; jm drafted the original paper. all authors read and approved the final manuscript. the authors declare that they have no competing interests. key: cord- -cav n w authors: hoppe, sebastian; bier, frank f.; nickisch-rosenegk, markus v. title: rapid identification of novel immunodominant proteins and characterization of a specific linear epitope of campylobacter jejuni date: - - journal: plos one doi: . /journal.pone. sha: doc_id: cord_uid: cav n w campylobacter jejuni remains one of the major gut pathogens of our time. its zoonotic nature and wide-spread distribution in industrialized countries calls for a quick and reliable diagnostic tool. antibody-based detection presents a suitable means to identify pathogenic bacteria. however, the knowledge about immunodominant targets is limited. thus, an approach is presented, which allows for the rapid screening of numerous cdna derived expression clones to identify novel antigens. the deeper understanding of immunodominant proteins assists in the design of diagnostic tools and furthers the insight into the bacterium’s pathogenicity as well as revealing potential candidates for vaccination. we have successfully screened clones of an expression library to identify proteins that have not been described as immunodominant before. after subcloning the corresponding genes and expression of full-length proteins, we investigated the immunodominant character by microarrays and elisa. subsequently, seven proteins were selected for epitope mapping. for cj and cj c linear epitopes were identified. for cj , specificity assays revealed a specific linear epitope site. consequently, an eleven amino acid residue sequence tlikelkrlgi was analyzed via alanine scan, which revealed the glycine residue to be significant for binding of the antibody. the innovative approach presented herein of generating cdnas of prokaryotes in combination with a microarray platform rendering time-consuming purification steps obsolete has helped to illuminate novel immunodominant proteins of c.jejuni. the findings of a specific linear epitope pave the way for a plethora of future research and the potential use in diagnostic applications such as serological screenings. moreover, the current approach is easily adaptable to other highly relevant bacteria making it a formidable tool for the future discovery of antigens and potential biomarkers. consequently, it is desirable to simplify the identification of structural epitopes, as this would extend the spectrum of novel epitopes to be detected. c. jejuni is a gram-negative, microaerophilic bacterium possessing a helical-shaped morphology [ ] . in industrialized countries, c. jejuni has been one of the primary causal agents of gastroenteritis. in , in germany alone cases have been reported [ ] . campylobacteriosis predominantly induces mild, self-limiting diarrhoea, however severe cases have been reported [ ] . several studies have shown the potential contribution of campylobacteriosis in the development of neuropathies such as the guillain-barré syndrome [ , ] . the prominent route of infection is the improper handling and insufficient cooking of poultry [ ] . the broad distribution of this pathogen in combination with a high clinical relevance necessitates fast and reliable diagnosis. although several genomic typing methods exist [ , ] , these are often timeconsuming and inappropriate for a point-of-care application. instead, a direct approach detecting the whole bacterium is beneficial. this can be achieved by using specific antibodies to membrane-associated antigens similar to the latex-agglutination-test that is already available for several bacterial pathogens including campylobacter [ ] . in order to achieve this, copious knowledge of potentially suitable targets, i.e. immunodominant proteins, is indispensable. in the past, screening for immunogenic proteins has been carried out on nitrocellulose membranes [ ] or using microarray library screening with extensive protein purification [ ] . however, these methods have some major drawbacks as the former is prone to non-specific binding and cross-reactivity when using polyclonal sera for screening of bacterial libraries [ ] , while the latter method is time-consuming and laborious due to the purification steps needed prior to microarray printing. as we have shown elsewhere, an approach using halotagh and specifically coated halolink tm slides is better suited to detect immunodominant proteins while reducing cross-reactivity to a minimum [ ] . the halotagh provides several advantages to other commonly used tags as it enhances the amount of soluble proteins expressed [ ] reducing the formation of inclusion bodies. in addition, the interaction of tag and its specific ligand is based on covalent binding [ ] . this negates the need for additional purification steps as the crude lysate can simply be spotted onto coated microarrays. only the target proteins presented as fusion constructs bearing a halotagh will bind to the surface, whereas the remaining proteins are washed off. combining the described screening method with expression libraries derived from c. jejuni cdna allows for the fast analysis of hundreds of different proteins. thus, suitable immunodominant proteins can be detected, isolated and identified via sequencing the encoding cdna sequence. the generation of a cdna derived expression library offers advantages in contrast to genomic libraries. the latter demand excessive screenings as the genetic information is mostly truncated or of little relevance representing areas within the genome that do not encode for proteins, whereas the former focuses on the genes transcribed [ ] . this reduces the amount of clones to be screened. nevertheless, for effective cdna library screening normalization is needed, as rrna is mainly overrepresented due to its extreme abundance within a total rna extraction prior to reverse transcription [ ] . bacteria only posses a poly(a)-tail on their mrna in rare cases [ , ] . although methods exist to isolate mrna from bacteria [ , ] , it is generally considered to be more challenging as compared to eukaryotic rna, where oligo(dt) primers are sufficient [ ] . therefore, we refrained from isolating the mrna prior to reverse transcription. instead, the generated cdna was normalized, i.e. trimmed down, afterwards by the use of a duplex-specific nuclease [ ] . this approach has been shown to effectively reduce the amount of rrna-derived molecules, thereby altering the overall composition in favour of the mrnaderived cdna without including a bias [ ] . further optimization of library construction was achieved by using a ligationindependent cloning as well as electroporation, which have been shown to enhance overall cloning efficiency [ , ] . using this approach, a relatively small number of clones can be screened to illuminate immunodominant proteins effectively. the identification of previously unknown or incompletely described antigens offers several benefits and potential applications. first, these proteins might serve in a diagnostic tool to rapidly identify c. jejuni in biological samples, e.g. in food manufacturing industry or hospitals. secondly, proteins eliciting an immune response might be suitable candidates for vaccination. finally, gaining insight into the structure and function of novel immunodominant proteins might improve the overall understanding of a bacterium's pathogenicity as it could accelerate the elucidation of novel virulence-associated factors. in this paper, we show the successful screening of an expression library of c. jejuni identifying several potentially immunodominant proteins. after further investigations, we selected a subset of these proteins for epitope mapping and succeeded in identifying linear epitopes for two proteins, namely cj and cj c not described before. furthermore, assays were performed to assess specificity of the binding as well as investigating the relevance of the amino acid residues involved via alanine scanning. additionally, the structure and antigenicity of the proteins and epitopes were modelled to analyze the suitability of the identified sequences for future applications like diagnostic tools or vaccine development. the rin values for the c. jejuni rnas isolated were above . in all cases (see s : rin). after polyadenylation of the rna, it was reverse transcribed and subsequently normalized using a duplexspecific nuclease (dsn). assessing the performance of normalization, we sequenced a sample set of individual clones after transformation with trimming and without trimming. without dsn treatment % of clones contained s rrna derived cdna and % other rrna derived cdna ( s and s). in comparison, after incubation with dsn only one clone in showed a s rrna derived cdna. a total number of different clones were screened using halolink tm slides. we used hisj (uniprot/swiss-prot: q ), cjaa (q p s ) and peb a (q p x ) as positive reference markers, as they have been described previously to be immunodominant [ ] [ ] [ ] . in contrast, argc (q pis ) and pyrc (q pbp ) were used as negative reference proteins. after comparing the median value and the standard deviation of each sample to the values of the positive references, a selection of clones were picked to be sequenced. generally, we grouped clones into three categories after screening. group i included samples showing a higher median contrast value than all the positive references used, group ii encompassed the samples with median contrast values in between those of different positive references and group iii the remaining samples which albeit below the lowest positive reference were still above all the negative reference signals. after screening clones, % fell into group i, % encompassed group ii and % made up group iii while the remaining % were below the negative controls. the sequenced clones were ''blasted'' against the genome of c. jejuni nctc (refseq: nc_ ) to identify the corresponding genes and proteins. for further experiments, known antigens of c. jejuni found within the sequenced clones were discarded and we focused solely on novel potentially immunogenic proteins. several of the identified clones carried only truncated fragments of the corresponding genes. in addition, some of the sequenced inserts possessed a frame shift. in order to overcome these limitations fulllength inserts were prepared and subcloned to express the fulllength proteins. the genes and their corresponding proteins are summarized in table including the length as well as the size of the protein without the attached halotagh. recombinant expression of the full-length fusion proteins was assessed by sds-page using a halotagh alexa ligand, see figure . the correct-sized proteins are highlighted as they are expressed fused to the kda halotagh. bands at roughly kda in size are visible throughout, likely representing early-termination transcripts comprised only of the halotagh. evaluating immunodominance of these proteins, we performed microarray and elisa analyses using three different primary antibodies against c. jejuni. summing up these results, table shows the respective mean q values and errors (n = ) for each of the proteins identified. the highest scores were attained by cj and cj with . . and . . respectively, indicating a potentially stronger immunogenicity as compared to the known immunodominant proteins hisj and cjaa. however, we restrained from further characterizing the former two proteins via epitope mapping as both are highly conserved and show multiple homologies in cellular organisms (cj ) or epsilonproteobacteria (cj ) according to the ncbi protein cluster database [ ] . the same rationale was applied for cj , which is conserved in cellular organisms and peaks at . . . instead, we focused on cj , cj and cj , which are highly conserved in campylobacter only as well as cj c and cj , which in turn are conserved in campylobacteraceae. cj , cj and . for almost all proteins, bands were visible with the respective sizes of their molecular weight and the halotag tm . the halotag tm alone at kda is visible throughout. the proteins were as follows: -cj , -cj , -cj , -cj , -cj c, -cj , -cj , -cj , -cj , -cj , -cj , -cj , -cj , -cj , -cj , - , -cj , -cj , -cj , -cj , -cj and -cj . the pictures of the two gels as well as the righthand marker were fused by coreldraw to minimize space. for all proteins except cj c ( ) and cj ( ) the overlapping peptide sequences corresponding to each protein were mapped with three different primary antibodies to c. jejuni. each peptide contains amino acid residues with an overlap of to each adjacent peptide. cj , an abc-transporter atp-binding protein, showed significant intensities in two adjacent peptide spots as shown in figure . the consensus sequence derived from these spots is tlikelkrlgi, which is highly conserved in c. jejuni (see s : tlikelkrlgi is conserved in c. jejuni). next, we assessed specificity of these signals by applying antibodies reactive to other bacteria then c. jejuni in the epitope mapping to assess whether the binding occurs via the paratope of the antibody or in a nonspecific manner. figure shows the results of these investigations. the mean rfis (n = ) of peptide and are at and a.u. respectively after incubation with polyclonal antibody to c. jejuni, whereas these intensities drop below a.u. if an e.coli antibody is used. the remaining proteins painted a different picture. for cj (s :epitope mapping of cj ) -a fragment with only residues -no linear epitope was identified. the same was true for cj (s :epitope mapping of cj ), cj (s :epitope mapping of cj ) as well as cj (s :epitope mapping of cj ) which harboured no linear epitopes. for the protein cj (s :epitope mapping of cj ) only one peptide showed signal intensities above the positive control. another abc-transporter protein, cj c, possesses several regions, where signal intensities were above those of the positive controls, namely peptides to (aa - ), to (aa - ) as well as and (aa - ), see s : epitope mapping of cj c. the three potential antigenic regions were further assessed using transmembrane prediction tool tmhmm . and the antigenic prediction tool by emboss. here, peptides - showed a consensus sequence, namely spfavwkfldal, which ought to be presented extracellular as well as being antigenic according to the prediction tools. for the other two sites, either no antigenicity was predicted or most of the amino acids lie within a transmembrane region. this is true for amino acids - , - and - . for a summary of the predicted characteristics, see s : transmembrane and antigenic potential of three potential epitope sites for cj c.further, specificity control assays revealed that these signals do not drop significantly when using antibodies to salmonella enterica indicative of non- specific binding to occur, see s : specific vs. non-specific binding to potential linear epitopes of cj c. the influence of each amino acid within the consensus sequence tlikelkrlgi was assessed by performing an alanine scan. as figure shows, the majority of amino acids in the sequence do not confer a change in binding intensity if replaced by alanine. however, in case of glycine a substantial drop in rfi values to less than % of the original value is observed. this indicates an important role of the glycine residue in the binding of the antibody. further, these results were highly reproducible in a second set of experiments and the specificity of the binding was evaluated using antibodies reactive to helicobacter pylori as summarized in figure . the green boxes on the left represent the values of original sequence tlikelkrlgi and altered sequence tlikelkrlai after incubation with polyclonal antibodies to c. jejuni. as was observed in the previous figure, the drop in intensity is significant from a mean value of approximately a.u. to less than equal to roughly % decrease in the signal. in comparison, the adjacent boxes represent the data of the original sequence as well as tlikelkrlai after incubation with antibodies reactive to helicobacter pylori (orange).here, no significant difference between unaltered and altered sequence could be observed. furthermore, the mean intensities were below a.u. thus, binding of an anti-helicobacter pylori antibody to the sequence is approximately % as compared to the original binding of polyclonal antibodies to campylobacter jejuni against this linear epitope. the same trend was observed when using antibodies reactive to s. enterica (s : binding specificity assay of tli-kelkrlgi with anti-salmonella antibodies). this indicates low non-specific binding. modelling of cj 's structure was performed by swiss-model using automated mode. the template chosen during automatic identification was pdb ji a from the rcsb protein data bank [ ] , which referred to the crystal structure analysis of the abc transporter from thermotoga maritima [ ] . the model spans amino acids to of the molecule incorporating more than % of the residues. figure shows a graphical display of the structure with the distinct secondary structures highlighted. in addition, the epitope sequence tli-kelkrlgi is marked in orange. the first nine residues comprise parts of an alpha helix, whereas glycine and isoleucine form a loop to attach the alpha helix with a beta strand further down the primary sequence of the protein. the modelling was supported by statistical analyses including determination of a z-value [ ] based on published data in the protein data bank. figure shows the accuracy of the model based on the z-value calculations. we analyzed the consensus peptide sequence tlikelkrlgi by geneious pro . . . notably, when predicting secondary structure and antigenic regions by emboss garnier and antigenic tool respectively, the results matched the previous observations. an alpha helix precedes a loop that is connected to a beta-sheet. the only difference to the structural modelling by swiss model is the number of residues within the loop as emboss only includes the glycine residue in the loop. for antigenic potential, emboss antigenic predicts all amino acids within the consensus sequence to be part of antigenic sites except for lysine. the development of a cdna derived expression library and subsequent screening of the expressed fusion proteins has revealed several novel immunodominant proteins. optimization during library construction by reducing the amount of rrna molecules via dsn treatment has proven to greatly reduce the amount of rrna clones as was shown for other applications elsewhere [ , ] . this enhances the number of clones carrying mrnaderived cdna, thus increasing the number of clones potentially expressing proteins of interest. for screening purposes antibodies were used, which were generated by immunizing rabbits with whole and partly lysed cells of c. jejuni. therefore, both membrane-associated as well as soluble proteins were available during immune response. thus, it was to be expected that the antibodies used in screening target both membrane and cytosolic proteins, which could be observed in the proteins identified. the screening not only detected the proteins listed here but included other already known immunogenic proteins. these were excluded from further analyses as the main goal of this work was to identify novel proteins and their linear epitopes. the screening and analysis of immunodominant behaviour by microarrays was highly reproducible. some of the proteins showing the strongest signals during screening and microarray experiments were discarded, as they possess strong homologies within a wide spectrum of organisms. however, we focused on proteins specific for campylobacter due to the potential application in a diagnostic tool. thus, linear epitope mapping was performed on a subset of seven proteins. this revealed linear epitopes for two proteins, cj and cj c, both abc-type transporters, which were expected to be immunogenic [ ] . the identified region tlikelkrlgi of cj proved to be specific showing little to none non-specific interactions with other antibodies. most notably, h. pylori antibodies did not reveal any binding to the linear epitope. in the wake of the other antibodies tested, i.e. anti-e.coli and anti-s. enterica, this strongly suggests non-specific binding to be minute. furthermore, the applicability of tlikelkrlgi as a specific target for campylobacter identification, antibody or vaccine production seems plausible. alanine scan has further identified the glycine residue to be of utmost importance for the antibody binding as replacing it by alanine reduces the intensity by at least %. this has been somewhat surprising as glycine is a non-charged, achiral residue. however, the presence of glycine might cause the protein to turn more easily provided by the glycine's flexibility and consequently leading to a better accessibility. thus, immunogenic ability might be rendered by the presence of glycine. in fact, epitopes containing glycine as an important residue have been reported before [ ] [ ] [ ] . next, we modelled the -dimensional structure of cj in order to evaluate the accessibility of the potential epitope. this is an important feature when bearing a diagnostic application in mind. from the model, we could conclude that the glycine residue and the linear epitope are most likely presented as part of a loop structure adjacent to the end of an alpha helix. these are located on the outer rim of the protein, hence easily accessible for antibody binding to occur. although, the structural information is merely based on modelling, the z-values of this model fall within close proximity to zero (- . ) commonly associated with x-ray crystallographic results [ ] . consequently, the probability of this model to match the real structure is high. these findings are supported by further calculations performed using the emboss garnier and emboss antigenic algorithms. these results underline the antigenic potential of the consensus sequence tlikelkrlgi as well as predicting almost identical secondary figure . binding specificity assay of epitope sequence in alanine scan. box-whisker-plot (n = ) of the eleven residue long consensus peptide sequence tlikelkrlgi and its derivative tlikelkrlai of cj after incubation with antibodies reactive to c. jejuni (green) and h. pylori (blue). each box represents % of the values, while % fall within the whiskers. the median is represented by a horizontal line within each box and the small rectangle corresponds to the mean for each sample. for references the rfi values of the positive control, rabbit igg, and negative control, mbp, are indicated on the right for both antibody incubations. for the original sequence a mean value of a.u. can be observed after incubation with antibody to c. jejuni. this dropped to less than a.u. after alanine substitution. this represents a drop of roughly % in overall intensity. in contrast, neither original nor substituted peptide showed any significant mean values after incubation with the anti-h. pylori antibodies indicating specific binding to the peptide by the anti-campylobacter antibody. doi: . /journal.pone. .g structures as the automated swiss model. moreover, the secondary structure prediction changes when glycine is replaced by alanine, which further underlines the important role glycine most likely plays in antibody binding due to its ability to create a loop structure which would otherwise be absent with high probability. in conclusion, tlikelkrlgi seems a suitable candidate to be studied further and shows high potential for applications in a diagnostic tool or vaccination due to its good accessibility and antigenicity. this gains further momentum as tlikelkrlgi is highly conserved in c. jejuni, whereas other species of campylobacter, e.g. c. coli, c. upsaliensis or c. lari possess other amino acid residues within this region. for helicobacter pylori, this effect is even more pronounced. specifically, the glycine residue at the tenth position that was revealed paramount for the binding is not present in helicobacter pylori, see s . thus, tlikelkrgli seems a suitable candidate for specific diagnostic applications targeting c. jejuni. the analysis of the remaining six proteins revealed linear epitopes only in the case of cj c, another abc-type transporter. the region encompassing amino acids - spfavwkfldal seems to be the most promising as its rfi values are high in microarray experiments. furthermore, the prediction tools determined this sequence to be antigenic and located extracellularly, thus easily accessible. this is not true for the other two regions detected from cj c, which are either lacking antigenic potential or are located within the cell or in transmembrane regions. still, the results indicate non-specific binding to contribute mainly to the positive signal. this exempts spfavwkfldal from a suitable diagnostic application; however, further analysis might be helpful to investigate the full potential of this sequence as a specific epitope. although the analyses of full-length proteins on microarrays hint at the presence of antigenic sites within each protein, a lack of linear epitopes was observed. however, this was expected, as most naturally occurring epitopes are conformational and not linear [ ] [ ] [ ] [ ] . the current method used for epitope mapping cannot detect conformational epitopes. still, the presence of linear epitopes and their detection are important features in serological applications. this is particularly true for the high-throughput analysis of sera on microarrays as described elsewhere [ , ] . the present work has accomplished two main goals. first, we have been able to identify a previously unknown antigenic site tlikelkrlgi of cj from c. jejuni and were able to determine the important residue involved in antibody binding as well as modelling the epitope's accessibility within the full-length protein. for c. jejuni the generation of monoclonal antibodies against cj is mandatory to further investigate the affinity and kinetics of the observed binding event by biacore. this might grant further insight whether or not this sequence is a suitable candidate for specific detection in a biosensor or if cj is a suitable vaccine candidate. mutagenesis of cj might help to illuminate the function of this protein and its potential role if any in pathogenicity. once this is determined, a monoclonal antibody might be used to cocrystallize the antigen for x-ray crystallography to assign a measured structure to the predicted model. on top of that, antibody validation by elisa with a wide array of c. jejuni isolates is needed to evaluate the applicability of the derived antibody for future clinical point-of-care devices. additionally, next-generation sequencing of transcriptomes of a broad spectrum of c. jejuni strains ought to be beneficial in analyzing the presence and expression of the protein. this might provide further insight into the suitability of the protein for clinical applications and vaccine development. consequently, it could help to identify potential sequence homologies or discrepancies, which as of now are limited to already published datasets. thus, nextgeneration sequencing might be an attractive procedure to complement the approach presented herein. secondly, we have established and applied a quick and easy method for the screening of cdna expression libraries in order to rapidly detect novel immunodominant proteins of bacteria, see figure for a summary. this approach is easily transferable to other bacterial pathogens such as methicillin-resistent staphylococcus aureus, klebsiella pneumonia, neisseria gonorrhoea, pseudomonas aeruginosa, salmonella enterica and others. the information needed prior to analysis is minute, yet using fully sequenced strains is advantageous as it speeds up the identification of genes and proteins. still, even unsequenced strains such as clinical isolates can easily be analyzed and should not pose any hindrance, as homologous proteins ought to be identified via blast. the more important prerequisite for the screening is the availability of polyclonal antibodies or patient sera. in the context of a diagnostic tool to be used in hospitals, patient sera are beneficial, as they resemble the desired immune response better, thus narrowing down the results to the clinically most relevant proteins. finally, suitable references are useful, yet in most cases, some immunodominant antigens have already been described. regardless, even without a suitable known antigen, this method could identify novel immunoreactive proteins and their linear epitopes with some minor adjustments to the overall calculations. in conclusion, we have successfully shown the application of this method while gaining insight into some novel immunodominant proteins of an important gut pathogen, c. jejuni. the current research emphasizes the need for future investigations in two distinct areas. first, more insight into the newly identified proteins of c. jejuni might foster the understanding of this enigmatic pathogen and help to illuminate its pathogenicity while providing suitable means for rapid detection and combating its spreading. second, transferring this method to other bacterial pathogens will help to discover other immunodominant proteins, potentially leading to a broad spectrum of clinical applications and serological assays. additionally, it might be preferable to simplify the identification of conformational epitopes as this could greatly enhance the retrieval rate of suitable antigenic sites. the state-of-the-art technique involves the cocrystallization of an antigen with a monoclonal antibody [ , ] . besides, mutation analysis [ ] , mass-spectrometry [ ] or clips technology [ ] are other methods to identify structural epitopes. however, the existing methods are extremely costly, time-consuming and demand high material inputs. therefore, advances to simplify structural epitope detection are essential and would be ideally suited to be combined with our current approach. nevertheless, linear epitopes provide an important tool for clinical applications. while their low abundance poses a problem, their simplicity grants major benefits. they are rapidly synthesized and modified. this enables them to be used in a broad spectrum of assays. in a clinical context where antibody titer determination of patient sera is necessary, short peptides are extremely valuable. production costs using chemical synthesis are relatively low and easier than recombinant expression of full-length proteins. furthermore, specific peptides of pathogenic bacteria remove the need to use the entire pathogen, thus reducing the associated risk. although prediction-based strategies for antigens and linear epitopes have been published [ ] [ ] [ ] [ ] , their accuracy is often lacking [ ] . in contrast, our approach offers an attractive dual procedure as it rests on experimental data that are complemented by a widespread support of bioanalytical tools. thus, such a thorough approach facilitates to focus on relevant epitopes quickly, while rapidly evaluating their suitability for prospective diagnostic applications as compared to prediction-based or experimental methods alone. consequently, we believe our present findings to be of outstanding interest for diagnostic applications and to pave the way for future implementation. moreover, the ability to quickly generate cdna libraries and identify novel immunodominant figure . schematic summary of the methods involved for library construction, screening and characterization. a bacterial pathogen is selected and its rna isolated. after cdna generation, normalization is performed to minimize the number of rrna derived clones. next, screening is performed using polyclonal antibodies and potentially immunogenic proteins are selected. the corresponding clones are sequenced and the genes and proteins identified, either by exact match (sequenced strain) or homology (clinical isolate). a set of candidate proteins is selected focusing only on previously unknown antigens, while known antigens are discarded. immunodominant nature of each full-length protein is assessed by elisa and microarray analyses to further narrow down the number of proteins to be tested via epitope mapping, if desired. linear epitope mapping reveals potential antigenic sites, which are tested for specificity and in alanine scan. finally, bioinformatic tools are used to model the -dimensional structure, accessibility and antigenicity of each protein. afterwards, the most promising candidates can be used in future applications including but not limited to monoclonal antibody generation, kinetic measurements, structural and functional analyses and diagnostic applications. doi: . /journal.pone. .g proteins independent of the bacterium used should lay the foundations for future research with highly relevant pathogens. creating a method to extend the detection to structural epitopes would add another tier to the current approach and greatly enhance the knowledge about antigenic sites. further, we are expanding our focus to other pathogens to help elucidate novel antigens within a wide array of clinically relevant bacteria. the strain c. jejuni nctc was grown on solid karmali media for h at uc under microaerophilic conditions ( % n , % o and % co ) within a hypoxic workstation (coylabs). for rna isolation, a liquid culture was prepared by inoculating ml of brain-heart-infusion broth (bhi) with a single colony and incubated overnight at uc, rpm under microaerophilic conditions. this overnight culture was used to inoculate a flask containing ml fresh bhi medium and incubated for h prior to harvesting. for initial screening, a rabbit polyclonal igg antibody to c. jejuni (acris ap pu-n) was used. for further microarray analyses of a subset of candidate proteins, elisa, epitope mapping and alanine scanning this antibody was used as well as two other rabbit polyclonal igg antibody to c. jejuni (abcam ab and abd serotec - ). the immunogen used for generation of antibodies to c. jejuni was c. jejuni atcc . for specificity assays rabbit polyclonal igg antibody to e.coli bl (micromol # ), s.enterica (abcam ab ), s. aureus (fitzgerald c-cr rp) and h. pylori (abcam ab ) were used respectively. detection was achieved by usage of secondary antibodies. a goat polyclonal to rabbit igg conjugated with chromeo tm - (abcam ab ) for fluorescent and conjugated with horseradish peroxidase (abcam ab ) for a colorimetric readout was applied. the cells were harvested by centrifugation ( x g, min). the supernatant was discarded and the pellets resuspended in fresh bhi medium. subsequently, . ml of the bacterial suspension were added to ml of rnaprotect bacteria reagent (qiagen) in a ml tube, vigorously vortexed for s and incubated for min at room temperature. after centrifugation ( x g, min) the pellets were resuspended in ml of lysis buffer ( mm tris-cl, mm edta, mg/ml lysozyme, . mau proteinase k) by pipetting and vortexing for s. the solution was incubated for min using a thriller thermomixer (peqlab) at room temperature and rpm. after addition of ml buffer rlt and ml % ethanol, the lysate was applied to rneasy bacteria mini kit spin columns (qiagen) for rna isolation following the manufacturer's instructions. after loading of the lysate, an on-column dnase digest was performed using rnasefree dnase i solution (qiagen) according to manufacturer's instructions. the isolated total rna was eluted in ml of rnasefree water and its concentration and purity analyzed by nanodrop (peqlab) measurements. the quality of isolated rna was assessed using the rna pico kit and bioanalyzer (agilent). the total rna was diluted to a working concentration of - pg/ml. the analysis was performed following manufacturer's instructions and the rna integrity number (rin) calculated by the expert software (agilent). the rin is defined to fall into a range of to , with a higher score indicating an intact rna, whereas lower numbers are associated with degraded rna molecules [ ] . in order to use bacterial mrna as a substrate in cdna synthesis, polyadenylation was mandatory. the tailing was achieved using the poly(a) polymerase tailing kit (epicentre) following the alternate protocol offered by the manufacturer. briefly, up to mg of total rna were combined with ml poly(a) polymerase reaction buffer, ml mm atp, . ml riboguard rnase inhibitor and ml poly(a) polymerase ( u) in a total reaction volume of ml. the reaction was incubated for min at uc, terminated by the addition of ml . m edta and purified by rneasy mini kit (qiagen) following manufacturer's instructions. yield and purity were determined by nanodrop measurements. for cdna synthesis, the in-fusionh smarter tm directional cdna library construction kit (clontech) was used according to manufacturer's instructions with slight modifications. . ml total, polyadenylated rna were mixed with ml of in-fusion smarter cds primer, heated first for min at uc and then incubated for additional min at uc. after addition of . ml mastermix ( ml x first strand buffer, . ml mm dtt, ml mm dntps, ml mm smarter v oligonucleotide, . ml rnase inhibitor and ml smartscribe tm reverse transcriptase) the tubes were incubated for min at uc. the reaction was terminated at uc for min. for second strand cdna synthesis two ml aliquots of first strand reaction were used in long distance pcr using phusion polymerase (finnzymes). each pcr reaction was comprised as follows: ml first-strand reaction, ml rnase-free water, ml x phusion hf buffer and ml each of dntp mix ( mm), pcr primer ii a ( mm), in-fusion smarter pcr primer ( mm) and phusion polymerase with a total reaction volume of ml. the pcr reactions were subjected to the cycling program with uc for min as initial denaturation followed by cycles of s denaturation at uc, s of primer annealing at uc and min extension at uc. for improved pcr results optimization was performed as follows; ml of the cycle experimental tube were transferred to a separate pcr tube, cycling commenced and aliquots of ml each were collected after , , , and cycles total. the different cycles were compared by gel electrophoresis and the experimental tubes subjected to additional cycles if necessary. finally, pcr reactions were purified using the qiaquick pcr purification kit (qiagen). the purity and yield of each reaction were analyzed by nanodrop measurements. normalization of double-stranded cdna was achieved with the trimmer- cdna normalization kit (evrogen) to reduce the number of cdna molecules derived from rrnas. briefly, ml of cdna (approx. ng/ml) were mixed with ml of x hybridization buffer. for the trimming reaction ml of this mixture were distributed to four different pcr tubes and overlaid with a drop of pcr-grade mineral oil. after centrifugation ( x g, min), the tubes were incubated for min at uc followed by h at uc. next, pre-heated ( uc) duplex-specific nuclease (dsn) master buffer was added to each tube and incubation prolonged for min. dsn was added to the first three tubes in decreasing concentrations - u/ml, . u/ml and . u/ml -with the fourth tube receiving dsn storage buffer and no enzyme as a control reaction. the incubation was continued for min at uc. after addition of ml dsn stop solution and subsequent incubation for min at uc, the tubes were placed on ice. the chilled reaction was diluted by addition of ml sterile, rnase-free water. for amplification of normalized cdna, ml of each reaction was used as template in pcr. each pcr reaction contained ml of template from the normalization reaction, ml nuclease-free water, ml x phusion hf buffer, ml mm dntp mix (neb), ml of each primer pcr primer ii a ( mm), in-fusion smarter pcr primer ( mm) and ml phusion polymerase. the pcr was performed with initial denaturation at uc for min and seven cycles of denaturation at uc for s, primer annealing at uc for s and extension at uc for min, respectively. for optimization, the control tube was subjected to , , , and cycles with ml aliquots taken every two cycles. the optimization samples were analyzed by gel electrophoresis ( % agarose, tae, v) and the optimal cycle number determined. the remaining three tubes were subjected to + x cycles with x being the differential of the optimized cycles to the originally performed seven cycles. after the second pcr, the experimental reactions were compared to the optimal control reaction using gel electrophoresis as above. reactions showing a successful normalization were combined and used in a third pcr reaction. after the final pcr, the reactions were purified by qiaquick pcr purification kit. for cloning pfn a (promega) was used as a vector, as it features a n-terminal encoded halotagh fusion protein, which allows for specific and covalent binding to a unique ligand, thus reducing background and minimizing cross-reactivity in immunoassays with halolink tm microarrays harbouring the ligand on its surface. first, the vector needed to be linearized to be used with the in-fusion cloning technology. this was achieved by reverse pcr using ifs a for ( ttgataccactgcttttc-catggcgatcgcgttatc ) and ifs a rev ( tctcatcgtaccccgtgtttaaacgaattcgggctcg ) . each reaction contained ml each of : diluted pfn a ( ng/ml) and the two primers, ml x phusion hf buffer, ml mm dntps, . ml phusion polymerase and . ml nucleasefree water to reach a total reaction volume of ml. the pcr was run using a cycle two-step program with uc denaturation for s and min extension at uc. after completion, ml of dpni ( u/ml) were added to the reaction and incubated at uc for h. the presence of a single band was checked by gel electrophoresis and the remaining reaction purified by qiaquick pcr purification kit. cloning of normalized cdna and linearized pfn a vector was performed following the manufacturer's instructions within the in-fusion smarter directional cdna library construction kit (clontech). electroporation ml of the cloning reaction were mixed with ml of electrocompetent acella tm e.coli cells (mobitec), a bl derivative, and electroporated in mm cuvettes using the easyject plus electroporator (peqlab). conditions for electroporation were as follows: voltage = v, capacity = mf, resistance = v and pulse duration of ms. the electroporated cells were added to ml of super optimal broth with catabolite expression (soc) and incubated at uc for h with shaking at rpm. afterwards, ml of the transformation reaction were plated on lysogeny broth (lb) agar containing ampicillin. for each reaction, at least two plates were prepared and incubated at uc for h. a total number of clones were selected and transferred to . ml u deepwell tm plates (nunc) containing . ml lbamp. the plates were incubated overnight at uc, rpm. on the next day, the deepwell tm plates were centrifuged, the supernatant discarded and the pellets resuspended in ml of lbamp. a new set of u deepwell tm plates was prepared with ml of fresh lb-amp and inoculated with ml each from the resuspended overnight cultures. the remaining ml of resuspended overnight culture were mixed with ml of sterile-filtered dmso and stored at - uc. the newly inoculated plates were incubated for h at uc, rpm. afterwards, the temperature was reduced to uc, incubation continued for h and protein expression induced by addition of ml of . m b-d- thiogalactopyranoside (iptg). incubation persisted overnight at uc, rpm. the cells were harvested by centrifugation ( x g, min), the supernatant discarded and the pellets frozen at - uc. after min the pellets were resuspended in ml of easylyse tm bacterial protein extraction solution (epicentre) and incubated for min at room temperature. dnase i was mixed with dnase reaction buffer ( mm tris-hcl, . mm mgcl , mm cacl ), added to the reaction and incubation was carried on for min at uc. the plates were centrifuged to collect cell debris for min at x g. for each sample ml of lysate were transferred to microtiter plates (genetixx), which were used as reservoirs for the spotting procedure. the samples were spotted onto halolink tm slides (promega) using the qarray microarray spotter (molecular devices). different samples were spotted per slide with three replicate slides per screening. in total samples were screened on slides (n = ). each sample was spotted as quadruplicates with controls in two identical sets of eighteen subarrays each (total number of spots per slide ). the controls used included ht-hisj, ht-cjaa and ht-peb a as positive reference proteins as these have been described as immunodominant before. as specificity controls ht-argc and ht-pyrc were used, representing proteins without known immunodominant behaviour, thus binding of the polyclonal antibodies is not expected. in addition, two different e.coli strains -acella tm electrocompetent cells and krx single-step competent cells (promega) -were spotted as further controls. as those two lack proteins expressed with a halotagh, they are used as negative controls. after spotting of the samples, the slides were incubated for h at room temperature in a humidity chamber. next, slides were washed with pbs + . % igepalh ca- (pbsi, sigma aldrich) and dried by nitrogen flow. the well proplate tm module (grace biolabs) was attached to each slide. the top chamber was filled with . ml of rabbit-polyclonal antibody to c. jejuni (acris, mg/ml) in pbs. the bottom chamber was incubated with pbs only. after h of incubation at room temperature with gentle rocking, both chambers of each slide were washed three times with ml of pbsi. secondary antibody (goat-polyclonal to rabbit igg conjugated with chromeo tm - , abcam, mg/ml) was subjected to each chamber in pbs and the slides were incubated at room temperature for h in the dark under gentle rocking. finally, slides were washed three times with pbsi, the proplate tm modules removed and the slides dried by nitrogen flow. the slides were scanned on an axon genepix a laser scanner (molecular devices) with the following settings: nm laser, pmt gain , % laser power, lines to average , mm resolution and standard green emission filter at nm. the raw data sets of all the microarray analyses in this publication have been deposited in ncbi's gene expression omnibus [ ] and are accessible through geo series accession number gse . the median fluorescence intensity of each spot corrected by the local background (median f -b ) was used. further, relative fluorescence intensity (rfi) was calculated by subtracting the signals of the bottom chamber from the raw data signals of the top chamber to account for non-specific binding of secondary antibodies. for screening of expression libraries we used the contrast method with either argc or pyrc as specificity control to determine the contrast via the formula: with rf f i control the median of all rfis of the control used: clones harbouring strong signals in microarray screening were selected to be sequenced. sequencing was performed externally by lgc genomics using ht f ( acatcggcccgggtct-gaatc ) and flxr ( cttcctttcgggctttgttag ) primers. after sequencing and identification of potentially immunodominant proteins, primers were designed to generate full-length clones for each identified gene, see s for a list of the primers used. cloning was performed as mentioned above with slight modifications. the pfn a vector was linearized using the following primer set; a if linear for ( gtttaaac-gaattcgggctc ) and a if linear rev ( ggcgatcgcgttatcgctctg ) with pcr conditions as mentioned before. protein expression, lysis and spotting of fulllength proteins were performed as described above. the slides were incubated for h at room temperature in a humidity chamber. for incubation with antibodies well or well proplate tm modules (grace biolabs) were attached to the halolink tm slides. processing of the slides was done similar to the original screening, however several different antibodies were used, see section antibodies. for testing of immunodominant characteristics with elisa, the crude lysate was first purified using halolink tm magnetic beads (promega) following the manufacturer's instructions. the proteins of interest were subsequently cleaved off by digestion with protev protease (promega) and concentration was determined by nanodrop measurements. the samples were diluted to a total protein content of mg/ml in pbs and ml of each sample was added to maxisorb plates (nunc). each sample was analyzed at least in triplicate. the elisa plate was covered with a lid and incubated overnight at uc in a humidity chamber. after five washing steps each with pbs + . % tween- (pbst), the plates were blocked using ml % non fat dried milk in pbs per well for h. afterwards, plates were washed three times with pbst. ml of primary antibody solution (c = mg/ml) in pbs containing % non fat dried milk were applied to each well using the respective desired antibody or pbs for controls. the plates were incubated for h at room temperature and washed four times with pbst. next, ml of conjugated secondary antibody (goat polyclonal to rabbit igg conjugated with horseradish peroxidase, abcam ab , c = ng/ml) were added to each well and incubation carried on for h. finally, plates were washed once again four times with pbst and ml , , , -tetramethylbenzidine (tmb, sigma-aldrich) was added to each well for detection. after min of incubation at room temperature in the dark, the reaction was stopped by applying ml of m h so to each well. the optical density of each well was measured using the omega fluostar (bmg labtech) at a wavelength of nm. primers were designed using primer [ ] within geneious pro . . [ ] . the sequenced inserts were identified by blast [ ] .peptide sequence secondary structures were predicted using the emboss garnier [ ] algorithm and the transmembrane regions predicted by tmhmm . [ , ] . antigenic sites were predicted by emboss antigenic [ , ] . data evaluation was performed by originpro g (originlab) and microsoft excel. dimensional structure predictions were performed using the swiss model automated mode [ ] [ ] [ ] [ ] [ ] and pdb files were visualized and analyzed by the ucsf chimera package [ ] . chimera is developed by the resource for biocomputing, visualization, and informatics at the university of california, san francisco (supported by nigms p -gm ). analysis of full-length proteins was achieved by combining the results from elisa and microarray data. hence, the rfi of each sample was calculated. next, a normalized rfi was generated by dividing the rfi of each sample by the median rfi of all the samples within an area of interest, i.e. incubation compartment. from these normalized rfis a median and standard deviation was calculated. if the median normalized rfi of the positive control was below the median normalized rfi of any of the negative references whilst considering the standard deviations, the test was rendered invalid. if the test passed the above criterion, the q values were calculated as follows: with rf f i sample the median of normalized rfis of the sample and rf f i pos:control the median of the normalized rfis of the positive control hisj respectively cjaa. the resulting error was calculated by error propagation according to gauss. finally, incorporating all valid tests, the mean q value was determined along with its resulting error following error propagation by gauss, see table . several proteins were chosen for epitope mapping. these were the proteins encoded by cj , cj , cj , cj , cj , cj and cj . the proteins were divided into -mer oligopeptides with an overlap of amino acids in silico. the synthesis and coupling to microarray slides was performed externally by jpt peptide technologies gmbh. each peptide sequence was applied times to one slide. the slides were used with proplate -well chamber system (grace) allowing for incubation with different antibodies. first, the slides were blocked with superblock blocking buffer (thermo fischer) for h, washed five times with pbs + . % tween- , primary antibodies applied, incubated overnight at uc with mild rocking, washed again, secondary antibodies applied for h in the dark and after a final washing procedure, dried and scanned as above. three different primary antibodies to c.jejuni were tested. the bottom chamber was always used as a control chamber, incubated only with secondary antibody. the peptide tlikelkrlgi and derivatives thereof substituting one amino acid for alanine were synthesized and coupled to microarray slides by jpt peptide technologies gmbh. each peptide was applied times to one slide. incubation procedure was performed as described above for epitope mapping. the expression of the desired halotagh fusion proteins was checked by sds-page. after lysis of cells, ml of each protein extract was mixed with ml of mm halotagh alexa ligand. after addition of ml x tbs ( mm tris, mm nacl, ph . ) the reaction was incubated at room temperature for minutes. ml of each reaction were removed, mixed with ml of x loading buffer (fermentas) and ml dtt and heated for min at uc. the separation was performed on a mini-protean tgx gel (biorad, any kd, wells) in a protean ii xi cell chamber (biorad) for min at v. as a size reference benchmark fluorescent protein standard (life technologies) was used. fluorescence was measured in a fla- (fujifilm) with excitation at nm. figure s rin. electropherogramm and rna integrity number (rin) for sample , a total rna isolated from c. jejuni nctc , after analysis using the rna pico kit and the agilent bioanalyzer . the rin equals and the ratio of s to s rrna is . . on the right hand, a virtual gel picture is presented as calculated by the agilent expert software. (tif) figure s likelkrlgi is conserved in c. jejuni. the protein sequence of cj was blasted and subsequently aligned according to a blosum matrix. as a reference sequence tlikelkrlgi of q pak , the protein encoded by cj from c. jejuni nctc was used. the dots indicate an agreement to the reference, while differences are given by the one letter amino acid code. the first sequences including the references are % identical and are all derived from c. jejuni proteins. for other species of campylobacter such as c. coli or c. upsaliensis several of the residues are replaced. for other organisms, especially helicobacter pylori the degree of replaced residues increases. the positions showing the most conservation throughout are residues , , and . however, residue , the glycine which was revealed to be paramount for the binding of the antibody is not found in helicobacter pylori proteins. (pdf) figure s specific vs. non-specific binding to potential linear epitopes of cj c. the bars represent the mean values (n = ) of rfi values after incubation with polyclonal antibody to c. jejuni (green) and salmonella enterica (orange). the mean values for each peptide fall within the same range or possess overlapping standard deviations. thus, no specific interaction of the antibody to the epitope is present; rather a non-specific binding seems likely. (tif) figure s binding specificity assay of tlikelkrlgi with anti-salmonella antibodies. the different sequences tested in alanine scanning are shown in the box-whisker-plot (n = ) with each box representing % of the values. the whiskers encompass % of the values, the median is indicated by a horizontal line and the mean represented by a small rectangle. the boxes in green on the left represent the results after incubation with polyclonal antibody to s. enterica. for comparison, the two red boxes show the original signals from fig. for the sequence tlikelkrlgi as well as tlikelkrlai, after incubation with polyclonal antibodies to c. jejuni. all the green boxes fall into the same range as the altered sequence tlikelkrlai, where alanine replaced the glycine residue, which possessed only % intensity of the original sequence. thus, no specific interaction of the antibody to the epitope is present; rather a non-specific binding seems likely. (tif) figure s primers used in this study. the name of each primer, the corresponding target gene or vector and its sequence in to direction is given. for each gene, f represents the forward primer and r the reverse. the primers were used for cloning in the in-fusion smarter tm directional cdna library construction kit. (xls) proposal for a new family epidemiologisches bulletin campylobacter jejuni and related species campylobacter and guillain-barré syndrome genetic basis for the variation in the lipooligosaccharide outer core of campylobacter jejuni and possible association of glycotransferase genes with post-infectious neuropathies risk factors for sporadic campylobacter infection in the united states. a case-control study in foodnet sites typing of campylobacter jejuni and campylobacter coli isolated from live broilers and retail broiler meat by flaa-rflp, mlst, pfge and rep-pcr rapid pulsed-field gel electrophoresis protocol for subtyping of campylobacter jejuni evaluation of three commercial latex agglutination tests for identification of campylobacter spp molecular cloning: a laboratory manual, third edition severe acute respiratory syndrome diagnostics using a coronavirus protein microarray immunogenic cross-reaction among outer membrane proteins of gram-negative bacteria microarray-based method for screening of immunogenic proteins from bacteria proteinprotein interactions: analysis of a false positive gst pulldown result halotag: a novel protein labeling technology for cell imaging and protein analysis validation of two ribosomal rna removal methods for microbial metatranscriptomics polyadenylic acid sequences in e. coli messenger rna identification of the gene for an escherichia coli poly(a) polymerase a simple method to enrich mrna from prokaryotic rna magnetic capturehybridization method for purification and probing of mrna for neutral protease of bacillus cereus direct detection of recombinant gene expression by two genetically engineered yeasts in soil on the transcriptional and translational level normalization of full-length-enriched cdna duplex-specific nuclease efficiently removes rrna for prokaryotic rna-seq ligation-independent cloning of pcr products (lic-pcr) high efficiency transformation of e. coli by high voltage electroporation the campylobacter jejuni/coli cjaa (cj c) gene encodes an n-glycosylated lipoprotein localized in the inner membrane genetic diversity of the campylobacter genes coding immunodominant proteins immunogenicity and immunoprotection of recombinant peb in campylobacter-jejuni-infected mice the national center for biotechnology information's protein clusters database characterization of genetically matched isolates of campylobacter jejuni reveals that mutations in genes involved in flagellar biosynthesis alter the organism's virulence potential nutrient acquisition and metabolism by campylobacter jejuni identification of campylobacter jejuni genes contributing to acid adaptation by transcriptional profiling and genome-wide mutagenesis structure, function, and evolution of bacterial atp-binding cassette systems the protein data bank: a computer-based archival file for macromolecular structures the . crystal structure of abc transporter from thermatoga maritima toward the estimation of the absolute quality of individual protein structure models construction and evaluation of normalized cdna libraries enriched with full-length sequences for rapid discovery of new genes from sisal (agava sisalana perr.) different development stages construction of normalized rna-seq libraries for next-generation sequencing using the crab duplex-specific nuclease atp-binding cassette transporters are targets for the development of antibacterial vaccines and therapies a glycine-rich bovine herpesvirus (bhv- ) ge-specific epitope within the ectodomain is important for bhv- neurovirulence identical igm antibodies recognizing a glycine-alanine epitope are induced during acute infection with epstein-barr virus and cytomegalovirus glycine-rich cell wall proteins act as specific antigen targets in autoimmune and food allergic disorders epitope mapping x-ray crystallography of antibodies b-cell epitopes: fact and fiction antigenic diversity among helicobacter pylori vacuolating toxins functional peptide microarrays for specific and sensitive antibody diagnostics serodiagnosis of echinococcus spp. infection: explorative selection of diagnostic antigens by peptide microarray three-dimensional structure of an antibody-antigen-complex epitope mapping using the x-ray crystallographic structure of complement receptor type (cr )/cd : identification of a highly inhibitory monoclonal antibody that directly recognizes the cr -c d interface high-resolution epitope mapping of hghreceptor interactions by alanine-scanning mutations characterization of an anti-borrelia burgdorferi ospa conformational epitope by limited proteolysis of monoclonal antibody-bound antigen and mass spectrometric peptide mapping functional reconstruction and synthetic mimicry of a conformational epitope using clips technology diagnostic peptide discovery: prioritization of pathogen diagnostic markers using multiple features best: improved prediction of b-cell epitopes from antigen sequences an introduction to epitope prediction methods and software immunoinformatics and the in silico prediction of immunogenicity. an introduction benchmarking b cell epitope prediction: underperformance of existing methods the rin: an rna integrity number for assigning integrity values to rna measurements gene expression omnibus: ncbi gene expression and hybridization array data repository primer on the www for general users and for biologist programmers geneious v basic local alignment search tool analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins predicting transmembrane protein topology with a hidden markov model: application to complete genomes a hidden markov model for predicting transmembrane helices in protein sequences a semi-empirical method for prediction of antigenic determinants on protein antigens new hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and x-ray derived accessible sites the swiss-model workspace: a web-based environment for protein structure homology modeling the swiss-model repository and associated resources swiss-model: an automated protein homology-modeling server swiss-model and the swiss-pdbviewer: an environment for comparative protein modeling protein modeling by e-mail ucsf chimera -a visualization system for exploratory research and analysis the c. jejuni strain nctc was a kind gift of the group of s. bereswill (department of microbiology and hygiene, charité -university medicine berlin, berlin, germany). sh is greatly indebted to martina obry for her assistance during expression library construction and clone isolation. the authors would like to thank simone aubele for technical assistance. we also gratefully acknowledge michaela schellhase for microarray printing. conceived and designed the experiments: sh mvnr. performed the experiments: sh. analyzed the data: sh ffb mvnr. contributed reagents/materials/analysis tools: sh. wrote the paper: sh ffb mvnr. key: cord- - l fthw authors: jin, fan; yu, chen; lai, luhua; liu, zhirong title: ligand clouds around protein clouds: a scenario of ligand binding with intrinsically disordered proteins date: - - journal: plos comput biol doi: . /journal.pcbi. sha: doc_id: cord_uid: l fthw intrinsically disordered proteins (idps) were found to be widely associated with human diseases and may serve as potential drug design targets. however, drug design targeting idps is still in the very early stages. progress in drug design is usually achieved using experimental screening; however, the structural disorder of idps makes it difficult to characterize their interaction with ligands using experiments alone. to better understand the structure of idps and their interactions with small molecule ligands, we performed extensive simulations on the c-myc( – ) peptide and its binding to a reported small molecule inhibitor, ligand -a . we found that the conformational space of the apo c-myc( – ) peptide was rather dispersed and that the conformations of the peptide were stabilized mainly by charge interactions and hydrogen bonds. under the binding of the ligand, c-myc( – ) remained disordered. the ligand was found to bind to c-myc( – ) at different sites along the chain and behaved like a ‘ligand cloud’. in contrast to ligand binding to more rigid target proteins that usually results in a dominant bound structure, ligand binding to idps may better be described as ligand clouds around protein clouds. nevertheless, the binding of the ligand and a non-ligand to the c-myc( – ) target could be clearly distinguished. the present study provides insights that will help improve rational drug design that targets idps. intrinsically disordered proteins (idps), discovered in the s, are proteins that lack a stable three-dimensional native structure under physiological conditions [ ] [ ] [ ] [ ] [ ] . idps are sometimes described as ''protein clouds'' because of their structural flexibility and dynamic conformation ensemble [ ] . various bioinformatics methods have been developed to predict idps based on their sequences [ , ] . it was revealed that idps are abundant in all kingdoms of life; for example, more than % of the proteins in eukaryotic cells possess disordered regions longer than residues [ , ] . because of the flexibility of the chain and the resulting advantages in protein-protein interactions [ , , ] , idps play important roles in various critical physiological processes such as the regulation of transcription and translation [ ] , cellular signal transmission, protein phosphorylation and molecular assemblies [ , , ] . on the other hand, idps also have some adverse effects. it was revealed that many idps are associated with human diseases such as cancer, cardiovascular disease, amyloidosis, neurodegenerative diseases, and diabetes [ ] . it was also reported that the swiss-prot keywords for eleven severe diseases are strongly correlated with idps [ ] . given their abundance and their biological importance, idps are regarded as promising and potential drug targets [ , [ ] [ ] [ ] . compared with rational drug design targeting ordered proteins [ ] [ ] [ ] , drug design targeting idps is still in its infancy. though some general strategies have been proposed [ ] , most of the studies [ ] [ ] [ ] [ ] [ ] [ ] [ ] have been limited to only a few systems, namely, p -mdm , ews-fli and c-myc-max. among them, the oncoprotein c-myc is an encouraging example. c-myc is a transcription factor with a basic helix-loop-helix leucine zipper (bhlhzip) domain which becomes active by forming a dimer with its partner protein max [ ] . in their unbound forms, both c-myc and max are disordered. however, in the dimerized forms, they undergo coupled folding and binding. in most cancers cells, c-myc protein is expressed persistently by a mutated myc gene, causing its unregulated expression in cell proliferation and signal transmission. therefore, inhibiting either the overexpression of c-myc and/or its dimerization with max may provide a therapy for cancer. yin et al. [ ] have used high-throughput experimental screening to successfully identify seven compounds that inhibit dimerization between c-myc and max. further biophysical studies using nuclear magnetic resonance (nmr), circular dichroism (cd) and fluorescence assays have verified three different binding sites (residues - , - , and - ) in the bhlhzip domain of c-myc [ ] . these binding sites contain several successive residues that can independently bind different small molecules [ ] [ ] [ ] . it should be noted that, after binding with the small molecule inhibitors, the c-myc sequence remains disordered, making the detailed experimental characterization of the molecular interactions almost impossible. therefore, the inhibition mechanism is still unclear. for example, a recent study using drifttime ion mobility mass spectrometry suggested that the binding between c-myc and these inhibitors is not as specific as previously thought [ ] . the lack of conformation data also hampers the application of the well-developed structure-based drug design approach to optimize the inhibition. molecular simulations are useful in understanding the characteristics of idps because they can provide an atomic description of molecular interactions. coarse-grained models [ , [ ] [ ] [ ] and allatom simulation [ ] [ ] [ ] [ ] [ ] [ ] [ ] have both been used to investigate idps. recently, knott and best [ ] used large-scale replica exchange molecular dynamics (remd) simulations with a well-parameterized force field to obtain a conformational ensemble of the nuclear coactivator binding domain of the transcriptional coactivator cbp. their simulation results were in good agreement with nmr and small-angle x-ray scattering measurements, validating the efficacy of all-atom simulations in exploring the highly dynamic conformations of idps. for the c-myc/inhibitor complex described above, michel and cuchillo [ ] built a structural ensemble using all-atom simulations for c-myc - with and without an inhibitor ( -f ) and found that -f bound to multiple distinct binding sites and interacted with c-myc - . however, because the c-myc segment used in their simulation contained only the residues that covered the binding sites of -f (residues - ), it is unclear how the inhibitors would interact with longer segments of c-myc and how specific the interaction would be. in the present study, we conducted extensive all-atom molecular dynamic (md) simulations to investigate the c-myc - conformational ensemble and its interactions with a smallmolecule inhibitor ( -a ). first, we performed implicitsolvent remd simulations to clarify the conformational features of the unbound c-myc - . next, we performed md simulations with an explicit water model to explore in detail the interactions between c-myc - and -a . finally, a negative control using a different peptide segment (c-myc - ) was simulated to address the issue of interaction specificity. the conformational ensemble that we obtained will be useful not only in clarifying the structural features of c-myc and the binding mechanism with inhibitors, but also in providing reference structures for drug design targeting c-myc via structure-based approaches. conformational sampling of idps for molecular modeling is challenging because the energy landscapes of idps are relatively flat [ , ] . in the present study, extensive remd simulations using an implicit solvent model were performed to explore the conformational characteristics of c-myc - . the accumulative total of simulation time reached . ms (see methods). c-myc - is a -residue truncated construct of a full-length c-myc. the conformational properties of c-myc - in its bound state (with -a ) and more dynamic unbound state, have been studied experimentally using cd and nmr spectroscopy, and a likely average conformation was built based on chemical shift data which is not meant to (and cannot) define detailed structural features [ ] . we compared our simulation results with the available experimental results. to assess the sampling quality of the remd simulations, we computed h and c chemical shifts from the simulated conformational ensemble using shifts [ ] and compared the computed values with the experiment values ( figure ). the agreement is reasonable, though not excellent. deviations between the average chemical shift values for a simulated ensemble and experimental values have been observed previously in several studies on idps [ , , ] . the chemical shift calculation performed using several other software (shiftx [ ] , camshift [ ] , sparta+ [ ] ) also showed deviations between the computed and experimental values ( figure s ). a possible reason is that chemical shifts are difficult to calculate accurately and the underlying parameterizations applied in current software for the calculation of chemical shifts have been optimized for ordered proteins but not for idps [ ] . interestingly, when we backcalculated chemical shifts from the nmr-refined structure using either the shifts [ ] or shiftx [ ] software, the resulting values also deviated from the experimental ones ( figure s ). in addition, the ensemble nature of idp conformations suggests that the chemical shifts of idps should be described as a distribution, and not merely as average values. the calculated distributions of the h a chemical shifts obtained from our simulations are summarized in figure . all the h a chemical shifts are distributed over a broad range. the experimental values, indicated by arrows in figure , are located close to the centers of the distributions, indicating the validity of the conformational sampling. data for the h n , c a and c b chemical shifts are given in figure s , showing similar behaviors as the h a chemical shifts. we also computed the distribution of the backbone dihedral angles (ramachandran (w,y) plot) for the simulations and the dihedral angles of the nmrrefined apo structure lie well within the simulation distributions ( figure s ). the secondary structure content of the simulated structures was also calculated [ , [ ] [ ] [ ] and compared with that estimated from the experimental chemical shifts (figure ). the helix and polyproline ii content of the simulated structure were consistent with the experimental structures ( figure a ). however, the sheet content of the simulated structures was much lower than the sheet content of the experimental structures. in a previous study [ ] on a shorter c-myc segment, c-myc - , a similar underestimation of sheet content was observed in the simulated structures. the deficiency of sheet content in the simulated structures might be caused by a bias in the force fields. although c-myc - is intrinsically disordered, it possesses a high content of residual helical structure (. %). the simulated helix propensity ( figure b ) showed three helical regions separated by proline residues, pro and pro . intrinsically disordered proteins (idps) exist as conformational ensembles that change rapidly. they are an important and common class of proteins in all kingdoms of life. idps are widely associated with human diseases and may serve as potential drug design targets. however, drug design targeting idps is difficult and only limited examples have been reported. one example is the oncoprotein, c-myc, for which seven inhibitors were discovered by experimental screening. understanding how small inhibitor molecules bind to c-myc may help in understanding the binding mechanism of idps with ligands. in the present study, we conducted extensive molecular dynamics simulations to explore the binding mechanism for the c-myc peptide with an inhibitor -a . we found that -a could bind to c-myc - at different sites along the peptide chain and its binding behavior could be described as a 'ligand cloud'. even in the bound state, the structure of the c-myc - peptide remained a dynamic ensemble. compared to c-myc peptides that do not bind to -a , c-myc - binds selectively with -a , but the specificity of binding was not high. the interactions of idps with ligands can perhaps be described as a scenario in which ligand clouds around protein clouds. to clarify the conformational features of c-myc - , backbone-rmsd clustering with a cutoff of . Å of the conformations was performed. representative structures (the central structure of each group) of the first eight groups were depicted in figure . they are all somewhat collapsed compared to the fully extended structure and possess a rich residual helical structure. these states with considerable population will be useful references for rational drug design targeting c-myc. the existence of residual structure may be related to the functional misfolding that prevents idps from unwanted interactions with non-native partners [ ] . a quantitative analysis on the distributions of dimension and helix content was provided in figure s . the mean radius of gyration is around . . Å , which is much smaller than the expected value of random coils ( . Å ) under the same chain length. the mean helix content of the conformational ensemble is . . %, showing a broad distribution. these results indicated that c-myc - is disordered in nature and interconversions between dispersed structures occur. to reveal how the conformations of apo c-myc - were stabilized, we analyzed the lennard-jones and electrostatic residue-residue interactions among all the residues ( figure s ). the lennard-jones interaction matrix was rather weak ( figure s a ), indicating that the conformations were disordered and that the packing in the collapsed structures was poor. this finding is consistent with the contact map, which showed that residueresidue contacts were dispersed and low in magnitude ( figure s b ). the electrostatic interactions, on the other hand, were comparatively strong ( figure s c ), probably because nearly onethird of the residues in c-myc - ( out of the ) are charged residues. the favorable electrostatic interactions of the arg , arg , lys , lys and lys residues with the asp , glu , glu and glu residues ( figure s c ) are the result of the electrostatic attraction between residues with opposite charges. residues like ser and gln also contributed to the electrostatic interactions by forming hydrogen bonds ( figure s d ). therefore, charge-pair interactions and hydrogen bonds were the main stabilized factors for the c-myc - conformations. we conducted md simulations with an explicit solvent model to investigate the interactions between c-myc - and the inhibitor -a . -a is the only inhibitor (among seven inhibitors of c-myc) that binds to the - sites in loop region of the bhlhzip domain of c-myc and we wanted to see whether or not stable local structures were induced when -a interacted with the flexible loop region. in the experimental study, -a is a mixture of two chiral forms, the s and r forms ( figure ). in the simulations, both chiral forms were tested. for comparison, the apo c-myc - was simulated with the same explicit solvent model. the accumulative simulation time for each group was ms (see methods). we calculated and compared the simulated chemical shifts with experimental chemical shifts for both implicit solvent remd and explicit solvent simulations ( figures s , s , s , s ). reasonable agreements were found. for example, the average discrepancy between the simulated and experimental chemical shifts for ha atoms of apo c-myc - is . and . in the md simulations with explicit solvent model and remd simulations, respectively (see table s ). the relative binding free energy of c-myc - with the two chiral -a forms was analyzed from the md trajectories for the remd simulations (red), the helix and sheet content was computed using the dssp [ ] method; the polyproline ii content was computed with the pross software [ ] . for the experimental data (black), the secondary structure content was estimated from the chemical shifts using d d [ ] . b helix propensity from the remd simulations using the dssp method. doi: . /journal.pcbi. .g using the molecular mechanic/poisson-boltzmann surface area (mm/pbsa) method [ ] . the results of this analysis, together with the average non-bonded interactions u non-bonded (lennard-jones and electrostatic potentials) between c-myc - and -a , are given in table . we found that the interaction between c-myc - and the s form of -a was much stronger than the interaction with the r form. the difference of u non-bonded between the s and r forms ( . kcal/mol) was close to the difference of dh from mm/pbsa ( . kcal/mol). the difference of binding free energy between the s and r forms was . kcal/mol, resulting in a binding-affinity ratio of for the s and r forms. therefore, compared with the binding of the s form to c-myc - , the binding of the r form can be ignored. thus, only the holo system with the s form of -a is discussed further. hammoudeh et al. [ ] reported an induced circular dichroism (icd) effect on c-myc - by the binding of a racemate ( : mixture of the s and r forms) of -a . there were two possible reasons for the observed icd effect [ ] ; either the chiral surroundings affected the absorption transition of the compound, or the enantiomer-specific effect (the different binding affinity of the s and r forms) led to the icd effect. we have shown above that the s form of -a bound much stronger with c-myc - than the r form. therefore, we suggest that it was the enantiomer-specific effect that was responsible for the observed icd effect. further experiments using single chiral forms of -a would be helpful in clarifying this observation. we clustered the conformations from md simulations with the explicit solvent model for both the apo and holo c-myc - peptide based on rmsd of the backbone atoms. figure and showed the representative conformations for the top eight clusters of the apo and holo peptides. it is clear that both the apo and the holo peptides have a rather broad conformation distribution, which is typical of disordered proteins. upon binding to the ligand -a , the conformational distribution became more condensed. the top eight conformation clusters of the holo peptide were more highly populated compared to that of the apo peptide, with a total of about % occupancy compared to %. similar to the apo c-myc - structure, the holo c-myc - structure is rich in helical structures. a quantitative analysis indicated that the helix and polyproline ii content was almost unaffected by the binding of -a ( figure s ), while the sheet content was enhanced (see also in figure ). the electrostatic interactions (from both charged residues and hydrogen bonding) dominated the intramolecular stabilizing force for holo c-myc - ( figure s ). the binding free energy was calculated using the mm/pbsa method [ ] . all quantities are in kcal/mol. *u non-bonded is averaged non-bonded potential, which is composed of lennard-jones and electrostatic potential. -a usually binds simultaneously to two or more regions that are flanked by several residues. the binding was highly dynamic and could switch between different modes within a trajectory. the time percentage of binding for each residue was calculated and is shown in figure . three binding sites were detected, which included site i (residues to ), site ii ( to ), and site iii ( to ). site i was near the n-terminal and showed stronger potency than that of the other two sites. this result was supported by the intermolecular interaction analysis (figure ), which showed that both the electrostatic and lennard-jones interactions for site i were much stronger than those of the other two sites. in fact, in the latter cases, hydrogen bonds hardly formed and the electrostatic interactions were weak. site i was similar to the experimentally determined binding site of -a on c-myc at residues - [ ] . binding at all the other sites generated in our simulations was much weaker, which would make them difficult to be observed experimentally. the low residue interaction specificity that we observed in the simulations is consistent with a recent simulation on an residue peptide of c-myc - that suggested that ligand binding was driven by weak and nonspecific interactions [ ] . the mass spectrometry experiment on c-myc reported by harvey et al. [ ] also supported this conclusion. to further investigate the inherent specificity features of idps, we conducted a negative control study in which we chose another segment of c-myc (residues - ) that does not bind with -a [ ] . the simulated binding between c-myc - and -a is shown in figure the specificities of idps in molecular recognition are complicated [ ] . our simulation results showed that the specificity of c-myc in binding the small-molecule ligand -a was not high. c-myc is a typical example of idps. it is sticky and binds the ligands at different regions with different interaction strengths. because of the lack of coupled folding and binding, after binding, c-myc is still in an ensemble with diverse conformations and the distinct conformations are all capable of binding the ligand. furthermore, for a given c-myc structure, the binding of ligand occurred at disperse sites ( figure ). we named this phenomenon ligand clouds. ligand clouds are remarkably different from the type of binding that is found in ordered proteins where a dominant binding structure is formed. we expect that ligand clouds may be a general feature for idps binding with small-molecule ligands. for idps binding with macromolecule partners, it was reported that some idps remain disordered in the holo state [ ] ; for example, b-catenin/tcf , b-catenin/apc peptide, b-catenin/apc phosphorylated, vif/elob/eloc, and errclbd/pgc- a. these idp complexes assume dynamic structures upon binding, suggesting that idps may interact with their partners in a similar manner to the ligand clouds. the ligand clouds concept supports the idea that there is no definite binding mode in the interactions between idps and small-molecule inhibitor [ ] . it suggests that the interactions could be described as protein clouds interacting with ligand clouds. the ligand cloud concept describes a scenario for the interactions between idps and small-molecule ligands and may provide a basis for drug design targeting idps. a straightforward strategy for rational drug design on idps is to extract metastable structures from simulations and then to conduct a virtual screen on them to identify potential inhibitors. a similar strategy was applied successfully in designing an inhibitor for ab fibrillation [ ] . however, the ligand clouds concept for small molecules binding with idps implies that different strategies from those used for ordered proteins should be developed for better rational drug design on idps. for example, because ligand binding on idps occurs in disperse locations and in different orientations, multimode interactions should be considered in the scoring functions instead of the single-mode interaction that is commonly used for other proteins. therefore, schemes that can consider binding energy landscapes [ ] might be expected to perform better when designing small molecule ligands for idps. on the other hand, in contrast to the conventional ordered proteins that are in either ''binding'' or ''non-binding'' states with small molecules, idps are ''sticky'' and would be either in ''strong binding'' or ''weak binding'' with small molecules. so more cares should be paid to the problem of specificity in drug design targeting idps. for conventional ordered proteins, the binding conformation is unique which could be selected from pre-existing conformations (the conformational selection mechanism) or be induced (the induced fit mechanism) by particular ligands. the scenario of ligand clouds around protein clouds for idps indicates that multiple protein conformations are selected and/or induced by the binding of a ligand on idps. this may extend the conformational selection-induced fit continuum in a new dimension. in conclusion, we conducted extensive simulations to explore the conformational ensemble of c-myc - and its complex with a small-molecule inhibitor -a . the conformational space was found to be rather dispersed. in contrast to conventional structured proteins, the conformations of c-myc - were mainly stabilized by charge interactions and hydrogen bonds. upon binding to -a , c-myc - remained disordered. the -a ligand bound at different sites throughout the c-myc - chain with different strength. accordingly, a ligand cloud concept was proposed, that is, the interactions between small molecule ligands and idps were like ligand clouds around protein clouds. the different binding probabilities between the protein clouds and ligand clouds indicated that the ligand could be selective and thus specific. though the specificity of the binding was not high, the binding of ligand and non-ligand to the target idp could be clearly distinguished. hammoudeh et al. [ ] measured chemical shifts and several noe signals of c-myc - and predicted dihedral angle distributions and atomic contacts. to build the c-myc - peptide, we first built a completely extended conformation with the following sequence: lkrsffalrdqipelenne-kapkvvilkkatayilsvqae (accession number: p ). we then built the initial structures from the reported dihedral angles [ ] using pymol [ ] . the apo and holo structures for c-myc - were refined further using the gromacs . . software package [ ] and the amber sb force field, with the nmr data [ ] as the dihedral angle and distance restraints in the simulation. each initial structure was minimized in vacuum. then, it was solvated, minimized, and equilibrated as described below. the time step was set to . fs. finally, a ns production simulation was performed and the final structure was adopted as the refined structure. the conformations of the c-myc - peptide were sampled by remd simulations with a generalized born/surface area (gb/sa) implicit solvent model. the amber molecular simulations package was used with amber sb force fields [ ] . a total of replicas were adopted with temperatures ranging between . k and . k. all adjacent replicas attempted to exchange temperature every ps with the average exchange rate between % and %. to produce the starting conformations for an remd simulation, an initial structure (described below) was minimized using steepest descent for steps and then switched to conjugate gradient for another steps. the minimized conformation was then heated to the defined temperature over a time of ps for each replica. the obtained conformations were adopted as starting conformations in the remd simulations, which were run with a time step of fs. replica temperature was controlled with a coupling time constant of ps. bonds involving hydrogen atoms were constrained with shake. chirality restraints on the backbone were employed to prevent non-physical chiralities. ionic strength was set to . m. the cutoff for non-bonded interactions and for the gb pairwise summations involved in calculating born radii was Å to consider all probable interactions entirely. snapshots from each trajectory were stored every ps. we conducted four groups of remd simulations with different initial structures: (a) the extended structure of the peptide; (b) apo nmr refined structure; (c) the structure after a -ns md simulation at k starting from the extended conformation; and (d) the most occupied representative conformation generated previously from the remd simulations of the extended structure in (a). the simulation time for the four groups of remds was ns, ns, ns and ns, respectively. the total simulation time was . ms ( . ms per replica). the trajectories of . k, k and k were used in the further analyses except that only the trajectory of k was used in the chemical shifts calculations. to investigate the interactions between c-myc - and -a , md simulations for the complex structure were carried out with an explicit solvent model [ ] . the apo c-myc - was also simulated with the same explicit solvent model for comparison. three groups of simulations were performed, one for the apo and two for the holo (with the two chiral -a forms (see figure ) ). each group contained seven trajectories of ms, therefore, the total simulation time was ms. one of the seven initial structures was the nmr refined structures (apo and holo); the other six initial structures were adopted from representative conformations generated previously in the -ns remd simulations (for the holo structures, the -a isomers were docked using the autodock . program [ ] ). md simulations with the explicit solvent model were performed with the gromacs . . software package [ ] and am-ber sb force field under particle mesh ewald periodic boundary conditions. the tip p-ew water model [ ] was used with amber sb force field because of its previously reported good performance in other simulations of idps [ , , ] . in the holo simulations, the small molecule -a ligand involved was parameterized using a general amber force field (gaff) with acpype software [ ] . an am -bcc charge model [ ] was used to assign charges to the ligand. each initial structure was immersed in an explicit tip p-ew truncated octahedral water box. the dimensions of the box, defined as the distance between the farthest atoms of the peptide and the edge of the box, was set to Å . the system was neutralized by adding ions, and extra nacl was added to represent a solution with an ionic strength of . m. the system was minimized using the steepest descent minimization approach. after the minimization, the system was equilibrated in the nvt ensemble with all-heavy atom restrained with a force constant of kcal/mol. the temperature was maintained at k using a v-rescale thermostat with a coupling constant of . ps. further equilibration was carried on in the npt ensemble without strains, and where the pressure was maintained at atmosphere using a parrinello-rahamn barostat with the coupling constant set to . ps. both equilibrations were performed for ps with a time step of fs. for the production run, the thermostat and barostat settings were the same as for the npt run. to enable fs time steps, bonds involving hydrogen atoms were constrained to equilibration length using the lincs algorithm [ ] . a realspace cutoff of Å was used for the electrostatic and lennard-jones forces. snapshots from each trajectory were stored every ps. to further investigate the inherent specificity features of idps, we conducted a negative control study using the c-myc - truncated peptide ( eqkliseedllrkrreqlkhk-leqlrns ), which did not bind to -a . the extended structure of the peptide was used as the initial structure in an ns implicit solvent md simulation and the final structure that was generated was applied in all-atom explicit simulations. two groups of simulations were performed for each of the two chiral -a isomers. each group contained one trajectory of ms; the other parameters were the same as the parameters used for the holo c-myc - simulations described above. all the simulations were analyzed using the gromacs utilities [ ] with either pymol [ ] or in-house scripts. dsasa was used in determinations of the binding sites. upon small molecule binding, for each residue in the peptide there would be a clear decrease of sasa related to the difference between the sasa of the bound and unbound states. backbone rmsd clustering of peptide conformations was performed to identify distinct structural clusters and to estimate their populations. the relative binding free energy was calculated every ps using mm/pbsa [ ] methods. figure s comparisons of the computed and experimental chemical shifts for apo c-myc - . the computed values using shiftx (red circles), camshift (blue square) and sparta+ (green squares) are from the remd simulations and the experimental values are from hammoudeh et al. [ ] (black triangle). note that the experimental values for some residues were not available. chemical shifts are for the atoms : a h a , b h n natively unfolded proteins: a point where biology waits for physics intrinsic disorder and protein function intrinsically unstructured proteins and their functions intrinsically unstructured proteins intrinsically disordered proteins: the new sequencestructure-function relations drugs for 'protein clouds': targeting intrinsically disordered transcription factors predicting intrinsic disorder in proteins: an overview inherent relationships among different biophysical prediction methods for intrinsically disordered proteins prediction and functional analysis of native disorder in proteins from the three kingdoms of life comparing and combining predictors of mostly disordered proteins kinetic advantage of intrinsically disordered proteins in coupled folding-binding process: a critical assessment of the ''fly-casting'' mechanism smoothing molecular interactions: the ''kinetic buffer'' effect of intrinsically disordered proteins malleable machines take shape in eukaryotic transcriptional regulation exploring the binding diversity of intrinsically disordered proteins involved in one-to-many binding intrinsically disordered proteins in human diseases: introducing the d concept functional anthology of intrinsic disorder. . ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins intrinsically disordered proteins are potential drug targets rational drug design via intrinsically disordered protein novel strategies for drug discovery based on intrinsically disordered proteins (idps) dynamic modeling of human -lipoxygenase-inhibitor interactions helps to discover novel inhibitors discovery of multitarget inhibitors by combining molecular docking with common pharmacophore matching virtual screening of novel noncovalent inhibitors for sars-cov c-like proteinase rational drug design via intrinsically disordered protein inhibition of the p -mdm interaction: targeting a proteinprotein interface in vivo activation of the p pathway by small-molecule antagonists of mdm a small molecule blocking oncogenic protein ews-fli interaction with rna helicase a inhibits growth of ewing's sarcoma improved low molecular weight myc-max inhibitors multiple independent binding sites for small-molecule inhibitors on the oncoprotein c-myc structural rationale for the coupled binding and unfolding of the c-myc oncoprotein by small molecules low molecular weight inhibitors of myc-max interaction and function x-ray structures of myc-max and mad-max recognizing dna: molecular bases of regulation by proto-oncogenic transcription factors smallmolecule inhibition of c-myc:max leucine zipper formation is revealed by ion mobility mass spectrometry polyelectrostatic interactions of disordered ligands suggest a physical basis for ultrasensitivity electrostatically accelerated coupled binding and folding of intrinsically disordered proteins binding-induced folding of a natively unstructured transcription factor optimizing protein-solvent force fields to feproduce intrinsic conformational preferences of model peptides anchoring intrinsically disordered proteins to multiple targets: lessons from n-terminus of the p protein energy landscape analyses of disordered histone tails reveal special organization of their conformational dynamics a free-energy landscape for coupled folding and binding of an intrinsically disordered protein in explicit solvent from detailed all-atom computations a preformed binding interface in the unbound ensemble of an intrinsically disordered protein: evidence from molecular simulations residual structures, conformational fluctuations, and electrostatic interactions in the synergistic folding of two intrinsically disordered proteins binding of two intrinsically disordered peptides to a multi-specific protein: a combined monte carlo and molecular dynamics study the impact of small molecule binding on the energy landscape of the intrinsically disordered protein c-myc ensemble modeling of protein disordered states: experimental restraint contributions and validation constructing ensembles for intrinsically disordered proteins automated prediction of n, ca, cb and c' chemical shifts in proteins using a density functional database structure and dynamics of the ab( - ) peptide from the interplay of nmr experiments and molecular simulations homogeneous and heterogeneous tertiary structure ensembles of amyloid-b peptides rapid and accurate calculation of protein h, c and n chemical shifts fast and accurate predictions of protein nmr chemical shifts from interatomic distances sparta+: a modest improvement in empirical nmr chemical shift prediction by means dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features a physical basis for protein secondary structure determination of secondary structure populations in disordered states of proteins using nuclear magnetic resonance chemical shifts intrinsically disordered proteins may escape unwanted interactions via functional misfolding mmpbsa.py: an efficient program for end-state free energy calculations do intrinsically disordered proteins possess high specificity in protein-protein interactions inhibitor discovery targeting the intermediate structure of beta-amyloid peptide on the conformational transition pathway: implications in the aggregation mechanism of beta-amyloid peptide binding energy landscape analysis helps to discriminate true hits from high-scoring decoys in virtual screening the pymol molecular graphics system algorithms for highly efficient, load-balanced, and scalable molecular simulation development of an improved four-site water model for biomolecular simulations: tip p-ew autodock and autodocktools : automated docking with selective receptor flexibility atomic-level characterization of the ensemble of the ab( - ) monomer in water using unbiased molecular dynamics simulations and spectral slgorithms acpype -antechamber python parser interface fast, efficient generation of highquality atomic charges. am -bcc model: i. method lincs: a linear constraint solver for molecular simulations the authors thank daqi yu, dr. changsheng zhang and dr. fangjin chen for helpful discussions. f.j. gratefully acknowledges the help of prof. tsun-mei chang and dr. shuangyu bi in preparing the manuscript. key: cord- -b mlt n authors: jacomin, anne-claire; samavedam, siva; charles, hannah; nezis, ioannis p. title: ilir@viral: a web resource for lir motif-containing proteins in viruses date: - - journal: autophagy doi: . / . . sha: doc_id: cord_uid: b mlt n macroautophagy/autophagy has been shown to mediate the selective lysosomal degradation of pathogenic bacteria and viruses (xenophagy), and to contribute to the activation of innate and adaptative immune responses. autophagy can serve as an antiviral defense mechanism but also as a proviral process during infection. atg -family proteins play a central role in the autophagy process due to their ability to interact with components of the autophagy machinery as well as selective autophagy receptors and adaptor proteins. such interactions are usually mediated through lc -interacting region (lir) motifs. so far, only one viral protein has been experimentally shown to have a functional lir motif, leaving open a vast field for investigation. here, we have developed the ilir@viral database (http://ilir.uk/virus/) as a freely accessible web resource listing all the putative canonical lir motifs identified in viral proteins. additionally, we used a curated text-mining analysis of the literature to identify novel putative lir motif-containing proteins (lircps) in viruses. we anticipate that ilir@viral will assist with elucidating the full complement of lircps in viruses. autophagy is a multistep process that consists of the isolation of cytoplasmic components into double-membrane vesicles, called autophagosomes, that shuttle to lysosomes, which serve as end-point degradative organelles. it is a catabolic mechanism that enables the removal of damaged or excess cellular organelles and proteins, thereby contributing to the maintenance of cell homeostasis and survival. the autophagic machinery is highly conserved from unicellular eukaryotes to metazoans. among the proteins that take part in this process, the atg -family proteins play a central role. indeed, these proteins are involved in the elongation and maturation of the autophagosome and its fusion with lysosomes. , phosphatidylethanolamine-conjugated atg -family proteins reside on autophagosomal membranes where they can contribute to the recruitment of other core autophagy machinery proteins essential for the effective course of the autophagy process. , [ ] [ ] [ ] [ ] although originally considered to be a nonselective bulk degradation mechanism, a gathering body of evidence over the past decade suggests that autophagy is much more selective than initially appreciated. selective targeting of cellular components to autophagosomes for degradation relies on the existence of selective autophagy receptors able to recognize and tether cargos toward nascent autophagosomes. , examples of selective autophagy include aggrephagy, mitophagy, lipophagy, and xenophagy. [ ] [ ] [ ] [ ] the interaction between selective autophagy receptors and atg -family proteins is essential for the proper steering of the cargo for degradation. these receptors typically contain an lc -interacting region (lir, also known as lrs, aim or gim; the latter correspond to lc recognition sequence, atg -interacting motif and gabarap interaction motif respectively) critical for the binding to atg -family proteins. , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] the lir motif consists of a short amino acid sequence with a core motif originally described as w/f/yxxi/l/ v (where 'x' represents any amino acids, and referred to as wxxl hereafter). , this sequence has lately been relaxed and extended to amino acids to integrate most of the experimentally verified lirs. [ilv] , where the residues in positions and correspond to the most crucial ones for the interaction with atg -family proteins. [ ] [ ] [ ] besides its role in cellular homeostasis, autophagy is also involved in the innate immune response against pathogens. , , recent years have seen an outburst of studies on autophagy and viral infections. autophagy may exert a variety of antiviral functions, including the degradation of viral components (known as virophagy), the activation of innate immunity by the delivery of viral nucleic acids to the toll-like receptors, and adaptive immunity through the major histocompatibility complex ii (mch-ii/hla class ii), or the control of the production of reactive oxygen species (ros). [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] however, to be successful, viruses have evolved mechanisms to evade host defense. several viruses have thus developed strategies to use the autophagy machinery or even thrive in the autophagosomes and promote their replication, spread, and survival. [ ] [ ] [ ] [ ] other viruses susceptible to autophagy have evolved mechanisms to counteract autophagy activation by expressing proteins that interfere with the cellular machinery, essentially inhibiting the autophagosome-lysosome fusion or interfering with early stages of autophagy activation. [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] a few proteins from viruses infecting mammals and plants that interfere with the host autophagy process have been shown to associate with atg -family proteins. , [ ] [ ] [ ] [ ] [ ] [ ] yet, only one lirdependent interaction has been reported. we have developed a database, ilir@viral (http://ilir.uk/ virus/), that organizes information on the presence of lir motifs in viral proteins. additionally, a curated text-mining analysis of the literature permitted us to predict functional lir motifs in viral proteins that have already been shown to associate with atg -family members. the ilir@viral database is a web resource freely available to the academic community at http://ilir.uk/virus/. various functionalities are accessible under specific menus. the 'classification' menu gives access to the complete list of putative lir motifs in viral proteins. two virus taxonomic systems have been used: the nomenclature used by the international committee on taxonomy of viruses (ictv) to name the species, genus and families of each of the viruses cited in the database, and the baltimore classification system that groups viruses depending on their genome and kind of replication (dsdna, dsrna, ssdna, ssrna and reverse-transcribing viruses) (see methods). , for each specific family or group of viruses, the data are presented in a table containing (i) the clickable uniprotkb accession number of the protein, (ii) the information related to the lir-motif (position, sequence, pssm score, if the pattern is recognized as an xlir or wxxl motif and the presence of the motif in an intrinsically disordered region (anchor)), (iii) the name of the protein and (iv) the name of the species. for some genera, no putative lir-containing protein (lircp) could be identified; for accuracy in the classification, these are listed but appear in red (fig. ) . the 'search' menu allows the user to look in the database for a specific protein or virus order, family, subfamily, genus, species or common name. uniprot identifiers can also be used in the search function. the blast (basic local alignment search tool) menu offers the user to search the database using a protein sequence as query against the reviewed set of viral proteins from uniprot database. we have used position-specific iterative (psi) blast to search against the database. this search can be run against any of the viral classification systems described above. by default, the blast search runs against all the data available in the database with a default e-value . ; nevertheless, the user has the possibility to run the blast search against a specific category (baltimore classification) or order taxonomic rank (ictv classification), and define a different e-value. the results page displays the subject sequences from the database that match the query sequence. the lir patterns are highlighted with red asterisks. the menus 'bibliography' and 'help' provide users with additional information. finally, the 'links' menu gives access to other ilir web resources, inducing the ilir search tool and the ilir database for eukaryote model organisms. , analysis of the content of the database we have used the ilir web resource (https://ilir.warwick.ac.uk) to identify lircps in viruses (see methods for details). , out of , reviewed viral sequences available from uniprot across individual viral species we found that , of them contain either xlir or wxxl motifs. proteins figure . screenshot of the ilir@viral database 'classification' menu. example for the ictv classification system. the genera for which no lircps were found appear in red. contain xlir motifs whereas , contain wxxl motifs. proteins contain both xlir and wxxl motifs whereas proteins contain only xlir motifs (without containing wxxl patterns) and proteins contain only wxxl motifs (without xlir patterns) (table s ). we found a correlation between the total number of putative lir motifs identified in a family and number of sequences ( fig. a) . on average, we found . lir motifs per sequence. the fact that viral sequences often refer to polyproteins instead of individual proteins is possibly an explanation of the high proportion of patterns identified. the ilir web resource can make the distinction between xlir and wxxl patterns. , we noticed that a vast majority of the motifs identified in viral proteins correspond to the wxxl pattern ( fig. b and table s ). the identification of putative lir motifs has been done for all the reviewed sequences for viral proteins. among the sequences sorted as putative lir-containing proteins, sequences belong to species of bacteriophages (table s ). however, it is likely that these sequences correspond to false-positive hits as bacteria don't have an autophagy process. using a hypergeometric test (see supplementary information), we compared the enrichment fold of proteins containing lir motifs (lircps) in bacteriophages with viruses infecting eukaryotes. we noticed that the enrichment fold for both xlir and wxxl patterns in lircps was higher in viruses infecting eukaryotes than in bacteriophages (table s ) which is in line with the fact that autophagy has been reported only in eukaryotes. we have previously identified all the verified and putative lir motifs in eukaryotic model organisms. we also compared the enrichment of putative lir motifs in viruses infecting some of the model organisms (human, mouse, rat, and chicken) (table s ) . two hypergeometric tests have been conducted to compare the enrichment fold of lircps in viruses against the host: one for proteins containing all combination of lir patterns (i.e., either xlir or wxxl, or both kind) and another considering only the proteins containing at least one xlir motif (i.e., xlir patterns alone or along with one or more wxxl patterns). we observed that when all the possible combinations of lir patterns are taken into account, there is an enrichment of putative lir motifs in viruses compared with the host organism for all model organisms tested. however, there is an enrichment of xlir patterns in the host compared with the infecting viruses. the lir motifs can be divided into subtypes depending on the residue at the first position: w-, f-and y-types. it has been shown that the f-type lir motif of mammalian ulk and atg has a preference for gabarap proteins, thus suggesting that the subtype of the lir motifs could be related to a specificity toward atg family proteins. additionally f-type and y-type lir motifs are mostly contained in selective autophagy adaptor proteins. we thus analyzed the distribution of w-, f-and y-type of wxxl and xlir patterns at the viral order and family levels (table s and fig. c, d) . we observed that % and % of the putative lir motifs matching the wxxl pattern are of f-type and y-type respectively. w-type motifs are the least represented, with about % of the patterns (fig. c) . similar distribution could be observed for the putative xlir motifs with a higher variability across families, probably due to the low representation of xlir motifs compared with wxxl patterns (fig. d) . to assess the trustworthiness of our in silico screening for lircps in viruses, we compared our data with the already published viral proteins listed on the web resource viral-zone as modulators of autophagy. viralzone classifies entries related to the activation of the host autophagy, and entries linked to the inhibition of host autophagy. we found all these entries in our database. among viruses that inhibit autophagy, only proteins have been shown to interact directly with mammalian atg - also table s ). (d) distribution of the w-, f-and y-type of xlir patterns across the different families (see also table s ). family proteins: viral infectivity factor (vif) from hiv- binds to all atg -family proteins, and matrix protein (m ) from influenza was shown to bind to lc . yet, influenza m protein is the only one that contains a lir motif that has been experimentally validated. , other viral proteins listed in viralzone as being related to inhibition of the autophagy process are the neurovirulence factor icp . and rna-binding protein us from human herpesviruses, the protein nef from hiv- , and the protein trs from human cytomegalovirus. , , , negative regulation of autophagy by icp . and trs proteins depends on their ability to interact with becn /beclin ; - while us function has been linked to the protein kinase eif ak /pkr. putative, or functional, lir motifs could be identified for all these proteins using the ilir web resource, except for the rna-binding protein us . the protein m from influenza a virus is necessary and sufficient for the inhibition of the autophagic degradation of the virus by blocking the fusion between the autophagosomes and lysosomes. these results were further confirmed and extended by beale and colleagues who show that the cytoplasmic tail of m interacts in a lir-dependent manner with lc and promotes the relocalization of lc at the plasma membrane. we have identified a wxxl motif at positions to that has the highest pssm score ( to ), and corresponds to the one experimentally verified (fvsi). , it is an f-type lir motif and the fact that m protein has been shown to block the autophagosome-lysosome fusion suggests that it may act as a an adaptor protein. we were also able to identify one xlir motif in the sequence of the accessory viral protein nef of the virus hiv- group m subtype b (strain . ). nef colocalizes with lc and becn , and contributes to the inhibition of autophagic maturation, thus protecting the virus from elimination by autophagy. , the proteins trs from human cytomegalovirus and neurovirulence factor icp . from human herpesvirus (hsv) both interact with becn via a specific becn -binding domain. this interaction is required for the inhibition of autophagosome maturation and fusion with the lysosomes. [ ] [ ] [ ] while ilir could detect several wxxl motifs in trs sequence, a single one in an intrinsically disordered region was identified for icp . whose sequence ( -rqwlhv- ) is quite well conserved among strains of hsv- and one strain of hsv- . however, to date, there is no evidence of association between icp . and lc proteins. we observed that % of the reviewed sequences that contains at least one putative lir motif correspond to the genome polyprotein from various viruses. because polyproteins are processed co-and post-translationally by both host and viral proteases, we ran a systematic pubmed search with the terms 'name of the virus c autophagy' followed by 'name of the virus c lc ' to restrain the result outcome as necessary, finally we looked for papers (excluding reviews and commentaries) that specifically mention proteins derived from the processing of the viral genome polyprotein. our literature searching strategy pinpointed several nonstructural viral proteins; one of those was the protein ns from dengue viruses. studies have shown that ns protein from dengue virus type (denv- ) colocalizes with lc and that denv- particles and autophagosomes travel together during viral infection. , in contrast to denv- , ns protein from denv- displays a low level of colocalization with lc . sequence alignment of ns proteins from denv- and denv- showed that they are highly conserved. however, checking for the presence of lir motifs revealed a discrepancy between them. we observed that denv- ns has an xlir motif (asfiev) that is not recognized in any denv- ns sequences due to the substitution f to l, as well as an additional wxxl motif with a pssm score (rawnsl) that is absent in denv- (sl to vw) (fig. s ). it is possible that the absence of either the xlir or wxxl motifs (or both) in denv- is responsible for its lower affinity to lc . other nonstructural proteins from different viruses interact with lc proteins. for instance, the nonstructural protein ns a from hepatitis c virus that colocalizes and can be coimmunoprecipitated with lc proteins when ectopically expressed in various hepatoma cell lines. , , also, the viral peptide bc and the protein a encoded by the genome polyprotein from poliovirus type interact with lc -ii. , all these proteins contain wxxl motifs. finally, we were able to identify proteins from zika virus that have been just recently related to autophagy. independent studies have shown that zika virus activates autophagy and that the formation of autophagosomes is crucial to the replication of the virus. [ ] [ ] [ ] it appears that the nonstructural proteins ns a and ns b are responsible for the induction of autophagy in infected cells by inhibiting the akt-mtor signaling, and both of them contain wxxl motifs. , very little is known about the relation between zika virus infection and autophagy modulation. we have found several proteins encoded by zika genome polyprotein that contain lir motifs, that could be good candidates for experimental validation. autophagy is an evolutionarily conserved and highly regulated, intracellular catabolic mechanism that is essential for maintaining homeostasis and coping with nutrient starvation. it is increasingly appreciated that autophagy can be highly selective, and that xenophagy, the selective autophagy of pathogens, is an important aspect of the immune response, which protects against infection. a vast array of viruses are associated with autophagy, and we have found several viral proteins containing putative lir motifs that are thought to interact with the autophagic machinery via atg -family proteins. a continued research effort to better understand how these viral proteins interact with the autophagic machinery may provide therapeutic strategies and ultimately lead to the discovery of novel pharmacological agents to fight viral infections. protein sequences of all reviewed viral proteins were downloaded from uniprot database vailable at: http://www.uni prot.org/ [accessed september ]. xlir and wxxl patterns for these proteins were identified using the approach suggested previously. , for a given protein, information related to the start and end of lir pattern, actual lir sequence, pssm score (position specific scoring matrix), similar lircps and presence or absence xlir and wxxl in intrinsically disordered region were obtained. the taxonomic lineage obtained from uniprot for all the reviewed viral proteins correspond to the baltimore classification system. to do the conversion between the baltimore and ictv classification systems, we matched the organism name for each reviewed sequence from uniprot with the organism names obtained from the ictv master species list v , available at: https://talk.ictvonline.org/files/master-species-lists/ m/msl/ [accessed september ]. the differences between the classification systems were identified through a battery of sql queries. the details of these are attached in tables s , s and s . levenshtein distance (also known as edit distance) was used to compare the species name that belongs to a particular genus and family between both the classification systems. levenshtein distance is defined as the minimal number of characters required to replace, insert or delete to transform one string into another. the levenshtein distance is symmetric and it holds: here, 'x' and 'y' are strings and d(x,y) is the distance between 'x' and 'y' put as minimal cost of operations to transform 'x' to 'y'. the complexity of the algorithm is o(m à n), where n and m are the lengths of strings. we used perl extension for approximate matching (search.cpan. org, ( ). string-approx- . . retrieved from: http://search.cpan.org/ cpan/authors/id/j/jh/jhi/string-approx- . .tar.gz) the closer the value of levenshstein distance to zero, the closer are the species names. using this approach, we could reliably justify the ictv classification of out proteins loaded into the database. to share the information beyond our mysql (v . . ) dbms (database management system), we built a website using html, css, javascript and php (v . no potential conflicts of interest were disclosed. the machinery of macroautophagy network organization of the human autophagy system atg family lc /gabarap proteins are crucial for autophagosome-lysosome fusion but not autophagosome formation during pink /parkin mitophagy and starvation role of the mammalian atg /lc family in autophagy: differential and compensatory roles in the spatiotemporal regulation of autophagy lc and gate- /gabarap subfamilies are both essential yet act differently in autophagosome biogenesis plekhm regulates autophagosome-lysosome fusion through hops complex and lc /gabarap proteins atg family proteins act as scaffolds for assembly of the ulk complex: sequence requirements for lc -interacting region (lir) motifs the lir motif -crucial for selective autophagy the lc interactome at a glance aggrephagy: selective disposal of protein aggregates by macroautophagy mechanisms of mitophagy lipophagy: selective catabolism designed for lipids the role of 'eat-me' signals and autophagy cargo receptors in innate immunity p /sqstm binds directly to atg /lc to facilitate degradation of ubiquitinated protein aggregates by autophagy structural basis for sorting mechanism of p in selective autophagy the autophagy-related protein kinase atg interacts with the ubiquitin-like protein atg via the atg family interacting motif to facilitate autophagosome formation structural basis of target recognition by atg /lc during selective autophagy atg -family interacting motif crucial for selective autophagy the structure of atg b-lc complex reveals the mechanism of lc processing and delipidation during autophagy a role for nbr in autophagosomal degradation of ubiquitinated substrates nix is a selective autophagy receptor for mitochondrial clearance the tbk adaptor and autophagy receptor ndp restricts the proliferation of ubiquitin-coated bacteria nix directly binds to gabarap: a possible crosstalk between apoptosis and autophagy phosphorylation of the autophagy receptor optineurin restricts salmonella growth structural and functional analysis of the gabarap interaction motif hfaim: a reliable bioinformatics approach for in silico genome-wide identification of autophagy-associated atg -interacting motifs in various organisms ilir: a web resource for prediction of atg -family interacting proteins ilir database: a web resource for lir motif-containing proteins in eukaryotes autophagy in antiviral innate immunity autophagy in infection, inflammation and immunity autophagy protects against sindbis virus infection of the central nervous system image-based genome-wide sirna screen identifies selective autophagy factors pkr-dependent autophagic degradation of herpes simplex virus type autophagydependent viral recognition by plasmacytoid dendritic cells replication-independent activation of human plasmacytoid dendritic cells by the paramyxovirus sv requires tlr and autophagy pathways production of interferon alpha by human immunodeficiency virus type in human plasmacytoid dendritic cells is dependent on induction of autophagy antigen-loading compartments for major histocompatibility complex class ii molecules continuously receive input from autophagosomes autophagy enhances the presentation of endogenous viral antigens on mhc class i molecules during hsv- infection absence of autophagy results in reactive oxygen species-dependent amplification of rlr signaling autophagy facilitates ifngamma-induced jak -stat activation and cellular inflammation hepatitis c virus upregulates beclin for induction of autophagy and activates mtor signaling peroxisomal protein pex functions in selective autophagy fam b, the selective autophagy receptor for endoplasmic reticulum turnover, inhibits replication of ebola virus strains makona and mayinga sustained autophagy contributes to measles virus infectivity autophagy and the effects of its inhibition on varicella-zoster virus glycoprotein biosynthesis and infectivity host cell autophagy modulates early stages of adenovirus infections in airway epithelial cells the autophagy elongation complex (atg - / l ) positively regulates hcv replication and is required for wild-type membranous web formation herpes simplex virus gamma . interferes with autophagosome maturation and antigen presentation in dendritic cells matrix protein of influenza a virus blocks autophagosome fusion with lysosomes coronavirus membrane-associated papain-like proteases induce autophagy through interacting with beclin to negatively regulate antiviral innate immunity human immunodeficiency virus type- infection inhibits autophagy hiv- viral infectivity factor interacts with microtubule-associated protein light chain and inhibits autophagy irgm is a common target of rna viruses that subvert the autophagy network multi-layered control of galectin- mediated autophagy during adenovirus cell entry through a conserved ppxy motif in the viral capsid foot-and-mouth disease virus infection suppresses autophagy and nf-small ka, cyrillicb antiviral responses via degradation of atg -atg by cpro berlioz-torrent c. lc c contributes to vpu-mediated antagonism of bst /tetherin restriction on hiv- release through a non-canonical autophagy pathway autophagy functions as an antiviral mechanism against geminiviruses in plants co-localization of constituents of the dengue virus translation and replication machinery with amphisomes replication of hepatitis c virus rna on autophagosomal membranes modification of cellular autophagy protein lc by poliovirus autophagy pathway intersects with hiv- biosynthesis and regulates viral yields in macrophages a lc -interacting motif in the influenza a virus m protein is required to subvert autophagy and maintain virion stability taxonomy of viruses., king amq. virus taxonomy: classification and nomenclature of viruses: ninth report of the international committee on taxonomy of viruses expression of animal virus genomes gapped blast and psi-blast: a new generation of protein database search programs viralzone: a knowledge resource to understand virus diversity human immunodeficiency virus type nef inhibits autophagy through transcription factor eb sequestration hsv- icp . confers neurovirulence by targeting the beclin autophagy protein the human cytomegalovirus protein trs inhibits autophagy via its interaction with beclin analysis of the role of autophagy inhibition by two complementary human cytomegalovirus becn /beclin -binding proteins the herpes simplex virus us protein inhibits autophagy through its interaction with the protein kinase pkr analysis of the native conformation of the lir/ aim motif in the atg /lc /gabarap-binding proteins herpes simplex virus type i induces an incomplete autophagic response in human neuroblastoma cells single-virus tracking approach to reveal the interaction of dengue virus with autophagy during the early stage of infection a role for autophagolysosomes in dengue virus production in hepg cells interferoninducible protein scotin interferes with hcv replication through the autolysosomal degradation of ns a a functional role for ns atp in the induction of hcv ns a-mediated autophagy subversion of cellular autophagosomal machinery by rna viruses biology of zika virus infection in human skin cells zika virus ns a and ns b proteins deregulate akt-mtor signaling in human fetal neural stem cells to inhibit neurogenesis and induce autophagy zika virus infection induces mitosis abnormalities and apoptotic cell death of human neural progenitor cells anchor: web server for predicting protein binding regions in disordered proteins a guided tour to approximate string matching we would like to thank professor andrew easton (university of warwick) and professor keith leppard (university of warwick) for helpful discussions. this work is supported by bbsrc grants bb/l / and bb/p / awarded to i.p.n. key: cord- -x t lf x authors: matthews, david; emmott, edward; hiscox, julian title: viruses and the nucleolus date: - - journal: the nucleolus doi: . / - - - - _ sha: doc_id: cord_uid: x t lf x the nucleolus is a dynamic sub-nuclear structure integral to the function of a eukaryotic cell. some of its major roles involve ribosome subunit biogenesis, rna processing, cell cycle control and responding to cellular stress, such as infection. our understanding of the relationship between viruses and the nucleolus has moved from a phenomenological approach describing protein localisation to functional studies involving genetic analysis and proteomic approaches. these advances have provided fundamental insights as to how and why the nucleolus is targeted by many different viruses both to usurp normal functioning and to recruit nucleolar proteins to facilitate virus replication. this knowledge has been exploited for therapeutic strategies involving targeted inhibition of virus replication and live-attenuated recombinant vaccines. ; lee et al. ( ) ; lutz and kedinger ( ) ; tollefson et al. ( ) ; matthews and russell ( ) asfaviridae lower et al. ( ) many of these proteins have been shown to interact with nucleolar proteins the reason why rna viruses, and positive-strand rna viruses in particular, interact with the nucleolus when the site of genome replication is in the cytoplasm is less intuitive. in this latter case, viral proteins that are normally required in the cytoplasm must transit through the nuclear pore complex both to and from the nucleus. this process is crucial for virus biology because if the viral proteins that are required for cytoplasmic functions such as rna synthesis and encapsidation are sequestered in the nucleolus or nucleus, then progeny virus production will be affected as has been revealed by inhibitor and genetic studies (lee et al. ; tijms et al. ) . viruses may interact with the nucleolus to usurp host cell functions and recruit nucleolar proteins to facilitate virus replication. investigating the interactions between viruses and the nucleolus may facilitate the design of novel anti-viral therapies both in terms of recombinant vaccines (pei et al. ) and molecular intervention (rossi et al. ) , and also contribute to a more detailed understanding of the cell biology of the nucleolus. for many years our understanding of the interaction of viruses and the nucleolus was phenomenological and focused on identifying viral proteins that localised to this structure, their mechanisms of trafficking and potential interaction with nucleolar proteins (e.g. see table . ). however, recent research capitalising on advances in proteomics, viral genetics and cellular imaging techniques are beginning to increase our understanding of the mechanisms viruses use to subvert host cell nucleoli and facilitate virus biology . new data are now emerging that support the view that many viruses interact with the nucleus and nucleolus, particularly to facilitate virus replication. one of the best-studied viruses in terms of viral interactions with the nucleolus is hiv- and is described in detail in chap. . although hiv has clearly defined cytoplasmic and nuclear replication strategies, the virus has a positive-sense rna genome in the sense that the viral capsid contains two copies of positive-sense rna, but these are reverse transcribed in the cytoplasm and then trafficked to the nucleus, where ultimately the new genome is transcribed and trafficked back to the cytoplasm. part of the reasoning for the interaction of hiv- with the nucleolus is the trafficking of intronless mrna from the nucleus into the cytoplasm (michienzi et al. ) . this is a property shared with herpes viruses and indicated that different viruses have evolved similar strategies involving subversion of nucleolar function for the benefit of virus biology (boyne and whitehouse ) . in the case of hiv- , this knowledge has also led to the design and implementation of effective genetic therapies against the virus (unwalla et al. ). a large number of viruses with dna genomes have been shown to interact with nucleolus, and this perhaps is not surprising as most dna viruses replicate in the nucleus. a genome-wide screen of three distinct herpesviruses, herpes simplex virus (hsv- ), cytomegalovirus (cmv) and epstein-barr virus (ebv), has shown that at least herpesvirus-encoded proteins specifically localise to the nucleolus (salsman et al. ) , which are implicated in many aspects of the herpesvirus life cycle. therefore, a number of proteomic studies are currently being undertaken to study changes, in a global context, within the nucleolar proteome during virus infections, and are discussed later (lam et al. ) . several different herpes virus proteins have been shown to cause the redistribution of nucleolar proteins and hence disruption of the nucleolus. these include herpes simplex virus , the major tegument structural protein vp (lopez et al. ) , and the us (xing et al. ) and ul proteins (bertrand and pearson ; lymberopoulos and pearson ) . such disruption in many cases may have a direct effect on nucleolar function. a significant area of virus biology that has been investigated is the role of viral proteins that traffic through the nucleolus. for example, a number of hiv proteins that traffic through the nucleolus have been implicated in virus mrna processing (dundr et al. ) . similar observations have also been made in herpesviruses whitehouse , ; leenadevi and dalziel ) . initial studies utilising the prototype g- herpesvirus, herpes virus saimiri (hvs), demonstrated that the hvs nucleolar trafficking orf protein induces nucleolar redistribution of the host cell human trex proteins, which are involved in mrna nuclear export (boyne and whitehouse ) . intriguingly, ablating orf nucleolar trafficking led to a failure of orf -mediated viral mrna nuclear export (boyne and whitehouse ) . the precise role of this nucleolar sequestration is yet to be determined, but possible effects on viral mrna/protein processing and viral ribonucleoprotein particle assembly are currently being investigated. this property may also be conserved in other orf homologues as recent analysis has shown that the orf protein from kaposi's sarcoma associated herpesvirus (kshv) also dynamically traffics through the nucleolus (boyne et al. b) . moreover, on the rapid disorganisation of the nucleolus a reduction is observed in virus mrna nuclear export (boyne and whitehouse ). the formation of an orf -mediated export competent ribonucleoprotein particle within the nucleolus may also have implications for the translation of viral mrnas. for example, it has recently been demonstrated that the cellular nucleo-cytoplasmic shuttle protein, pym, which is involved in translation enhancement, is redistributed to the nucleolus in the presence of the kshv orf protein (boyne et al. ). this interaction effectively enhances the translation of the predominantly intronless transcripts made by kshv, and draws parallels with potential translation enhancement of positive strand rna virus genomes through their interaction with the nucleolus (discussed later). a second area of virus replication where nucleolar proteins are sequestered involves the replication of the virus dna genome. for example, we (matthews) and others have observed that nucleolar antigens upstream binding factor (ubf) and nucleophosmin (b . ) are both sequestered into adenovirus dna replication centres where they promote viral dna replication (hindley et al. ; lawrence et al. ; okuwaki et al. ) . similarly, in hsv- infected cells, a number of nucleolar proteins including nucleolin and ubf are recruited into viral dna replication centres . these are specific sites where replication and encapsidation of the hsv- genome occurs. evidence suggests that sequestration of ubf is essential for viral dna replication as overexpression of tagged version of ubf acts in a dominant-negative manner inhibiting virus dna replication (stow et al. ). moreover, depletion of nucleolin results in reduced virus gene expression and infectious virion production (calle et al. ; sagou et al. ) . in addition to enhancing virus replication, nucleolar proteins are redistributed to alter cellular pathways during infection. for example, the nucleolar targeted hsv- us protein has been shown to interact with homeodomain-interacting protein kinase (hipk ), which plays a role in p -mediated cellular apoptosis and hypoxic response (calzado et al. ) and also participates in the regulation of the cell cycle (calzado et al. ). this interaction alters the sub-cellular localisation of hipk and protects against hipk -mediated cell cycle arrest (giraud et al. ) . in contrast, the cellular protein, protein interacting with the carboxyl terminus- (pict- ), can sequester the virally encoded apoptosis suppressor protein, ks-bcl- protein, from the mitochondria into the nucleolus to down-regulate its anti-apoptotic activity (kalt et al. ). this is a potential interesting interplay between two sub-cellular structures involved in the viral stress response (olson ), and maybe more common and widespread. for example, bacterial infection has been shown to disrupt the nucleolus through regulating mitochondrial dysfunction (dean et al. ). although many rna virus proteins have been shown to localise to the nucleolus, most attention has focused on viral capsid proteins. these are proteins that associate with the viral genome for encapsidation and assembly of new virus particles. these proteins may also modulate replication (and transcription, where appropriate) of the viral genome. increasingly, capsid proteins have also been shown to have a number of roles in modulating host cell signalling pathways and functions. these capsid proteins are referred to as capsid, nucleoproteins or nucleocapsid proteins, depending on the virus. in many cases, they are phosphorylated (chen et al. ) , which can modulate activity (spencer et al. ) . many examples of these proteins have been shown to localise to the nucleolus both when over-expressed and also in infected cells. these include proteins from positive-strand animal and plant rna viruses, including the coronavirus nucleocapsid protein (chen et al. ; hiscox et al. ; wurm et al. ) , the arterivirus nucleocapsid protein (rowland et al. ) , the alphavirus capsid protein (jakob ) and non-structural protein nsp (rikkonen et al. (rikkonen et al. , and the umbravirus orf protein (ryabov et al. ). capsid proteins from negative-strand rna viruses also localise to the nucleolus. these have strain dependent localisation of a number of different influenza virus proteins (emmott et al. c; han et al. ; melen et al. ; volmer et al. ) . for many years this has followed a phenomenological pattern and viral capsid and rna-binding proteins might simply localise to the nucleolus because they diffuse through the nuclear pore complex and associate with compartments in the nucleus that have high rna contents -the nucleolus in particular because it is transcriptionally active. in this case, sub-cellular localisation to the nucleolus would have no physiological consequence for the virus or the cell. however, rna virus replication is error prone and selection pressure might apply to such a fortuitous localisation (given the ~ , + nucleolar proteins and their diverse roles (ahmad et al. )), with the concomitant effect that the virus could select for changes that ultimately disrupt nucleolar function and/or recruit nucleolar proteins to aid virus replication. there is a potential correlation between the nucleolar localisation of a viral protein and the loss of an essential nucleolar function. the molecular mechanisms responsible for this effect are unknown, but the displacement and re-localisation of nucleolar proteins by viral proteins could increase or decrease the nucleolar, nuclear and/or cytoplasmic pool of these proteins. certainly, the accumulation of viral proteins in the nucleolus could potentially cause volume exclusion or crowding effects, which have been proposed to play a fundamental role in the formation of nuclear compartments including the nucleolus, and can be addressed by proteomic strategies. therefore, disruption of nucleolar architecture and function might be common in virus-infected cells if viral proteins target the nucleolus or a stage of the virus lifecycle disrupts nucleolar proteins. for example, poliovirus infection results in the selective redistribution of nucleolin from the nucleolus to the cytoplasm (waggoner and sarnow ) and inactivation of ubf, which shuts off rna polymerase i transcription in the host cell. the infection of cells with ibv has been shown to disrupt nucleolar architecture (dove et al. b ) and cause arrest of the cell cycle in the g /m phase and failure of cytokinesis (dove et al. a) . the ibv and arterivirus nucleocapsid proteins associate with nucleolin and fibrillarin, respectively. similarly, the hiv- rev protein has been shown to localise to the dfc and gc and over-expression of rev protein alters the nucleolar architecture and is associated with the accumulation of nucleophosmin (dundr et al. ) . many different virus proteins localise to the nucleolus (table . ). however, predicting viral (and cellular) nucleolar targeting signals has historically been problematic and only recently has bioinformatic software been developed to fascilitate this (scott et al. ) . nucleolar trafficking might be mediated by virtue of the fact that viral proteins that are trafficked to the nucleolus contain motifs that resemble host nucleolar targeting signals, that is, a form of molecular mimicry is used . the discovery of specific nucleolar trafficking signals in viral proteins has indicated a functional mechanism behind this observed localisation (lee et al. ; reed et al. ; . analysis of the different nucleolar trafficking signals identified in viral proteins using dynamic live-cell imaging has certainly demonstrated that different proteins can confer differential trafficking rates and localisation patterns (emmott et al. ) . this is very similar to cellular nucleolar proteins (lechertier et al. ). in some virus proteins, both nlss and nucleolar targeting signals act in concert to direct a protein to the nucleolus. the arterivirus porcine reproductive and respiratory syndrome virus (prrsv) nucleocapsid protein localises to the nucleolus and has been shown to contain two potential nlss, a pat and a downstream pat motif (rowland et al. . analysis revealed that a amino acid sequence incorporating the pat motif could direct the nucleocapsid protein to both the nucleus and nucleolus. the protein also contains a predicted nes, presumably to allow the protein to traffic back into the cytoplasm to contribute to viral function in this compartment. this is common with other similar related proteins. for example, in the avian coronavirus nucleocapsid protein an eight amino acid sequence is necessary and sufficient to target the protein to the nucleolus (reed et al. ) and contains an nes (reed et al. ). intriguingly, genetic analysis (lee et al. ) , dynamic livecell imaging (you et al. ) and use of trafficking inhibitors (tijms et al. ) paint a picture of the requirement of these positive sense rna virus capsid proteins localising to the nucleolus as soon as they are translated, prior to their involvement in virus replication or assembly. this may be related to subversion of host cell function, protein modification (e.g. phosphorylation) or recruitment of nucleolar proteins. viral proteins might also traffic to the nucleolus through association with cellular nucleolar proteins . for example, the hepatitis delta antigen has been shown to contain a nucleolar targeting signal that also corresponded to a site that promoted binding to nucleolin (lee et al. ) . mutating this region prevented nucleolin binding to the delta antigen and nucleolar trafficking. by implication, this relates nucleolin binding to nucleolar trafficking (lee et al. ) . certainly, interaction with nucleophosmin and hepatitis delta antigens can modulate viral replication (huang et al. ) and more recently combined proteomic-rnai screens have revealed many other nucleolar proteins that can be associated with this viral protein (cao et al. ). trafficking and accumulation of viral proteins to and from the nucleolus, similar to cellular proteins, may also be cell cycle related. for example, the coronavirus nucleocapsid protein localises preferentially to the nucleolus in the g phase of the cell cycle (cawood et al. ) , as does the human cytomegalovirus protein ul in the g phase (arcangeletti et al. ) . again these trafficking profiles may be related to the interaction with cellular nucleolar proteins (emmott and hiscox ). semliki forest virus non-structural protein nsp can localise to the nucleolus (peranen et al. ; rikkonen et al. rikkonen et al. , and disruption of this localisation through a single amino acid change results in a reduction in neurovirulence (fazakerley et al. ) . such in vitro data has also been backed up by persuasive in vivo data. mutation of the arterivirus nucleocapsid protein pat nls motif in the context of a full-length clone revealed that this sequence could have a key role in virus pathogenesis in vivo, as animals infected with mutant viruses had shorter viraemia than wild-type viruses (lee et al. ; pei et al. ) . interestingly, reversions occurred in the mutated nucleocapsid gene sequence and although the amino acid sequence of the pat motif was altered, its function was not; this new signal was defined as a pat motif (lee et al. ) . the clear implications of this groundbreaking work is that disruption of nucleolar trafficking of a viral protein proves functional relevance and illustrates the potential of exploiting this knowledge for the generation of growth attenuated recombinant vaccines (pei et al. ; reed et al. reed et al. , . similarly, point mutations in the japanese encephalitis virus (jev) core protein that abolished nuclear and nucleolar localisation resulted in recombinant viruses with impaired replication in mammalian cells, compared to wild type virus (mori et al. ; tsuda et al. ) . curiously, replication of recombinant viruses was not impaired in insect cells, illustrating this could potentially be related to differences in nucleolar architecture and proteomes between these cell types (thiry and lafontaine ) . the jev core protein has been shown to interact with nucleophosmin and is translocated from the nucleolus to the cytoplasm. flaviviruses in general (jev, dengue virus and west nile virus) appear to have a part-nuclear stage to the synthesis of viral rna and several components of the viral replicase together with newly synthesised rna have been found in the nucleus of infected cells (uchil et al. ) . one intriguing question that has yet to be elucidated is how such viral rna traffics from the nucleus to the cytoplasm. most cellular mrnas are spliced and it is part of the splicing process that signals nuclear export. certain dna viruses, such as herpesvirus saimiri, produce intron-less mrna and these viruses have evolved specific viral proteins (such as herpesvirus saimiri orf (boyne et al. a) ), which interact with the cellular mrna export machinery (e.g. the mrna processing and export factor aly) to traffic viral mrna from the nucleus to the cytoplasm (boyne et al. b (boyne et al. , boyne and whitehouse ) and a similar process might be required by rna viruses. for example, tomato bushy stunt virus (tbsv) redistributes aly from the nucleus to the cytoplasm, and this might be a way the virus mediates host cell protein synthesis (uhrig et al. ). in plants rna silencing, a host defence mechanism targets virus rnas for degradation in a sequence-specific manner and viruses use several mechanisms to counteract this system (canto et al. ) . tbsv encodes a protein, p , which interferes with this pathway. however, aly might transport p from the cytoplasm to the nucleus or nucleolus and disrupt its silencing suppression activity. nucleolin has also been shown to be involved in the trafficking of herpes simplex virus type nucleocapsids from the nucleus to the cytoplasm (sagou et al. ) , drawing parallels with the involvement of nucleolar proteins in the movement of plant viruses (kim et al. a, b) . different plant virus proteins involved in long-distance phloem-associated movement of virus particles or with roles in binding to the rna virus genomes localise to the nucleolus and other sub-nuclear structures (kim et al. b; ryabov et al. ). this may be mediated by association with nuclear proteins, as is the case with fibrillarin and the orf protein of plant umbraviruses (kim et al. a) . hijacking the nucleolus is not exclusive to plant viruses and may also occur with mammalian viruses. similar to the plant rhabdovirus maize fine streak virus (mfsv), whose nucleocapsid and phosphoproteins localise to the nucleolus (tsai et al. ) , the animal negative-stranded rna virus borna disease virus has been reported to use the nucleolus as a site for genome replication, and its rna-binding protein has the appropriate trafficking signals for import to and export from the cytoplasm to the nucleus (pyper et al. ). the hepatitis delta virus genome also has differential synthesis in the nucleus with rna being transcribed in the nucleolus (huang et al. ) ; this is similar to the potato spindle tuber viroid where rnas of opposite polarity are sequestered in different nuclear compartments, with the positive-sense rna being transported to the nucleolus. again localisation to different sub-nuclear strcutures may have different roles in the virus life cycle (li et al. ). an intriguing recent discovery has been made showing that adenoassociated virus (aav) encodes an additional protein called assembly-activating protein (aap) that localises to the nucleolus and promotes assembly of the viral capsid (sonntag et al. ) . as a result of their limited genomes and coding capacities, recruitment of cellular proteins with defined functions in rna metabolism would be a logical step to facilitate rna virus infection. as nucleolar proteins have many crucial functions in cellular rna biosynthesis, processing and translation, it comes as no surprise that nucleolar proteins are incorporated into the replication and/or translation complexes formed by rna viruses. given that some nucleolar proteins have many different functions, the same nucleolar protein might be used by a virus for different aspects of the replication pathway. studies suggest that the human rhinovirus c protease ( cpro) pre-cursors, cd' and/or cd, localise in the nucleoli of infected cells early in infection and inhibit cellular rna transcription via proteolytic mechanisms (amineva et al. ). this general property is not restricted to human rhinovirus and in terms of the inhibition of cellular translation has also been described for encephalomyocarditis virus (aminev et al. a, b) , again suggesting roles in translational regulation. given the many roles of the nucleolus in the life cycle of the cell, including as stress sensor (boulon et al. ; mayer and grummt ) , it would seem reasonable that comprehensive unbiased analysis of the nucleolar proteome would yield interesting data, particularly, with providing clues as to what cellular nucleolar functions may be altered by virus infection and what mechanisms the nucleolus may use to respond to this. how the nucleolar proteome changes in response to virus-infection has been investigated using stable isotope labelling with amino acids in cell culture (silac) coupled to lc-ms/ms and bioinformatics (fig. . ) . these studies, led by our laboratories, have analysed purified nucleoli and the nucleus, and have directly stemmed from the pioneering work of the lamond laboratory in analysing purified nucleoli using quantitative proteomics (andersen et al. (munday et al. ) . overall, our data indicates that only a small proportion of nucleolar proteins change in abundance in virus-infected cells, fig. . diagram of a "classic" silac experiment. this technology allows high-throughput quantitative proteomics and has been readily applied to the nucleolus, especially when coupled with dynamic live-cell imaging (andersen et al. ) . the ability to simultaneously compare up to three different conditions through selection of the appropriate isotope label has enabled the recent studies of how the nucleolar proteome changes in virus-infected cells (emmott et al. a; emmott et al. b; emmott et al. c; hiscox et al. ; lam et al. ) and these tend to be virus-specific. for example, in adenovirus infected cells just % of proteins identified show a twofold or greater change compared to almost a third of nucleolar antigens showing a greater than twofold change when cells are treated with actd which inhibits rrna synthesis (lam et al. ) . what is notable is that direct comparison between the adenovirus data set and the actd dataset shows no clear correlation lam et al. ) , further supporting the case that adenovirus induces effects on the nucleolus distinct from that of a generalised, non-specific shut down of nucleolar function. this fits well with a previous observation that adenovirus infection does not affect rrna synthesis even h post-infection (lawrence et al. ). these results were initially surprising given the number of different viral proteins that can localise to this structure and how they interact with nucleolar proteins. this suggests that the nucleolar proteome and architecture is resilient during early stages of infection but may become disrupted as more and more damage accumulates inside cells because of virus activity, as clearly evidenced in live-cell imaging experiments (bertrand and pearson ; dove et al. b; ). coupling quantitative proteomic analysis of the nucleolus and deep sequencing throughout infection in time-course experiments of lytic, latent, acute and persistent viruses would reveal valuable insights into the response of the nucleolus to virus infection. likewise, being able to move from studying cell culture-adapted laboratory strains into clinical isolates replicating in primary cells would yield more biologically relevant information, particularly with regard to the severity of disease and nucleolar changes. these technologies could also be applied to large-scale analysis of viral proteins that traffic to the nucleolus and the cellular nucleolar proteins that they associate with (e.g. using silac and egfp-traps (trinkle-mulcahy et al. )), thus generating and integrating interactome networks with the nucleolar proteome during infection. nopdb: nucleolar proteome database- update encephalomyocarditis viral protein a localizes to nucleoli and inhibits cap-dependent mrna translation encephalomyocarditis virus (emcv) proteins a and bcd localize to nuclei and inhibit cellular mrna transcription but not rrna transcription rhinovirus c protease precursors cd and cd' localize to the nuclei of infected cells nucleolar proteome dynamics human cytomegalovirus proteins pp and iep are targeted to distinct compartments in nuclei and nuclear matrices of infected human embryo fibroblasts cell-cycle-dependent localization of human cytomegalovirus ul phosphoprotein in the nucleolus and modulation of viral gene expression in human embryo fibroblasts in vitro np protein of human endogenous retrovirus k interacts with ligand of numb protein x functional role of px open reading frame ii of human t-lymphotropic virus type in maintenance of viral loads in vivo a temporal study of the expression of the capsid, cytoplasmic inclusion and nuclear inclusion proteins of tobacco etch potyvirus in infected plants visualization of the interaction between the precursors of vpg, the viral protein linked to the genome of turnip mosaic virus, and the translation eukaryotic initiation factor iso e in planta the conserved n-terminal domain of herpes simplex virus ul protein is sufficient to induce the spatial redistribution of nucleolin m r and m r are two virulence factors for myxoma virus pathogenesis in the european rabbit the nucleolus under stress nucleolar trafficking is essential for nuclear export of intronless herpesvirus mrna nucleolar disruption impairs kaposi's sarcoma-associated herpesvirus orf -mediated nuclear export of intronless viral mrnas herpesvirus saimiri orf : a post-transcriptional regulatory protein recruitment of the complete htrex complex is required for kaposi's sarcoma-associated herpesvirus intronless mrna nuclear export and virus replication kaposi's sarcoma-associated herpesvirus orf protein interacts with pym to enhance translation of viral intronless mrnas nucleolin is required for an efficient herpes simplex virus type infection hipk : a versatile switchboard regulating the transcription machinery and cell death from top to bottom: the two faces of hipk for regulation of the hypoxic response nuclear localization of nucleocapsid-like particles and hcv core protein in hepatocytes of a chronically hcv-infected patient a single amino acid change in the nuclear localization sequence of the nsp protein affects the neurovirulence of semliki forest virus analysis of the subcellular localization of the proteins rep, rep' and cap of porcine circovirus type bovine immunodeficiency virus tat gene: cloning of two distinct cdnas and identification, characterization, and immunolocalization of the tat gene products human t-cell leukemia virus type i p nuclear/nucleolar retention is mediated through interactions with rna and a constituent of the s ribosomal subunit us of herpes simplex virus type interacts with hipk and antagonizes hipk -induced cell growth arrest nuclear and nucleolar localization of an african swine fever virus protein, i l, that is similar to the herpes simplex virus-encoded virulence factor icp . the nucleoprotein and the viral rna of infectious salmon anemia virus (isav) are localized in the nucleolus of infected cells the bovine immunodeficiency virus rev protein: identification of a novel lentiviral bipartite nuclear localization signal harboring an atypical spacer sequence cucumber mosaic virus b protein subcellular targets and interactions: their significance to rna silencing suppressor activity involvement of the nucleolus in replication of human viruses identification of nucleolus localization signal of betanodavirus ggnnv protein alpha characterization of the nuclear and nucleolar localization signals of bovine herpesvirus- infected cell protein new regulatory mechanisms for the intracellular localization and trafficking of influenza a virus ns protein revealed by comparative analysis of a/pr/ / and a/sydney/ / imaging of viroids in nuclei from tomato leaf tissue by in situ hybridization and confocal laser scanning microscopy distinctions between bovine herpesvirus and herpes simplex virus type vp tegument protein subcellular associations nucleolar localization of potato leafroll virus capsid proteins relationship between adenovirus dna replication proteins and nucleolar proteins b . and b . direct interaction between nucleolin and hepatitis c virus ns b brief review: the nucleolus -a gateway to viral infection? the interaction of animal cytoplasmic rna viruses with the nucleus to facilitate replication rna viruses: hijacking the dynamic nucleolus the coronavirus infectious bronchitis virus nucleoprotein localizes to the nucleolus nucleolar proteomics and viral infection the hbz-sp isoform of human t-cell leukemia virus type i represses junb activity by sequestration into nuclear bodies nucleolar localization of mouse mammary tumor virus proteins in t-cell lymphomas identification and characterization of the ul gene product of herpes simplex virus type the nucleolar phosphoprotein b interacts with hepatitis delta antigens and modulates the hepatitis delta virus rna replication expression and processing of a small nucleolar rna from the epstein-barr virus genome a novel, mouse mammary tumor virus encoded protein with rev-like properties nucleolar accumulation of semliki forest virus nucleocapsid c protein: influence of metabolic status, cytoskeleton and receptors gltscr /pict- , a putative tumor suppressor gene product, induces the nucleolar targeting of the kaposi's sarcoma-associated herpesvirus ks-bcl- protein interaction of a plant virus-encoded protein with the major nucleolar protein fibrillarin is required for systemic virus infection cajal bodies and the nucleolus are required for a plant virus systemic infection electron microscopy of ribonucleic acid in nuclear particulate aggregates of hepatitis d using nuclease-gold complexes functional similarity of hiv-i rev and htlv-i rex proteins: identification of a new nucleolar-targeting signal in rev protein nucleo-cytoplasmic redistribution of the htlv-i rex protein: alterations by coexpression of the htlv-i p x protein proteomics analysis of the nucleolus in adenovirus-infected cells immunocytology shows the presence of tobacco etch virus p protein in nuclear inclusions nucleolar protein upstream binding factor is sequestered into adenovirus dna replication centres during infection without affecting rna polymerase i location or ablating rrna synthesis a b -interacting sequence as a tool to visualize protein interactions in a cellular context the nucleolin binding activity of hepatitis delta antigen is associated with nucleolus targeting adenovirus core protein vii contains distinct sequences that mediate targeting to the nucleus and nucleolus, and colocalization with human chromosomes precursor of human adenovirus core polypeptide mu targets the nucleolus and modulates the expression of e proteins mutations within the nuclear localization signal of the porcine reproductive and respiratory syndrome virus nucleocapsid protein attenuate virus replication the alcelaphine herpesvirus- orf encodes a nuclear shuttling protein functional interaction and colocalization of the herpes simplex virus major regulatory protein icp with eap, a nucleolar-ribosomal protein rna-templated replication of hepatitis delta virus: genomic and antigenomic rnas associate with different nuclear bodies capsid protein of cucumber mosaic virus accumulates in the nuclei and at the periphery of the nucleoli in infected cells nucleolar and nuclear localization properties of a herpesvirus bzip oncoprotein, meq the major tegument structural protein vp targets areas of dispersed nucleolin and marginalized chromatin during productive herpes simplex virus infection identification of a rev-related protein by analysis of spliced transcripts of the human endogenous retroviruses htdv/ herv-k properties of the adenovirus iva gene product, an effector of late-phase-dependent activation of the major late promoter involvement of ul in herpes-simplex-virus- -induced dispersal of nucleolin relocalization of upstream binding factor to viral replication compartments is ul independent and follows the onset of herpes simplex virus dna synthesis involvement of the ul protein in herpes simplex virus -induced dispersal of b and in nuclear egress the products of gene us of herpes simplex virus type are dna-binding and localize to the nucleoli of infected cells stable expression of hepatitis delta virus antigen in a eukaryotic cell line adenovirus core protein v is delivered by the invading virus to the nucleus of the infected cell and later in infection is associated with nucleoli cellular stress and nucleolar function identification of nuclear and nucleolar localization signals in the herpes simplex virus regulatory protein icp nuclear and nucleolar targeting of influenza a virus ns protein: striking differences between different virus subtypes karyophilic properties of semliki forest virus nucleocapsid protein ribozyme-mediated inhibition of hiv suggests nucleolar trafficking of hiv- rna localization and importance of the adenovirus e orf protein during lytic infection the lactate dehydrogenase-elevating virus capsid protein is a nuclear-cytoplasmic protein the protein icp of herpes simplex virus type is targeted to nucleoli of infected cells nuclear localization of japanese encephalitis virus core protein enhances viral replication quantitative proteomic analysis of a cells infected with human respiratory syncytial virus functional domain structure of human t-cell leukemia virus type rex nucleolar localization of human hepatitis b virus capsid protein nucleolar targeting signal of human t-cell leukemia virus type i rex-encoded protein is essential for cytoplasmic accumulation of unspliced viral mrna identification of nucleophosmin/b , an acidic nucleolar protein, as a stimulatory factor for in vitro replication of adenovirus dna complexed with viral basic core proteins induction of apoptosis by viruses: what role does the nucleolus play? inhibition of human immunodeficiency virus type and type tat function by transdominant tat protein localized to both the nucleus and cytoplasm nuclear entry and nucleolar localization of the newcastle disease virus (ndv) matrix protein occur early in infection and do not require other ndv proteins functional mapping of the porcine reproductive and respiratory syndrome virus capsid protein nuclear localization signal and its pathogenic association nuclear localization of semliki forest virus-specific nonstructural protein nsp the nucleolus is the site of borna disease virus rna transcription and replication the complex subcellular distribution of satellite panicum mosaic virus capsid protein reflects its multifunctional role during infection control of nuclear and nucleolar localization of nuclear inclusion protein a of picorna-like potato virus a in nicotiana species proapoptotic effect of hepatitis c virus core protein in transiently transfected cells is enhanced by nuclear localization and is dependent on pkr activation delineation and modelling of a nucleolar retention signal in the coronavirus nucleocapsid protein characterization of the nuclear export signal in the coronavirus infectious bronchitis virus nucleocapsid protein nuclear and nucleolar targeting signals of semliki forest virus nonstructural protein nsp nuclear targeting of semliki forest virus nsp intracellular transport of the murine leukemia virus during acute infection of nih t cells: nuclear import of nucleocapsid protein and integrase functional analysis of proteins involved in movement of the monopartite begomovirus, tomato yellow leaf curl virus genetic therapies against hiv nucleolar-cytoplasmic shuttling of prrsv nucleocapsid protein: a simple case of molecular mimicry or the complex regulation by nuclear import, nucleolar localization and nuclear export signal sequences the localisation of porcine reproductive and respiratory syndrome virus nucleocapsid protein to the nucleolus of infected cells and identification of a potential nucleolar localization signal sequence peptide domains involved in the localization of the porcine reproductive and respiratory syndrome virus nucleocapsid protein to the nucleolus structural and functional characterization of human immunodeficiency virus tat protein human endogenous retrovirus herv-k(hml- ) encodes a stable signal peptide with biological properties distinct from rec intracellular location of two groundnut rosette umbravirus proteins delivered by pvx and tmv vectors identification of a nuclear localization signal and nuclear export signal of the umbraviral long-distance rna movement protein nucleolin is required for efficient nuclear egress of herpes simplex virus type nucleocapsids genome-wide screen of three herpesviruses for protein subcellular localization and alteration of pml nuclear bodies identification of the caprine arthritis encephalitis virus rev protein and its cis-acting rev-responsive element the rev protein of visna virus is localized to the nucleus of infected cells pnac: a protein nucleolar association classifier characterization of signals that dictate nuclear/nucleolar and cytoplasmic shuttling of the capsid protein of tomato leaf curl java virus associated with dna beta satellite sequence requirements for nucleolar localization of human t cell leukemia virus type i px protein, which regulates viral rna processing a viral assembly factor promotes aav capsid formation in the nucleolus role of phosphorylation clusters in the biology of the coronavirus infectious bronchitis virus nucleocapsid protein upstream-binding factor is sequestered into herpes simplex virus type replication compartments nucleolin associates with the human cytomegalovirus dna polymerase accessory subunit ul and is necessary for efficient viral replication reversible nucleolar translocation of epstein-barr virus-encoded ebna- and hsp proteins after exposure to heat shock or cell density congestion birth of a nucleolus: the evolution of nucleolar compartments nuclear localization of non-structural protein and nucleocapsid protein of equine arteritis virus identification of a new human adenovirus protein encoded by a novel late l-strand transcription unit identifying specific protein interaction partners using quantitative mass spectrometry and bead proteomes complete genome sequence and in planta subcellular localization of maize fine streak virus proteins nucleolar protein b interacts with japanese encephalitis virus core protein and participates in viral replication nuclear localization of flavivirus rna synthesis in infected cells relocalization of nuclear aly proteins to the cytoplasm by the tomato bushy stunt virus p pathogenicity protein use of a u snorna-containing ribozyme library to identify ribozyme targets in hiv- avian reovirus sigmaa localizes to the nucleolus and enters the nucleus by a nonclassical energy-and carrier-independent pathway nucleolar localization of influenza a ns : striking differences between mammalian and avian cells viral ribonucleoprotein complex formation and nucleolarcytoplasmic relocalization of nucleolin in poliovirus-infected cells interactions of minute virus of mice and adenovirus with host nucleoli intracellular localization and determination of a nuclear localization signal of the core protein of dengue virus proteins c and ns b of the flavivirus kunjin translocate independently into the nucleus subcellular compartmentalization of adeno-associated virus type assembly the n-terminal domain of pmtv tgb movement protein is required for nucleolar localization, microtubule association, and long-distance movement localisation to the nucleolus is a common feature of coronavirus nucleoproteins and the protein may disrupt host cell division molecular anatomy of subcellular localization of hsv- tegument protein us in living cells nucleolar localization of the ul protein of herpes simplex virus type colocalization and interaction of the porcine arterivirus nucleocapsid protein with the small nucleolar rna-associated protein fibrillarin subcellular localization of the severe acute respiratory syndrome coronavirus nucleocapsid protein a model for the dynamic nuclear/ nucleolar/cytoplasmic trafficking of the porcine reproductive and respiratory syndrome virus (prrsv) nucleocapsid protein based on live cell imaging g /g arrest and apoptosis induced by sars-cov b protein in transfected cells intracellular localization of the ul protein of herpes simplex virus type acknowledgements dam and jah would like to acknowledge their co-workers and collaborators over the years for developing viral interactions with the nucleolus. dam's research on the nucleolus is supported by the wellcome trust and jah's by the bbsrc and a leverhulme trust research fellowship. ee is supported by a bbsrc astbury dtg studentship. key: cord- -zq kqf h authors: shen, hsin-hui; lithgow, trevor; martin, lisandra l. title: reconstitution of membrane proteins into model membranes: seeking better ways to retain protein activities date: - - journal: int j mol sci doi: . /ijms sha: doc_id: cord_uid: zq kqf h the function of any given biological membrane is determined largely by the specific set of integral membrane proteins embedded in it, and the peripheral membrane proteins attached to the membrane surface. the activity of these proteins, in turn, can be modulated by the phospholipid composition of the membrane. the reconstitution of membrane proteins into a model membrane allows investigation of individual features and activities of a given cell membrane component. however, the activity of membrane proteins is often difficult to sustain following reconstitution, since the composition of the model phospholipid bilayer differs from that of the native cell membrane. this review will discuss the reconstitution of membrane protein activities in four different types of model membrane—monolayers, supported lipid bilayers, liposomes and nanodiscs, comparing their advantages in membrane protein reconstitution. variation in the surrounding model environments for these four different types of membrane layer can affect the three-dimensional structure of reconstituted proteins and may possibly lead to loss of the proteins activity. we also discuss examples where the same membrane proteins have been successfully reconstituted into two or more model membrane systems with comparison of the observed activity in each system. understanding of the behavioral changes for proteins in model membrane systems after membrane reconstitution is often a prerequisite to protein research. it is essential to find better solutions for retaining membrane protein activities for measurement and characterization in vitro. the cell membrane separates intracellular components from the outside environment and is constituted by various phospholipids, cholesterol, glycolipids and proteins. integral membrane proteins have at least one polypeptide segment spanning the membrane bilayer whereas peripheral membrane proteins are temporarily attached to the lipid bilayer or to integral membrane proteins by various interactions such as hydrophobic, electrostatic and other types of non-covalent interactions. membrane proteins work as a selective filter to regulate molecules entering cells and also serve in communicating with the surrounding environment. thus, membrane proteins play an essential role in the physiological functions needed for cell survival. the functional activities of membrane proteins are modulated by the structure of the surrounding lipids molecules in the membrane [ , ] ; thus the composition of the lipid bilayer can affect the inter-or intra-molecular interactions between the lipid bilayer and membrane proteins [ ] . investigating membrane proteins in vivo is difficult because the membrane proteins are associated with a complex mixture of other proteins, and are prone to aggregation in solution [ ] . it is still a major challenge at this stage to extract information needed in vivo to address specific questions in the function of the cell membrane. to simplify cell membrane systems, model membranes such as monolayers, bilayers, liposomes and nanodiscs have been developed, enabling detailed investigation of membrane protein structure in lipid membranes. model membrane environments more closely resemble the natural lipid bilayer than alternatives such as detergents. however, many features of phospholipid structure need to be considered and optimized in the creation of a suitable model membrane. for example, the hydrophobicity of the lipid chain defined by the lengths of the fatty acid chains, is an important parameter for retaining protein activity. other factors affecting the reconstituted membrane protein activity are the chemical properties of the lipid head groups which control membrane hydrophilicity. both parameters are crucial in stabilizing membrane protein structure. there are a number of approaches used to create a model membrane in order to mimic properties of the native cell membrane, and we will review these various approaches for reconstituting membrane proteins into different types of model membrane-monolayers [ ] , supported planar lipid bilayer [ ] and liposomes [ ] as shown in figure a -c. we will also discuss the emerging technology of nanodiscs [ ] ( figure d ). nanodiscs are a new class of model membrane, with attractive properties that address shortcomings of other approaches in the study of membrane proteins. the first section gives a brief summary of each method and a comparison of their strengths and weaknesses. in the following section, we describe four case studies and will compare the protein activity changes when the membrane proteins are reconstituted into different model membranes. in these case studies, we demonstrate how protein activities are modulated by the lipid environment and discuss how this environment helps to retain protein activities. and black in b represent water and a substrate respectively. nanodiscs contain membrane scaffold proteins, shown in green. one of the most common approaches to study the membrane protein structure and activity uses a langmuir monolayer at the air-water interface. this method has been extensively used for more than a century [ , ] . reconstitution of membrane proteins helps obtain further information on their organization and structure in the langmuir membrane [ , ] . it is a simple method to create a phospholipid monolayer at an air-water interface. basically, a desired amount of lipid or lipid mixtures are dissolved in organic solvents such as chloroform or chloroform/ethanol mixtures, followed by spreading the lipid/solvent mixtures on the water surface. by evaporating out the solvent, the phospholipid molecules self-assemble vertically as a monolayer film at the air-water interface, with their hydrophilic head groups immersed in the water and their hydrophobic tail pointed to the air as shown in figure a [ ] . a major advantage of using the langmuir monolayer system is that parameters such as thickness, surface pressure, molecular area and subphase thickness can be well controlled [ ] . more advanced characterization techniques, such as Π-a isotherm uv-vis adsorption, x-ray reflectivity, ellipsometry and rheology, have been developed to gain detailed information on the binding of proteins onto the phospholipid monolayer and to monitor enzyme activities when binding to the monolayer [ ] . however, a limitation of langmuir monolayers is that the lack of a layer comparing to the natural cell structure (bilayer) and the high surface tension of water that can cause protein denaturation. despite this limitation, there are several successful studies using this approach. two types of membrane proteins in monolayer model membrane system will be briefly described below: rhodopsin [ , ] , bacteriorhodopsin [ , ] and the aliphatic peptide gramicidin [ , ] have been successfully reconstituted and studied in monolayers at the air-water interface. in order to obtain information on the secondary structure and orientation, the protein layer can be investigated in situ at the air-water interface by either polarization modulation infrared reflection absorption spectroscopy (pm-irras) or x-ray reflectivity in combination with surface pressure-area isotherms [ ] . the study of gramicidin is an example of such an approach, and while gramicidin is unfolded at high molecular area (low pressure), it is refolded upon compression and retains its precise structure and orientation. likewise, for both rhodopsin and bacteriorhodopsin, the secondary structures measured in monolayers are indistinguishable from that in native membranes when appropriate conditions are used. while some experiments have suggested that spreading of rhodopsin in certain conditions (> m/n) leads to denaturation [ ] , bacteriorhodopsin, in contrast, is very stable in most testing conditions (compression and temperature change). the different properties of the protein are probably due to the ability of baceriorhodopsin to form a stable two-dimensional crystalline structure at the air-water interface [ ] . phospholipid monolayers are simple model membrane systems that are perfectly suited to study the binding of peripheral proteins onto a membrane surface. peripheral membrane proteins spontaneously bind onto phospholipid monolayers at the air-water interface by injecting themselves into the subphase underneath the lipid monolayer. in most cases, useful information can be obtained by measuring the binding of peripheral proteins onto the monolayer. for example, the kinetics and dynamics of adsorption of myristoylated and nonmyristoylated recoverin onto phospholipid monolayers have been investigated using surface pressure isotherm described in figure [ ] . the curve can be fitted with stretched exponential which can convert into the rate of adsorption of myristoylated and nonmyristoylated which is . s − and . s − , respectively. this indicates that the adsorption of myristoylated recoverin is six times faster than nonmyristoylated recoverin. reconstituting enzymes into the langmuir monolayers at the air-water interface has been found to be a very useful approach to understand the hydrolysis of membrane phospholipids. for example, the interfacial recognition and adsorption of phospholipases a (pla ) and phospholipases c (plc) to the phospholipid membrane interface are poorly understood. by using this approach, it appears that both pla and plc are active at the monolayer model membrane, indicating that the kinetics of phospholipid hydrolysis at the air-water interface can be monitored by biophysical characterization techniques in situ such as pm-irras and infrared reflection adsorption spectroscopy [ ] . moreover, it has been found that in the presence of calcium, phospholipid hydrolysis by pla resulted in the production of calcium-palmitate complexes. this suggests that calcium is necessary for pla secretion. the formation of a supported lipid bilayer on a solid substrate was reported by tamm and mcconnell in as a new model membrane system to study the physical properties of biological membranes and their constituent lipid and protein molecules [ , ] . supported planar lipid bilayers are prepared by several methods [ , ] . vesicle fusion is the simplest method for supported bilayer formation and the fusion mechanism on a hydrophilic support is well understood [ , ] . essentially, the bilayer is prepared by the fusion of small unilamellar vesicles on solid supports such as sio , glass and modified gold surface by van der waals, electrostatic, hydration and steric forces. the supported lipid bilayer has polar hydrophilic headgroups facing the aqueous surroundings and two hydrophobic tails that face the interior of the membrane which more closely resembles biological membranes than the langmuir monolayer. the supported lipid bilayer can confer many key functions to biological membranes. however, one side of the hydrophilic head group is still tightly attached to the solid support and this may, in some cases, affect the fluidity of the model membrane. this matters, since integral membrane proteins may not diffuse in the plane of the membrane. furthermore the orientation of membrane proteins cannot be controlled in the supported planer lipid bilayer. to alleviate some of these problems, a new tethered polymer-supported planar lipid bilayer system was developed to investigate the reconstitution of integral membrane proteins in a laterally mobile form into the supported lipid bilayer [ ] . wagner and tamm [ ] have successfully designed a supported lipid bilayer on a polyethyleneglycol cushion shown in figure . the polymer cushion minimizes the interactions of the proteins with the substrate and the polymer. it also provides a soft support and, for increased stability, covalent linkage of the membranes to the supporting quartz or glass substrates. in low polyethyleneglycol concentration regimes, the bilayers were assembled with high lateral lipid diffusion coefficients ( . - . × − cm /s). cytochrome b and annexin v were used to test the polyethyleneglycol cushion system. two populations of laterally mobile proteins were observed in the polyethyleneglycol cushion-supported bilayers. approximately a quarter of cytochrome b diffused with a diffusion coefficient of . - . × − cm /s, and more than half of the cytochrome b diffused with a diffusion coefficient of ~ × − cm /s. similar results were found in the annexin v system. annexin v diffused with two populations with diffusion coefficients of × − cm /s and × − cm /s. the new polymer-supported lipid bilayer system has increased the mobile fraction and retained the full lateral mobility of both cytochromes b and annexin when integrated or bound to the supported lipid. although polymer cushions allow for successful integration of small membrane proteins into bilayers, further challenges stem from studies with large transmembrane proteins. polymer cushions cannot provide large transmembrane proteins with good solvent accessibility, or enough space for the motion; required for the activity. while several types of polymer cushions have been developed, including polymethyl methacrylate diblock polymer cushions [ ] , poly(ethylene imine) [ , ] cushions and poly(ethylene glycol) tethered lipopolymers [ ] , these cushions are mostly limited to a thickness of up to nm. a recent development of a maleic anhydride copolymer thin film has film thickness up to nm [ ] . the hydrophilic polymer-cushioned supported lipid bilayers provide a higher mobility and homogeneous distribution of the incorporated beta-amyloid precursor protein cleaving enzyme (bace) on the bilayer surface, and enhances the enzymatic activity of bace (increased from % to %). even so, the activity of the incorporated bace remains significantly lower ( %) than that of the native enzyme ( %). another important classic category of membrane proteins are the transporters of ions and small molecules. studies of how ion channels regulate the transport of substrates [ ] are important for fundamental biology. however, it is challenging to incorporate ion channels in supported lipid bilayers due to leakage or instability issues. detailed studies of ion channel conduction or gating require considerable period of time (possibly > h), and it is difficult to set up a stable and electrically quiet environment for the ion channel in planar lipid bilayer. a better alternative has proven to be reconstitution of ion channels into proteoliposomes. lipid vesicles, also known as liposomes, consist of a self-closed lipid bilayer. they have been widely used for more than years to reconstitute the membrane proteins in unilamellar phospholipid vesicles. liposomes are relatively easy to construct by procedures such as extrusion method or ultrasonication, with reverse-phase evaporation. furthermore, giant vesicles of unilamellar or multilamellar nature can be "micro-manipulated" under an optical microscope. reconstitution of membrane proteins in liposomes usually requires detergents wherein purified membrane proteins are solubilized in detergent, then mixed with the desired phospholipid vesicles forming an isotropic solution of mixed phospholipid-protein-detergent micelles. the detergent can then be removed slowly by dialysis, gel filtration or biobead adsorption. when the detergent concentration reaches a critical level, the protein will spontaneously associate with the phospholipid membrane to form biologically active liposomes, called proteoliposomes. however, it has been a hard feat to control the final orientation of protein in the proteoliposomes [ ] , as well as the amount of protein inserted due to the limited area available. in many cases, disorientation of the protein causes aggregation. despite these difficulties, there have been many successful cases of membrane proteins reconstituted in the proteoliposomes, and we describe two examples below. several integral membrane proteins have been successfully reconstituted into proteoliposomes such as rhodopsin [ ] , g proteins [ , ] , proapoptotic bcl- proteins and t-bid [ ] , phosphocholine cytidylyltransferase (ct) [ ] and p protein kinase c (pkc) [ ] . however, these studies also found that the resulting protein activities are sensitive to the membrane curvature of the liposomes. this indicates that different phospholipids can cause considerable curvature stress changes in the liposomes [ ] . specifically, the curvature stress has been suggested to modulate the free energy and folding of the integral membrane proteins [ ] . sometimes the activity of different enzymes is modulated by the same driving force of the membrane curvature, but there may also be variation of activity through different mechanisms. for example, the activities of both ct [ ] and pkc [ ] are enhanced by increasing the negative curvature strain of the membrane. the activity of ct appears to be directly coupled with the membrane curvature, in contrast, the activity of pkc does not have a direct relationship with the curvature strain and enzymatic activity [ ] . the activity of pkc is instead modulated by nonlamellar-forming lipids via a less direct mechanism. liposomes have been commonly used for reconstituting different types of transporters to allow for the free diffusion of solution or catalysis of obligatory co-transporters. a large number of functional membrane proteins have been successfully reconstituted into liposomes but only a few examples will be discussed here. the reconstitution of colicin ia and e in either soybean phospholipids or e. coli phospholipids show that there is channel formation in the liposomes but there are unspecific channels allowing passage of ions, such as rubidium, sodium, chlorine, potassium or phosphate but not of sugars [ , ] . an example of the reconstitution of selective transport comes from the d-glucose transporter, purified from human erythrocytes and extracted from detergents followed by incorporation into proteoliposomes. with incorporation of the d-glucose transporter, the proteoliposomes become permeable to d-glucose but not to l-glucose. the transport was inhibited by cytochalasin b which is a potent inhibitor of d-glucose transporter [ , ] . several types of atp-dependent ion transporters such as ca + /mg + -atpase, na + /k + -atpase, and h + /k + -atpase have been reconstituted into proteoliposomes [ ] . upon addition of atp, ions are observed to be transported inwards and can form a complex. the single-channel property of channels incorporated into proteoliposomes can be investigated using the well-known patch-clamp method [ ] . channel activity is monitored following excision of the patch from the proteoliposomes. ion-channel reconstitution makes possible the investigation of the influence of membrane lipid composition on channel function. the kinetic investigation of these channels under physiological conditions has been discussed elsewhere [ ] . another up-to-date method is using organic solvent or oil mixed with water that creates water-in-oil (w/o) microdroplets coated by phospholipid. the hydrophilic head group immerses in the water and the hydrophobic tail locates in the oil/organic solvent phase. the application of the water-in-oil system could cover a wide range of applications from monolayer, planer lipid bilayer and liposomes. funakoshi et al. [ ] and maglia et al. [ ] used a planer lipid bilayer formed by two microdroplets driven to come in contact to reconstitute ion channels in the bilayer. this method is extremely simple and reproducible. recently, the water-in-oil microdroplets are extended to form liposomes by using droplet-transfer method invented by yoshikawa [ ] . by using this approach, it is possible to modulate the lipid compositions of outer and inner leaflets and furthermore to orient a reconstituted membrane protein in liposomes [ ] . nanodiscs offer a solution to some of the challenges described in the previous sections. the first attempt to reconstitute membrane proteins in the phospholipid bilayers using nanodisc technology was initiated by sligar's group a decade ago [ ] . the nanodisc is a self-assembly of phospholipids and a membrane scaffold protein derived from human serum apolipoprotein a . the detergent, cholate, can be used to solubilize phospholipids and membrane scaffold proteins into a micelle mixture. following detergent removal with dialysis or bio-beads adsorbent, a nanodisc self-assembles. the phospholipid associates as a bilayer domain while the membrane scaffold protein wraps around the edges of the discoidal structure in a belt-like configuration ( figure d ). it is possible to modify the diameter of the bilayer disc by genetically engineering the apolipoprotein a by changing the number of amphipathic helices. by this approach, the diameter of nanodiscs can be made anywhere from . to nm, and therefore accommodate a range of membrane proteins. the ratio of phospholipid: membrane scaffold protein is precisely defined which helps engineer the different size of membrane proteins in the nanodiscs. detailed formation of different types of nanodiscs has been described elsewhere [ , ] . the great advantage of using nanodiscs is keeping the membrane proteins in aqueous solution, in native-like phospholipid bilayer environment that is soluble, stable, monodisperse and detergent-free. most important, it isolates proteins or complexes as individual particles in monomeric or oligomeric states for analysis by techniques that range from activity assays to electron microscopy. since , there have been more than membrane proteins reconstituted into nanodiscs [ ] , ranging from signaling receptors to transport machines. we will discuss the applications separately below. nanodiscs have been used to analyze the influence of binding substrate on monodisperse receptors which are isolated from the cell-surface membrane. those receptors include g protein-coupled receptors (gpcr) [ , ] , cholera toxin receptor ganglioside g m , bacterial chemoreceptor [ ] and epidermal growth factor receptor. introduced into nanodiscs, the receptors stay in monodispersed, controllable, predefined oligomeric states in which it is possible to characterize the oligomeric status. for example, two different gpcr proteins, the beta-adrenergic receptor (β ar) and rhodopsin [ ] have been extensively studied using nanodiscs. β ar was one of the first receptors assembled into nanodiscs which was found to be functionally active ( % of starting activity recovered) and shown coupling to its g-protein. rhodopsin is a light-activated gpcr present in the photoreceptor cells of the retina and transducin is an important g-protein naturally expressed in retina rods and cones. assembly of functional rhodopsin into nanodiscs was found to activate transducin with high efficiency and to isolate the high affinity of transducin-metharhodopsin ii complex. this provides strong evidence that the monomeric state of rhodopsin can activate and interact with the transducin. a dimeric rhodopsin nanodisc was separated for monomeric forms using sucrose density gradients. even with two rhodopsins in the nanodiscs, interaction with a single transducin molecule was observed and found to activate the transductin with high efficiency [ ] . numerous membrane associated enzymes have been incorporated into nanodiscs. cytochrome p (cyp) enzymes have been extensively studied, including cyp b [ ] , cyp b [ ] , cyp a [ ] and cyp [ , ] . this system has provided a means for studying the extensive collection of membrane bound cytochromes p with the same biochemical and biophysical tools that have been previously limited to use with the soluble p s. for example, the cytochrome p a (cyp a ) is a membrane-bound protein which is a human hepatic drug-metabolizing enzyme. most studies of the ligand binding by cyp a are carried out in the presence of detergents below their critical micelle concentrations [ , ] but are compared by the propensity of cyp a to aggregate. even in studies attempting to use liposomes, cyp a is unlikely to exist in its native state because the detergent concentrations are much higher than the phospholipid concentrations. as a result, the understanding of the structure and composition of cyp a in the lipid phase was limited and the membrane effect on cyp a ligand binding behavior is unclear. nanodiscs have been utilized to study cyp a which displays monophasic reduction kinetics. with a high lipid-protein ratio, cyp a is captured as a monomer. however, at lower lipid ratio, cy a self-associates and heterogeneous behaviors are induced. the nanodiscs prohibit self-association in this case as there is only one cyp a per nanodisc and show significant improvement in homogeneity and stability. this opens up new possibilities for detailed analysis of equilibrium and steady-state kinetic characteristics of catalytic mechanisms of cyp enzymes [ ] . the sec translocon is a membrane-embedded protein assembly that drives protein translocation into or across membranes. the core translocon is formed from a trimeric arrangement of secy, sece and secg [ ] . the secyeg promoter has transmembrane helices sitting in the phospholipid membrane. the oligomerization of secyeg has been proposed to be necessary to proper function. researchers were successful in reconstituting sec into membrane vesicles in and have had great success in characterizing several partial reactions of secyeg functions [ ] . reconstituting a single secyeg into a nanodisc with different types of lipids [ ] suggests that the acidic lipids can stabilize the secyeg channel in the nanodisc bilayer and trigger dissociation of the seca dimer. a model has been proposed by alami et al. [ ] , suggesting that the dissociation of the seca dimer provoked by the secyeg complex is followed by activation of the seca atpase. furthermore, dalal et al. [ ] , using the nanodisc technology, have also shown that only the secy dimer together with acidic lipids supports the activation of the seca translocation atpase. recently, a high resolution single-particle cryo-em structure of single secyeg complexes in nanodiscs, bound to translating ribosomes was first solved at subnanometer resolution [ ] . it allows the secyeg complex to be investigated in a natural lipid bilayer environment and identifies the ribosome-lipid interactions. wu et al. [ ] also used surface plasmon resonance to investigate the competitive binding of ribosomes and seca. the data suggest that both ribosomes and seca can interact simultaneously with secyeg complex during membrane protein insertion, but seca competes with ribosome when it binds to the secyeg complex. in the previous section, we have shown that membrane proteins can be assembled into four different types of model membrane and the activities of some of the membrane proteins can be retained, allowing their physicochemical properties to be studied. but is there a model membrane system that is the best for membrane-protein reconstitution? the reconstitution of the same membrane protein into different model membranes has been compared and, in this section, we list four membrane proteins with varying activities in different model membranes. ganglioside g m is a naturally occurring native receptor that binds to cholera toxin via hydrogen bonds [ ] . it is an excellent receptor for studying lipid-receptor interaction. several different approaches to reconstituting the glycolipid receptor g m in model membranes have enabled the measurement of binding of its interaction partner cholera toxin. in liposomes and supported lipid bilayer systems, the ganglioside g m is free to diffuse across long distances and exhibits a non-uniform lateral distribution, i.e., self-aggregation, even at low incorporation ratios. therefore the binding activity of ganglioside g m with cholera toxin b is restricted [ ] . investigations of ganglioside g m incorporated into nanodiscs found reduced protein aggregation. bricarello et al. [ ] found that the reconstitution of a low concentration of ganglioside g m in nanodiscs, shows binding of cholera toxin with a significantly higher affinity than in liposomes or supported lipid bilayers. this is due to the interaction of ganglioside g m with the headgroup region of the disc which reduces the oligomerization, thereby causing a potential effect on the affinity of toxin binding. thus, nanodisc technology restricts the ganglioside g m oligomerization by controlling the number of ganglioside g m monomer isolated by each nanodisc. borch et al. [ ] have also used sensor chip-based surface plasmon resonance (spr) technology to measure the detailed kinetic binding of the interaction between soluble molecules and membrane receptors inserted in the bilayer of nanodiscs. the corresponding spr sensorgrams are displayed in figure . overall, the change of the sensorgram indicates that the spr sensorchip is immobilized with histidine-modified nanodisc or the cholera toxin b bound to the nanodiscs. the sensorgrams in both figure a ,b shows the binding of nanodiscs ( ru) on the antibody immobilization surface on the sensor chip. by injecting the cholera toxin b over two flow cells presented in figure a and b, the spr sensorgrams can detect the interaction of the cholera toxin b with nanodisc with or without the existence of g m . it has been revealed that the captured % g m -nanodiscs bound ru of the cholera toxin b without binding to the capturing nanodiscs without g m ( figure b ). the measured kinetic values of the interaction are in agreement with those reported by previous studies on the interaction of the cholera toxin with the g m receptor embedded in different membrane systems. this, therefore, serves as a proof of concept that nanodiscs can be employed in kinetic spr studies. the nucleus envelope is composed of two bilayers (the outer nuclear membrane and inner nuclear membrane) and contains abundant ion channels, through which ions and small molecules diffuse between the cytoplasm, nucleoplasm and perinuclear (i.e., intermembrane) space. the nuclear ionic channels represent a ubiquitous structure in the nuclei in a wide range of cells, although little is known about its functional properties. to characterize nuclear ionic channels, guihard et al. [ ] attempted to reconstitute nuclear envelope vesicles derived from the canine liver nuclei into a planar lipid bilayer and giant proteoliposomes. they found that the success rate of nuclear envelope fusion into planar lipid bilayers was extremely low although cardiac nuclear ionic channels were successfully incorporated into planar lipid bilayers. the detection of the nuclear ionic channels activity was not possible. such a low efficiency can be explained by the clustering of nuclear envelope vesicles, and the low density of single vesicles, as well as the presence of residual chromatin and/or nuclear proteins (histones or lamins) which would prevent fusion events with the bilayer. another approach is reconstituting nuclear envelope vesicles into giant proteolipsosmes and detecting the single ion channel by the patch-clamp technique [ ] . large conductance, voltage-gated, k + and cl − selective nuclear ionic channels are characterized and plotted as a current-voltage relationship presented in figure a ,b respectively. it has been found that under asymmetrical / mm kcl conditions, the zero current potential for unitary currents is at mv ( figure a ). calculated from the goldman-hodgkin-katz (ghk) flux equation, a p k +/p cl − ratio is . . this value indicates the k + selectivity for this channel. in figure b , the cl − selective nuclear ionic channel yields a positive zero current potential of + . mv, with a p cl −/p k + ratio of , indicative of a high cl − selectivity over k + . this suggests super fusion of the channel under asymmetrical ( / mm) kcl conditions. the current-voltage relationship curves indicate that the nuclear ion channels can be functionally characterized by incorporating the proteins into the giant proteoliposomes where it is possible to retain their channel activity. furthermore, the measured activities are consistent with those described for native nuclear ion channels. p-glycoprotein, the most extensively studied atp-binding cassette transporter, has been implicated in the phenomenon of multidrug-resistance in tumor cells and has been suggested to play a significant role in drug absorption and deposition. how p-glycoprotein interacts with its substrates is still unknown. functional studies are limited because of the difficulty of obtaining large quantities of stable p-glycoprotein. besides that, no atpase activity of p-glycoprotein solubilized in detergent could be detected. when p-glycoprotein is reconstituted into proteolipsomes, it has detectable atpase activity; however, the whole complex is very unstable. heikal et al. [ ] have further found that p-glycoprotein reconstituted in the proteoliposomes has a half-life of less than one day. in , ritchie et al. [ ] performed a detailed study of drug-stimulated atpase kinase activity of p-glycoprotein using the nanodisc technology. the p-glycoprotein protein was reconstituted into both msp e d disc and liposomes in order to compare its atpase kinase activities. the results described in figure demonstrate that p-glycoprotein is functionally active when reconstituted into the nanodiscs (close squares). comparing to the atpase kinase activity of p-glycoprotein reconstitution in lipsosomes (close circles), there is a twofold increase in the maximum atpase activity in the nanodiscs. this could be due to the uniform orientations of p-glycoprotein in the nanodiscs while there are two possible orientations in liposomes. these data not only show that p-glycoprotein is functionally active when reconstituted into the nanodiscs, but that it also exhibits higher specific activity than the current standard reconstitution system. figure . comparsions of the atpase activity of p-glycoprotein in nanodiscs (square) and proteoliposomes (circle). open symbols: basal activity in the absence of drug; filled-in symbols: activity in the presence of nicardipine [ ] . atp-binding cassette transporters utilize the energy of atp hydrolysis to transport a wide range of substrates across cellular membranes and for non-transport-related processes such as translation of rna and dna repair [ ] . a member of the atp-binding cassette super family, the maltose transporter malfgk from e. coli, together with the substrate-binding protein male, is one of the best-characterized atp-binding cassette binding cassette transporters suitable for various reconstitution techniques. bao and fuong have reported the reconstitution of the maltose transporter in nanodiscs, in detergent and in proteoliposomes. the atpase activity of the malfgk complex in various environments is shown in figure . the data presented in the first column of figure show that the basal atpase activity for assembly in the nanodiscs and detergent (~ nmol/min/mg) is -fold higher than in proteoliposomes because of the decrease in the activation energy barrier of the transporter [ ] in detergent micelles and nanodiscs. however, in the presence of male, the rate of atp hydrolysis increases in all assembly conditions. this is because male captures maltose and delivers the sugar to the transporter. note that the basal atpase activity assembly in the nanodiscs dramatically increases from to nmol/min/mg. the maltose alone has no effect on the basal atpase activity in the nanodiscs and detergent. however, in nanodisc and detergent, an inhibition of the atpase activity was observed in the presence of both maltose and male in the nanodiscs. this is because that maltose reduces the binding affinity of the male-malfgk complex, which therefore has reduced the atpase activity. in proteoliposomes, the atpase activity (~ nmol/min/mg) shows a further -fold increase in the presence of both maltose and mele in the figure. the author used another type of male mutant which binds maltose with higher affinity. this male mutant, in contrast, shows a reduction of the atpase activity in proteoliposomes which has the same effect as the nanodiscs and detergents. overall, proteoliposomes have shown a low basal atpase activity because the lipid stabilized the transporter. however, the nanodiscs have been shown to be a better medium than proteoliposomes for studying the atp hydrolysis ability of atp-binding cassette transporters. this review summarizes and compares the most up-to-date methods for reconstituting membrane proteins into model membranes. there is no superior method for reconstituting membrane proteins in the model membrane; instead two or more model membranes should be considered, depending on the particular needs of the system and the proteins of interest. in general, systems based on lipid bilayers supported on a solid substrate are still the most favored and well-developed of the methods to study membrane proteins in the bilayer. this approach allows detailed study of the fundamental properties of biological membranes and is practical to reproduce the bilayer system. on the other hand, the proteoliposome is more suitable for ion channel reconstitution in the bilayer, as well as for combination with the patch-clamp method to detect the ionic selectivity of the channel. finally the self-assembled nanodiscs system provides a robust and common means for rendering these targets soluble in aqueous media while providing a native-like bilayer environment that maintains functional activities. nanodisc technology offers another way to prepare monodisperse samples of membrane proteins in the bilayer environment, and it is emerging as the favored approach in studies concerning membrane protein complexes. biochemical and functional characterization of the membrane association and membrane permeabilizing activity of the severe acute respiratory syndrome coronavirus envelope protein lipid-protein interactions in human erythrocyte-membrane acetylcholinesterase. modulation of enzyme activity by lipids correlation between the effect of the anti-neoplastic ether lipid -o-octadecyl- -o-methyl-glycero- -phosphocholine on the membrane and the activity of protein kinase calpha weak dependence of mobility of membrane protein aggregates on aggregate size supports a viscous model of retardation of diffusion structure and phase transitions in langmuir monolayers allogeneic stimulation of cytotoxic t cells by supported planar membranes effect of colicins ia and e on ion permeability of liposomes direct solubilization of heterologously expressed membrane proteins by incorporation into nanoscale lipid bilayers surface enhanced raman scattering of a lipid langmuir monolayer at the air-water interface lipid monolayers: why use half a membrane to characterize protein-membrane interactions? langmuir monolayer of artificial pulmonary surfactant mixtures with an amphiphilic peptide at the air/water interface: comparison of new preparations with surfactant polymyxin b-lipid interactions in langmuir-blodgett monolayers of escherichia coli lipids: a thermodynamic and atomic force microscopy study langmuir balance investigation of superoxide dismutase interactions with mixed-lipid monolayers modern physicochemical research on langmuir monolayers structure of rhodopsin in monolayers at the air-water interface: a pm-irras and x-ray reflectivity study formation, structure, and spectrophotometry of air-water interface films containing rhodopsin proton transport by bacteriorhodopsin in planar membranes assembled from air-water interface films structural and spectroscopic characteristics of bacteriorhodopsin in air-water interface films spectroscopic and structural properties of valine gramicidin a in monolayers at the air-water interface effects of gramicidin-a on the adsorption of phospholipids to the air-water interface organization, structure and activity of proteins in monolayers monitoring of phospholipid monolayer hydrolysis by phospholipase a by use of polarization-modulated fourier transform infrared spectroscopy supported planar membranes in studies of cell-cell recognition in the immune system supported phospholipid bilayers formation of high-resistance supported lipid bilayer on the surface of a silicon substrate with microelectrodes supported lipid bilayer self-spreading on a nanostructured silicon surface simulations of temperature dependence of the formation of a supported lipid bilayer via vesicle adsorption simulations of lipid vesicle rupture induced by an adjacent supported lipid bilayer patch membrane lateral mobility obstructed by polymer-tethered lipids studied at the single molecule level tethered polymer-supported planar lipid bilayers for reconstitution of integral membrane proteins: silane-polyethyleneglycol-lipid as a cushion and covalent linker reversible activation of diblock copolymer monolayers at the interface by ph modulation, : lateral chain density and conformation polymer-cushioned bilayers. i. a structural study of various preparation methods using neutron reflectometry polymer-cushioned bilayers. ii. an investigation of interaction forces and fusion using the surface forces apparatus controlled enhancement of transmembrane enzyme activity in polymer cushioned supported bilayer membranes orientation and reactivity of nadh kinase in proteoliposomes modulation of rhodopsin function by properties of the membrane bilayer role of lipid polymorphism in g protein-membrane interactions: nonlamellar-prone phospholipids and peripheral protein binding to membranes influence of the membrane lipid structure on signal processing via g protein-coupled receptors the apoptotic protein tbid promotes leakage by altering membrane curvature modulation of ctp:phosphocholine cytidylyltransferase by membrane curvature elastic stress the role of membrane biophysical properties in the regulation of protein kinase c activity membrane lipid polymorphism: relationship to bilayer properties and protein function elastic coupling of integral membrane protein stability to lipid bilayer forces reconstitution of colicin e into dimyristoylphosphatidylcholine membrane vesicles the permeability of bilayer lipid membranes on the incorporation of erythrocyte membrane extracts and the identification of the monosaccharide transport proteins binding of cytochalasin b to human erythrocyte glucose transporter conformational dynamics of na+/k+-and h+/k+-atpase probed by voltage clamp fluorometry the extracellular patch clamp: a method for resolving currents through individual open channels in biological membranes lipid bilayer formation by contacting monolayers in a microfluidic device for membrane protein analysis droplet networks with incorporated protein diodes show collective properties cell-sized liposomes and droplets: real-world modeling of living cells oriented reconstitution of a membrane protein in a giant unilamellar vesicle: experimental verification with the potassium channel kcsa phospholipid phase transitions in homogeneous nanometer scale bilayer discs membrane protein assembly into nanodiscs functional reconstitution of beta -adrenergic receptors utilizing self-assembling nanodisc technology transducin activation by nanoscale lipid bilayers containing one and two rhodopsins using nanodiscs to create water-soluble transmembrane chemoreceptors inserted in lipid bilayers atomic-force microscopy: rhodopsin dimers in native disc membranes single-molecule height measurements on microsomal cytochrome p in nanometer-scale phospholipid bilayer disks co-incorporation of heterologously expressed arabidopsis cytochrome p and p reductase into soluble nanoscale lipid bilayers the critical iron-oxygen intermediate in human aromatase the ferrous-oxy complex of human aromatase kinetics of dithionite-dependent reduction of cytochrome p a : heterogeneity of the enzyme caused by its oligomerization ligand binding to cytochrome p a in phospholipid bilayer nanodiscs the effect of model membranes the atpase activity of seca is regulated by acidic phospholipids, secy, and the leader and mature domains of precursor proteins nanodiscs unravel the interaction between the secyeg channel and its cytosolic partner seca two copies of the secy channel and acidic lipids are necessary to activate the seca translocation atpase cryo-em structure of the ribosome-secye complex in the membrane environment competitive binding of the seca atpase and ribosomes to the secyeg translocon crystal structure of cholera toxin b-pentamer bound to receptor gm pentasaccharide self-aggregation-an intrinsic property of g(m ) in lipid bilayers ganglioside embedded in reconstituted lipoprotein binds cholera toxin with elevated affinity nanodiscs for immobilization of lipid bilayers and membrane receptors: kinetic analysis of cholera toxin binding to a glycolipid receptor patch-clamp study of liver nuclear ionic channels reconstituted into giant proteoliposomes the stabilisation of purified, reconstituted p-glycoprotein by freeze drying with disaccharides chapter -reconstitution of membrane proteins in phospholipid bilayer nanodiscs abc transporters, mechanisms and biology: an overview discovery of an auto-regulation mechanism for the maltose abc transporter malfgk the authors gratefully acknowledge financial support from the australian research council (arc). hhs is an arc super science fellow and tl is an arc federation fellow. tl and lm were awarded the arc super science fellowships and grant (fs ). we thank victoria hewitt for her critical reading of the manuscript. key: cord- -ke az nf authors: schlake, thomas; thess, andreas; thran, moritz; jordan, ingo title: mrna as novel technology for passive immunotherapy date: - - journal: cell mol life sci doi: . /s - - - sha: doc_id: cord_uid: ke az nf while active immunization elicits a lasting immune response by the body, passive immunotherapy transiently equips the body with exogenously generated immunological effectors in the form of either target-specific antibodies or lymphocytes functionalized with target-specific receptors. in either case, administration or expression of recombinant proteins plays a fundamental role. mrna prepared by in vitro transcription (ivt) is increasingly appreciated as a drug substance for delivery of recombinant proteins. with its biological role as transient carrier of genetic information translated into protein in the cytoplasm, therapeutic application of mrna combines several advantages. for example, compared to transfected dna, mrna harbors inherent safety features. it is not associated with the risk of inducing genomic changes and potential adverse effects are only temporary due to its transient nature. compared to the administration of recombinant proteins produced in bioreactors, mrna allows supplying proteins that are difficult to manufacture and offers extended pharmacokinetics for short-lived proteins. based on great progress in understanding and manipulating mrna properties, efficacy data in various models have now demonstrated that ivt mrna constitutes a potent and flexible platform technology. starting with an introduction into passive immunotherapy, this review summarizes the current status of ivt mrna technology and its application to such immunological interventions. our bodies are continuously exposed to molecules that may indicate disease or parasitic invasion. the immune system is responsible for the detection and clearance of such molecules and for control of the underlying causes. immediate discrimination between self and foreign upon first exposure is mediated by innate immunity. signals that induce innate immunity are called pathogen-associated and damageassociated molecular patterns (pamps and damps) [ , ] . the innate immune system is characterized by induction of generic defenses against a broad spectrum of infectious agents and is mainly aimed at clearance of tissue damage at the site of infection and interruption of further pathogen replication. responses against specific targets following prolonged or repeated exposures are processed by the adaptive immune system [ ] . important effectors of adaptive immunity are highly specialized cells: b cells secrete antibodies against soluble or cell-associated antigens [ ] . cytotoxic cd + t cells (ctls) recognize and kill infected or neoplastic cells [ ] . regulatory cd + t cells augment b cell maturation or inhibit auto-reactive immune cells [ ] . dendritic cells (dcs) process and present antigen for the transition from innate to adaptive immunity [ ] . some b and t cells progress towards persisting memory cells that react faster and with greater affinity to the foreign molecules upon re-exposure. compared to innate immunity, adaptive immunity usually requires weeks (as opposed to minutes) until an effective response against novel targets is achieved. advantages of the adaptive immune system include the ability to launch specific immune responses also against antigens of endogenous (not only microbial) origin that may be associated with degenerative or neoplastic disease. manipulation of the immune system is an important component of many prophylactic and therapeutic applications against infectious, degenerative and neoplastic diseases. the diverse repertoire of methods can be roughly divided into four approaches: active immunization (vaccination) prepares adaptive memory responses, usually prior to first exposure, and constitutes one of the most efficacious and cost-effective medical interventions. immunization with antigens generally only leads to the induction of antibody production, whereas for instance inoculation with attenuated viruses also elicits cytotoxic effector cells [ ] . in addition, active immunization can be accomplished by adoptive cell transfer. in a typical setting, dendritic cells are loaded with tumor antigens ex vivo and subsequently infused into patients to induce an adoptive immune response against cancer cells [ ] . in contrast, passive immunotherapy circumvents the initial steps required for immune responses to be launched and directs the immune system efficiently to the desired medical targets. for cellular approaches, ctls are equipped with recombinant receptors ex vivo. these cells are designed to attack neoplastic cells that express the cognate tumorassociated antigen immediately after infusion [ ] . passive immunization by administration of processed antibodies derived from human or animal donors is a well-established emergency procedure for treatment of snake-bite envenomation or post exposure prophylaxis against, for example, rabies [ ] . the advantage of passive immunization is that protective antibodies can be provided in a very short time. application of recombinant antibodies further expands the number of available targets and is increasingly important for augmentation of conventional therapies against cancer [ , ] . passive immunotherapies either require or can exploit modern nucleic acid-based methods, among which mrna is the latest technology. the present review is dedicated to provide an overview on mrna in passive immunotherapy after introducing the immunological approach as well as mrna technology as such. it is well known that protection raised by most of today's licensed vaccines is primarily antibody-dependent (fig. a ) [ , ] . basically, this explains the long and successful history of passive immunotherapy (fig. b) . the protective capacity of serum against bacterial toxins was discovered in the early s [ ] . the avoidance or control of infection by such passive immunization is based on the transfer of serum and later polyclonal immune globulin (= antibody) preparations from convalescent or vaccinated humans or animals [ , ] . prior to the discovery of antibiotics, serum was the only antidote for bacterial diseases [ ] . thus, following successful passive immunization against diphtheria toxin [ ] , a whole plethora of serum or immune globulin therapies for viral and bacterial diseases as well as to neutralize snake toxins was developed [ ] . clinical benefits of serum and immune globulin therapy were demonstrated for viral diseases such as influenza, measles, and polio and bacterial infections with meningococcus or pneumococcus [ ] [ ] [ ] . with the advent of antibiotics, the use of serum or polyclonal immune globulin preparations as antibacterial agent was largely discontinued in late . however, such therapies retained a niche as treatment for venoms, toxins, and certain viral infections. in the second half of the twentieth century, polyclonal antibody preparations were developed and in part licensed for the prophylaxis and treatment of hepatitis a and b, cytomegalovirus, varicella-zoster virus, vaccinia virus, rabies, respiratory syncytial virus, west nile virus, and various hemorrhagic fevers [ ] . for instance, passive immunization against the argentine hemorrhagic fever shows beneficial effects when applied within week after the emergence of symptoms, and post-exposure treatment with human or equine immunoglobulins are recommended for rabies [ ] [ ] [ ] . moreover, botulism is treated with equine antitoxin [ , ] . such immune globulin preparations of non-human origin are particularly prone to elicit an immune response that obstructs their therapeutic use or efficacy [ ] . further disadvantages or difficulties associated with the use of serum or polyclonal globulin preparations are the often high content of non-neutralizing antibodies, batch-to-batch variations, and in case of human sources the availability of appropriate immune donors [ , ] . in , groundbreaking work described the production of monoclonal antibodies (mab) by immortalization of b cells [ ] . the resulting hybridoma technology was then rapidly exploited for clinical use, for instance, to produce a mab to cd for preventing organ rejection [ ] . recombinant technologies further expanded the available therapies based on mabs. in vitro antibody selection technologies like phage or ribosome display were developed to enable the generation of highly specific human mabs out of libraries that may even be naïve for the specific antigens [ ] [ ] [ ] [ ] [ ] [ ] schematic illustration comparing active immunization and passive antibody immunotherapies. a during active immunization triggered by natural infection or vaccination, antigenic patterns are presented by antigen-presenting cells in the lymph nodes. this leads to t cell mediated activation of antigen-specific b cells. as a consequence, b lymphocytes differentiate into plasma cells which produce and secrete antigen-specific antibodies that bind to cognate structures, finally leading to their clearance. b instead of being produced by plasma b cells, antibodies can be manufactured recombinantly and administered for instance by subcutaneous injection for passive immunization. after injection, antibodies enter circulation by diffusion and act like endogenous antibodies. c for dna-based passive immunization, dna is often packaged in nanoparticles, e.g., virus capsids, which for instance can be injected intramuscularly. after uptake by muscle cells, dna is released into the cytosol. for transcription into mrna, the dna has to enter the cell nucleus first. mrna is then translated into antibodies which are secreted to bind their cognate targets a high throughput technique to amplify and clone antibody genes from single human b cells was described [ , ] . although recombinant mab technology is exploiting a large variety of different antibody formats (fig. ) , the prevalent type is still a full-size antibody of the igg class. in addition to the variable domains essential for antigen binding, they contain constant domains including the fc-region. the latter is important for antibody function and can mediate antibody-dependent cellular cytotoxicity (adcc), complement-dependent cytotoxicity (cdc), and antibodydependent cellular phagocytosis (adcp) [ ] . furthermore, binding to the neonatal fc receptor (fcrn) plays a role in controlling the antibody half-life which is - days for human igg [ ] [ ] [ ] . fcrn rescues bound antibody from degradation by transporting it back to the cell surface where it is released into the extracellular space [ , ] . specific mutations in the fc region that increased the affinity to fcrn have been shown to prolong antibody half-life up to fivefold [ ] . in addition to the impact of fcrn binding, half-life also benefits from the large size of iggs. it obstructs antibody clearance by the kidney as well as metabolization by cytochrome p [ , ] . the downside of the large size is the correspondingly low access to and penetration of tissue which can affect therapeutic efficacy [ ] . full-size antibodies are often posttranslationally glycosylated, modulating fc function. although aglycosylated iggs can be produced in bacteria [ ] , modern production processes usually take advantage of the cellular machinery for advanced posttranslational maturation and secretion of eukaryotic cells [ ] [ ] [ ] [ ] . the vast majority of approved therapeutic antibodies are currently produced in mammalian cells [ , ] . production processes have been optimized especially for the predominant production system, the continuous chinese hamster ovary (cho) cell line. cho cells secrete antibodies with negligible non-human glycoforms [ ] and are also amenable to glycoengineering [ ] . differences in glycoforms depend on the production system and affect distribution and stability of the antibody, fc effector function and immunogenicity in recipients [ ] . since traditional expression hosts such as e. coli do not allow efficient production of full-size antibodies, smaller proteins consisting of fragments derived from the variable domains were developed as promising alternatives. such single-chain variable fragments (scfv) and various derivatives thereof preserve antigen binding while facilitating manufacturing (fig. b, c) [ ] . another type of antibody fragment is derived from camelids or cartilaginous fish. these animals produce single-domain antibodies devoid of light chains (fig. e) [ , ] . since antigens are recognized by a heavychain-only v h domain (v h h) in camelids [ ] , the variable v h h fragment can be easily engineered into nanobodies that offer additional advantages such as improved heat and ph stability [ ] . moreover, they can also be assembled into v h h-based neutralizing agents (vnas) (fig. e) [ ] . various studies demonstrated that multivalent formats were more effective than monovalent single-domain antibodies [ , ] . notably, all formats based on antibody fragments can be relatively efficiently produced with less expensive bacterial expression systems, typically employing e. coli [ , ] . the antibody fragments produced in this system are often targeted to the oxidative environment of the periplasm using specific signal peptides to foster disulfide bond formation and proper folding [ , ] . moreover, enhanced expression of chaperones and cytoplasmic oxidases has been demonstrated to increase the yield of antibody fragments [ , ] . small antibody fragments were also the basis for developing the concept of bispecific antibodies more than years ago. initially, single chain antibodies with a different binding specificity were fused to the c-terminal ends of heavy chains of iggs [ ] . generation of first bispecific igg molecules benefited from the knob-into-hole technology [ ] . today, many different bispecific antibody formats combining two different antigen binding domains in one molecule are available ( fig. d ) [ ] [ ] [ ] [ ] . among them, bispecific diabodies (bi-(scfv) ) and bite antibodies are prominent examples [ , ] . in general, bispecific antibodies can be deployed to target therapeutic substances such as toxins, radionuclides, and drugs as well as effector cells like ctls to the site of cognate antigen expression [ ] . associated with their small size, many formats using antibody fragments are cleared by renal elimination [ , ] . moreover, in the absence of an fc region, recycling by the fcrn rescue mechanism cannot take place [ ] . as a consequence, these formats usually reveal short plasma half-lives [ ] . for instance, bi-(scfv) antibodies have a serum half-life of less than h which usually requires continuous infusion [ ] . in case of the bite blinatumomab, the antibody is usually administered daily due to its short halflife [ ] . possible strategies to extend serum half-lives are site-specific pegylation and fusion to an fc region [ , ] . however, the latter approach would negate various advantages of antibody fragments including their better and faster tissue penetration [ , ] . it has been shown that small single-domain antibodies could even cross the blood-brain barrier [ ] . in case of an anti-rabies antibody, this allowed partial rescue of mice challenged with virus injection into the brain in contrast to full-size immunoglobulins [ , ] . today, monoclonal antibodies play an important role in the therapeutic armamentarium. dozens of antibodies have been licensed to treat cancer, rheumatoid arthritis, multiple sclerosis, psoriasis, allergy, systemic lupus, and other diseases. in addition, mabs have shown promise in protecting against various microorganisms, viruses, and fungal infections as well as in treating neurodegenerative diseases [ ] [ ] [ ] [ ] . however, mabs for infectious disease indications are still scarce among licensed products. palivizumab for respiratory syncytial virus prophylaxis in high-risk infants was the first antiviral mab approved by the fda [ ] . since then, antibodies against anthrax and rabies in india became available. in addition, bezlotoxumab binding to clostridium difficile toxin is used to prevent recurrent bacterial infections [ ] . mabs for treating cancer are still the largest group of licensed products. here, rituximab, directed against the transmembrane protein cd on the surface of b lymphocytes, was the first mab in clinical use [ ] . however, as for many other mabs in antitumor therapy, high doses are needed to obtain clinical efficacy [ ] . a successful example for a bispecific antibody is the first-in-class bite against cd /cd , blinatumomab, which is approved for the treatment of acute lymphoblastic leukemia [ ] [ ] [ ] . in addition to the administration of immunoglobulins, passive immunity can also be conferred by transferring functionalized immune cells. adoptive transfer of ctls was shown to be a potent therapeutic means to treat both viral infections and cancers [ ] [ ] [ ] . to this end, t cells can be equipped with an additional t-cell receptor (tcr) or a chimeric antigen receptor (car) [ ] . while cars are limited to the binding of surface antigens, tcrs recognize mhc-presented peptides derived also from intracellular proteins. the first successful clinical trial with an engineered tcr on ctls demonstrating tumor regression was reported in [ ] . subsequently, passive immunotherapies with tcr-engineered t cells became an important approach for antitumor treatments [ ] . efficient targeting and killing of cancer cells expressing the respective antigen in patients with various forms of cancer including metastatic melanoma, synovial sarcoma, and colorectal carcinoma have been demonstrated [ , [ ] [ ] [ ] . a possible problem specific to the use of tcr-engineered t cells is the presence of an endogenous tcr. mispairing of endogenous and introduced α-and β-chains may create new specificities with potential reactivity to host molecules [ , ] . to avoid such mispairing, the use of γ/δ t cells has been suggested, since γ/δ-chains do not pair with α-or β-chains [ ] [ ] [ ] . alternatively, gene editing is now being explored to disable endogenous tcr expression [ , ] . the concept of cars was developed in [ ] and then refined using a scfv fragment to obtain antibody-like receptor specificity without the need to transfer multiple genes [ ] . first generation cars consisted of an antigenspecific scfv, fused to a transmembrane and intracellular cd ζ tcr signaling domain, conferring transient activation and cytotoxicity to t cells [ ] . upon target binding through the scfv domain, the engineered t cell is activated in an mhc-independent manner [ ] . subsequent generations of cars were improved with respect to cytotoxicity and persistence by including additional co-stimulatory domains such as cd , ox- or - bb [ , [ ] [ ] [ ] . t cells engineered with such cars targeting specific tumor antigens are remarkably successful in treating hematological malignancies like leukemia and lymphoma [ , ] . for instance, cd -directed car t cells repeatedly revealed complete and durable remissions in patients with b-cell acute lymphoblastic leukemia (b-all) [ ] [ ] [ ] . in contrast, car t cell therapies face various challenges for solid tumors [ ] . at present, the most common techniques for generating tcr-or car-engineered t cells utilize viral gene transduction with retro-or lentiviral vectors [ ] . however, permanent expression of the transgenic receptor mediated by this efficient technology can be disadvantageous in case of therapy-related severe toxicities due to accidental cross reactivity [ ] [ ] [ ] . on-target/off-tissue and off-target toxicities by engineered t lymphocytes attacking healthy host cells as well as cytokine release syndrome are feared side-effects which were repeatedly reported when virus vector transduced cells were applied [ ] [ ] [ ] [ ] . another concern of using retroviral vectors is the potential to induce insertional mutagenesis and genotoxicity in effector cells [ ] [ ] [ ] [ ] . hence, more precise cell manipulations are currently under investigation. among them, gene editing of primary human t cells was recently demonstrated to be an efficient approach [ ] . while t cell engineering for adoptive transfer inevitably requires the use of nucleic acids encoding a target-specific receptor, antibody immunotherapy can deploy recombinant proteins. however, as pointed out above, maintaining therapeutically effective levels may require frequent administrations dependent on clearance and indication [ ] . thus, dna-mediated antibody expression directly in the body may represent an attractive alternative to administration of recombinant proteins (fig. c ). both, plasmids as well as viral vectors have been used for passive immunization. although efficient in small animal models, strong expression of recombinant genes using unformulated dna does not scale well to larger animals (including primates) [ ] . consequently, application of recombinant adeno-associated viruses (aavs) is currently the preferred method for transduction of the antibody gene of interest [ ] . early work reached single digit µg/ml serum titers [ ] . later studies with advanced vector designs reported expression levels in the high µg/ml or even in the single digit mg/ml range [ ] [ ] [ ] . much work has been done in the field of hiv prophylaxis. here, a single intramuscular injection of recombinant aav was demonstrated to elicit peak antibody titers above µg/ml in mice [ ] . treated mice revealed substantial anti-hiv antibody levels for more than a year and were protected from hiv- challenge. although aav usually maintains an episomal state, this vector still harbors an inherent risk of insertional mutagenesis. in patients with hepatocellular carcinomas integration of aav into known cancer genes was observed [ ] . moreover, aav-based immunotherapy faces various issues regarding immunogenicity [ ] [ ] [ ] . a substantial percentage of the population has already been in contact with the used virus and consequently shows pre-existing immunity limiting the efficacy of treatment [ , ] . the induction of anti-viral responses during immunotherapy may have similar consequences if a single virus serotype is used in repeated treatments [ ] . pre-existing or induced immunity could lead to clearance of the viral vector and/or aavtransduced cells. finally, gene delivery by aav has been reported to induce immune responses against the encoded protein. even when using an endogenous gene such as erythropoietin (epo), some macaques receiving epo-encoding aav intramuscularly developed severe autoimmunity against the protein [ ] . when non-human primates were treated with aav vectors expressing antibody or antibody-like proteins, titers dropped rapidly in some animals [ ] . one reason is the possibility of inducing anti-idiotype antibodies which requires immune suppression to obtain sustained expression [ ] . further studies indicated that primates may be more prone to develop a robust t cell response to the aavencoded protein compared to mice [ ] . in summary, the risks of insertional mutagenesis and genotoxicity, long-lasting expression without control in case of adverse events, and various potential issues regarding immunogenicity associated with viral vectors that may limit efficacy emphasize the high demand for other vectors for passive immunotherapy than dna. how mrna offers a viable option to meet the demands is reviewed in the following sections. the cellular machinery uses mrna as a transient carrier of genetic information for the synthesis of proteins. based on this fundamental biological concept administering exogenous mrna represents an alternative to dna-mediated protein delivery in vitro and in vivo. using mrna instead of dna as therapeutic substance is attractive due to the absent risk of insertional mutagenesis. moreover, efficient expression is even obtained in non-dividing cells, since mrna does not require a nuclear phase for activity. compared to the delivery of proteins and peptides, mrna may prolong the availability of effector molecules, however, not as much as dna. in contrast to the latter, mrna therapy, therefore, has to cope with the short half-life in vivo of exogenously delivered mrna as for instance indicated by mrna-mediated vegf expression in myocardium returning to baseline within h [ ] . while this may be a therapeutic disadvantage in various instances, it can be considered advantageous from a safety perspective, particularly in case of adverse events. mrna was first employed for the expression of a protein of interest in the early s when rna preparations were microinjected into xenopus oocytes and synthesis of the encoded protein was demonstrated [ , ] . in , the group of inder verma presented a reliable method to efficiently introduce rna into a variety of cells using a cationic lipid [ ] . almost at the same time, mrna-mediated protein expression in vivo was demonstrated after direct injection into mouse muscle [ ] . much of the early work on a potential therapeutic use of mrna focused on the development of active vaccination approaches, in part since low amounts of antigen suffice due to the amplifying nature of the immune response. subcutaneous injection of liposomeencapsulated, antigen-encoding mrna was the first example of eliciting an antigen-specific ctl response in mice [ ] . gene gun delivery of mrna into mouse epidermis provided the first evidence of an antigen-specific antibody response [ ] . in addition, mrna turned out to be a potent means to load dendritic cells with antigens to convert them to tailored antigen-presenting cells in vitro and in vivo [ ] . later, a new vaccination protocol was introduced which elicited a complete adaptive immune response consisting of antigenspecific antibodies and t cells with lytic activity without requiring any transfection reagents, special equipment or heterologous boost [ ] . from a retrospective view, this event marked the starting point of the commercial development of mrna vaccines. in the minimal structure, mrna contains a protein-encoding open reading frame (orf) flanked at the ′-and ′-end by two elements essential for the function of eukaryotic mrna: the "cap", a -methyl-guanosine residue (m g) bound to the ′-end of the rna via a ′- ′ triphosphate bond, and, with the exception of histone mrnas, a poly(a) tail at the ′-end [ ] [ ] [ ] . synthetic mrna is transcribed in vitro from a plasmid dna template that contains at least a bacteriophage promoter, the orf, and a unique restriction site for linearization of the plasmid to ensure defined termination of transcription. typically, the template also contains a poly(d[a/t]) sequence transcribed into poly(a). alternatively, the poly(a) tail can be generated by enzymatic in vitro polyadenylation of transcribed rna. the cap may be incorporated into the transcript during transcription by including an m gpppg dinucleotide as a structural homolog of the endogenous cap structure in the transcription reaction. different options for how to derive ivt mrna are summarized in fig. . various elements of an mrna molecule contribute to the level and duration of expression of the encoded protein. the cap structure is required for efficient translation and stabilizes mrna towards exonucleolytic decay [ ] [ ] [ ] . various structures have been repeatedly used for ivt mrna (fig. ) . the basic m gpppg cap analog is incorporated in both orientations into the rna by the bacteriophage rna polymerase [ ] . however, reverse incorporation of the cap analog results in mrna molecules that lack the m methylation at the cap and are not recognized by the translational machinery [ ] . substitution of the hydroxyl group in c ′ or c ′ position of the m g with a methoxy group prevents reverse incorporation of the cap analog by inhibiting elongation at the m g. the dinucleotide is, therefore, called 'antireverse cap analog' or arca [ , ] . arca-capped mrna revealed increased as well as prolonged protein expression in cultured cells and enhanced reporter protein expression in mouse dendritic cells up to -fold [ , ] . for further optimization of the cap structure, modifications were introduced within the triphosphate linkage to inhibit decapping. substitution of a non-bridging oxygen in the β-phosphate moiety of an arca by sulfur results in the β-s-arca dinucleotide. while β-s-arca maintains recognition by the translational machinery, protein expression from mrna capped with β-s-arca was extended in hc cells and immature dcs, but not in mature dcs [ , ] . enzymatic capping represents an alternative to the cotranscriptional approach, avoiding a n -unmethylated cap as does arca. to this end, mrna is transcribed without cap analog and subsequently capped using the vaccinia virus capping complex [ ] . this complex with triphosphatase, guanylyltransferase and (guanine- )-methyltransferase activity adds a natural cap to the ′-triphosphate of an rna molecule. such a cap structure can be further converted into a cap group by o-methylation of the ribose of the cap-proximal nucleotide using the vaccinia virus ′o-methyltransferase [ ] . cap (and cap harboring a further o-methyl-ribose at the subsequent nucleotide) structures are typical of eukaryotic mrna and are recognized less by cytosolic rna sensors of the innate immune system, thereby rendering the mrna less immunostimulatory. for instance, rig-i is activated by cap but not cap mrna and ifit binds cap mrna more tightly than cap molecules [ , ] . such sensor-mediated immune stimulation may in turn negatively impact mrna translation [ ] . it is interesting to note that since recently a new cap analog, cleancap, is available which enables the cotranscriptional incorporation of a cap structure into ivt mrna. like the cap structure at the ′-end, details of the poly(a) tail at the ′-end influence translation and stability of mrna [ , ] . while numerous studies demonstrated a positive effect of a poly(a) tail and some correlation of effectiveness and length of the element, details of observations were remarkably variable. the majority of studies indicate an enhancement of translation when extending the poly(a) length from approximately - to - nucleotides or by tail extension using enzymatic polyadenylation [ , [ ] [ ] [ ] . however, effects were mostly moderate, but reached a maximum of a -fold increase in one particular setting. in contrast, another study reported an optimum of approximately nucleotides; protein expression declined with further increasing poly(a) length [ ] . in addition to poly(a) length, one report suggests a positive role of using a type iis restriction enzyme for dna template linearization to obtain a free poly(a) end rather than one extended with unrelated nucleotides [ ] . further elements that can affect mrna translation and/or half-life are untranslated region (utr) sequences flanking fig. schematic representation of different cap structures. a the typical ′ cap of eukaryotic mrnas. a guanosine is methylated at position and linked to the first nucleotide of the mrna by an unusual ′ to ′ triphosphate bridge. depending on the degree of methylation of the first two bases of the mrna, the full ′ terminal structure is referred to as cap , cap or cap . the cleancap™ analog, a trinucleotide introducing a cap structure during ivt, is indicated in blue. b a plain cap analog (orange) is incorporated in two orientations during ivt. c inverse orientation can be avoided using anti-reverse cap analogs (arcas, highlighted in orange). such analogs are characterized by the presence of a methoxy group at either c ′ or c ′ of m g. to improve resistance to decapping, a phosphorothioate was positioned in the ′- ′ bridge of arca (β-s-arca) the orf sequence. trans-acting regulatory rna-binding proteins (rbps) interact with distinct rna sequence elements, thereby affecting ribosome recruitment and transit [ ] . for instance, β globin ′-and ′-utrs, duplicating the β globin ′-utr, the ′-utr of tobacco etch virus, and a structure of the ′-utr of human heat shock protein all enhanced mrna translation in mammalian cells [ , , , ] . according to a very recent survey of a combinatorial utr library, ′-utr sequences appear to be most critical for protein expression [ ] . in contrast, ′-utrs seem to be the key driver for mrna half-life as exemplified by the stabilizing effects of the α globin ′-utr as well as a duplication of the β globin ′-utr [ , ] . codon usage of the orf sequence also affects translation efficacy in many species. although in humans codon usage bias does not correlate with trna levels and gene expression [ , ] , still increased protein expression from mrna upon codon usage optimization has been reported. for instance, codon usage adaptation of hiv- gag improved protein yield from mrna approximately . -fold in a human t lymphocyte cell line [ ] . a more pronounced increase in protein expression as a result of codon optimization was reported upon transfection of mrna encoding angiotensinconverting enzyme into a and hepg cells [ ] . however, alternatively to a direct effect of codon usage, the enhancement may be an indirect result of the use of modified nucleotides, the content of which was altered by codon optimization. furthermore, coding sequence engineering exploiting more advanced concepts like codon optimality may prove valuable in designing therapeutic mrnas of high efficacy. according to recent insights, codon usage may also affect fidelity of translation or the stability of transcripts [ ] . in addition, orf as well as utr sequences can have an effect on immunostimulation and thus on translational activity. initially, researchers applied mrna containing only the four unmodified bases a, u, c, and g [ ] [ ] [ ] ] . in this context, for instance, u-rich sequences, as well as several rna structural features were described as immunostimulatory due to interactions with various rna sensors such as toll-like receptors (tlr), rig-i, and protein kinase r (pkr) [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . as a consequence, development of mrna therapies was hampered by the immunogenicity of in vitro transcribed mrna. the consideration of mrna for therapeutic purposes gained momentum by the finding that incorporation of modified bases into in vitro transcribed mrna reduced immunostimulation of such preparations. various modified bases found in natural rnas suppressed recognition by tlrs in vitro [ ] . among these, particularly pseudouridine increased translation and stability of mrna [ ] . in addition to the effect on tlr binding, replacement of uridine by pseudouridine affected binding to and activation of further sensors such as pkr and ′- ′-oligoadenylate synthetase, contributing to higher and longer protein expression [ , ] . however, the effects of mrna modifications on translation, immunostimulation and resulting protein expression appear to be variable, for instance dependent on the type of target cells. in vitro testing of various modifications and combinations thereof revealed decreased protein yield for any of them in a context dependent manner, while most of them reduced immunostimulation in raw . macrophages [ ] . as found in vitro, pseudouridine modification of mrnas reduced immunostimulation and increased level and duration of protein expression after intravenous (iv) or intraperitoneal administration of formulated mrna in mice [ , ] . in contrast, another study on systemic administration of nanoparticle-complexed mrna concluded that neither immunostimulation nor protein expression benefited from pseudouridine modification [ ] . after intradermal or intramuscular injection of formulated mrna, only n -methyl-pseudouridine, but not pseudouridine, substantially enhanced expression [ ] . in line, a different study applying lipid nanoparticle (lnp)-formulated mrna intradermally found that replacement of uridine with n -methylpseudouridine resulted in much increased and longer lasting protein expression [ ] . notably, recent studies revealed that also endogenous eukaryotic mrna harbors various modified nucleotides [ ] [ ] [ ] [ ] . however, the total level of modification is rather low which is in strong contrast to the usually % replacement of an unmodified nucleotide in ivt mrna. moreover, heavy nucleotide modification appears to interfere with the function of translation-enhancing rna elements such as utrs and internal ribosomal entry sites (ireses) [ ] . at the time when modified nucleotides were introduced to minimize immunostimulation of ivt mrna, the importance of stringent purification of such preparations was recognized. chromatographic purification, particularly hplc, can separate mrna according to size, thereby removing smaller or larger by-products such as abortive transcripts, mrna from traces of non-linearized dna template or double-stranded rna (fig. ) [ , ] . such purification enhanced protein yield, most probably by enriching functional transcripts and depleting contaminants causing detrimental immunostimulation [ , ] . the latter is corroborated by the finding that stringent purification reduced the beneficial effect of chemical modification or even made it dispensable, particularly in combination with specific sequence-engineering of the mrna [ , ] . importantly, the specifics of mrna formulation are another layer that can influence activation of the innate immune system by masking mrna from recognition by sensors, particularly tlrs. for example, immune responses after mrna administration into the central nervous system were effectively suppressed by the use of a nanomicelle formulation compared to naked mrna [ ] . after in vivo administration of mrna was proven to be feasible, the concept of using mrna as a basis for therapeutics was pursued almost immediately. the very first report on a therapeutic effect with exogenous mrna was already published in and described a temporary reversion of diabetes insipidus in rats by intrahypothalamic injection of vasopressin mrna [ ] . thereafter, it took almost two decades until further studies started to demonstrate the broad potential of mrna-based protein therapies. meanwhile, there is a plethora of publications on a huge variety of indications comprising anemia [ , ] , hemophilia [ , ] , myocardial infarction [ , ] , cancer [ , ] , lung disease such as surfactant b deficiency and asthma [ ] [ ] [ ] , metabolic disorders [ ] [ ] [ ] [ ] [ ] , fibrosis [ ] , skeletal degeneration [ ] , tendon impairment [ ] , and neurological disorders such as sensory nerve dysfunction, friedreich's ataxia and alzheimer's disease [ ] [ ] [ ] . whereas evidence for the therapeutic potential of mrna is mostly restricted to mouse models, first data in swine indicate that mrna-based protein therapies are feasible also in large animals [ , ] . in view of the various indications, it is hardly surprising that this diversity goes along with different routes of administration and various formulations. only very few studies looking at local administration used uncomplexed and thus unprotected mrna [ , , , ] . the majority of investigations built on lipid-based formulations with a clear tendency to the application of lnps [ , , [ ] [ ] [ ] ] . most if not all groups purified their ivt mrna before in vivo administration. while some simply precipitated the mrna [ , , ] , most used commercial purification kits. only a few researchers applied hplc purification [ , , ] . with respect to the mrna, the vast majority of studies used long poly(a) tails of at least nucleotides. likewise, there is a clear prevalence to chemically modified mrna, although various examples suggest that this is not mandatory [ , , , ] . while most mrnas harbored -methyl-cytosine and/or pseudouridine initially [ , , ] , there appears to be a trend towards the use of n -methyl-pseudouridine at present [ , ] . regarding the cap structure, almost all early studies cotranscriptionally generated cap mrnas using arca [ , , ] . however, since about years, research groups prefer to apply mrnas with a cap ′-end [ , , ] . initial attempts to apply mrna to passive immunotherapy focused on cellular approaches for various reasons. adoptive transfer of ctls equipped with either an additional tcr or a car had shown great promise in cancers and viral infections. in contrast to typical scenarios of antibody therapy, receptor expression usually requires much lower protein levels. furthermore, t cells are loaded with receptor-encoding nucleic acid (dna or mrna) ex vivo. hence, passive cellular immunotherapy does not require sophisticated and highly efficient formulations for in vivo delivery but can build on the armamentarium of cell transfection methods. previous work on active cellular vaccination with antigen-presenting cells that had been transfected with antigen-encoding mrna revealed electroporation as easy and efficient means to load cells [ ] . comparison to passive cell pulsing and lipofection demonstrated that electroporation was also most efficient for transfection of t lymphocytes [ ] . rna electroporation had up to % efficiency without eliciting any critical toxicity [ ] . onset of transgene expression was very rapid and lasted about days [ ] . receptor transfer into t cells by mrna electroporation has now been well established for many years [ , ] . moreover, gmpcompliant protocols for manufacturing receptor-expressing retention time t cell preparations via mrna electroporation are available today [ , ] . electroporation of human t lymphocytes with various antigen-specific tcrs redirected them to recognize cancer cells in an mhc-dependent manner in vitro [ ] . mrnamediated tcr expression conferred in vitro cytotoxicity to t cells for at least h [ ] . the lytic efficacy of such cells was comparable to retrovirally transduced lymphocytes [ , ] . likewise, transfection of car-encoding mrnas generated cells that were lytically active in vitro. using an optimized ivt mrna for a cd -specific car, surface expression and cytotoxic function were detectable for up to days [ ] . to avoid as many manipulation steps as possible in generating t cells for adoptive transfer, it was demonstrated that human peripheral blood lymphocytes instead of purified t lymphocytes could be used as well to elicit strong cytotoxicity in vitro upon electroporation of a car mrna [ ] . currently, most car approaches deploy αβ t cells. however, γδ t lymphocytes are an attractive target as well due to their antitumor effector function which is not mhc restricted and does not require co-stimulation. accordingly, mrna-mediated tcr and car expression in such cells was investigated very recently and shown to kill target cells in an antigen-specific manner [ ] . to the best of our knowledge, there is so far just one study that started to systematically analyze the role of different mrna elements for receptor expression in t cells. to this end, the group of carl june built on previous findings in dendritic cells which revealed the superiority of a duplicated β globin ′-utr over a single copy of the same element and of a long ( nt) over a short ( - nt) poly(a) tail [ ] . with respect to receptor expression, as enhanced expression compared to as [ ] . a tandem repeat of the β globin ′-utr had also a beneficial effect, particularly in combination with a long poly(a). in contrast, the vegf translational enhancer as ′-utr element had even detrimental consequences. the authors speculated that this may be due to reduced capping efficacy but did not provide data corroborating their hypothesis. however, they demonstrated the important role of the cap structure. co-transcriptional cap using arca and an enzymatically generated cap structure were equivalent and outperformed the basic cap analog as well as an enzymatic cap . besides expression level, capping also appeared to have an effect on the persistence of expression. modification of the orf sequence by removing all internal orfs had no effect on receptor production. t lymphocytes transfected with tcr-or car-encoding mrna proved to be functional also in vivo. robust antitumor effects were observed in various preclinical models [ , ] . they were mrna-specific, since mock-transfected t cells had no or very little unspecific impact. although mrna-mediated receptor expression is transient, a single injection of car t cells against cd was sufficient to prolong survival of mice [ ] . using peripheral blood lymphocytes instead of purified t cells for car mrna transfection (see above) also enabled a strong antitumor response in vivo although those cells could not persist long-term in vitro [ ] . very recently it was shown that mrna cannot only be used to drive receptor expression, but can support the generation of t cells for adoptive immunotherapy. by expressing a chimeric membrane protein against cd , cells could be efficiently stimulated and expanded in vitro [ ] . after transfection with an mrna for an anti-cd bite, those cells mediated sustained reduction in tumor burden upon intraperitoneal injection. based on encouraging preclinical data, adoptive t cell therapies using mrna were already subjected to first clinical testing. in a phase trial on solid tumors addressing the safety and feasibility of using such cells, car transfected lymphocytes migrated to tumor sites after iv injection [ ] . in addition, the study appeared to provide initial evidence of antitumor activity. due to the transient nature of mrna expression, subjects received repeated infusions of t cells. this led to an anaphylactic response in one patient who developed antibodies specific to the scfv domain of the car [ ] . however, this could be a consequence of the murine origin of this domain. in another trial, mrnatransfected car t cells were injected intratumorally in metastatic breast cancer patients [ ] . treatment was well tolerated and elicited an inflammatory response within tumors. as discussed above, the use of viral vectors for adoptive t cell therapy has potential safety issues. in various studies, authors ascribed toxicities particularly to the persistence of receptor-expressing t cells. regarding such concerns, mrna-mediated tcr or car expression offers at least two advantages. first, mrna does not integrate into a cell's genome, thus excluding genotoxicity. second, due to its transient nature, any potential toxicities accompanying treatment are temporary as well [ ] . however, the increased level of safety has a substantial drawback. apparently, clinical efficacy correlates with long-term persistence of receptor-engineered t cells [ , ] . as a consequence, mrna-transfected cells are expected to have limited antitumor activity because of rapidly declining receptor expression. substantial mrna translation only lasts about day which translates into efficacious receptor levels on the cell surface for several days [ , ] . importantly, clinical studies demonstrated that iv infused t lymphocytes reached tumor sites only - days after administration [ , ] . as a consequence, intratumoral or intraarterial administration was suggested to counteract the delayed cell arrival. the problem of transience of mrna is further enhanced by the well-known ligand-induced receptor internalization. upon target recognition, tcrs as well as cars are rapidly internalized, a mechanism which is important for proper signal transduction [ , ] . this explains why lentiviral vectors generated a more robust treatment effect than mrna [ ] . mrna transfection can give rise to high receptor expression, equaling lentiviral vectors [ ] . however, mrna was only similarly effective during the first hours after electroporation. later, contact to target cells strongly down-regulated receptor on the cell surface while lentiviral expression remained constant [ ] . in comparison to a single transfer of cells with retroviral car expression, an mrna-encoded receptor required three consecutive lymphocyte infusions to obtain a comparable antitumor effect [ ] . these observations and considerations may be put into perspective at least in part by the finding that transferred t cells can become tolerized rather rapidly, thereby losing their ability to function in the tumor microenvironment [ ] . thus, frequent injection of t cells may be desired even for viral vector transduced t cells. notably, mrna was also considered to be of use in settings that require long-term expression for therapeutic efficacy and preclude repeated cell administration. the greater safety of mrna-transfected t cells may make the initial testing of novel antigens and receptors with unknown on-target/off-tissue toxicity less hazardous [ ] . passive immunization with antibodies often requires considerable amounts of polypeptide to obtain therapeutically active concentrations after systemic administration. this poses a substantial challenge to the broad applicability of any nucleic acid mediated passive immunization strategy. thus, compared to the very first attempts regarding in vivo protein expression with mrna in the s, the optimization of ivt mrna to enhance and extend expression is a prerequisite for many mrna-based passive immunotherapies. as reviewed above, great progress towards this goal was made during the last almost three decades. another potential hurdle for passive mrna immunization is related to delivery. to obtain high levels of in vivo antibody expression, the mrna should be targeted to as many "producer" cells as possible which in turn should be transfected with high efficiency. to this end, the mrna which is prone to degradation by rnases should also be protected against these ubiquitous nucleases in an appropriate manner. moreover, a viable mrna complexation reagent needs to be well tolerated. notably, various commercial transfection reagents can be used to formulate mrna and suffice research purposes. among them, transit has been repeatedly used for in vivo studies [ , , ] . with respect to potential therapeutic applications, the class of lipid nanoparticles (lnps) became the most widely deployed means of complexation [ , , [ ] [ ] [ ] . after iv delivery, lnps mainly route to the liver on the basis of an apolipoprotein e (apoe)-dependent mechanism [ ] . however, such nanoparticles were demonstrated to be also applicable to intramuscular and subcutaneous administration [ ] . advances in mrna and formulation technology led to a couple of recent intriguing studies as to passive immunization with mrna (fig. ) . while the group of drew weissman described successful passive mrna immunization for prophylaxis of viral infections [ ] , stadler et al. demonstrated the applicability of mrna-mediated antibody expression for cancer immunotherapy [ ] . the feasibility of using mrna for such indications was confirmed by thran et al. who applied different mrnaencoded antibody formats to diverse biological threats, viruses, toxins, and tumors [ ] . finally, sabnis and colleagues presented first antibody expression data in nonhuman primates (nhps) in a publication dealing with the development of novel lnp formulations [ ] . schematic illustration of mrna-mediated passive antibody immunotherapy. for in vivo administration, mrna is usually formulated in nanoparticles which for instance can be administered by iv injection. for many formulations, liver is the main target organ. upon uptake of nanoparticles by hepatocytes and release of the mrna into the cytosol, it is translated into antibodies that are typically secreted into circulation and finally bind their cognate antigens mrna design and formulation fundamental designs of antibody-encoding mrnas reveal only a few common features but several differences (table ) . among the latter, the exclusive use of chemically unmodified nucleotides by thran et al. as in a previous publication from the same group [ ] may be the most prominent one, since it contrasts to all other reports. other differences are much more diverse among these studies. hence, they do not provide an unequivocal guidance for future work, but at least commonalities may be taken as a recommendation. although obviously not mandatory, mrna with cap structure was clearly preferred. in addition, all mrnas harbored a poly(a) tail. the use of bipartite poly(a) elements by some groups may be owed to the experience that maintenance of long poly(d[a/t]) vector sequences is challenging and strongly dependent on bacterial strains [ ] . beyond these common rna elements, the publications suggest that mrna should be subjected to further optimizations to exploit its full potential for antibody expression. however, different strategies appear to be applicable, but little is known about the interchangeability of individual elements. finally, chromatographic purification of ivt mrna appears to be generally recommended as well (table ) . whether pardi et al. actually used fplc as stated throughout their report instead of hplc applied by other groups is not fully clear, since they referred to earlier publications describing the use of hplc [ , ] . pardi et al. encoded a well-known, broadly neutralizing antibody against hiv- , vrc [ ] . to this end, heavy and light chains of the full igg antibody were represented on separate mrna molecules. for delivery, heavy and light chain mrnas were mixed in a molar ratio of : . likewise, thran et al. used separate mrnas to encode heavy and light chain of various full igg antibodies [ ] . titration of heavy and light chain mrna found a molar ratio of approximately . : to be optimal for co-delivery. neither report provides a rationale for encoding chains on separate molecules. in principle, a bicistronic construct separating heavy and light chain by an ires sequence or an mrna for a polypeptide where a a sequence between heavy and light chain would lead to separate antibody chains by ribosome skipping could have been used [ ] . for pardi et al. the observation that modified nucleotides can hamper the function of ires elements may have affected the selection [ ] . sabnis et al. also worked with a full igg antibody, directed against influenza a, but did not provide any details on how heavy and light chain were represented [ ] . in contrast, stadler et al. chose bite antibodies directed against tcr-associated cd and one of three different tumor-associated antigens (taas) [ ] . they displayed the bites as fab(scfv) or scfv molecules but focused on the latter format. their findings on single-chain antibodies are complemented by thran et al. whose work covers single domain-derived vnas in addition to igg antibodies [ ] . for iv administration of antibodyencoding mrna all but the group of ugur sahin used lnp formulations which, however, may differ from each other in composition ( table ). the latter team exploited transit but switched the route of administration which had been intraperitoneal in previous studies [ , ] . as with lnps, nanoparticles were shown to mainly target the liver upon iv injection [ ] . drew weissman's group administered µg of vrc encoding mrna in most in vivo studies [ ] . this corresponded to doses between and . mg/kg due to differences in mouse weight among experiments. antibody serum titers h after administration, the earliest time of analysis, ranged between approximately and µg/ml in various mouse strains. obviously, slight differences in dosage, as well as the respective strain contributed to varying peak levels. notably, increasing the administered mrna dose in steps of two enhanced serum titers by more than twofold with each step. moreover, µg of mrna in lnps generated higher serum titers than µg of recombinant vrc protein. the kinetics of antibody serum titers revealed an accelerated decline after about a week in balb/c mice. the kinetics in nsg mice appeared to be basically the same, since the level at week after single administration was largely the same as in balb/c at this time. the observed kinetics may also explain why weekly injections of mrna-lnps in nsg mice did not show additive effects on serum titers at the times of analyses. since measurements were conducted days after each treatment, antibodies from the preceding injection probably dropped to background levels within this -week period as observed in balb/c animals. such accelerated decline of protein titers after a few days is often indicative of the induction of an anti-drug antibody (ada) response [ ] . the likelihood of such a response may be particularly high for the reported experiments, since the authors expressed a human antibody in mice. the emergence of adas cannot be fully ruled out because animals were not analyzed accordingly. however, the apparently similar kinetics in immunocompromised nsg mice which are unable to develop adas suggests a different explanation for the pharmacokinetics. possibly, the mrna continues to express antibody for a few days which would inevitably lead to a seemingly extended antibody serum half-life during that period. only after expression ceases, the actual shorter antibody half-life becomes evident. while this could also easily explain the serum profile of repeated treatment of nsg mice, it remains hypothetical due to the lack of respective analyses. stadler et al. first characterized their mrna in vitro demonstrating expression and secretion of functional antibodies [ ] . in a pbmc-mediated killing assay, mrna-derived bite antibodies targeted ctls to tumor cells via binding to cd on pbmcs and to the cognate taa on tumor cells, thereby inducing t cell activation and tumor cell lysis. these antibodies were equally potent as the corresponding recombinant protein. in immunodeficient nsg mice, antibody plasma levels peaked within h, but rapidly declined by more than % within the next h. subsequently, the decrease of bite titers became much slower. the authors did not provide an explanation of this striking kinetics. a pharmacokinetic analysis in non-tumor-bearing mice could have elucidated whether the initial kinetics reflects the trapping of antibody in the engrafted tumor until saturation of binding sites. bite plasma levels above background for a few days were in accordance with the sustained ex vivo cytotoxicity of plasma from mrna-treated mice. in contrast to the antibody plasma kinetics, cytotoxicity showed a steady and slow decline during the observation period. . µg of mrna were already sufficient to obtain strong plasma activity in the ex vivo killing assay. µg of mrna (approx. . mg/kg) were comparable to - µg of recombinant antibody with respect to peak plasma concentrations that were in the range of . µg/ml in nsg mice. as opposed to this modified and hplc-purified mrna, antibody plasma levels were almost undetectable with mrna preparations without modification and chromatographic purification. plasma titers declined much faster for recombinant protein compared to mrna, thereby demonstrating the substantial impact of mrna on bite pharmacokinetics. consequently, only mrna was able to maintain a sustained cytotoxic activity of plasma by weekly administrations. the various mrna-encoded antibodies of thran et al. included vrc which had been used in drew weissman's work [ , ] . however, analyses were limited to in vitro characterization, preventing a direct comparison between studies. as observed for bites, igg and vna antibodies produced from mrna in vitro revealed potencies comparable to that of the respective recombinant proteins. iv administration of µg of unmodified mrna (approx. mg/kg) gave rise to antibody serum titers between and µg/ ml in immunocompetent mice. this contrasts strongly with the finding of stadler et al. who found unmodified mrna to be basically inactive. differences in purification and mrna design may be responsible for this striking discrepancy. similar to the weissman work, thran and colleagues observed a disproportionate increase of antibody serum titers with elevated mrna doses. onset of antibody expression was rather rapid, being already substantial h after injection and reaching peak levels after approximately h. this confirms findings on other mrna-mediated protein therapies showing that mrna starts accumulating in hepatocytes within minutes after administration and leads to substantial protein levels within a couple of hours [ , ] . serum half-life of igg antibodies appeared to be in the range of week and thus slightly longer compared to pardi et al. [ ] . as in the latter study, one of two iggs showed an accelerated decline after about week, however, only in approximately half of the animals. here, the expedited clearance could be assigned to the development of an ada response against the mrna-encoded antibody. importantly, this response was antibody-dependent and not intimately linked with the use of mrna. as expected, vnas revealed a much shorter serum half-life of about - days. compared to published kinetics data on recombinant vnas, mrna appeared to contribute to extended antibody availability during the first days after administration as it has been observed for bite-expressing mrna by ugur sahin's group. however, the lack of a headto-head comparison hampers a detailed analysis. while previous in vivo studies on mrna-mediated antibody expression were limited to mice, sabnis et al. presented expression results in nhps using a proprietary lnp formulation [ ] . a . mg/kg dose of mrna gave rise to antibody serum titers of about µg/ml h after iv administration which is at least at the lower end of the range of efficacy observed in mouse studies. however, data on a different protein suggest that efficacy of the formulation may be slightly lower in nhp than in mouse. whereas a . mg/ kg dose induced protein levels of approximately µg/ml in mice, a . mg/kg dose generated protein titers between and ng/ml in nhps. all mouse studies on mrna-encoded antibodies investigated their therapeutic efficacy. pardi et al. used two different humanized mouse models to demonstrate that mrnaderived vrc protects from hiv- challenge [ ] . mrna encoding a reporter protein was utilized as control. mrna-lnps were administered h prior to challenge with one of two hiv- isolates. in the authors' first model, a vrc mrna dose of . mg/kg was ineffective, but . mg/kg already reduced viral rna copies in the plasma to undetectable levels as assessed by quantitative real-time polymerase chain reaction (qrt-pcr). the latter dose is well below the - mg/kg doses that are typically used for prophylactic immunization with recombinant antibody in humanized mice to reach therapeutic concentrations [ , ] . however, the authors did not titrate the dose of recombinant vrc but used a mg/kg dose as control which was sufficient to completely eradicate viral rna copies in the plasma. mrna efficacy could be also demonstrated in the second mouse challenge model. to show in vivo efficacy of mrna-mediated bite expression, stadler et al. implanted tumor cells subcutaneously in immunodeficient nsg mice [ ] . about week before mrna treatment, human pbmcs were engrafted into these animals. µg of bite mrna (approx. . mg/ kg) given three times with an interval of week could eliminate tumors entirely. in contrast, tumors progressed in control animals that received mrna encoding a reporter protein. the recombinant bite required three injections per week and a total of ten injections of - µg each to obtain a comparable antitumor effect as with bite mrna. the need for a more frequent administration corroborated the previous finding that mrna substantially improved antibody plasma half-life. due to the diversity of antibodies included in their study, thran et al. utilized various disease models for demonstrating therapeutic efficacy [ ] . in contrast to all other studies, the authors applied mrna encoding irrelevant antibodies instead of a reporter protein as control. a µg dose (approx. mg/kg) of antibody mrna could protect mice from challenges with either rabies virus or botulinum toxin. in the intoxication model, mrna was proven to be equally protective as recombinant antibody. however, mice received approximately . mg/kg of recombinant vna compared to approximately mg/kg of mrna. based on protein expression levels from mrna dose titrations, lower doses than mg/kg may still confer full protection but this remains hypothetical, since the authors did not conduct an mrna dose titration in their challenge model. notably, mrna was effective in pre-as well as in post-exposure settings. the latter is important for some indications of passive immunization and confirms the aforementioned rapid onset of antibody expression. the post-exposure scenario for botulinum toxin requires very rapid availability of neutralizing antibodies. whereas recombinant protein can act immediately after administration, mrna needs more time to provide the antibody. hence, it may well be that in such instances higher doses of mrna than of protein are required, not for obtaining the same peak level but for reaching meaningful titers in a timely manner. in a further model, thran et al. evaluated their mrna approach with respect to anti-tumor efficacy. using a disseminated tumor model for rituximab, they showed efficient tumor growth control with injections of µg (approx. . mg/kg) of rituximab mrna twice a week. higher doses ( µg, approx. mg/kg) of recombinant rituximab were less potent. this finding is reminiscent of results of drew weissman's group and contrasts those of ugur sahin and colleagues who required similar doses of mrna and recombinant protein (but less frequent dosing with mrna) to obtain equivalent therapeutic effects [ , ] . notably, the difference among studies may be related to the use of igg antibodies on the one hand and a scfv protein on the other hand. moreover, the irrelevant antibody control used by thran et al. appeared to have a slight unspecific anti-tumor effect. it may have contributed to the superiority of mrna compared to recombinant protein regarding dosing. amongst other explanations, the potential unspecific effect may be due to an mrna-lnp-independent response to repeated treatment or may be the consequence of a weak and transient cytokine response observed after mrna-lnp administration. however, the phenomenon was not investigated further. in line with previous reports, pardi et al. confirmed the importance of highly purified ivt mrna. in combination with lnps, only modification with n -methyl-pseudouridine plus chromatographic purification was sufficient to avoid cytokine release by innate immune activation [ ] . for this analysis, however, the authors deployed an mrna encoding a different protein than the vrc antibody which had been used for in vivo expression and efficacy experiments. tolerability of mrna-lnps was also addressed by repeated mrna treatments. translation of vrc mrna was not compromised over time, but the analysis was conducted in immunodeficient nsg mice. to overcome this caveat, the authors complemented their study by repetitive treatment of immune competent balb/c mice. to this end, they switched to an endogenous protein, since human vrc may be recognized as foreign and thus elicit an immune response. again, mrna injections did not lose efficacy over time. however, the authors also changed the formulation (transit instead of lnp) as well as the route of administration (intraperitoneal instead of iv) compared to the use of vrc mrna. hence, evidence for immune silence and overall tolerability of vrc mrna in lnps is just circumstantial yet. stadler et al. did not observe any liver toxicity upon treatment with mrna in transit according to liver enzyme analyses [ ] . moreover, bite mrna administration did not elevate murine cytokines such as ifnα and tnfα above background in plasma. likewise, analysis of systemic human cytokine release from engrafted pbmcs did not show any unspecific t cell activation. as opposed to modified and hplc-purified mrna, preparations without nucleotide modification and chromatographic purification induced detectable levels of murine cytokines. similar to the weissman group, the authors also assessed the tolerability of formulated mrna by repeated injections. administrations did not lose efficacy over time, but as in the corresponding weissman experiment immunodeficient nsg mice were used. using chemically unmodified mrna formulated in lnps, thran et al. did not observe any liver toxicity in histopathological analyses [ ] . only a few animals developed an ada response which was dependent on encoded antibody and was, thus, no intrinsic consequence of treatment with mrna-lnp. in addition, treatment appeared to elicit a transient weak cytokine release which, however, neither suppressed antibody expression nor induced adverse effects. since there are ample differences among studies on mrna-mediated antibody expression and no detailed analyses of the issue, the role of mrna, lnp, and/or encoded antibody/protein in cytokine induction remains elusive. an earlier study on erythropoietin showing the absence of any appreciable immunostimulation suggests that the use of chemically unmodified instead of modified mrna is not the decisive parameter [ ] . quite a few in vivo studies provided compelling evidence for the principle feasibility of mrna-based immunotherapies. as discussed above, challenges and open questions regarding adoptive t cell transfer are less related to the mrna and its formulation or transfection but more of fundamental character. in contrast to ex vivo loading of cells, mrnamediated antibody expression is strongly affected by body size. thus, while there are now convincing efficacy data in diverse small rodent models, the translation to larger animals and finally humans has still to be demonstrated. first data suggest that substantial expression can be obtained in small nhps. however, the utilized lnps appeared to lose some efficacy when switching from mouse to nhp. hence, the development of human therapies may perhaps require further advancements of the mrna technology as well as primatespecific formulations with improved efficacy. in addition, tolerability of formulations has to be analyzed further and in more depth in the future. for instance, repeated dosing of nanoparticles can induce complement activation-related pseudoallergy (carpa) [ ] . however, this can in principle be counteracted by optimization towards better biocompatibility. in case of lnps, fast degradation was shown to be particularly important [ , , ] . while antibodies for cancer treatment were initially developed for iv administration, there is a trend towards subcutaneous injections today. for instance, rituximab was initially formulated for iv infusion which is typically administered over a period of . - h [ ] . this treatment schedule poses a substantial burden to patients as well as the healthcare system. thus, a formulation which reduces the time and required resources would be advantageous. to meet these goals by subcutaneous administration, the antibody solution was concentrated -fold [ , ] . since this volume was still too large for subcutaneous injection, rituximab was co-formulated with human hyaluronidase which limits swelling and associated pain by increasing the dispersion and absorption of co-administered substances [ , ] . now, median administration time for rituximab using the subcutaneous route is min. as a consequence, antibody immunotherapy with mrna does not only require competitive efficacy and costs but also routes of administration to become a viable alternative to recombinant proteins. although other routes than iv have been shown to be possible for mrna, there are still a few open questions to be addressed by future studies. where are the advantages of using mrna for antibody immunotherapies? compared to dna it may be primarily the safety aspect. concerning recombinant proteins various points matter. as reviewed above, mrna provides benefits as to the pharmacokinetics when short-lived antibodies such as scfv, (bi)-scfv or vna are used. moreover, solving the challenges of antibody cocktails may be easier using mrna. different mrna sequences are much more similar with respect to their physicochemical characteristics than different proteins are. hence, producing a cocktail may be less demanding for mrna compared to protein. however, co-delivery and thus co-expression implicates the risk of antibody chimerism and thus requires specific solutions such as knob-into-hole concepts [ ] . last but not least, while proteins are difficult to deliver directly through the cell membrane [ ] , mrna-mediated protein expression makes a large number of potential intracellular targets accessible to antibody immunotherapy. particularly single-chain and single-domain formats are amenable to functional expression in the cytosol and thus suited as intrabodies, since they are less dependent on disulfide bond formation [ , ] . the value of targeting intracellular proteins has already been demonstrated by various studies. a bispecific scfv could restore p function in mutant p colon cancer cells and trapping ccr in the er via an intrabody reduced hiv cell entry [ , ] . support for the potential of intrabodies as therapeutics also comes from further work in the field of oncology or neurodegenerative diseases [ ] [ ] [ ] . although it has been recently demonstrated that even a full antibody can be delivered into cells in vivo [ ] , it has been recognized that fusions of cell-penetrating peptides (cpps) and macromolecules are often trapped in endosomes instead of being released to the cytoplasm [ ] . in contrast, nucleic acids including mrna can be very efficiently transfected into cells, making them ideal for the delivery of intrabodies. however, while lnps provide very efficient solutions for systemic delivery to the liver, formulations for routing mrna to other tissues are scarce today. with the recent burst of publications on successful applications of mrnabased antibody immunotherapy it is likely that there will be much to follow in the near future. pathogen recognition and innate immunity inside, outside, upside down: damage-associated molecular-pattern molecules (damps) and redox fundamentals and methods for t-and b-cell epitope prediction b-lymphocyte lineage cells and the respiratory system current advances in t-cell-based cancer immunotherapy foxp + regulatory t cells in the human immune system human dendritic cell subsets: an update correlates of protection induced by vaccination cancer immunotherapy via dendritic cells recombinant snakebite antivenoms: a cost-competitive solution to a neglected tropical disease? strategies and challenges for the next generation of therapeutic antibodies contributions of humoral and cellular immunity to vaccine-induced protection in humans ueber das zustandekommen der diphtherie-immunitat und der tetanus-immunitat bei thieren passive immunity in the prevention of rabies passive antibody therapy for infectious diseases monoclonal antibody-based therapies for microbial diseases passive immunity in prevention and treatment of infectious diseases use of concentrated human serum gammaglobulin in the prevention and attenuation of measles evaluation of red cross gamma globulin as a prophylactic agent for poliomyelitis. iv. final report of results based on clinical diagnoses serum therapy revisited: animal models of infection and development of passive antibody therapy the growth and potential of human antiviral monoclonal antibody therapeutics efficacy of immune plasma in treatment of argentine haemorrhagic fever and association between treatment and a late neurological syndrome treatment of argentine hemorrhagic fever rabies in the americas and remarks on global aspects clostridial infections botulism in the united states: a clinical and epidemiologic review heterologous antisera and antivenins are essential biologicals: perspectives on a worldwide crisis comparison of an anti-rabies human monoclonal antibody combination with human polyclonal anti-rabies immune globulin continuous cultures of fused cells secreting antibody of predefined specificity an overview of the use of the monoclonal antibody okt in renal transplantation overview of antibody phage-display technology and its applications a surface expression vector for antibody screening selecting and screening recombinant antibody libraries ribosome display human monoclonal antibody as prophylaxis for sars coronavirus infection in ferrets a neutralizing human monoclonal antibody protects african green monkeys from hendra virus challenge predominant autoantibody production by early human b cell precursors efficient generation of monoclonal antibodies from single human b cells by single cell rt-pcr and expression vector cloning therapeutic antibodies for autoimmunity and inflammation fcrn: the neonatal fc receptor comes of age clinical pharmacokinetics of therapeutic monoclonal antibodies monoclonal antibody pharmacokinetics and pharmacodynamics a theoretical model of gamma-globulin catabolism mediator of transmission of immunity and protection from catabolism for igg enhanced antibody half-life improves in vivo activity pharmacokinetics and pharmacodynamics of monoclonal antibodies: concepts and lessons for drug development challenges in monoclonal antibody-based therapies expression of recombinant antibodies current state and recent advances in biopharmaceutical production in escherichia coli, yeasts and mammalian cells production of recombinant protein therapeutics in cultivated mammalian cells therapeutic antibody expression technology high level transient production of recombinant antibodies and antibody fusion proteins in hek cells biopharmaceutical benchmarks antibody glycosylation and its impact on the pharmacokinetics and pharmacodynamics of monoclonal antibodies and fc-fusion proteins production of non-fucosylated antibodies by co-expression of heterologous gdp- -deoxy-d-lyxo- -hexulose reductase engineered antibody fragments and the rise of single domains camelid and shark single domain antibodies: structural features and therapeutic potential llama-derived single domain antibodies to build multivalent, superpotent and broadened neutralizing anti-viral molecules comparison of physical chemical properties of llama vhh antibody fragments and mouse monoclonal antibodies prolonged prophylactic protection from botulism with a single adenovirus treatment promoting serum expression of a vhh-based antitoxin protein nanobodies with in vitro neutralizing activity protect mice against h n influenza virus infection single-domain antibodies targeting neuraminidase protect against an h n influenza virus challenge design and evaluation of a diabody to improve protection against a potent scorpion neurotoxin identification of potent nanobodies to neutralize the most poisonous polypeptide from scorpion venom systematic screening of soluble expression of antibody fragments in the cytoplasm of design and production of novel tetravalent bispecific antibodies knobs-into-holes' engineering of antibody ch domains for heavy chain heterodimerization systemic administration of a bispecific antibody targeting egfrviii successfully treats intracerebral glioma bispecific antibodies and trispecific immunocytokines for targeting the immune system against cancer: preparing for the future targeting and killing of glioblastoma with activated t cells armed with bispecific antibodies dual targeting strategies with bispecific antibodies diabodies": small bivalent and bispecific antibody fragments single-chain mono-and bispecific antibody derivatives with novel biological properties and antitumour activity from a cos cell transient expression system immunotherapeutic perspective for bispecific antibodies antibody pharmacokinetics and pharmacodynamics elimination mechanisms of therapeutic monoclonal antibodies bispecific antibodies rise again clinical use of blinatumomab for b-cell acute lymphoblastic leukemia in adults tailoring structure-function and pharmacokinetic properties of single-chain fv proteins by site-specific pegylation strategies to extend plasma half-lives of recombinant antibodies properties, production, and applications of camelid single-domain antibody fragments transmigration of beta amyloid specific heavy chain antibody fragments across the in vitro blood-brain barrier protective effect of different anti-rabies virus vhh constructs against rabies disease in mice post-exposure treatment with anti-rabies vhh and vaccine significantly improves protection of mice from lethal rabies infection a mab recognizing a surface antigen of mycobacterium tuberculosis enhances host survival antibodies to a cell surface histone-like protein protect against histoplasma capsulatum monoclonal antibodies for prophylactic and therapeutic use against viral infections single-chain fragment variable passive immunotherapies for neurodegenerative diseases human respiratory syncytial virus and other viral infections in infants receiving palivizumab bezlotoxumab (zinplava) for prevention of recurrent clostridium difficile infection depletion of b cells in vivo by a chimeric mouse human monoclonal antibody to cd rituximab, an anti-cd monoclonal antibody: history and mechanism of action targeted therapy with the t-cell-engaging antibody blinatumomab of chemotherapy-refractory minimal residual disease in b-lineage acute lymphoblastic leukemia patients results in high response rate and prolonged leukemia-free survival blinatumomab: a bispecific t cell engager (bite) antibody against cd /cd for refractory acute lymphoid leukemia alternative molecular formats and therapeutic applications for bispecific antibodies adoptive t-cell immunotherapy t-cell therapy: options for infectious diseases cancer regression in patients after transfer of genetically engineered lymphocytes chimeric antigen receptor-and tcr-modified t cells enter main street and wall street adoptive cell transfer as personalized immunotherapy for human cancer gene therapy with human and mouse t-cell receptors mediates cancer regression and targets normal tissues expressing cognate antigen tumor regression in patients with metastatic synovial cell sarcoma and melanoma using genetically engineered lymphocytes reactive with ny-eso- t cells targeting carcinoembryonic antigen can mediate regression of metastatic colorectal cancer but induce severe transient colitis adoptive transfer of t-cell immunity: gene transfer with mhc-restricted receptors transgenic tcr expression: comparison of single chain with full-length receptor constructs for t-cell function independent association of t cell receptor beta and gamma chains with cd in the same cell surface expression of only gamma delta and/or alpha beta t cell receptor heterodimers by cells with four (alpha, beta, gamma, delta) functional receptor chains alpha beta t cell receptor transfer to gamma delta t cells generates functional effector cells without mixed tcr dimers in vivo talenmediated editing of endogenous t-cell receptors facilitates efficient reprogramming of t lymphocytes by lentiviral gene transfer optimizing t-cell receptor gene therapy for hematologic malignancies expression of immunoglobulin-t-cell receptor chimeric molecules as functional receptors with antibody-type specificity the t-body approach: potential for cancer immunotherapy the basic principles of chimeric antigen receptor design specific activation and targeting of cytotoxic lymphocytes through chimeric single chains consisting of antibody-binding domains and the gamma or zeta subunits of the immunoglobulin and t-cell receptors chimeric antigen receptors for t cell immunotherapy: current understanding and future directions structural design of engineered costimulation determines tumor rejection kinetics and persistence of car t cells chimeric antigen receptors modified t-cells for cancer therapy adoptive t cell transfer for cancer immunotherapy in the era of synthetic biology adoptive immunotherapy for cancer: harnessing the t cell response chimeric antigen receptor t cells for sustained remissions in leukemia cd car-t cells of defined cd + :cd + composition in adult b cell all patients t cells expressing cd chimeric antigen receptors for acute lymphoblastic leukaemia in children and young adults: a phase dose-escalation trial chimeric antigen receptor-modified t cells for the treatment of solid tumors: defining the challenges and next steps state-of-the-art gene-based therapies: the road ahead going viral: chimeric antigen receptor t-cell therapy for hematological malignancies case report of a serious adverse event following the administration of t cells transduced with a chimeric antigen receptor recognizing erbb treatment of metastatic renal cell carcinoma with autologous t-lymphocytes genetically retargeted against carbonic anhydrase ix: first clinical experience treatment of metastatic renal cell carcinoma with caix car-engineered t cells: clinical evaluation and management of on-target toxicity b-cell depletion and remissions of malignancy along with cytokineassociated toxicity in a clinical trial of anti-cd chimericantigen-receptor-transduced t cells clinical development of car t cells-challenges and opportunities in translating innovative treatment concepts current concepts in the diagnosis and management of cytokine release syndrome preventing and exploiting the oncogenic potential of integrating gene vectors gene therapy. safer and virus-free? lmo -associated clonal t cell proliferation in two patients after gene therapy for scid-x genotoxicity of retroviral integration in hematopoietic cells generation of knock-in primary human t cells using cas ribonucleoproteins pharmacokinetics and immunogenicity of broadly neutralizing hiv monoclonal antibodies in macaques state of play and clinical prospects of antibody gene transfer vector-mediated antibody gene transfer for infectious diseases generation of neutralizing activity against human immunodeficiency virus type in serum by antibody gene transfer antibody-based protection against hiv infection by vectored immunoprophylaxis broad protection against influenza infection by vectored immunoprophylaxis in mice stable antibody expression at therapeutic levels using the a peptide recurrent aav -related insertional mutagenesis in human hepatocellular carcinomas general considerations on the biosafety of virus-derived vectors used in gene therapy and vaccination pre-existing immunity against ad vectors: humoral, cellular, and innate response, what's important? hum vaccines immunother promise and problems associated with the use of recombinant aav for the delivery of anti-hiv antibodies preexisting anti-adeno-associated virus antibodies as a challenge in aav gene therapy in situ production of therapeutic monoclonal antibodies erythropoietin gene therapy leads to autoimmune anemia in macaques vector-mediated gene transfer engenders long-lived neutralizing activity and protection against siv infection in monkeys broadly neutralizing human immunodeficiency virus type antibody gene transfer protects nonhuman primates from mucosal simian-human immunodeficiency virus infection adeno-associated virus-mediated gene transfer to nonhuman primate liver can elicit destructive transgene-specific t cell responses modified mrna directs the fate of heart progenitor cells and induces vascular regeneration after myocardial infarction use of frog eggs and oocytes for the study of messenger rna and its translation in living cells translation of encephalomyocarditis viral rna in oocytes of xenopus laevis cationic liposome-mediated rna transfection direct gene transfer into mouse muscle in vivo induction of virusspecific cytotoxic t lymphocytes in vivo by liposomeentrapped mrna gene gun delivery of mrna in situ results in efficient transgene expression and genetic immunization dendritic cells pulsed with rna are potent antigen-presenting cells in vitro and in vivo in vivo application of rna leads to induction of specific cytotoxic t lymphocytes and antibodies ′-terminal cap structure in eucaryotic messenger ribonucleic acids how the messenger got its tail: addition of poly(a) in the nucleus formation of the ′ end of histone mrna the cap and poly(a) tail function synergistically to regulate mrna translational efficiency the enzymes and control of eukaryotic mrna turnover concerted action of poly(a) nucleases and decapping enzyme in mammalian mrna turnover reverse ′ caps in rnas made in vitro by phage rna polymerases synthesis and properties of mrnas containing the novel "anti-reverse" cap analogs -methyl( ′-o-methyl)gpppg and -methyl ( ′-deoxy)gpppg novel "anti-reverse" cap analogs with superior translational properties effective delivery with enhanced translational activity synergistically accelerates mrna-based transfection mrna transfection of dendritic cells: synergistic effect of arca mrna capping with poly(a) chains in cis and in trans for a high protein expression level phosphorothioate cap analogs stabilize mrna and increase translational efficiency in mammalian cells phosphorothioate cap analogs increase stability and translational efficiency of rna vaccines in immature dendritic cells and induce superior immune responses in vivo modification of the ′ end of mrna. association of rna triphosphatase with the rna guanylyltransferase-rna (guanine- -)methyltransferase complex from vaccinia virus cap-specific mrna (nucleoside-o ′-)-methyltransferase and poly(a) polymerase stimulatory activities of vaccinia virus are mediated by a single protein a conserved histidine in the rna sensor rig-i controls immune tolerance to n - ′o-methylated self rna inhibition of translation by ifit family members is determined by their ability to interact selectively with the ′-terminal regions of cap -, cap -and ′ppp-mrnas viral stressinducible protein p inhibits translation by blocking the interaction of eif with the ternary complex eif .gtp.met-trnai mrna poly(a) tail, a ′ enhancer of translational initiation modification of antigen-encoding rna increases stability, translational efficacy, and t-cell stimulatory capacity of dendritic cells multiple injections of electroporated autologous t cells expressing a chimeric antigen receptor mediate regression of human disseminated tumor mrna with a < -nt poly(a) tail imparted by the poly(a)-limiting element is translated as efficiently in vivo as long poly(a) mrna optimized transfection of mrna transcribed from a d(a/t) tail-containing vector trans-acting translational regulatory rna binding proteins increased erythropoiesis in mice injected with submicrogram quantities of pseudouridine-containing mrna encoding erythropoietin an element within the ′ untranslated region of human hsp mrna which acts as a general enhancer of mrna translation optimization of mrna untranslated regions for improved expression of therapeutic mrna an mrna stability complex functions with poly(a)-binding protein to stabilize mrna in vitro codon usage and trna genes in eukaryotes: correlation of codon usage diversity with translation efficiency and with cgdinucleotide usage as assessed by multivariate analysis evolution of synonymous codon usage in metazoans quantitative effect of suboptimal codon usage on translational efficiency of mrna encoding hiv- gag in intact t cells translation of angiotensin-converting enzyme upon liver-and lung-targeted delivery of optimized chemically modified mrna codon optimality, bias and usage in translation and mrna decay reversal of diabetes insipidus in brattleboro rats: intrahypothalamic injection of vasopressin mrna species-specific recognition of single-stranded rna via toll-like receptor and recognition of double-stranded rna and activation of nf-kappab by toll-like receptor innate antiviral responses by means of tlr -mediated recognition of single-stranded rna ′-triphosphate rna is the ligand for rig-i differential roles of mda and rig-i helicases in the recognition of rna viruses rig-i-mediated antiviral responses to single-stranded rna bearing ′-phosphates rig-i detects viral genomic rna during negative-strand rna virus infection ′-triphosphate-dependent activation of pkr by rnas with short stem-loops suppression of rna recognition by toll-like receptors: the impact of nucleoside modification and the evolutionary origin of rna incorporation of pseudouridine into mrna yields superior nonimmunogenic vector with increased translational capacity and biological stability incorporation of pseudouridine into mrna enhances translation by diminishing pkr activation nucleoside modifications in rna limit activation of ′- ′-oligoadenylate synthetase and increase resistance to cleavage by rnase l screening of mrna chemical modification to maximize protein expression with reduced immunogenicity efficacy and immunogenicity of unmodified and pseudouridine-modified mrna delivered systemically with lipid nanoparticles in vivo n( )-methylpseudouridine-incorporated mrna outperforms pseudouridine-incorporated mrna by providing enhanced protein expression and reduced immunogenicity in mammalian cell lines and mice nucleoside-modified mrna vaccines induce potent t follicular helper and germinal center b cell responses chemical and structural effects of base modifications in messenger rna chemical pulldown reveals dynamic pseudouridylation of the mammalian transcriptome a mettl -mettl complex mediates mammalian nuclear rna n -adenosine methylation transcriptome-wide mapping reveals reversible and dynamic n( )-methyladenosine methylome sequence-engineered mrna without chemical nucleoside modifications enables an effective protein therapy in large animals messenger rna-based vaccines generating the optimal mrna for therapy: hplc purification eliminates immune activation and improves translation of nucleoside-modified, protein-encoding mrna spontaneous cellular uptake of exogenous messenger rna in vivo is nucleic acid-specific, saturable and ion dependent in vivo messenger rna introduction into the central nervous system using polyplex nanomicelle therapeutic efficacy in a hemophilia b model using a biosynthetic mrna liver depot system systemic delivery of factor ix messenger rna for protein replacement therapy biocompatible, purified vegf-a mrna improves cardiac function after intracardiac injection week post-myocardial infarction in swine systemic delivery of modified mrna encoding herpes simplex virus thymidine kinase for targeted cancer gene therapy exploring cytotoxic mrnas as a novel class of anti-cancer biotherapeutics expression of therapeutic proteins after delivery of chemically modified mrna in mice in vivo genome editing using nuclease-encoding mrna corrects sp-b deficiency modified foxp mrna protects against asthma through an il- -dependent mechanism systemic messenger rna therapy as a treatment for methylmalonic acidemia targeted mrna therapy for ornithine transcarbamylase deficiency g pc mrna therapy positively regulates fasting blood glucose and decreases liver abnormalities in a mouse model of glycogen storage disease a quantitative systems pharmacology model of hugt a -modrna encoding for the ugt a enzyme to treat crigler-najjar syndrome type mrna treatment produces sustained expression of enzymatically active human adamts in mice chemically modified rna induces osteogenesis of stem cells and human tissue explants as well as accelerates bone healing in rats tendon healing induced by chemically modified mrnas treatment of neurological disorders by introducing mrna in vivo using polyplex nanomicelles intrathecal delivery of frataxin mrna encapsulated in lipid nanoparticles to dorsal root ganglia as a potential therapeutic for friedreich's ataxia messenger rna-based therapeutics for brain diseases: an animal study for augmenting clearance of beta-amyloid by intracerebral administration of neprilysin mrna loaded in polyplex nanomicelles efficient genetic modification of murine dendritic cells by electroporation with mrna highly efficient gene delivery by mrna electroporation in human hematopoietic cells: superiority to lipofection and passive pulsing of mrna and to electroporation of plasmid cdna for tumor antigen loading of dendritic cells high-efficiency transfection of primary human and mouse t lymphocytes using rna electroporation a new way to generate cytolytic tumor-specific t cells: electroporation of rna coding for a t cell receptor into t lymphocytes a gmp-compliant protocol to expand and transfect cancer patient t cells with mrna encoding a tumor-specific chimeric antigen receptor rna-transfection of gamma/delta t cells with a chimeric antigen receptor or an alpha/beta t-cell receptor: a safer alternative to genetically engineered alpha/beta t cells for the immunotherapy of melanoma transfer of mrna encoding recombinant immunoreceptors reprograms cd + and cd + t cells for use in the adoptive immunotherapy of cancer treatment of advanced leukemia in mice with mrna engineered t cells adoptive immunotherapy using human peripheral blood lymphocytes transferred with rna encoding her- /neu-specific chimeric immune receptor in ovarian cancer xenograft model novel t cells with improved in vivo anti-tumor activity generated by rna electroporation mesothelin-specific chimeric antigen receptor mrna-engineered t cells induce anti-tumor activity in solid malignancies t cells expressing chimeric antigen receptors can cause anaphylaxis in humans ) safety and efficacy of intratumoral injections of chimeric antigen receptor (car) t cells in metastatic breast cancer nonviral rna transfection to transiently modify t cells with chimeric antigen receptors for adoptive therapy adoptive t cell therapy for cancer in the clinic adoptive cell transfer: a clinical path to effective cancer immunotherapy survival and tumor localization of adoptively transferred melan-a-specific t cells in melanoma patients phase i trial of adoptive immunotherapy with cytolytic t lymphocytes immunized against a tyrosinase epitope signal transduction and endocytosis: close encounters of many kinds endosomes: a legitimate platform for the signaling train redirecting t cells to ewing's sarcoma family of tumors by a chimeric nkg d receptor expressed by lentiviral transduction or mrna transfection immunological quality and performance of tumor vessel-targeting car-t cells prepared by mrna-ep for clinical research interleukin- rescues tolerant cd + t cells for use in adoptive immunotherapy of established tumors elimination of large tumors in mice by mrna-encoded bispecific antibodies structure, activity and uptake mechanism of sirna-lipid nanoparticles with an asymmetric ionizable lipid expression kinetics of nucleoside-modified mrna delivered in lipid nanoparticles to mice by various routes administration of nucleoside-modified mrna encoding broadly neutralizing antibody protects humanized mice from hiv- challenge ) mrna mediates passive vaccination against infectious agents, toxins, and tumors a novel amino lipid series for mrna delivery: improved endosomal escape and sustained pharmacology and safety in non-human primates hplc purification of in vitro transcribed long rna comparison of ires and f a-based locus-specific multicistronic expression in stable mouse lines hiv therapy by a combination of broadly neutralizing antibodies in humanized mice passive immunization with a human monoclonal antibody protects hu-pbl-scid mice against challenge by primary isolates of hiv- complement activation-related pseudoallergy: a stress reaction in blood triggered by nanomedicines and biologicals phase iii safety study of rituximab administered as a -minute infusion in patients with previously untreated diffuse large b-cell and follicular lymphoma subcutaneous administration of rituximab (mabthera) and trastuzumab (herceptin) using hyaluronidase non-clinical pharmacokinetic/pharmacodynamic and early clinical studies supporting development of a novel subcutaneous formulation for the monoclonal antibody rituximab a recombinant human enzyme for enhanced interstitial transport of therapeutics delivery of antibodies to the cytosol: debunking the myths intracellular and cell surface displayed single-chain diabodies phenotypic lentivirus screens to identify functional single domain antibodies construction and expression of a bispecific single-chain antibody that penetrates mutant p colon cancer cells and binds p barbas cf rd ( ) functional deletion of the ccr receptor by intracellular immunization produces cells that are refractory to ccr -dependent hiv- infection and cell fusion a novel intracellular antibody against the e oncoprotein impairs growth of human papillomavirus -positive tumor cells in mouse models a human singlechain fv intrabody blocks aberrant cellular effects of overexpressed alpha-synuclein trapping prion protein in the endoplasmic reticulum impairs prpc maturation and prevents prpsc accumulation a cell-penetrating whole molecule antibody targeting intracellular hbx suppresses hepatitis b virus via trim -dependent pathway delivery of macromolecules using arginine-rich cell-penetrating peptides: ways to overcome endosomal entrapment acknowledgements we thank mariola fotin-mleczek for discussion on the manuscript. we are grateful to nigel horscroft and michael stolz for critical reading of the review. we also thank bettina danker for her graphical illustrations. finally, we apologize to those authors whose work was not cited owing to space limitations. key: cord- -li x m authors: hung, ling-chu title: the monoclonal antibody recognized the open reading frame protein in porcine circovirus type -infected peripheral blood mononuclear cells date: - - journal: viruses doi: . /v sha: doc_id: cord_uid: li x m the purpose of this study in the context of the open reading frame (orf ) protein of porcine circovirus type (pcv ) was especially its location and its relation to the capsid protein and the apoptosis protein in pcv -infected porcine peripheral blood mononuclear cells (pbmcs). to detect the orf protein, monoclonal antibodies (mabs) were generated in this study. the mab d binds to the orf peptide (residues – ) and the native orf protein in pcv -infected pbmcs, as shown by immunofluorescence assay (ifa). the data show that – % of pbmcs were positive for orf protein or p protein. further, – % of pbmcs were positive for the capsid. this study confirmed the orf protein not only colocalized with the capsid protein but also colocalized with the p protein in pbmcs. immunoassays were conducted in this study to detect the capsid protein, the orf protein, anti-capsid igg, and anti-orf igg. the data show the correlation (r = . ) of the orf protein and the capsid protein in the blood samples from the pcv -infected herd. however, each anti-viral protein igg had a different curve of the profile in the same herd after vaccination. overall, this study provides a blueprint to explore the orf protein in pcv -infected pbmcs. the porcine circovirus (pcv) is a small virus and contains closed circular single-stranded dna [ ] . previous studies indicated that porcine circovirus type (pcv ) is non-pathogenic to porcine herd [ , ] . however, porcine circovirus type (pcv )-infected pigs showed dullness, thinness, lymphadenopathy, jaundice, hepatomegaly, and other manifestations [ ] [ ] [ ] [ ] [ ] . lymphocyte depletion and apoptosis of lymphocytes were the significant histopathological lesions in lymphoid tissues of pcv -infected pigs [ , ] . pcv infection causes a reduction of t-and b-lymphocytes in the pbmc [ ] . likewise, pcv -associated disease (pcvad) manifested as severe swine herd problems [ , [ ] [ ] [ ] . recently, pcvad has become one of the significant swine diseases worldwide, and it impacts pork production [ ] . at least four commercial pcv a vaccines have been widely used to reduce viremia and pcv infectious pressure in swine herds [ ] [ ] [ ] [ ] . that might have caused the global trend shift from pcv a to pcv b first [ ] [ ] [ ] [ ] , and then pcv d (pcv b mutant strain) has become a predominant genotype globally [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . the open reading frame (orf ), open reading frame (orf ), and open reading frame (orf ) genes are the three principal orfs among orfs in pcvs [ ] . the orf encodes for the replicase peptide c and peptide n were appended with an n-terminal cysteine during synthesis, which was required for conjugation with maleimide-activated carriers. animal experiments in this study were performed following the current legislation on ethical and welfare recommendations. the experiments followed the standards of the guide of the care and use of laboratory animals. all animal work was approved by the institutional animal care and use committee (iacuc) of the livestock research institute, and the iacuc of the animal health research institute. the iacuc approval numbers lriiacuc - (for swine and murine experiments), a (for murine and rabbit experiments), and a (for murine and rabbit experiments) were given in this study. antisera against pcv were generated by immunizing new zealand white rabbits (from livestock research institute, council of agriculture, taiwan). rabbits were maintained in isolation rooms, and the room temperature was at - • c. they were fed with a commercially pelleted diet (fwusow in., taiwan), and pure water was available ad libitum. then, animals were immunized with peptide immunogen (the peptide n conjugated with klh) or commercial vaccine five times at two-week intervals. for peptide immunization, an initial dose of µg of the conjugated peptide was mixed with complete freund's adjuvant (sigma-aldrich, st. louis, mo, usa) at the first injection. each rabbit was re-immunized with the same amount of immunogenic mixture with incomplete freund s adjuvant (sigma-aldrich, st. louis, mo, usa). for virus-like particles (vlp) of pcv immunization, the rabbit was injected intramuscularly in the legs with . ml of the pcv vaccine (circoflex ® , boehringer ingelheim). the blood was harvested two weeks after the last injection. the antibody titers were measured by indirect enzyme-linked immunosorbent assay (ielisa) [ ] (performed as described previously for mouse anti-pcv elisa, with the exception that peroxidase-conjugated donkey anti-rabbit igg (h + l, jackson immunoresearch, west grove, pa, usa) secondary antibody was used). hybridomas were generated following established methods, as previously described [ ] . briefly, four five-week-old, female, balb/cbyjnarl mice were purchased from a specific pathogen-free (spf) colony (the national applied research laboratories, taiwan). the mice were maintained in isolation rooms in filtertop cages, and the room temperature was at - • c. mice were fed with a commercially pelleted diet (labdiet, , st. louis, mo, usa), and pure water was available ad libitum. mice were inoculated with µg of immunogen (the peptide n conjugated with klh) emulsified in freund's adjuvant. subsequently, mice were boosted fortnightly with the same amount of immunogen. three days after the final booster, the spleens of the mice were harvested for hybridoma generation. hybridomas were screened for the secretion of anti-orf -specific mabs by ielisa [ ] using the peptide n as a coating antigen. that secondary antibody solution (hrp-conjugated goat anti-mouse iga/igg/igm, h/l chain (novus, saint charles, mo, usa) was used at : dilution for hybridoma screening. the hybridomas that secreted anti-n mabs were subcloned by limited dilution of the cells. then, ascites containing mabs were collected as previously described [ ] . subsequently, the mabs were purified from the ascites by using the nab protein g spin column (thermo scientific, rockford, il, usa), then collecting solutions were exchanged for the buffer and concentrated using the amicon ultra k (millipore, burlington, ma, usa). the concentration of mabs was determined using the nano photometer ® (implen, munich, germany). the heavy chain and the light chain of the two mabs secreted by each hybridoma were determined by using an sba clonotyping system/hrp (southern biotechnology, birmingham, al, usa). it was performed according to the manufacturer's instructions and a previously published method [ ] . briefly, the nunc maxisorp flat-bottom -well plate (thermo fisher scientific, inc., waltham, ma, usa) was coated with the capture antibody (goat anti-mouse ig, µg/ml) in . -m bicarbonate buffer (sigma-aldrich, st. louis, mo, usa) at • c overnight. after three washes with pbs containing . % tween (pbst), the plate was blocked with the blocking buffer (pbst containing % casein hydrolysate) at • c for min. after washing, µl of each culture supernatant of hybridoma were added sequentially, and the plate was incubated at • c for h. after washing, µl of hrp labeled goat anti-mouse igg , -igg a, -igg b, -igg , -iga, -igm, -κ light chain, and -λ light chain were added to appropriate wells of the plate. pbst served as background. the plate was then incubated at • c for h and after that washed with pbst. finally, µl of , -azino-bis ( -ethylbenzothiazoline- -sulfonic acid) diammonium salt (abts, sigma-aldrich, st. louis, mo, usa) buffer were added to each well of the plate. after min, the optical density (od) of each well was measured at nm using a spectramax m microplate reader (molecular devices, san jose, ca, usa). the epitope mapping of these mabs was performed by ielisa [ ] using truncated peptides (as shown in table ) as coating antigens. briefly, peptides contained the sequence of orf protein of pcv b between residues and , associated -mer peptides, truncated derivatives, and the control (peptide c ). ninety-six-well maxisorp plates were coated with each peptide ( µg/ml) in bicarbonate buffer and incubated at • c overnight. after three washes in pbst, the plates were blocked with the blocking buffer at • c for min. after washing, the culture supernatant of hybridoma was added, and plates were again incubated at • c for h. the anti-peptide n mouse serum (at a : dilution) was used as the positive control, and the anti-orf (peptide c ) mab ( h ) was served as the negative control. after rinsing three times with pbst, the secondary antibody solution was applied to the appropriate wells. peroxidase-conjugated goat anti-mouse igg (subclasses + a + b + , fcγ, jackson immunoresearch, west grove, pa, usa) was used at a : dilution for binding to the mab d . hrp-conjugated goat anti-mouse lambda light chain (novus, saint charles, mo, usa) was used at a : dilution for binding to the mab d . after h, the plates were washed three times. the colorimetric reaction was developed by using the abts solution. following a -min incubation, the od was measured at nm. both of the isotype and molecular weight of the mab d were determined by western blotting. the culture supernatant of the hybridoma (clone d ) was separated by bolttm bis-tris plus gel (invitrogen, carlsbad, ca, usa). one gel was stained with fastain protein staining solution (yeastern biotech, taiwan). the other gel was transferred to the polyvinylidene difluoride (pvdf) membranes (millipore, burlington, ma, usa) with fast semi-dry transfer buffer (thermo, waltham, ma, usa) by using a yrdimes semi-dry transfer system (wealtec, new taipei city, taiwan). the membrane was blocked with the blocking buffer for min at • c. after three washes in pbst, each strip of the membrane was incubated with one appropriate horseradish peroxidase (hrp)-conjugated antibody to detect mab at • c overnight and then at room temperature for h. peroxidase-conjugated goat anti-mouse iga/igg/igm, h/l chain (novus, saint charles, mo, usa) at : , hrp-conjugated goat anti-mouse iga (southern biotechnology, birmingham, al, usa) at : , hrp-conjugated goat anti-mouse lambda light chain (novus, saint charles, mo, usa) at : , hrp-conjugated goat anti-mouse igm, µ chain (jackson immunoresearch, west grove, pa, usa) at : , and hrp-conjugated goat anti-mouse igg (subclasses + a + b + , fcγ, jackson immunoresearch, west grove, pa, usa) at : were used in this assay. then, they were washed three times with pbst, min each wash. each strip was incubated with , -diaminobenzidine (dab, horseradish peroxidase substrate, millipore, burlington, ma, usa) buffer for min. protein molecular weight markers were included in each western blot analysis. after revelation with the dab substrate, the target protein was visualized and calculated the molecular weight by using molecular weight markers. homemade immunofluorescence assay (ifa) slides were performed as previously described [ ] . pcv -infected peripheral blood mononuclear cells (pbmcs) were collected from the pcv -infected conventional piglet. the pig serum was detected as seropositive for pcv by anti-capsid peptide (c ) specific immunoassay [ ] . the whole blood sample was confirmed as positive for pcv antigen by using the ingezim pcv das kit (ingenasa, madrid, spain). briefly, pbmcs were isolated from ethylenediaminetetraacetic acid (edta)-treated whole blood samples by ficoll-paque plus (ge healthcare bio-sciences, uppsala, sweden) and according to the manufacturer's instructions. then, mononuclear cells were drawn from the interface of the ficoll-paque plus-containing tube and resuspended in ml pbs for washing and centrifugation ( × g for min at • c). after washing three times with pbs, the cell pellet was resuspended in . ml of fetal bovine serum. an aliquot ( µl) of cell suspension was loaded onto a glass slide for the cell smear. after the cell smear, the slides were air-dried at room temperature. thereby, the pbmcs were stuck on slides. these slides were fixed with pbs containing % paraformaldehyde at • c for min and then washed three times with pbs. following that, the slides were soaked in pbs containing . % triton x- at • c for min. subsequently, slides were washed with pbs and prepared for ifa. two anti-capsid (the peptide c ) mabs ( h and b ) [ ] , one anti-pcv mab a (ingenasa, madrid, spain), one anti-p protein (wt-p ) rabbit polyclonal antibody (bioss, woburn, ma, usa), the homemade anti-orf peptide (n ) mab ( d ), and rabbit antisera (as the method mentioned above) were used in this study. the anti-capsid peptide mabs ( h and b ) recognized native capsid proteins and reacted with core peptides (p , kdpplnp ); however, the mab h bound the three minimal linear epitopes (dpplnp, dpplnpk, and lkdpplkp) and the mab b bound two minimal linear epitopes (kdpplnp and kdpplnpk). these mabs produced a variable definite staining pattern in pbmcs by ifa [ ] . the pcv -infected pbmcs slides were incubated with a : dilution of rabbit antiserum and a : dilution of mab (ascites). after incubation at • c for h, the slides were gently rinsed briefly in pbs and then soaked for min in pbs at • c. the slides were then incubated at • c with tritc-conjugated goat anti-mouse igg (subclasses + a + b + , fcγ) at a : dilution and fitc-conjugated goat anti-rabbit igg (h + l) (all from jackson immunoresearch, west grove, pa, usa, and each minimal antibody cross-reaction to other animal's serum protein) at a : dilution. after min of incubation, the slides were washed with pbs and then incubated with , -diamidino- -phenylindole, dihydrochloride (dapi, aat bioquest, sunnyvale, ca, usa) at a : dilution in pbs at room temperature for min. the slides were mounted under % glycerol and observed with an olympus bx fluorescence microscope and spot flex camera (diagnostic instrument, model . mp, usa). terminal deoxynucleotidyl transferase dutp nick end labeling (tunel) assay was performed according to the manufacturer's instructions (the cell meter™ tunel apoptosis assay kit, aat bioquest ® , inc., sunnyvale, ca, usa) to detect excessive dna breakage in the pbmc. briefly, the pcv -infected pbmcs slides were incubated with the tf -dutp/reaction buffer. after min of incubation, the slides were washed with pbs. they were then incubated with hoechst solution at room temperature for min. the slides were mounted under % glycerol and observed with an olympus bx fluorescence microscope and spot flex camera. blood samples were collected from the pcv -infected herd with the following vaccination at two weeks of age with a pcv vaccine ( ml, porcilis ® pcv, msd animal health) in the conventional pig farm with the farrow-to-finish operation. pigs were selected from animals of different ages ( , , , , , , , , , and ≥ weeks of age) at one time. then, four samples were randomly chosen per age group. blood was collected in the vacutainer ® edta tubes (becton and dickinson bv) and sored at − • c. all whole-blood samples had been treated by the freeze-thaw process twice before the test. the capsid protein was detected from µl of whole-blood sample by using the ingezim pcv das kit (ingenasa, madrid, spain) according to the manufacturer's instructions, and the absorbance of each well was measured at nm. meanwhile, the homemade antigen-elisa was conducted in this study for detecting the orf protein in whole-blood specimens. it was based on the double-antibody-sandwich principle. ninety-six-well maxisorp plates were coated with µl/ml ( µl per well) of the trapping sera (rabbit anti-orf peptide) in bicarbonate buffer. the plates were incubated at • c overnight and washed three times with pbst. then, the plates were blocked with viruses , , of the blocking buffer at • c for min. after the plates were washed, µl of pbst and µl of whole-blood samples were mixed and added to the well. the plates were incubated at • c for h and then washed in pbst. then, µl/well of mab d at µg/ml were added to the plates, and the plates were incubated at • c for h. after the plates were washed, µl/well of peroxidase-conjugated goat anti-mouse igg fcγ at a : dilution were added, and the plates were incubated at • c for h. after washing, abts buffer was added to each well. following a -min incubation, the od was measured at nm. . . detection of the specific antibodies against the capsid protein or the orf protein in plasm samples from the pcv -infected herd with a pcv vaccine plasm samples were separated from the aforementioned fresh whole-blood samples and sored at− • c. the antibody titers were measured by ielisa [ ] (performed as described previously for pig anti-peptide elisa, with the exception that the capsid protein (circoflex ® , boehringer ingelheim) or peptide p was used as the coating antigen, and pig plasm at a : dilution was used). the subsequent procedures of blocking, pig antibody, washing, secondary antibody, substrate, and the od reading, were the same as described previously. the data were analyzed by using one-way analysis of variance (anova) and tukey's studentized range (hsd, honestly significant difference) multiple comparisons test using the sas enterprise guide . ® software (sas institute inc., cary, nc, usa). a p-value < . was considered significant. the orf peptide n or vlp of pcv was capable of inducing each specific antibody. the antibody titer of each post-immune serum (two weeks after the fourth booster) was detected at the dilution : ( figure s ). the binding of the rabbit antiserum to the virus was confirmed by using an immunofluorescence assay (ifa) ( figure s ). although pre-immune serum had a low background (od value less than . ) at a dilution of : , the pre-immune serum did not bind the virus as detected by ifa (data not shown). the author checked for the cross-reactivity of post-immune antiserum to other antigens. the result shows they have a minor cross-reaction at a dilution of : ( figure s ), but the od value of cross-reaction has not significantly higher than that of the negative control serum (p ≥ . ) ( figure s ). following the fusions, hybridoma supernatants were screened for the presence of anti-orf (peptide n ) specific antibodies by ielisa. among them, hybridomas supernatants reacted with the peptide n at the first screening (data not shown). after repeatedly subcloning by the limiting dilution and selection, two stable hybridomas secreting anti-orf mabs ( d and d ) were obtained. the hybridoma produced igg mab ( d ) with kappa-light chains. the other produced mab ( d ) with a lambda-light chain ( figure a) ; however, no heavy chain was detected in the supernatant of clone d by sba clonotyping system/hrp assay. the supernatant of a hybridoma (clone d ) was separated directly by bolttm bis-tris plus gel to confirm the heavy chain of this mab. the gel showed a broad staining region within - kda. the author speculated that the supernatant of a hybridoma contains abundant serum proteins from the fetal bovine serum-containing medium. that is to say, the supernatant of a hybridoma (clone d ) contains a little mab d (light chain). further, the western blotting was used to confirm the heavy chain of mab d and its molecular weight. the hrp-conjugated antibodies (goat anti-mouse iga/igg/igm, h/l chain, and goat anti-mouse lambda light chain) gave specific reactions with the supernatant. two bands were observed at about (between and ) and kda, respectively ( figure ). the lower band ( kda) implied that some degradation of this mab. however, other anti-ig antibodies (goat anti-mouse ig a (α chain), goat anti-mouse igm (µ chain), and goat anti-mouse igg (fcγ)) gave negative reactions with the supernatant. this study confirmed mab d contains a lambda light chain, but it does not include any intact heavy chain. since the molecular weight of the light chain was kda, it may be assumed that the mab d might contain a short fragment of a protein. a peptide scan analysis was performed to determine the binding site for each mab ( figure ). the mab d bound to two linear peptides (peptide n and p ). the peptide n was appended with a cysteine residue at the n-terminal of the orf peptide (residues - , p ). both of them contained the sequence of the orf protein of pcv b between residues and . however, the non-igg mab ( d ) not only reacted with peptide n but also showed minor reactions against other peptides, which compared with the mab d in the elisa test ( figure ) . interestingly, the anti-peptide n mouse serum reacted all test peptides and bound firmly to six linear peptides (peptide n , p , p , p , and p ). notably, anti-n poly antibodies reacted with these peptides (expect p ) which contained the common core peptide (p and tllhfpahfq ). nevertheless, this study did not get any mab which bound firmly to the peptide p . viruses , , x for peer review of a peptide scan analysis was performed to determine the binding site for each mab ( figure ). the mab d bound to two linear peptides (peptide n and p ). the peptide n was appended with a cysteine residue at the n-terminal of the orf peptide (residues - , p ). both of them contained the sequence of the orf protein of pcv b between residues and . however, the non-igg mab ( d ) not only reacted with peptide n but also showed minor reactions against other peptides, which compared with the mab d in the elisa test ( figure ) . interestingly, the antipeptide n mouse serum reacted all test peptides and bound firmly to six linear peptides (peptide n , p , p , p , and p ). notably, anti-n poly antibodies reacted with these peptides (expect p ) which contained the common core peptide (p and tllhfpahfq ). nevertheless, this study did not get any mab which bound firmly to the peptide p . anti-orf mabs ( d and d ) bound the linear peptide spanning from residues to . the antipeptide n mouse serum was used as the positive control, and the anti-orf (peptide c ) mab ( h ) served as the negative control. the mab binding was tested by using an ielisa. peptides contained the sequence of the orf protein between residues and , associated -mer peptides, truncated derivatives, and the control (peptide c ) (as shown in table ). the experiments were performed three times. data represent the mean ± standard error (se). for these homemade pbmcs slides, all slides were made from the same batch and contained the same proportion of pcv -infected cells. all wells contained both negative and positive cells (pcv infected cells) confirmed by ifa, as previously noted in a similar study [ ] . in this study, pcv viral protein (orf protein and capsid protein) locations were initially assessed by ifa. the previous study indicated that the mab h bound the three minimal linear epitopes (dpplnp, dpplnpk, and lkdpplkp), which were located at carboxyl-terminus (cterminus) of the capsid proteins of pcv b, pcv d, and pcv a, respectively [ ] . likewise, the mab b bound two minimal linear epitopes (kdpplnp and kdpplnpk), which were located at cterminus of the capsid proteins of pcv b, and pcv d, respectively [ ] . first, the percentage was determined by counting the number of positive cells per nuclei in an ifa slide. the data show that - % of pbmcs were positive for anti-capsid peptide mab ( h ) staining (pcv a-, pcv b-, the anti-peptide n mouse serum was used as the positive control, and the anti-orf (peptide c ) mab ( h ) served as the negative control. the mab binding was tested by using an ielisa. peptides contained the sequence of the orf protein between residues and , associated -mer peptides, truncated derivatives, and the control (peptide c ) (as shown in table ). the experiments were performed three times. data represent the mean ± standard error (se). for these homemade pbmcs slides, all slides were made from the same batch and contained the same proportion of pcv -infected cells. all wells contained both negative and positive cells (pcv -infected cells) confirmed by ifa, as previously noted in a similar study [ ] . in this study, pcv viral protein (orf protein and capsid protein) locations were initially assessed by ifa. the previous study indicated that the mab h bound the three minimal linear epitopes (dpplnp, dpplnpk, and lkdpplkp), which were located at carboxyl-terminus (cterminus) of the capsid proteins of pcv b, pcv d, and pcv a, respectively [ ] . likewise, the mab b bound two minimal linear epitopes (kdpplnp and kdpplnpk), which were located at cterminus of the capsid proteins of pcv b, and pcv d, respectively [ ] . of pbmcs were positive for anti-orf mab ( d ) staining ( figure s ). it might be due to different abundance of the protein, the degradation of the protein, different pcv strains, conformational epitopes, or linear epitopes, which interacted with the antibody. interestingly, the anti-vlp of pcv rabbit serum produced cytoplasmic staining in pcv -infected pbmcs ( figure b,f) . when pcv -infected pbmcs were co-stained with anti-vlp of pcv rabbit serum and anti-capsid peptide mabs ( b ), some vlp/capsid peptide dual-positive cells were shown ( figure d ). the overlap between the anti-vlp rabbit serum staining and the anti-orf mab ( d ) staining is presented in figure h , and the anti-orf polyclonal antibody and anti-capsid mab ( a ) overlapped the same in figure p ; however, the polyclonal antibody staining was brighter than the mab staining. this result shows orf protein overlaps the capsid protein (or vlp) of pcv ( figure h,p) . both the capsid proteins and the orf protein were detected mainly in the cytoplasm. according to the previous study, the n-terminal half of orf protein (residues - ) was discovered in the cytoplasm, and the c-terminal half of orf protein (residues - ) was majorly located in the nucleus [ ] . the medial region of orf protein (residues - ) might be primarily located in the cytoplasm in this study. of pcv ( figure h,p) . both the capsid proteins and the orf protein were detected mainly in the cytoplasm. according to the previous study, the n-terminal half of orf protein (residues - ) was discovered in the cytoplasm, and the c-terminal half of orf protein (residues - ) was majorly located in the nucleus [ ] . the medial region of orf protein (residues - ) might be primarily located in the cytoplasm in this study. a previous study suggested that orf protein led to increased p protein levels and apoptosis of the infected cells [ ] . to identify the subcellular location of the orf protein in pcv -infected pbmcs, anti-capsid, anti-orf , and anti-p antibodies were used. the p protein also colocalized with capsid peptide (figure l ), and the orf peptide ( figure x ). the p protein was mainly distributed in the cytoplasm and around the nucleus ( figure j,v) . the orf protein, as detected with mab d , seemed to colocalize quite nicely with the p protein ( figure x ). the p /orf dual-positive cell presented a segmented nucleus ( figure w,y) . the nuclei of negative cells were round or oval, and nuclear segmentation was very rare in negative cells ( figure y ). the apoptosis in pcv -infected pbmcs was observed by using a terminal deoxynucleotidyl transferase dutp nick end labeling (tunel) assay (figures and s ) . the data show that - % pbmcs were positive for tunel staining. it is very similar to the percentage of anti-orf mab ( d ) staining. however, these similar percentages are not sufficient proof to show that the orf protein of pcv causes dna damage. a more careful analysis would be needed to reveal whether pcv infection causes dna damage. unfortunately, it was not easy to label both the orf protein and the signals of dna breakage simultaneously by ifa and tunel assay (data not shown). further research would be needed to determine whether the orf protein colocalizes with the signals of tunel. a previous study suggested that orf protein led to increased p protein levels and apoptosis of the infected cells [ ] . to identify the subcellular location of the orf protein in pcv -infected pbmcs, anti-capsid, anti-orf , and anti-p antibodies were used. the p protein also colocalized with capsid peptide (figure l ), and the orf peptide ( figure x ). the p protein was mainly distributed in the cytoplasm and around the nucleus ( figure j ,v). the orf protein, as detected with mab d , seemed to colocalize quite nicely with the p protein ( figure x ). the p /orf dual-positive cell presented a segmented nucleus ( figure w,y) . the nuclei of negative cells were round or oval, and nuclear segmentation was very rare in negative cells ( figure y ). the apoptosis in pcv -infected pbmcs was observed by using a terminal deoxynucleotidyl transferase dutp nick end labeling (tunel) assay ( figure and figure s ). the data show that - % pbmcs were positive for tunel staining. it is very similar to the percentage of anti-orf mab ( d ) staining. however, these similar percentages are not sufficient proof to show that the orf protein of pcv causes dna damage. a more careful analysis would be needed to reveal whether pcv infection causes dna damage. unfortunately, it was not easy to label both the orf protein and the signals of dna breakage simultaneously by ifa and tunel assay (data not shown). further research would be needed to determine whether the orf protein colocalizes with the signals of tunel. b (a); mab h (i); and mab a (m)), and the orf peptide of pcv (mab d (e,u)). staining with spf mouse serum was used as the negative control (q). (b,f,j,n,v) staining with rabbit antiserum (green) was used to identify the vlp of pcv (b,f), the p (j,v), and the orf peptide of pcv (n). (c,g,k,o,s,w,y) nuclei were stained with dapi (blue). the arrow points to the positive staining cell (z) and its irregular shaped nucleus (y), and the enlarged image in the sixth row (x,w). (d,h,l,p,t,x,z) the merged images are also shown. the scale bar is μm. the viral proteins were detected in the whole-blood samples using antigen-elisa techniques. the cut-off value in the commercial pcv capsid elisa and the orf protein elisa were . and . , respectively. the capsid protein of pcv was mainly detected in pigs at , , , and ≥ weeks of age in this study farm ( figure a ). noticeably, these groups (at , , , and ≥ weeks of age) showed more elevated standard error of od value than other groups in this statistical analysis. this result implies that pigs at that age would be susceptible to pcv infection. interestingly, non-vaccinated piglets showed the highest od value of capsid protein at one week of age. after piglets were inoculated with the pcv vaccine at two weeks of age, the od value of capsid protein gradually decreased until weeks of age. however, the od value of capsid protein was raised at ≥ weeks of age (two gilts and two sows). there is evidence to suggest that the vaccine could not protect pigs against pcv infection at weeks postimmunization. the orf protein of pcv showed the same result except for od values of the orf protein were lower than the capsid protein. the correlation coefficient (r) was . . the mean od values of the orf protein were significantly higher for suckers at one week of age than postweaning ones at - weeks of age (p < . ; figure b ). there is an apparent probability that the amount of orf protein was fewer than the amount of capsid protein in pcv -infected blood from suckers at one week of age. protect pigs against pcv infection at weeks postimmunization. the orf protein of pcv showed the same result except for od values of the orf protein were lower than the capsid protein. the correlation coefficient (r) was . . the mean od values of the orf protein were significantly higher for suckers at one week of age than postweaning ones at - weeks of age (p < . ; figure b ). there is an apparent probability that the amount of orf protein was fewer than the amount of capsid protein in pcv -infected blood from suckers at one week of age. figure . assessment of the capsid protein and the orf protein in the whole-blood samples from pigs of different age groups. this study used the commercial capsid kit (a) and the orf protein elisa (b) to measure each specimen, and the absorbance was measured at or nm, respectively. the peptide n and psbt were used as a positive (pos) control and a negative (neg) control, respectively, in the orf protein elisa. the data represent the mean ± se. treatments with different letters have statistically significant differences (p < . ). figure . assessment of the capsid protein and the orf protein in the whole-blood samples from pigs of different age groups. this study used the commercial capsid kit (a) and the orf protein elisa (b) to measure each specimen, and the absorbance was measured at or nm, respectively. the peptide n and psbt were used as a positive (pos) control and a negative (neg) control, respectively, in the orf protein elisa. the data represent the mean ± se. treatments with different letters have statistically significant differences (p < . ). the anti-viral protein-specific antibodies were detected in the plasm samples using ielisa techniques. the cut-off value of in the anti-capsid igg elisa and the anti-orf peptide (p ) igg elisa were . and . , respectively. non-vaccinated suckers showed the highest mean od value of anti-capsid igg and anti-orf igg at one week of age. it may mean that non-vaccinated suckers absorbed large amounts of maternally derived antibodies after birth. notably, the non-vaccinated group showed the most elevated standard error of od value (anti-capsid igg) in this statistical analysis. a gradual decrease of the mean od value of anti-capsid igg for piglets from the suckling period (one week of age) to the weaning period (six weeks of age) was observed. from to weeks old, the mean od value of anti-capsid igg increased significantly (p < . ; figure ). the mean od value of anti-capsid igg also increased significantly for pigs from to weeks old (p < . ) and reached a plateau at weeks old. the same trend was observed in the result of the anti-orf igg elisa test, except for od values of anti-orf igg were lower than anti-capsid igg. notwithstanding, it is worth mentioning that there was a sharp decrease of the mean od value of anti-orf igg for suckers from one to three weeks old. the mean od value of anti-orf igg increased significantly for pigs from six to nine weeks old (p < . ; figure ) and reached a plateau at nine weeks old. there was a weak correlation (r = . ) between the od value of anti-capsid igg and the od value of anti-orf igg in the plasm samples. it points to the probability that the amount of anti-capsid igg reflects humoral immunity in pcv -infected pigs before and after the injection of a pcv vaccine. besides, the amount of anti-orf igg reflects humoral immunity in pigs, which were caused by pcv infecting only. the mean od value of anti-capsid igg also increased significantly for pigs from to weeks old (p < . ) and reached a plateau at weeks old. the same trend was observed in the result of the anti-orf igg elisa test, except for od values of anti-orf igg were lower than anti-capsid igg. notwithstanding, it is worth mentioning that there was a sharp decrease of the mean od value of anti-orf igg for suckers from one to three weeks old. the mean od value of anti-orf igg increased significantly for pigs from six to nine weeks old (p < . ; figure ) and reached a plateau at nine weeks old. there was a weak correlation (r = . ) between the od value of anti-capsid igg and the od value of anti-orf igg in the plasm samples. it points to the probability that the amount of anti-capsid igg reflects humoral immunity in pcv -infected pigs before and after the injection of a pcv vaccine. besides, the amount of anti-orf igg reflects humoral immunity in pigs, which were caused by pcv infecting only. figure . assessment of the anti-capsid igg and the anti-orf peptide (p ) in the plasm samples from pigs of different age groups. the experiments were repeated three times, and data represent the mean ± se. treatments with different letters at the same kind of antibody assay have statistically significant differences (p < . ). according to a previous study, the orf peptide (residues - ) of pcv interacts with the p -binding domain of ppirh [ ] . the peptide n (residues - ) of the orf protein was synthesized and used in this study. subsequently, anti-orf mabs were generated and characterized in this study. this work created one hybridoma producing the mab d with igg , and the other producing the defective-ig mab ( d ) with a lambda-light chain. the mab d (igg ) bound to the linear peptide n (c hndvyislpi tllhfpahfq kfsqpaeisdkr ) and p ( hndvyislpi tllhfpahfq kfsqpaeisdkr ). however, the defective-ig mab d minorly reacted with all truncated peptides. this finding is similar to the previous study [ ] . the defective-ig mabs likely have broad binding, moderate specificity, and low affinity with the associated peptide. there were concerns about this defective-ig mab since this seems to be very rare. further, the molecular weight of the mab d was determined by western blotting. two bands were observed figure . assessment of the anti-capsid igg and the anti-orf peptide (p ) in the plasm samples from pigs of different age groups. the experiments were repeated three times, and data represent the mean ± se. treatments with different letters at the same kind of antibody assay have statistically significant differences (p < . ). according to a previous study, the orf peptide (residues - ) of pcv interacts with the p -binding domain of ppirh [ ] . the peptide n (residues - ) of the orf protein was synthesized and used in this study. subsequently, anti-orf mabs were generated and characterized in this study. this work created one hybridoma producing the mab d with igg , and the other producing the defective-ig mab ( d ) with a lambda-light chain. the mab d (igg ) bound to the linear peptide n (c hndvyislpi tllhfpahfq kfsqpaeisdkr ) and p ( hndvyislpi tllhfpahfq kfsqpaeisdkr ). however, the defective-ig mab d minorly reacted with all truncated peptides. this finding is similar to the previous study [ ] . the defective-ig mabs likely have broad binding, moderate specificity, and low affinity with the associated peptide. there were concerns about this defective-ig mab since this seems to be very rare. further, the molecular weight of the mab d was determined by western blotting. two bands were observed at about and kda, respectively. the small one ( kda) implied some degradation of this mab. since the molecular weight of the light chain was kda, the author suggested that the mab d might contain a short fragment of a protein. interestingly, the author also tested the molecular weight of another defective-ig mab c [ ] (which against peptide c ), and two molecules at about and kda were again shown (data not shown). according to previous documents, human patients with abnormal serum immunoglobulin-free light chain production should only be due to monoclonal plasmaproliferative disorders (included multiple myeloma, light chain myelomas, and light chain amyloidosis) [ ] [ ] [ ] . arguably, the defective-ig mab ( d ) might be related to the phenotype of splenocytes, which fused with the myeloma cell. further research should be carried out using advanced techniques (such as liquid chromatography with mass spectrometry and spectroscopic techniques) to investigate the constitution of the mab d . based on previous reports, lymphocyte depletion and apoptosis in lymphoid tissues are histological hallmarks in pcv -infected pigs [ ] [ ] [ ] . interestingly, these histopathological lesions are similar to marek's disease and african swine fever [ , ] . more recent evidence shows that these viruses caused an early cytolytic infection in lymphocytes by apoptosis [ ] [ ] [ ] . therefore, this study hypothesized that lymphocyte and pbmcs lineage cells should be the primary target cells for pcv infection. to the best of the author's knowledge, the study of the orf protein by indirect immunofluorescence assay in pcv -infected pbmcs has never been performed, and only a single article mentions the transient expression of the orf in porcine pbmcs and detecting apoptosis with a tunel assay [ ] . therefore, the author explored the relation of the capsid, p protein, and the orf protein by ifa and shed light on p protein (the marker of apoptosis) accompanied the peptide (residues - , p ) of the orf protein in pcv -infected pbmcs. it is worth highlighting that - % of pbmcs were positive for some antibodies staining, including anti-orf mab ( d ) staining, anti-p protein rabbit polyclonal antibody staining, and tunel assay ( figure s ). there is a strong probability that - % of pbmcs were undergoing the process of apoptosis. however, the data revealed that the variable percentage ( - %) of pbmcs was positive for anti-capsid mabs ( h , b , and a ) staining and anti-vlp rabbit serum staining. it was because these mabs or antiserum recognized different epitopes of proteins presented in pcv -infected cells. that means that the variety of interaction between antigen-binding sites of antibodies and epitopes, which includes linear form epitopes, conformational epitopes, degradation, and different pcv strains. curiously, this study ( figure s ) showed that the percentage of pcv -infected pbmcs (by mab h staining) was about -fold the rate of orf -positive pbmcs (by mab d staining) or the percentage of p -positive pbmcs. it seems likely that not all pcv -infected cells contained orf proteins or p protein. this finding concurs with the previous study, which indicated that the apoptosis statistically decreases in the initial pcv -infected pigs [ ] . on the other hand, this study showed the orf protein colocalized with the capsid protein marker of pcv . moreover, the p protein was also colocalized with the orf protein. remarkably, the p /orf dual-positive cell presented a segmented nucleus ( figure w ). however, these p /orf dual-positive cells were very few in these samples. this finding will be confirmed by flow cytometry in the future. although a previous study indicated that pcv -infected pigs had a reduction of lymphocytes in the peripheral blood [ ] , the mechanism of lymphocyte lysis was still unclear. this study confirmed that the orf protein was related to the p protein (apoptosis marker) in pcv -infected pbmcs, while other factors (proliferative activity [ ] , caspases and [ , ] , granzymes [ ] , or corticosteroids [ , ] ) causing lymphocyte lysis or depletion need to be considered. although the experiment on apoptosis in pcv -infected pbmcs was carried out by the tunel assay (figure ) , it did not clarify the relationship between the tunel result and the orf protein in pcv -infected cells. only the ifa data confirm previous reports that exogenous orf protein was related to the accumulation of p protein [ , ] , but more proof are needed to make sure the orf protein is a major factor leading to apoptosis in pcv -infected cells [ , , ] and depletion of lymphocytes. pcv is one of the most critical pathogens in modern swine production and causes endemic disease in pig farms [ ] . the commercial vaccines were confirmed protective in the field against pcv and mainly administered to suckers in herds with pcv infection [ , , [ ] [ ] [ ] . most researchers utilized the quantitative real-time pcr to detect pcv nucleic acid, and then they evaluated the efficacy of vaccination in regard to pcv viremia in pigs [ , , ] . however, little is known about the capsid protein or the orf protein in blood from the pcv -infected herd with vaccination. although the capsid antigen-elisa could detect capsid protein, this assay could not differentiate from native viral proteins or vaccine ones in blood. the orf protein-elisa, by contrast, only identified native orf proteins in pig blood since no commercial vaccine contains the orf protein of pcv . for these purposes, this study used the commercial capsid antigen-elisa and homemade orf protein-elisa (anti-n polyclonal antibodies and mab d based) to detect viral proteins in pig blood. this study found non-vaccinated suckers had the highest of pcv proteins in blood at one week of age. the author suggests that in these suckers it was caused by pcv infection, and these viruses majorly stemmed from the sow-to-newborn transmission [ ] . after inoculated with the pcv subunit vaccine, the capsid protein, and the orf protein were gradually decreased. however, these proteins were detected again in gilts and sows (≥ weeks of age). that means pcv viral proteins (the capsid protein or the orf protein) in gilts and sows were higher than that of pigs at - weeks of age. to put it another way, vaccinated-pigs reduced viremia compared to non-vaccinated suckers, and the immunity could not continue to protect vaccinated pigs after weeks post-vaccination. since pcv is highly resistant to environmental conditions, being able to remain in the farm environment and thus represent a risk for infection maintenance [ ] , even mass pcv vaccination (without implementing further farm management practices or biosafety measures) was not able to clear out pcv infection, and the virus became detectable again when vaccination was stopped [ , ] . another key point to mention is that pcv -specific antibodies response play roles in the pcv -infected herd. it is worth noting that there were two different antibody profiles in this pcv -infected herd with the pcv vaccine. the data show that pcv -infected herd had a higher od value of anti-capsid igg at age , , and ≥ weeks, compared with - weeks. in addition, pcv -infected herd had a higher od value of anti-orf igg at age and ≥ weeks, compared with - weeks. according to previous studies, newborn suckers received colostral antibodies from seropositive sows and showed various levels of maternally derived antibodies [ , ] . these maternally derived antibodies might interfere with the viral protein-specific antibodies response while suckers immunized with the pcv subunit vaccine. it may be assumed that the artificial capsid protein (pcv vaccine) elicited anti-capsid igg producing while maternally derived antibodies progressed to degrade. the total level of anti-capsid igg decreased slowly but still maintained a high level of anti-capsid igg during the age of - weeks. this phenomenon could neutralize the pcv virus and prevent piglets from suffering pcv infection. in contrast with dual-source capsid proteins (pcv virus and subunit vaccine), the piglet's immunity response to the orf protein was only caused by pcv virus infection. the total level of anti-orf igg decreased quickly at - weeks of age. then, it increased significantly at - weeks of age. the weak correlation (r = . ) between the anti-capsid igg and the anti-orf igg is worth mentioning. it might reflect two kinds of antibody responses in virus infection and post-immunization. these data interpret the interaction of the viral protein with the host immune system. according to previous reports, orf encodes a -kda protein that is involved in viral capsid formation and contributed to self-assembled virus-like particles (vlp) [ ] . the monomer structures were assembled into a vlp model consisting of capsid subunits to form an icosahedron [ ] . vaccination with this vlp (subunit vaccine) induced both humoral and cell-mediated responses against pcv [ , ] . however, orf encodes an . -kda protein [ ] that is the non-structural protein and involved in pcv -induced apoptosis [ , , ] . until now, no report mentions that the orf protein was solely used as the vaccine against pcv , reducing viremia or pathological lesions in pcv -infected herds. these differences contributed to their application in different ways [ , ] . in this study, the percentage of orf -positive pbmcs was significantly lower than capsid-positive pbmcs. similarly, the antigen-elisa result shows that the amount of orf protein was less than the amount of capsid protein in pcv -infected blood from suckers at one week of age. that might be because mab d only recognizes the orf protein (peptide p ) for pcv b strain (genbank: aac . ), and it causes low detection. however, pcv d (pcv b mutant strain) and pcv b are still predominant genotypes in the field farms. besides the similar percentages of pbmcs were positive for these antibodies (mab d and anti-p protein rabbit polyclonal antibody) staining or tunel assay. in contrast to the orf protein, the detection of the capsid protein could be the variable result in pcv -infected pbmcs via different antibodies, due to different pcv strains [ , ] , conformational epitopes [ ] , or linear epitopes [ , ] . previous studies indicated that antibody binding residues were often on the exterior of the capsid of pcv [ , , , , , ] . although an epitope might be on the surface of the single capsid unit, it could bury in the vlp and be inaccessible to the antibody [ , ] . in general, these results confirm that mab d recognized the native orf protein in pcv -infected pbmcs. this study used mab d or its minimal linear epitope to design various immunoassays. these assays evaluate the viral load and the immune response of viral proteins stimuli. these may serve as surveillance tools for monitoring natural pcv infection in the herd. overall, the author generated anti-orf mabs against the orf peptide (residues - ) of pcv . this study confirmed the defective-ig mab ( d ) with a lambda-light chain, and its molecular weight was about kda. the mab d contained heavy chains (γ ) and kappa-light chains, and it bound to one minimal linear epitope (hndvyislpitllhfpahfq kfsqpaeisdkr). the data show that - % of pbmcs were positive for orf protein or p protein. otherwise, - % of pbmcs were positive for anti-capsid peptide mab ( h ) staining. this study confirmed the orf protein colocalized with the p protein in pcv -infected pbmcs. the author devised the orf protein-elisa (anti-orf antisera and mab d -based) to detect the orf protein in blood samples. the results show that the amount of orf protein was less than the amount of capsid protein in pcv -infected blood from suckers at one week of age. the correlation between the orf protein and the capsid protein is worth noting (r = . ). furthermore, the antibody level of anti-capsid igg and anti-orf igg could imply the immunity response of pig herd, but the sample size needs to be considered. the author declares no conflict of interest. a very small porcine virus with circular single-stranded dna characterization of novel circovirus dnas associated with wasting syndromes in pigs studies on epidemiology and pathogenicity of porcine circovirus experimental reproduction of severe wasting disease by co-infection of pigs with porcine circovirus and porcine parvovirus reproduction of lesions of postweaning multisystemic wasting syndrome by infection of conventional pigs with porcine circovirus type alone or in combination with porcine parvovirus enteritis associated with porcine circovirus in pigs porcine circovirus type associated disease: update on current terminology, clinical manifestations, pathogenesis, diagnosis, and intervention strategies porcine circovirus type (pcv ) infections: clinical signs, pathology and laboratory diagnosis porcine circovirus induces b lymphocyte depletion in pigs with wasting disease syndrome postweaning mulstisystemic wasting syndrome (pmws) in pigs. a review changes in peripheral blood leukocyte populations in pigs with natural postweaning multisystemic wasting syndrome (pmws) can porcine circovirus type (pcv ) infection be eradicated by mass vaccination? comparative efficacy of three commercial pcv vaccines in conventionally reared pigs efficacy of the porcine circovirus (pcv ) vaccination under field conditions evaluation of natural porcine circovirus type (pcv ) subclinical infection and seroconversion dynamics in piglets vaccinated at different ages a new emerging genotype subgroup within pcv- b dominates the pmws epizooty in switzerland epidemiology and horizontal transmission of porcine circovirus type (pcv ) genotypic shift of porcine circovirus type from pcv- a to pcv- b in spain from efficacy and future prospects of commercially available and experimental vaccines against porcine circovirus type (pcv ) genetic variation analysis of chinese strains of porcine circovirus type porcine circovirus type (pcv ): genetic variation and newly emerging genotypes in china genetic analysis of porcine circovirus type (pcv ) in queensland detection and genotyping of porcine circovirus (pcv- ) and detection of porcine circovirus (pcv- ) in sera from fattening pigs of different european countries genomic analysis of porcine circovirus type from southern china genetic diversity and prevalence of porcine circovirus type (pcv ) and type (pcv ) in the midwest of the usa during global molecular genetic analysis of porcine circovirus type (pcv ) sequences confirms the presence of four main pcv genotypes and reveals a rapid increase of pcv d nucleotide sequence of porcine circovirus associated with postweaning multisystemic wasting syndrome in pigs the essential and nonessential transcription units for viral protein synthesis and dna replication of porcine circovirus type molecular biology of porcine circovirus: analyses of gene expression and viral replication replacement of the replication factors of porcine circovirus (pcv) type with those of pcv type greatly enhances viral replication in vitro open reading frame of porcine circovirus type encodes a major capsid protein a porcine circovirus type (pcv ) mutant with amino acids in capsid protein showed more virulence in vivo, compared with classical pcv a/b strain characterization of a previously unidentified viral protein in porcine circovirus type -infected cells and its role in virus-induced apoptosis the orf protein of porcine circovirus type is involved in viral pathogenesis in vivo attenuation of porcine circovirus in spf piglets by abrogation of orf function the orf protein of porcine circovirus type interacts with porcine ubiquitin e ligase pirh and facilitates p expression in viral infection pirh , a p -induced ubiquitin-protein ligase, promotes p degradation pcdh , a novel p transcriptional target in regulating cell migration maf transcriptionally activates the mouse p promoter and causes a p -dependent cell death p and apoptosis: it's not just in the nucleus anymore porcine circovirus type orf protein competes with p in binding to pirh and mediates the deregulation of p homeostasis the orf protein of porcine circovirus type promotes secretion of il- and il- in porcine epithelial cells by facilitating proteasomal degradation of regulator of g protein signalling through physical interaction orf of porcine circovirus enhances the in vitro and in vivo spread of the of the virus t lymphocyte epitope mapping of porcine circovirus type . viral immunol characterization of specific antigenic epitopes and the nuclear export signal of the porcine circovirus orf protein analysis of putative orf gene within porcine circovirus type peptides mimicking viral proteins of porcine circovirus type were profiled by the spectrum of mouse anti-pcv antibodies versatile carboxyl-terminus of capsid protein of porcine circovirus type were recognized by monoclonal antibodies with pluripotency of binding the porcine circovirus type nonstructural protein orf induces apoptosis in porcine peripheral blood mononuclear cells light chain multiple myeloma: an evaluation of its biochemical investigations international myeloma working group guidelines for serum-free light chain analysis in multiple myeloma and related disorders treatment of al amyloidosis suppression of lymphocyte apoptosis in spleen by cxcl after porcine circovirus type infection and regulatory mechanism of cxcl expression in pigs apoptosis in lymphoid organs of pigs naturally infected by porcine circovirus type immunosuppression in postweaning multisystemic wasting syndrome affected pigs pathology of african swine fever: the role of monocyte-macrophage atrophy of primary lymphoid organs induced by marek's disease virus during early infection is associated with increased apoptosis, inhibition of cell proliferation and a severe b-lymphopenia transcriptomic characteristics of bronchoalveolar lavage fluid and peripheral blood mononuclear cells in covid- patients apoptosis and proliferative activity in lymph node reaction in postweaning multisystemic wasting syndrome (pmws). veterinary immunology and immunopathology porcine circovirus type (pcv ) causes apoptosis in experimentally inoculated balb/c mice granzyme b-activated p interacts with bcl- to promote cytotoxic lymphocyte-mediated apoptosis calcium dependence of glucocorticoid-induced lymphocytolysis stress-induced enhancement of activity of lymphocyte lysosomal enzymes in pigs of different stress-susceptibility studies on porcine circovirus type vaccination of -day-old piglets inactivated pcv one shot vaccine applied in -week-old piglets: improvement of production parameters and interaction with maternally derived immunity ten years of pcv vaccines and vaccination: is eradication a possibility? fetal infections and antibody profiles in pigs naturally infected with porcine circovirus type (pcv ) environmental distribution of porcine circovirus type (pcv ) in swine herds with natural infection the . -angstrom structure of porcine circovirus one dose of a porcine circovirus (pcv ) sub-unit vaccine administered to -week-old conventional piglets elicits cell-mediated immunity and significantly reduces pcv viremia in an experimental model immunogenicity of empty capsids of porcine circovius type produced in insect cells rüütel boudinot, s. porcine circovirus type orf protein induces apoptosis in melanoma cells antigenic subtyping and epitopes' competition analysis of porcine circovirus type using monoclonal antibodies identification of one critical amino acid that determines a conformational neutralizing epitope in the capsid protein of porcine circovirus type fine mapping of antigenic epitopes on capsid proteins of porcine circovirus, and antigenic phenotype of porcine circovirus type in vitro and in silico studies reveal capsid-mutant porcine circovirus b with novel cytopathogenic and structural characteristics structure analysis of capsid protein of porcine circovirus type from pigs with systemic disease antibody recognition of porcine circovirus type capsid protein epitopes after vaccination, infection, and disease viruses , key: cord- -bcw f b authors: nan title: abstracts: th ebsa european biophysics congress, august rd– th , budapest, hungary date: - - journal: eur biophys j doi: . /s - - -z sha: doc_id: cord_uid: bcw f b nan o- structure determination of dynamic macromolecular complexes by single particle cryo-em holger stark max-planck-institute for biophysical chemistry, goettingen, germany macromolecular complexes are at the heart of central regulatory processes of the cell including translation, transcription, splicing, rna processing, silencing, cell cycle regulation and repair of genes. detailed understanding of such processes at a molecular level requires structural insights into large macromolecular assemblies consisting of many components such as proteins, rna and dna. single-particle electron cryomicroscopy is a powerful method for threedimensional structure determination of macromolecular assemblies involved in these essential cellular processes. it is very often the only available technique to determine the d structure because of the challenges in purification of complexes in the amounts and quality required for x-ray crystallographic studies. in recent years it was shown in a number of publications that it is possible to obtain near-atomic resolution structures of large and rigid macromolecules such as icosahedral viruses. due to a number of methodological advances there are now also great perspectives for high-resolution single particle cryo-em studies of large and dynamic macromolecules. successful high-resolution structure determination of dynamic complexes requires new biochemical purification strategies and protocols as well as state of the art electron microscopes and high-performance computing. in the future cryo-em will thus be able to provide structures at near-atomic resolution and information about the dynamic behavior of macromolecules simultaneously. detection and rapid manipulation of phosphoinositides with engineered molecular tools tamas balla section on molecular signal transduction, program for developmental neuroscience, nichd, nih, bethesda, md , usa polyphosphoinositides (ppis) are ubiquitous lipid regulators of a variety of cellular processes serving as docking sites and conformational switches for a large number of signaling proteins. the localization and dynamic changes in ppis in live cells have been followed with the use of protein domain gfp chimeras. in this presentation we will show experimental systems that allow rapid manipulation of the levels of ppis in specific membrane compartments. we are also actively pursuing strategies that will allow us to map the distribution and possible functional diversity of the phosphatidylinositol (ptdins) pools within intact cells since they are the precursors of ppis. we will show our most recent progress in addressing this question: the use of a ptdins specific plc enzyme isolated from listeria monocytogenes together with a highly sensitive diacylglycerol sensor to determine the distribution and also to alter the level of ptdins in living cells. these studies reveal that a significant metabolically highly active ptdins pool exists associated with tiny mobile structures within the cytoplasm in addition to the known er and pm ptdins pools. we will show our most recent data on the consequences of ptdins depletion within the various ptdins pools on ppi production and on the morphology and functions of various organelles. the functionality of proteins is known to be intimately related to the motion of their constituents on the atomic/molecular level. the study of microscopic motion in complex matter is often reduced to the observation of some average mean square atomic displacement, a first, very partial characterization of the dynamics. the marked crossover in the temperature dependence of such quantities in hydrated proteins around k, the so called ''dynamic transition'' has been originally observed a quarter of century ago. the origin, nature and the key characteristics of the atomic motions behind this remarkable evolution of the mean square displacement in proteins remained controversial over the past decades. recent analysis of mö ssbauer, dielectric relaxation and neutron scattering spectroscopic data provide unambiguous evidence that this phenomenon is caused by the temperature dependence of a relaxation process spread over several orders of magnitude in the time domain, similarly to the b relaxation process observed in glasses. the review and critical analysis of the available data highlights the inherent ambiguities of commonly used data fitting approaches. emerging evidence from model independent observations tend to exclude some of the proposed mechanisms. microbial rhodopsins: light-gated ion channels and pumps as optogenetic tools in neuro-and cell biology e. bamberg, c. bamann, r.e. dempski, k. feldbauer, s. kleinlogel, u. terpitz, p. wood department of biophysical chemistry, max-planck-institute of biophysics, frankfurt, germany microbial rhodopsins are widely used in these days as optogenetic tools in neuro and cell biology. we were able to show that rhodopsins from the unicellar alga chlamydomonas reinhardtii with the transmembrane helix motif act as light-gated ion channels, which we named channelrhodopsins(chr , ). together with the light driven clpump halorhodopsin chr is used for the non-invasive manipulation of excitable cells and living animals by light with high temporal resolution and more important with extremely high spatial resolution the functional and structural description of this new class of ion channels is given (electrophysiology, noise analysis,flash photolysis and d crystallography). new tools with increased spatial resolution and extremely enhanced light sensitivity in neurons are presented. a perspective for basic neurobiology and for medical applications is given. extracellular signals consists of the induction of specific gene expression patterns and the re-organization in space and time of stereo-specific macromolecular interactions that endow the cell with its specific morphology. we develop quantitative experimental and computational approaches to derive and conceptualize physical principles that underlie these dynamics of signal processing and cellular organization. we have an experimental emphasis on functional microscopic imaging approaches at multiple resolutions to study the localization and dynamics of protein reactions/ interactions, maintaining the inherent spatial organization of the cell. we have a strong recursion between computation of molecular dynamics in realistic cell geometries as sampled by microscopy, and experiments that reveal the dynamic properties of networks in living cells. we investigate the cellular topography of activities that transmit signals from receptors at the cell surface. here we ask, how spatial partitioning of intracellular signalling activities is achieved by the causality structure of the signalling network, and how this partitioning affects signal response. this entails the experimental elucidation of connections between reactions and the determination of enzyme kinetic parameters in living cells. o- molecular photovoltaics mimic photosynthesis michael grä tzel laboratory of photonics and interfaces, institute of chemical science and engineering, station , ecole polytechnique fé dé rale, ch- lausanne, switzerland e-mail: michael.graetzel@epfl.ch the field of photovoltaic cells has been dominated so far by solid state p-n junction devices made e.g. of crystalline or amorphous silicon, profiting from the experience and material availability of the semiconductor industry. however, there is an increasing awareness of the possible advantages of devices referred to as ''bulk'' junctions due to their interconnected three-dimensional structure. their embodiment departs completely from the conventional flat p-n junction solid-state cells, replacing them by interpenetrating networks. this lecture focuses on dye sensitized mesoscopic solar cells (dscs), which have been developed in our laboratory. imitating natural photosynthesis, this cell is the only photovoltaic device that uses a molecular chromophore to generate electric charges from sunlight and that accomplishes the separation of the optical absorption from the charge separation and carrier transport processes. it does so by associating the molecular dye with a film constituted of tiny particles of the white pigment titanium dioxide. the dsc has made phenomenal progress, present conversion efficiencies being over percent for single junction and percent for tandem cells, rendering the dsc a credible alternative to conventional p-n junction devices. molecularly single-molecule imaging and tracking techniques that are applicable to living cells are revolutionizing our understanding of the plasma membrane dynamics, structure, and signal transduction functions. the plasma membrane is considered the quasi- d non-ideal fluid that is associated with the actinbased membrane-skeleton meshwork, and its functions are likely made possible by the mechanisms based on such a unique dynamic structure, which i call membrane mechanisms. my group is largely responsible for advancing highspeed single molecule tracking, and based on the observations made by this approach, i propose a hierarchical architecture of three-tiered meso-scale ( - nm) domains as fundamental organizing principles of the plasma membrane. the three tiers i propose are the following. [tier ] - nm compartments made by partitioning the entire plasma membrane by the membrane-associated actinbased meshwork (membrane skeleton: fences) and its associated transmembrane proteins (pickets). since the entire plasma membrane is partitioned by these structures, and the membrane skeleton provides important platforms for the molecular interactions and pools, membrane compartments are the most basic tier for the plasma membrane organization. [tier ] meta-stable - nm raft domains that can be turned into stable * - -nm domains (receptor-cluster rafts), based on ligand-induced homo-dimers of glycosylphosphatidylinositol (gpi)-anchored receptors (coupling with [tier ]) and facilitated by raft-lipid interactions. [tier ] protein complexes of various sizes ( - nm) and lifetimes. i will also talk about how domains of tiers and are coupled to the membrane partitioning (tier ). the concept of the three-tiered domain architecture of the plasma membrane and the cooperative interactions of different tiers provides a good perspective for understanding the mechanisms for signal transduction and many other functions of the plasma membrane. introduction: in the present study we investigate the effects of electromagnetic fields (emf) on the binding of norfloxacin (nrf) to human serum albumin (hsa) by fluorescence, three-dimensional fluorescence and uv-visible spectroscopic approaches. hsa is the most abundant protein in human blood plasma which works as a carrier that transports different materials in the body. nrf is used to treat variety of bacterial infections. it works by stopping the bacterial growth. methods: hsa, nrf and potassium phosphate buffer were purchased from sigma. fluorescence spectrofluorometer, uv-vis spectrophotometer, three-dimensional fluorescence and a home-built emf generator apparatuses were used. results: results obtained from this study indicated that nrf has a strong ability to quench hsa in nm. in addition, there was a slight blue shift, which suggested that the microenvironment of protein became more hydrophobic after addition of nrf. moreover, synchronous fluorescence demonstrated that the microenvironment around tyrosine (tyr) had a trivial increase. these, and the results of hsa-nrf in the presence of emf with khz, illustrates the same results inferred from quenching and blue shift. however, there was a significant decrease in k sv of nrf with hsa in presence of emf exposure. moreover, the binding parameters including the number of binding sites and the binding constant were calculated form hill equation. conclusion: it was shown that nrf could induce conformational changes in hsa both in the absence and presence of emf with no significant difference. yet, the affinity is decrease significantly in the presence of emf. the clinical implications are discussed in detail. characterization of the biochemical properties and biological function of the formin homology domains of drosophila we characterised the properties of drosophila melanogaster daam-fh and daam-fh -fh fragments and their interactions with actin and profilin by using various biophysical methods and in vivo experiments. the results show that while the daam-fh fragment does not have any conspicuous effect on actin assembly in vivo, in cells expressing the daam-fh -fh fragment a profilindependent increase in the formation of actin structures is observed. the trachea specific expression of daam-fh -fh also induces phenotypic effects leading to the collapse of the tracheal tube and lethality in the larval stages. in vitro, both daam fragments catalyze actin nucleation but severely decrease both the elongation and depolymerisation rate of the filaments. profilin acts as a molecular switch in daam function. daam-fh -fh , remaining bound to barbed ends drives processive assembly of profilin-actin, while daam-fh forms an abortive complex with barbed ends that does not support profilinactin assembly. both daam fragments also bind to the sides of the actin filaments and induce actin bundling. these observations show that the drosophila melanogaster daam formin represents an extreme class of barbed end regulators gated by profilin. electron spin echo studies of free chain-labelled stearic acids interacting with b-lactoglobulin rita guzzi, luigi sportelli, rosa bartucci dipartimento di fisica, università della calabria, rende (cs), italy b-lactoglobulin (blg) binds non-covalently fatty acids within its central calyx, a cavity in the barrel formed by the strands ba-bh. we present results of pulsed electron paramagnetic resonance (epr) spectroscopy on the interaction of blg with stearic acids spin-labelled at selected positions, n, along the acyl chain (n-sasl, n = , , , , ). d o-electron spin echo envelope modulation (eseem) fourier transform spectra indicate that all segments of the bound chains in the protein binding site are accessible to the solvent. the extent of water penetration decreases progressively on moving from the first segments toward the terminal methyl end of the chain. about % of the nitroxides in the upper part of the chain (n = , ) are h-bonded by a single water molecule and this fraction reduces to % at the chain terminus (n = , ). a lower fraction of the nitroxides are h-bonded by two water molecules, and it decreases from about % to a vanishingly small value on going down the chain. echo-detected ed-epr spectra reveal subnanosecond librational motion of small amplitude for both -and -sasl in the protein cavity. the temperature dependence of the librations is more marked for -sasl and it arises mainly from an increase in librational amplitude with increasing temperature. fusion peptides (fp) pertaining to the spike glycoprotein from severe acute respiratory syndrome (sars) coronavirus are essential for the fusion between viral and host cellular membranes. here we report a biophysical characterization of the interaction of two putative fps with model membranes. fluorescence and dsc experiments showed that both peptides bind stronger to anionic than to zwitterionic lipid membranes. esr spectra showed that toac-sars ifp rotational dynamics is modulated by lipid composition and ph as compared to the spectrum of this peptide in solution. however, stearic acid spin labels reported no changes on the dynamic structure of zwitterionic micelles, whereas the whole chain of anionic surfactants was perturbed by the peptides. finally, cd data revealed a predominant b-strand structure for sars fp and an a-helix for sars ifp in the presence of micelles, in contrast to their disordered structures in buffer. overall the results point out that electrostatic and hydrophobic interactions are both important to the energetic behavior of peptide membrane interaction. these findings might provide a useful rationale for the elucidation of one of the steps involved in the fusion process, and thus help understanding the more general way of action of fps at a molecular level. interaction of filamentous actin and ezrin within surface modified cylindrical nanopores daniela behn , and claudia steinem institute for organic and biomolecular chemistry, university of gö ttingen, tammannstraße , gö ttingen, germany, ezrin is a member of the ezrin-radixin-moesin (erm) protein family that acts as a dynamic linker between the plasma membrane and the actin cytoskeleton and is hence involved in membrane organization, determination of shape and surface structures and other cellular processes. the protein is highly enriched in microvilli of polarized epithelial cells, where it binds filamentous actin (f-actin) with its c-terminal domain, while the n-terminal domain is connected to the plasma membrane via specific binding to l-a-phosphatidylinositol- , -bisphosphate (pip ). nanoporous anodic aluminum oxide (aao) films provide similar dimensions as microvilli and are thus a versatile template to investigate the interaction of ezrin with f-actin within spationally confined areas. owing to their optical transparency, functionalized aaos can be used to measure the binding process of ezrin to a pip containing solid supported membrane by means of time resolved optical waveguide spectroscopy (ows). confocal laser scanning microscopy (clsm) will elucidate, whether f-actin binding to ezrin takes place within or atop the nanopores. furthermore, elasticity mapping of f-actin filaments by means of atomic force microscopy will allow determining binding forces and the lateral tension of the actin cytoskeleton. in vitro application of porphyrin photosensitisers on mcf , hela and g tumour cell lines binder s., kolarova h., bajgar r., tomankova k., daskova a. deparment of medical biophysics, faculty of medicine of palacky university, olomouc, czech republic tumour treatment presents a challenge to all scientists and clinicians. contemporary methods like radiotherapy, chemotherapy or surgery have many undesirable side effects. photodynamic therapy (pdt) seems to be one of alternatives which can be helpful in malignant cell therapy. pdt is not only limited to cancer treatment but is also used as an alternative for cardiovascular, skin and eye disease treatment. pdt employs photosensitive agents which need to be activated by light which is not harmful to a patient. the activated photosensitive agent provokes a formation of reactive oxygen species leading to cell damage or death. the phototoxicity of the two porphyrin photosensitizer (tmpyp, zntpps . h o) on the malignant cell lines (g , hela, mcf ) irradiated with the jcm - doses was evaluated by ros production assay, mtt assay and comet assay. our results indicate higher efficiency of tmpyp over zntpps . h o. as for the photodynamic effectiveness of the used photosensitizers on chosen cell lines we found that hela cell line is the most sensitive to phototoxic damage induced by tmpyp. p- nmr analysis of the respiratory syncytial virus m - protein structure and of its interaction with some of its targets c. sizun the respiratory syncytial virus (rsv) is a major cause of acute respiratory tract infections (bronchiolitis, pneumonia) in human and a leading cause of viral death in infants and immunocompromised patients. rsv genome is formed of a single non-segmented negative strand rna which transcription and replication is ensured by a specific rna-dependent rna polymerase complex formed of the large (l) polymerase subunit and of several cofactors. this complex has no cellular counterpart and represents an ideal target for antiviral drugs. among the cofactors, m - acts as an antitermination factor and increases the polymerase processivity. its central domains has been shown, in vitro, to bind the phosphoprotein p and genomic rna in a competitive manner. here we report the nmr structure of this central domain and its interaction with p and rna fragments. m - shares structural similarity with vp , a transcription factor of ebola virus. the binding surfaces for rna and p are distinct but overlapping. rna binds to a basic cluster located next to residues found to be critical for transcription both in vitro and in vivo by mutational analysis. we speculate that m - might be recruited by p to the transcription complex, where interaction with rna takes place, stabilized by additional elements. force spectroscopy at the membrane-cytoskeleton interface: interactions between ezrin and filamentous actin julia a. braunger , , ingo mey and claudia steinem institute for organic and biomolecular chemistry, georg-august-university of gö ttingen, tammannstraße , gö ttingen, germany, ggnb doctoral program: imprs - ezrin, a member of the erm (ezrin/radixin/moesin) protein family, provides a regulated linkage between the plasma membrane and the actin cytoskeleton. it contributes to the organization of structurally and functionally distinct cortical domains participating in adhesion, motility and fundamental developmental processes. ezrin is negatively regulated by an intramolecular interaction of the terminal domains that masks the f-actin binding site. a known pathway for activation involves the interaction of ezrin with phosphatidylinositol , bisphosphate (pip ) in the membrane, followed by phosphorylation of the threonine residue in the c-terminal domain. to date, it is unclear to what extent both regulatory inputs contribute to the activation. we developed an in vitro system that facilitates the specific analysis of the interaction forces between ezrin and f-actin by means of atomic force spectroscopy (afm). applying ezrin wild type and the pseudophosphorylated mutant protein ezrin t d, respectively, permits to monitor the individual influence of phosphorylation on the f-actin-ezrin interaction. thus, a thorough characterization of the acting forces at the ezrin-actin interface will elucidate the activation mechanism of ezrin. delivery system even more efficient, we have constructed nano-carrier by coating of ldl by polyethylene glycol (peg) . the hydrophilicity of peg should reduce the interaction of ldl with other serum proteins and consequently decrease the redistribution of loaded drug from ldl to the (lipo)proteins. dynamic light scattering was used for determination of hydrodynamic radius of ldl-peg particles. cd spectroscopy measurements didn't reveal structural changes of apoliprotein b- (ligand for ldl receptors on cell surface), after conjugation of peg with ldl. interaction of ldl-peg complexes with hypericin (hyp) a natural photosensitizer was studied by fluorescence spectroscopy. we have demonstrated accumulation of higher number of hyp in ldl-peg than ldl particles. however, the kinetics of hyp redistribution from hyp/ldl-peg complex to free ldl have similar parameters as those for the kinetics of hyp transfer between non-modified ldl molecules. we suggest that hyp molecules are mostly localized in the vicinity of the surface of the ldl-peg particles and they are prone to redistribution to other serum proteins. grant support: lpp- - , vega- - . modification of the head-group of aminophospholipids by glycation and subsequent lipid oxidation affect membrane's structure causing cell death. these processes are involved in the pathogenesis of aging and diabetes. non-enzymatic glycation forms in the first step a schiff base (sb), which rearranges to a more stable ketoamine, amadori product, which leads to the formation of a heteregenous group of compounds (ages). although several studies have been focused on identification of aminophospholipid glycation products, less attention has been paid to kinetic mechanism of the reaction. for that reason, in the present work, we compare the kinetic reactivity of polar head-group of phosphatidylethanolamine (pe) and phosphatidylserine (ps), the two target phospholipids components of mammalian cell membranes. the reaction of pe and ps's head-group with glycating compounds (glucose and arabinose) was studied in physiological conditions by using nmr spectroscopy. the obtained formation rate constants for sb are lower than those determined for the sb of the peptide ac-phe-lys with the same carbohydrates. it suggests that the phosphate group may delay the glycation process. moreover, the ps's head-group has a carboxilic group in the structure, which affects the stability of the sb. we developed ultrasensitive, elisa-like nanoimmuno assays suitable for proteomics/interactomics studies in low sample volumes. we exploit the approach of dna microarray technologies applied to proteomics [ ] , in combination with atomic force microscopy (afm) to generate functional protein nanoarrays: semisynthetic dna-protein conjugates are immobilized by bioaffinity within a nanoarray of complementary ssdna oligomers produced by afm nanografting (ng). a nanoarray of different antibodies or synthetic molecular binders can be generated in a single operation, once the dna nanoarray is produced. moreover, ng allows adjusting the packing density of immobilized biomolecules to achieve optimum bio-recognition. afm-based immunoassays with these nanoarrays were shown to achieve detection limit of hundreds of femto molar, in few nanoliters volumes, with very high selectivity and specificity [ ] . to detect the hybridization efficiency of our devices, we run a combined experimental-computational study that provides quantitative relations for recovering the surface probe density from the mechanical response (afmcompressibility measurements) of the sample. nucleoside analogues used as anticancerous drugs can be rapidly degraded within treated cells, constituting a major obstacle of their therapeutic efficiency. among the enzymes responsible for this degradation, the cytosolic 'nucleotidase ii (cn-ii) catalyses the hydrolysis of some nucleoside monophosphates. in order to improve the efficacy of anticancerous drugs and to define the precise role of cn-ii, new original inhibitors have been developed against cn-ii. virtual screening of chemical libraries on the crystal structure has allowed us to identify very promising candidates that turned to be competitive inhibitors of cn-ii. one molecule was included in the anticancerous treatment of tumoral cell lines in order to evaluate the potential benefit and could induce in fine a sensitization of certain anticancerous drugs. we also explore other inhibitors targeting the allosteric sites of this enzyme using a strategy that takes into account the dynamics of cn-ii. the chemical structures of the newly identified allosteric inhibitors as well as the atomic interactions with enzyme residues will be presented. the final goal of this study is to find molecules that can freeze the enzyme in a conformation for which its dynamics is severely limited and therefore its function. native mass spectrometry to decipher interactions between biomolecules sarah cianferani laboratoire de spectromé trie de masse bio-organique, université de strasbourg, iphc, rue becquerel strasbourg, france. cnrs, umr , strasbourg, france mass spectrometry is generally understood as ''molecular mass spectrometry'' with multiple applications in biology (protein identification using proteomic approaches, recombinant protein and monoclonal antibody characterization). an original and unexpected application of mass spectrometry emerged some twenty years ago: the detection and the characterization of intact biological noncovalent complexes. with recent instrumental improvements, this approach, called native ms, is now fully integrated in structural biology programs as a complementary technique to more classical biophysical approaches (nmr, crystallography, calorimetry, spr, fluorescence, etc.). native ms provides high content information for multiprotein complexes characterization, including the determination of the binding stoichiometries or oligomerization states, sitespecificities and relative affinities. recent developments of ion mobility / mass spectrometry instruments (im-ms) provide a new additional level for ms-based structural characterization of biomolecular assemblies allowing size and shape information to be obtained through collisional cross section measurements. these different aspects of native ms for structural characterization of biomolecular assemblies will be illustrated through several examples, ranging from multiprotein-complexes to protein/nucleic acid assemblies. complex coacervation is a process which may result by electrostatic interaction between charged polysaccharides. it depends essential on ph, ionic strength and biopolymers properties like ratio, concentration and charge density. in this case, the main work was to study the structural properties of a colloidal system of opposite charge -chitosan and gum-arabic by atomic force microscopy (afm). according to some of complexes show tendency to agglomerate. this depends on the molar ration of the macromolecules and their relative molecular weights. afm micrographs show, too, that some formation of irregular aggregates by both polymers were due to presence of noncharged polar monomers in chitosan molecule. at higher gum-arabic/chitosan ratios biopolymer concentrations, coacervates appear like a core-shell miccelar structure composed of hydrophobic core (charge neutralized segments) stabilized by the excess component (positive zeta potential) and non-charged segments of gum arabic. interaction of human serum albumin with rutin theoretical and experimental approaches Ícaro p caruso human serum albumin (hsa) is the principal extracellular protein with a high concentration in blood plasma and carrier for many drugs to different molecular targets. flavonoids are a large class of naturally occurring polyphenols widely distributed in plants. rutin (quercetin- -rutinoside) is the glycoside between flavonoids quercetin and disaccharide rutinose. like other flavonoids, rutin displays anti-inflammatory and anti-oxidant properties. the interaction between hsa and rutin was investigated by fluorescence spectroscopy, ab initio and molecular modeling calculations. fluorescence titration was performed by keeping the hsa concentration ( lm) constant and stoichiometrically varying the rutin concentration ( - lm) . the emission spectra were obtained in the range of to nm, with the excitation wavelength at nm. the obtained fluorescence data were corrected for background fluorescence and for inner filter effects. the stern-volmer quenching constant values were . and . m - at and k, respectively. from the modified stern-volmer association constants . (at k) and . m - (at k) were calculated the thermodynamic parameters dh = . kj mol - , dg k = - . kj mol - and dg k = - . kj mol - , and ds = . kj mol - k - . fluorescence quenching method was used also to study the binding equilibria thus determining the number of binding sites . and . , and binding constant . m - and . m - at and k, respectively. the efficient quenching of the trp fluorescence by rutin indicates that the binding site for the flavonoid is situated within subdomain iia of hsa. the distance r = . nm between the donor (hsa) and the acceptor (rutin) was obtained according to fluorescence resonance energy transference (fret). wavelength shifts in synchronous fluorescence spectra showed the conformation of hsa molecules is changed in the presence of rutin. the structure of rutin utilized in molecular modeling calculation was obtained by gaussian program. the optimization geometry of rutin was performed in its ground states by using ab initio dft/b lyp functional with - g(d,p) basis set used in calculations. the molecular electrostatic potential (mep) was calculated to provide the molecular charge distribution of rutin. the gap energy value between the homo and lumo of the rutin molecule was about . ev which indicates that rutin is classified as a reactive molecule. from molecular modeling calculation the interaction between hsa and rutin was investigated using the autodock program package. the three-dimensional coordinates of human serum albumin were obtained from the protein data bank (entry pdb code ao ) and of rutin were obtained from output optimization geometry of dft. the best energy ranked result shows that rutin is localized in the proximity of single tryptophan residue (trp ) of hsa that is in agreement with the fluorescence quenching data analysis. the effect of toxofilin on the structure of monomeric actin lívia czimbalek, veronika kollá r, roland kardos, gá bor hild university of pé cs, faculty of medicine, department of biophysics, pé cs, hungary actin is one of the main components of the intracellular cytoskeleton. it plays an essential role in the cell motility, intracellular transport processes and cytokinesis as well. toxoplasma gondii is an intracellular parasite, which can utilise the actin cytoskeleton of the host cells for their own purposes. one of the expressed proteins of t. gondii is the kda-sized toxofilin. the long protein is a monomeric actinbinding protein involved in the host invasion. in our work we studied the effect of the actin-binding site of toxofilin - on the g-actin. we determined the affinity of toxofilin to the actin monomer. the flourescence of the actin bound e-atp was quenched with acrylamide in the presence or absence of toxofilin. in the presence of toxofilin the accessibility of the bound e-atp decreased, which indicates that the nucleotide binding cleft is shifted to a more closed conformational state. the results of the completed experiments can help us to understand in more details what kind of cytoskeletal changes can be caused in the host cell during the invasion of the host cells by intracellular parasites. t bacteriophage, as a surrogate on non-enveloped viruses was selected as a test system. both tmpcp and bmpcp and their peptide conjugates proved to be efficient photosensitizers of virus inactivation. the binding of porphyrin to phage dna was not a prerequisite of phage photosensitization, moreover, photoinactivation was more efficiently induced by free than by dna bound porphyrin. mechanism of photoreaction (type i. versus type ii) and the correlation between dna binding, singlet oxygen production and virus inactivation capacity was also analyzed. dna binding reduced the virus inactivation due to the reduced absorbance and singlet oxygen production of bound photosensitizer, and altered mechanism of photoinactivation. as optical melting studies of t nucleoprotein revealed, photoreactions of porphyrin derivatives affected the structural integrity of dna and also of viral proteins, even if the porphyrin did not bind to np or was selectively bound to dna. anesthesia is a medical milestone (friedman & friedland, medicine's greatest discoveries, ) and local anesthetics (la) are the most important compounds used to control pain in surgical procedures. however, systemic toxicity is still a limitation for la agents as well as low solubility, as for tetracaine (ttc). approaches to improve la effects include macrocyclic systems formation, such as in cyclodextrins (cd). we have studied complexes formed between ttc and b-cd or hydroxylpropyl (hp)-b-cd through nmr and other (uv-vis, fluorescence, dsc and x-ray diffraction) techniques. at ph . a : stoichiometry of complexation was detected for both complexes, with association constants of m - and m - for ttc:b-cd and ttc:hp-b-cd, respectively. the nuclear overhauser nmr data disclosed trough the space proximities between hydrogens h h and h iat the aromatic ring of ttc -and hydrogens from the inner cavity of the cyclodextrins, allowing us to propose the topology of ttc:cd interaction. complex formation did not curb ttc association with model (liposomes) and biological membranes since the total analgesic effect (infraorbital nerve blockade in rats) induced by mm ttc increased % upon complexation. supported by (fapesp # / - , - ) brazil. p- itc as a general thermodynamic and kinetic tool to study biomolecule interactions philippe dumas , dominique burnouf , eric ennifar , sondes guedich , guillaume bec , guillaume hoffmann isothermal titration calorimetry (itc) is a powerful technique for thermodynamic investigations that is little used to obtain kinetic information. we have shown that, in fact, the shape of the titration curves obtained after each ligand injection is strictly governed by the kinetics of interaction of the two partners. a simple analysis allowed us to explain several facts (e.g. the variation of time needed to return to equilibrium during a titration experiment). all simplifications were further released to obtain a very realistic simulation of an itc experiment. the method was first validated with the binding of the nevirapine inhibitor onto the hiv- reverse transcriptase by comparison with results obtained by biacore tm . importantly, for more complex systems, the new method yields results that cannot be obtained in another way. for example, with the e. coli transcription-regulator thiamine pyrophosphate riboswitch, we could resolve kinetically and thermodynamically the two important successive steps: ( ) the binding of the tpp ligand and ( ) the subsequent rna folding. our results show that initial tpp binding is controlled thermodynamically by tpp concentration, whereas the overall transcription regulation resulting from rna folding is kinetically controlled. gfps, due to their tendency to dimerize at high concentration. we have characterized for the first time the selfassociation properties of cfp (cyan fluorescent protein), the fluorescent protein mostly used as fret donor. we found that the fluorescence quenching observed at high expression level in the cell cytoplasm and the fluorescence depolarization measured at high concentration in vitro are insensitive to the a k mutation, shown to dissociate other gfp dimers. both phenomena are satisfactorily accounted for by a model of non-specific homo-fret between cfp monomers due to molecular proximity. modeling the expected contributions to fluorescence depolarization of rotational diffusion, homo-fret within a hypothetical dimer and proximity homo-fret shows that cfp has a homo-affinity at least times lower than gfp. this difference is due to an intrinsic mutation of cfp (n i), originally introduced to increase its brightness and that by chance also disrupts the dimers. biomolecular recognition typically proceeds in an aqueous environment, where hydration shells are a constitutive part of the interacting species. the coupling of hydration shell structure to conformation is particularly pronounced for dna with its large surface to volume ratio. conformational substates of the phosphodiester backbone in b-dna contribute to dna flexibility and are strongly dependent on hydration. we have studied by rapid scan ftir spectroscopy the isothermal b i -b ii transition on its intrinsic time scale of seconds. correlation analysis of ir absorption changes induced by an incremental growth of the dna hydration shell identifies water populations w (po --bound) and w (non-po --bound) exhibiting weaker and stronger h-bonds, respectively, than those in bulk water. the b ii substate is stabilized by w . the water h-bond imbalance of - kj mol - is equalized at little enthalpic cost upon formation of a contiguous water network (at - h o molecules per dna phosphate) of reduced !(oh) band width. in this state, hydration water cooperatively stabilizes the b i conformer via the entropically favored replacement of w -dna interactions by additional w -water contacts, rather than binding to b i -specific hydration sites. such water rearrangements contribute to the recognition of dna by indolicidin, an antimicrobial -mer peptide from bovine neutrophils which, despite little intrinsic structure, preferentially binds to the b i conformer in a water-mediated induced fit. in combination with cd-spectral titrations, the data indicate that in the absence of a bulk aqueous phase, as in molecular crowded environments, water relocation within the dna hydration shell allows for entropic contributions similar to those assigned to water upon dna ligand recognition in solution. segmental-labeling expression of sh domains of cd ap protein to study interaction with their ligand i.f. herranz-trillo , j.l. ortega-roldan , n.a.j. van transient and low affinity interactions within the cell can be enhanced by the combination of more than one domain. up to now most of the effort has been put on the study of the regulation in the affinity and specificity of the binding to isolated single domains but little is known about the effect of the presence of a second or third domain. multiple examples of proteins containing tandem domains exist in the genome like the cin /cms family of adaptor proteins. in this family all three n-terminal sh domains are involved in a wide variety of different interactions, they share higher similarity among themselves than to any other sh domains, suggesting an overlapping specificities in binding. cd associated protein (cd ap) is an adaptor protein and belongs to this family, its n-terminus consists of three sh domains and the interaction of each one of them with its target(-s) might be ultimately modulated by the presence of its next-door-neighbor. in this work we present the expression and purification of the tandem cd ap-sh a/ sh b produced by segmental labeling techniques that allow us to express the domains with different isotopic label, improving the nmr signal and facilitating to study the interaction of the natural ligand in the presence of nextdoor-neighbor domain. there are plenty of molecules that exert their effects at the cell membrane. the evaluation of these interactions, frequently quantified by the nernst lipid/water partition constant (kp), helps to elucidate the molecular basis of these processes. we present here a recently derived and tested method to determine kp for single solute partitions using fpotential measurements. the concept was then extended to the interaction of supramolecular complexes with model membranes. a simultaneous double partition with an aqueous equilibrium is considered in this partition model. the results were validated by dynamic light scattering -dls, f-potential, fluorescence spectroscopy and laser confocal microscopy experiments. we evaluated the interaction of supramolecular complexes (peptides derived from dengue virus proteins with oligonucleotides) with luv to study our biophysical models. dengue virus (dv) infects over - million people every year and may cause viral hemorrhagic fever. no effective treatment is available and several aspects of its cellular infection mechanism remain unclear. the extension of the interactions of these complexes with biomembranes helps to elucidate some steps of dv life cycle. the aggregation of amphotericin b in the lipid membranes induced by k + and na + ions: langmuir monolayers study marta arczewska, mariusz gagoś department of biophysics, university of life sciences in lublin, poland the polyene antibiotic amphotericin b (amb) is currently the drug of choice in the treatment of fungal infections despite its undesirable side effects. according to the general conviction, the biological action of the drug is based on the formation of transmembrane channels which affect physiological ion transport, especially k + ions. this work reports the results of langmuir monolayers study of the effect of k + and na + ions on the molecular organizations of amb in the model lipid membrane. the two-component monolayers containing amb and phospholipid (dppc) have been investigated by recording surface pressure-area isotherms spread on aqueous buffers containing physiological concentration of k + and na + ions. the strength of the amb-dppc interactions and the stability of the mixed monolayers were examined on the basis of surface pressure measurements, the compressional modulus and the excess free energy of mixing. the obtained results proved a high affinity of amb towards lipids in the presence of k + than na + ions. the most stable mixed monolayers were formed with the : and : stoichiometry in the presence of k + and na + ions, respectively. this research was financed by ministry of education and science of poland within the research project n n . microcalorimetric study of antibiotic amphotericin b complexes with na + , k + and cu + ions arkadiusz matwijczuk, grzegorz czernel, mariusz gagoś department of biophysics, university of life sciences in lublin, poland amphotericin b (amb) as a metabolite of streptomyces nodosus is one of the main polyene antibiotics applied in the treatment of deep-seated mycotic infections. we presented microcalorimetric (dsc) study of molecular organization of amphotericin b in lipid membranes induced by na + , k + and cu + ions. the analysis of dsc curves indicates the influence of na + and k + ions on the main phase transition of pure dppc lipid. for the molar fractions of , , , mol% amb in dppc we observed the thermal shift towards higher temperatures in respect to pure lipid, both in the presence of na + and k + ions. this result may be connected with the changes in dynamic properties of the model membrane system. in case of amb-cu + complexes in aqueous solution at two ph values, . and . , the dsc measurements reported endothermic heat effect. this phase transition was related to the dissociation process of amb-cu + complexes. the formation of amb-cu + complexes are accompanied by changes of the molecular organization of amb especially disaggregation. these all observed effects might be significant from a medical point of view. this research was financed by ministry of education and science of poland within the research project n n . membrane proteins and peptides are acting in an environment rich in other proteins or peptides. aim of our study was to understand how such molecular crowding and resulting intermolecular interactions can influence the behavior of membrane proteins, using various antimicrobial peptides and membrane proteins as examples. in the case of antimicrobial peptides we have previously described a change in their alignment in the membrane at a characteristic threshold concentration. to understand whether this change is due to unspecific crowding or specific peptidepeptide interactions, we tested if this re-alignment depends on the presence of additional peptides. in most cases we found a similar re-orientation behavior irrespective of the added peptide type, indicating unspecific crowding. when pairing pgla and magainin- , however, we observed a distinctly different sequence of pgla re-orientation in the membrane, indicating a specific interaction between these two peptides, which correlates well with their known synergistic activity. a rather different effect of crowding was observed for the larger channel protein mscl, which was found to form clusters of functionally active proteins in the membrane. we propose that this clustering is caused by lipid-mediated protein-protein interactions. water, hydrophobic interaction, and protein stability j. raul grigera and c. gaston ferrara instituto de física de líquidos y sistemas bioló gicos (iflysib), conicet-unlp, la plata, argentina although there are several forces maintaining protein structure, it is well know that hydrophobic interaction is the dominant force of protein folding. then, we can infer that any factor that alters hydrophobic interaction will affect the protein stability. we have study by computer simulation a model system consisting in solution of lenard-jones particles in water (spc/e model) at different pressures and temperatures and analyzed the solubility i.e. the aggregation properties, of such a system. from the obtained data we are able to build up the phase surface determining the critical point. the computing results where compared with experimental data of binary mix of non polar substance in water and of protein denaturation, finding high coincidence on the critical point. since the behavior of our model system can only be due to hydrophobic effects, the coincidences with the denaturation of proteins allow us to conclude that the dominant factor that determine temperature and pressure denaturation of proteins is the hydrophobic interaction. the temperature and pressures at which the denaturation, as well the disaggregation of simple non-polar particles, starts agree with what we could expect based on the cross over line of the low to high density structure water transition. the functional reconstitution of a mitochondrial membrane protein into a lipid bilayer was studied using a quartz crystal microbalance. the xhis-tagged protein was immobilised via specific binding to a cu + terminated sensor surface, with a change in frequency indicating approximately % coverage of the sensor surface by the protein. a lipid bilayer was reconstituted around the protein layer, with a final change in frequency that is consistent with the remaining area being filled by lipid. incubation with a specific ligand for the protein resulted in a significant change in frequency compared to the interaction with the surface or lipid alone. the change is greater than expected for the mass of the ligand, indicating a possible conformational change of the protein, such as the opening of a channel and increased water content of the layer. electrical impedance measurements on the same system have provided additional evidence of protein-lipid bilayer formation, and it is intended that this system will be studied with neutron reflectometry to characterise potential ligand induced channel formation. valuable functional and structural information about this membrane protein was obtained by using surface sensitive techniques to study the protein in a biomimetic lipid bilayer. visualizing and quantifying hiv-host interactions with fluorescence microscopy jelle hendrix , *, zeger debyser , johan hofkens and yves engelborghs laboratory for biomolecular dynamics, university of leuven, belgium, laboratory for molecular virology and gene therapy, university of leuven, belgium, laboratory for photochemistry and spectroscopy (*present address), university of leuven, belgium protein-chromatin interactions are classically studied with in vitro assays that only provide a static picture of chromatin binding. fluorescence correlation spectroscopy (fcs) is a non-invasive technique that can be used for the same purpose. being applicable inside living cells it provides dynamic real-time information on chromatin interactions. transcriptional co-activator ledgf/p has well characterized protein and chromatin interacting regions. we studied ledgf/p in vitro and inside living cells with fcs and other techniques (luminescent proximity assay, spot/half-nucleus fluorescence recovery after photobleaching, continuous photobleaching). protein-protein interactions in living cells can be monitored with fluorescence cross-correlation spectroscopy (fccs) using fluorescent proteins as genetic labels. advantages over using fö rster resonance energy transfer (fret) are the independence from intermolecular distance and knowledge of absolute protein concentrations. we characterized fccs with fluorescent proteins in vitro and then studied the intracellular complex of ledgf/p and the hiv- integrase (in) enzyme both with fret and fccs. nucleus and its compartment nucleolus are a seat of enormous biosynthetic activity in human cancer cells. nucleolar proteins, e.g. b or c , play an important role in regulation of cell division and proliferation. one of the strategies how to intermit malignant cell proliferation is affecting, e.g. by drug treatment, a net of intracellular protein interactions to bring the cell on a way of apoptosis. a cytostatic agent actinomycin d initiates apoptosis in human cancer cells, as well as in normal peripheral blood lymphocytes. at the same time, translocation of b and c into nucleoplasm is observed in the treated cells. therefore interaction between nucleolar and apoptotic proteins comes into a question. co-immunoprecipitation, fluorescence microscopy and yeast two hybrid analysis are used to answer it. in co-immunoprecipitation experiments, tumor suppressor p showed up to be a promising candidate for the interaction. fluorescence deposits mostly constituted by variants of transthyretin (ttr), a homotetrameric plasma protein implicated in the transport of thyroxine and retinol [ ] . nowadays, the only effective therapy for ttr amyloidosis is liver transplantation. new therapeutic strategies are being developed taking advantage of our current understanding of the molecular mechanisms of amyloid formation by ttr [ ] . a significant effort has been devoted to the search and rational design of compounds that might decrease ttr tetramer dissociation, for example, through ligand binding at the thyroxine binding sites of ttr [ , ] . here, we use isothermal titration calorimetry (itc) to characterize the thermodynamic binding signature of potential ttr tetramer stabilizers, previously predicted by computerassisted methods [ ] . itc allows the measurement of the magnitude of the binding affinity, but also affords the characterization of the thermodynamic binding profile of a protein-ligand interaction. high affinity/specificity ttr ligands, enthalpically and entropically optimized, may provide effective leads for the development of new and more effective drug candidates against ttr amyloidosis. we have established a set of vectors to promote easy cloning of ecfp and eyfp fusions with any protein of interest. we exploit these fluorescent fusion proteins to study protein-protein interactions by fluorescence lifetime of ecfp. the decrease of ecfp lifetime reveals fret between ecfp and eyfp and hence the interaction between proteins in question. groel-groes chaperonin complex is required for the proper folding of eschericia coli proteins. bacteriophage t and its distant relative coliphage rb encode co-chaperon proteins (respectively gp and coco) that can replace groes in the chaperonin complex. gp is also required in the folding of the major capsid protein of the phage. prd is a large membrane-containing bacteriophage infecting gram-negative bacteria such as e. coli and salmonella enterica. it has kb long linear dsdna genome and the capsid has an icosahedral symmetry. the groel-groes chaperonin complex is needed in the assembly of prd . we have found evidence that prd protein p can work similar way as other viral co-chaperones and substitute groes in chaperonin complex. fluorescence lifetime studies between proteins groel and p reveals an interaction that backs up the theory. structural modification of model membranes by fibrillar lysozyme as reaveled by fluorescence study a.p. kastorna v.n. karazin kharkiv national university, svobody sq., kharkiv, , ukraine recent experimental findings suggest that protein aggregation, leading to the formation and depositions of amyloids play a central role in the neurodegenerative diseases, type ii diabetes, systemic amyloidosis, etc. in the present study we focused our efforts on investigation of the influence of fibrillar lysozyme on the structural state of model lipid membranes composed of phosphatidylcholine and its mixtures with cardiolipin ( mol %) and cholesterol ( mol %). to achieve this purpose, two fluorescent probes with different bilayer location, , -diphenyl- , , -hexatriene (dph) distributing in membrane hydrocarbon core and -lauroyl- -dimethylaminonaphthalene (laurdan) locating at lipid-water interface, have been employed. the changes in membrane viscosity under the influence of amyloid lysozyme were characterized by fluorescence anisotropy of dph. this fluorescence parameter was not markedly affected by fibrillar protein in all types of model membranes. the changes in emission spectra of laurdan were analysed by the generalized polarization value (gp). it was found that adding of amyloid lysozyme resulted in the increment of gp value. our data suggest that lysozyme fibrils cause reduction of bilayer polarity and increase of lipid packing density. isothermal titration calorimetry (itc) is the gold standard for the quantitative characterisation of protein-ligand and protein-protein interactions. however, reliable determination of the dissociation constant (k d ) is typically limited to the range lm [ k d [ nm. nevertheless, interactions characterised by a higher or lower k d can be assessed indirectly, provided that a suitable competitive ligand is available whose k d falls within the directly accessible window. unfortunately, the established competitive itc assay requires that the high-affinity ligand be soluble at high concentrations in aqueous buffer containing only minimal amounts of organic solvent. this poses serious problems when studying protein binding of small-molecule ligands taken from compound libraries dissolved in organic solvents, as is usually the case during screening or drug development. here we introduce a new itc competition assay that overcomes this limitation, thus allowing for a precise thermodynamic description of highand low-affinity protein-ligand interactions involving poorly water-soluble compounds. we discuss the theoretical background of the approach and demonstrate some practical applications using examples of both high-and low-affinity protein-ligand interactions. interaction of myoglobin with oxidized polystyrene surfaces studied using rotating particles probe m. kemper , , d. spridon , l.j. van ijzendoorn , m.w.j. prins , eindhoven university of technology, department of applied physics, eindhoven, the netherlands, dutch polymer institute, eindhoven, the netherlands, philips research, eindhoven, the netherlands the interaction of proteins with polymer surfaces is of profound importance for the sensitivity of biosensors. polymer surfaces are often treated in order to tune their chemical and physical properties, for example by oxidation processes. to get a better understanding of the association of proteins to treated polymer surfaces, we use the rotating particles probe (x.j.a. janssen et al., colloids and surfaces a, vol. , p. , ). in this novel technique, protein coated magnetic particles are in contact with a substrate and the binding is recorded for all individual particles using a rotating magnetic field. we investigate the interaction of myoglobin coated magnetic particles to spincoated polystyrene surfaces that have been oxidized with a uv/ozone treatment. the surfaces have been characterized by xps, afm and water contact angle measurements. we will demonstrate a clear influence of polystyrene oxidation on the binding fractions of the myoglobin coated particles. we interpret the results in terms of dlvo-theory: electrostatic as well as electrodynamic properties of the surfaces will be influenced by the oxidation. interact with monomeric and/or filamentous actins. twinfilin is a - kda protein composed of two adf-homologue domains connected by a short linker. in our work we studied the effects of the mouse twinfilin- (twf ) on the monomeric actin. we determined the affinity of twf to the atp-actin monomer with fluorescence anisotropy measurement (k d = . lm). the fluorescence of the actin bound e-atp was quenched with acrylamide in the presence or absence of twf . in the presence of twinfilin the accessibility of the bound e-atp decreased, which indicates that the nucleotide binding cleft is shifted to a more closed conformational state. it was confirmed with stopped-flow experiments that the kinetics of nucleotide-exchange of actin decreased in the presence of twf . we determined the thermodynamic properties of twf and investigated the effect of twinfilin on the stability of actin monomer with differential scanning calorimetry. the twf stabilized the stucture of the g-actin. our results can help us to understand the regulation of actin cytoskeleton in more details. magnetic np have attracted attention due to their potential of contrast enhancement of magnetic resonance imaging and targeted drug delivery, e.g. tumor magnetic hyperthermia therapy. potential nephrotoxicity of single i.v. administration of fe o np was studied in female wistar rats i.v. administered either placebo ( % v/v rat serum in . % nacl), suspension of tio np (positive control, bimodal / nm distribution), or fe o np (bimodal / nm distribution) in doses of . , . or . mg/kg. rats were sacrificed h, -, -and -days after np injection (n= - /each group). administration of np did not alter kidney size significantly; renal function of np administered rats as monitored by plasma creatinine and urea concentrations, creatinine clearance and protein excretion rate did not differ significantly in either interval from rats administered placebo. one week after administration significant rise in plasma ca, its urinary and fractional excretion was observed in rats administered mg fe o /kg. plasma mg levels rose in this group and weeks after administration. no significant changes in the expression of tnf-a, tgf-b, and collagen iv genes in renal cortex were revealed. no obvious nephrotoxic effects were observed in rats after a single i.v. dose of fe o np. study was supported by fp ec eu: nanotest (development of methodology for alternative testing strategies for the assessment of the toxicological profile of nanoparticles used in medical diagnostics.), grant no.: . biomimetic supramolecular assemblies for studying membrane interactions in vitro and in vivo s. kolusheva, r. jelinek ben-gurion university, beer-sheva, israel we designed a novel biomimetic sensor, composed of conjugated polydiacetylene (pda) matrix embedded within lipid vesicles. the system is capable of detecting various compounds occurring within lipid membranes through rapid colorimetric as well as fluorescent transitions. the colorimetric response of the sensor is correlated to the extent of compound-membrane binding and permeation and quantified binding sensitivity to lipid composition. we describe a new disease diagnostic approach, denoted ''reactomics'', based upon reactions between blood sera and an array of vesicles comprising different lipids and polydiacetylene (pda), a chromatic polymer. we show that reactions between sera and such a lipid/pda vesicle array produce chromatic patterns which depend both upon the sera composition as well as the specific lipid constituents within the vesicles. through attachment of chromatic polydiacetylene (pda) nanopatches onto the plasma membrane, real-time visualization of surface processes in living cells is possible. the ras protein is mutated in % of human tumors. ras acts as a switch, transmitting a growth signal in an active gtp-bound form and turning the signal off in an inactive gdp-bound form. the switch off is accomplished by gtp hydrolysis, which is catalyzed by ras and can be further accelerated by gtpase activating proteins (gaps). mutations which prevent hydrolysis cause severe diseases including cancer. we investigate the reaction of the ras gap protein-protein complex by time-resolved ftir spectroscopy. detailed information on the mechanism and the thermodynamics of the reaction was revealed: first, the catalytic arginine-finger of gap has to move into the gtp binding pocket, then cleavage of gtp is fast and h po hydrogen-bonded in an eclipsed conformation to the b-phosphate of gdp is formed. further, we performed for the first time atr-ftir spectroscopy of ras in its native environment, a lipid membrane. in this setup we are able to do difference spectroscopy of the immobilized protein. interactions with other proteins can be determined in a similar way as in spr experiments but with the additional information from the infrared spectra. galectins are a family of animal lectins that specifically bind b-galactosides and have gained much attention due to their involvement in several biologic processes such as inflammation, cell adhesion and metastasis. in such processes, several issues are still not clear including the mechanisms of interaction with different carbohydrates. galectin- (gal- ) is a tandem-repeat type galectin that contains two carbohydrate recognition domains (crd-i and crd-ii) connected by a linking peptide. in this study, we performed spectroscopic studies of the carbohydrate-recognition domains from human gal- . our goals are two-fold: ( ) to monitor conformational changes in each domain upon its binding to specific ligands and then to correlate the observed changes with structural differences between the crds and ( ) to investigate the interaction between the crds and lipid model membranes. to achieve such objectives we used a combined approach of spectroscopic techniques involving circular dichroism and electron spin resonance. overall the results obtained so far show that crd-i and crd-ii have distinct behaviors in terms of carbohydrate recognition and membrane binding. this may be due to specific differences in their structures and certainly suggests a non-equivalent role in protein function. hemoglobin influence on lipid bilayer structure as revealed by fluorescence probe study o.k. kutsenko, g.p. gorbenko, v.m. trusova v.n. karazin kharkov national university, kharkov, ukraine hemoglobin (hb) is a red blood cell protein responsible for the oxygen transport. its affinity for lipid bilayers represents interest for gaining insight into protein biological function as well as for some applied aspects such as development of blood substitutes or biosensors. hb influence on lipid bilayer structure was investigated using fluorescent probes pyrene and prodan. model membranes were prepared of phosphatidylcholine (pc) and its mixtures with phosphatidylglycerol (pg) and cholesterol (chol). hb penetration into membrane interior is followed by the increase of relative intensity of pyrene vibronic bands and decrease of prodan general polarization value suggesting an enhancement of bilayer polarity. this implies that hb incorporation into membrane interior decreases packing density of lipid molecules, promoting water penetration into membrane core. chol condensing effect on lipid bilayer prevents protein embedment into bilayer, thus decreasing membrane hydration changes as compared to pc bilayers. in the presence of anionic lipid pg hb-induced increase of bilayer polarity was found to be most pronounced, pointing to the modulatory role of membrane composition in hb bilayer-modifying propensity. we present optimized sialic acid-based mimics binding in the low nanomolar range. molecular interactions were determined with surface plasmon resonance (spr), characterizing the affinity and the kinetics of binding. furthermore, isothermal titration calorimetry (itc) was applied to dissect the standard free energy of binding (dg°) into the standard enthalpy of binding (dh°) and the standard entropy of binding (ds°). in order to pass the cell membranes, most of these medicines has to be administrated to patients as nucleoside pro-drugs and not directly as triphosphorylated forms. because of the poor phosphorylation of the nucleoside analogues used in therapy, it is important to understand and to optimize their metabolism. our aim is to understand how compounds of chirality l turn away -phosphoglycerate kinase (pgk) from its normal glycolytic function to be converted into the triphosphate forms. in order to elucidate pgk mechanism and substrate specificity, we have measured the kinetics of the different steps of the enzymatic pathways by rapid mixing techniques and studied the influence of the nature of the nucleotide substrate thereon. we first performed an extensive study with d-and l-adp (see poster by p. lallemand). we are now extending the studies to other nucleotide diphosphates (some of them used in therapies). changes in the nature of the nucleobase or deletion of hydroxyl group of the sugar affect the efficiency of phosphorylation by pgk, either by decreasing dramatically their affinity or by altering the phospho-transfer step itself. structural explanations are given based on docking data. probing drug/lipid interactions at the molecular level represents an important challenge in pharmaceutical research, drug discovery and membrane biophysics. previous studies showed that enrofloxacin metalloantibiotic has potential as an antimicrobial agent candidate, since it exhibits antimicrobial effect comparable to that of free enrofloxacin but a different translocation route. these differences in uptake mechanism can be paramount in counteracting bacterial resistance. in view of lipids role in bacterial drug uptake, the interaction of these compounds with different e. coli model membranes were studied by fluorescence spectroscopy. partition coefficients determined showed that lipid/antibiotic interactions were sensitive to liposomes composition and that the metalloantibiotic had a higher partition than free enrofloxacin. these results corroborate the different mechanism of entry proposed and can be rationalized on the basis that an electrostatic interaction between the metalloantibiotic positively charged species, present at physiological ph, and the lipids negatively charged head groups clearly promotes the lipid/antimicrobial association. oligomerization and fibril assembly of amyloid b peptide amyloid b peptide (ab) forms a large amount of extracellular deposits in the brain of alzheimer's disease (ad) patients and it is believed that this peptide is related to the pathogenesis of that disease. the most abundant monomeric form of physiological ab (* %) is constituted by amino acids and is benign, but by an unknown mechanism this endogenous material becomes aggregated and neurotoxic. increasing evidence suggests that membrane interaction plays an important role in ab neurotoxicity. in this work it will be studied the interactions of ab( - ) with ctac (cationic), sds (anionic), pfoa (anionic with fluorine atoms) and og (nonionic) amphiphiles in monomeric and micellar forms. the results demonstrated that ab( - ) forms fibrils with different morphologies in the presence of micelles. in addition, the presence of micelles accelerates the formation of fibrils and decreases the lifetime of oligomers. we present here the exploitation of the powerful approach of surface plasmon resonance imaging and mass spectrometry coupling for protein fishing in biological fluids such as human plasma at the same sensitivity. on one hand, multiplex format spri analysis allows direct visualization and thermodynamic analysis of molecular avidity, and is advantageously used for ligand-fishing of captured bio-molecules on multiple immobilized receptors on a spri-biochip surface. on the other hand, maldi mass spectrometry is a powerful tool for identification and characterization of molecules captured on specific surface. therefore, the combination of spri and ms into one concerted procedure, using a unique dedicated surface, is of a great interest for functional and structural analysis at low femtomole level of bound molecules. to reach these goals, particular surface engineering has been engaged to maintain a high level of antibody grafting and reduce non-specific adsorption. thus, various chemistries have been tested and validated towards biological fluids such plasma, keeping in mind the capacity of the in situ investigation by ms. finally, signal to noise ratio was magnified leading to the characterization of protein lag , a potential marker of breast cancer, in human plasma. atenolol incorporation into pnipa nanoparticles investigated by isothermal titration calorimetry mihaela mic, ioan turcu, izabell craciunescu, rodica turcu, national institute for r&d of isotopic and molecular technologies, cluj-napoca, romania e-mail: mihaela.mic@itim-cj.ro poly(n-isopropylacrylamide) (pnipa) is a thermo-sensitive hydrogel undergoing a volume phase transition at about of °c close to the body temperature. this volume phase transition is envisaged as a key property for drug binding and release. the purpose of our research is the thermodynamic characterization of the binding of atenolol by pnipa polymeric nanoparticles. the thermodynamic parameters which characterize the binding process are obtained using the isothermal titration calorimetry (itc) as the main investigation technique. when polymeric nanoparticles bind drug molecules, heat is either generated or absorbed depending on the amount of bond molecules and also on the exothermic or endothermic character of the binding process. the heat measurement allows the determination of binding constants, reaction stoichiometry and the thermodynamic profile of the interaction. itc technique has been used to investigate the binding properties of nanoparticles which shrink from the swollen to the collapsed state. the capacity of such nanogels to bind atenolol molecules is directly related to relevant differences between the binding properties in the swollen and in the collapsed state respectively. aggregation study of x-(alkyldimethylamonium)alkylaldonamide bromides p. misiak , b. ró _ zycka-roszak , e. woźniak , r. skrzela , k.a. wilk department physics and biophysics, wrocław university of environmental and life sciences, wrocław, poland, department of chemistry, wrocław university of technology, wrocław, poland sugar-based surfactants are of considerable research interest because they have improved surface and performance properties, reduced environmental impact, and have potential pharmaceutical and biomedical applications. x-(alkyldimethylamonium)alkylaldonamide bromides (c n gab) with different chain lenghs (n = , , , ) belonging to cationic sugar-based surfactants were newly synthesised. the aggregation processes of c n gabs were studied by means of isothermal titration calorimetry (itc), electric conductance method and molecular modelling methods. the critical micelle concentrations (cmc), the degree of micelle ionization (b), the enthalpies (dh m ) and the entropies (ds m ) of micellization as well as the contributions of the headgroups to the gibbs free energies (dg m (hy)) were calculated. the obtained values were compared with those for dodecyldimethylethylammonium bromide and literature data for analogical glucocationic surfactants. the latest compounds differ from c n gab surfactants by substitution of sugar chain by gluco ring. molecular modelling methods were used to relate the molecular properties of the compounds with their experimentally studied properties in solution. this work was supported by grant n n . every year over million people are infected with dengue virus (denv), transmitted by a mosquito (aedes aegypti). this enveloped virus, member of the flaviviridae family, has four distinct serotypes. it has a single stranded positive rna molecule with a single open reading frame that encodes a single poliprotein, which, after appropriate processing by viral and host proteases, gives rise to three structural proteins (c, prm and e) and seven non-structural proteins (ns , ns a, ns b, ns , ns a, ns b and ns ) [ ] . the surface of the immature virion is composed of e and prm heterodimers that are arranged as trimers protruding from the membrane [ ] . the virus is thought to enter the host cell via a receptor-mediated endocytosis, although, if any, the specific dengue receptors have not been described. once inside the cell, the acidified environment inside the endocytic vesicle triggers an irreversible trimerization of the envelope (e) protein, inducing the release of the nucleocapsid (composed of viral rna and multiple copies of c protein) to the cytoplasm, thus starting the infection process, where the poliprotein is translated and processed, originating all viral proteins. considering the structural proteins c and e, these are essential for the viral infection, specifically, protein c is thought to be involved in the viral assembly and specific encapsidation of the genome and protein e (a class ii fusion protein) plays a major role in the fusion process. as recently described by some studies [ ] , protein c is composed of four a helices connected by four short loops and has a highly hydrophobic region forming a concave groove that could interact with lipid membranes and a region with an increased concentration of positive charges, possibly interacting with the viral rna. as for protein e, it is composed of three b stranded domains. it is proposed that the fusion loop is located in domain ii of this protein and the putative receptor binding sites, considered essential for the viral entry, are supposedly located in domain iii. in this work, we describe the identification of the membrane active regions of both these proteins, considering both theoretical studies, hydrophobic moments, hydrophobicity and interfaciality values as well as experimental ones, namely fluorescence spectroscopy, where a fluorescent probe is encapsulated in model membrane systems, and differential scanning calorimetry [ ] . we have found one region in protein c and four regions in protein e with membranotropic activity. this is the first work describing experimentally the putative membrane interacting zones of both these proteins. this work was funded by grant bfu - -bm from ministerio de ciencia y tecnologia, spain, granted to jose villalaín. investigation of membrane-membrane interaction mediated by coiled coil lipopeptides gesa pä hler, andreas janshoff georg-august-university, tammannstrasse , gö ttingen, germany e-mail: gpaehle@gwdg.de specific cellular membrane interaction and fusion are crucial points in vivo which are in eukaryotic cells mediated by snare proteins. the definite mechanism behind these processes is still poorly understood, but the coiled coil formation of a snare core complex consisting of four a-helices seems to generate a fusogenic driving force. this offers the possibility to design a straightforward experimental setup to mimic the complex protein-mediated membrane-membrane interaction by using mere protein fragments or peptides attached to artificial lipid bilayers which self-assemble to a coiled coil structure. in our approach, two artificial three heptad repeat coiled coil forming peptides were synthesized and attached to maleimide functionalized membranes via an in situ-coupling reaction. thus, secondary structure changes, kinetic characteristics and binding energetics were monitored during coiled coil formation with real time ellipsometry, ir and cd spectroscopy. the lipopeptide mediated membrane-membrane interaction itself is investigated by colloidal probe spectroscopy and tirfm. these techniques and the setup of our model system allow screening the energetic and structural properties of variable coiled coil forming peptides, i.e. linker-modified or biologically inspired sequences. enzymatic reactions in nanostructured surfaces: unzipping and cutting the double helix pietro parisse , matteo castronovo , bojana lucic , alessandro vindigni , giacinto scoles and loredana casalis sincrotrone trieste s.c.p.a., trieste, italy, temple university, philadelphia, usa, protein-dna interactions are vital for living organisms. from viruses to plants and humans, the interactions of these two different classes of biopolymers control processes as important and diverse as the expression of genes and the replication, rearrangement, and repair of dna itself. to understand these processes at the molecular level, and to follow changes in cellular pathways due to different kinds of perturbations and/or diseases it is necessary the identification and quantification of proteins and their complex network of interactions. we have exploited the high spatial resolution given by atomic force microscopy to generate dna arrays of variable density by means of nanografting. on such nanostructures, we investigate the mechanism of different enzymatic reactions (from restriction enzymes to helicases). registering with high precision the height variation due to the action of the enzyme onto the engineered dna sequences (in the case of restriction enzymes) or taking advantage of the different mechanical properties of single and double stranded dna (in the case of helicases, where for the first time kinetic data were obtained on recq human helicase), we were able to monitor either the activity and/or the action mechanisms of these two important classes of enzymes. in this study an attempt has been made to investigate the structure, dynamics and stability of cyclic peptide nanotubes (cpnts) formed by the self-assembly of cyclic peptides (cps) using classical molecular dynamics (md) simulation and semiempirical quantum chemistry calculation employing pm . the structure and energetics of monomer and various oligomeric cpnts have been investigated by considering the (cyclo-[(d-ala-l-ala) ]) peptide as the model for cp. various geometrical parameters extracted from the md simulation reveal that the terminal residues are loosely hydrogen bonded to the inner subunits regardless of degree of oligomerization. the hydrogen bonds present in the inner core regions are stronger than the terminal residues. as the degree of oligomerization increases, the stability of the tube increases due to the hydrogen bonding and stacking interactions between the subunits. the results show that the binding free energy increases with the extent of oligomerization and reaches saturation beyond cpnt . in addition, hydrophobic and electrostatic interactions play crucial roles in the formation of cpnts. analysis of both structural and energetics of formation of cpnts unveils that the selfassembly of dimer, trimer and tetramer cpnts are the essential steps in the growth of cnpts. monolayers on a langmuir trough constitute a great biomimetic model to characterize protein-protein or protein-lipid interaction, where the physical state of the interfacial layer is completely controlled. we present here three studies performed on monolayers, with a wide panel of experimental (optical, spectroscopical, rheological) techniques. i) surface properties and conformation of nephila clavipes spider recombinant silk proteins (maspi and masp ) was studied at the air-water interface: we show that the mechanism of assembly of both proteins is different, although both proteins share the same sequence pattern and a close hydrophobicity. they both exhibit a certain propensity to form b-sheets that may be important for the efficiency of the natural spinning process. ii) the dystrophin molecular organization and its anchoring in a lipidic environment depend on the rod fragment used and on the lipid nature. moreover the interaction is guided by the lateral surface pressure. this lipid packing variation is essential to understand the role of the dystrophin during compression-extension cycle of the muscle membrane. iii) we evidence that non additive behavior of mixtures of food globular proteins leads to enhanced foaming properties or to self assembled objects. nucleolar-targeting peptides (nrtps) were designed by structural dissection of crotamine, a toxin from the venom of a south-american rattlesnake. at lm concentration, nrtps penetrate different cell types and exhibit exquisite nucleolar localization. the aim of this work was to pursue with the study of nrtps molecular mechanism for translocation, as well as to determine the ability of nrtp to delivery large molecules into cells. for the translocation experiments, rhodamine b-labeled nrtps were used and tested with giant multilamellar vesicles. confocal microscopy results show that there is an efficient translocation across model membranes. high levels of intracellular peptide were also seen in different cell lines and pbmc, soon after incubation with nrtp. furthermore, a conjugate of nrtp (nrtp c) bound to b-galactosidase was prepared by chemical synthesis and tested in hela cells. this conjugate maintains enzymatic activity and is stable at °c for several days. the work done so far with this new family of cell-penetrating peptides revealed strong interaction and translocation with lipid model systems. moreover, successfully cellular delivery of bgalactosidase was observed and quantified. interaction of zinc phthalocyanine with ionic and non ionic surfactants: uv-vis absorption and fluorescence spectroscopy for application in photodynamic therapy m. p. romero, s. r. w. louro physic department, pontifícia universidade cató lica de rio de janeiro puc-rio, brazil among the second-generation photosensitizer (ps) developed for the treatment of neoplastic diseases by photodynamic therapy (pdt), metallo-phthalocyanines (mpc) have been proposed as an alternative to the currently used ps in clinical application. unsubstituted mpc are not soluble in physiological solvents and their in vivo administration relies upon their incorporation into carriers or their chemical conversion into water-soluble dyes by the attachment of selected substituents. in this work, uv-vis absorption and fluorescence spectroscopy were used to study the ability of different micelles for dispersing zinc phthalocyanine (znpc). the following surfactants were tested: sds, ctab, hps, tween , tween , and pluronic f . znpc has low solubility in virtually all solvents, but dmf and dmso are observed to dissolve znpc in concentrations of the order of . and . mmol/l, respectively. stock solutions of znpc in dmf and dmso were prepared. micelles of the different surfactants containing znpc were prepared by dissolving in aqueous medium (milli-q water or phosphate buffer ph . ) small amounts of the stock solutions previously mixed with each surfactant. the amounts of each surfactant were calculated to give an average ratio of one znpc molecule per micelle in the final solution. the absorption and fluorescence spectra of znpc in the micellar systems were obtained, and were observed to change in time. immediately after dissolution the spectra are characteristic of monomeric znpc, suggesting formation znpccontaining nanoemulsions with the mixture of znpc-organic solvent in the hydrophobic region of the micelle. since dmso and dmf are miscible with water, the solvent diffuses out of the micelle and znpc stays inside the micelle in a monomeric or aggregated form. the different surfactants lead to different time evolution of znpc aggregation. aggregation lifetimes vary from one hour, in the case of pluronic f , to more than twelve hours, in the case of ctab and hps. it was observed that the ionic surfactants were more efficient than non ionic ones for monomeric delivery of znpc . work partially supported by cnpq, inami and faperj. nucleobase-containing peptides are an important class of molecules comprising both artificial (synthetic nucleopeptides) and natural (peptidyl nucleosides and willardiine-containing peptides) compounds characterized in many cases by interesting biological properties. , in this work, we report a spectroscopic study on the properties of a chiral nucleobase-bearing peptide obtained by chemical synthesis starting from commercial sources. the findings of this research strongly encourage further efforts in the field of the use of nucleopeptides as supramolecular assembling systems and open the way to novel drug delivery approaches based on nucleobase recognition. conformational plasticity. their structure depends tremendously on their local environment and confinment, and may accommodate several unrelated conformations, that are a strong challenge for the traditional characterizations of structure, supramolecular assembly and biorecognition phenomena. atomic force microscopy (afm) has been successfully exploited for both highly controllable nanolithography of biomolecules and for biorecognition studies, such as oriented prion protein -antibody interaction (sanavio et al., acs nano ( ) ( ): , bano et al. nano lett ( ) ( ): - ). here, we report different strategies for selective, oriented confinement of alphasynuclein at the nanoscale for sensitive and accurate direct detection, via precise topographic measurements on ultraflat surfaces, of biomolecular interactions in confined assemblies. a new class of cell penetrating peptides (cpps) was generated by splicing the ( - ) and ( - ) segments of crotamine, a toxin from crotalus durissus terrificus venom [ ] . as they localize preferably on the nucleolus, these novel cpps were named nucleolar-targeting peptides (nrtps). the extent of nrtp partition to zwitterionic (popc; popc:cholesterol : ) and anionic (popg; popc:popg : ) lipid vesicles was studied following the intrinsic tyr or trp fluorescence of the peptides. the partition curves into popc zwitterionic vesicles were characterized by downward slopes and higher partition coefficients (k p * - ). for pure popg, an upward curve and smaller partition coefficient point out for a different type of membrane-peptide interaction. popc:popg membranes present characteristics of both types of interaction. from red edge excitation shift and quenching experiments similar conclusions were attained. leakage assays ruled out lipid vesicle disruption by crotamine or nrtps. further studies on nrtp cellular translocation mechanism and large molecule delivery are currently in progress. dystrophin is essential to skeletal muscle function and confers resistance to the sarcolemma by interacting with cytoskeleton and membrane. we characterized the behaviour of dys r - , a five spectrin-like repeats from the central domain of human dystrophin, in the presence of liposomes and monolayers as membrane models. interaction of dys r - depends on the lipid nature, anionic or zwitterionic, with suvs, and on the lipid packing when comparing luvs to suvs. lateral pressure of lipid monolayers modifies the protein organization and leads dys r - to form a regular network as revealed by afm. trypsin proteolysis assays show that the protein conformation is modified following its binding to monolayer and suvs. label free quantification by nano-lc/ms-ms allowed identifying the helical amino acid sequences in repeats and that are involved in the interaction with anionic suvs. results indicate that dys r - constitutes a part of dystrophin that interacts with anionic as well as zwitterionic lipids and adapts its interaction and organization depending on lipid-packing and lipid nature. we provide here strong experimental evidence for a physiological role of the central domain of dystrophin on sarcolemma scaffolding through modulation of lipid-protein interactions. extracellular matrix proteins. overexpression of the mmps has been associated with a variety of diseases ranging from periodontal disease and arthritis to tumor invasion and metastasis. the majority of the more powerful synthetic inhibitors of mmps incorporate a hydroxamate group, but exhibit low selectivity and are toxic. in a recent modeling study, astaxanthin (ast), a carotenoid with potent antioxidant property, has been shown to be a potential inhibitor of mmp- function by occupying a binding site near the active center of the enzyme (bika di et al. ). in our ongoing project, we investigate the binding of ast to the catalytic domain of mmps using biochemical and ultimately crystallization to validate the proposed action of ast. along these lines, the catalytic domain of mmp- (cdmmp- ) was expressed in e.coli bl (de ) codon-plus and refolded using a novel effective refolding method. our results reveal that ast has a potent inhibitory effect on cdmmp- activity, however, determination of ic % or k i is difficult due to fast oxidation and structural instability of ast. ongoing work aims at optimizing the inhibition conditions and improving the refolding yield to allow analyzing structure and function of the ast-bound mmp- in more detail. hyaluronic acid (hyaluronan, ha) is a linear polysaccharide with a molar mass in the range of to da and is built from alternating units of glucuronic acid and n-acetylglucosamine. synthesized in the cellular plasma membrane, it is a network-forming and space-filling component in the extracellular matrix of animal tissues. here, we create hyaluronic acid films atop a porous alumina substrate, where they act as a barrier for macromolecular transport depending on their length and geometry. the geometry of the hyaluronic acid switches between a fully stretched and a mushroomlike state and is dependent on the concentration of hyaluronic acid. to bind hyaluronic acid selectively atop the nanoporous anodic aluminum oxide (aao), the aao is orthogonally functionalized by silane chemistry. by means of time resolved optical waveguide spectroscopy (ows) the transport of macromolecules, e.g. avidin, across the hyaluronic acid barriers can be recorded as a function of the pore diameter and hyaluronic acid concentration in a time resolved and label free manner. confocal laser scanning microscopy (clsm) provides an alternative method to investigate the orthogonal functionalization of the pores and to elucidate whether a molecule can cross the barrier at the pore entrance. we functionalized gold surfaces with a hydroxy-terminated self-assembled thiol monolayer exposing an adjustable fraction of biotin moieties. [ ] by in-situ acetylation or fluorination, surface properties could be fine-tuned to different protein immobilization scenarios. using streptavidin as a linker protein, immobilization of human abcc [ ] in liposomal and planar bilayer systems was possible. abcc -containing proteoliposomes doped with a biotinylated anchor lipid were successfully tethered to our streptavidin-coated surfaces. biotinylation of the extracellular glycosylation of abcc allowed direct immobilization with inside-up orientation and subsequent assembly of a lipid bilayer. outside-up orientation was achieved by exploiting the c-terminal histidin tag of recombinant abcc for immobilization via ni + and biotinnitrilotriacetate. all systems were thoroughly characterized by quartz crystal microbalance, atomic force microscopy and surface plasmon resonance techniques with respect to monitoring abcc mediated substrate transport in real time. because of its role in the apoptotic pathway, conformational transitions of cytochrome c (cyt c) have gain on interest. in native cyt c, met and his residues serve as heme axial ligands. cyt c interaction with the membrane causes the disruption of the iron-met bond. this allows the binding of others endogenous ligands forming alternative low spin species , or induces peroxidase activity through the formation of a five coordinated high spin iron specie. acquisition of this peroxydase activity by cyt c has been shown to be a key stage before its release from the mitochondria . in order to study these non native low spin species by checking the possible amino acids able to bind the human cyt c heme, different mutants have been designed and produced: h q, h n, and the double one h q/h n. sds in countries where seafood is an integrate part of the diet, fish are among the most common food allergen sources. the major fish allergen parvalbumins are abundant in the white muscle of many fish species. parvalbumin belongs to the family of ef-hand proteins and has a globular shape containing six helical parts. high pressure is known to unfold proteins. we performed high pressure ftir experiments, to explore the p-t phase diagram of cod parvalbumin, gad m , to test the possibility of its inactivation by high pressure treatment. the infrared spectrum of parvalbumin is characteristic for the helical conformation, in agreement with the crystal structure. a marked transition in the structure of the parvalbumin was observed with the central point of . gpa (at room temperature). the amide i position shifts to a wavenumber which is between the helical and the unfolded position. we assign this change to a native-molten globule transition. it was reversible as seen from the infrared spectra. it has been proven in the past that reflectometric interference spectroscopy (rifs) is a powerful measurement system for the detection of protein-protein interactions . we present here the development of a reflectometric sensor which allows for the detection of membrane-protein interactions in the micromolar regime. in this study we employ two different instrumental assemblies. the first installation enables direct detection and quantification of the interaction of membrane proteins with solid supported lipid bilayers. in the second assembly the original instrument is combined with an upright fluorescence microscope. the advantages of this installation are the direct optical control of the experiment as well as a smaller sensing area. the set-up allows for the detection of interactions on lipid-patches of just several micrometers in diameter. the aim of this work is an experimental system that enables the measurement of transport processes through lipid membranes. we attempt to achieve this by covering a closed porous substrate with a lipid membrane. the first steps to reach this goal were done by spanning membranes over anodized aluminum oxide substrates. initiation of actin polimerization in cells requires nucleation factors. a pointed-end-binding protein of f-actin -the lei-omodin -acts as a strong filament nucleator in muscle cells. the dynamical, structural and kinetic properties of a protein can provide important information to understand the intramolecular events underlying its function. we are interested in how does the leiomodin regulate the actin polimerization. our aim is to determine the dissociation constant of the actin-leiomodin complex, and study a possible side-binding effect of the leiomodin . the cardiac leiomodin of rattus norvegicus is a kda molecular weight protein, which contains a kda n-terminal, a kda leucin reach repeat (lrr) and a kda c-terminal region. the n-term and the lrr regions are together tropomodulin homologues. we expressed the wild type the n-term+lrr the lrr+c-term and the c-term protein fragments by using a ptyb vector that contains an amplicillin resistance gene. the expression of the proteins was carried out with the twin-cn (ne bio-labs) kit, which is a chitin-intein self-cleavage and purification system. the nucleation activity of leiomodin and the polimerization speed of actin in the presence of tropomyosin and leiomodin were studied by using pyrene-actin polimerization assay. we measured the stoichiometric, conformational and kinetic properties of the leiomodin-actin complexes with co-sedimentation assay, fluorescence spectroscopic and rapid-kinetic methods. the results showed that the rate of actin polimerization depended on the leiomodin concentration. the nucleator activity of leiomo-din was ionic strength dependent. the data also confirmed that leiomodin is a side-binding and pointed-end binding protein of f-actin. the binding of leiomodin to the sides of the actin filaments was slower than to the pointed-end of the f-actin. the structure of f-actin was changed by the sidebound leiomodin . these observations will contribute to the better understanding of the development and function of thin filaments in cardiac and other muscle tissues. leukemias are one of the most common malignancy worldwide. there is a substantial need for new chemotherapeutic drugs effective against these diseases. doxorubicin (dox), used for treatment of leukemias and solid tumors is poorly efficacious when administered systemically at conventional doses. therefore, in our study, to overcome these limitations, we used transferrin (trf) as a drug carrier. we compared the effect of dox and doxorubicin-transferrin conjugate (dox-trf) on human leukemic lymphoblasts (ccrf-cem). the in vitro growth-inhibition test, xtt assay, indicated that dox-trf was more cytotoxic for leukemia cells than dox alone. in our researches we also evaluated the alternations of mitochondrial transmembrane potential (dw m) , and production of reactive oxygen species (ros). we monitored the dw m using dye jc- ( , ', , '-tetrachloro- , ', , '-tetraethylbenzimidazolcarbocyanine) . the level of ros was studied using the fluorescent probe dcfh -da ( ', 'dichlorodihydrofluorescein diacetate). the results demonstrate that dox-trf induced, decrease of mitochondrial membrane potential and significantly higher production of ros compared with dox treated cells. moreover, all these results seem to be correlated with dna fragmentation analyzed by dna ladder. the tested processes were partially inhibited by the antioxidant, n -acetylocysteine (nac). the changes induced by dox-trf conjugate and free drug were suggest the different mechanism of action of dox alone and conjugated with transferrin. time-resolved detection of protein-protein interaction masahide terazima kyoto university, kyoto, - , japan revealing molecular mechanism of a protein reaction has been a central issue in biophysics. for that purpose, a variety of time-resolved spectroscopic methods have been developed. however, most of them can monitor only dynamics associated with an optical transition and it has been very difficult to trace processes without optical transition. we used the pulsed laser induced transient grating (tg) method to study spectrally silent reactions of various proteins in time-domain. here we will show studies on pixd. pixd is a kda short protein which consists of the bluf domain and additional short helices, and is involved in phototactic movement. the photochemical reaction studied by absorption spectroscopy revealed that this protein exhibits typical photochemistry of the bluf proteins. the red-shifted intermediate is generated within a ps. the spectrum does not change after this initial reaction, and returns back to the dark state with a time constant of s at room temperature. we studied the reaction of this protein by our method and found that the proteinprotein interaction is drastically changed during the reaction. the details and the biological meaning will be presented. human ileal bile acid-binding protein (i-babp) plays a key role in the enterohepatic circulation of bile salts. previously we have shown that the protein has two binding sites and triand dihydroxy bile salts bind with strong and moderate positive cooperativity, respectively. positive cooperativity is thought to be related to a slow conformational change in the protein. our current study is directed at the structural and dynamic aspects of molecular recognition in human i-babp using nmr spectroscopy and other biophysical techniques. as a first step in the investigation, n relaxation nmr experiments have been employed to characterize the backbone motion in the apo and holo protein on a wide range of timescales. our results show a moderately decreased ps-ns flexibility in the ligated protein, with most significant ordering near the portal region. in addition, the measurements indicate a slow ls-ms fluctuation at four distinct segments in the apo protein, a motion not observed in the doubly-ligated form at room temperature. our studies support the hypothesis of an allosteric mechanism of binding cooperativity in human i-babp. to shed more light on the molecular details of the binding mechanism, a site-directed mutagenesis study is in progress. cationic porphyrin-peptide conjugates were recently shown to enhance the delivery of peptide moiety to the close vicinity of nucleic acids but their interaction with dna is not yet studied. we synthesized two cationic porphyrin-peptide conjugates: tetra-peptides were linked to the tri-cationic meso-tri( -n-methylpyridyl)-mono-( -carboxyphenyl)porphyrin and bi-cationic meso- , -bis( -n-methylpyridyl)- , di-( -carboxyphenyl)porphyrin. dna binding of porphyrins, and their peptide conjugates was investigated with comprehensive spectroscopic methods. evidences provided by the decomposition of absorption spectra, fluorescence decay components, fluorescence energy transfer and cd signals reveal that peptide conjugates of di-and tricationic porphyrins bind to dna by two distinct binding modes which can be identified as intercalation and external binding. the peptide moiety does not oppose the interaction between the dna and cationic porphyrins. we compared the effect of complexation on structural stability of dna and nucleoprotein complex : hela nucleosomes and t phage. uv and cd melting studies revealed that the porphyrin binding increases the melting temperature of dna and destabilizes the dna protein interaction in the nucleosomes but not in the t phage. the wide nucleotide specificity of -phosphoglycerate kinase (pgk) allows its contribution to the effective phosphorylation (activation) of nucleotide-based pro-drugs against hiv. here the structural basis of the nucleotide-pgk interaction is characterised in comparison to other kinases, namely pyruvate kinase (pk) and creatine kinase (ck) by enzyme kinetic and structural modelling studies. the results evidenced favouring the purine vs. pyrimidine base containing nucleotides for pgk rather than for pk or ck. this is due to the exceptional ability of pgk in forming the hydrophobic contacts of the nucleotide rings that assures the appropriate positioning of the connected phosphate chain for catalysis. the unnatural l-configurations of the nucleotides (both purine and pyrimidine) are better accepted by pgk than either by pk or ck. further, for the l-forms the absence of the ribose oh-groups with pgk is better tolerated for the nucleotides with purine rather than pyrimidine base. on the other hand, positioning the phosphate chain of both purines and pyrimidines with l-configuration is even more important for pgk, as deduced from the kinetic studies with various nucleotide-site mutants. these characteristics of the kinase-nucleotide interactions can provide a guideline in drug-design. the role of the enzyme types atp-ases in the muscle contraction g. vincze-tiszay , j. vincze , e. balogh hheif, budapest, hungary, nové zá mky, slovakia the myofibrilla assuring muscle contraction gains energy to the slipping in mechanisms and the degree of efficiency of this process will decisively be determined by the velocity of recombination of the atp molecule. in this there play a particular part the na + -k + -atp-ase and mg ++ -atp-ase enzymes. chemical reactions taking place in the living organism are catalyzed by enzymes, so the recombination from adp to atp, too. this transport process can be modelled from the energetic point of view on the basis of the general transport theorem through the following formula: grad a x dx dt: from the point of view of muscle contraction it is of interest that, dependent from the type of the motions whether the length of time is very short, some seconds, or we can speak about a long lasting process. in the first case one can compare the decomposition of the atp with the avalanche effect while in the spot. its degree of efficiency is determined by the migration and linkage velocity of the ions. conclusion: the degree of efficiency of the muscle contraction is determined by the quantities of the two enzymes (na + -k + -atp-ase and mg ++ -atp-ase) as related to each other. [ ] . experiments with deletion mutants have shown that the aminoterminal domain contains a beta sheet with an ordered array of acidic residues, which mediates the attachment to basic calcium phosphates [ , ] . the inhibition is based on the formation of nanometer-sized, spherical mineral-fetuin-a colloids, denoted as calciprotein particles (cpps) [ , ] . the initially formed cpps show hydrodynamic radii in the range of nm and are only transiently stable. after a distinct lag time, they are subject to a morphological change towards larger prolate ellipsoids with hydrodynamic radii of - nm [ ] . in this context, we studied the role of fetuin-a in the formation and ripening of cpps. on the one hand, dynamic light scattering (dls) was used to study the influence of temperature, fetuin-a concentration and mineral ion supersaturation on the kinetics of cpp ripening [ ] . on the other hand, the protein fetuin-a was investigated by means of small angle x-ray scattering (saxs) and fluorescence correlation spectroscopy (fcs degradation of the mrna cap (mrna ' end) by dcps (decapping scavenger) enzyme is an important process of the gene expression regulation, but little is known about its mechanism. the biological role of dcps and its potential therapeutic applications, e.g. as a novel therapeutic target for spinal muscular atrophy, make it an interesting object for biophysical investigations. the ability of dcps to act on various short capped mrnas will be presented. we have examined the substrate specificity and binding affinity of the enzyme in a quantitative manner, employing experimental physics' resources, such as atomic force microscopy (afm) and fluorescence spectroscopy for enzyme kinetics and timesynchronized-titration method (tst ) . in this study we extended the application of mqd-ihc to investigate potential biomarkers associated with prostate cancer (pca) invasiveness and lethal progression. objectives: to establish a mqd-ihc protocol using qd light-emitting nanoparticles ) to detect the expression/activation of critical cell signaling proteins at the single cell level; ) to image the plasticity and lethal progression of human pca with specific emphasis on emt and c-met signaling; and ) to examine the utility of mqd-ihc in clinical pca specimens to determine its invasion ability and predict its metastatic capability. results: we analyzed the co-expression and activation of osteomimicry associated biomarkers: b -microlgobulin (b -m), phosphorylated cyclic amp responsive element binding protein (pcreb) and androgen receptor (ar) in , cells from localized pca tissue areas (gleason and ) of patients with known metastatic status. the overall median % triple positive for b -m + /pcreb + /ar + cells was . %. the median triple positive for the samples with metastatic potential was % compared with those without metastatic potential (median = %); p = . by a wilcoxon rank sum test. the results were confirmed in pca bone metastatic specimens. we also investigated the c-met signaling in castration-resistant human pca model or crpc xenografts and the clinical pca specimens and found that the downstream signal components including pakt and mcl- were activated. conclusion: to validate our findings, additional clinical specimens with confirmed survival data will be analyzed and the cell-signaling-network-based mqd-ihc will be automated by vectra image analysis system in a high throughput manner with the hope to predict the lethal progression of pca prior to clinical manifestation of distant metastases. protein ligand binding is an important field of biopharmaceutical research. there are many techniques for quantitative determination of the ligand binding. the combination of isothermal titration calorimetry (itc) and thermal shift assay provides a robust estimate of the binding constant. many binding reactions are coupled to the absorption or release of protons by the protein or the ligand, conformational changes of the protein and other processes. to correlate the structural features of binding with the energetics of binding one needs to carry out a detailed thermodynamic study of the binding reaction and to determine dependencies such as ph, buffer and temperature. here we present a detailed thermodynamic description of radicicol binding to human heat shock protein hsp and determined proton linkage contributions to observed binding thermodynamics. we calculated the pk a of the group responsible for proton linkage, the protonation enthalpy of this group and intrinsic thermodynamic parameters for radicicol binding. the intrinsic enthalpy of radicicol binding to hsp is one of the largest enthalpies observed for any protein -ligand binding. the structural features responsible for such large binding enthalpy and very favorable intrinsic binding gibbs free energy are discussed. neuronal systems and modelling o- optogenetic electrophysiology walther akemann, amelie perron, hiroki mutoh, and thomas knö pfel laboratory for neuronal circuit dynamics, riken brain science institute, japan the combination of optical imaging methods with targeted expression of protein-based fluorescent probes enables the functional analysis of selected cell populations within intact neuronal circuitries. we previously demonstrated optogenetic monitoring of electrical activity in isolated cells, brain slices and living animals using voltage-sensitive fluorescent proteins (vsfps) generated by fusing fluorescent proteins with a membrane-integrated voltage sensor domain. however, several properties of these voltage reporters remained suboptimal, limiting the spatiotemporal resolution of vsfpbased voltage imaging. a major limitation of vsfps had been a reduced signal-to-noise ratio arising from intracellular aggregation and poor membrane targeting upon long-term expression in vivo. to address this limitation, we generated a series of enhanced genetically-encoded sensors for membrane voltage (named vsfp-butterflies) based on a novel molecular design that combines the advantageous features of vsfp s and vsfp s with molecular trafficking strategies. the new sensors exhibit faster response kinetics at subthreshold membrane potentials and enhanced localization to neuronal plasma membranes after long-term expression in vivo, enabling the optical recording of action potentials from individual neurons in single sweeps. vsfp-butterflies provide optical readouts of population activity such as sensoryevoked responses and neocortical slow-wave oscillations with signal amplitudes exceeding % dr/r in anesthetized mice. vsfp-butterflies will empower optogenetic electrophysiology by enabling new type of experiments bridging cellular and systems neuroscience and illuminating the function of neural circuits across multiple scales. opsin molecules are a burgeoning new tool for temporally precise neuronal stimulation or inhibition. opsin properties are commonly characterized in cell culture or acute brain slice preparations using whole cell patch clamp techniques, where neuronal membrane voltage is fixed at the resting potential. however, in vivo, where neurons are firing action potentials, opsins are exposed to large fluctuations in membrane voltage and transmembrane ionic concentrations which can influence opsin function. in the case of implanted light delivery devices, stimulation light power varies as a function of brain tissue volume. we therefore investigated the stability of opsin properties across a variety of in vivo-like stimulation conditions. we find that off-kinetics of excitatory opsins vary significantly with holding membrane potential; channelrhodopsin (chr ) slowing with depolarisation and chief (chr /chr hybrid) in contrast, accelerating. new chr point mutation variants demonstrate stability across all membrane potentials. we additionally explore responses to initial and subsequent light pulses and find that chief has the unique property of accelerating kinetics after the first light stimulation. inhibitory opsins vary in sensitivity to light in a manner which correlates with their off-kinetics. slower opsins, such as mac (leptosphaeria maculans), have higher sensitivity at low light power densities, saturating early relative to fast inhibitory opsins such as arch (archaeorhodopsin) and nphr (halorhodopsin). we discuss the relative merits of stability versus versatility of opsins under variable stimulation conditions. it has been previously shown that overexpression of ndm ncrna in a sknbe -derived neuroblastoma (nb) cell line leads to cell differentiation, with a decrease of malignant potential. here we use the patch-recording technique to characterize the ionic channel apparatus of nb cells expressing ndm at its basal level (mock cells) or at . fold higher levels (s cells). the two cell lines shared very similar pools of functional k channels, but s cells displayed larger ttx-sensitive na currents and were able to generate action potentials, while mock cells were not. in addition, while mock cells barely express functional gaba receptors, in the majority of s rapid application of gaba elicited a current with a ec = . lm; this current was antagonized by bicuculline ( lm) and potentiated by zaleplon (ec = nm). in mock cells, real time pcr evidenced a high level of gaba a a subunit, while in s cells a significant expression of a and a was detected, whereas a mrna was downregulated by %, confirming the development of functional gaba a receptors. in the same cell lines, the presence of specific markers and the secretion of specific cytokines confirmed that ndm expression leads to a differentiation process toward a neuron-like, rather than glial-like, phenotype. was planned therefore to reconstitute a model of brain tumors in rats by orthotopic implantation of xenogenic transformed human cells. iron is important element used for chemical reaction catalysis and physiological cell functions. the reason of iron deposition is still unknown. under conditions prevailing in human brain it is expected the formation of an amorphous or minute crystalline phase. we used light, scanning (sem) and transmission electron microscopy (tem), energy-dispersive microanalysis, electron diffraction and electron paramagnetic resonance (epr) for investigation of iron deposits in globus pallidus of human brain. sem revealed iron rich particles with na, si, p, s, cl, ca and cu around glial cells. tem revealed bumpy, solid particles of platy and sometimes rounded shape with the size of lm to lm. these ones were identified as hematite. epr measurements showed the presence of fe(iii) and cu(ii), but little amount of fe(ii) can not be excluded. we consider low-temperature process of hematite formation in human globus pallidus in aqueous environment influenced by organic and inorganic factors. chemical processes leading to nanoparticles formation can be associated with neurodegenerations such as alzheimer or parkinson disease. over the past years our understanding of the basic biophysical mechanisms governing the spatio-temporal dynamics of neuronal membrane potentials and synaptic efficacies has significantly expanded and improved. much research has focussed on how ionic currents contribute to the generation and propagation of action potentials, how subthreshold signals propagate along dendritic trees, how the active properties of dendrites shape the integration of incoming signals in a neuron, and how pre-and postsynaptic activities â and potential heterosynaptic effects a determine the way synaptic efficacies change on the short-and long-term. yet, despite these advances, there have been no systematic efforts to relate the basic dynamical repertoire of neurons to the computational challenges neural circuits face, and in particular to explain systematically how the biophysical properties of neurons are adapted to process information efficiently under the constraints of noise and uncertainty in the nervous system. as an initial step in this direction, i will show how various biophysical properties of neurons, in particular short-term synaptic plasticity and dendritic non-linearities, can be seen as adaptations to resolve an important bottleneck in neuronal information processing: the loss of information entailed by the conversion of analogue membrane potentials to digital spike trains. the optogenetic toolbox has greatly expanded since the first demonstration of genetically-targeted optical manipulation of neural activity. in addition to the cation channel channelrhodopsin- (chr ), the panel of excitatory opsins now includes an array of chr variants with mutations in critical residues, in addition to other, related cation channels, and channel hybrids. the inhibitory opsin panel has similarly expanded beyond the first-described halorhodopsin (nphr), a chloride pump, to include trafficking-enhanced versions of nphr as well as the proton pumps mac and arch. while the expansion of available opsins offers researchers an increasingly powerful and diverse selection of tools, it has also made it increasingly difficult to select the optimal tool for a given experiment. one cannot extract a comparison of opsins from the current literature, since studies differ across multiple variables known to contribute to opsin performance (e.g. expression method, light power density, stimulation protocols, etc.). here, we provide the first empirical comparison of both excitatory and inhibitory opsins under standardized conditions. furthermore, we identify the set of parameters that describe the properties of an opsin in a way that is maximally relevant for biological application. o- subcellular compartment-specific distribution of voltage-gated ion channels zoltan nusser institute of experimental medicine, hungarian academy of sciences, budapest, hungary voltage-gated na + (nav) channels are essential for generating the output signal of nerve cells, the action potential (ap). in most nerve cells, aps are initiated in the axon initial segment (ais). in vitro electrophysiological and imaging studies have demonstrated that dendritic nav channels support active backpropagation of aps into the dendrites, but the subunit composition of these channels remained elusive. here, i will present evidence for the somato-dendritic location of nav channels in hippocampal pyramidal cells (pcs). using a highly sensitive electron microscopic immunogold localization technique, we revealed the presence of the nav . subunit in pc proximal and distal dendrites, where their density is -fold lower than that found in aiss. a gradual decrease in nav . density along the proximo-distal axis of the dendritic tree was also detected. we have also investigated the subcellular distribution of kv . voltage-gated k + channel subunit and found a somato-dendritic localization. in contrast to that of nav . channels, the density of kv . first increases then decreases as a function of distance from the somata of pcs. such subcellular compartment-specific distribution of voltage-gated ion channels increases the computational power of nerve cells. keywords: memory, extra cellular matrix, random walk we expose first a biological model of memory based on one hand of the mechanical oscillations of axons during action potential and on the other hand on the changes in the extra cellular matrix composition when a mechanical strain is applied on it. due to these changes, the stiffness of the extra cellular matrix along the most excited neurons will increase close to these neurons due to the growth of astrocytes around them and to the elastoplastic behavior of collagen. this will create preferential paths linked to a memory effect. in a second part, we expose a physical model based on random walk of the action potential on the array composed of dendrites and axons. this last model shows that repetition of the same event leads to long time memory of this event and that paradoxical sleep leads to the linking of different events put into memory. myelinated nerve fibres were studied with fluorescent microscopy and laser interference microscopy. ca + redistribution during prolonged stimulation, changes in morphology and rearrangement of cytoplasmic structures were compared in normal conditions and after membrane modification by lysolecithin and methyl-b-cyclodextrin. lysolecithin is a detergent known to provoke demyelination, and methyl-b-cyclodextrin extracts cholesterol from membranes. cholesterol extraction could lead to disruption of membrane caveolae-like microdomains or ''rafts'' and solubilisation of different proteins connected to them. our data suggest that methyl-b-cyclodextrin and lysolecithin lead to different changes in morphology and distribution of cytoplasmic structures. the effect was different for different regions of the nerve (node of ranvier, paranodal and internodal regions). the agents also altered the kinetics of ca + response to stimulation in myelinated fibres. extracellular carbonic anhydrase contributes to the regulation of ca + homeostasis and salivation in submandibular salivary gland nataliya fedirko, olga kopach, nana, voitenko lviv national university, human and animal physiology, lviv, ukraine the maintenance of ph in the oral cavity is important for the oral heath since even a minor drop in ph can result in dental caries and damage to the teeth. submandibular salivary gland (smg) is main source of fluid and electrolytes enriched saliva therefore its core for oral ph homeostasis. smg secretion is activated by acetylcholine (ach) in [ca + ] idependend manner and accompanied with oral ph acidic shifts. ph shifts could be due to changes in buffering capacity that is regulated by carbonic anhydrase (ca). despite the expression of different subtypes of ca in smg the role of ca in the regulation of smg function is unclear yet. we found that ca inhibition by benzolamide (bz) decreased of fluid secretion in vivo extracellular na + concentration in situ. the latter confirm the ability of ca to modify both primarily and final saliva secretion. we also found correlation between the secretion and ca + -homeostasis since bz-induced decrease of: in striated muscle ca + release from the sarcoplasmic reticulum (sr) occurs when ryanodine receptors (ryr-s) open either spontaneously or upon the stimulation from dihydropyridine receptors that are located in the adjacent transverse-tubular membrane and change their conformation when the cell is depolarized. recent observations demonstrated that muscles from animal models of ptdinsp phosphatase deficiency suffer from altered ca + homeostasis and excitation-contraction coupling, raising the possibility that ptdinsp-s could modulate voltage-activated sr ca + release in mammalian muscle. the openings of a single or a cluster of ryr-s can be detected as ca + release events on images recorded from fibres loaded with fluorescent ca + indicators. to elucidate the effects of ptdinsp-s on ca + release events, images were recorded from skeletal muscle fibers enzymatically isolated from the m. flexor digitorum breavis of mice utilizing a super-fast scanning technique. a wavelet-based detection method was used to automatically identify the events on the images. three different ptdinsp-s (ptdins p, ptdins p, and ptdins( , )p) were tested. all these ptdinsp compounds decreased the frequency of spontaneous ca + release events. supported by the hungarian national science fund (otka ), té t. calcium sparks elicited by mmol/l caffeine and by a depolarization to - mv were recorded at high time resolution on both x-y ( frames/s) and line-scan images ( lines/ ms) on intact skeletal muscle fibers of the frog. while a typical spark appeared in one frame only, . and . % of spark positions overlapped on consecutive frames following caffeine treatment or depolarization, respectively. while both caffeine and depolarization increased the frequency of sparks, as estimated from x-y images, the morphology of sparks was different under the two conditions. both the amplitude (in df/f ; . ± . vs. . ± . ; n = vs. ; mean ± sem, p \ . ) and the full width at half maximum (in lm; parallel with fiber axis: . ± . vs. . ± . ; perpendicular to fiber axis: . ± . vs. . ± . ) of sparks was significantly greater after caffeine treatment than on depolarized cells. these observations were confirmed on sparks identified in line-scan images. in addition, x-t images were used to analyze the time course of these events. calcium sparks had significantly slower rising phase under both conditions as compared to the control. on the other hand, while the rate of rise of signal mass was decreased after depolarization, it increased in the presence of caffeine. prolonged depolarisation of skeletal muscle cells induces entry of extracellular calcium into muscle cells, an event referred to as excitation-coupled calcium entry. skeletal muscle excitation-coupled calcium entry relies on the interaction between the , dihydropyridine receptor on the sarcolemma and the ryanodine receptor on the sarcoplasmic reticulum membrane. in this study we exploited tirf microscopy to monitor with high spatial resolution excitationcoupled calcium entry (ecce) in primary coltures of human skeletal muscle cells harbouring mutation in the ryr gene linked to malignant hyperthermia and central core disease. we found that excitation-coupled calcium entry is strongly enhanced in cells from patients with central core disease compared to individuals with malignant hyperthermia and controls. in addition, excitation-coupled calcium entry induces generation of reactive nitrogen species and causes the nuclear translocation of nfatc . the activation of nfatc dependent genes is consistent with an increase of the il secretion from primary coltures human myotubes from ccd patients and with fibre type predominance of skeletal muscle of ccd patients. membrane lipids, microdomains & signalling p- ftir and calorimetric investigation of the effects of trehalose and multivalent cations on lipid structure sawsan abu sharkh, jana oertel, and karim fahmy division of biophysics, institute of radiochemistry, helmholtz-zentrum dresden-rossendorf, germany e-mail: s.sharkh@hzdr.de the structure of membrane lipids is of fundamental importance for the integrity of cell and organelle membranes in living organisms. membrane lipids are typically hydrated and their headgroup charges counter-balanced by solvated ions. consequently, water loss can induce severe structural changes in lipid packing (lyotropic transitions) and can lead to the damage of lipid membranes even after rehydration. this can be one out of several factors that affect the viability of organisms undergoing desiccation. many organisms, however, are resistant to even extreme water loss. some of them synthesize trehalose which has been shown to be associated with survival of desiccation in phylogenetically diverse organisms (yeast, nematodes, brine shrimp, insect larvae, resurrection plants, and others). here we have studied hydration sensitive transitions in model lipids to determine the effect of trehalose and electrostatics on lipid order. hydration pulse-induced time-resolved fourier-transform infrared (ftir) difference spectroscopy was used to address hydration-dependent lipid structure as a function of trehalose. in combination with differential scanning calorimetry and studies of langmuir-blodget films we arrive at a structural and energetically consistent picture of how trehalose can affects lipidic phase behaviour and support a native lipid structure under water loss. experiments were performed on model lipids with different headgroups and native lipids from desiccation-tolerant organisms. controlled self-assembly and membrane organization of lipophilic nucleosides and nucleic acids: perspectives for applications martin loew , paula pescador , matthias schade , julian appelfeller , jü rgen liebscher , oliver seitz , daniel huster , andreas herrmann , and anna arbuzova humboldt universitä t zu berlin, institute of biology/ biophysics, berlin, germany, humboldt universitä t zu berlin, institute of chemistry, berlin, germany, universitä t leipzig, department of medical physics and biophysics, leipzig, germany lipophilic conjugates of nucleosides and nucleic acids such as dna, rna, and peptide nucleic acid (pna) -combining assembly properties of amphiphiles and specific molecular recognition properties of nucleic acids -allow numerous applications in medicine and biotechnology. we recently observed self-assembly of microtubes, stable cylindrical structures with outer diameters of nm and - lm and a length of - lm, from a cholesterol-modified nucleoside and phospholipids. morphology and properties of these microtubes and functionalization with lipophilic dna will be characterized. we also observed that lipophilic nucleic acids, pna and dna differing in their lipophilic moieties, partition into different lipid domains in model and biological membranes as visualized by hybridization with respective complementary fluorescently-labeled dna strands. upon heating, domains vanished and both lipophilic nucleic acid structures intermixed with each other. reformation of the lipid domains by cooling led again to separation of membrane-anchored nucleic acids. by linking specific functions to complementary strands, this approach offers a reversible tool for triggering interactions among the structures and for the arrangement of reactions and signaling cascades on biomimetic surfaces. conformational dependent trafficking of p-glycoprotein with rafts zsuzsanna gutayné tó th, orsolya bá rsony, katalin goda, gá bor szabó and zsolt bacsó university of debrecen, mhsc, department of biophysics and cell biology, debrecen, hungary p-glycoprotein (pgp), an abc-transporter playing a prominent role in multidrug resistance, demonstrate conformationdependent endocytosis on the surface of t -mdr cells. these cell surface transporters have a uic conformationsensitive-antibody-recognizable subpopulation, which is about one-third of the rest persisting long on the cell surface and perform fast internalization via rafts. we have identified that the rapid internalization is followed by quick exocytosis, in which the other subpopulation is not or only slightly involved. the exocytosis presents a cholesterol depletion dependent intensification, in contrast to the internalization, which is inhibited by cyclodextrin treatment. this continuous recycling examined by total internal reflection (tirf) microscopy increases the amount of the raft associated subpopulation of pgps in the plasma membrane, and it might have a role in restoring the cholesterol content of the membrane after cholesterol depletion. our presentation will summarize related endocytotic, exocytotic and recycling processes and that how does our data fit into our current notions regarding to the cholesterol and sphingomyelin trafficking. membrane nanodomains based on phase-segregating lipid mixtures have been emerged as a key organizing principle of the plasma membrane. they have been shown to play important roles in signal transduction and membrane trafficking. we have developed lipid-like probes carrying multivalent nitrilotriacetic acid (tris-nta) head groups for selective targeting of histagged proteins into liquid ordered or liquid disordered phases. in giant unilamellar vesicles strong partitioning of tris-nta lipids into different lipid phases was observed. for a saturated tris-nta lipid, at least -fold preference for the liquid ordered phase was found. in contrast, an unsaturated nta lipid shows a comparable preference for the liquid disordered phase. partitioning into submicroscopic membrane domains formed in solid supported membranes was confirmed by superresolution imaging. single molecule tracking of his-tagged proteins tethered to solid supported membranes revealed clear differences in the diffusion behavior of the different nta-lipids. by using bsa as a carrier, multivalent nta lipids were efficiently incorporated into the plasma membrane of live cells. based on this approach, we established versatile methods for probing and manipulating the spatiotemporal dynamics of membrane nano domains in live cells. il- is a multifunctional cytokine with pleiotropic effects on t cells. the il- r consists of the cytokine-specific a-subunit and the c c -chain shared with other cytokines, including il- and - , important regulators of t cells. we have previously shown the preassembly of the heterotrimeric il- and il- r, as well as their participation in common superclusters with mhc glycoproteins in lipid rafts of human t lymphoma cells. integrity of lipid rafts was shown to be important in il- signaling. we could hypothesize that other members of the c c cytokine receptor family, such as the il- r complex, may also fulfill their tasks in a similar environment, maybe in the same superclusters. co-localization of il- r with lipid rafts as well as with the il- r/mhc superclusters was determined by clsm. molecular scale interactions of il- ra with il- r and mhc molecules were determined by microscopic and flow cytometric fret experiments. the role of lipid rafts in il- r signaling was assessed by following the effect of cholesterol depletion on il- induced stat phosphorylation. our results suggest the possibility that preassembly of the receptor complexes in common membrane microdomains with mhc glycoproteins may be a general property of c c cytokines in t cells. to unravel the molecular processes leading to fas clustering in lipid rafts, a -mer peptide corresponding to the single transmembrane domain of the death receptor was reconstituted into model membranes that display liquid-disordered/ liquid-ordered phase coexistence, i.e. mimicking cells plasma membranes. using the intrinsic fluorescence of the peptide two tryptophans residues (trp and trp ), fas membrane lateral organization, conformation and dynamics was studied by steady-state and time-resolved fluorescence techniques. our results show that the fas has preferential localization to liquid disordered membrane regions, and that it undergoes a conformational change from a bilayer inserted state in liquid-disordered membranes to an interfacial state in liquidordered membranes. this is a result of the strong hydrophobic mismatch between the (hydrophobic) peptide length and the hydrophobic thickness of liquid-ordered membranes. in addition, we show that ceramide, a sphingolipid intimately involved in fas oligomerization and apoptosis triggering, does not affect fas membrane organization. overall, our results highlight ceramide role as an enhancer of fas oligomerization, and unravel the protective function of fas transmembrane domain against non-ligand induced fas apoptosis. organization and dynamics of membrane-bound bovine a-lactalbumin: a fluorescence approach arunima chaudhuri and amitabha chattopadhyay centre for cellular and molecular biology, hyderabad , india e-mail: amit@ccmb.res.in many soluble proteins are known to interact with membranes in partially disordered states, and the mechanism and relevance of such interactions in cellular processes are beginning to be understood. interestingly, apo-bovine alactalbumin (bla), a soluble protein, specifically interacts with negatively charged membranes and the membranebound protein exhibits a molten globule conformation. we have used the wavelength-selective fluorescence approach to monitor the molten globule conformation of bla upon binding to negatively charged membranes as compared to zwitterionic membranes. tryptophans in bla exhibit differential red edge excitation shift (rees) upon binding to negatively charged and zwitterionic membranes, implying differential rates of solvent relaxation around the tryptophan residues. our results utilizing fluorescence anisotropy, lifetime and depth analysis by the parallax approach of the tryptophans further support the differential organization and dynamics of the membrane-bound bla forms. in addition, dipole potential measurements and dye leakage assays are being used in our ongoing experiments to explore the mechanism of bla binding to membranes. these results assume significance in the light of antimicrobial and tumoricidal functions of a-lactalbumin. role of long-range effective protein-protein forces in the formation and stability of membrane protein nano-domains nicolas destainvill laboratoire de physique thé orique, université paul sabatier toulouse -cnrs, toulouse, france we discuss a realistic scenario, accounting for the existence of sub-micro-metric protein domains in plasma membranes. we propose that proteins embedded in bio-membranes can spontaneously self-organize into stable small clusters, due to the competition between short-range attractive and intermediate-range repulsive forces between proteins, specific to these systems. in addition, membrane domains are supposedly specialized, in order to perform a determined biological task, in the sense that they gather one or a few protein species out of the hundreds of different ones that a cell membrane may contain. by analyzing the balance between mixing entropy and protein affinities, we propose that protein sorting in distinct domains, leading to domain specialization, can be explained without appealing to preexisting lipidic micro-phase separations, as in the lipid raft scenario. we show that the proposed scenario is compatible with known physical interactions between membrane proteins, even if thousands of different species coexist. lipid rafts are cholesterol and sphingolipid-enriched functional microdomains present in biomembranes. rafts have been operationally defined as membrane fractions that are detergent insoluble at low temperature. here we have characterized drms from erythrocytes treated with the nonionic detergents brij and brij , at °c and °c, and compared them to drms obtained with triton x- (tx ). we have also investigated the effect of cholesterol depletion in drms formation. brij drms were enriched in cholesterol as well as tx drms. hptlc analysis showed a very similar distribution of phosphatidylcholine-pc, phosphatidylethanolamine-pe and sphingomyelin-sm in brij drms to that found in ghost membranes. sm-enriched drms were obtained only with tx while pe content was decreased in tx drms, in comparison to brij drms. immunoblot essays revealed that rafts markers (flotillin- and stomatin) were present in all drms. contrary to tx drms, analysis of electron paramagnetic resonance spectra (with -doxyl stearate spin label) revealed that brij drms are not in the liquid-ordered state, evincing the differential extraction of membrane lipids promoted by these detergents. supported by fapesp/cnpq (brazil). several biological membrane mimics have been built to investigate the topology of molecules in membranes. among them ''bicelles'', i.e., mixtures of long-chain and short-chain saturated phospholipids hydrated up to %, became very popular because they orient spontaneously in magnetic fields. disk-shaped systems of - nm diameter and - nm thickness have been measured by electron microscopy and solid state nmr and can be oriented by magnetic fields with the disc-plane normal perpendicular to the field. we have been developing recently lipids that contain in one of their chains a biphenyl group (tbbpc) affording an orientation parallel to the magnetic field, in the absence of lanthanides. a large number of hydrophobic molecules including membrane proteins have been successfully embedded and static nmr afforded finding the orientation of protein helices in membranes; mas nmr provided the d structure of peptides in bicelles. biphenyl bicelles keep their macroscopic orientation for days outside the field, thus leading to combined nmr and x-rays experiments. tbbpc allows also construction of lm vesicles showing a remarkable oblate deformation in magnetic fields (anisotropy of - ) and opens the way to applications for structural biology or drug delivery under mri. imaging membrane heterogeneities and domains by super-resolution sted nanoscopy christian eggeling, veronika mueller, alf honigmann, stefan w. hell department of nanobiophotonics, max planck institute for biophysical chemistry, am fassberg , gö ttingen, germany cholesterol-assisted lipid and protein interactions such as the integration into lipid nanodomains are considered to play a functional part in a whole range of membrane-associated processes, but their direct and non-invasive observation in living cells is impeded by the resolution limit of [ nm of a conventional far-field optical microscope. we report the detection of the membrane heterogeneities in nanosized areas in the plasma membrane of living cells using the superior spatial resolution of stimulated emission depletion (sted) far-field nanoscopy. by combining a (tunable) resolution of down to nm with tools such as fluorescence correlation spectroscopy (fcs), we obtain new details of molecular membrane dynamics. sphingolipids or other proteins are transiently (* ms) trapped on the nanoscale in cholesterol-mediated molecular complexes, while others diffuse freely or show a kind of hopping diffusion. the results are compared to sted experiments on model membranes, which highlight potential influences of the fluorescent tag. the novel observations shed new light on the role of lipid-protein interactions and nanodomains for membrane bioactivity. ca + controlled all-or-none like recruitment of synaptotagmin- c ab to membranes sune m. christensen, nicky ehrlich, dimitrios stamou bio-nanotechnology laboratory, department of neuroscience and pharmacology, nano-science center, lundbeck foundation center biomembranes in nanomedicine, university of copenhagen, copenhagen, denmark e-mail: ehrlich@nano.ku.dk & stamou@nano.ku.dk synaptotagmin- (syt) is the major ca + sensor that triggers the fast, synchronous fusion of synaptic vesicle with the presynaptic membrane upon ca +-mediated membrane recruitment of the cytosolic c ab domain. the ca +-dependent recruitment of syts c ab domain to membranes has so far been investigated by ensemble assays. here we revisited binding of wild type c ab and different c ab mutants of syt to lipid membranes using a recently developed single vesicle assay. the hallmark of the single vesicle approach is that it provides unique information on heterogeneous properties that would otherwise be hidden due to ensemble averaging. we found that c ab does not bind to all vesicles in a homogenous manner, but in an all-or-none like fashion to a fraction of the vesicles. the fraction of vesicles with bound c ab is regulated by the amount of negatively charged lipids in the membrane as well as by [ca +] . this ca + controlled all-ornone like recruitment of syt to membranes provides a possible explanation for the strongly heterogeneous behavior of the in vitro model system for neuronal membrane fusion. furthermore, heterogeneity in release probability among synaptic vesicles is a critical property in determining the output of a neuronal signaling event. new insights in the transport mechanism of ciprofloxacin revealed by fluorescence spectroscopy mariana ferreira, sílvia c. lopes, paula gameiro requimte, faculdade de ciê ncias da universidade do porto, porto, portugal keywords: fluoroquinolones; liposomes; proteoliposomes; ompf; ciprofloxacin; fluorescence; anisotropy. fluoroquinolones are antibiotics that have a large spectrum of action against gram negative and some gram positive bacteria. the interaction between these species and liposomes has been cited as a reference in the understanding of their diffusion through phospholipid bilayer and can be quantified by the determination of partition coefficients between a hydrophobic phase (liposomes) and an aqueous solution. it is also known that some porins of the bacterial membranes are involved in transport mechanism of many fluoroquinolones. ompf, a well characterized membrane protein characteristic of the outer membrane of gram negative bacteria assumes the conformation of homo-trimer, whose monomers have two tryptophan residues (one located at the interface of monomers and the other at the interface lipid/protein). thus, we proceeded to study the interaction of ciprofloxacin, a second generation fluoroquinolone, with unilamellar liposomal vesicles and ompf proteoliposomes of pope/popg, pope/popg/cardiolipin and e. coli total. partition coefficients (kp's) and the association with ompf proteoliposomes were determined by steady state fluorescence spectroscopy under physiological conditions (t= °c; ph . ). the membrane mimetic systems used were characterized by dls and fluorescence anisotropy. motivation is whether there exist differences in the patternforming capabilities of two adhesion molecules of different roles: cd , mediating ,,dynamic'' adhesion in cell rolling and icam- , mediating ,,static'' adhesion during the formation of immune-synapse. homo-and hetero-associations of cd , icam- and the mhci is investigated on the nm-and lmdistance levels on ls t colon carcinoma cells in two different conditions of lymphocyte homing: ( ) with ifnc and tnfa, both lymphokines up-regulating the expression level of mhci and icam- and down-regulating that of cd . ( ) crosslinking of cd and icam- representing receptor engagement. the observations are explained by assuming the existence of a kinase cascade-level crosstalk between the cd and icam- molecules which manifests in characteristic complementary changes in the properties of cell surface receptor patterns. for the characterisation of cluster morphology new colocalization approaches were developed: (i) ,,number of first neighbours'' distribution curves, (ii) ,,acceptor photobleaching fret-fluorescence intensity fluctuation product'' correlation diagrams, and (iii) ,,random gradient-kernel smoothing assisted decay'' of pearson-correlations, and (iv) k-function formalism. analyzing janus kinases in living cells with single-color fcs using confocal and tir-illumination thomas weidemann , hetvi gandhi , remigiusz worch , , robert weinmeister , christian bö kel , and petra schwille biophysics research group, technische universitä t dresden, germany, crtd, center for regenerative therapies dresden, technische universitä t dresden, germany, institute of physics, polish academy of science, warsaw, poland cytokine receptors of the hematopoietic superfamily transduce their signal through non-covalently bound janus kinases. there are only such kinases in humans (jak , , and tyk ), which associate to different cytokine receptor chains. here we study the dynamics of gfp-tagged jak and jak in epithelial cells with fluorescence correlation spectroscopy (fcs). jak and jak behave differently in various aspects: in the absence of receptors, jak still binds the membrane, whereas jak diffuses homogeneously in the cytoplasm. we used fcs under total internal reflection illumination (tir-fcs) and determined the membrane binding affinity of jak to be ± nm. the association of jak with the common gamma chain (c c ) is very tight as shown by fluorescence recovery after photobleaching (frap). molecular brightness analysis of single-point fcs shows that jak diffuses as a monomer in the rather small cytoplasmic pool, whereas jak diffuses as dimers, which undergo a defined oligomerization. the degree of oligomerization decays at higher concentrations, indicating that some unknown, saturable scaffold is involved. characterizing the binding and mobility schemes of the janus kinases may be important to further elucidate their specific and redundant effects in signal transduction. plasma membrane (pm)-enriched fraction obtained through subcellular fractioning protocols are commonly used in studies investigating the ability of a compound to bind to a receptor. however, the presence of mitochondria membranes (mi) in the pm-enriched fraction may compromise several experimental results because mi may also contain the interest binding proteins. aiming to analyze the subcellular fractioning quality of a standard sucrose density based protocol, we investigated (a) the na + k + -atpase (pm marker) and succinate dehydrogenase -sd (mi marker) activities; (b) the immunocontent of the adenine nucleotide translocator (ant -mi membrane marker) in both pm-and mi-enriched fractions. since several binding protocols may require long incubation period, we verified the quality of both fractions after hours of incubation in adequate buffer. our results show that pmand mi-enriched fractions exhibit contamination with mi or pm, respectively. we did not observe any effect of incubation on na + k + -atpase activity and ant content in both fractions. surprisingly, sd activity was preserved in the pm-but not in mi-enriched fraction after incubation. these data suggest the need of more careful use of pm-enriched fraction preparation in studies involving pm proteins characterization. human neutrophil peptide (hnp ) is a human cationic defensin that present microbial activity against various bacteria (both gram-positive and negative), fungi and viruses. hnp is stored in the cytoplasmic azurophilic granules of neutrophils and epithelial cells. in order to elucidate the mode of action of this antimicrobial peptide (amp), studies based on its lipid selectivity were carried out. large unilamellar vesicles (luv) with different lipid compositions were used as biomembranes model systems (mammal, fungal and bacterial models). changes on the intrinsic fluorescence of the tryptophan residues present in hnp upon membrane binding/insertion were followed, showing that hnp have quite distinct preferences for mammalian and fungal membrane model systems. hnp showed low interaction with glucosylceramide rich membranes, but high sterol selectivity: it has a high partition for ergosterol-containing membranes (as fungal membranes) and low interaction with cholesterolcontaining membranes (as in mammalian cells). these results reveal that lipid selectivity is the first step after interaction with the membrane. further insights on the hnp membrane interaction process were given by fluorescence quenching measurements using acrylamide, -doxylstearic acid ( ns) or ( ns). nanoparticles (np) are currently used in many industrial or research applications (paints, cosmetics, drug delivery materials…). recent papers demonstrate clearly their activity with biological membranes (nanoscale holes, membrane thinning, disruption). different studies of the np-membrane interaction suggest that parameters are particularly important, such as the np size, their surface properties or their aggregation state. composition of biological membranes being particularly complex, supported lipid bilayers (slb) composed of a restricted number of lipids are usually used as simplified membrane model. moreover, these two-dimensional systems are convenient for surface analysis techniques, such as atomic force microscopy (afm), giving information on the morphology of the slb and its mechanical properties. in this work, we study the behaviour of slbs made of lipids representative of the membrane fluid phase (popc) or of the raft phase (sphingomyelin). these slbs are deposited on planar surfaces (mica or glass) previously recovered with silica beads ( or nm in diameter) in order to mimick the np-membrane interaction. we will present in this work our first results obtained by afm and fluorescence microscopy. it is well known that the eukaryotic nuclei are the sphere of lipids active metabolism. the investigations demonstrated the existence of numerous enzymes in nuclei which modulate the changes of nuclear lipids during different cellular processes. although the nuclear membrane is accepted as the main place of the lipids localization, nearly % of nuclear lipids are discovered in chromatin fraction. the ability of chromatin phospolipids to regulate dna replication and transcription was already demonstrated. the chromatin phospolipids seems to play an important role in cell proliferation and differentiation as well as in apoptosis. it seems also possible that chromatin phospholipids may be participated in realization of cisplatin antitumor effects. the -hour in vivo effect of cisplatin on rat liver chromatin phospholipids was investigated. the phospholipids of rat liver chromatin were fractionated by microtlc technique. the quantitative estimation of fractionated phospholipids was carried out by computer program fugifilm science lab. image gauge v . . the alteration of total phospholipids content as well as the quantitative changes among the individual phospholipids fractions in rat liver chromatin after in vivo action of cisplatin was established. the total content of chromatin phospholipids was significantly decreased after the cisplatin action. four from five individual phospholipids fractions were markedly changed after the drug action. two cholin-content phospholipids, particularly phosphatidylcholine and sphingomyelin exhibit diversity in sensitivity to this drug: the increase of sphingomyelin content accompanied by quantitative decrease of phosphatidylcholine. the quantity of cardiolipin was markedly increased while the amount of phosphatidylinositol was decreased after the cisplatin treatment. the phosphatidylethanolamin content remained unchanged after the drug action. it seems that high sensitivity of chromatin phospholipids exhibited to cisplatin action may play an important role in antitumor effects of this drug. membrane lipids and drug resistance in staphylococci r. d. harvey institute of pharmaceutical science, king's college london, stanford street, london se nh, uk staphylococci express numerous resistance mechanisms against common antimicrobials, including peptide components of the innate immune system which have been trumpeted as being likely candidates to replace our increasingly ineffective antibiotics. the membrane phospholipid lysylphosphatidylglycerol (l-pg) appears to play a key role in staphylococcal drug resistance, since its absence in mutant bacteria renders them susceptible to a range of cationic antimicrobials. the current assumption about the role l-pg plays in drug resistance is that of facilitating charge neutralisation of the plasma membrane, leading to loss of affinity towards cationic moieties. we have investigated this phenomenon using a range of model membrane systems composed of both synthetic lipids and reconstituted natural lipid extracts, using such techniques as stopped-flow fluorescence, circular dichroism and neutron scattering. our conclusions indicate that the initial assumptions about the role of l-pg in drug resistance are over-simplistic and certainly do not tell the whole story of the physical and biological properties of this fascinating moderator or membrane behaviour. our findings show that l-pg does not inhibit antimicrobial drug action by charge dampening, hinting at a different protective mechanism. modulation of a-toxin binding by membrane composition m. schwiering, a. brack, c. beck, h. decker, n. hellmann institute for molecular biophysics, university of mainz, mainz, germany although the alpha-toxin from s. aureus was the first poreforming toxin identified, its mode of interaction with membranes is still not fully understood. the toxin forms heptameric pores on cellular and artificial membranes. the present hypothesis is that the initial binding to the membrane occurs with low affinity, and that an efficient oligomerisation, relying on clusters of binding sites, is the reason for the overall high affinity of the binding process. in order to separate the effects of increasing concentration of binding sites from this topological effect, we investigated the oligomer formation based on pyrene-fluorescence for a series of lipid compositions, where the fraction of toxin binding lipids (egg phospatidylcholine (epc) or egg sphingomyelin (esm)) was varied while their concentration remained constant. the results indicate that an increased local density of toxin binding sites occurring due to phase separation facilitates oligomer formation. furthermore, the change in local environment (number of neighboring cholesterol molecules) upon domain formation also enhances oligomer formation.we thank the dfg (sfb ) for financial support, s. bhakdi and a. valeva for production of the toxin and helpful discussions. we explored quercetin effects on lipid bilayers containing cholesterol using a spectrofluorimetric approach. we used the fluorescent probe laurdan which is able to detect changes in membrane phase properties. when incorporated in lipid bilayers, laurdan emits from two different excited states, a non-relaxed one when the bilayer packing is tight and a relaxed state when the bilayer packing is loose. this behavior is seen in recorded spectra as a shift of maximum emission fluorescence from nm at temperatures below lipids phase transition to nm at temperatures above lipids phase transition values. emission spectra of laurdan were analyzed as a sum of two gaussian bands, centered on the two emission wavelengths allowing a good evaluation of the relative presence of each population. our results show that both laurdan emission states are present with different shares in a wide temperature range for dmpc liposomes with cholesterol. quercetin leads to a decrease in the phase transition temperature of liposomes, acting in the same time as a quencher on laurdan fluorescence. this paper is supported by the sectorial operational programme human resources development, financed from the caveolins are essential membrane proteins found in caveolae. the caveolin scaffolding domain of caveolin- includes a short sequence containing a crac motif (v tkywfyr ) at its c-terminal end. to investigate the role of this motif in the caveolin-membrane interaction at the atomic level, we performed a detailed structural and dynamics characterization of a cav- (v -l ) nonapeptide encompassing this motif and including the first residue of cav- hydrophobic domain (l ), in dodecylphosphocholine (dpc) micelles and in dmpc/dhpc bicelles, as membrane mimics. nmr data revealed that this peptide folded as an amphipathic helix located in the polar head group region. the two tyrosine sidechains, flanked by arginine and lysine residues, are situated on one face of this helix, whereas the phenylalanine and tryptophan side-chains are located on the opposite face (le lan c. et al., , eur. biophys. j., , - ). to investigate the interactions between the crac motif and the lipids, we performed molecular dynamics simulations in two different environment: a dpc micelle and a popc bilayer. the results obtained are in good agreement with nmr data and the comparison between both systems provided insight into the orientation of the crac motif at the membrane interface and into its interactions with lipids. this work was partially supported by the strategic grant posdru/ / . our study suggests that conformation and positioning of hydroxyl groups significantly affects thermotropic properties of sphingolipids and sterol interaction. the polymorphism of a new series of bolaamphiphile molecules based on n-( -betainylamino-dodecane)-octyl b-d-glucofuranosiduronamide chloride is investigated. the length of the main bridging chain is varied in order to modify the hydrophilic/lipophilic balance. the other chemical modification was to introduce a diacetylenic unit in the middle of the bridging chain to study the influence of the pp stacking on the supramolecular organisation of these molecules. dry bolaamphiphiles self-organize in supramolecular structures such as lamellar crystalline structure, lamellar fluid structure and lamellar gel structure. the thermal dependence of these structures, as well as the phase transition is followed by smallangle and wide-angle x-ray scattering. once the thermal cycle is accomplished, the system remains in the kinetically stabilized undercooled high-temperature phase at temperature of °c. subsequently, the time dependence of the relaxation to the thermodynamically stable phase is followed and very slow relaxation on the order of hours or days is observed. the study of polymorphism and the stability of various phases of this new series of bolaamphiphiles is interesting for potential application in health, cosmetics or food industry, is undertaken in this work. alkylphospholipids have shown promising results in several clinical studies and among them perifosine (opp) is promising for breast cancer therapy. antitumor effect was much better in estrogen receptor negative (er-) than in estrogen receptor positive (er+) tumors in vivo. it is believed that apl do not target dna, but they insert in the plasma membrane and ultimately lead to cell death. liposomes made of opp and different amount of cholesterol (ch) showed diminished hemolytic activity as compared to micellar opp, but in most cases cytotoxic activity was lower. in order to find optimal liposomal composition and to understand better the difference in the response of er+ and er-cells the interaction opp liposomes with er+ and er-cells was studied. for liposomes with high amount of ch both cell types showed slow release of the liposome entrapped spin probe into the cytoplasm. liposmes with low amount of ch interact better with cells but the release is faster for er-as for er+ cells at °c. experiments with nitroxide-labeled opp (sl-opp) liposomes suggest that the exchange of sl-opp between liposomes and cellular membranes is fast. however, translocation of sl-opp across the plasma membrane is slow, but seems to be faster for opp resistent, er+ cells as for er-cells at °c. estimation of a membrane pore size based on the law of conservation of mass krystian kubica , artur wrona and olga hrydziuszko institute of biomedical engineering and instrumentation, wroclaw university of technology, wroclaw, poland, centre for systems biology, university of birmingham, birmingham, uk the size of biomembrane pores determines which solutes or active compounds may enter the cell. here, using a mathematical model of a lipid bilayer and the law of conservation of mass, we calculate the radius of a membrane pore created by rearranging the lipid molecules (the pore wall was formed out of the lipid heads taken from the membrane regions situated directly above and below the pore, prior its formation). assuming a constant number of lipid molecules per bilayer (with or without the pore) and based on the literature data ( % decrease in the area per chain for a fluid-to-gel transition and a matching change of one chain volume not exceeding %) we have shown that the pore radius can measure up to . nm (for a nm thick lipid bilayer) without the lipid molecules undergoing a phase transition. a further assumption of area per chain being modified as a consequence of the lipids conformational changes has resulted in an increase of the calculated radius up to . nm. finally, a comparison of the pore volume with the corresponding volume of the lipid bilayer has led to a conclusion that for the system under consideration the membrane pore can only be created with the lipids undergoing fluidto-gel conformational changes. the key signaling pathway involves tyrosine phosphorylation of signal transducers and activators of transcription (stat and stat ) by receptor-associated janus kinases. we aim to unveil of the very early events of signal activation including ligand-induced receptor assembly and the recruitment of the cytoplasmic effector proteins stat and stat in living cells. to this end, we have explored the spatiotemporal dynamics of stat recruitment at the membrane on a single molecule level. highly transient interaction of stats to membrane-proximal sites was detected by tirf-microscopy, allowing for localizing and tracking individual molecules beyond the diffraction limit. thus, we obtained a pattern of the spatio-temporal recruitment of stat molecules to the plasma membrane revealing distinct submicroscopic structures and hotspots of stat interaction with overlapping recruitment sites for stat and stat . strikingly, these stat binding sites were independent on receptor localization and expression level. simultaneous superresolution imaging of the cytoskeleton revealed the organization of stat recruitment sites within the cortical actin skeleton. characterization of molecular dynamics on living cell membranes at the nanoscale level is fundamental to unravel the mechanisms of membrane organization andcompartmentalization. we have recently demonstr ated the feasibility of fluorescence correlation spectroscopy (fcs) based on the nanometric illumination of near-field scanning optical microscopy (nsom) probes on intact living cells [ ] . nsom-fcs was applied to study the diffusion of fluorescent lipid analogs on living cho cells. the experiments allowed to reveal details of the diffusion hidden by larger illumination areas and associated with nanoscale membrane compartmentalization. the technique also offers the unique advantages of straightforward implementation of multiple color excitation, opening the possibility to study nanoscale molecular cross-correlation. furthermore, the nsom probe evanescent axial illumination allows to extend diffusion study to the membrane-proximal cytosolic region. as such, nsom-fcs represents a novel powerful tool to characterize the details of many biological processes in which molecular diffusion plays a relevant role. the growing interest in supported lipid bilayers (slbs) on conductive substrates, such as gold, is due to the possibility of designing lipid-based biosensor interfaces with electrochemical transduction. due to the hydrophobicity of gold it is still a challenge to deposit planar and continuous bilayers without previous surface modification. most studies on gold concern single-phase slbs without cholesterol or gangliosides, two vital components of biomembranes. in this work the experimental conditions suitable for the formation of complex slbs with phase-separation directly on gold are exploited. the mixtures dopc/dppc/cholesterol ( : : ) with or mol % of ganglioside gm , which should yield lipid raft-like domains according to reported phase diagrams, were studied. slb with lipid rafts were successfully formed onto bare au ( ), although surface modification with -mercapto-undecanoic acid sam stabilized the slbs due to its charge and hydrophilicity. the different experimental conditions tested had an impact on nano/microdomains organization observed by atomic force microscopy in buffer solution. surface characterization through the combined use of ellipsometry, cyclic voltammetry and afm allowed to optimize the conditions for the formation of more planar and compact slbs. it is widely accepted that the conversion of the soluble, nontoxic amyloid b-protein (ab) monomer to aggregated toxic ab rich in b-sheet structures is central to the development of alzheimer's disease. however, the mechanism of the abnormal aggregation of ab in vivo is not well understood. we have proposed that ganglioside clusters in lipid rafts mediate the formation of amyloid fibrils by ab, the toxicity and physicochemical properties of which are different from those of ab amyloids formed in solution [ , ] . in this presentation, we report a detailed mechanism by which ab-( - ) fibrillizes in raft-like lipid bilayers composed of gm /cholesterol/sphingomyelin. at lower concentrations, ab formed an a helix-rich structure, which was cooperatively converted to a b sheet-rich structure above a threshold concentration. the structure was further changed to a seed-prone b structure at higher concentrations. the seed recruited ab in solution to form amyloid fibrils. [ hepatitis c virus (hcv) has a great impact on public health, affecting more than million people worldwide since it is the cause of liver-related diseases such as chronic hepatitis, cirrhosis and hepatocarcinoma. hcv entry into the host cell is achieved by the fusion of viral and cellular membranes, replicates its genome in a membrane associated replication complex, and morphogenesis has been suggested to take place in the endoplasmic reticulum (er) or modified er membranes. the variability of the hcv proteins gives the virus the ability to escape the host immune surveillance system and notably hampers the development of an efficient vaccine. hcv has a single-stranded genome which encode a polyprotein, cleaved by a combination of cellular and viral proteases to produce the mature structural proteins (core, e , e , and p ) and non-structural ones (ns , ns , ns a, ns b, ns a and ns b), the latter associated with the membrane originated from the er in the emerging virus. the ns b protein, a fundamental player in the hcv replicative process and the least characterized hcv protein, is a highly hydrophobic protein associated with er membranes. it has recently been shown that the c-terminal is palmitoylated and the n-terminal region has potent polymerization activity. the expression of ns b induces the formation of the so called membranous web, which has been postulated to be the hcv rna replication complex. thus, a function of ns b might be to induce a specific membrane alteration that serves as a scaffold for the formation of the hcv replication complex and therefore has critical role in the hcv cycle. due to the high hydrophobic nature of ns b, a detailed structure determination of this protein is very difficult. the ns b protein is an integral membrane protein with four or more transmembrane domains. the c-terminal region of ns b is constituted by two a helices, h (approximately from amino acid to ) and h (approximately from amino acid to ), which have been studied as potential targets for inhibiting hcv replication. previous studies from our group, based on the study of the effect of ns b peptide libraries on model membrane integrity, have allowed us to propose the location of different segments in this protein that would be implicated in lipid-protein interaction. additionally, the h region could be an essential constituent in the interaction between protein and membrane. in this study we show that peptides derived from the c-terminal domain of ns b protein of hcv are able to interact with high affinity to biomembranes, significantly destabilizing them and affecting their biophysical properties. there were also differences in the interaction of the peptide depending on the lipid composition of the membranes studied. we have also applied fluorescence spectroscopy, infrared spectroscopy and differential scanning calorimetry which have given as a detailed biophysical study of the interaction of the peptide with model biomembranes. this work was supported by grant bfu - -bmc (ministerio de ciencia y tecnología, spain) to j.v. a semi-quantitative theory describing the adhesion kinetics between soft objects as living cells or vesicles was developed. the nucleation-like mechanism has been described in the framework of a non-equilibrium fokker-planck approach accounting for the adhesion patch growth and dissolution (a. raudino, m. pannuzzo, j. chem. phys. , ( )). a well known puzzling effect is the dramatic enhancement of the adhesion/fusion rate of lipid membranes by water-soluble polymers that do not specifically interact with the membrane surface. we extend the previous approach by molecular dynamics simulations in the framework of a coarse-grained picture of the system (lipid+polymer+ions embedded in an explicit water medium) in order to test and support our previous analytical results. simulations show that the osmotic pressure due to the polymer exclusion from the inter-membrane spacing is partially balanced by an electrostatic pressure. however, we also evidenced an interesting coupling between osmotic forces and electrostatic effects. indeed, when charged membranes are considered, polymers of low dielectric permittivity are partially excluded from the inter-membrane space because of the increased local salt concentration. the increased salt concentration means also a larger density of divalent ions which form a bridge at the contact region (stronger adhesion). the overall effect is a smaller membrane repulsion. this effect disappears when neutral membranes are considered. the model could explain the fusion kinetics between lipid vesicles, provided the short-range adhesion transition is the rate-limiting step to the whole fusion process. the role of ceramide acyl chain length and unsaturation on membrane structure sandra n. ceramide fatty acid composition selectively regulates distinct cell processes by a yet unknown mechanism. however, evidence suggests that biophysical processes are important in the activation of signalling pathways. indeed, ceramide strongly affects membrane order, induces gel/fluid phase separation and forms highly-ordered gel domains. the impact of ceramide n-acyl chain in the biophysical properties of a fluid membrane was studied in popc membranes mixed with distinct ceramides. our results show that: i) saturated ceramide has a stronger impact on the fluid membrane, increasing its order and promoting gel/fluid phase separation, while their unsaturated counterparts have a lower (c : ceramide) or no (c : ceramide) ability to form gel domains at physiological temperature, ii) differences between distinct saturated species are smaller and are related mainly to domain morphology, and iii) very long chain ceramide induces the formation of tubular structures probably associated with interdigitation. these results suggest that generation of different ceramide species in cell membranes has a distinct biophysical impact with acyl chain saturation dictating membrane lateral organization, and chain asymmetry governing interdigitation and membrane morphology. extra high content of cholesterol (chol) in fiber-cell membranes in the eye lens leads to chol saturation and formation of cholesterol bilayer domains (cbds). it is hypothesized that high enrichment in cholesterol helps to maintain lens transparency and protect against cataractogenesis. in model studies, the cbd is formed in a phospholipid bilayer when cholesterol content exceeding the cholesterol solubility threshold, thus, the cbd is surrounded by phospholipid bilayer saturated with cholesterol. in the present study, we carried out molecular dynamics (md) simulation of two bilayers: a palmitoyloleoylphosphatidylcholine (popc) bilayer (reference) and a : popc-chol bilayer, to investigate the smoothing effect of the saturating amount of cholesterol on the bilayer. to our knowledge, this effect has not been studied so far so this study is certainly providing new results. our results indicate that saturation with cholesterol significantly narrows the distribution of vertical positions of the center-of-mass of the popc molecules and the popc atoms in the bilayer and smoothes the bilayer surface. we hypothesize that this smoothing effect decreases light-scattering and helps to maintain lens transparency. the phospholipid content of staphylococcus aureus membranes displays a high degree of variability ( - ). the major phospholipids found in s. aureus are phosphatidylglycerol (pg), cardiolipin (cl) and lysylphosphatidylglycerol (lpg), ( ) the concentrations of which are environment dependent and see fluctuations on exposure to high concentrations of positively charged moieties ( ) . up regulation of lpg has a suspected role in neutralisation of the plasma membrane in response to cationic threats. studies have been conducted to probe biomimetic models of this theory however our focus is to look at atomic details of membrane extracts in the presence of magaininf w. s. aureus lipid extracts from cells grown at ph . and . , studies by neutron diffraction with and without peptide at two contrasts. d-spacings were assessed by vogt area fitting and bragg's law. bilayer separation at low ph was * - Å less than ph . . with peptide, bilayer separations of ph . and . extracts were reduced by * Å and * Å , respectively. reduced pg content of low ph extracts is suggested to reduce d-spacing, however presence of peptide further reduces this, possibly by an anion neutralisation effect. abnormal d-spacing on increased humidity may be due to the breakdown of lpg. activation of neutrophils releasing hocl and apoptosis of vein endothelial cells are the events documented to occur in the course of atherosclerosis. as lipid chlorohydrins, which are the key products of the reaction between hocl and unsaturated fatty acid residues, were found in atherosclerotic plaques, we decided to check their biological activity in the context of their ability to act as the mediators of hoclinduced oxidative stress and apoptosis in the culture of immortalized human umbilical vein endothelial cells (hu-vec-st). the concentration of reactive oxygen species was found to be elevated after h cell incubation with phospahtidylcholine chlorohydrins. this effect was at least partially caused by the leakage of superoxide anion from mitochondria and followed by depletion of gsh and total thiols. the significant decrease of antioxidant capacity of cell extracts was also observed. the intracellular red-ox imbalance was accompanied by the increase of the ratio between phosphorylated and dephosphorylated forms of p map kinase. after longer incubation a significant number of apoptotic cells appeared. summing up, phosphatidylcholine chlorohydrins may be regarded as signaling molecules, able to initiate signalling pathways by induction of oxidative stress. giant unilamellar vesicles (guvs) are a valuable tool in the study of lateral distribution of biological membrane components. guv dimensions are comparable to typical cell plasma membranes and lipid phase separation can be observed through fluorescence microscopy. guv studies frequently require immobilization of the vesicles, and several methods are available for that effect. one of the most common methodologies for vesicle immobilization is the use of avidin/streptavidin coated surfaces and biotin labeled lipids at very low concentration in the vesicles. here, we analyze the effect of using this methodology on lipid domain distribution for different lipid compositions. we show that as a result of non-homogeneous distribution of biotin labeled lipids between liquid disordered, liquid ordered and gel phases, distribution of lipid domains inside guvs can be dramatically affected. monitoring membrane permeability: development of a sicm approach christoph saßen , and claudia steinem institute for organic and biomolecular chemistry, university of gö ttingen, tammannstraße , gö ttingen, germany, ggnb doctoral program: imprs, physics of biological and complex systems scanning ion conductance microscopy (sicm) utilises a nanopipette containing an electrode as a probe for surface investigations with resolutions of / of the inner pipette diameter. experiments are conducted under physiological conditions, in situ and without mechanical contact of probe and sample. hence, sicm serves as a well-suited technique for the investigation of soft objects such as cells or artificial lipid membranes. using pore-suspending membranes (psm) as a model system, interactions of melittin as an example for cell penetrating peptides (cpps) and lipid membranes are investigated by means of sicm. formation of a range of solvent free psm from lipid vesicles has been achieved as confirmed by means of fluorescence microscopy and sicm. application of melittin results in rupturing of the lipid bilayer. putative insights gained from this assay are critical concentrations of membrane permeabilising ccps and answers to mechanistic questions, e.g. whether ccps translocate only or form pores within the lipid bilayer. positioning of the z-ring in escherichia coli prior to cell division is regulated by intracellular pole-to-pole oscillation and membrane binding of min proteins, allowing assembly of ftsz filaments only at the center plane of the cell. in order to investigate the influence of membrane geometry on the dynamic behavior of membrane binding of min proteins, we combined concepts of synthetic biology and microfabrication technology. glass slides were patterned by a gold coating with microscopic windows of different geometries, and supported lipid bilayers (slb) were formed on these microstructures. on slbs, min-proteins organize into parallel waves. confinement of the artificial membranes determined the direction of propagation.min-waves could be guided along curved membrane stripes, in circles and even along slalom-geometries. in elongated membrane structures, the protein waves always propagate along the longest axis. coupling of protein waves across spatially separated membrane patches was observed, dependent on gap size and viscosity of the aqueous media above the bilayer. this indicates the existence of an inhomogeneous and dynamic protein gradient above the membrane. minimal systems for membrane associated cellular processes petra schwille bio technology center biotec, technical university of dresden, germany the strive for identifying minimal biological systems, particularly of subcellular structures or modules, has in the past years been very successful, and crucial in vitro experiments with reduced complexity can nowadays be performed, e.g., on reconstituted cytoskeleton and membrane systems. in this overview talk, i will first discuss the virtues of minimal membrane systems, such as guvs and supported membranes, in quantitatively understand protein-lipid interactions, in particular lipid domain formation and its relevance on protein function. membrane transformations, such as vesicle fusion and fission, but also vesicle splitting, can be reconstituted in these simple subsystems, due to the inherent physical properties of selfassembled lipids, and it is compelling question how simple a protein machinery may be that is still able to regulate these transformations. as an exciting example of the power of minimal systems, i show how the interplay between a membrane and only two antagonistic proteins from the bacterial cell division machinery can result in emergence of protein self-organization and pattern formation, and discuss the possibility of reconstituting a minimal divisome. quantitative microscopic analysis reveals cell confluence regulated divergence of pdgfr-initiated signaling pathways with logically streamlined cellular outputs Á rpá d szö } or, lá szló ujalky-nagy, já nos szö ll} osi, gyö rgy vereb university of debrecen, department of biophysics and cell biology, debrecen, hungary platelet derived growth factor receptors (pdgfr) play an important role in proliferation and survival of tumor cells. pdgf-bb stimulation caused a redistribution of pdgf receptors towards gm rich domains, which was more prominent in confluent monolayers. pdgf-bb stimulation significantly increased relative receptor phosphorylation of the ras / mapk pathway specific tyr residues and the pi -kinase / akt pathway specific tyr residues in nonconfluent cultures. tyr residues that serve as adaptors for ras-gap which inactivates the mapk pathway and tyr residues feeding into the plc-gamma / camk-pkc pathway were the docking sites significantly hyperphosphorylation following ligand stimulation in confluent cells. we found that p-akt facilitated cell survival and pmapk dependent proliferation is more activated in dispersed cells, while phospholipase c-gamma mediated calcium release and pkc-dependent rhoa activation are the prominent output features pdgf stimulus achieves in confluent cultures. these observations suggest that the same stimulus is able to promote distinctly relevant signaling outputs, namely, cell division and survival in sparse cultures and inhibition of proliferation joined with promotion of migration in confluent monolayers that appear contact inhibited. a thermodynamic approach to phase coexistence in ternary cholesterol-phospholipid mixtures jean wolff, carlos m. marques and fabrice thalmann institut charles sadron, université de strasbourg, cnrs upr, , rue du loess, strasbourg cedex, f- , france e-mail: thalmann@ics-cnrs.unistra.fr we present a simple and predictive model for describing the phase stability of ternary cholesterol-phospholipid mixtures. assuming that competition between the liquid and gel organizations of the phospholipids is the main driving force behind lipid segregation, we derive a phenomenological gibbs free-energy of mixing, based on the calorimetric properties of the lipids main transition. gibbs phase diagrams are numerically obtained that reproduces the most important experimental features of dppc-dopc-chol membranes, such as regions of triple coexistence and liquid orderedliquid disordered segregation. based on this approach, we present a scenario for the evolution of the phase diagram with temperature. results for other phospholipid species, such as popc or psm will also be presented. interleukin- and - receptors play a central role in the activation, survival and death of t lymphocytes. they form supramolecular clusters with mhc i and ii glycoproteins in t cells. in damaged or inflamed tissues the extracellular k+ concentration increases, which can depolarize the membrane. the common signaling beta and gamma chains of il- / r are phosphorylated upon cytokine binding and get a permanent dipole moment, thus their conformation, interactions, mobility and activity may be sensitive to the membrane potential. we induced depolarization on ft . t lymphoma cells by increasing the ec. k+ level or by blocking kv . voltage gated k+ channels with margatoxin. fcs measuremens showed that the lateral mobility of fab-labeled il- / r and mhc i and ii decreased upon depolarization, while that of gpi-linked cd did not change. fret efficiency measured between some elements of the il-receptor/mhc cluster increased, which may reflect an increase of cluster size. il- -induced receptor activity, as monitored by measuring stat -phosphorylation, increased upon depolarization, whereas il- induced phosphorylation did not change. our results may reveal a novel regulatory mechanism of receptor function by the membrane potential. cytokines play an important role in t cell activation and immunological memory, whereas mhcs are known for the role in antigen presentation. we applied rnai to silence the expression of mhc i in order to study its possible role in receptor assembly and function. fret data indicated that the association of il- r and il- r with mhc i as well as between il- r and il- r weakened. fcs indicated an increase of receptor mobility also suggesting the partial disassembly of the clusters. mhc i gene silencing lead to a remarkable increase of il- /il- induced phosphorylation of stat transcription factors. in search for the molecular background of this inhibition of signaling by mhc i we checked il- binding and the formation of the receptor complex (il- r alpha -il- r beta association), but we did not find a difference as compared to the control. our results suggest that mhc i plays an organizing role in maintaining supramolecular receptor clusters and inhibits il- r signaling, revealing a nonclassic new function of mhc i beyond its classical role in antigen presentation. interleukin- (il- ) is an important cytokine involved in adaptive immunity. il- binds with high affinity the singlepass transmembrane receptor il- ra. the occupied complex, il- /il- ra then engages either il- rc or il- ra , to form an activated type i or ii receptor, respectively. this formation of heterodimers is believed to trigger cross-activation of intracellular janus kinases. here we follow a fluorescently labeled ligand through various stages of receptor activation in hek t: using fluorescence correlation spectroscopy (fcs), we see that the receptor chains diffuse as monomers within the plasma membrane. using dual-color fccs provides direct evidence for ligand induced co-diffusion of occupied il- ra and il- ra . in contrast, type i complexes containing il- rc could not be observed. however, ectopic expression of gfp-tagged il- rc/jak induced stable fluorescent speckles in or close to the plasma membrane. we identified these structures as early sorting endosomes by colocalization of surface markers like eea and rab gtpases. the il- ra chain is continuously trafficking into these compartments. these observations suggest that the formation of a type i il- r heterodimer may require internalization and that early endosomes serve as a platform for il- signaling. among the membrane associated proteins, the ras family, which is lipid-anchored g protein, plays a key role in a large range of physiological processes and, more importantly, is deregulated in a large variety of cancer. in this context, plasma membrane heterogeneity appears as a central concept since it ultimately tunes the specification and regulation of ras-dependent signaling processes. therefore, to investigate the dynamic and complex membrane lateral organization in living cells, we have developed an original approach based on molecule diffusion measurements performed by fluorescence correlation spectroscopy at different spatial scales (spot variable fcs, svfcs) ( ). we have shown in a variety of cell types that lipidbased nanodomains are instrumental for cell membrane compartmentalization. we have also observed that thesenanodomains are critically involved in the activation of signaling pathways and are essential for physiological responses ( - ). more recently, weextend the application of svfcs to characterizethe dynamics of k-rasproteinat the plasma membrane. as major result, we demonstrated that the rate of k-ras association/dissociation from the membrane is fast but vary as a functional of the activation state of the molecule as well as of specific intracellular protein interactions. we have so demonstrated that an helical lid sub-domain in the sbd is essential for monomeric as binding, but not for the anti-aggregation activity of the chaperone, suggesting that hsp is able to interact with pre-fibrillar oligomeric species formed during as aggregation and that, then, the mechanism of binding for these species is different from that of the monomeric protein. aggregation of the acylphosphatase from sulfolobus solfataricus (sso acp) into amyloid-like protofibrils is induced by the establishment of an intermolecular interaction between a -residue unfolded segment at the n-terminus and the globular unit of another molecule. we have used data from hydrogen/deuterium exchange experiments, intermolecular paramagnetic relaxation enhancements and isothermal titration calorimetry measurements on an aggregation-resistant sso acp variant lacking the -residue n-terminus to characterize the initial steps of the aggregation reaction. under solution conditions that favour aggregation of the wild-type protein, the truncated protein was found to interact with a peptide corresponding to the n-terminal residues of the full length protein. this interaction involves the fourth strand of the main b-sheet structure of the protein and the loop following this region and induces a slight decrease in protein flexibility. we suggest that the amyloidogenic state populated by sso acp prior to aggregation does not present local unfolding but is characterized by increased dynamics throughout the sequence that allow the protein to establish new interactions, leading to the aggregation reaction. amyloid-like aggregates alter the membrane mobility of gm gangliosides martino calamai and francesco pavone university of florence, lens -european laboratory for non-linear spectroscopy, sesto fiorentino, florence, italy neuronal dysfunction in neurodegenerative pathologies such as alzheimer's disease is currently attributed to the interaction of amyloid aggregates with the plasma membrane. amongst the variety of toxic mechanisms proposed, one involves the binding of amyloid species to gm gangliosides. gm takes part into the formation of membrane rafts, and exerts antineurotoxic, neuroprotective, and neurorestorative effects on various central neurotransmitter systems. in this study, we investigated the effects of amyloid-like aggregates formed by the highly amyloidogenic structural motif of the yeast prion sup (sup nm) on the mobility of gm on the plasmamembrane of living cells. preformed sup nm aggregates were incubated with cells and gm molecules were subsequently labeled with biotinylated ctx-b and streptavidin quantum dots (qds). single qds bound to gm were then tracked. the mobility of gm was found to decrease dramatically in the presence of sup nm aggregates, switching from brownian to mainly confined motion. the considerable interference of amyloid-like aggregates with the lateral diffusion of gm might imply a consequent loss of function of gm , thus contributing to explain the toxic mechanism ascribed to this particular interaction. insights into the early stages of fibrillogenesis of insulin using mass spectrometry harriet l. insulin is a vital hormone in metabolic processes as it regulates the glucose levels in the body. insulin is stored in the b cells of the pancreas as a hexamer, however its biologically active form is the monomer. the formation of fibrillar aggregates of insulin rarely occurs in the body; however localised amyloidosis at the site of injection for diabetes patients and aggregation of pharmaceutical insulin stocks present problems. in the current study oligomers formed early in the process of fibril assembly in vitro are observed by mass spectrometry (ms). ms is the only technique which allows early species to be characterised as it can identify different oligomeric orders by mass to charge ratio and show protein abundance and aggregation propensity. on mobility ms is used to examine rotationally averaged collision cross sections of oligomers in the aggregating solution. a wide array of oligomers is observed and the stability of specific species is remarked. the presence of multiple conformations for the highly charged oligomers is particularly noted and their assignment confirmed using fourier transform ion cyclotron resonance ms and collision induced dissociation. molecular modelling has been used to further explore the conformational space the oligomers inhabit. amyloid fibrils consisting of different proteins have been recognized as an accompanying feature of several neurodegenerative diseases. many proteins without known connection to any diseases have been found to form amyloid fibrils in vitro, leading to suggestion that the ability to form fibrils is the inherent property of polypeptide chain. the observed common character of protein amyloid formation enables to seek further clues of fibrillation mechanism by studying generalized sequenceless polypeptide models, e.g. polylysine. we have studied conformational transitions of polylysine, with different chain length at various ph, ionic strength and temperature by means of novel approachviscometric method. this polypeptide undergoes alfa-helix to beta-sheet transition and forms amyloid fibrils in special conditions. temperature induced a-helix to b-sheet transitions occurs at ph interval form to . and with increasing chain length is slightly shifted to the lower ph. we have found narrow ph interval, in which the thermal transition is fully reversible, suggesting the high sensitivity of polypeptide conformation on subtle changes in charge on its side chains. this work was supported within the projects vega , and , cex sas nanofluid and by esf project no. . understanding the mechanisms of the conversion from the native state of a protein to the amyloidal state represents a fundamental step in improving the purification, storage and delivery of protein-based drugs and it is also of great relevance for developing strategies to prevent in vivo protein aggregation. amyloid fibrils have a structural arrangement of cross b-sheet but they can also experience different packing into three dimensional superstructures, i.e. polymorphism. it is well known that, among others, both the geometric confinement of the molecules and shear forces can affect the final morphology of the aggregates. importantly, due to the complexity and crowding of the cellular region, such parameters also play a crucial role in in vivo processes. we present an experimental approach to study in vitro amyloid aggregation in a controlled and uniform shear force field and within microscale environments. in particular we focus on the effect of these two parameters on the formation of spherical aggregates, known as spherulites. using micro channels of different cross-sections from to lm x lm and flow rates in the range of hundreds of ll/min, the number and diameter of spherulites within the channels have been characterized using crossed polarizers optical microscopy. inhibition of insulin amyloid fibrillization by albumin magnetic fluid k. insulin amyloid aggregation causes serious problems for patient with insulin dependent diabetes undergoing long-term treatment by injection, in production and storage of this drug and in application of insulin pumps. recent studies indicate that protein amyloid aggregation causes the cell impairment and death; however, the prevention of amyloid aggregation is beneficial. we have investigated ability of albumin magnetic fluid (amf) to inhibit insulin amyloid aggregation by spectroscopic and microscopic techniques. albumin magnetic fluid consists of magnetic fe o nanoparticles sterically stabilized by sodium oleate and functionalized with bovine serum albumin (bsa) at various weight ratios bsa/fe o . we have found the positive correlation between inhibiting activity of afm and nanoparticle diameter and zeta potential. the ability of amf to inhibit formation of amyloid fibrils exhibits concentration dependence with ic values comparable to insulin concentration. the observed features make amf of potential interest as agents effective in the solving of problems associated with insulin amyloid aggregation. (this work was supported within the projects vega , and , cex sas nanofluid, apvv- - , sk-ro- - and esf project ). amyloid formation of peptides causes diseases like alzheimer's and parkinson's disease. however, the conditions for the onset of the neurotoxic beta-sheet formation are poorly understood. we focus on aggregation triggers and their interplay: interactions with hydrophobic-hydrophilic interfaces, orientation of peptides in d, metal ion complexation and lipid layers. the tailor-made model peptides exhibit defined secondary structure propensities and metal ion binding sites. the interactions of the peptide with the air-water interface and with metal ions are studied using surface sensitive methods connected to film balance measurements. x-ray diffraction, x-ray reflection, infrared reflection-absorption spectroscopy and total reflection x-ray fluorescence were applied to reveal the layer structure, peptide conformations and metal ion binding at the interface. we found that amyloid formation in d is dominated by the hydrophobic-hydrophilic interface and not comparable to the bulk behaviour. the interface can enhance or inhibit betasheet formation. the effect of metal ion complexation depends on the arrangement of the binding sites in the peptide and the preferred metal complexation geometry. the two triggers interface and metal ion complexation, can oppose each other. effect of apoe isoform and lipidation status on proteolytic clearance of the amyloid-beta peptide hubin, e. alzheimer's disease (ad) is the most common type of dementia in the elderly. the most important genetic risk factor identified for ad is the isoform, e , e or e , of apolipoprotein e (apoe), a lipid-carrying protein. one hallmark of ad is the accumulation of amyloid-beta peptide (ab) in the brain which is thought to result from an imbalance between the production of ab and its clearance. previous studies report an important role for apoe in ab degradation. we sought to determine the effect of apoe isoform and lipidation status on the degradation of soluble ab by proteinases such as insulin-degrading enzyme and neprilysin. in this study an in vitro ab clearance assay based on the competition between ab and a fluorogenic peptide substrate is developed to quantify ab degradation. to elucidate the proteolytic clearance mechanism, the fragments resulting from cleavage are identified by mass spectrometry and further analyzed to identify the interacting stretch of the ab sequence with the different apoe isoforms. the results suggest that apoe influences the rate of ab degradation. the aggregation of proteins into fibrillar nanostructures is a general form of behaviour encountered for a range of different polypeptide chains. the formation of these structures is associated with pathological processes in the context of alzheimer's and parkinson's diseases but is also involved in biologically beneficial roles which include functional coatings and catalytical scaffolds. this talk focuses on recent work directed at understanding the kinetics of this process through the development and application of experimental biophysical methods and their combination with kinetic theories for linear growth phenomena. lbs contains not only as, but also other proteins including - - proteins. - - proteins exist mainly as a dimer and its exact functions are remain unclear. however, recent work has shown that the association of - - (eta) with as in lbs. herein we show how - - (eta) can modulate as in vitro aggregation behavior, by rerouting it toward the formation of stable non-fibrillar aggregates. we also show that the resulting populations of fibrillar and pre-fibrillar aggregates exhibit a modified toxicity in vivo with respect to the unperturbed aggregates. interestingly, - - (eta) does not show any binding affinity for monomeric as, nor for the mature fibrillar aggregates. we provide evidence that it acts on the oligomeric species which form during the amyloidogenesis process of aggregation. since - - (eta) can influence the toxicity of amyloidogenesis without perturbing the functional as monomers, we are convinced that once fully understood, its mode of action could represent a promising model to mimic with synthetic drugs and peptides. what makes an amyloid toxic: morphology, structure or interaction with membranes? more than human diseases are related to amyloids. in order to understand why some amyloids may become toxic to their host and some others are not, we first developed a genetical approach in the yeast saccharomyces cerevisiae. we have chosen the amyloid/prion protein het-s prion forming domain ( - ) from the fungi podospora anserina, which is not toxic in yeast. some toxic amyloids mutants were generated by random mutagenesis. in vitro the most toxic mutant called ''m '' displays very peculiar nanofibers, which polymerized mainly in amyloid antiparallel b-sheets whereas the non-toxic wt exhibits a parallel polymerization. we further established the dynamic of assembling of the m toxic amyloid, in comparison to the wt non-toxic amyloid, and showed the presence of specific oligomeric intermediates also organized in antiparallel b-sheet structures. a more global structure/toxicity study on more than mutants clearly identified an antiparallel b-sheet signature for all the toxic mutants. therefore size, intermediates and antiparallel structures may account for amyloid toxicity in yeast but we still wonder what their cellular targets are. recently, we established the first evidences that toxic mutants may specifically bind in vitro to lipids, particularly negatively charged. interconnected mechanisms in abeta ( - ) we present an experimental study on the fibril formation of ab( - ) peptide at ph . . the kinetics of this process is characterized by the occurrence of multiple transient species that give rise to final aggregates whose morphology and molecular structure are strongly affected by the growth conditions. to observe in details the aggregation pathway as a function of solution conditions, we have used different experimental techniques as light scattering, thioflavin t fluorescence, circular dichroism and two-photon fluorescence microscopy. this approach gives information on the time evolution of conformational changes at molecular level, on the aggregates/fibrils growth and on their morphologies. the selected experimental conditions allowed us to highlight the existence of at least three different aggregation mechanisms acting in competition. a first assembly stage, which implies conformational conversion of native peptides, leads to the formation of small ordered oligomers representing an activated conformation to proceed towards fibril growth. this process constitutes the rate limiting step for two distinct fibril nucleation mechanisms that probably implicate spatially heterogeneous mechanisms. the formation of amyloid fibrils of amylin - was studied by means of molecular dynamics (md) and energy partition on three peptide ß-sheet stack systems with the same amino acid composition: wild type amylin - (amyl - ), reverse amylin - (rev-amyl - ) and scrambled amylin version scr-amyl - . the results show that for amylin - peptides, amino acid composition determines the propensity of a peptide to form amyloid fibrils independent of their sequence. the sequence of amino acids defines the shape and the strength of amyloid protofibril, which conforms with the atom force microscope (afm) data [ ] . md show that the x revamyl- - stack has looser selfassembly than the x amyl- - stack, which conforms with the results of fourier transform infrared spectroscopy (ftir) measurements for the peptides studied [ ] . the results of md show that x amyl- - could have a turn, which consists with ftir data [ ] . data on ab aggregation kinetics have been characterized by a large spread between experiments on identical samples that contain so many molecules that stochastic behaviour is difficult to explain unless caused by uncontrolled amounts of impurities or interfaces. we have therefore spent considerable effort to eliminate sources of inhomogeneity and reached a level of reproducibility between identical samples and between experiments on separate occasions that we can now collect data that can lead to mechanistic insights into the aggregation process per se, and into the mechanism of action of inhibitors. data on ab aggregation will be shown that give insight into the influence of physical parameters like peptide concentration, shear and ionic strength, as well as the effect of inhibitory proteins, model membranes and the effects of sequence variations. monte carlo simulations of amyloid formation from model peptides corroborate the finding from experiments and underscore that the very high level of predictability and reproducibility comes from multiple parallel processes. negatively-charged membranes were reported to catalyze ''amyloid-like'' fiber formation by non-amyloidogenic proteins [ ] . our study aims to elucidate the factors that govern the formation of these amyloid-like fibers. lysozyme was selected as a model of non-amyloidogenic protein and was fluorescently-labeled with alexa fluor (a -lz). first, a -lz partition towards phosphatidylserine-containing liposomes was characterized quantitatively using fluorescence correlation spectroscopy (fcs), in order to calculate the protein coverage of liposomes. secondly, the interaction between a -lz and negatively-charged lipid membranes was studied using both steady-state and time-resolved fluorescence techniques. this interaction was found to switch from a peripheral binding to the anionic headgroups, at high lipid/ protein molar ratio (l/p), to a partial insertion of protein into the hydrophobic core of the membrane, at low l/p. finally, the lipidprotein supramolecular complexes formed at low l/p were characterized by fluorescence lifetime imaging microscopy (flim). the mean lifetime of a -lz in these supramolecular structures is much lower compared to the values obtained for the free and bound a -lz at high l/p. the fiber characterization will be complemented by fcs studies. [ the conversion of normal prp c to its pathological isoform prp sc is a key event in prion diseases and is proposed to occur at the cell surface or more probably in acidic late endosomes. a convergence of evidence strongly suggests that the early events leading to the structural conversion of the prp seem to be in relation with more or less stable soluble oligomers, which could mediate neurotoxicity. as commonly shared by other amyloidogenic proteins, membrane-bound monomers undergo a series of lipid-dependent conformational changes, leading to the formation of oligomers of varying toxicity rich in b-sheet structures (annular pores, amyloid fibrils). here, we have used a combination of biophysical techniques (dynamic and static light scattering, fluorescence techniques, and quartz crystral microbalance) to elucidate the interaction of native monomeric prp and that of purified b-rich oligomeric prp on model lipid membranes. under well established conditions, three b-sheet-rich oligomers were generated from the partial unfolding of the monomer in solution, which were found to form in parallel. from single mutation and/or truncation of the full length prp, the polymerization pathway is strongly affected, revealing the high conformational diversity of prp. in our previous work, we identify the minimal region of the prp protein leading to the same polymerization pattern of the full length prp. soluble -subunits and -subunits oligomers were obtained depending on the single mutation or truncation and purified. we compare their structural properties (ftir, cd) when associated with anionic lipid bilayers and study their propensities to permeabilize the membrane. fluorescence kinetics suggest different mechanisms of membrane perturbation for the monomer and the prp oligomers. deciphering this complex network of lipid interactions and conformational diversity of the prp protein will help for understanding of how amyloidogenic proteins induce neurotoxicity. the traditional view of the lipid bilayer described as a ''sea'' of lipids where proteins may float freely, is going to be inadequate to describe the increasingly large number of complex phenomena which are known to take place in biological membranes. membrane-assisted protein-protein interactions, formation of lipid clusters, protein-induced variation of the membrane shape, abnormal membrane permeabilities and conformational transitions of membrane-embedded proteins are only a few examples of the variegated ensemble of events whose tightly regulated cross-talk is essential for cell structure and function. experimental work on the above mentionated problems is very difficult and some time not accessible, especially when the studied systems have a fast dynamics. due to the large size of the systems usually involved in this multifaceted framework, a detailed molecular description of these phenomena is beyond the possibilities of conventional amyloid aggregation, a generic behavior of proteins, is related to incurable human pathologies -amyloid-related diseases, associated with formation of amyloid deposits in the body. all types of amyloid aggregates possess rich bsheet structural motif. the recent data confirm the toxic effect of aggregates on the cells, however, it was found that reduction of amyloid aggregates plays important role in prevention as well as therapy of amyloidosis. we have investigated effect of phytoalexin derivatives on amyloid fibrillization of two proteins, human insulin and chicken egg lysozyme, by tht and ans fluorescence assays. we have found that amyloid aggregation of both studied proteins was significantly inhibited by phytoalexin derivates cyclobrassinin and benzocamalexin. for most effective phytoalexins the estimated ic values were at low micromolar concentration. the observed inhibiting activity was confirmed by transmission electron microscopy. our data suggest the potential therapeutic use of the most effective phytoalexins in the reduction of amyloid aggregation. (this work was supported within the projects vega , , cex sas nanofluid, apvv- - , sk-ro- - and esf project ). the amyloid pore hypothesis suggests that interactions of oligomeric alpha-synuclein (as) with membranes play an important role in parkinson's disease. oligomers are thought to permeabilize membranes and interfere with ca + pathways. permeabilization by as requires the presence of negatively charged phospholipids. whether as can bind and permeabilize membranes with physiologically relevant lipid compositions has not been extensively explored. here we report on the binding of as to giant unilamellar vesicles (guvs) with physiologically relevant lipid compositions. comparing different protocols of oligomer preparation, leakage assays on both large unilamellar vesicles (calcein release) and guvs (hpts efflux assay) show that as is not able to permeabilize these membranes. the presence of cholesterol has a stabilizing effect on these membrane systems. in agreement with these findings, we do not observe concentration dependent as toxicity using in vivo mts assays. however, in the calcein release assay, different as preparations show differences in kinetics and as concentrations that cause % leakage. these results motivate us to critically reassess the amyloid pore hypothesis, and suggest that membrane permeabilization may be attributable only to a very specific as species. alpha-synuclein oligomers impair membrane integrity-a mechanistic view martin t. stö ckl, mireille m. a. e. claessens, vinod subramaniam nanobiophysics, mesa+ institute for nanotechnology, university of twente, enschede, the netherlands one of the most prevalent neurodegenerative diseases is parkinson's disease (pd), which is accompanied with the loss of dopaminergic neurons. although the mechanisms leading to the death of these cells are still unclear, the protein alpha-synuclein (as) is one of the pivotal factors. previous studies indicate that especially oligomeric forms of as show a detrimental effect on membrane integrity. as an intact membrane is crucial to many cellular processes, the impairment of the membrane integrity is a likely pathway for neuronal death. we use different phospholipid bilayer model systems to investigate the mechanisms underlying this process. atomic force microscopy in combination with suspended asymmetric phospholipid bilayers, which closely mimic the plasma membrane, allows the identification of the binding sites, the measurement of penetration depths of the as oligomers into the phospholipid bilayer, and the detection of membrane thinning or creation of membrane defects. using an approach based on phospholipid vesicles we were able to observe for the first time that as oligomers cause an enhanced lipid-flip flop. suggesting that the loss of lipid asymmetry is a novel mechanism which may contribute to or trigger neuronal death in pd. amyloid protein-membrane interactions and structuretoxicity relationship study h.p. ta (toxic) has a much higher and more specific effect on negatively charged phospholipids (dops, dopi and dopg) than in the case of wt (non-toxic). therefore the insertion of protein into phospholipid monolayers, which occurred similarly for both wt and m , is not a key factor for these effects (h.p. ta et al. langmuir, in press). we are now using unilamellar vesicles as a membrane model to investigate the amyloid protein (toxic and nontoxic) -phospholipid interactions. results confirmed the high specific and strong effects of m on negatively-charged membrane. in this project, we study the chemical, physical and biological properties of fibrillar networks. the formation and the network mechanics are investigated by combining droplet-based microfluidics with optical microscopy and small angle x-ray scattering (saxs). the chosen system, fibrin network formation, plays an important role in blood coagulation processes. crosslinking of fibrinogen induced by an enzymatic reaction with thrombin leads to d fibrin network formation. the fibrillar networks are formed within picoliter droplets of aqueous solutions in an continuous oil phase. droplets containing fibrinogen and thrombin can be produced of different sizes and stored for fibrin network formation. the formation of the fibrillar networks is imaged by fluorescence microscopy. to analyze the elastic properties of the networks, the droplets flow through a microchannel device of alternating width in order to squeeze and stretch the networks. additionally, saxs experiments will give structural information about the molecular dimensions of the networks. the amyloid beta peptide (ab), implicated in alzheimer's disease (ad), is released from the amyloid precursor protein (app) by secretase-induced cleavage. this process results in the release of a range of ab peptides varying in length. the brains of ad patients often contain longer ab peptides while the total concentration of ab is unaffected. longer peptides are more hydrophobic having far-reaching consequences for their toxicity and aggregation. as ab is necessary for normal neuronal function, research activities into ad therapeutic development currently explore the possibilities of modulating c-secretase activity to produce short ab peptides. whether such an approach effectively ameliorates the toxic effect of ab has not been explored yet. to answer this question, we studied the impact of heterogeneity in ab pools in an in vitro biophysical and in cellulo context using microelectrode array to assay the synaptic activity of primary neurons. we show that various lengths of the ab peptide and mixtures thereof aggregate with distinct kinetics and notoriously affect synaptotoxic and cytotoxic response. we also show that small amounts of less abundant peptides ab and ab induce aggregation and toxicity of ab while the behavior of ab is unaffected. one of the most important irreversible oxidative modifications of proteins is carbonylation, a process of introducing the carbonyl group in reaction with reactive oxygen species. importantly, carbonylation increases with the age of cells and is associated with the formation of intracellular protein aggregates and the pathogenesis of age-related disorders such as neurodegenerative diseases and cancer. however, it is still largely unclear how carbonylation affects protein structure, dynamics and aggregability on the atomic level. here, we use classical molecular dynamics simulations to study structure and dynamics of the carbonylated headpiece domain of villin, a key actin-organizing protein. we perform an exhaustive set of molecular dynamics simulations of native villin headpiece together with every single combination containing carbonylated versions of its seven lysine, arginine and proline residues, the quantitatively most important carbonylable amino acids. surprisingly, our results suggest that high levels of carbonylation, far above those associated with cell death in vivo, may be required to destabilize and unfold protein structure through the disruption of specific stabilizing elements, such as salt bridges or proline kinks, or tampering with the hydrophobic effect. on the other hand, by using thermodynamic integration and molecular hydrophobicity potential approaches, we quantitatively show that carbonylation of hydrophilic lysine and arginine residues is equivalent to introducing hydrophobic, chargeneutral mutations in their place, and, by comparison with experimental results, demonstrate that this by itself significantly increases intrinsic aggregation propensity of both structured, native proteins and their unfolded states. finally, our results provide a foundation for a novel experimental strategy to study the effects of carbonylation on protein structure, dynamics and aggregability using site-directed mutagenesis. septins are an evolutionarily conserved family of gtp-binding proteins involved in important cellular processes, such as cytokinesis and exocytosis, and have been implicated in neurological diseases, such as alzheimer's and parkinson's diseases. the focus of this study was two septins of schistosoma mansoni, (the causative agent of schistosomiasis in south america) named smsept and smsept , which were produced in a recombinant system. our objective was to verify if these septins from a simpler organism display similar characteristics to human septins. analysis of protein structure by circular dichroism showed that both recombinant smseptins produced were folded. the gtpase activity assay showed that smsept was able to hydrolyze gtp, whereas smsept was not. aggregation studies for amyloid fibril detection by right angle light scattering and thioflavin t fluorescence assay were performed. both proteins showed a temperature dependent increase in light scattering and fluorescence emission of tht probe. this indicated that s. mansoni septins tend to aggregate into amyloid-like fibers in high temperatures, with thresholds of °c for smsept and °c for smsept . these results are in accordance to that previously reported for human septins. in our work we investigated the response to standard chemotherapy of blood lymphocytes of patients suffering with melanoma. dna single and double strand breaks were determined using comet assay; intracellular levels of marker proteins were detected using immunocytochemistry. ultimately this set of parameters allows to characterize two mechanisms of dna repair (base excision repair, ber and mismatch repair, mmr) which together with apoptosis proneness underlie response of tumor cells to chemotherapy. cell death caused by o mg adducts is promoted by mmr system by inducing unrepaired double strand breaks in dna. there was a linear correlation between the level of dsdna breaks in lymphocytes after -st cycle of chemotherapy and mmr efficiency in them. the level of double strand breaks in dna after -st cycle of chemotherapy is predictive of clinical outcome. otherwise damage at the level of ssdna (ap-sites and single strand breaks) and ber mechanism associated with it couldn't be a good prognostic factor of chemotherapy. high level of double strand breaks in dna in blood lymphocytes of melanoma patients hours after -st cycle of chemotherapy appears to be a marker of a good prognosis. self-assembly and stability of g-quadruplex: counterions, pressure and temperature effects e. baldassarri jr., p. mariani, f. spinozzi, m. g. ortore saifet dept. & cnism, marche polytechnic university, ancona, italy the important role of g-quadruplex in biological systems is based on two main features: composition and stability of telomeres, and activity of telomerase. the g-quadruplex structures are formed by supramolecular organization of basic units called g-quartets that are planar rings constituted by four guanosines linked by hoogsten hydrogen bonds. gquadruplex requires the presence of monovalent cations playing a key role in stabilizing these structures, since they give rise to coordination bonds needed for the stacking of more tetrads. we performed x-ray diffraction experiments at different pressures (ranging from to bar), and small angle x-ray scattering (saxs) changing the temperature (between - °c retinoic acid receptor (rar) is a member of the nuclear receptor superfamily. this ligand-inducible transcription factor binds to dna as a heterodimer with the retinoid x receptor (rxr) in the nucleus. the nucleus is a dynamic compartment and live-cell imaging techniques make it possible to investigate transcription factor action in real-time. we studied the diffusion of egfp-rar by fluorescence correlation spectroscopy (fcs) in order to uncover the molecular interactions determining receptor mobility. in the absence of ligand we identified two distinct species with different mobilities. the fast component has a diffusion coefficient of d = . - lm /s corresponding to small oligomeric forms, whereas the slow component with d = . - . lm /s corresponds to interactions of rar with the chromatin or other large structures. the rar ligand binding domain fragment also has a slow component probably as a result of indirect dna-binding via rxr, with lower affinity than the intact rar:rxr complex. importantly, rar-agonist treatment shifts the equilibrium towards the slow population of the wild type receptor, but without significantly changing the mobility of either the fast or the slow population. by using a series of mutant forms of the receptor with altered dna-or coregulator-binding capacity we found that the slow component is probably related to chromatin binding, and that coregulator exchange, specifically the binding of the coactivator complex, is the main determinant contributing to the redistribution of rar during ligand activation. formation of inactive nuclear with high level of dna compaction in sperm cells is accompanied by a substitution of linker histones h by a number of other proteins. among them sperm-specific histones (ssh), which are characterized by elongated arginine-rich polypeptide chain compared to the somatic h . the secondary and tertiary structure of the ssh and their interactions with dna were studied using spectroscopic and thermodynamic approaches. the histones were isolated from sperm of marine invertebrates and rat thymus. all studied ssh demonstrate no considerable compaction of dna in solutions of low ionic strength. however, at physiological conditions, ssh h from s.intermedius and a.japonica compact dna more intensively than other ssh. the somatic h from rat thymus revealed a minimal ability to compact the dna. we suggest that the ssh h are able to interact with dna not only in the major groove but also in the minor groove of the double helix inducing considerable structural changes in dna and facilitating the formation of the supercompact sperm chromatin. the authors are grateful for the financial support from the russian foundation for basic research (grants § - - and - - ) and from administration of saint-petersburg. ionizing radiation causes modification and destruction of nitrogenous bases in dna molecule. there are also local breakages of hydrogen bonds both in the lesion sites mentioned above and in other sites of the macromolecule. to reveal the amount of some of these damages we applied cd and uv absorption spectroscopy. radiation-induced changes in dna structure influence its uv absorption spectrum in different ways: partial denaturation causes hyperchromic effect, while destruction of the bases results in hypochromic shift. at the same time both of them result in the same changes in dna cd spectra: the decrease in intensity. we attempted to segregate the described damages in dna structure and studied the influence of dna ionic surroundings on the radiation effect. it is shown that the radiation efficiency of base destruction and partial denaturation increases with decreasing concentration of nacl in irradiated solution. udu (ugly duckling) has been first identified from a zebrafish mutant and shown to play an essential role during erythroid development; however, its roles in other cellular processes remain largely unexplored. facs analysis showed that the loss of udu function resulted in defective cell cycle progression and comet assay indicated the presence of increased dna damage in udu mutants. we further showed that the extensive p -dependent apoptosis found in udu mutants is a consequence of activation in the atm-chk pathway. udu appears not to be required for dna repair, because both wild-type and udu embryos similarly respond to and recover from uv treatment. yeast two-hybrid and coimmunoprecipitation data demonstrated that pah-l repeats and sant-l domain of udu interacts with mcm and mcm . furthermore, udu was colocalized with brdu and heterochromatin during dna replication, suggesting a role in maintaining genome integrity. recently, we started to work on the second zebrafish homolog, udu , and its mammalian counterpart, gon l. preliminary data showed that udu and gon l mrna injection can rescue zebrafish udu mutant phenotypes. furthermore, pah-l and sant-l domains of udu and gon l can bind to mcm and mcm and they are localized in the nucleus. these data suggest that udu and gon l are functionally equivalent to zebrafish udu. their molecular mechanism leading to udu phenotypes is currently under investigation. chromatin condensation: general polyelectrolyte association and histone-tail specific folding nikolay korolev, nikolay berezhnoy, abdollah allahverdi, renliang yang, chuan-fa liu, james p. tam, lars nordenskiö ld school of biological sciences, nanyang technological university, nanyang drive, , singapore the major component of chromatin, dna, is a densely charged polyanion. electrostatic interactions between dna and dnapackaging proteins contribute decisively to formation of its elementary unit, the nucleosome, and are also important in chromatin folding into higher-order structures. we investigate condensation of dna and chromatin and find that electrostatics and polyelectrolyte character of dna play dominant role in both dna and chromatin condensation. by comprehensive experimental studies and using novel oligocationic ligands, we suggest simple universal equation describing dna condensation as a function of oligocation, dna and monovalent salt concentrations and including the ligand-dna binding constant. we found that similar dependence was also observed in condensation of the nucleosome arrays. next, we studied how general electrostatic and specific structural alterations caused by lysine acetylations in the histone tails influence formation of -nm chromatin fibre and intermolecular nucleosome array association. for the first time, a structural model is suggested which explains critical dependence of chromatin fibre folding on acetylation of the single lysine at position of the histone h . exceptional importance of the h lys acetylation in general and gene specific transcriptional activation has been known for many years but no structural basis for this effect has yet been proposed. detection of specific dna sequences is central to modern molecular diagnostic. ultrasensitive raman measurements of nucleic acids are possible through exploiting the effect of surface-enhanced raman scattering (sers). in this work, the sers spectra of genomic dnas from leaves of different apple trees grown in the field, have been analyzed [ ] . a detailed comparative analysis of sers signatures of genomic dnas is given. sers wavenumbers (cm - ) are reported here for all types of vibrations of plant genomic dnas, including bands assigned to localized vibrations of the purine and pyrimidine residues, localized vibrations of the deoxyribosephosphate moiety, etc. proposed band assignments are given. a strong dependence of the sers spectra on dna concentration and on time have been observed. in biochemical fields, nucleic acids might be used to explore the interaction between dna and small molecules, which is important in connection with probing the accurate local structure of dna and with understanding the natural dnamediated biological mechanisms [ ] . the ph-dependent structure of dna studied by fourier transform infrared spectroscopy the region of the infrared spectrum studied covered the wave number range from cm - to cm - . ir spectra show that in ph . - . interval carbonyl (c=o) band at - cm (assigned to guanine) is reduced in intensity and slightly shifted to lower frequencies. at the ph . significantly decreases band intensity at cm - due to unbounded c =o of thymine and shifts to lower frequencies, indicating at the transition of this group in bounded form, supposedly by means of excess polarized hydroxyl ions. together this, in basic region a new intense absorption band has been observed in - cm - frequency interval, corresponding to o-h group in-plane bending vibration ( - cm - ). as for acidic conditions, it was observed that under the extreme ph (* . ) value carbonyl absorption region shifts to higher frequencies and absorption intensity significantly increased, indicated at releasing of c=o groups from h-bonding between base pairs. moreover, bands intensity at cm - and cm - corresponding to out-of-plan deformation of nh groups increased due to rupture of connections between the dna strands. during the last decade it was found that in many cases specific structural organization of multi-molecular protein and dna-protein complexes determines their functioning in living cells. although these functioning structures are usually unique, it is often possible to identify their common structural elements. one of the interesting examples of such universal elements are hmgb domains: structurally conservative functional domains of non-histone proteins hmgb / also identified in many nuclear proteins. using afm, thermodynamic approaches, circular dichroism and molecular absorption spectroscopy in far-uv and mid-ir regions we have studied structural organization of the complexes between dna and different proteins, including hmgb , hmgb-domain recombinant proteins and linker histone h . we have demonstrated, that interaction with dna leads to increasing both a-helicity of the proteins and thermal stability of dna. also, this interaction may result in formation of highly ordered supramolecular complexes facilitated by hmgb-domains. the c-terminal sequence of hmgb / regulates affinity of the proteins to dna and can be ''inactivated'' by interaction with histone h . based on the data obtained a model of the interaction of multy-domain hmgb-proteins with dna is suggested. darmstadt, germany, lmu biozentrum; munich, germany *these authors contributed equally to this work chromatin in living cells displays considerable mobility on a local scale. this movement is consistent with a constrained diffusion model, in that individual loci execute multiple, random jumps. to investigate the connection between local chromatin diffusion (lcd) and the changes in nuclear organization, we established a stable hela cell line expressing gfp-pcna. this protein, a core component of the replication machinery, serves as a cell-cycle marker and allows us to visualize sites of ongoing dna synthesis within the nucleus. to monitor lcd, we labeled discrete genomic loci through incorporation of cy -dutp. this experimental system, in conjunction with particle tracking analysis, has enabled us to quantitatively measure chromatin mobility throughout the cell cycle. our results demonstrate that lcd is significantly decreased in s-phase. to explore the connection between dna replication and reduced chromatin movement, we undertook a more detailed examination of lcd in s-phase nuclei, correlating chromatin mobility with sites of replication. our results demonstrate that labeled chromatin in close proximity to gfp-pcna foci exhibit significantly decreased mobility. we therefore conclude that presence of active replication forks constrains the movement of adjacent chromatin. single-molecule studies of dna replication antoine m. van oijen zernike institute for advanced materials, groningen university, nijenborgh . ag groningen, the netherlands e-mail: a.m.van.oijen@rug.nl advances in optical imaging and molecular manipulation techniques have made it possible to observe individual enzymes and record molecular movies that provide new insight into their dynamics and reaction mechanisms. in a biological context, most of these enzymes function in concert with other enzymes in multi-protein complexes, so an important future direction will be the utilization of single-molecule techniques to unravel the orchestration of large macromolecular assemblies. we are applying a single-molecule approach to study dna replication. i will present recent results of single-molecule studies of replication in bacterial and eukaryotic systems. by combining the stretching of individual dna molecules with the fluorescence observation of individual proteins, we visualize the dynamic interaction of replication factors with the fork. in the bacteriophage t replication system, we show that dna polymerases dynamically associate with and dissociate from the fork during replication. further, i will present new data from single-molecule replication studies in x. laevis oocyte extracts. we have developed a novel imaging scheme that permits single-molecule fluorescence experiments at concentrations of labeled protein that were hitherto inaccessible. using this method, we visualize, in real time, origin firing and fork movement. in force-extension diagrams of reference models of naked dna (freely jointed chain, wormlike chain) as well as extensionrotation diagrams of naked dna have been successfully recovered. of note, plectonemic structures are most efficiently simulated thanks to ode's collision detection code. new insights into nucleosome and chromatin fiber structure and dynamics will be presented. the study of the pkm. plasmid effect on the repair of dna j. vincze , i, francia , g. vincze-tiszay hheif, budapest, hungary, univ. debrecen, debrecen, hungary in our experiments was studied the effect of pkm. plasmid on repair of single strand breaks in dna induced by cogamma irradiation in e.coli k ab (wild type) and its different rec mutant cells. the pkm. resistant-factor in case of the control decreases the sensitivity of radiation, which as we suppose, is reached by the help of dna conformation change. it can be supposed from the well known effect of radiation biology that by the effect of pkm. , the ratio of dna radiation sensitive volumes by appearing its new conformation decreases. the pkm. r-factor in rec mutants in case of gamma irradiation shows effects in two ways. one is the ''chemical'' connection between the r-factor and dna, though the other relate to positive and negative ''induced'' radiation resistance from the local type effect of the connection of an r-factor and a rec mutant, and the two radiation resistant effects are added algebraically. as a result from the view of biology we have to categorize the radiation resistance and the connected repair processes as two different classes according to the change either in the chemical or in the induced radiation resistant effect. recent studies have indicated that two trimethylated peptides (k , k ), derived from the parental hybrid peptide ca( - )m( - ), strongly interact with a bacterial membrane model (mixture of zwitterionic and negatively charged lipids), but not with a membrane model of mammalian erythrocytes (zwitterionic lipids) [ ] . a reduction of the cytotoxicity effect and an improvement of the therapeutic index have also been reported for the derivatives when compared with the parental ca( - )m( - ) [ ] . in this work, with the aim of providing insight on the interaction phenomena of the indicated peptides with zwitterionic and negatively charged membrane models, a systematic molecular dynamics study was carried out. full hydrated bilayers of dmpc:dmpg ( : ) and pope:popg( : ) were studied in the presence of each peptide, and results analyzed in terms of peptide structure and membrane composition. lipid-water and lipid-lipid interactions at the membrane/water interface play important role in maintaining the bilayer structure, however, this region is not easily available for experimental studies. we performed molecular dynamics simulations of two bilayers composed of two different types of lipids: ( ) dioleoylphosphatidylcholine (dopc); ( ) galactolipid monogalactosyldiacylglycerol (mgdg). to investigate the properties of the membrane/water interface region, we performed analysis of lipid-lipid interactions: direct, via charge pairs (dopc) and hydrogen bonds (mgdg) as well as indirect, via water bridges. we also examined water-lipid interactions. existence of well-defined entities (lipids) linked by different types of interactions (hydrogen bonds, charge pairs, water bridges) makes the analysis of the membrane/ water interface region a suitable for a graph theoretical description. we applied a network analysis approach for comparative analysis of simulated systems. we note a marked difference between the organization as well as the dynamics of the interfacial region of the two bilayers. l-nucleoside analogues form an important class of antiviral and anticancer drug candidates. to be pharmacologically active, they need to be phosphorylated in multiple steps by cellular kinases. human phosphoglycerate kinase (hpgk) was shown to exhibit low specificity for nucleotide diphosphate analogues and its catalytic efficiency in phosphorylation was also affected. to elucidate the effect of ligand chirality on dynamics and catalytic efficiency, molecular dynamics simulations were performed on four different nucleotides (d-/l-adp and d-/l-cdp) in complex with hpgk and , -bisphospho-d-glycerate (bpg). the simulation results confirm high affinity for the natural substrate (d-adp), while l-adp shows only moderate affinity for hpgk. the observed short residence time of both cdp enantiomers at the active site suggests very weak binding affinity which may result in poor catalytic efficiency shown for hpgk with d-/l-cdp. analysis of the simulations unravels important dynamic conditions for efficient phosphorylation replacing the single requirement of a tight binding. using the van der waals density functional based on the semilocal exchange functional pw together with a longrange component of the correlation energy [ ] implemented in the siesta program code, we have calculated the band structure of the double stranded dna. the unit cell was built by taking together gc (or at) homogenous base pairs and we have considered the translational symmetry as the periodic boundary condition. the results obtained are compared with the oligomer calculations taking up to seven base pairs. the band structure obtained with this van der waals density functional is also compared with results obtained with other exchange-correlation functionals as well as with band structure obtained by the hartree-fock crystal-orbital method taking into account the helical symmetry of the double stranded dna. the role of different parts of dna (base pairs, sugar-phosphate backbone, na ions) is also presented. transmembrane (tm) proteins comprise some % to % of the proteome but owing to technical difficulties, relatively few of these structures have been determined experimentally. computational modeling techniques can be used to provide the essential structural data needed to shed light on structurefunction relationships in tm proteins. low-resolution electrondensity maps, obtained from cryo-electron microscopy (cryo-em) or preliminary x-ray diffraction studies, can be used to restrict the search in conformational space. at the right resolution, the locations of tm helices can be roughly determined even when the amino acids are not visible. when these data are combined with physicochemical characteristics of amino acids (such as their hydrophobicity) and with evolutionary conservation analysis of the protein family, the location of the amino acids can be modeled. the modelstructure may provide molecular interpretations of the effects of mutations. moreover, it can be used to suggest molecular mechanisms and to design new mutations to examine them. the overall approach will be demonstrated using two human proteins: copper transporter (ctr ), which is the main copper transporter in the human cell, and the kda translocator protein (tspo) of the outer mitochondria. modelstructures of these proteins and their functional implications in health and disease will be discussed. calcium channels play a crucial role in many physiological functions and their selectivity mechanism is still an unresolved question and a subject of debate. a physical model of selective ''ion binding'' in the l-type calcium channel is constructed, and consequences of the model are compared with experimental data. this reduced model treats only ions and the carboxylate oxygens of the eeee locus explicitly and restricts interactions to hard-core repulsion and ion-ion and ion-dielectric electrostatic forces. according to the charge/space competition mechanism, the charge of structural ions attracts cations into the filter, while excluded volume effects are trying to keep them out. this is a competition between energy and entropy, where the balance of these terms minimizes free energy and determines selectivity. experimental conditions involving binary mixtures of alkali and/or alkaline earth metal ions are computed. the model pore rejects alkali metal ions in the presence of biological concentrations of ca + and predicts the blockade of alkali metal ion currents by micromolar ca +. conductance patterns observed in varied mixtures containing na+ and li+, or ba + and ca +, are predicted. ca + is substantially more potent in blocking na+ current than ba +. in apparent contrast to experiments sing buffered ca + solutions, the predicted potency of ca + in blocking alkali metal ion currents depends on the species and concentration of the alkali metal ion, as is expected if these ions compete with ca + for the pore. these experiments depend on the problematic estimation of ca + activity in solutions buffered for ca + and ph in a varying background of ulk salt. equilibrium binding affinity (expressed as the occupancy of the selectivity filter by various ions) is computed by equilibrium grand canonical monte carlo (gcmc) simulations. the conductivity of the channel is estimated from the equilibrium concentration profiles using the integrated nernst-planck equation. our simulations show that the selectivity of l-type calcium channels can arise from an interplay of electrostatic and hard-core repulsion forces among ions and a few crucial channel atoms. the reduced system selects for the cation that delivers the largest charge in the smallest ion volume. we have also performed dynamic monte carlo (dmc) simulations for a model ca channel to simulate current directly and present our results for the dynamical selectivity (expressed as the flux carried by various ions). we show that the binding affinity of ca + versus na+ is always larger than the dynamical selectivity because ca + ions are tightly bound to the binding site of the selectivity filter of the channel and, at the same time, their mobility and drift velocity is smaller in this region. carotenoids are used in light-harvesting complexes with the twofold aim to extend the spectral range of the antenna and to avoid radiation damages. the effect of the polarity and conformation of the environment is supposed to be responsible for the tuning of the electronic, optical and vibrational properties of peridinin carotenoid both in solution and in protein matrix. we investigate the effect of vibrational properties of peridinin in different solvents by means of vibrational spectroscopies and qm/mm molecular dynamics simulations . the shift of vibrational fingerprints in the - cm - frequency region, due to the solvent polarity and proticity, is studied in three cases: cyclohexane (apolar/aprotic), deuterated acetonitrile (polar/aprotic) and methanol (polar/ protic). the frequencies and vibrational modes of the carbonyl, the allene, and the polyene chain were identified using effective normal mode analysis and compared with the present and previous experimental data . on the basis of our calculations and experiments in different solvents, we propose a classification of the four peridinins of the high-salt pcp form. the controlled self-assembly of functional molecular species on well defined surfaces is a promising approach toward the design of nanoscale architectures. by using this methodology, regular low-dimensional systems such as supramolecular clusters, chains, or nanoporous arrays can be fabricated. small biological molecules such as amino acids represent an important class of building blocks that are of interest for molecular architectonic on surfaces because they inherently qualify for molecular recognition and selfassembly [ ] . the interaction between amino acids and solid surfaces is decisive for the development of bioanalytical devices or biocompatible materials as well as for a fundamental understanding of protein-surface bonding. we investigate the adsorbtion mechanism of the cysteine on au( ) surfaces by means of the dft [ ] . our main concern is to describe the molecule-metal bonding mechanism. therefore we present a complex study, including the full determination of the density of states for the free and adsorbed molecule, the determination of molecule-surface bonding energy. the method of crystal orbital overlap populations is used in order to determine the contribution of specific atomic orbitals to the molecule-metal bond. it is now widely accepted that myoglobin (mb) is not simply an o storage/delivery system but, depending on oxygen concentration, it exerts other fundamental physiological roles. recent studies revealed a widespread expression and, in particular, an over-expression in response to hypoxia, in various non-muscle tissues, including tumor cells. in human five different mb isoforms are present. the two most expressed ([ %) differ only at the th position, k (mb-i) and e (mb-ii) respectively. since high-altitude natives from tibet are characterized by a higher mb concentration and locomotion efficiency, together with the observation that the mb overexpression is totally attributable to mb-ii, the idea that the latter might be one of the responses to high-altitude evolutionary adaptation, i.e. hypoxic environment, started to emerge. however, this is not yet supported by any structure/function investigation. we performed hundred nanoseconds md simulations on human mbs to investigate the structure and dynamics of both protein and surface water. important differences have been protein kinases play key roles in cell signaling and constitute crucial therapeutic targets. in normal cell, upon substrate binding, tyrosine kinase receptor kit undergoes extensive structural rearrangement leading to receptor dimerization and activation. this process is initiated by the departure of the juxta membrane region (jmr) from the active site, allowing the activation loop (aloop) deployment. the deregulation of kit activity is associated with various forms of cancer provoked by abnormalities in signal transduction pathways. mutations v g (jmr) and d h/v (a-loop) have been reported as oncogenic and/or drug-resistant. to contribute further in the understanding of kit activation/ deactivation mechanisms, we applied a multi-approach protocol combining molecular dynamics (md), normal modes analysis (nma) and pocket detection. disturbing structural effects, both local (a-loop) and long-range (jmr), were evidenced for kit d h/v in the inactive state. nma showed that jmr is able to depart its position more easily in the mutants than in the wild type. pockets analysis revealed that this detachment is sufficient to open an access to the atp binding site. our results provided a plausible conception of mutant dimerization and a way to explore putative allosteric binding sites. transmembrane association of leukocyte integrin heterodimer might be mediated by a polar interaction choon-peng chng and suet-mien tan biophysics group, a*star institute of high performance computing, and, school of biological sciences, nanyang technological university, republic of singapore the lateral association of transmembrane (tm) helices is important to the folding of membrane proteins as well as a means for signaling across the cell membrane. for integrin, a hetero-dimeric protein important for cell adhesion and migration, the association of its a-and b-subunits' tm helices plays a key role in mediating bi-directional mechanical signaling across the membrane. we found evidence from experiment and simulation for a polar interaction (hydrogenbond) across leukocyte integrin alb tm that is absent in the better-studied platelet integrin aiibb [ ] . our coarse-grained molecular dynamics simulations of tm helix-helix selfassembly showed more native-like packing achieved by alb within the simulation timescale as compared to its 'lossof-function' b t g mutant or aiibb [ ] . association free energy profiles also showed a deeper minimum at a smaller helix-helix separation for alb , suggestive of tighter packing. the likely conservation of this polar interaction across the b integrin family further reinforces its importance to the proper functioning of leukocyte integrins. active extrusion of drugs through efflux pumps constitutes one of the main mechanisms of multidrug resistance in cells. in recent years, large efforts have been devoted to the biochemical and structural characterization of rnd efflux pumps in gram-negative bacteria, in particular the acrb/a-tolc system of e.coli. specific attention has been addressed to the active part of the efflux system, constituted by the acrb unit. despite the presence of several data, crucial questions concerning its functioning are still open. the understanding of the structure-dynamics-function relationship of mexb, the analogous transporter in p. aeruginosa, encounters even more difficulties, because of the lack of structural data of the transporter in complex with substrates. to shade some light on the activity of mexb, we performed computational studies on mexb interacting with two compounds, meropenem and imipenem, the first known to be a good substrate, and the second a modest one. several techniques were used in the present work, ranging from flexible docking [ ] to standard and targeted molecular dynamics (md) simulations. starting from the published crystal structure [ ] we identified the most probable poses of the two compounds in both the original experimental and in the md-equilibrated structures. we used information from acrb binding pocket in order to find relevant binding sites of the two compounds in the analogous binding pocket of mexb. meropenem frequently lies with appropriate orientation in a pocket similar to the one identified for doxorubicin in acrb [ ] , while the occurrence of imipenem poses in the same pocket is very scarce. additionally, when present in the pocket, imipenem is located in a position that renders very unlikely its extrusion toward the oprm docking domain during the simulation of the functional peristalsis. the analysis of the trajectories has provided a complete inventory of the transporter and antibiotic hot spots, which is key information in terms of screening and design of antibiotics and inhibitors. clathrins are three-legged proteins with the intriguing ability to self-assemble into a wide variety of polyhedral cages. the nucleation and growth of a clathrin lattice against the cytosolic face of a cell membrane enables the endocytosis of membrane proteins and various external molecules, by wrapping the membrane around the cargo to produce a coated transport vesicle within the cell. clathrins can also self-assemble, in slightly acidic solutions devoid of auxiliary proteins, into empty cages. our simulations of this process, using a highly coarsegrained model, indicate that the key to self-assembly is neither calthrin's characteristic puckered triskelion shape, nor the alignment of four legs along all cage edges, but an asymmetric distribution of interaction sites around the leg's circumference. based on the critical assembly concentration, the binding strength in these cages is estimated at to k b t per clathrin. the simulations also answer the long-standing conundrum of how flat patches of purely hexagonal clathrin lattices, which in cryo electron microscopy are frequently seen to decorate cell membranes, can produce highly curved cages containing twelve pentagonal faces interdispersed between hexagonal faces. we present experimental evidence supporting this pathway. in eukaryotic cells, the exchange of macromolecules between the cytoplasm and the nucleus is mediated by specialized transport factors. by binding to these transporters, cargo molecules, which are otherwise excluded from entering the nucleus, can traverse the nuclear pore complex efficiently. most of the proteins mediating nuclear import and export exhibit a characteristic _-solenoid fold, which provides them with an unusual intrinsic flexibility. crm is an essential nuclear export receptor, which recognizes a very broad range of export cargoes. crm -dependent nuclear export is ran-gtpase-driven, and recognition of rangtp and cargo is highly cooperative. however, recent crystal structures show that the binding sites for export cargos and rangtp are located at distant parts of crm [ ] [ ] [ ] . we have used a combined approach of all-atom molecular dynamics simulations and small-angle x-ray scattering to study rangtp and cargo binding to crm . we have found that the allosteric effect in crm -dependent nuclear export arises from a combination of subtle structural rearrangements and changes in the dynamic properties of crm . light-induced phototactic responds in green algae chlamydomonas reinhardtii are mediated via microbial-type rhodopsins, termed channelrhodopsin- (chr ) and channelrhodopsin- (chr ) , which carry the chromophore retinal covalently linked to lysin via a schiff base and were shown to be directly light-gated ion-channels . the n-terminal putative seven-transmembrane region of chr was shown to be responsible for the generation of photocurrents and exhibits sequence similarity to the well understood proton pump bacteriorhodopsin (br) and the sensory rhodopsin anabaena sensory rhodopsin (asr) . as for the majority of membrane proteins, there is no d-structural data for chr available yet. here we present homology models of chr using two high-resolution x-ray template structures of br ( qhj ) respectively asr ( xio ) in order to get structural and functional insights into chr . with both homology models we performed molecular dynamics (md) simulations in a native membrane/solvent environment using gromacs . . . comparison of energetic and structural results revealed obvious advantages of the br-based homology model of chr . here we show that the br-based homology model is a reliable model of chr exhibiting structural features already found experimen-tally . our br-based homology model of chr allows predictions of putative crucial residues within chr . so we proposed several mutations within the chr sequence which are already accomplished. electrophysiologic and spectroscopic studies of these mutations are underway in order to confirm the functional relevance of these residues and to contribute to an optimized usage of chr as a powerful tool in optogenetics. ( neuroglobin is a recently discovered globin protein predominantly expressed in brain. its biological function is still elusive. despite the fact that neuroglobin shares very little sequence homology to the well-known globins as mioglobin and hemoglobin, they all have a characteristic globin fold with heme molecule bound to the distal pocket. the structural investigations and co binding kinetics revealed existence of cavities and tunnels within the protein matrix, where small ligands can be stored even for hundreds of microseconds [ ] . in human neuroglobin there is one internal disulfide bond possible which existence is proved to have significant effect on ligand affinity [ ] . in this study effects of temperature, ph, distal histidine mutation and presence of disulfide bond on co rebinding to neuroglobin are investigated by flash photolysis experiments. in parallel, the molecular dynamics simulations are performed in corresponding conditions in order to investigate structural change of neuroglobin and especially changes in distribution of internal tunnels and cavities able to bind diatomic ligands in response to different physical conditions listed above. the thrombospondin family, being extracelluar proteins, is known to be implicated in various physiological processes such as wound healing, inflammation, angiogenesis and neoplasia. the signature domain of thrombospondins shows high sequence identities and thus allows us to transfer results obtained, studying this complex calcium reach part of the proteins, from one member of the family to the other. the domain is known to play a key role in hereditary diseases such as psach or med. in this part of thrombospondins lies a binding site to integrins, important for cell attachment. it is further known that the lectin like globe binds to cd- , a feature known to be important in cancer research. as the theoretical unit we are trying to resolve these problems by numerical means and are constantly challenged by the size, where thrombospondin can be a huge trimeric protein as one strand can measure kda, and the large variety of subdomains found in this proteins. we are thus facing a multiscale problem which can range from solving, by means of quantum mechanics a specific ion binding site, to large scale abstraction by continuum mechanics. in our talk we will show you our newest results that we obtained by simulating calcium rich c-terminal domain which is known to be conserved across the entire family, and give you an outlook into the future of our research. the process of swift heavy ions energy deposition while penetrating a solid or scattering on its surface can result in a strong and nonequilibrium excitation of matter. an extremely localized character of this excitation, meanwhile, can make possible both selective changes in chemistry of matter and its surface nanomodifications . since possible applications have been found in bio-and it-technologies (cancer curing and nanostructuring respectively) in the last decade, the heavy ion bombardment technique has attracted a lot of scientific interest , . the processes of fast energy deposition into the solid and its further dissipation, however, are essentially perturbed with highly excited and nonequilibrium state of both lattice and electron systems. at such conditions therefore, the precision in treatment of processes of electron thermalization, fast electron heat conduction, and phase transformation of the overheated solid becomes crucial. having several physical models to handle the mentioned processes, it is nevertheless difficult to describe all of them within a scale of a single computational approach. our work is aimed on elaboration of the atomistic-continuum model of heavy ion bombardment of solids. in particular, the model will be applied to study the formation of nanohillocks in the experiments on swift heavy ion xe + scattering on srtio surface . [ ] g. aquaporins are protein channels located across the cell membrane with the role of conducting water or other small sugar alcohol molecules (aquaglyceroporins). the presence of the human aquaporin (hsaqp ) in cells proximal to airinteracting surfaces (eyes, lacrimal glands, salivary glands, lungs, stomach etc.) suggest its potentially important role in ''wetting'' these surfaces. the high-resolution x-ray structure of the hsaqp tetramer (pdb code d s) exhibits two important features: (i) lack of the four fold symmetry, common in most of the aquaporins, and (ii) occlusion of the central pore by a phosphatidylserine lipid tail. in this study we investigate the importance of these two features on the transport properties of the human aqp by means of molecular dynamics simulations. we found that the asymmetry in the tetramer leads to a distribution of monomeric channel structures characterized by different free energy landscapes felt by the water molecules passing through the channel. furthermore, the structures' distribution is influenced both by the presence/absence of the lipid tail in the central pore, and by the lipid composition of the bilayer that solvates the hsaqp tetramer. elucidating the modular structure of the protein g c fragment and human igg fc domain binding site using computer simulations hiqmet kamberaj faculty of technical sciences, international balkan university, skopje, r. of macedonia protein-protein recognition plays an important role in most biological processes. although the structures of many protein-protein complexes have been solved in molecular detail, general rules describing affinity and selectivity of proteinprotein interactions break down when applied to a larger set of protein-protein complexes with extremely diverse nature of the interfaces. in this work, we will analyze the non-linear clustering of the residues at the interface between proteins. the boundaries between clusters are defined by clustering the mutual information of the protein-protein interface. we will show that the mutations in one module do not affect residues located in a neighboring module by studying the structural and energetic consequences of the mutation. to the contrary, within their module, we will show that the mutations cause complex energetic and structural consequences. in this study, this is shown on the interaction between protein g c fragment and human igg fc domain by combining molecular dynamics simulations and mutual information theory, and computational alanine scanning technique. the modular architecture of binding sites, which resembles human engineering design, greatly simplifies the design of new protein interactions and provides a feasible view of how these interactions evolved. the results test our understanding of the dominant contributions to the free energy of protein-protein interactions, can guide experiments aimed at the design of protein interaction inhibitors, and provide a stepping-stone to important applications such as interface redesign. membrane proteins can form large multimeric assemblies in native membranes that are implicated in a wide range of biological processes, from signal transduction to organelle structure. hydrophobic mismatch and membrane curvature are involved in long range forces largely contributing to such segregation. however, the existing assembly specificity is thought to be coded in the atomic details of protein surface and topology. these are best described in high resolution structures and atomistic molecular dynamics simulations. in order to explore more systematically such forces and energetics arising at intermediate time scales and resolution, we use coarse grained molecular dynamics simulations applied to membrane systems spanning over to us. as a first glimpse we study how proteins induce lipid perturbations using a previously developed conformational entropy estimator. we show that in the model membrane where hydrophobic mismatch is present, lipid perturbations extend up to * a from the protein surface. however, significant variations in perturbation profiles are seen. parameters such as protein shape, surface topology, and amino acid physicochemical properties are studied to discover the parameters governing such perturbations. crossing energy barriers with self-guided langevin dynamics gerhard kö nig, xiongwu wu, bernard brooks national institutes of health, national heart, lung and blood institute, laboratory of computational biology, rockville, md, usa even with modern computer power, the applicability of molecular dynamics simulations is restricted by their ability to sample the conformational space of biomolecules. often high energy barriers cause normal molecular dynamics simulations to stay trapped in local energy minima, leading to biased results. to address this problem, self-guided langevin dynamics (sgld) were developed. it enhances conformational transitions by accelerating the slow systematic motions in the system. this is achieved by calculating the the local average of velocities and adding a guiding force along the direction of systematic motions. thus, the efficiency of sgld is governed by three factors: a.) the friction constant involved in the langevin dynamics b.) the local averaging time and c.) the guiding factor that determines the guiding force. however, the guiding force also causes deviations from the original ensemble that have to be corrected by reweighting the data, thus decreasing the efficiency. here, we explore the three-dimensional parameter space of sgld for several benchmark systems with particularly rough energy surfaces. based on our data, we supply guidelines for the optimal selection of sgld parameters, to allow the extension of our method to other biological problems of interest. propagation of d v/h mutation effects across kit receptor e. laine, i. c. de beauchê ne, c. auclair and l. tchertanov lbpa, cnrs -ens de cachan, france receptor tyrosine kinases (rtks) regulate critical biological processes. constitutive activation of rtks provokes cancers, inflammatory diseases and neuronal disorders. biological data evidenced that oncogenic mutations of the rtk kit, located either in the juxtamembrane region (jmr) or in the activation loop (a-loop) -as is the case of d v/h, displace the equilibrium of conformational states toward the active form. we present a multi-approach study that combines molecular dynamics (md), normal modes (nm) and pocket detection to characterize and compare the impact of d v/h on the structure, dynamics and thermodynamics of kit. we have evidenced a local structural destabilization of a-loop induced by the mutation and a long-range effect on the structure and position of jmr. we have further correlated these observations with experimental data and decipher some details about the activation mechanisms of the mutants, involving leverage of the jmr negative regulation and release of an access to the catalytic site. through the identification of ''local dynamic domains'' and the recording of interactions within the protein, we propose a model of the mutational effects propagation, which highlights the importance of both structural distortion and local conformational fluctuations. investigation of biological active azolidinones and related heterocycles refer to one of the most successful scientific projects of dh lnmu. it is based on three strategic vectors: organic synthesis, pharmacological research, rational design of ''drug-like'' molecules (including in silico approaches). while applying the research strategy we succeeded in gaining a number of interesting results that make possible to extend the field, especially in the scope of ''drug-like'' molecules design, notably it has focused on the search of new anticancer agents. anticancer activity screening was carried out for more than compounds (us nci protocol (dtp) based on obtained directed library over new compounds, among them compounds showed high activity level. for the purpose of optimization and rational design directions of highly active molecules with optimal ''drug-like'' characteristics and discovering of possible mechanism of action sar, compare analysis, molecular docking and qsa(p)r were carried out. obtained results allowed to form main directions of possible anticancer activity mechanisms, which probable are apoptosis-related. nowadays the investigation of cellular and molecular aspects of anticancer effects is in progress. regulation of (bio)chemical reactions by mechanical force has been proposed to be fundamental to cellular functions [ , , ] . atomic force microscopy experiments have identified the effect of mechanical force on the reactivity of thiol/disulfide exchange, a biologically important reaction [ ] . in order to understand the influence of the force at an atomistic level, we have performed hybrid quantum mechanicsmolecular mechanics (qm/mm) transition path sampling simulation of the reaction under external forces. the results of the simulations supported the experimental findings and demonstrated that the location of the transition state on the free energy surface was shifted due to force [ ] . in contrast to our findings, however, a recent experimental study suggests only a weak coupling between the mechanical force and the reaction rate [ ] . in this study, the reactants were covalently linked to a stilbene molecule. in this system a force can be applied by photo-isomerization from the relaxed trans to the strained cis configuration. a drawback of this system is that one cannot easily determine the forces that acting on the reaction coordinate. therefore, we have developed a force distribution analysis method for quantum mechanical molecular dynamics simulations. the results of the analysis show how isomerization of stilbene alters the forces acting on the reacting atoms. the force distribution is essential for understanding how chemistry is controlled by external forces. [ conformational space modelling (csm) is a promising method for membrane protein structure determination. it is based on the concept of the side-chain conformational space (sccs), which is formed by the allowed side-chain conformations of a given residue. each sccs can be calculated from a d structure or measured via epr-sdsl experiments. for structure determination a set of singly spinlabelled mutants is needed. the final structure is obtained by altering an initial (possibly random) d structure until the best fit between the calculated and measured sccs for the whole set is found. such optimization is computationally intensive; therefore csm includes several empirical approximations. one of them describes the effect of the lipid tails on the sccs. the implementation is not trivial as lipids diffuse in the plane of the membrane and the lipid tails behave differently at different membrane depths. to unravel this relationship adaptive biasing force md simulations were used. an alanine peptide helix was made in silico, spin-labelled at the middle and inserted perpendicularly into a dmpc membrane. the free energy of the spin-label orientation at various membrane depths was calculated. a d free energy surface describing the membrane ''depth'' effect was obtained. it is known that b-cyclodextrins (bcds) are able to modify the cholesterol content of cell or model membranes. however the molecular mechanism of this process is still not resolved. using molecular dynamics simulations, we have been able to study the bcd-mediated cholesterol extraction from cholesterol monolayers and lipid-cholesterol monolayer mixtures. we have investigated many conditions that would affect this process (e.g. lipid-cholesterol ratio, lipid chain unsaturation level) our results can be summarized as follow: i) dimerization of bcds, ii) binding of the dimers to the membrane surface assuming either a tilted (parallel to the membrane normal) or untilted ( °respect to the normal of the membrane) configuration, iii) the latter configuration is suitable to extract cholesterol at a reasonable computational time scale ( - ns), however, this process may be affected by unfavorable energy barriers (from to kj mol - ), iv) desorption of the complex brings cholesterol in solution, v) the bcd-cholesterol complex is able to transfer cholesterol to other membranes. with a clearer understanding of the basic molecular mechanism of the bcd mediated process of cholesterol extraction, we can begin to rationalize the design of more efficient bcds in numerous applications. the mechanism of the complex formation of biopolymers with ligands including the solvent molecules is an actual problem of modern biophysical and biological science. polypeptides form a secondary structure and mimic the motifs of the protein architecture. the study of complexation between polypeptides and solvent molecules leads to deeper understanding of the basic interaction of proteins with environment at atomic level. besides polypeptides are promising for the development of applications which encompass some of the following desirable features: anti-fouling, biocompatibility, biodegradability, specific biomolecular sensitivity. on this account polypeptides have a great significance for a variety modern applications ranging from the nanoscale medicine devises up to food technology and others. we compare the results of calculations of complexes between helical polypeptides (polyglutamic acid in neutral form and poly-c-benzyl-l-glutamate) and water molecules at dft pbe level and the results of ftir-spectroscopic study of the film of wetted polypeptides. vibrational spectroscopy is one of the most useful experimental tools to study non-covalent bonded complexes, and calculated spectra in comparison with experimental data are reliable test for reality of simulated complexes. platelet aggregation at the site of vascular injury is vital to prevent bleeding. excessive platelet function, however, may lead to thrombus formation after surgery. therefore, an accurate measure and control of platelet aggregation is important. in vitro platelet aggregometry monitors aggregate formation in platelet rich plasma triggered by agonists such as adp, epinephrine or collagen. the fraction of aggregated platelets is plotted versus time, and platelet function is assessed by analyzing the plot's morphology. we propose new measures of platelet function based on a compartmental kinetic model of platelet aggregation induced by adp. our model includes three compartments: single, aggregated and deaggregated platelets. it is simpler than earlier models and agrees with experimental data. the model parameters were determined by non-linear least squares fitting of published data. we associated the kinetic parameters with the activity of the adp receptors p y and p y . to this end, we studied published data obtained in the presence and in the absence of specific antagonists of these receptors. comparison of kinetic parameters of healthy subjects with those of patients with myeloproliferative disorder (mpd) shows that the function of p y is significantly reduced in mpd. coarse-grained modeling of drug-membrane interactions manuel n. melo, durba sengupta, siewert-jan marrink groningen biomolecular sciences and biotechnology institute, university of groningen, groningen, the netherlands the martini coarse-grained (cg) forcefield was used to simulate the actions of the antimicrobial peptide alamethicin and of the anti-tumoral drug doxorubicin. both drugs were shown to interact strongly with a fluid phospholipid bilayer, and aggregate there, in agreement with experimental results. because doxorubicin may establish intermolecular h-bonding, and thus lower its dipole moment, the cg representation of a doxorubicin dimer was adjusted. this less polar dimer was then tested for translocation and/or pore formation. contrary to results of atomistic simulations, alamethicin aggregates did not spontaneously open pores. they did so, however, when the size of the water beads was decreased. several small independent pores could then be observed. the magnitude of the permeability of these pores is analyzed and compared to experimental values. the occurrence of multiple small pores could indicate that the different conductance levels experimentally observed for alamethicin might simply result from the association of different numbers of these small pores. polarization refers to the asymmetric changes in cellular organization in response to external or internal signals. neuronal polarization begins with the growth of a single neurite shortly after cell division, followed by the growth of a second neurite at the opposite pole. this early bipolar shape is critical for brain function, as it defines axis of migration and consequently proper three dimensional organization and nerve circuitry. however, it is not known if a direct relationship exists between the formation of the second, opposite, neurite and the mechanisms involved in the formation of the first. we tackled this issue through mathematical modeling, based on membrane traffic (exocytosis-endocytosis), and lateral diffusion. with this approach, we demonstrated that a single pole of molecular asymmetry is sufficient to induce a second one at the opposite side, upon induction of growth from the first pole. our work gives mathematical proof that the occurrence of a single asymmetry in a round cell is sufficient to warrant morphological bipolarism. trypsin is one of the best characterized serine proteases. enzyme acylation process is required for substrate degradation. there is a lot of information about how this process undergoes. however, in order to obtain a more detailed description of the catalytic triad mechanism, a qm/mm approach was used. we used the hybrid qm/mm potential implemented in amber package. in the qm calculations a dft hamiltonian was used. we develop an approach based on the adaptively biased md in order to obtain the free energy surface of the conformational space defined by the reaction coordinates. this approach presents some characteristics of steered md and umbrella sampling procedures. our results offer information about the lowest energy trajectory, the barrier profile of the reaction, and the geometry of the transition state. this method also provides a further insight into the role of specific residues in the reaction. substituting asp , member of the catalytic triad, for ala we were able to detect an increase of the barrier profile. this was due to the loss of the interaction of carbonyl group of asp with nd of his , which make ne of this residue a worst proton acceptor. this results show our approach as a valuable method in the study of enzymatic mechanisms. the intracellular media comprise a great variety of macromolecular species that are not individually concentrated, but being preset in the same compartment they exclude each other's volume and produce crowding. crowding has a profound impact on protein structure and determines conformational transitions and macromolecular association. we investigated macromolecular association on a % w/w bovine serum albumin (bsa) solution by time-domain terahertz (thz) spectroscopy and molecular modeling. molecular crowding was simulated by including two bsa molecules in the same water box. we generated * such dimeric models, computed their thz spectra by normal modes analysis and compared them with the experimental data. the best bsa dimer model was selected based on the agreement with the experiment in the lowest frequencies domain of up to thz. symmetry constraints improve accuracy of ion channels models. application to two-pore-domain channels adina l. ion channels are important drug targets. structural information required for structure-based drug design is often filled by homology models (hm). making accurate hm is challenging because few templates are available and these often have substantial structural differences. besides, in molecular dynamics (md) simulations channels deviate from ideal symmetry and accumulate thermal defects, which complicate hm refinement using md. we evaluate the ability of symmetry-constrained md simulations to improve hm accuracy, using an approach conceptually similar to casp competition: build hm of channels with known structure and evaluate the efficiency of various symmetry-constrained md methods in improving hms accuracy (measured as deviation from x-ray structure). results indicate that unrestrained md does not improve accuracy, instantaneous symmetrization improves accuracy but not stability during subsequent md, while gradually imposing symmetry constraints improves both accuracy (by - %) and stability. moreover, accuracy and stability are strongly correlated, making stability a reliable criterion in predicting hm accuracy. we further used this method to refine hm of trek channels. we also propose a gating mechanism for mechanosensitive channels that was further experimentally confirmed. nucleotide modifications and trna anticodon-mrna codon interactions on the ribosome olof allné r and lennart nilsson karolinska institutet, stockholm, sweden molecular dynamics simulations of the trna anticodon and mrna codon have been used to study the effect of the common trna modifications cmo u and m a . in trna val these modifications allow all four nucleotides to be successfully read at the wobble position in a codon. previous data suggest entropic effects are mainly responsible for the extended reading capabilities but detailed mechanisms have remained unknown. the aim of this paper is to elucidate the details of these mechanisms on an atomic level and quantify their effects. we have applied: extensive free energy perturbation coupled with umbrella sampling, entropic calculations of trna (free and bound to the ribosome) and thorough structural analysis of the ribosomal decoding center. human neuroserpin (hns) is a serine protease inhibitor (serpin) of a tissue-type plasminogen activator (tpa). the conformational flexibility and the metastable state of this proteins underlies to misfolding and to dysfunctional mutations causing a class of rare genetic diseases which share the same molecular basis. the conformational transition of the native form, triggered upon the cleavage at reactive center loop (rlc), releases a complex of the cleaved form bound to the inactivated target protease. without rlc cleavage a stable inactive latent form can be obtained by intra/inter molecular loop insertion leading to polymerization. this work concerns the study of the three above mentioned forms of hns by md simulations to investigate the relation between their conformational stability and. the starting native and cleaved configurations are based on the x-ray structure, while the latent form is here modelled. the results of the simulation reveal a striking conformational stability along with the intrinsic flexibility of selected regions of the fold.the analysis of the essential collective modes of the native hns shows that the initial opening of the b-sheet a coincides with several changes in the local pattern of salt bridges and of hydrogen bonds. regulation of ubiquitin-conjugating enzymes: a common mechanism based on a pattern of hydrophobic and acidic residues enzyme temperature adaptation generally involves a modulation of intramolecular interactions, affecting proteins dynamics, stability and activity [ ] [ ] . in this contribution, we discuss studies of different classes of extremophilic enzymes, focusing on cold-adapted variants, as well as their mesophilic-like mutants, performed by all-atom molecular dynamics simulations with particular attention to structural communication among residues within the threedimensional architecture [ ] [ ] . common adaptation strategies turned out to be based on improved local flexibility in the proximity of the functional sites, decrease in interconnected electrostatic interactions, and modulation of correlated motions and networks of communicating residues. specific differences related to the diverse protein folds can also be detected. bneurexins and neuroligins are cell adhesion molecules and play important role in synapse junction formation, maturation and signal transduction between neurons. mutations in genes coding these proteins occurs in patients with cognitive diseases like autism disorders, asperger syndrome and mental retardation [ ] . it has been found that the complex bneurexin-neuroligin has also an important role in angiogenesis [ ] . herein we will present molecular foundations of bneurexin-neuroligin interactions obtained from all-atom molecular dynamics simulations of bneurexin, neuroligin and their complex ( b q) [ ] . ns md trajectories (charmm force field) were analyzed and roles of ca + and n-actetyl-d-glucosamine posttranslational modifications in intermolecular interactions were scrutinized. advances in hardware and software have enabled increasingly long atomistic molecular dynamics simulations of biomolecules, allowing the exploration of processes occurring on timescales of hundreds of microseconds to a few milliseconds. increasing the length of simulations beyond the microsecond time scale has exposed a number of limitations in the accuracy of commonly employed force fields. such limitations become more severe as the size of the systems investigated and the length of the simulations increase. here i will describe the force field problems that we have encountered in our studies, how we identified and addressed them, and what we have learned in the process about the biophysics of the systems we are investigating. while the quest for a ''perfect'' force field is not over (and may never be), our work has greatly improved the accuracy and range of applicability of simple physics-based force fields, to the point that reliable predictions can now be obtained from millisecond-timescale simulations of biomolecules. local anesthetics (la) are pain-relief drugs, widely used in medicine and dentistry. the relatively short duration of analgesia still restricts their clinical use for the treatment of chronic pain. nowadays, intensive research is focused on anesthetics entrapped into liposomes to enhance their activity and pharmacokinetic properties [ ] . in this work, we investigated the encapsulation of prilocaine (plc), an aminoamide local anesthetic, into a small unilamellar liposome. on the line of our previous work [ ] , we have carried out molecular dynamics (md) simulations using a coarse grain model up the microsecond time scale. in this way, we compare the effects of the concentration of la at fisiological ph. we were able to capture important features of the plc-vesicle interactions. the behavior of plcs at fisiological ph is essentially a combination of high and low ph: we found that all neutral plc molecules rapidly diffuse into the hydrophobic region of the vesicle adopting an asymmetric bimodal density distribution. protonated plc molecules (pplc) initially placed in water were instead only found on the external monolayer, with a high rate of exchange with the water phase and no access to the inner part of the liposome in a concentration dependent way. we focus on applications of molecular and mesoscale simulation methodologies to the cellular transport process of endocytosis, i.e., active transport mechanisms characterized by vesicle nucleation and budding of the cell membrane orchestrated by protein-interaction networks, and functionalized carrier adhesion to cell surfaces. we discuss theoretical and computational methodologies for quantitatively describing how cell-membrane topologies are actively mediated and manipulated by intracellular protein assemblies. we also discuss methods for computing absolute binding free energies for carrier adhesion. we present rigorous validation of our models by comparing to diverse range of experiments. the importance of delta-opioid receptors as target of a large number of drugs is well recognized, but the molecular details of interaction and action of the compounds are largely unknown. in an effort to shade some light on this important issue we performed an extensive computational study on the interaction of two compounds, clozapine and desmetilclozapine, with a delta-opioid receptor. according to experiments, the lacking of a single methyl group in desmetilclozapine with respect of clozapine makes the former more active than the latter, providing a system well suited for a comparative study. we investigated stable configurations of the two drugs inside the receptor by simulating their escape routes by metadynamics, an algorithm that allows the simulation of events that are otherwise out of range for standard molecular dynamics simulations. our results point out that the action of the compound is related to the spatial distribution of the affinity sites it visits during its permanency. desmetilclozapine has a larger distribution of residues, which is interacting with, than clozapine. however, large conformational changes of the receptor were not observed in the presence of both compounds. thus, a more dynamical picture of ligand-receptor affinity is proposed on the basis of the results obtained, involving the competition among different stable states as well as the interaction with the solvents. such information might be useful to provide hints and insights that can be exploited in more structure-and-dynamics-oriented drug design. the coupling between the mechanical properties of enzymes and their biological activity is a well-established feature that has been the object of numerous experimental and theoretical works. in particular, recent experiments show that enzymatic function can be modulated anisotropically by mechanical stress. we study such phenomena using a method or investigating local flexibility on the residue scale, which combines a reduced protein representation with brownian dynamics simulations. we performed calculations on the enzyme guanylate kinase in order to study its mechanical response when submitted to anisotropic deformations. the resulting modifications of the protein's rigidity profile can be related to the changes in substrate binding affinities that were observed experimentally. further analysis of the principal components of motion of the trajectories shows how the application of a mechanical constraint on the protein can disrupt its dynamics, thus leading to a decrease of the enzyme's catalytic rate. eventually, a systematic probe of the protein surface led to the prediction of potential hotspots where the application of an external constraint would produce a large functional response both from the mechan- hiv- protease autocatalyses its own release from gag and gagpol precursor polyproteins into mature functional proteins. as it is functional in the dimeric form, whilst initially, only a single monomer is embedded within each gagpol chain, the question arises as to what cut's the cutter. two individual monomers in different gagpol chains are known to come together to form an embedded-dimer precursor protease. mature-like protease activity is concomitant with n-terminal intramolecular cleavage of this transient embedded-dimer precursor, but how this crucial maturation-initiating step is physically achieved, has remained unknown. here, we show via all-atom explicit solvent molecular dynamics simulation runs of ns each that the n-terminal of an immature-like protease, with the n-terminal initially unbound as in the gagpol polyprotein, can self-associate to the active site and therefore be cleaved under conditions of thermodynamic equilibrium, identifying possible binding pathways at atomic resolution, in agreement with previous indirect experimental evidence [ ] . the binding pathway predominantly makes use of the open conformation of the beta-hairpin flaps characterised by us previously [ ] , and the n-terminal binds across the entire active site in good agreement with crystal structures of a cleavage-site peptidebound protease. the n-terminus serves two roles, firstly in the maturation of the protease itself by self-associating to the protein and then as a stabilizing component of the dimer interface in a mature protease. targeting the prior mechanism could be the focus of a novel therapeutic strategy involving immature protease inhibition. [ knotted proteins are the object of an increasing number of experimental and theoretical studies, because of their ability to fold reversibly in the same topologically entangled conformation. the topological constraint significantly affects their folding landscape, thus providing new insight and challenges into the funnel folding theory [ ] . recently developed experimental methods to trap and detect knots have suggested that denaturated ensembles of the knotted proteins may be knotted [ ] . we present numerical simulations of the early stage of folding of the knotted proteins belonging to protein families mtase (methyltransferase) and sotcase (succinyl-ornithine transcarbamylase), and of their unknotted homologues [ ] . our results show that native interactions are not sufficient to generate the knot in the denaturated configurations. however, when non-native interactions are included we observe formation of knots only in the chains whose native state is knotted. in addition, we find that the knots are formed through a universal mechanism. such a knot formation mechanism correctly predicts the fraction of the knotted proteins found in nature and can be used to make qualitative prediction on their folding times. shape and motility and also for numerous signaling processes. adhesion is based on non-covalent interactions between transmembrane proteins and the extracellular matrix. cells are able to create two-dimensional assemblies of integrins, so called focal adhesions, which they use to stick to the substrate and collect information about the environmental properties. the goal of this work is a deeper understanding of the formation and the stability of these adhesion clusters. bond cluster formation and disintegration is dynamically modeled with the aid of monte carlo simulations. in the model, a membrane is attached to a flat surface via a variable number of adhesion bonds. the spacial configuration of these adhesion points subjected to an inhomogeneous stress field maps into a distribution of local membrane/ surface distances. we introduce a model which explicitly accounts for the membrane elasticity and demonstrate that such models are able to explain the spontaneous formation of adhesion bond clusters. structure based models are successful at conjugating the essence of the energy landscape theory of protein folding [ ] with an easy and efficient implementation. recently their realm expanded beyond single protein structure, been used profitably to widely study large conformational transitions [ ] [ ] . still, when dealing with conformational transition between two well-defined structures an unbiased and realistic description of the local backbone and sidechain interactions is necessary. the proposed model merges a precise description of these interactions with a structure-based long range potential that takes into account different conformers. we present the results of the activation of the catalytic domain of human csrc tyrosine kinase for which we reconstructed the transition free energy and the description of the activation loop flexibility. the excellent model performances in terms of speed and the satisfactory accuracy of the description of the system and its flexibility are promising for a more systematic study of the tyrosine kinase family activation mechanisms. [ we introduce a previously undescribed technique for modelling the kinetics of stochastic chemical systems. we apply richardson extrapolation, a sequence acceleration method for ordinary differential equations, to a fixed-step tau-leaping algorithm, to produce an extrapolated tau-leaping method which has weak order of accuracy two. we prove this mathematically for the case of linear propensity functions. we use four numerical examples, two linear and two nonlinear, to show the higher accuracy of our technique in practice. we illustrate this by using plots of absolute error for a fixed-step tau-leap and the extrapolated tau-leap. in all cases, the errors for our method are lower than for a fixedstep tau-leap; in most cases they are second order of accuracy. the major tripartite efflux pump acrab-tolc is responsible for the intrinsic and acquired multidrug resistance in escherichia coli. at heart of the extrusion machinery there is the homotrimeric transporter acrb, which is in charge of the selective binding of structurally and chemically different substrates and energy transduction. the effects of conformational changes, which have been proposed as the key features of the extrusion of drugs, are investigated at molecular level using different computational methods like targeted molecular dynamics. simulations, including almost half a million atoms, have been used to assess several hypotheses concerning the structure-dynamics-function relationship of the acrb protein. the results indicate that, upon induction of conformational changes, the substrate detaches from the binding pocket and approaches the gate to the central funnel. in addition, we provide evidence for the proposed peristaltic transport involving a zipper-like closure of the binding pocket, responsible for the displacement of the drug. using these atomistic simulations the role of specific amino acids during the transitions can be identified, providing an interpretation of sitedirected mutagenesis experiments. additionally, we discuss a possible role of water molecules in the extrusion process. virus inhibitory peptide (virip), a amino acid peptide, binds to the fusion peptide (fp) of human immunodeficiency virus type (hiv- ) gp and blocks viral entry. molecular dynamics (md) and molecular mechanics/poisson-boltzmann surface area (mm/pbsa) free energy calculations were executed to explore the binding interaction between several virip derivatives and gp fp. a promising correlation between antiviral activity and simulated binding free energy was established thanks to restriction of the flexibility of the peptides, inclusion of configurational entropy calculations and the use of multiple internal dielectric constants for the mm/pbsa calculations depending on the amino acids sequence. bases on these results, a virtual screening experiment was carried out to design enhanced virip analogues. a selection of peptides was tested for inhibitory activity and several improved virip derivatives were identified. these results demonstrate that computational modelling strategies using an improved mm/pbsa methodology can be used for the simulation of peptide complexes. as such, we were able to obtain enhanced hiv- entry inhibitor peptides via an mm/pbsa based virtual screening approach. an essential step during the hiv life cycle is the integration of the viral cdna into the human genome. hiv- integrase mediates integration in a tight complex with the cellular cofactor: ledgf/p [ ] . disruption of the interaction interferes with hiv replication and therefore provides an interesting new drug target for antiretroviral therapy [ , ] . here we present the structure based discovery and optimization of a series of small molecule inhibitors that bind to hiv- integrase and block the interaction with ledgf/p . the work flow was set up according to a funnel principle in which a series of virtual screening tools were applied in such way to discard at each step molecules unlikely to be active against the desired target (including d filtering, pharmacophore modelling and molecular docking) the activity and selectivity of the selected molecules were confirmed in an alpha screen based assay, that measure protein-protein interaction in vitro, and furthermore by in vivo experiments. active compounds proceeded towards crystallographic soaking into the receptor protein crystals. these crystal structures not only validated the binding mode and activity of the hit compounds, but furthermore were used as a platform for structure based drug design which resulted in the rational discovery of new hit compounds and optimized lead compounds. in vitro and in vivo experiments validated the mechanism of action of these compounds and show that they are a novel class of antiretroviral compounds with in vivo inhibitory activity by targeting the interaction between ledgf/p and hiv- integrase. cross resistance profiling indicate that these compounds are active against current resistant viral strains. [ ] currently the most potent inhibitors show an in vivo ic of nm. these compounds are promising for future pharmaceutical optimizations to be used in the clinic as new antiretroviral agents. crystallography was used to validate the binding mode of the discovered inhibitors. insights in the interaction of the ligand-protein complex allowed for rational design of optimized inhibitors. ligand-induced structural changes are small, thermal fluctuations can play a dominant role in determining allosteric signalling. in thermodynamic terms, the entropy change for subsequent binding is influenced by global vibrational modes being either damped or activated by an initial binding event. one advantage of such a mechanism is the possibility for long range allosteric signalling. here, changes to slow internal motion can be harnessed to provide signalling across long distances. this paper considers homotropic allostery in homodimeric proteins, and presents results from a theoretical approach designed to understand the mechanisms responsible for both cooperativity and anticooperativity. theoretical results are presented for the binding of camp to the catabolite activator protein (cap) [ ] , where it is shown that coupling strength within a dimer is of key importance in determining the nature of the allosteric response. results from theory are presented along side both atomistic simulations and simple coarse-grained models, designed to show how fluctuations can play a key role in allosteric signalling in homodimeric proteins. [ reversibly switchable fluorescent proteins (rsfps) can be switched between a fluorescent (on) and a nonfluorescent (off) state which is accompanied by a cis-trans isomerization of the chromophore on the molecular level , . this unique property has already provided new aspects to various microscopy techniques, such as high resolution microscopy, fcs or monochromatic multicolor microscopy - . despite of their established potential, rsfps still have a major drawback: the wavelength for fluorescence excitation is always one of the two switching wavelengths. the imaging process thus inevitably results in the switching of a small fraction of the rsfps, which might hinder or complicate some experiments. we developed a new reversibly switchable fluorescent protein which eliminates the problem of the coupling between switching and fluorescence excitation. this fluorescent protein follows an unusual and currently unknown mechanism of switching between a fluorescent and a nonfluorescent state. it is brightly fluorescent and exhibits an excellent signal to noise ratio. in parallel studies [ ] , qd-based ligands (egf, mabs) were targeted to egfr in gliomas. cell-cultures, animal models and ex vivo biopsies of human high-grade as well as low-grade gliomas showed high probe specificity.. the aim is to define more precisely the tumor boundaries at the time of resection. we used the programmable array microscope designed for sensitive, high-speed optical sectioning, particularly of living cells. the pam is based on structured illumination and conjugate detection using a digital micromirror device (dmd) [ ] located in a primary image plane. the unique feature is the rapid, (re)programmable adjustment of local excitation intensity, dynamic, on-the-fly optimization is thus achieved, e.g. multipoint frap [ ] , light exposure minimization and object tracking [ ] , or super-resolution strategies. the features and operation of the rd generation pam will be presented. contraction of muscle cells, motility of microorganisms, neuronal activity, and other fast cellular processes require microscopic imaging of a three-dimensional ( d) volume with a video-rate scanning. we present d video-rate investigations of structural dynamics in biological samples with the multicontrast third-and second-harmonic generation as well as fluorescence microscope. the multidepth scanning is achieved by two combined laser beams with staggered femtosecond pulses. each of the beams is equipped with a pair of deformable mirrors for dynamic wavefront manipulation enabling multidepth refocusing with simultaneous corrections for optical aberrations. combined, more than frames per second lateral scanning with fast refocusing enables the d video-rate imaging of dynamically moving structures. in addition, combination of two laser beams is accomplished at two perpendicular polarizations enabling live imaging of sample anisotropy, which is important for structural studies particularly with the second harmonic generation microscopy. investigations of beating chick embryo hearts with the d video-rate scanning microscope revealed multidirectional cardiomyocyte contraction dynamics in myocardial tissue. intricate synchronization of contractions between different layers of myocytes in the tissue will be presented. the video-rate d microscopy opens new possibilities of imaging fast biological processes in living organisms. confocal fluorescence microscopy is an invaluable tool to study biological systems at cellular level also thanks to the synthesis of always new specific fluorescent probes, e.g. multiprobe labelling enables complex system characterization. however, only the recent employment of narrowband tunable filters overcomes the problems due to the use of the broadband ones. the possibility of acquiring the emission spectra in a spatially resolved way extends simple image intensity studies into characterization of complex probeenvironment relationship through the sensitivity of fluorescence spectra to the local molecular environment differences. consequently, fluorescence microspectroscopy (fms) is able to provide the spectral information in a well defined spatial region allowing the researcher to simultaneously obtain spatial and spectroscopic information. our instrument has been specially built to study live cells and their interaction with nanomaterials, drug carriers and modified cell environment. other main characteristics are reducing the bleaching effect and employing a white light source that does not limit the use to specific probes. graphical tools, such as colour coded images, have also been introduced to provide explicit and straightforward visual information. high speed fpga based multi-tau correlation for singlephoton avalanche diode arrays jan buchholz , , jan krieger we demonstrate the use of fret-imaging (forster resonance energy transfer) as an assay to directly monitor the dynamics of cross-bridge conformational changes in single fibres of skeletal muscle. we measured nm-distances of several fret pairs located at strategic positions to sense myosin head conformational changes: we focused our attention on the essential light chain, elc (specifically labelling a modified elc and exchanging it with the natural elc of the fibre) and we investigated its interaction with the sh helix, with the nucleotide binding pocket and with actin. we characterized fret in single rigor muscle fibres, determining distances in agreement with those from the crystallographic data. the results demonstrate the viability of the approach in sensing different fret efficiencies over a few nanometres, an essential requirement to follow the expected small fret variations in contracting muscle fibres. we are now performing dynamic experiments on rigor and active fibres by applying small stretch/release cycles to alter the interaction distances (estimated time resolution of nearly ms/frame). in this configuration, it will be possible to measure functional changes, shedding light on the myosin head dynamics during contraction. focal stimulation of cultured neurons is crucial since it mimics physiological molecules release. indeed, the nervous system finely tunes the activity of each synapse by regulating the secretion of molecules spatially and temporally. currently used techniques have some drawbacks such as a poor spatial resolution or a low flexibility. we propose a novel approach based on optical tweezers (ot) [ ] to overcome these limitations. ot allow an ease manipulation with sub-micrometric precision of silica beads, which can be functionalized with any protein. for a proof-of-principle study we coated , lm large beads with brain-derived neurotrophic factor (bdnf) or bovine serum album (bsa) as control. we showed that a single bead was able to activate the bdnf receptor trkb, inducing its phosphorylation. moreover bdnf beads but not control beads were able to induce c-fos translocation into the nucleus [ ] , indicating that the whole pathway was activated. finally, we positioned vectors in proximity to the growth cones of cultured hippocampal neurons [ ] . control beads didn't affect the normal development of these structures while bdnf beads significantly did. these findings support the use of the ot technology for long-term, localized stimulation of specific subcellular neuronal compartments. a key role of its photoactivity, due to the singlet oxygen production, which has a very short lifetime (ns-ls, depending of hyp environment). hyp sub-cellular localization depends on its concentration in the medium, incubation time and used delivery system. variations in activity of protein kinase c, (anti-apoptotic pkca and pro-apoptotic pkcd) in correlation with the activity of bcl- protein, cytochrome c release from mitochondria and decreasing of mitochondrial membrane potential after photodynamic action were monitored. the study was performed for two different delivery modes of hyp to u- mg glioma cells-hyp alone (membrane diffusion) vs. hyp loaded in low density lipoprotein (ldl) (endocytosis). confocal fluorescence microscopy, flow-cytometry and specific fluorescence labeling were used as main experimental techniques. our results show that hyp photoaction strongly affects apoptotic response of the cells and that the dynamics of this action significantly depends on used delivery system. correlation analysis between the monitored parameters (see above) determined for both delivery system is presented and critically discussed. surface contamination by bacteria is a natural and spontaneous process occurring in any natural environment on biotic (mucosa, tissues…) and abiotic surfaces (medical equipments, food surfaces…). whatever the bacterial nature (gram-positive or -negative), the environmental fluid (air, water, blood…) and the receptor surface (implants, medical equipments, food surfaces…), the surface contamination initiated by the first adherent bacteria can evolve to a three dimensional structure named biofilm (cohesive bacteria assembly ensured by an autoproduced extracellular organic matrix). the mechanisms by which these biofilms offer protective environment to viral particles or hypertolerance to antimicrobial action are not yet elucidated. to reach a better understanding of biofilm reactivity, we reported for the first time successful applications of correlative time-resolved optical microscopy approach by time-lapse (tl), frap, fcs, flim, for real-time analysis of molecular mobility and reaction inside biofilms. by means of non-biological or biological (virus, biocides and antibiotics) reactive compounds, significant advances to understand the roles of the extracellular matrix and bacteria physiological properties were obtained, an important step to improve pathogenic biofilm inactivation. here we present a feasibility study to develop two-photon microscopy ( pm) into a standard diagnostic tool for noninvasive, skin cancer diagnosis. the goal is defining experimental parameters to maximize image quality of optical biopsies in human skin, while avoiding tissue damage. possible diagnostic indicators will be compared for healthy tissue, benign, and malignant melanocytic lesions. we report on preliminary results of a study on pm imaging of ''ex-vivo'' biopsy samples, were autofluorescence intensity and contrast between lesion and surrounding tissues were optimised varying excitation wavelength, detection band, and dispersion pre-compensation. moreover, we determined modulation functions for laser power and detector gain to compensate losses in deep tissue imaging. as the main process of photo-damage, thermo-mechanical modifications were quantified and damage threshold powers were determined. in order to image structural changes in ordered tissue like collagen fibres, second-harmonic generation signals were recorded and optimised. in-vivo two-photon imaging of the honeybee antennal we adapted a two-photon microscope for in-vivo imaging of the honeybee olfactory system, focusing on its primary centres, the antennal lobes. the setup allowed obtaining both d-tomographic measurements of the antennal lobe morphology and time-resolved in-vivo calcium imaging of its neuronal activity. the morphological data were used to precisely measure the glomerular volume in both sides of the brain, investigating the question of volumetric lateralization. functional calcium imaging allowed to record the characteristic glomerular response maps to external odour stimuli applied to the bees' antennae. compared to previous neural imaging experiments in the honeybee, this work enhanced spatial and temporal resolution, penetration depth, and it minimized photo-damage. final goal of this study is the extension of the existing functional atlases of the antennal lobe to d and into the temporal dimension by investigating the time-resolved activity pattern. the use of voltage sensitive fluorescent dyes (vsd) for noninvasive measurement of the action potential (ap) in blood perfused heart have been hindered by low interrogation depth, high absorption and auto-fluorescence of cardiac tissue. these limitations are diminished using new near infrared (nir) vsd (di- -anbdqbs). here we validated toxicity and photo-toxicity of these dyes in guinea pig and human cardiac muscle slabs. application of nir vsd showed no effect on cardiac muscle contraction force or relaxation. optical action potentials closely tracked kinetics of microelectrode recorded aps in both field and electrode stimulated preparations. for phototoxicity assessment dye ( lm) preloaded cardiac slabs were exposed to prolonged laser radiation of various power. microelectrode ap recordings show that exposure to prolonged laser radiation ( min.; mw/mm ) of dye loaded tissue had no statistically significant effect on apd or conduction velocity, thus indicating no or weak photo-toxicity on the nir vsd. in contrast, exposure to min. laser radiation of phototoxic dyes (mitotracker deep-red) preloaded tissue caused significant reduction in apd (by %) and conduction velocity ( %). thus, due to the low photo-toxicity, nir vsd are well suited for in vivo cardiac imaging. streptomycesetes are filamentous gram-positive soil bacteria well known for their complex morphological development and secondary metabolite production. during their life cycle spores germinate to form a network of hyphae, which later develops into aerial mycelium when cross-walls are generated and spores are formed. we have examined and compared the last stage of the differentiation process in a wild-type s. coelicolor (m ) and its dcabb mutant lacking a calmodulin-like calcium binding protein. the strains were grown on four kinds of media: smms, smms with % saccharose, r and r with reduced calcium in order to study the effect of environment and osmotic stress on the sporulation of the two strains to assess the function of cabb protein. from the cultures pictures were taken at hours and after days using phase contrast, atomic force and confocal laser scanning microscope and the sizes of spores were measured. our results showed that the dcabb mutant made smaller spores, its differentiation and stress response were slower. we could conclude from it that the aberrant protein slows the metabolism, the signal transduction and has effects on sporulation, septation and air-mycelium formation. based on it we can tell that the cabb has a significant role in normal development. the mobility and reaction parameters of molecules inside living cells can be conveniently measured using fluorescent probes. typically fluorescence correlation spectroscopy (fcs) based on confocal microscopy is used for such measurements. this implies high time-resolution but only for a single spot at a time. in order to achieve a high time-resolution at multiple spots, we built a single plane illumination microscope (spim) equipped with high-speed image acquisition devices and a high-na detection optics. this allows us to do parallel fcs measurements in a thin plane (width * - lm) of the sample. our setup is equipped with a fast emccd camera (full frame time resolution ls) and a pixel array of spads. the spad array has a full frame time resolution of - ls, which is even fast enough to resolve the typical motion time-scale of small molecules (like egfp) inside living cells. the performance of the system is characterized by diffusion measurements of water-soluble fluorescent beads, as well as fcs measurements in living cells. our data acquisition system uses programmable hardware for some tasks and is fast enough to allow real-time correlation of pixels, as well as saving the complete dataset for later evaluation. electron cryo-microscopy (cryo-em) covers a larger size range than any other technique in structural biology, from atomic resolution structures of membrane proteins, to large noncrystalline single molecules, entire organelles or even cells. electron crystallography of two-dimensional ( d) crystals makes it possible to examine membrane proteins in the quasi-native environment of a lipid bilayer at high to moderately high resolution ( . - Å ). recently, we have used electron crystallography to investigate functionally important conformational changes in membrane transport proteins such as the sodium/proton antiporters nhaa and nhap, or the structure of channelrhodopsin. ''single particle'' cryo-em is well suited to study the structure of large macromolecular assemblies in the . to Å resolution range. a recent example is our Å map of a mitochondrial respiratory chain supercomplex consisting of one copy of complex , two copies of complex iii, and one of complex iv. the fit of the x-ray structures to our map indicates short pathways for efficient electron shuttling between complex i and iii by ubiquinol, and between complex iii and iv by cytochrome c. electron cryo-tomography can visualize large protein complexes in their cellular context at - Å resolution, and thus bridges the gap between protein crystallography and light microscopy. cryo-et is particularly suitable for studying biological membranes and large membrane protein complexes in situ. we found that long rows of atp synthase dimers along ridges of inner membrane cristae are an ubiquitous feature of mitochondria from all species we investigated ( mammals, fungi, plant). the proton pumps of the respiratory chain appear to be confined to the flat membrane regions on either side of the ridges. this highly conserved pattern suggests a fundamental role of the mitochondrial cristae as proton traps for efficient atp synthesis. single-particle analysis: advanced fluorescence imaging, including subdiffraction microscopy, relies on fluorophores with controllable emission properties. chief among these fluorophores are the on/off photoswitchable and green-to-red photoconvertible fluorescent proteins. irisfp was recently reported as the first fluorescent protein to combine irreversible photoconversion from a green-emitting to a red-emitting state with reversible on/off photoswitching in both the green and red states. the introduction of this protein resulted in new applications such as super-resolution pulse-chase imaging, but the properties of irisfp are far from being optimal from a spectroscopic point of view and its tetrameric organization complicates its use as a fusion tag. we have demonstrated how four-state optical highlighting can be rationally introduced into photoconvertible fluorescent proteins by developing and characterizing a new set of such enhanced optical highlighters derived from meo-sfp and dendra . one of these, which we called nijifp, was identified as a promising new multi-photoactivatable fluorescent protein with optical properties that make it ideal for advanced fluorescence-based imaging applications. introducing to medicine and biology concept of optical markers in tremendous way has changed the recent status of these two important disciplines. this was mainly due to strong development in imaging techniques which recently allow us to investigate both static as well dynamic properties of living cells, their components and their interactions with external factors. recently used molecular markers including organic dyes, fluorescent proteins or chelates containing lanthanide ions have several significant limitations. one of the alternatives for molecular markers are inorganic quantum dots (ie. cdse, cds) which are recently commonly used in many academic works. however, even if they are much better from physicochemical point of view, from the application point of view at this moment they are rather useless mainly because of their high risk of toxicity. one of the solution combining advantages of both concepts is to make nontoxic inorganic nanocrystals doped by lanthanide ions. in this work, we will present optical results obtained for nayf nanocrystals doped by different lanthanide ions. the aim of this work was to design and to synthesize these markers and to understand physical processes responsible for their emission and to optimize these processes to the physical limits. intravital microscopy has fostered a full blown of publication regarding the behavior of cells in different tissues and physiological conditions. however, a few papers describe how the motility parameters can be used to understand whether an interaction is occurring and, on balance, the distinction between interacting and non interacting cells is performed visually on the image time stack. here we describe a multi-parameter approach that allows to discern among different cell behaviors on an objective ground and we demonstrate its effectiveness valuating the mutual fate of natural killer (nk) and dendritic (dc) cells at the draining lymph-nodes in inflammatory and stationary conditions. the method is time saving and allows a wide scale characterization of the lymphocyte tracks and to build up statistics of the cell-cell interaction duration. this has allowed the development of a numerical model of the nk-dc interaction, based on a molecular-stochastic dynamic approach, whose output can be directly compared to the data. hemozoin is formed during malaria infection of red blood cells, the malaria parasite cleaves hemoglobin, leaving free heme which is then toxic to the parasite. the free heme is then bio-crystalized to form hemozoin which allows the parasite to remain viable. the hemozoin is released during the breakdown of the red blood cells, is small and can be difficult to resolve spatially. since it contains an abundance of heme protein, which has a strong absorbance at nm, it can be readily detected and tracked by using resonant raman scattering spectroscopy. here we use slit-scanning confocal raman imaging to detect the hemozoin and resolve it against the background molecules. inside a red blood cell, hemoglobin is the strongest background signal since it also contains large amounts of heme. nevertheless, the discrimination is possible, and the time-resolved observation of hemozoin is an important tool to understand effects of malaria since the hemozoin can trigger the immune response and cause inflammation in tissue. muscle performance at the molecular level is determined by the elementary displacement (working stroke) produced by the motor protein myosin ii, and its dependence on load. we developed a laser trap assay (the optical leash) capable of applying controlled loads to a single myosin head before the working stroke is initiated and probing actin-myosin interaction on the microsecond time scale. we found that the working stroke size depends both on load and on the detachment pathway followed by myosin. in a first pathway, myosin detaches very rapidly from actin (\ ms) without producing any movement. in a second pathway, myosin steps and remains bound to actin for a time inversely proportional to atp concentration; the working stroke remains constant (* nm) as the load is increased, until it suddenly vanishes as the isometric force is reached ( . ± . pn). a third dissociation pathway becomes more populated as the force is increased, due to premature unbinding of myosin from actin, resulting in a working stroke that decreases with load. taken together, these results give a new insight into the molecular mechanism of load dependence of the myosin working stroke, which is a primary determinant of skeletal muscle performance and efficiency. previously we have deleted either or both of these terminal helices genetically. surprisingly, all mutants rotated in the correct direction, showing that the shaft portion is dispensable. here we inquire if the rest of the c rotor, the globular protrusion that accounts for * % of the c residues, is also dispensable. keeping the n-and c-terminal helices that constitute the shaft, we have replaced the middle * residues with a short helix-turn-helix motif borrowed from a different protein. the protrusion-less mutant thus made retained a high atpase activity and rotated fast in the correct direction. this may not be unexpected because, in crystal structures, most of the removed residues do not contact with the a b ring. combined with the previous results, however, the present results indicate that none of the c residues are needed for rotation. the rotary mechanism of a molecular engine, the vacuolar proton-atpase, working in a biomembrane csilla ferencz , pá l petrovszki , zoltá n kó ta the rotary mechanism of vacuolar proton-atpase (v-atpase) couples atp hydrolysis and trans-membrane proton translocation. we tested the effect of oscillating electric (ac) field on v-atpase activity in yeast vacuoles. the ac technique has several advantages over direct observations: it can be applied on native membranes, there are no labels and attachments involved, and the target protein is in its natural environment. this was/is the first of its kind of experiment on v-atpase, and we got strikingly different results from previous studies on other proteins: both low and high frequency ac field reduces atpase activity in a wide frequency range. a sharp resonance is seen at . hz, where the atpase activity reaches or exceeds the control (no ac) level. we think that the resonance happens at that of the degrees rotor steps, meaning that the rotation rate of the rotor is around hz, under the given conditions. synchronisation of individual atpases by slow or matching, but not fast ac is likely via a hold-and-release mechanism. we can explain the above observations by assuming that the ac field interacts with the proton movements, and if we consider the estimated geometry of the hydrophilic proton channels and the proton binding cites on the rotor. [ ] . the ttss constitutes a continuous protein transport channel of constant length through the bacterial envelope [ ] . the needle of the type three secretion system is made of a single small protein (protomer). we analyzed the assembly and the structure of the ttss needle using different biophysical methods including fourier transform-infrared spectroscopy, nmr spectroscopy and x-ray crystallography. we show that the ttss needle protomer refolds spontaneously to extend the needle from the distal end. the protomer refolds from a-helix into b-strand conformation to form the ttss needle [ ] . regulated secretion of virulence factors requires the presence of additional protein at the ttss needle tip. x-ray crystal structure analysis of the tip complex revealed major conformational changes in both the needle and the tip proteins during assembly of the s. typhimurium ttss. our structural analysis provides the first detailed insight into both the open state of the ttss needle tip and the conformational changes occurring at the pathogen-host interface [ ] . [ the membrane-bound component f o of the atp synthase works as a rotary motor and plays the central role of driving the f component to transform chemi-osmotic energy into atp synthesis. we have conducted molecular dynamics simulations of the membrane-bound f o sector with explicit lipid bilayer, in which the particular interest was to observe the onset of helix motion in the c ring upon the change of the protonation state of asp of the c subunit, which is the essential element of the boyer's binding-change mechanism. to investigate the influence of transmembrane potential and ph gradient, i.e., the proton motive force, on the structure and dynamics of the a-c complex, different electric fields have been applied along the membrane normal. correlation map analysis indicated that the correlated motions of residues in the interface of the a-c complex were significantly reduced by external electric fields. the deuterium order parameter (s cd ) profile calculated by averaging all the lipids in the f o -bound bilayer was not very different from that of the pure bilayer system, which agrees with recent h solid-state nmr experiments. however, by delineating the lipid properties according to their vicinity to f o , we found that the s cd profiles of different lipid shells are prominently different. lipids close to f o formed a more ordered structure. similarly, the lateral diffusion of lipids on the membrane surface also followed a shell-dependent behavior. the lipids in the proximity of f o exhibited very significantly reduced diffusional motion. the numerical value of s cd was anti-correlated with that of the diffusion coefficient, i.e., the more ordered lipid structures led to slower lipid diffusion. our findings will not only help for elucidating the dynamics of f o depending on the protonation states and electric fields, but may also shed some light to the interactions between the motor f o and its surrounding lipids under physiological condition, which could help to rationalize its extraordinary energy conversion efficiency. this work has been published in march , and it was selected as one of the two featured articles of that issue. the research project is directed toward the construction of a synthetic bio-inorganic machine that consists in a single actin filament that interacts with a linear array of myosin ii motors regularly disposed on a nano-structured device. the motor array is intended to simulate the unique properties of the ensemble of motor proteins in the half-sarcomere of the muscle by providing the condition for developing steady force and shortening by cyclic interactions with the actin filament. the mechanical outputs in the range . - pn force and - , nm shortening will be measured and controlled the bacterial flagellar motor is a membrane-embedded molecular machine that rotates filaments, providing a propulsive force for bacteria to swim. the molecular mechanism of torque (turning force) generation is being investigated through the study of the properties and three-dimensional structure of the motor's stator unit. we are taking both topdown and bottom-up approaches, combining data from the molecular genetics studies, cross-linking, x-ray protein crystallography and molecular dynamics simulations. we have recently determined the first crystal structure of the protein domain that anchors the proton-motive-force-generating mechanism of the flagellar motor to the cell wall, and formulated a model of how the stator attaches to peptidoglycan. the work presented at the meeting will inform the audience on our latest work that establishes the relationship between the structure, dynamics and function of a key component of the bacterial flagellar motor, the motility protein b (motb). this work will be put in the perspective of the mechanism of rotation, stator assembly, anchoring to peptidoglycan and interaction with the rotor, and discussed in the light of the elementary events composing the cycle of electrochemical-to-mechanical energy conversion that drives flagellar rotation. members of the conserved kinesin- family fulfill essential roles in mitotic spindle morphogenesis and dynamics and were thought to be slow, processive microtubule (mt)-plusend directed motors. the mechanisms that regulate kinesin- function are still not well understood. we have examined in vitro and in vivo functions of the saccharomyces cerevisiae kinesin- cin using single-molecule motility assays and single-molecule fluorescence microscopy and found that cin motility is exceptional in the kinesin- family. in vitro, individual cin motors could be switched by ionic conditions from rapid (up to lm/min) and processive minus-end, to bidirectional, to slow plus-end motion. deletion of the uniquely large insert of amino acids in loop of cin induced bias towards minus-end motility and strongly affected the directional switching of cin both in vivo and in vitro. we further found that deletion of the functionally overlapping kinesin- kip and of the spindle-organizing protein ase affected cin velocity and processivity, but directionality was not affected. the entirely unexpected finding of switching of cin directionality in vivo and in vitro demonstrates that the ''gear box'' of kinesins is much more complex and versatile than thought. many biological motor molecules move within cells using stepsizes predictable from their structures. myosin-vi, however, has much larger and more broadly-distributed stepsizes than those predicted from its short lever arms. we explain the discrepancy by monitoring qdots and gold nano-particles attached to the myosin vi motor domains using high-sensitivity nano-imaging. the large stepsizes were attributed to an extended and relatively rigid lever arm; their variability to two stepsizes, one large ( nm) and one small ( nm). these results suggest there exist two tilt-angles during myosin-vi stepping, which correspond to the pre-and post-powerstrokes states and regulate the leading head. the large steps are consistent with the previously reported hand-over-hand mechanism, while the small steps follow an inchworm-like mechanism and increase in frequency with adp. switching between these two mechanisms in a strain sensitive, adpdependent manner allows myosin-vi to fulfill its multiple cellular tasks including vesicle transport and membrane anchoring. http://www.fbs.osaka-u.ac.jp/labs/yanagida/, http:// www.qbic.riken.jp/. ferritin deposits iron in oxyhydroxide iron core surrounded by protein shell. the iron core structure may vary in different ferritins in both normal and pathological cases. to study iron core variations the mö ssbauer spectroscopy with a high velocity resolution was applied for comparative analysis of normal and leukemia chicken liver and spleen tissues, human liver ferritin and commercial pharmaceutical products imferon, maltoferÒ and ferrum lek as ferritin models. mö ssbauer spectra of these samples measured with a high velocity resolution at room temperature were fitted using two models: homogenous iron core (one quadrupole doublet) and heterogeneous iron core (several quadrupole doublets). the results of both fits demonstrated small variations of mö ssbauer hyperfine parameters related to structural variations of the iron cores. these small structural variations may be a result of different degree of crystallinity, the way of iron package, nonequivalent iron position, etc. obtained small differences for normal and leukemia tissues may be useful for distinguishing ferritins from normal and pathological cases. this work was supported in part by the russian foundation for basic research (grant # - - -a). invasion of epithelial cells by salmonella enterica is mediated by bacterial ''effector'' proteins that are delivered into the host cell by a type iii secretion system (ttss). the collaborative action of these translocated effectors modulates a variety of cellular processes leading to bacterial uptake into mammalian cells. type iii effectors require the presence in the bacterial cytosol of specific tts chaperones. effectors are known to interact with their chaperone via a chaperone binding domain (cbd) situated at their n-terminus. this work focus on sopb, an effector with phosphoinositide phosphatase activity and particularly its interaction with the specific chaperone sige by biochemical, biophysical and structural approaches. we have co-expressed sopb with its specific chaperone sige and purified the complex, determined the limits of the cbd and purified the sopb cbd /sige complex. the structure of sige has been solved previously but no crystals could be obtained for structure determination of both complexes. we used saxs experiment combined with biophysical approach to analyse the interaction between sopb and its chaperone as well as the quaternary structure on the complex that will be described in this presentation. guanylate monophosphate kinase (gmpk) is a cytosolic enzyme involved in nucleotide metabolic pathways. one of the physiological roles of gmpks is the reversible phosphoryl group transfer from atp to gmp (its specific ligand), yielding adp and gdp. the gmpk from haemophilus influenzae is a small protein, with a number of amino acids in the primary structure. in order to determine the secondary structure changes of this enzyme, as well as some physical characteristics of its complexes with gmp and atp ligands, circular dichroism (cd) and atr -ftir studies were performed. the enzyme and its ligands were dissolved in tris -hcl buffer, at ph . and °c. from the cd spectra the content of the secondary structure elements of gmpk and gmpk/gmp, gmpk/atp (with and without mg + ) were determined. the major secondary structure elements of gmpk from haemophilus influenzae were a-helix (* %) and b-sheet (* %). atr -ftir experiments show that the amide i and amide ii bands of the gmpk are typical for a protein with great a-helix content. from the second derivative spectra, the content of the secondary structure elements were estimated. these data were in agreement with those obtained by cd. assembly of the mature human immunodeficiency virus type capsid involves the oligomerization of the capsid protein, ca. the c-terminal domain of ca, ctd, participates both in the formation of ca hexamers, and in the joining of hexamers through homodimerization. intact ca and the isolated ctd are able to homodimerize in solution with similar affinity (dissociation constant in the order of lm); ctd homodimerization involves mainly an a-helical region. in this work, we show that peptides derived from the dimerization helix (which keep residues energetically important for dimerization and with higher helical propensities than the wild-type sequence) are able to self-associate with affinities similar to that of the whole ctd. moreover, the peptides have a higher helicity than that of the wild-type sequence, although is not as high as the theoretically predicted. interesting enough, the peptides bind to ctd, but binding in all peptides, but one, does not occur at the dimerization interface of ctd (helix ). rather, binding occurs at the last helical region of ctd, even for the wild-type peptide, as shown by hsqc-nmr. as a consequence, all peptides, but one, are unable to inhibit capsid assembly of the whole ca in vitro. the peptide whose binding occurs at the ctd dimerization helix has an val?arg mutation in position , which has been involved in dimer-dimer contacts. these findings suggest that event keeping the energetically important residues to attain ctd dimerization within a more largely populated helical structure is not enough to hamper dimerization of ctd. putp is an integral membrane protein located in the cytoplasmic membrane of escherichia coli, being responsible for the coupled transport of na + and proline. it belongs to the family of sodium solute symporters (sss). structural data for putp is not available, but secondary structure predictions together with biochemical and biophysical analyses suggest a transmembrane motif. from a recent homology model based on the x-ray structure of the related na + /galactose symporter vsglt, previously published electron paramagnetic resonance (epr) studies, and recent crystallographic and epr studies on the cognate bacterial homolog of a neurotransmitter:na + symporter, leut, it has been proposed that helices viii and ix as well as the interconnecting ''loop '' region determine the accessibility of the periplasmic cavities which bind sodium and proline. we performed site-directed spin labeling of ''loop '' in combination with epr spectroscopy to investigate the structural features of this region and possible conformational changes induced by sodium and proline. analyses of spin label mobility and polarity as well as accessibility to paramagnetic quenchers allow us to refine this region in the present homology model. furthermore, our data suggest conformational changes in this region upon substrate binding including an overall motion of a helical segment. fatty acid-binding proteins (fabp) are a family of low molecular weight proteins that share structural homology and the ability to bind fatty acids. the common structural feature is a b-barrel of antiparallel b-strands forming a large inner cavity that accommodates nonpolar ligands, capped by a portal region, comprising two short a-helices. b-fabp exhibits high affinity for the docosahexaenoic (dha) and oleic acid (oa). it is also postulated that b-fabp may interact with nuclear receptors from ppar family. in the present work, we used molecular biology and spectroscopic techniques to correlate structure, dynamics and function. site-directed mutagenesis was used to produce mutants of b-fabp with a nitroxide spin probe (mtsl) selectively attached to residues located at the portal region. esr spectra of the labeled b-fabp mutants were sensitive to the location of the mutation and were able to monitor interactions in three cases: ( ) shsp are ubiquitous proteins involved in cellular resistance to various stress (oxidative, heat, osmotic…). they are able to prevent aggregation of non-native proteins through the ability of forming large soluble complexes and preventing their nonspecific and insoluble aggregation. in consequence to this molecular chaperone function, they can regulate many processes (resistance to chemotherapy, modulation of the cellular adhesion and invasion, inflammatory response in skin), and the modulation of their expression has been found to be a molecular marker in cancers, spermatogenesis, or cartilage degeneration. furthermore, they are involved in several pathologies: myopathies, neuropathies, cancers, cataracts. among the human members (hspb - ), this study focused on hspb (hsp involved in some cancers), hspb (lens specific), hspb (lens, muscle, heart, lung) and hspb -r g (responsible for a desmin-related myopathy and a cataract). as shsp form large, soluble (but polydisperse in mammals) hetero-oligomers, molecular biology, biochemistry, biophysics and bioinformatics were successfully combined to compare the functional/dysfunctional assemblies in order to understand the critical parameters between shsp members depending upon their tissue and cellular localization. ionizing radiation is a type of radiation that contains enough energy to displace electrons and break chemical bonds. this can promote the removal of at least one electron from an atom or molecule, creating radical species, namely, reactive oxygen species (ros) [ , ] . these are often associated with damages at cellular level, such as, dna mutations, cell cycle modifications and, in animal cells, cancer. to overcome this problem, organisms developed different protection/repair mechanisms that enable them to survive to these threats. dna glycosylases are enzymes that are part of base excision repair (ber) system, mainly responsible for dna repair. they can recognize a dna lesion and, in some cases, are able to remove the mutated base. here we propose to study one of those enzymes, endonuclease iii, which contains a [ fe- s] cluster [ , ] . samples were exposed to different doses of uv-c radiation and the effects were studied by electrophoretic and spectroscopic methods. [ na,k-atpase is an integral protein present in the plasma membrane of animal cells, and consists of two main subunits: the a and b. cholesterol is an essential constituent of the animal membrane cells. in order to study the interaction between na,k-atpase and cholesterol, we have used the dsc technique, and a proteoliposome system composed by the enzyme and dppc:dppe, with different percentage in mol of cholesterol. the heat capacity of purified na,k-atpase profile exhibits three transitions with , and kcal/mol at , and °c. multiple components in the unfolding transition could be attributed either to different steps in the pathway or to independent unfolding of different domains. this denaturation of na,k-atpase is an irreversible process. for the proteoliposome, we also observed three peaks, with , and kcal/mol and , and °c. this increase in dh indicates that the lipids stabilize the protein. when cholesterol was used (from to mol %), the first transition was shifted to a lower temperature value around °c. these results confirm that cholesterol has an influence on the packing and fluidity of lipid bilayer and changes in lipid microenvironment alter the thermostability as well as the activity of na,k-atpase. financial support: fapesp. we have undertaken to study the structure and function of peroxisomal multifunctional enzyme type (mfe- ) from different organisms. mfe- is a key enzyme in long-and branched-chain fatty acids breakdown in peroxisomes. it contains two enzymes in the same polypeptide and also consists of differing amount of domains depending on the species. crystal structure and enzyme kinetics of drosophila melanogaster mfe- has revealed the domain assembly and raised a question about existence of a substrate channeling mechanism. small-angle x-ray scattering studies have further resolved the assembly of domains in the human mfe- . mutations in the mfe- -coding gene in humans may cause d-bifunctional protein deficiency -a metabolic disease characterized by accumulation of fatty acyl-coa intermediates due to inactive or residually active mfe- protein. we have also studied the structure, stability and dynamics of such mutant proteins both experimentally and in silico. the latest results on all these studies will be presented. ftsz is a protein that plays a key role in bacterial division, forming a protein ring directly related to the constriction of the membrane. this process has been observed to occur without the help of molecular motors. nonetheless, the details of the self-assembly and subsequent force generation of the septal ring are still obscure. afm observations allows the study of the behaviour of ftsz solutions on a substrate with unprecedented resolution, permitting the identification of individual protein filaments. the different resulting structures can be compared to monte carlo models for a d lattice accounting for the essential interactions between monomers. these include a strong longitudinal bond that allows a limited flexibility (i.e., curvature of the filaments) and a weaker lateral interaction. the work we present follows this approach, focussing on the latest experiments with ftsz mutants. by using these mutants, it is possible to choose the specific region of the monomer that will anchor to the substrate, thus generating new structures that provide an insight into monomer-monomer interactions. in this way, we explore the anisotropy of the lateral bond in ftsz, a factor that has not been taken into account before but may prove to be of importance in fstz behaviour in vivo. modeling protein structures and their complexes with limited experimental data dominik gront university of warsaw, pasteura , - warsaw, poland conventional methods for protein structure determination require collecting huge amounts of high-quality experimental data. in many cases the data (possibly fragmentary and/or ambiguous) on itself cannot discriminate between alternative conformations and a unique structure cannot be determined. small angle xray scattering is an example of such a ''weak'' experiment. the spectrum encodes only several independent degrees of freedom that provide a global description of a molecular geometry in a very synthetic way. in this contribution we utilized both local information obtained from nmr measurements and global description of a macromolecule as given by saxs profile combined with a knowledge-based bimolecular force field to determine tertiary and quaternary structure of model protein systems. saxs curve as well as various kinds of local nmr data such as isotropic chemical shifts and their tensors, j-couplings, rdc, backbone noe and redor from nmr in solid phase are parsed with the ''experimental'' module of bioshell toolkit and utilized by rosetta modeling suite to generate plausible conformations. obtained results show the new protocol is capable to deliver very accurate models. noenet-use of noe networks for nmr resonance assignment of proteins with known d structure dirk stratmann, carine van heijenoort and eric guittet structural genomics programs today yield an increasing number of protein structures, obtained by x-ray diffraction, whose functions remain to be elucidated. nmr plays here a crucial role through its ability to readily identify binding sites in their complexes or to map dynamic features on the structure. an important nmr limiting step is the often fastidious assignment of the nmr spectra. for proteins whose d structures are already known, the matching of experimental and back-calculated data allows a straightforward assignment of the nmr spectra. we developed noenet, a structure-based assignment approach. it is based on a complete search algorithm, robust against assignment errors, even for sparse input data. it allows functional studies, like modeling of protein-complexes or protein dynamics studies for proteins as large as kd. almost any type of additional restraints can be used as filters to speed up the procedure or restrict the assignment ensemble. noenet being mainly based on nmr data (noes) orthogonal to those used in triple resonance experiments (jcouplings), its combination even with a low number of ambiguous j-coupling based sequential connectivities yields a high precision assignment ensemble. we observed, that t. thermophilus isopropylmalate dehydrogenase (ipmdh) has higher rigidity and lower enzyme activity at room temperature than its mesophilic counterpart (e. coli), while the enzymes have nearly identical flexibilities under their respective physiological working conditions, suggesting that evolutionary adaptation tends to maintain optimum activity by adjusting a ''corresponding state'' regarding conformational flexibility. in order to reveal the nature of the conformational flexibility change related to enzymatic activity, we designed a series of mutations involving non conserved prolines specific to thermophilic ipmdh. proline to glycine mutations substantially increased conformational flexibility and decreased conformational stability. the mutant enzyme variants did not show enhanced catalytic activity, but the non arrhenius temperature dependence of enzyme activity of the wild type was abolished. this phenomenon underlines the fact that the delicate balance between flexibility, stability and activity which is required for the environmental adaptation of enzymes can be easily disrupted with mutations even distant from the active site, providing further evidence that optimization of proper functional motions are also a selective force in the evolution of enzymes. the kinetoplastids trypanosoma brucei, t. cruzi and leishmania major are responsible for causing great morbidity and mortality in developing countries. the all a-helical dimeric dutpases from these organisms represent promising drug targets due to their essential nature and markedly different structural and biochemical properties compared to the trimeric human enzyme. to aid in the development of dutpase inhibitors we have been structurally characterizing the enzymes from these species. here we present the structure of the t. brucei enzyme in open and closed conformations, completing the view of the enzymes from the kinetoplastids. furthermore, we sought to probe the reaction mechanism for this family of enzymes as a mechanism has been proposed based on previous structural work but has not received any further verification. the proposed scheme is similar to that of the trimeric enzyme but differs in detail. using tryptophan fluorescence quenching in the presence of the transition state mimic alf we have been able to identify which species is the likely transition state in the reaction. the crystal structure of t. brucei in complex with this transition state analogue confirms the nature of the nucleophilic attack clearly showing how it differs from trimeric enzymes. the structure of factor h-c d complex explains regulation of immune complement alternative pathway circular dichroism (cd) spectroscopy is a widely used technique for studying the secondary structure (ss) of proteins. numerous algorithms have been developed for the estimation of the ss composition from the cd spectra. although, these methods give more or less accurate estimation for proteins rich in a-helical structure, they often fail to provide acceptable results on mixed or b-rich proteins. the problem arises from the diversity of b-structures, which is thought to be an intrinsic limitation of the technique. the worst predictions are provided for proteins of unusual b-structures and for amyloid fibrils. our aim was to develop a new algorithm for the more accurate estimation of ss contents for a broader range of protein folds with special interest to amyloid fibrils. using synchrotron radiation cd (srcd), we were able to collect high quality spectra of amyloid fibrils with good s/n ratios down to nm. the novel reference dataset with spectra that significantly differ from present reference sets, extends the information content for ss determination. our algorithm takes into account the diverse twist of the b-sheets that has a great influence on the spectral features. for the first time, we can reliably distinguish parallel and antiparallel b-structure using cd spectroscopy. monitoring the assembly of the membrane protein insertase alexej kedrov , marko sustarsic , arnold j.m. driessen groningen biomolecular sciences and bioengineering institute, university of groningen, the netherlands, university of oxford, uk molecular forces that govern membrane protein integration and folding remain a major question in current molecular biology and biophysics. each nascent polypeptide chain should acquire its unique three-dimensional folded state within a complex environment formed by the anisotropic lipid membrane and the membrane-water interface. secyeg translocase and members of a recently described yidc/oxa /alb chaperone family are recognized as primary players in the membrane protein genesis. these proteins, so called insertases, serve as membraneembedded molecular pores where the newly synthesized protein is loaded prior its release into the bilayer. here we apply fluorescence correlation spectroscopy to monitor the assembly of the insertase:ribosome:nascent polypeptide chain complexes in solution and reconstituted into nanodiscs and model membranes. results provide insights on molecular mechanisms and dynamics of the insertase functioning. conformational changes during gtpase activity induced self-assembly of human guanylate binding protein revealed by epr spectroscopy correct assembly and regulation of multi-component molecular machines is essential for many biological activities. the type iii secretion system (t ss) is a complex molecular machine that is a key virulence determinant for important gram-negative pathogens including shigella, yersinia and salmonella species [ ] [ ] . the t ss consists of multiple copies of * different proteins (totalling * mda), spans both bacterial membranes and drives insertion of a contiguous pore into the host-cell membrane. virulence factors are secreted via this apparatus directly into the host cell. in all t ss various levels of regulation occur with switching between secretion off and on states overlaid on control of which substrates are secreted. genes involved in a variety of these switches have been identified but the molecular mechanisms underlying these functions is poorly understood. we are studying the t ss of shigella flexneri, the causative agent of dysentery and will present the structure of the so-called ''gatekeeper protein'' mxia. the diacylglycerolacyltransferase (dgat ) is an integral protein from the reticulum endoplasmic membrane that plays an essential role in triacylglyceride synthesis. in cattle, this enzyme is associated to the fat content regulation on milk and meat. in this study, synthetic peptides corresponding to both dgat binding sites (sit and sit ) were designed, purified and employed to investigate the enzyme interaction with substrates and membrane models. different binding specificities in the interaction with phospholipid vesicles and micelles were noted. sit showed to bind more strongly in nonpolar membrane models, while sit was electrostatically attracted to negative phospholipid surfaces. the binding of both peptides was followed by significant conformational changes (like unordered to helix transition) in circular dichroism spectra and a nm blue shift in fluorescence emission. the binding of sit and sit peptides to negative liposome gave dissociation constants (k d ) of and . lm, respectively, and a leakage action -fold higher to sit . the difference in specificity is related to the features of the putative substrates (acyl-coas and diacylglycerol) and can be attributed to the distinct role of each dgat binding site during lipid synthesis. supported by fapesp. ftsz, the bacterial homologue of tubulin, assembles into polymers in the bacterial division ring. the interfaces between monomers include a gtp molecule, although the relationship between polymerization and gtpase activity is still controversial. a set of short ftsz polymers were modelled and the formation of active gtpase structures was monitored using molecular dynamics. only the interfaces nearest the polymer ends exhibited geometry adequate for gtp hydrolysis. conversion of a mid-polymer interface to a close-to-end interface resulted in its spontaneous rearrangement from an inactive to an active conformation. fluorescent proteins (fps) have become extremely valuable tools in the life sciences. due to the latest advances in the light microscopy, there is a steady need for fps with improved spectral properties. mirisfp is a monomeric fp that can be switched reversibly between a bright green fluorescent and a dark state by illumination with light of specific wavelengths [ ] . structurally, this photo-switching is based on a cis-trans isomerization of the chromophore. upon illumination with violet light, mirisfp can be irreversibly photoconverted from the green-emitting to a red-emitting form. the red form can again be switched reversibly between a fluorescent and a dark state. to elucidate the mechanistic details of the photoinduced reactions, we have generated mirisgfp . this variant can still undergo reversible photoswitching, but lacks the ability to photoconvert to the red state so that the photoinduced transitions of the green form can be studied without 'artifacts' due to green-to-red photoconversion. using uv/visible spectroscopy, we have characterized the on-and off-switching processes in great detail. several light-activated reaction pathways have been identified. they are highly intertwined so that the net effect achieved with light of a particular wavelength depends on the relative probabilities to photoinduce the various processes. phosducin (pd) is a g t bc-binding protein that is highly expressed in photoreceptors. pd is phosphorylated in dark-adapted retina and is dephosphorylated in response to light. dephosphorylated pd binds g t protein bc-heterodimer with high affinity and inhibits its interaction with g t a or other effectors, whereas phosphorylated pd does not. therefore pd down-regulates the light response in photoreceptors. phosphorylation of pd at s and s leads to the binding of the - - protein. the - - proteins function as scaffolds modulating the activity of their binding partners and their role in pd regulation is still unclear. the - - protein binding may serve to sequester phosphorylated pd from g t bc or to decrease the rate of pd dephosphorylation and degradation. we performed several biophysical studies of the - - :pd complex. analytical ultracentrifugation was used to determine the complex stoichiometry and dissociation constant. conformational changes of pd induced both by the phosphorylation itself and by - - binding were studied using the time-resolved fluorescence spectroscopy techniques. mö ssbauer spectroscopy with a high velocity resolution was used for comparative study of various oxyhemoglobins for analysis of the heme iron electronic structure and protein structure-function relationship. samples of pig, rabbit and normal human oxyhemoglobins and oxyhemoglobins from patients with chronic myeloleukemia and multiple myeloma were measured using mö ssbauer spectrometric system with a high velocity resolution at k. mö ssbauer spectra were fitted using two models: one quadrupole doublet (model of equivalent iron electronic structure in a-and b-subunits of hemoglobins) and superposition of two quadrupole doublets (model of non-equivalent iron electronic structure in a-and b-subunits of hemoglobins). in both models small variations of mö ssbauer hyperfine parameters (quadrupole splitting and isomer shift) were observed for normal human, rabbit and pig oxyhemoglobins and related to different heme iron stereochemistry and oxygen affinity. small variations of mö ssbauer hyperfine parameters for oxyhemoglobins from patients were related to possible variations in the heme iron stereochemistry and function. the different types of silk produced by orb-weaving spiders display various mechanical properties to fulfill diverse functions. for example, the dragline silk produced by the major ampulate glands exhibits high toughness that comes from a good trade-off between stiffness and extensibility. on the other hand, flagelliform silk of the capture spirals of the web is highly elastic due to the presence of proline and glycine residues. these properties are completely dictated by the structural organization of the fiber (crystallinity, degree of molecular orientation, secondary structure, microstructure), which in turn results from the protein primary structure and the mechanism of spinning. although the spinning process of dragline silk begins to be understood, the molecular events occurring in the secretory glands and leading to the formation of other silk fibers are unknown, mainly due to a lack of information regarding their initial and final structures. taking advantage of the efficiency of raman spectromicroscopy to investigate micrometer-sized biological samples, we have determined the conformation of proteins in the complete set of glands of the orb-weaving spider nephila clavipes as well as in the fibers that are spun from these glands. domain of rgs at . Å resolution was solved. the stoichiometry of - - f /rgs protein complex was elucidated using the analytical ultracentrifugation. to map the interaction between - - f and rgs protein we performed a wide range of biophysical measurements: h/d exchange and cross link experiments coupled to mass spectrometry, fret (fö rster resonance energy transfer) time-resolved fluorescence experiments, time-resolved tryptophan fluorescence spectroscopy and saxs (small angle x-ray scattering) measurements. based on all these results we build d model of - - f /rgs protein complex. our model revealed new details on architecture of complex formed by - - proteins. to date all known structure of - - proteins complexes suggests that the ligand is docked in the central channel of - - protein. our results indicate that the rgs domain of rgs protein is located outside the central channel of - - f protein interacting with less-conserved residues of - - f. the receptor for advanced glycation end-products (rage) is a multiligand cell surface receptor involved in various human diseases. the major alternative splice product of rage comprises its extracellular region that occurs as a soluble protein (srage). although the structures of srage domains were available, their assembly into the functional full length protein remained unknown. here we employed synchrotron small-angle x-ray scattering to characterize the solution structure of human srage. the protein revealed concentration-dependent oligomerization behaviour, which was also mediated by the presence of ca + ions. rigid body models of monomeric and dimeric srage were obtained from the scattering data recorded at different solvent conditions. the monomer displays a j-like shape while the dimer is formed through the association of the two n-terminal domains and has an elongated structure. the results provided insight into the assembly of i) the heterodimer srage:rage, which is responsible for blockage of the receptor signalling, and ii) rage homodimer, which is necessary for signal transduction, paving the way for the design of therapeutical strategies for a large number of different pathologies. clpb is a hexameric aaa? atpase that extracts unfolded polypeptides from aggregates by threading them through its central pore. the contribution of coiled-coil m domains is fundamental for the functional mechanism of this chaperone, and its location within the protein structure in previous structural models is contradictory. we present cryo-electron microscopy structural analysis of clpb from e. coli in several nucleotide states. the study reveals a novel architecture for clpb and shows that m domains form an internal scaffold located in the central chamber of clpb hexamers. this inner structure transmits local signals due to atp binding and hydrolysis by aaa? domains. surprisingly, coiled-coil m domains are seen to bend significantly around a hinge region that separates two structural motifs. our results present a new framework to understand clpb-mediated protein disaggregation. streptomyces clavuligerus isoenzymes involved in clavulanic acid biosynthesis: a structural approach clavulanic acid (ca) is a potent b-lactamase inhibitor produced by streptomyces clavuligerus. n -( -carboxyethyl) arginine synthase (ceas) and proclavaminate amidino hydrolase (pah) catalyze the initial steps in the biosynthesis of ca. recently ceas and pah genes (paralogous of ceas and pah) were related to the ca biosynthesis but their products have not been studied yet. here we present the initial structural analysis of ceas and pah using biophysical techniques. pah and ceas genes were isolated from the genomic dna of s. clavuligerus and overexpressed in e. coli. the recombinant proteins were purified by affinity chromatography and analyzed by size exclusion chromatography, non-denaturing page, dynamic light scattering, far-uv circular dichroism (cd) and fluorescence spectroscopy. our results showed that pah and ceas were obtained as hexamer and dimer respectively. both proteins showed an a/b folding, being stable up to °c. above this temperature protein unfolding was observed but the complete unfolding was not observed, even at °c. moreover ceas and pah showed to be stable over a wide ph range (ph . - . ). we are currently working on improving ceas crystals which are a promising step towards the elucidation of the ceas structure. supported by fapesp. • synchrotron radiation circular dichroism • mass spectrometry following vuv photoionisation • fluorescence imaging with lifetime and spectral measurements here we will present the srcd experiment. high photon flux of photons / sec, improved detector performances as well as user-orientated software developments have proven to be the garants for successful data collections,which considerably increased the information content obtained. the exploration into the charge transfer region of the peptide bonds is adding specifically new insights. low sample volumes of as little as ll per spectra as well as convenient sample chamber handling allow for economic and efficient data collections. typical spectra acquisition from to nm, last for min for three scans with a nm step size. prior to high resolution based techniques, srcd spectrawill answer questions about folding states of macromolecules including dna, rna and sugar macromolecules as well as their complexes with proteins and specially membrane proteins. sporulation in bacillus subtilis begins with an asymmetric cell division producing a smaller cell called the forespore, which initially lies side-by-side with the larger mother cell. in a phagocytosis-like event, the mother cell engulfs the forespore so that the latter is internalised as a cell-within-a-cell. engulfment involves the migration of the mother cell membrane around the forespore until the leading edges of this engulfing membrane meet and fuse. this releases the forespore, now surrounded by a double membrane, into the mother cell cytoplasm. membrane migration during engulfment is facilitated by the interacting proteins spoiiq and spoiiiah that are membrane-associated and expressed in the forespore and the mother cell respectively . they interact in the intercellular space and function initially as a molecular zipper and later they participate in a more elaborate complex in which spoiiq and spoiiiah are integral components of an intercellular channel. this channel is a topic of much current interest, having initially been proposed as a conduit for the passage from the mother cell to the forespore of a specific, but putative, regulator of the rna polymerase sigma factor, r g and later as a gap junction-like feeding tube through which the mother cell supplies molecules for the biosynthetic needs of the forespore. here we present data on the structure and interactions of spoiiq and spoiiiah gleaned from biophysical methods and protein crystallography. these data lead to a plausible model for the intercellular channel. the glycine receptor (glyr) is a chloride permeable ligand gated ion channel and that can mediate synaptic inhibition. due to a possible involvement in the pathophysiology of temporal lobe epilepsy, the different properties of glyrs containing alpha l and k subunit isoforms are currently investigated. previous characterizations of homomeric receptors consisting of these isoforms have shown a difference in their electrophysiological properties and their membrane distribution as observed by diffraction-limited fluorescence microscopy. we studied these isoforms, when separately expressed in transfected hek t cells, by using single molecule tracking (smt) in living cells and direct stochastic optical reconstruction microscopy (dstorm) on fixated cells. for both techniques the glyrs are stained using a primary antibody directly labeled with alexa fluor dyes. the dstorm experiments support the observation that alpha l glyr are clustered, while the alpha k glyrs are more uniformly spread. the analysis of the short range diffusion coefficients obtained by smt reveals the presence of heterogeneous motion for both isoforms. the k-isoform has a higher fraction of fast diffusion. in contrast, the l-isoform is more associated with slow diffusion and appears to undergo hindered diffusion. since nanoparticles are suitable for tumor therapy due to their passive targeting to cancer cells by enhanced permeability and retention effect [ ] , it is important to understand mechanisms of their delivery into the living cancer cells. in this respect we have developed a modular spectral imaging system based on a white light spinning disk confocal fluorescence microscope and a narrow tunable emission filter. firstly, interaction of polymer nanoparticles and cells labeled with spectrally overlapping probes was examined. the use of fluorescence microspectroscopy (fms) allowed co-localization, which showed that the size of polymer nanoparticles strongly influences their transfer across the cell plasma membrane. next, the delivery of liposomes (composed of cancerostatic alkylphospholipid (opp) and cholesterol) labeled with environment-sensitive fluorescent probe was monitored. we were able to detect a very small shift in emission spectra of cholesterol-poor opp liposomes inside and outside the cells, which would not be possible without the use of fms. this shift implies that the delivery of these liposomes into cancer cells is based on fusion with the cell membrane [ ] . [ high-resolution optical imaging techniques make now accessible the detection of nanofeatures in bio-and soft-matter by non-ionizing visible radiation. however, high-resolution imaging is critically dependent by the fluorescent probes used for reporting on the nano-environment. on account of our long-standing interest in the development of fluorescent probes, we set out to design and engineer new fluorescent systems for nanoscale imaging and sensing of biological specimens and soft-matter. these fluorophores report on fast subtle changes of their nanoscale environment at excited state and are meant to fulfill these requirements: a) optical responses (intensity, wavelength-shift, lifetime, anisotropy) predictably related to the environmental polarity, viscosity, macromolecular structure, b) high brightness allowing for single-molecule detection, c) easily conjugable to biomolecules or macromolecules of interest. notably, we aim at conjugating these properties with the capability of nanoscopy imaging based on stimulated emission depletion or stochastic reconstruction optical microscopy. in this lecture the main features and applications of the engineered probes will be reviewed and future developments in this exciting field will be discussed. foamy virus (fv) is an atypical retrovirus which shares similarities with hiv and hepatitis b viruses. despite numerous biochemical studies, its entry pathway remains unclear, namely membrane fusion or endocytosis. to tackle this issue, dual color fluorescent viruses were engineered with a gfp labeled capsid and a mcherry labeled envelope. using high resolution d imaging and d single virus tracing, we followed the entry of the fluorescent viruses in living cells with a precision of nm in the plane and nm along the optical axis. to distinguish between the two possible pathways, we developed a novel colocalization analysis method for determining the moment along every single trace where the colors separate, i.e. the fusion event. the combination of this dynamical colocalization information with the instantaneous velocity of the particle and its position within the reconstructed d cell shape allows us to determine whether the separation of capsid and envelope happens at the cell membrane or from endosomes. we then compared two types of fv and demonstrated, consistently with previous ph-dependency studies, that the prototype fv can enter the cell by endocytosis and membrane fusion, whereas the simian fv was only observed to fuse after endocytosis. phosphatidylinositol , -bisphosphate (pi( , )p ) is a minor component of the plasma membrane known to a critical agent in the regulation of synaptic transmission. clustering of pi( , )p in synaptic active zones is important for synaptic transmission. however, pi( , )p does not spontaneously segregate in fluid lipid membranes and another mechanism must be responsible for the lateral segregation of this lipid in active zones. clustering of pi( , )p is expected to be associated with lipid-protein interactions and possibly partition towards lipid rafts in the plasma membrane. here we analyze the influence of protein palmitoylation on the formation of pi( , )p clusters and on synaptic protein-pi( , )p interaction by means of fö rster resonance energy transfer measurements by fluorescence lifetime imaging (fret-flim) and fret confocal microscopy. . during sporulation an entire chromosome is transferred into the forespore. this process starts by the formation of an asymmetrically located division septum that leads to the formation of two unequally sized compartments: a large mothercell and a smaller forespore. the septum traps about % of the chromosome to be transferred into the forespore. the remaining (* mbp) are then translocated from the mother cell into the forespore by an active mechanism involving the spo-iiie dna translocase. the mechanisms of translocation, particularly the control of the directionality, still remains unknown and various models have been proposed so far. since each model predicts very different distribution of spoiiie proteins at the sporulation septum, we used palm microscopy (photoactivated localization microscopy) to investigate proteins localization in live-sporulating bacteria. using this technique, we showed that spoiiie proteins are forming a single tight focus at the septum with a characteristic size of around nm. more surprising, the focus is usually localized in the mother cell compartment and the mean distance between the spoiiie focus and the septum is nm. our data suggest that during the translocation process, spoiiie proteins are only forming stable complexes on the mother cell side, allowing then for a control of the chromosome translocation from the mother cell to the forespore. morphogenetic gradients determine cell identity in a concentration-dependent manner and do so in a way that is both incredibly precise and remarkably robust. in order to understand how they achieve this feat, one needs to establish the sequence of molecular mechanisms starting with morphogen gradient formation and leading to the expression of downstream target genes. in fruit flies, the transcription factor bicoid (bcd) is a crucial morphogen that forms an exponential concentration gradient along the embryo ap axis and turns on cascades of target genes in distinct anterior domains. we measured bcd-egfp mobility in live d. melanogaster embryos using fluorescence correlation spectroscopy and fluorescence recovery after photobleaching. we found that bcd-egfp molecules had a diffusion coefficient on the order of * lm /s during nuclear cycles - , both in the cytoplasm and in nuclei. this value is large enough to explain the stable establishment of the bcd gradient simply by diffusion. on the other hand, in the context of the extremely precise orchestration of the transcription of the hunchback bcd target gene, it is too slow to explain how a precise reading of bicoid concentration could be achieved at each interphase without the existence of a memorization process. single molecule studies of key processes during the initiation of innate and adaptive immune response the two pillars of the vertebrate immune system are the innate and adaptive immune response, which confer resistance to pathogens and play a role in numerous diseases. here we exploit single molecule fluorescence imaging on live cells to study the key molecular processes that underpin these responses. the first project looks at the changes in the organisation of toll-like receptor (tlr ) on the cell surface of macrophages upon activation via lipopolysaccharide (lps), as it is currently not known whether a higher level of tlr organisation is required for the signalling process. macrophages natively express tlr at a low level which allows for oligomerisation to be analysed in live cells by dynamic single molecule colocalisation (dysco) using data obtained by totalinternal reflection fluorescence (tirf) microscopy. the experiments of the second project aim at determining the critical initial events in t-cell triggering by labelling key proteins like the tcr receptor and cd on the surface of live t-cells and following how their spatial distribution changes following the binding of the t-cell to a surface. this enables us to distinguish between the different models of t-cell triggering which are based on aggregation, segregation or a conformational change of the tcr. the study of cells using scanning force microscopy the motility of unicellular parasites in mammalians seems very interesting, yet very complex. in a world, were inertia cannot be used for propulsion, in a world at low reynolds numbers, most of our everyday strategies of self-propulsion do not work. one class of parasites that know their way around, the flagellate trypansome, manage not only to survive in the blood stream, which is a lot faster than its own propulsion velocity and where the trypanosome is constantly attacked by its host's immune response, but also to penetrate the bloodbrain-barrier, which actually should be to tight to enter. even though trypanosomes are known for more than years, their motility behaviour is not completely elucidated yet. now, using high-speed darkfield-microscopy in combination with optical tweezers in microfluidic devices and analyzing the recorded data, new light has been shed on the motility of these parasites. astonishing results show that trypanosomes are very well adapted to their hosts environment, they even can abuse red blood cells for their self-propulsion and use the bloodstream itself to drag antibodies bound to their surface to their cell mouth, where the antibodies are endocytosed and digested. the first part of the presentation will discuss nanoparticle (quantum dot, qd) biosensors and nanoactuators that exploit novel and unusual fret phenomena in the induction/detection of protein aggregation [ ] , reversible on-off qd photoswitching [ ] , and ph sensing [ ] . the second part of the presentation will feature the application of an integrated chemical biological fret approach for the in situ (in/on living cells) detection of conformational changes in the ectodomain of a receptor tyrosine kinase (the receptor for the growth factor egf) induced by ligand binding [ ] . the measurements were conducted with a two-photon scanning microscope equipped with tcspc detection; novel methods for lifetime analysis ad interpretation were employed to confirm the concerted domain rearrangements predicted from x-ray crystallography. the study of protein-protein interactions in vivo is often hindered by the limited acquisition speed of typical instrumentation used, for instance, for lifetime imaging microscopy. anisotropy polarization is altered by the occurrence of foerster resonance energy transfer (fret) and anisotropy imaging was shown to be comparatively fast and simple to implement. here, we present the adaptation of a spinning disc confocal microscope for fluorescence anisotropy imaging that allowed to achieve in vivo imaging at high spatial and temporal resolution. we demonstrate the capabilities of this system and in-house developed analysis software by imaging living caenorhabditis elegans expressing constitutive dimeric and monomeric proteins that are tagged with gfp. measuring intracellular viscosity: from molecular rotors to photodynamic therapy of cancer marina k. kuimova department of chemistry, imperial college london, exhibition road, sw az, uk viscosity is one of the main factors which influence diffusion in condensed media and is crucial for cell function. however, mapping viscosity on a single-cell scale is a challenge. we have imaged viscosity inside live cells using fluorescent probes, called molecular rotors, in which the speed of rotation about a sterically hindered bond is viscosity-dependent [ ] [ ] [ ] . this approach enabled us to demonstrate that viscosity distribution in a cell is highly heterogeneous and that the local microviscosity in hydrophobic cell domains can be up to higher than that of water. we demonstrated that the intracellular viscosity increases dramatically during light activated cancer treatment, called photodynamic therapy (pdt) [ ] . we also demonstrated that the ability of a fluorophore to induce apoptosis in cells during pdt [ ] , or to act as a benign molecular rotor, measuring viscosity, can be controlled by carefully selecting the excitation wavelength in viscous medium [ ] . in the field of biophysics and nanomedicine, the cellular reaction and the kinetics of gene expression after transfection of live cells with plasmid dna or gene-silencing sirna is of great interest. in a previous study on the transfection kinetics of non-viral gene-transfer [ ] we realised that the development of single-cell arrays would be a great step towards easy-to-analyse, high-throughput transfection studies. the regular arrangement of single cells would overcome the limitations in image-analysis that arise from whole populations of cells. in addition to that, the analysis of expression kinetics at the single-cell can help to identify the cell-to-cell variability within a cell population. in order to develop suitable single-cell arrays, we are currently adjusting the different parameters of such a microenvironment (e.g. size, shape, surface-functionalisation) in order to end up with a defined surrounding for single-cell transfection studies. in addition to that, we try to find the optimal uptake pathway for each of the different applications. the neurodegenerative disorder alzheimer's disease (ad) causes cognitive impairment such as loss of episodic memory with ultimately fatal consequences. accumulation and aggregation of two proteins in the brain -amyloid beta and tau -is a characteristic feature. these soluble proteins aggregate during the course of the disease and assemble into amyloid-like filaments. recently it was found that the toxicity of soluble amyloid beta oligomers must also be taken into account for the pathogenesis of cognitive failure in ad. if oligomers are the predominant toxic species it would be pertinent to determine how they disrupt and impair neuronal function. the prion protein (prp) receptor has been proposed to mediate amyloid beta binding to neuronal cells. we have characterised the interaction of the amyloid beta and the prp receptor expressed on hippocampal and neuroblastoma cells at the single-molecule level. we do not detect any colocalisation of either the or amino acids variants with the prp receptor. bacterial biofilms are of the utmost importance in the study of environmental bioremediation and the design of materials for medical applications. the understanding of the mechanisms that govern cell adhesion must be analysed from the physics point of view in order to obtain quantitative descriptors. the genus rhodococcus is widely spread in natural environments. the species are metabolically diverse and thus they can degrade a wide range of pollutants. due to their high hydrophobicity, these cells are very resistant to harsh conditions, are able to degrade hydrophobic substances (e.g. oil) and attach to high-contact angle surfaces. the hydrophobicity of several strains of rhodococcus is measured and mapped using chemical force microscopy (cfm) in the present study. cfm relies on the functionalisation of scanning force microscopy (sfm) tips using hydrophobic or hydrophilic groups. in cfm, the microscope is operated in the forcevolume mode, which combines adhesion data with topographic images. the careful control of the tip chemistry permits the study of interactions between the functional groups on the tip and the bacterial surface, thus allowing the assessment of hydrophobicity. in order to perform a cfm study, the cells need to be firmly anchored to a substrate under physiological conditions (i.e. under a nutrient media or a saline buffer). to this end, several adhesive surfaces have been tested in order to find the one that gives the best results. optical microscopy is arguably the most important technique for the study of living systems because it allows d imaging of cells and tissues under physiological conditions under minimally invasive conditions. conventional far-field microscopy is diffraction-limited; only structures larger than * nm can be resolved, which is insufficient for many applications. recently, techniques featuring image resolutions down to * nm have been introduced such as localization microscopy (palm, storm) and reversible saturable optical fluorescent transition microscopy (resolft, sted). these methods are well suited for live-cell imaging and narrow the resolution gap between light and electron microscopy significantly. we have used palm imaging to study the formation and disassembly of focal adhesions of live hela cells in a high resolution pulse-chase experiment using monomeric irisfp [ ] . mirisfp is a photoactivatable fluorescent protein that combines irreversible photoconversion from a green-to a red-emitting form with reversible photoswitching between a fluorescent and a nonfluorescent state in both forms. in our experiments a subpopulation of mirisfp molecules is photoconverted to the red form by irradiating a specified region of the cell with a pulse of violet light. migration of tagged proteins out of the conversion region can be studied by subsequently localizing the proteins in other regions of the cell by palm imaging, now using the photoswitching capability of the red species. real-time image reconstruction developed in our lab [ ] allowed instant control imaging parameters. live cell imaging of cancer cells is often used for in-vitro studies in connection with photodynamic diagnostic and therapy (pdd and pdt). especially in presence of a photosensitizer, this live cell imaging can only be performed over relatively short duration (at most hour). this restriction comes from the light-induced cell damages (photodamages) that result from rapid fluorescence photobleaching of photosensitizer. while these studies reveal exciting results, it takes several hours to discover the detailed effects of the photosensitizer on cell damage. up to our knowledge, however, there is no general guideline for modification of excitation light dose to achieve that. in this paper, the relation between excitation light doses, photobleaching of photosensitizer (pvp-hyperycin) and cell vitality are investigated using human lung epithelial carcinoma cells (a ). the strategy of this paper is to reduce the excitation light dose by using a low-power pulsed blue led such that the structures are visible in time-lapse images. fluorescence signals and image quality are improved by labelling the cells with an additional non-toxic marker called carboxyfluorescein-diacetate-succinimidyl-ester (cfse). in total we collected time-lapse images (time intervals min) of dual-marked a cells under three different light intensities ( . , . and . mw/cm ) and a variety of pulse lengths ( . , . , , . and ms) over five hours. we have found that there is a nonlinear relationship between the amount of excitation light dose and cell vitality. cells are healthy, i.e. they commence and complete mitosis, when exposed to low light intensities and brief pulses of light. light intensities higher than . mw/cm together with pulse durations longer than ms often cause cell vesiculation, blebbing and apoptosis. in all other cases, however, we found no cell death. in the future, this striking nonlinearity will be studied in more detail. progressive advances in scanning ion conductance microscopy (sicm) [ ] enabled us to convert ordinary scanning probe microscope (spm) in to versatile multifunctional technique. as an imaging tool, ion conductance microscopy is capable to deliver highest possible topographical resolution on living cell membranes among any other microscopy techniques [ ] . also, it can visualize surfaces complexity of those makes them impossible to image by other spms [ ] . ion conductance microscopy combined with a battery of powerful methods such as fluorescence resonance energy transfer (fret) [ ] , patch-clamp, force mapping, localized drug delivery, nano-deposition and nano-sensing is unique among current imaging techniques. the rich combination of ion conductance imaging with other imaging techniques such as laser confocal and electrochemical [ ] will facilitate the study of living cells and tissues at nanoscale. ) and coleoptiles of wheat (triticum aestivum l.) seedlings, which were growing in light and dark conditions, were used to determine fluorescence of whole cells. fluorescence emission spectrum was monitored by fluorescent microscopy using the spectrometer usb . fluorescence intensity f , f , f and f was determined and data was statistically analyzed in annova. we observed that bgf, rf and frf intensity increased in the first leaves with the age of the seedlings. in the coleoptiles was observed great bgf intensity increase with the age of the seedlings. in the coleoptiles decreased rf intensity of the and hours old seedlings, and bgf intensity decreased of hours old seedlings. it was found that emission spectrum and fluorescence intensity changes are induced by the lack of light and salt (nacl) stress. analysis of fluorescence spectrum can quickly and accurately indicate the outset of light and salt stress in plants. there are analogical changes in fluorescence emission spectrum of plant cells in senescence and stress conditions. it was assumed that environmental stress and senescence have common mechanisms in plants. this changes can be monitored by fluorescent microscopy. triple-colour super-resolution imaging in living cells markus staufenbiel, stephan wilmes, domenik lisse, friedrich roder, oliver beutel, christian richter and jacob piehler universitä t osnabrü ck, fachbereich biologie, barbarastraße , , germany, markus.staufenbiel@biologie.uniosnabrü ck.de super-resolution fluorescence imaging techniques based on single molecule localisation has opened tremendous insight into the sub-micrometre organisation of the cell. live cell imaging techniques such as fluorescence photoactivation localization microscopy (fpalm) are currently limited to dual-colour detection due to the restricted availability of red-fluorescent photoswitchable proteins. we employed photoswitching of the oxazine dye atto under reducing conditions for super-resolution imaging in the cytoplasm of living cells. for efficient and specific covalent labelling of target proteins, we have made use of the halotag system. atto was coupled to the halotag ligand (htl) and fast reaction of htl-atto with the halotag enyzme was confirmed in vitro by solid phase binding assays. efficient labelling of the membrane cytoskeleton using lifeact fused to the halotag was observed and super-resolution imaging was readily achieved. based on this approach, we managed to follow the nanoscale dynamics of the actin cytoskeleton as well as clathrincoated pits using clathrin light chain fused to the halotag. we combined this technique with fpalm for triplecolour super-resolution imaging of the spatial distribution of membrane receptors in context of the membrane skeleton. the erbb family of receptor tyrosine kinases consists of four transmembrane proteins that transduce signals across the membrane to control cell fate. growth factor binding results in homo-and hetero-interactions between these receptors at the membrane. erbb receptors are implicated in many cancers, making them a target for therapeutic drugs. to date, studies of erbb interactions have been limited to individual family members or specific pairs, giving an incomplete picture of the highly complex behaviour controlling positive and negative feedback loops and signalling outcomes. to investigate erbb receptor interactions, we have developed tirf-based single molecule fluorescence microscopes capable of simultaneously imaging three, and soon five, fluorescence probes in live cells. we have also developed a catalogue of extrinsic fluorescent probes for : labelling of both endogenous and transfected erbb family members in mammalian cells, plus a bayesian approach to the analysis of single molecule data. this allows us to track active and inactive erbb family members at the basal surface of a model breast cancer cell line that expresses physiological levels of all four receptors. we present here initial characterisation of the entire erbb family together in the cell membrane. the human genome contains more than g proteincoupled receptors (gpcrs); overall, - % of the mammalian genome encodes these molecules. processes controlled by gpcrs include neurotransmission, cellular metabolism, secretion, and immune responses. however it is the stoichiometry of these receptors that is the most controversial. the starting point for understanding gpcr function was the idea that these receptors are monomeric. on the other hand a lot of recent studies favour the concept that gpcr form dimers and are not capable of signalling as independent monomers. recent single molecule studies try to solve this dilemma by suggesting that gpcrs form transient dimers with a lifetime of * ms. however questions remain about the physiological relevance of the preparations necessary for these studies, since they have not been performed on endogenous receptors. here, we directly image individual endogenous receptors using an equimolar mixture of two colour fluorescent fab fragments. we can then determine the receptors stoichiometry by quantifying its dynamic single molecule colocalisation (dy-sco) recorded by total-internal reflection fluorescence (tirf) microscopy. we have recently investigated the domain dynamics of pgk ( ) . structural analysis by small angle neutron scattering revealed that the structure of the holoprotein in solution is more compact as compared to the crystal structure, but would not allow the functionally important phosphoryl transfer between the substrates, if the protein would be static. brownian large scale domain fluctuations on a timescale of ns was revealed by neutron spin echo spectroscopy. the observed dynamics shows that the protein has the flexibility to allow fluctuations and displacements that seem to enable function. [ many physiological and pathological processes involve insertion and translocation of soluble proteins into and across biological membranes. however, the molecular mechanisms of protein membrane insertion and translocation remain poorly understood. here, we describe the ph-dependent membrane insertion of the diphtheria toxin t domain in lipid bilayers by specular neutron reflectometry and solid-state nmr spectroscopy. we gained unprecedented structural resolution using contrast-variation techniques that allow us to propose a sequential model of the membrane-insertion process at angstrom resolution along the perpendicular axis of the membrane. at ph , the native tertiary structure of the t domain unfolds, allowing its binding to the membrane. the membrane-bound state is characterized by a localization of the c-terminal hydrophobic helices within the outer third of the cis fatty acyl-chain region, and these helices are oriented predominantly parallel to the plane of the membrane. in contrast, the amphiphilic n-terminal helices remain in the buffer, above the polar headgroups due to repulsive electrostatic interactions. at ph , repulsive interactions vanish; the n-terminal helices penetrate the headgroup region and are oriented parallel to the plane of the membrane. the c-terminal helices penetrate deeper into the bilayer and occupy about two thirds of the acyl-chain region. these helices do not adopt a transmembrane orientation. interestingly, the t domain induces disorder in the surrounding phospholipids and creates a continuum of water molecules spanning the membrane. we propose that this local destabilization permeabilizes the lipid bilayer and facilitates the translocation of the catalytic domain across the membrane. the limited stability in vitro of mps motivates the search of new surfactants ( ) ( ) ( ) ( ) . fss with a polymeric hydrophilic head proved to be mild towards mps ( ) .new fss were designed with chemically defined polar heads for structural applications. lac-derivative was efficient in keeping several mps water soluble and active but formed elongated rods ( ) . the glu-family was synthesized, characterized in by sans and auc and for its biochemical interest. the formation of rods is related to the low volumetric ratio between the polar head and hydrophobic tail. the surfactant bearing two glucose moieties is the most promising one, leading to both homogeneous and stable complexes for both br and the b f. it was also shown be of particular interest for the structural investigation of membrane proteins using sans ( ). by combining elastic and quasi-elastic neutron scattering data, and by applying theory originally developed to investigate dynamics in glassy polymers, we have shown that in lyophilised apoferritin above t * k the dynamic response observed in the pico-to nano-second time regime is driven by ch dynamics alone, where the methyl species exhibit a distribution of activation energies. our results suggest that over the temporal and spatial range studied the main apoferritin peptide chain remains rigid. interestingly, similar results are reported for other smaller, more flexible lyophilised bio-materials. we believe this work elucidates fundamental aspects of the dynamic landscape in apoferritin which will aid development of complex molecular dynamic model simulations of super-molecules. a detailed appreciation of the relationships between dynamics and biological function will require analysis based on such models that realize the full complexity of macromolecular material. biological systems must often be stored for extended periods of time. this is done by lyophilisation in the presence of lyoprotectants, such as sugars, which results in stable products at ambient conditions. [ ] in an effort to understand the mechanism of preservation and stabilization, the interactions between sugars and liposome vesicles, which serve as a simple membrane model, have been studied extensively. amongst the common sugars, trehalose has superior preservative effects [ ] and accumulates to high concentrations in many anhydrobiotic organisms. despite many experimental and numerical studies three mechanisms are proposed: vitrification [ ] , preferential exclusion [ ] and water replacement [ ] . to gain more insight into the stabilization mechanism we have recently investigated the effect of trehalose on the bending elasticity of fully hydrated unilamellar vesicles of , -dipalmitoyl-phosphatidylcholine (dppc) in d o at temperatures below and above the lipid melting transition (tm) using neutron spin-echo. the data was analyzed using the zilman-granek theory. at all temperatures measured, trehalose stiffens the bilayer suggesting strong interactions between trehalose and the lipid. trehalose appears to broaden the melting transition but does not change the tm. this agrees with observations using differential scanning calorimetry. influence of macromolecular crowding on protein stability sté phane longeville, clé mence le coeur laboratoire lé on brillouin, gif-sur-yvette, france cell interior is a complex environment filled with a variety of different objects with respect to shape and size. macromolecules are present at a total concentration up to several hundred grams per litre and the overall occupied volume fraction can reach ö & . - . . under crowding environment protein-protein interaction play a fundamental role. the crowding environment can affect some physical, chemical, and biological properties of biological macromolecules [ , ] . traditionally, protein folding is studied in vitro at very low concentration of proteins. under such conditions, small globular single chain proteins can unfold and refold quite rapidly depending mainly to the nature of the solvent. such processes have been very intensively studied, since folding of proteins into their native structure is the mechanism, which transforms polypeptide into its biologically active structure. protein misfolding is involved in a very high number of diseases [ ] (e.g. alzheimer, parkinson, and kreuzfeld-jacob diseases, type ii diabetes, …). theoretically, the problem was studied by the introduction of the concept of excluded volume [ ] . in recent papers [ , ] , minton uses statistical thermodynamic models to address the question. he predicted that inert cosolutes stabilize the native state of proteins against unfolding state mainly by destabilizing the unfolded state and that the dimension of the unfolded state decreases with increasing the concentration of cosolute in a measurable way. small angle neutron scattering (sans) is a technique of choice for such study because, by using appropriate mixtures of light and heavy water, it is possible to match the scattering length density of the solvent to the one of the cosolute and thus to measure the conformation of a molecule at low concentration in a presence of a high concentration of another one. we will present a complete experimental study of the mechanism that leads to protein stabilization by macromolecular crowding [ , ] . coupled dynamics of protein and hydration water studied by inelastic neutron scattering and molecular dynamics simulation hiroshi nakagawa , , mikio kataoka , japan atomic energy agency, tokai, japan, juelich centre for neutron science, forschungszentrum juelich gmbh, nara institute of science and technology, ikoma, japan proteins work in an aqueous environment at ambient temperature. it is widely accepted that the proteins are flexible and mobile. the flexibility and mobility, that is, protein dynamics are essential for protein functions. neutron incoherent scattering is one of the most powerful techniques to observe protein dynamics quantitatively. here i will talk about dynamics of protein and its hydration water. the structure of a soluble protein thermally fluctuates in the solvated environment of a living cell. understanding the effects of hydration water on protein dynamics is essential to determine the molecular basis of life. however, the precise relationship between hydration water and protein dynamics is still unknown because hydration water is ubiquitously configured on the protein surface. we found that hydration level dependence of the onset of the protein dynamical transition is correlated with the hydration water network. hydration water dynamics change above the threshold hydration level, and water dynamics control protein dynamics. these findings lead to the conclusion that the hydration water network formation is an essential property that activates the anharmonic motions of a protein, which are responsible for protein function. thermal motions and stability of hemoglobin of three endotherms (platypus -ornithorhynchus anatinus, domestic chicken -gallus gallus domesticus and human -homo sapiens) and an ectotherm (salt water crocodile -crocodylus porosus) were investigated using neutron scattering and circular dichroism. the results revealed a direct correlation between the dynamic parameters, melting -, and body temperatures. on one hand, a certain flexibility of the protein is mandatory for biological function and activity. on the other hand, intramolecular forces must be strong enough to stabilize the structure of the protein and to prevent unfolding. our study presents experimental evidence which support the hypothesis that the specific amino acid composition of hb has a significant influence on thermal fluctuations of the protein. the amino acid sequence of hb seems to have evolved to permit an optimal flexibility of the protein at body temperature. macromolecular resilience was found to increase with body and melting temperatures, thus regulating hb dynamics. where k is the trap spring constant, a is the subdiffusion exponent and e a is the mittag-leffler function. the parameters obtained by fitting this equation to the experimental msds are summarized in table . at short lag times we have not found any difference between the two cell types, contrarily to the previous results obtained by afm . for both cell lines the subdiffusion exponent, a was found close to , the value predicted by the theory of semiflexible polymers. but the crossover frequency x , was found smaller for the cancerous cells for all datasets. it corresponds to passage to the confined regime at longer times. we attribute it to the bigger impact of molecular motors. we study the spatiotemporal evolution of fibrous protein networks present in the intracellular and extracellular matrices. here, we focus on the in vitro actin network dynamics and evolution. in order to study the hierarchical self-assembly of the network formation in a confined environment and provide external stimuli affecting the system minimally, we introduce a new microfluidic design. the microfluidic setup consists of a controlling channel to which microchambers of different shapes and sizes are connected through narrow channels. this design results in having mainly convective transport in the controlling channel and diffusive transport into the microchamber. rhodamine labeled actin monomers diffuse into the chamber. after polymerization is induced, they form a confined entangled network. cross-linking proteins can then be added to increase the network complexity. moreover, we can generate gradients of reactants across the microchambers and vasodilator-stimulated phosphoprotein (vasp) is a crucial regulator of actin dynamics. it is important in cellular processes such as axon guidance and migration, promoting assembly of filopodia and lamellipodia. vasp's multiple domain structure increases the range of interactions it has with actin monomers, filaments, and other proteins and it displays multiple binding modes both in vitro and in vivo, including barbed end elongation and filament bundling. however, it is not fully understood how vasp affects the structural and mechanical properties of actin networks. we characterize vasp-mediated bundling of actin networks in a simplified in vitro system using confocal microscopy and quantify mechanical properties with rheology measurements. we show that the network properties differ from other actin bundling proteins and reflect vasp's multiple domain structure, displaying a complex bundling phase space that depends upon solution conditions. we observe the formation of large bundle aggregates accompanied by a reduction in network elasticity at high protein ratios. in addition, we change vasp's actin binding mode and eliminate bundling by introducing free actin monomers. finally, we show preliminary results from a biomimetic system that extends the range of actin-vasp interaction. cell migration or proliferation? the go or grow hypothesis in cancer cell cultures tamá s garay, É va juhá sz, jó zsef tímá r, balá zs heged} us nd department of pathology, semmelweis university, budapest, hungary background: cancer related death is constantly growing in the past decades. the mortality of solid tumors is mostly due to the metastatic potential of tumor cells which requires a fine adjustment between cell migration and cell proliferation. as the metabolic processes in the cell provide a limited amount of available energy (i.e. atp) the various biological processes like cell motility or dna synthesis compete for the atp available. the go or grow hypothesis postulates that tumor cells show either high migration or proliferation potential. in our study we investigate on a large series of tumor cell lines whether this assumption stands for malignant cells. materials and methods: twenty tumor cell lines derived from malignant mesothelioma (mesodermal origin) and malignant melanoma (neuroectodermal origin) were subjected to three-days-long time-lapse videomicroscopic recordings. cell motility and proliferation were characterized by the probability of cell division within hours and the -hour migration distance of the cells. results: we found a wide range in both the cell migratory activity and the proliferation capacity in our series. the -hour migration distance ranged from to micron and from to micron in mesothelioma and melanoma cells, respectively. the lowest -hour cell division probability was found to be . in both the melanoma and mesothelioma series while the highest proliferation activity reached . and . in melanoma and mesothelioma, respectively. interestingly, in the melanoma cell lines we found a significant positive correlation (r= , ; p= , ) between cell proliferation and cell migration. in contrast our mesothelioma cell lines displayed no correlation between these two cellular processes. conclusions: in summary our findings demonstrate that the investigated tumor cells do not defer cell proliferation for cell migration. important to note the tumor cells derived from various organ systems may differ in terms of regulation of cell migration and cell proliferation. furthermore our observation is in line with the general observation of pathologists that the highly proliferative tumors often display significant invasion of the surrounding normal tissue. many cell types are sensitive to mechanical signals. one striking example is the modulation of cell proliferation, morphology, motility, and protein expression in response to substrate stiffness. changing the elastic moduli of substrates alters the formation of focal adhesions, the formation of actin filament bundles, and the stability of intermediate filaments. the range of stiffness over which different primary cell types respond can vary over a wide range and generally reflects the elastic modulus of the tissue from which these cells were isolated. mechanosensing also depends on the type of adhesion receptor by which the cell binds, and therefore on the molecular composition of the specific extracellular matrix. the viscoelastic properties of different extracellular matrices and cytoskeletal elements also influence the response of cells to mechanical signals, and the unusual non-linear elasticity of many biopolymer gels, characterized by strain-stiffening leads to novel mechanisms by which cells alter their stiffness by engagement of molecular motors that produce internal stresses. the molecular mechanisms by which cells detect substrate stiffness are largely uncharacterized, but simultaneous control of substrate stiffness and adhesive patterns suggests that stiffness sensing occurs on a length scale much larger than single molecular linkages and that the time needed for mechanosensing is on the order of a few seconds. to explore potential role of cytoskeletal component in cardiomyocyte for adaptation to extreme conditions was carried out the comparative study of expression of cytoskeletal sarcomeric protein titin in myocardium of ground squirrels during hibernation and gerbils after spaceflight. we have revealed a two-fold increase in content of long n ba titin isoform as compared to short n b titin isoform in different heart chambers of hibernating ground squirrels. the prevalence of the long titin isoform is known to determine the larger extensibility of heart muscle that promotes, according to frank-starling law, the increase in force of heart contractility for pumping higher viscous blood during torpor and adapting the myocardium to greater mechanical loads during awakening. moreover, titin mrna level showed seasonal downregulation in which all hibernating stages differed significantly from summer active level. it is possible that the decline of mrna and protein synthesis during hibernation may be regarded as the accommodation for minimization of energetic expenditures. we have not revealed differences in titin mrna levels between control gerbils and gerbils after spaceflight. but we have also observed the two-fold growth in the amount of n ba titin isoform in left ventricle of gerbils after spaceflight that is likely to be directed to the restoration of the reduced heart contractility at zero-gravity. these results suggest that the increase of the content of the long n ba titin isoform may serve as universal adaptive mechanism for regulating of heart function in response to the extreme conditions. nuclear migration is a general term for a non-random movement of the nucleus toward specific sites in the cell. this phenomenon has been described throughout the eukaryotes from yeast to mammals. the process is however still poorly understood in mammalian cells. by using microcontact printing we are able to regulate the geometry and spreading of cultured cells. adhesive micropatterns of fibronectin provide an attachment surface for the cells whereas the passivation of the surface by pll-peg prevents protein, thus cell adhesion. live cell imaging by time-lapse microscopy has shown that under these conditions cells gain a bipolar shape, and more interestingly, the nuclei of the cells showed auto-reversed motion. our research tries to understand the molecular cues and mechanisms behind the observed cellular and nuclear movement. we have already shown that the cytoskeleton plays an important role in this phenomena but the exact players and the detailed mechanism remain to be clarified. in order to identify the most important components and their relationship have drug treatments and sirna experiments have been applied. although our research focuses mainly on the motility of the nucleus, it may also help to get a better understanding of the general theme of cell migration. cell motility involves a number of strategies that cells use to move in their environments in order to seek nutrients, escape danger and fulfil morphogenetic roles. when these processes become uncontrolled, pathological behaviours, like cancer or metastasis of cancerous cells, can occur. here we present a new method for the contextual quantification of cellular motility, membrane fluidity and intracellular redox state, by using the ratiometric, redox-sensitive protein rxyfp and the ratiometric fluidity-sensitive probe laurdan. we provide evidence that dynamic redox and fluidity changes are correlated with signaling processes involved in cellular motility. these findings may pave the way to novel approaches for the pharmacological control of cell invasiveness and metastasis. manipulation of cellular mechanics anna pietuch, andreas janshoff georg-august university, tammanstraße , gö ttingen, germany, e-mail: anna.pietuch@chemie.unigoettingen.de rheological properties of cells determined by the underlying cytoskeleton (cortex) are key features in cellular processes like cell migration, cell division, and cell morphology. today it is possible to investigate local cellular elastic properties under almost physiological conditions using the afm. by performing force indentation curves on local areas on a cell surface the use of contact mechanic models provides the young's modulus, comprising information about the elastic properties of cells. the administration of cytoskeleton modifying substances into the cell is achieved by microinjection. we are also investigating morphological changes and rearrangements of the cytoskeleton in time resolved impedance measurements. electric cell-substrate impedance sensing is a label-free and minimal invasive technique which allows monitoring morphological changes of cells in real time. readout of the impedance is sensitive to changes in cell-substrate contacts as well as density of cell-cell contacts yielding important information about the integrity of the cell layer and changes in the properties of the cell membrane. we are studying the cellular response to modification of the cytoskeleton e.g. by introducing proteins which affect directly the organization of the actin structure like ezrin. mechanical characterization of actin gels by a magnetic colloids technique thomas pujol, olivia du roure, julien heuvingh pmmh, espci-cnrs-umpc-p . paris, france the actin polymer is central in cell biology: it is a major component of cytoskeleton and it plays a fundamental role in motility, division, mechanotransduction…. its polymerization just beneath the cell membrane generates forces responsible for cell movement. actin filaments form a network whose architecture is defined by the nature of the binding proteins and depends on the location in the cell. for example, in the structure which leads cell migration, the lamellipodium, the gel is branched due to its interaction with arp / protein complex. determining the mechanical properties of such actin network is a crucial interest to understand how forces are generated and transmitted in living cells. we grow a branched actin network from the surface of colloids using the arp / machinery. the particles are super paramagnetic and they attract each other via dipole-dipole interaction to form chain. by increasing the magnetic field we apply an increasing force to the gel and we optically measure the resulting deformation. from those measurements we deduce a young modulus for a large amount of data. we are characterizing different networks by varying the concentration of the capping and branching proteins and we show how mechanics can be regulated by the different proteins. microtubules (mts) are central to the organization of the eukaryotic intracellular space and are involved in the control of cell morphology. in fission yeasts cells mts transport polarity factors to poles where growth is located, thus ensuring the establishment and maintenance of the characteristic spherocylindrical shape. for this purpose, mt polymerization dynamics is tightly regulated. using automated image analysis software, we investigated the spatial dependence of mt dynamics in interphase fission yeast cells. we evidenced that compressive forces generated by mts growing against the cell pole locally reduce mt growth velocities and enhance catastrophe frequencies. in addition, our systematic and quantitative analysis (in combination with genetic modifications) provides a tool to study the role of ?tips (plus-end tracking proteins) such as mal and tip in the spatial regulation of mt dynamics. we further use this system to decipher how the linear transport by mt interferes with the feedback circuitry that assures the correct spatial distribution of tea , the main polarity factor in fission yeast cells. the dynamics of the cytoskeleton are largely driven by cytoskeletal motor proteins. complex cellular functions, such as mitotsis, need a high degree of control of these motors. the versatility and sophistication of biological nanomachines still challenges our understanding. kinesin- motors fulfill essential roles in mitotic spindle morphogenesis and dynamics and were thought to be slow, processive microtubule (mt)-plus-end directed motors. here we have examined in vitro and in vivo functions of the saccharomyces cerevisiae kinesin- cin using single-molecule motility assays and single-molecule fluorescence microscopy. in vivo, the majority of cin motors moved slowly towards mt plus-ends, but we also observed occasional minus-end directed motility episodes. in vitro, individual cin motors could be switched by ionic conditions from rapid (up to lm/min) and processive minus-end, to bidirectional, to slow plus-end motion. deletion of the uniquely large insert in loop of cin induced bias towards minus-end motility and strongly affected the directional switching of cin both in vivo and in vitro. the entirely unexpected in vivo and in vitro switching of cin directionality and speed demonstrate that kinesins are much more complex than thought. these results will force us to rethink molecular models of motor function and will move the regulation of motors into the limelight as pivotal for understanding cytoskeleton-based machineries. morphological and dynamical changes during tgf-b induced epithelial-to-mesenchymal transition david schneider , marco tarantola , and andreas janshoff institute of physical chemistry, georg-august-university, gö ttingen, germany, max planck institute for dynamics and self-organization, laboratory for fluid dynamics, pattern formation and nanobiocomplexity (lfpn), goettingen, germany the epithelial-to-mesenchymal transition (emt) is a program of cellular development associated with loss of cell-cell contacts, a decreased cell adhesion and substantial morphological changes. besides its importance for numerous developmental processes like embryogenesis, emt has also been held responsible for the development and progression of tumors and formation of metastases. the influence of the cytokine transforming growth factor (tgf-b ) induced emt on structure, migration, cytoskeletal dynamics and long-term correlations of the mammalian epithelial cell lines nmumg, a and mda-mb was investigated by time-resolved impedance analysis and atomic force microscopy (afm) performing force-indentation measurements. the three cell lines display important differences in cellular morphology mirrored in changes of their elastic response (young modulus), as well as their dynamics upon tgf-b treatment. impedance based measurements of micromotility reveal a complex dynamic response to tgf-b exposure which leads to a transient increase in fluctuation amplitude and long-term correlation. additionally, the investigation of cellular elasticity via afm depicts the different cytoskeletal alterations depending on the metastatic potential of the used cell type. physics of cellular mechanosensitivity studied with biomimetic actin-filled liposomes bjö rn stuhrmann, feng-ching tsai, guido mul, gijsje koenderink fom institute amolf, amsterdam, the netherlands biological cells actively probe the mechanical properties of their tissue environment by exerting contractile forces on it, and use this information to decide whether to grow, migrate, or proliferate. the physical basis for cell contractility is the actin cytoskeleton, which transmits motor generated stresses to mechanosensitive adhesions sites that anchor the cell to the tissue. the origins of mechanosensing are far from understood due to the complex interplay of mechanical effects and biochemical signaling that occurs far from equilibrium. we use a quantitative biophysical approach based on biomimetic constructs to elucidate physical principles that underlie active mechanosensing in biological cells. we have built realistic in vitro models of contractile cells by encapsulating cross-linked actin networks together with myosin motors in cell-sized membranous containers (liposomes). our method has several advantages over prior methods, including high liposome yield, compatibility with physiological buffers, and chemical control over protein/lipid coupling. i will show contour fluctuation spectra of constructs and first data on mechanical response obtained by laser tweezers microrheology. our work will yield novel insights into stress generation and stiffness sensing of cells. setting up a system to reconstitute cytoskeleton-based protein delivery and patterning in vitro nú ria taberner, liedewij laan, marileen dogterom fom institute amolf, amsterdam, the netherlands keywords: microtubules, fission yeast, cell polarity, protein patterning, plus end binding proteins. many different cell types, from mobile fibroblasts [ ] to fission yeast cells [ ] , display non-homogenous protein patterns on their cell cortex. in fission yeast the cell-end marker protein tea that among others is responsible for recruiting the actin dependent cell-growth machinery, is specifically located at the growing cell ends. tea travels at the tips of growing microtubules and is delivered to the cell ends [ ] . we aim to in vitro reconstitute a minimal microtubule plus-end tracking system that leads to cortical protein patterning in functionalized microfabricated chambers. our model will allow us to perturb microtubule-based transport and diffusion independently and evaluate the resulting protein patterns. [ the tropomyosins (tm) are dimeric actin-binding proteins that form longitudinal polymers along the actin filament groove. there is a great variety of isoforms, but the division of labour between the individual tms and their significance is poorly understood. as in most cell types, also in the neurons several isoforms are present, whose spatio-temporal localisation is differentially regulated. the neuron-specific brain isoform (tmbr- ) can be found in the axon of the mature cells. we aimed to clone, express and charaterise this protein in terms of its effects on the kinetic parameters of the actin filament. using a pet a construct we purified native, tag-free protein, and examined if it influences the rate of actin polymerisation or the stability of the filaments in the presence of either gelsolin or latrunculin-a, two depolymerising agents. in cosedimentation experiments the affinity of tmbr- to actin was* lm, about six times that of skeletal muscle tropomyosin. the net rate of actin polymerisation was reduced by % in the presence of tmbr- . the depolymerisation induced by gelsolin or latrunculin-a was inhibited in a concentration-dependent manner. tmbr- seems to stabilise actin filaments against disassembly without significant effect on the net polymerisation. cell mobility and metastatic spreading: a study on human neoplastic cells using optical tweezers f. tavano the primary causes of death in cancer patients are local invasion and metastasis but their mechanisms are not yet completely understood. metastatization is accompanied by alterations of the cytoskeleton and membrane structure leading to changes in their biomechanical properties [ ] . in this study we analyzed by means of optical tweezers the mechanical properties of two different breast adenocarcinoma cell lines corresponding to different metastatic potential. ot were used to grab the plasma membrane by a , um silica bead and form a plasma membrane tether. we measured the force exerted by the cell membrane on the bead and drew the force-elongation curves. fitting data in the kelvin body model [ ] we found out the values for the viscoelastic parameters influencing the pulling of the membrane tethers. the first cell line analyzed, mcf- , associated to a low metastatic potential showed tether stiffness of pn/um in average. the second cell line, mda-mb , poorly differentiated with a high metastatic potential had a tether stiffness of pn/um in average, that is a four times lower value. these results seems to confirm the hypotesis that metastasis prone cells are softer than less aggressive cancer cells, and support the use of ot for these measurements for its sub-pn force resolution and because cells are manipulated without damage. [ tubulin polymerization promoting protein (tppp/p ) is a brain-specific protein that primary targets the microtubule network modulating its dynamics and stability. tppp/p is a disordered protein with extended unstructured segments localized at the n-and c-terminals straddling a flexible region. tppp/p is primarily expressed in oligodendrocytes where its multifunctional features such as tubulin polymerization promoting and microtubule bundling activities are crucial for the development of the projections in the course of oligodendrocyte differentiation enabling the ensheathment of axons with a myelin sheath that is indispensable for the normal function of the central nervous system. microtubule network, a major constituent of the cytoskeleton, displays multiple physiological and pathological functions in eukaryotic cells. the distinct functions of the microtubular structures are attained by static and dynamic associations of macromolecules and ligands as well as by post-translational modifications. tppp/p is actively involved in the regulation of microtubule dynamics not exclusively by its bundling activity, but also by its tubulin acetylation-promoting activity. atypical histone deacetylases, such as nad-dependent sirt and histone deacetylase- , function outside of the nucleus and control the acetylation level of cytosolic proteins, such as tubulin. acetylation-driven regulation of the microtubule network during cellular differentiation is an ambiguous issue. tppp/p has been recently identified as an interacting partner and inhibitor of these deacetylases and their interaction decreased the growth velocity of the microtubule plus ends and the motility of the cells. we have established cell models for the quantification of the acetylation degree of microtubule network in correlation with its dynamics and stability as well as in relation to aggresome formation, that mimics the pathological inclusion formation. the intracellular level of tppp/p is controlled at posttranscription level by microrna and at protein level by the proteosome machinery. under pathological circumstances this disordered protein displays additional moonlighting function that is independent of its association with microtubule system or deacetylases; it enters aberrant proteinprotein interaction with a-synuclein forming toxic aggregates within the neuronal and glial cells leading to the formation of inclusions characteristic for parkinson's disease and multiple system atrophy, respectively. the cell membrane separates the intracellular from the extracellular environment while intimately interacting with the cytoskeleton in numerous cellular functions, including cell division and motility. cell shape changes are for a large part mediated by the contractile actomyosin network forming the cortex underneath the cell membrane. to uncover molecular mechanisms of cell shape control based on actin-membrane interactions, we built a novel biomimetic model system: a cell-sized liposome encapsulating an actively contracting actin-myosin network. our fabrication method is inspired by a recent report of liposome preparation by swelling of lipid layers in agarose hydrogel films. we extensively characterize important liposomal properties, finding diameters between and lm, unilamellarity, and excellent and uniform encapsulation efficiency. we further demonstrate chemical control of actin network anchoring to the membrane. the resulting liposomes allow quantitative tests of physical models of cell shape generation and mechanics. in the cohesive structure of the cytoskeleton functionally distinct actin arrays orchestrate fundamental cell functions in a spatiotemporally controlled manner. emerging evidences emphasize that protein isoforms are essential for the functional polymorphism of the actin cytoskeleton. the generation of diverse actin networks is catalyzed by different nucleation factors, like formins and arp / complex. these actin arrays also exhibit qualitative and quantitative differences in the associated tropomyosin (tm) isoforms. how the molecular composition and the function of actin networks are coupled is not completely understood. we investigated the effects of different tm isoforms (skeletal muscle, cytoskeletal nm and br ) on the activity of mdia formin and arp / complex using fluorescence spectroscopic approaches. the results show that the studied tm isoforms have different effects on the mdia -, and arp / complexmediated actin assembly. the activity of the arp / complex is inhibited by sktm and tm nm , while tmbr does not have any effect. all three tm isoforms inhibited the activity of mdia . these results contribute to the understanding of the mechanisms by which tropomyosin isoforms regulate the functional diversity of the actin cytoskeleton. chronic thromboembolic pulmonary hypertension (cteph) is a dual pulmonary vascular disorder, which combines major vascular remodelling with small-vessel arteriopathy. the presence of fibroblasts in the clot, occluding the pulmonary arteries, and its composition create a microenvironment with increased collagen level, which might affect the local endothelial function. in this study, human pulmonary artery endothelial cells (hpaec) were exposed to collagen type i to address the effect of the thrombotic microenvironment on the vessel wall forming cells. the hpaecs, cultured under standard conditions were treated with , and lg/ml of collagen type i for h and h. the changes in the endothelial cell barrier function were investigated by performing permeability and migration test as well as ve-cadherin staining. collagen type i treatment led to a decrease in ve-cadherin signal in hpaec. the loosening of cell-cell contacts could be proven with a significant increase in permeability after h of collagen treatment with different concentrations. besides the loosening of the cell-cell contacts, the hpaec migration was also dose dependently retarded by collagen application over time. our data show that collagen-rich microenvironment leads to a disruption of the junctional proteins in hpaecs, indicating an environmental induced possible alteration in the function of endothelial cells in the clots of cteph patients. the implementation of miniaturisation and high throughput screening has quickened the pace of protein structure determination. however, for most proteins the process still requires milligram quantities of protein with purity [ %. these amounts are required as a result of unavoidable losses during purification and for the extensive screening of crystallisation space. for integral membrane proteins (imps), one of the initial steps in the structure determination procedure is still a major bottleneck -the over-expression of the target protein in the milligram quantity range. with a view of developing guidelines for over-expression of human imps, a systematic approach using the three most common laboratory expression systems (e. coli, s. cerevisiae, sf insect cells) was implemented. initial expression levels were determined by either partial purification using ni ? -nta (e. coli), green fluorescence protein (gfp) fluorescence using a c-terminal gfp tagged protein (s. cerevisiae) or flag tagged partial purification (sf cells). the results show that e. coli is suitable for the over-expression of human imps in the required quantity range however protein size and complexity is an important factor. the yeast system is fast and affordable but, for the group of human imps tested, the expression levels were borderline. finally for the insect cell system, the timelines are slower and it is in comparison costly to run, however, it can produce relatively large quantities of human imps. the cu?-atpase copb of enterococcus hirae is a bacterial p-type atpase involved in resistance to high levels of environmental copper by expelling excess copper. the membrane protein copb was purified from an over-expressing strain and solubilized in dodecyl-maltoside. by uv circular dichroism the secondary structure is predicted to contain - % a-helices and - % b-sheets in agreement with estimates based on homology with the ca atpase serca . we present cd-spectroscopic data on thermal unfolding of the protein to address the influence of the binding of the atp analogs atpcs and the fluorescent analog mant-atp on the protein stability. such analogs are used to mimic functional states of the atpase but undergo different interactions with the binding site that are not well characterized. we propose a competition-based assay for nucleotide binding using cdspectroscopy to deduce the occupancy of the nucleotidebinding site by non-fluorescent nucleotides. alternatively, the change of intrinsic fluorescence of mant-atp upon binding to the atpase is exploited in these assays. finally, we show how the simultaneous measurement of protein cd and nucleotide fluorescence in thermal denaturation experiments may help to determine the stability of several functional conformational states of copb. showing the steady-state distribution of electric potential, ionic concentrations are obtained efficiently. channel current, a summation of drift and diffusive currents, can be further computed from the flux of ionic concentrations. the influence of finite size effect will be also addressed. effect of cholesterol and cytoskeleton on k v . membrane distribution jimé nez-garduño am , , pardo la , ortega a , stü hmer w unam, mexico-city, mexico, mpi-em, gö ttingen, germany the potassium channel k v . is expressed nearly exclusively in the central nervous system. besides its function as an ion channel, k v . has also been associated with non-canonical signaling functions. various membrane proteins associated with cholesterol-sphingolipids enriched microdomains are involved in signaling pathways. in this work we studied the membrane distribution of k v . in highly purified brain-tissue plasma membranes as a function of cholesterol content versus cytoskeletal proteins. the results show that one fraction of kv . associates to cholesterol-rich domains or detergent resistant membranes (drm) and another fraction to non-drm domains. the kv . fraction inserted in drm is dependent on cholesterol as well as on cytoskeleton proteins. depletion of cholesterol leads to a doubling of k v . current density. we suggest that k v . coexists in two different populations: one where the transmembrane domain fits cholesterol enriched membranes and another able to fit into a less packed lipid bilayer. the importance of this distribution on signaling processes needs to be further investigated. we use the reduced model of an axis-symmetric water-filled channel whose protein wall has a single charged site. the channel length, radius and fixed charge are selected to match experimental data for gramicidin a. the ion current, occupancy and escape rate are simulated by the d self-consistent bd technique with account taken of the electrostatic ion-ion interaction. the bath with non-zero ion concentration on one side of the channel is modelled via the smoluchowski arrival rate. it is shown that: a) the occupancy saturates with michaelis-menten kinetics. b) the escape rate starts from the kramers value at small concentrations and then increases with concentration due to the electrostatic amplification of charge fluctuations. the resulting dynamics of the current can be described by modified reaction rate theory accounting for ionic escape over the fluctuating barrier [ ] . many membrane-protein functions are amenable to biophysical and biochemical investigation only after the protein of interest has been reconstituted from a detergent-solubilised state into artificial lipid bilayers. unfortunately, functional reconstitution has remained one of the main bottlenecks in the handling of numerous membrane proteins. in particular, gauging the success of reconstitution experiments has thus far been limited to trial-and-error approaches. to address this problem, we have established high-sensitivity isothermal titration calorimetry (itc) as a powerful method for monitoring the reconstitution of membrane proteins into liposomes. itc has previously been employed for characterising liposome solubilisation and reconstitution in the absence of protein. here we show that itc is also excellently suited for tracking the complex process of membrane-protein reconstitution in a non-invasive and fully automated manner. the approach is exemplified for the prokaryotic potassium channel kcsa, which we first purified in detergent micelles and then reconstituted into stable proteoliposomes at very high protein densities. electrophysiological experiments performed in planar lipid membranes confirmed that kcsa regained its functional activity upon itc-guided reconstitution. gating currents of low-voltage-activated t-type calcium channels family má ria karmažínová , Ľ ubica lacinová institute of molecular physiology and genetics, sav, bratislava, slovak republic t-type calcium channels are distinguished by relatively low voltage threshold for an activation and steep voltage dependence of activation and inactivation kinetics just above the activation threshold. kinetics and voltage dependence of macroscopic inward calcium current through ca v channels was described in a detail. in contrast, very little information is available on gating current of these channels. therefore we compared gating currents measured from all three ca v . , ca v . and ca v . channels. voltage dependencies of charge movement differ dramatically from those for macroscopic current. first, their slope factors are several-fold bigger that slope factors of macroscopic current activation. second, activation mid-point for ca v . channels on-gating is shifted to more positive membrane potentials by about mv compare to ca v . and ca v . channels, whose activation mid-points are similar. the same is truth for off-gating voltage dependences. kinetics of both onand off-gating is remarkably faster for ca v . and ca v . channels compare to ca v . channels. further, more charge is moved per unit of macroscopic current amplitude in ca v . channels compare to ca v . and ca v . channels. supported by apvv- - and vega / / . the local anaesthetic lidocaine (lid) is generally believed to reach its binding site in the intracellular vestibule of the voltage-gated sodium channel via the cell membrane. qx (qx) is a permanently charged, quaternary amine analogue of lid, that can access this binding site via a hydrophilic route across the channel protein. the mutation i e of the adult rat muscle-type sodium channel (rna v . ) opens such a hydrophilic pathway. when bound to the internal vestibule, lid stabilizes both fast and slow inactivated states. we wondered whether qx, once bound to the internal vestibule, exerts a similar modulatory action on inactivated states as lid. the construct i e was expressed in tsa cells and studied by means of the patch-clamp technique. when applied from the extracellular side lm qx stabilized the slow but not the fast inactivated state in i e. when applied internally, qx entered the channel, but stabilization of inactivated states could not be observed. these results suggest that binding site for use-dependent block is in the inner vestibule of the channel, fast inactivation is modulated only by the hydrophobic form of lid, and the binding site for modulation of slow inactivation by qx is only accessible from the extracellular side of the channel. we observed that lipophilicity (quantified by the logarithm of the calculated water-octanol partition coefficient, logp) is important in determining both kr and ki, but had a greater effect on ki. distribution coefficients (logd) discriminated better between kr and ki than partition coefficients (logp). the ratio of positively charged/neutral forms (quantified by the acidic dissociation constant, pka) was a significant determinant of resting affinity: predominantly charged compounds tended to be more potent against resting channels, while neutral compounds tended to be more state-dependent. aromaticity was more important for inactivated state affinity. the acidification of intracellular compartments is critical for a wide range of cellular processes. a recent candidate for ph regulation within early endosomes and tgn is the highly conserved intracellular na ? /h ? exchanger isoform (nhe ), whose mutation leads to neurological syndromes in human patients. however, due to its intracellular localization, nhe biochemical features are still poorly characterized and its biological function remains elusive. we have developed somatic cell genetic techniques that enable the selection of variant cells able to resist h ? killing through plasma membrane expression of h ? extruders. this enabled us to obtain stable cell lines with forced plasma membrane expression of nhe . we used them to measure its functional and pharmacological parameters with high accuracy, using fast transport kinetics. to summarize, this exchanger displays unique features within the nhe family, especially with respect to its affinity for its substrates, lithium, sodium and protons and for its guanidine-derived inhibitors. taken together with our results on the subcellular localization of the native nhe , these unique biochemical features provide new insights on the biological function and pathological implications of this intracellular na ? /h ? exchanger. analysis of the collective behaviour of ion channels j. miśkiewicz, z. trela, s. przestalski wrocław university of environmental and life sciences, physics and biophysics department, ul. norwida , - wrocław, poland a novel approach to the analysis of the ion current recordings is proposed. the main goal of the standard patch clamp technique is to measure single channel activity (however the whole cell configuration is also used in various researches). in the presented study the ion channels time series recordings were several (up to four) ion channels were present are analysed and the collective behaviour of ion channels is investigated. the time ion current time series are converted into dwelltime series and the channel activity is analysed. the hypothesis of collective ion channels behaviour is verified and the influence of organolead compounds (met pbcl) on collective ion channel activity is measured. the analysis is performed on the sv cation channels of vacuolar membrane of beta vulgaris. the aim of our computed study was to examine the possible binding site of primaquine (pq) using a combined homology protein modeling, automated docking and experimental approach. the target models of wild-type and mutant-types of the voltage-dependent sodium channel in rat skeletal muscle (rna v . ) were based on previous work by tikhonov and zhorov. docking was carried out on the p-loop into the structure model of rna v . channel, in the open state configuration, to identify those amino acidic residues important for primaquine binding. the threedimensional models of the p-loop segment of wild types and mutant types (w . w c, w c and w a at the outer tryptophan-rich lip, as well as d c, e c, k c and a c of the deka motif) helped us to identify residues playing a key role in aminoquinoline binding. in good agreement with experimental results, a -fold inhibition loss was observed, tryptophan is crucial for the reversible blocking effects of pq. as a result, w c abolished the blocking effect of primaquine in voltage-clamp assays. hydrogen bond formation accelerates channel opening of the bacterial mechanosensitive channel mscl yasuyuki sawada and masahiro sokabe department of physiology, nagoya university graduate school of medicine, nagoya, japan the bacterial mechanosensitive channel mscl is constituted of homopentamer of a subunit with two transmembrane inner and outer (tm , tm ) a-helices, and its d structure of the closed state has been resolved. the major issue of mscl is to understand the gating mechanism driven by tension in the membrane. to address this question, we performed molecular dynamics (md) simulations for the opening of mscl embedded in the lipid bilayer. in the closed state of mscl, neighboring tm inner helices are crossed each other near the cytoplasmic side and leu and val in the constricted part form a stable hydrophobic environment called gate. upon membrane stretch, phe in tm outer helices was dragged by lipids, leading to an opening of mscl. thus phe was concluded to be the major tension sensor. during opening, tm inner helices were also dragged and tilted, accompanied by the outward sliding of the crossings. this led to a slight expansion of the gate associated with an exposure of oxygen atoms of the backbone to the inner surface of the gate. this allows water penetration in the gate and formation of hydrogen bonds between water and the exposed oxygen, which in turn weakened the hydrophobic interaction at the crossings, causing a further opening of the gate and water permeation. mitochondrial bk ca , mitobk ca has been proposed to be cardioprotective and formed by proteins of * to * kda. thus, we investigated the molecular characteristics of this channel in isolated mitochondria from murine heart. labeling of adult mouse cardiomyocytes with plasmalemma bk ca antibodies, mitotracker, and wheat germ agglutinin yielded remarkable mitochondrial but not plasma membrane localization. nanoscale fluorescence microscopy (stimulated emission depletion) revealed to of * - nm bk ca clusters per mitochondria. further, western blot analysis of purified mitochondria showed the presence of a full length * kda protein. systematic rt-pcr exon scanning of isolated cardiomyocyte mrnas were consistent with a full length * kda alpha-subunit protein and revealed the expression of three splice inserts. insertless-bk ca robustly localized to the plasma membrane of cho cells but when a c-terminal splice insert was present bk ca was readily targeted to the mitochondria (protein proximity index was * . indicating % colocalization). hence, cardiac mitobk ca is composed by full-length bk ca protein but with splice inserts which may facilitate its targeting to mitochondria. supported by nih and aha. patch-clamp technique was used to examine effect of trimethyl-lead and -tin on the sv channel activity in the red beet (beta vulgaris l.) taproot vacuoles. it was found that the addition of both investigated compounds to the bath solution inhibit, in a concentration-dependent manner, sv currents. when single channel properties were analyzed, only little channel activity can be recorded in the presence of lm of organometal. compounds investigated decreased significantly (by about one order of magnitude) the open probability of single channels. the recordings of single channel activity obtained in the presence and in the absence of organometal showed that compounds only slightly (by ca. %) decreased the unitary conductance of single channels. it was also found that organometal diminished significantly the number of sv channel openings, whereas it did not change the opening times of the channels. taken together, these results suggest that organometal binding site is located outside the channel's selectivity filter and that the inhibitory effect of both compounds investigated on sv channel activity probably results from organometal-induced disorder in compatibility between membrane lipids and membrane proteins. the research was financed by polish ministry of science and higher education by grant no. nn . electrophysiological investigation of the hvdac ion channel in pore-spanning membranes conrad weichbrodt, claudia steinem georg-august-universitä t gö ttingen, iobc, tammannstr. , d- gö ttingen, germany, e-mail: cweichb@gwdg.de the human voltage-dependent anion channel (hvdac ) plays an important role in cell life and apoptosis since it is the main porin of the outer mitochondrial membrane. as hvdac is believed to play a pivotal role in apoptosis-related diseases such as stroke, alzheimer, parkinson and cancer, the alterations of its electrophysiological properties under different conditions are of great value. to perform different investigations, refolded hvdac is reconstituted in artificial membranes which typically consist of dphpc with a cholesterol fraction of - %. they are prepared via the mü ller-rudin-technique on a functionalized porous alumina substrate containing pores with a diameter of nm. the quality of these so-called nano-black-lipid-membranes (nano-blms) is verified via electrochemical impedance spectroscopy (eis), hvdac is reconstituted and single channel recordings are made. membranes are also prepared by spreading proteoliposomes on hydrophobized porous silicon nitride with pores of lm diameter. information about altered gating-characteristics and related conductivities is gained by application of holding potentials up to ± mv and evaluation of the resulting currents. the hvdac was a kind gift of prof. c. griesinger, mpibpc, gö ttingen. pharmacological inhibition of cardiac herg k ? channels is associated with increased risk of arrhythmias. many drugs bind directly to the channel, thereby blocking ion conduction. ala-scanning mutagenesis identified residues important for drug block. two aromatic residues y and f were found to be crucial for block of most compounds. surprisingly, some cavity blocking drugs are only weakly affected by mutation y a. in this study we provide a structural interpretation for this observation. md simulations on the y a mutant suggest side chain rearrangements of f located one helical turn below y . loss of p-p stacking induces reorientation of f from a cavity facing to a cavity lining conformation, thereby substantially altering the shape of the binding site. docking studies reveal that due to their rigid shape and compactness y insensitive drugs can still favorably interact with the reoriented f aryl groups, while molecules with more extended geometries cannot. the ankyrin transient receptor potential channel trpa is a transmembrane protein that plays a key role in the transduction of noxious chemical and thermal stimuli in nociceptors. in addition to chemical activation, trpa can be activated by highly depolarizing voltages but the molecular basis of this regulation is unclear. the transmembrane part of the tetrameric trpa is structurally related to the voltagegated k ? channels in which the conserved charged residues within the fourth transmembrane region (s ) constitute part of a voltage sensor. compared to these channels, the voltage-dependence of trpa is very weak (apparent number of gating charges * . versus in k ? channels) and its putative voltage-sensing domain most likely lies outside the s because trpa completely lacks positively charged residues in this region. in the present study we used homology modelling and molecular dynamics to create models of the transmembrane part and the proximal cytoplasmic c terminus of trpa . in combination with electrophysiological data obtained from whole cell patchclamp measurements we were able to point out several positively charged residues which mutation strongly alter the voltage sensitivity of trpa channel. these may be candidates for as yet unrecognized voltage sensor. photosynthesis p- action of double stress on photosystem aliyeva samira a., gasanov ralphreed a. institute of botany, azerbaijan national academy of sciences, baku, azerbaijan the simultaneous effect of photoinhibitory illumination and toxic action of heavy metals ions (cd ? and co ? ) on activity of ps in vitro measuring by millisecond delayed fluorescence (ms-df) of chlorophyll a was studied. during action on chloroplasts only of cd ? ( - m) the fast component of ms-df, which originates via radiative recombination of reaction center with the camn -cluster or y z on donor side of ps , is inhibited stronger than at action of only co ? . the steadystate level at cd ? treatment is remain stable, while at co ? action it is increased. simultaneous action of cd ? and photoinhibitory illumination ( lmol photons m - s - ) have shown that fast component of ms-df was inhibited faster with time than in case of action of co ? and excess light. result indicates that damage sites of action cd ? and co ? are donor and acceptor side of ps , accordingly. we assume that binding site of cd ? is y z or camn -cluster, one of the recombination partners with p ? on the donor side of ps . thereby, action of cd ions on donor side of ps leads mainly to development of mechanism of donor-side photoinhibition. field instrument for determination of the photosynthetic capacity of intact photosynthetic bacteria e. asztalos , z. gingl and p. maró ti department of medical physics and informatics, university of szeged, hungary, department of experimental physics, university of szeged, hungary a combined pump and probe fluorometer and spectrophotometer with high power laser diodes has been constructed to measure fast induction and relaxation of the bacteriochlorophyll fluorescence yield and light-induced absorption changes in intact cells of photosynthetic bacteria. the construction is the upgraded version of our previous set up with better time resolution ( ls). the compact design of the mechanics, optics, electronics and data processing makes the device easy to use as outdoor instrument or to integrate into larger measuring systems. the versatility and excellent performance of the apparatus will be demonstrated on different fields: ) organization and redox state of the photosynthetic apparatus of the whole cells under different growth conditions deduced from fluorescence characteristics including the lag phase, the amplitude and the rise time of the variable fluorescence, ) electron transfer in the reaction center, cytochrome bc complex and in between obtained from relaxation of the fluorescence and ) re-reduction kinetics of the oxidized primary donor of the reaction center and energetization and relaxation of the intracytoplasmic membrane tracked by absorption changes at and nm, respectively. previous work has established that the iron stress induced protein a (isia) synthesized by cyanobacteria under stress conditions, has at least two functions: light harvesting [ ] and photoprotection [ ] . under prolonged iron starvation isia becomes the main chlorophyll-binding protein in the cell and occurs without a photosystem association. these isia aggregates have a strong ability to dissipate light energy and there is evidence of carotenoid participation in the quenching mechanism via downhill energy transfer from chlorophyll to the s state of a carotenoid [ ] . in the present work we have measured the temperature dependence of the fluorescence of carotenoid depleted mutants (echinenone and/or zeaxanthin) and isia monomers in order to investigate the role of carotenoid and aggregation in the quenching process. pigment analysis confirms the absence of the carotenoid mutated in its biosynthesis but shows that it is mainly replaced. the monomers are lacking two carotenoids, echinenone and one of the two ß-carotenes found previously in isia aggregates. temperature dependent fluorescence shows that quenching properties are affected in the monomers and the mutants lacking zeaxanthin. soon exhausted oil resources and global climate change have stimulated research aiming at production of alternative fuels, ideally driven by solar energy. production of solar fuels needs to involve the splitting of water into protons, energized electrons and dioxygen. in photosynthetic organisms, solar-energy conversion and catalysis of water splitting (or water oxidation) proceed in an impressive cofactor-protein complex denoted as photosystem ii (psii). the heart of biological water-oxidation is a protein-bound manganese-calcium complex working at technically unmatched efficiency. in an attempt to learn from nature, the natural paragon is intensely studied using advanced biophysical methods. structural studies by x-ray spectroscopy with synchrotron radiation play a prominent role in this endeavor. time-resolved methods provide insights in the formation of intermediate states of the reaction cycle. an overview is presented focusing on (i) the efficiency of solar energy usage in psii, (ii) the interrelation between electron transfer and proton relocations, and (iii) the mechanism of water oxidation. as an outlook, new results on water oxidation by biomimetic manganese and cobalt oxides, which may become a key element in future solar-fuel systems, are presented. the peridinin-chlorophyll-protein (pcp) is a light-harvesting complex (lhc) that works as antenna in the photosynthetic process of dinoflagellates. the protein contains both chlorophylls and carotenoids molecules, the latter being responsible to extend the spectral range of captured light to regions where chlorophylls are transparent. pcp crystal structures [ ] reveal that each chlorophyll is surrounded by or molecules of the carotenoid peridinin, located in non-equivalent positions. the different protein environment of the sites might be responsible of a spectral shift of the pigments with the functional role to extend the absorption spectra of the complex and enhance its light harvesting capabilities. high resolution x-ray diffraction data on a reconstructed pcp, the refolded peridinin-chlorophyll a-protein (rfpcp) [ ] , and on the less common high salt-pcp (hspcp) opened the way to the mechanistic understanding of peridinin spectral tuning, peridinin chlorophyll energy transfer and photoprotective mechanism [ ] . the two pcp forms differ in various features: spectral properties, molar mass, amino acid sequence and, above all, pigment stoichiometry, the peridinin:chlorophyll ratio being : for the rfpcp and : for the hspcp [ ] . in the present work we perform classical molecular dynamics simulations of the rfpcp and the hspcp in explicit water solution. we analyse the structure and dynamics of the proteins and of their pigments to characterize the different peridinin sites in both pcp forms in terms of quantities that can affect the chromophore spectra, such as distorsion, fluctuations and nature of the protein environment. the comparison between the data suggests correspondences between the pigments of the two forms. quantum and mixed quantum/ classical molecular dynamics simulations are also under progress to investigate the effect of the protein environment on the electronic and optical properties of the pcp pigments. peculiar applications, like in optoelectronics, biosensors, photovoltaics. among the existing carrier matrices conductive metal oxides (e.g. indium tin oxide, ito), carbon nanotubes, graphenes, silicon (si) are the most frequently used materials because of their unique characteristics such as good conductivity, good optical properties and excellent adhesion to substrates. in our work we combined purified photosynthetic reaction center protein (rc) and porous silicon (psi) investigating the morphology and optoelectronic properties of the bio-nanocomposite material. ftir spectroscopy, scanning electron microscopy and energydispersive x-ray spectroscopy indicated the binding of the protein to the psi. specular reflectance spectra showed a red shift in the characteristic interference peak of the psi microcavity which was saturated at higher concentration of the protein. the binding was more effective if the functionalization was done by the si-specific oligopeptide compared to the classical covalent binding via aminopropyl-triethoxysilane (aptes). excitation by single saturation flashes indicated that the rc still exhibited photochemical turnover after the binding. the role of reactive oxygen species (ros) in plant stress, both as damaging agent and as potential signal molecule is often assessed in experiments using photosensitized elicitor dyes. for these studies, it is essential to know how efficiently these chemicals generate ros, whether they are specific ros sources, as well as their cellular localization and additional side effects. the present study addresses these issues using a variety of dyes known and traditionally applied as singlet oxygen ( o ) sources. rose bengal (rb), methylene violet (mvi), methylene blue (mb), neutral red (nr) and indigo carmine (ic) were studied as putative ros sources in tobacco leaves. ros products of photosensitized dyes were measured in vitro, using spin trapping epr spectroscopy. dye concentrations and irradiation concentration leading to equal absorbed excitation quanta were determined spectrophotometrically. in vivo studies were carried out using tobacco leaves infiltrated with water solutions of the putative o sources. cellular localizations were identified on the basis of the dyes' fluorescence. rb, nr and mvi reached into mesophyll cells and were used to study the effects of these dyes on photosynthesis. photochemical yields and quenching processes were compared before and after photosensitization of the elicitor dyes inside the leaf samples. chlorophyll-chlorophyll charge transfer quenching is the main mechanism of non-photochemical quenching in higher plants alfred r. holzwarth max-planck-institut fü r bioanorganischechemie, stiftstraße - , d- mü lheim a.d.ruhr, germany non-photochemical quenching (npq) in plants protects against photochemical destruction of the photosynthetic apparatus under excess light conditions.while one location of the npq process has been shown to be centered on the major light harvesting complex ii (lhcii) (q type or qequenching), an additional quenching center responsible for qi type (identical to q center) quenchinghas been suggested to be located on the minor light-harvesting complexes upon accumulation ofzeaxanthin (zx), in particularon cp and cp we have performed femtosecond transient absorption and time-resolved fluorescence measurements of npq quenching in intact leaves of higher plants, on isolated light harvesting complexes in the minor (non-aggregated)light harvesting complex cp reconstituted with violaxanthin (vx) or zx, and in the isolated major lhc ii complex in the aggregated state. in all of these situations we find the formation of chl-chl charge transfer (ct) states to be the dominant quenching mechanism. the yield of formation of carotenoid cation states and/or carotenoid s state is either extremely low or absent, thus excluding their involvement in npq quenching as a major quenching mechanism. single-molecule spectroscopy (sms) is a powerful technique that allows investigation of fluorescence properties from single fluorescing systems. this technique enabled us to investigate the dynamics of the fluorescence intensity and spectral profiles of single, isolated light harvesting complex (lhc) on timescales of milliseconds to seconds, during continuous laser illumination. we were able to observe how each complex can rapidly switch between different emission states [ , ] and to characterise the intensity and the spectral dynamics of major and minor antenna complexes from plants, in two different environments, mimicking the light harvesting and the light dissipating state, respectively. the results will be discussed with respect to the current models for nonphotochemical quenching (npq) mechanisms [ , , ] , a vital photoprotection mechanism during which the lhcs of plants switch between a state of very efficient light utilisation and one in which excess absorbed excitation energy is harmlessly dissipated as heat. phaeodactylum tricornutum is one of the most utilized model organisms in diatom photosynthesis research mainly due to availability of its genome sequence ( ). it's photosynthetic antennae are the fucoxanthin chlorophyll a/c binding proteins (fcps) which share a high degree of homology with lhcs of higher plants and green algae ( ) . for detailed investigation of the antenna system of p.tricornutum, a transgenic strain expressing recombinant his-tagged fcpa protein was created which simplified the purification of a specific stable trimeric fcp complex consisting of fcpa and fcpe proteins. excitation energy coupling between fucoxanthin and chlorophyll a was intact and the existence of a chlorophyll a/fucoxanthin excitonic dimer was demonstrated ( ). we investigated in detail the existence of specific antenna system for psi and psii in p.tricornutum as in case of higher plants. our studies indicated that at least the main light harvesting proteins fcpa and fcpe are most probably shared as a common antenna by both psi and psii. harvesting complex ii (lhciib) from spinach or in native thylakoid membranes by picosecond time-resolved fluorescence. the domain size was estimated by monitoring the efficiency of added exogeneous singlet excitation quenchers -phenyl-p-benzoquinone (ppq) and dinitrobenzene (dnb). the fluorescence decay kinetics of the systems under study were registered without quenchers and with quenchers added in a range of concentrations. stern-volmer constants, k sv and k sv -for aggregates (membranes) and detergentsolubilized complexes, respectively, were determined from the concentration dependence of the ratio of the mean fluorescence lifetimes without/with quencher (s , s). the ratio k k sv • s /s was suggested as a measure of the functional domain size. values in the range of - were found for lhcii macroaggregates and - for native thylakoid membranes, corresponding to domain sizes of - chlorophylls. although substantial, the determined functional domain size is still orders of magnitude smaller than the number of physically connected pigment-protein complexes; thus our results imply that the physical size of an antenna system beyond these numbers has little or no effect on improving the light-harvesting efficiency. the interaction between photosynthetic reaction center proteins (rcs) purified from purple bacterium rhodobacter sphaeroides r- and functionalized and non-functionalized (single (swnt) and multiple (mwnt) walled) carbon nanotubes (cnt-s) has been investigated. both structural (afm, tem and sem microscopy) and functional (flash photolysis and conductivity) techniques showed that rcs can be bound effectively to different cnts. both physical sorption and binding through -nh or -cooh groups gave similar results. however, it appeared that by physical sorption some sections of the cnts were covered by multiple layers of rcs. after the binding the rcs kept their photochemical activity for a long time (at least for three months, even in dried form) and there is a redox interaction between the cnt and rcs. the attachment of rc to cnts results in an accumulation of positive and negative charges followed by slow reorganization of the protein structure after excitation. in the absence of cnt the secondary quinone activity decays quickly as a function of time after drying the rc onto a glass surface. the special electronic properties of the swnt/protein complexes open the possibility for several applications, e.g. in microelectronics, analytics or energy conversion and storage. the decay of the high fluorescence state generated by actinic illumination of different durations was measured in whole cells of various strains and mutants of photosynthetic purple bacteria. although similar method is used in higher plants, its application in photosynthetic bacteria is novel and highly challanging. the available data are restricted and only the re-oxidation of the reduced primary quinone (q a -? q a ) is usually blamed for the decay kinetics. here, we will analyse the complexity of the kinetics over a very broad time range (from ls to s) and show that the dark relaxation of the bacteriochlorophyll fluorescence reflects the overlap of several processes attributed to the intra-and interproteinous electron transfer processes of the reaction center (rc) and cytochrome bc complex of the bacterium. in the shorter (\ ms) time scale, the dominating effect is the re-reduction of the oxidized primary donor (p ? ? p) that is followed by the re-oxidation of the acceptor complex of the rc by the cytochrome bc complex. as the life times and amplitudes of the components depend on the physiological state of the photosynthetic apparatus, the relaxation of the fluorescence can be used to monitor the photosynthetic capacity of the photosynthetic bacteria in vivo. circular dichroism (cd) spectroscopy is an indispensable tool to probe molecular architecture. at the molecular level chirality results in intrinsic cd, pigment-pigment interactions in protein complexes give rise to excitonic cd, whereas ''psitype'' cd originates from large densely packed chiral aggregates. it has been well established that the anisotropic cd (acd), measured on samples with defined orientation, carries specific information on the architecture of molecules. however, acd can easily be distorted by linear dichroism of the sample or instrumental imperfections -which might be the reason why it is rarely studied in photosynthesis research. here we present acd spectra of isolated intact and washed, unstacked thylakoid membranes, photosystem ii membranes (bby), and tightly-stacked lamellar macroaggregates of the main light-harvesting complex ii (lhcii). we show that the acd spectra of face-and edge-aligned stacked thylakoid membranes and lhcii lamellae exhibit profound differences in their psi-type cd bands. marked differences are also seen in the excitonic cd of bby and washed thylakoid membranes. thus acd provides an additional dimension to the light induced conformation changes of quinone depleted photosynthetic reaction centers (rcs) purified from the carotenoid-less rhodobacter sphaeroides r- were investigated by transient absorption (ta) and grating (tg) methods. surprisingly, the decay of the ta signal measured at nm was divided into a ns and a ls components. the latter coincides with the life time of the tg signal, which was assigned earlier [nagy et al. ( ) febs lett. , - ] to spectrally silent conformational changes. the nature of the ls phase was investigated further. although, the probability of chlorophyll triplet formation under our measuring conditions was small, possible contribution of the triplet states was also studied. the presence of carotenoid in the wild type rcs eliminated the ls component indicating the role of carotenoid in the energy transfer within the rcs. there was no significant effect of the molecular oxygen on the ta. this fact may be explained if the chlorophyll triplets inside the protein have reduced accessibility to molecular oxygen. a differential effect of osmotic potential and viscosity on conformation changes accompanying the primary charge separation was measured by the effect of ficoll, glucose and glycerol as compared the ta to the tg signals. variable chlorophyll fluorescence: in part a yield change due to light-induced conformational change gert schansker , szilvia z. tó th , lá szló ková cs , alfred r. holzwarth and gy} oz} o garab institute of plant biology, biological research center, hungarian academy of sciences, szeged, hungary, max-planck-institut fü r bioanorganische chemie, mü lheim an der ruhr, germany on a dark-to-light transition the chlorophyll fluorescence rises from a minimum intensity (f ) to a maximum intensity (f m ). conventionally, this rise is interpreted to arise from the reduction of the primary quinone acceptor, q a , of photosystem ii -although this cannot explain all presently available observations. in untreated leaves, at room temperature, the fluorescence rise follows the reduction of the electron transport chain (etc). once induced, * - % of the variable fluorescence intensity relaxes within ms in darkness and can be re-induced within ms as long as the etc remains reduced. analyzing the fluorescence relaxation kinetics, ?/-dcmu, * % of the amplitude cannot be explained by q a -re-oxidation. special properties of this phase determined on dcmu-inhibited samples: at cryogenic temperatures (below - °c), where the q a -/s recombination is blocked, it still relaxes and it exhibits a strong temperature dependence with an apparent e a & kj/mol, whereas the reduction of q a is nearly temperature insensitive. a fluorescence yield change, driven by light-induced conformational change in the reaction center complex, can explain all these observations. tuning function in bacterial light-harvesting complexes katia duquesne , edward o'reilly , cecile blanchard , alexandra olaya-castro and james n. sturgis lism, cnrs and aix-marseille university, marseille, france, department of physics, university college, london, uk purple photosynthetic bacteria are able to synthesize a variety of different light-harvesting complexes, sometimes referred to as lh , lh and lh . here we have investigated the structural origins of these different forms and the manner in which the sequence tunes the absorption spectrum of the light-harvesting system. we then consider the functional consequences of this tuning for the organization of the light harvesting system and on the ecology of the organisms. specifically by spectroscopic techniques, in particular circular dichroism and resonance raman spectroscopy, we have been able to obtain information on the organization of the bacteriochlorophyll binding sites in the unusual lh of roseobacter denitrificans. this provides a picture of how different peripheral light-harvesting complexes are able to modulate the absorption spectrum. the structure and organization of this complex is the put in the context of the the recently published variability of the light-harvesting complexes. in particular the observation of their ability to form mixed complexes containing different polypeptides. we examine quantitatively the possible reasons for maintaining such variability by considering the transport properties of membranes containing either pure or mixed complexes and show that mixed complexes can permit light-harvesting to continue during adaptation. we then consider the different constraints that may be behind this type of adaptation in different bacteria and the conditions under which different types of antenna system might be optimal finally we integrate this into the evolutionary context of adaptation to variable light intensity and the ecological niches where such organisms are found. interaction between photosynthetic reaction centers and ito t. szabo , g. bencsik , g. kozak , cs. visy , z. gingl , k. hernadi , k. nagy , gy. varo and l. nagy departments of medical physics and informatics, physical chemistry and materials science, technical informatics and of, applied and environmental chemistry, university of szeged, hungary, institute of biophysics, has biological research center, szeged, hungary photosynthetic reaction center proteins (rc) purified from rhodobacter sphaeroides purple bacterium were deposited on the surface of indium-tin-oxide (ito), a transparent conductive oxide, and the photochemical/-physical properties of the composite was investigated. the kinetics of the light induced absorption change indicated that the rc was still active in the composite and there was an interaction between the protein cofactors and the ito. the electrochromic response of the bacteriopheophytine absorption at nm showed an increased electric field perturbation around this chromophore on the surface of ito compared to the one measured in solution. this absorption change is associated with the chargecompensating relaxation events inside the protein. similar life time, but smaller magnitude of this absorption change was measured on the surface of borosilicate glass. the light induced change in the conductivity of the composite as a function of the concentration showed the typical sigmoid saturation characteristics unlike if the chlorophyll was layered on the ito which compound is photochemically inactive. in this later case the light induced change in the conductivity was oppositely proportional to the chlorophyll concentration due to the thermal dissipation of the excitation energy. the supramolecular organization of photosystem ii in vivo studied by circular dichroism spectroscopy tü nde tó th , , herbert van amerongen , , gy} oz} o garab , lá szló ková cs institute of plant biology, biological research center, hungarian academy of sciences, hungary, wageningen university, laboratory of biophysics, wageningen, the netherlands, microspectroscopy centre, wageningen university, wageningen, the netherlands the light reactions of photosynthesis in higher plants take place in granal chloroplast thylakoid membranes, which contain chirally organized macrodomains composed of photosystem ii (psii) supercomplexes associated with light harvesting antenna complexes (lhciis). the physiological relevance of this hierarchic organization, which often manifest itself in semicrystalline assemblies, has not been elucidated but the diversity of the supramolecular structures and their reorganizations under different conditions indicates its regulatory role. the present work focuses on the structural and functional roles of different components of lhcii-psii supercomplexes. we used various growth conditions, influencing the protein composition, and different arabidopsis mutants (kocp , kocp , kopsbw, kopsbx, dgd ), with altered organization of the membranes, and measured their circular dichroism (cd) spectra as well as their chlorophyll fluorescence kinetics to characterize the chiral macro-organization of the chromophores and the functional parameters of the membranes, respectively. we show that the formation of chiral macrodomains require the presence of supercomplexes. our data also reveal specific functions of some of the protein or lipid compounds in the light adaptation processes of plants. excitation energy transfer and non-photochemical quenching in photosynthesis rienk van grondelle department of physics, vu university, de boelelaan , hv, amsterdam, the netherlands the success of photosynthesis relies on two ultrafast processes: excitation energy transfer in the light-harvesting antenna followed by charge separation in the reaction center. lhcii, the peripheral light-harvesting complex of photosystem ii, plays a major role. at the same time, the same light-harvesting system can be 'switched' into a quenching state, which effectively protects the reaction center of photosystem ii from over-excitation and photodamage. in this talk i will demonstrate how lhcii collects and transfers excitation energy. using single molecule spectroscopy we have discovered how lhcii can switch between this light-harvesting state, a quenched state and a red-shifted state. we show that the switching properties between the light-harvesting state and the quenched state depend strongly on the environmental conditions, where the quenched state is favoured under 'npq-like' conditions. it is argued that this is the mechanism of non-photochemical quenching in plants. photobiology in the soil: arrested chlorophyll biosynthesis in pea epicotyl sections beá ta vitá nyi, katalin solymosi, annamá ria kó sa, bé la bö ddi eö tvö s university, institute of biology, department of plant anatomy, pá zmá ny p. s. /c, h- budapest, hungary the key regulatory step of chlorophyll (chl) biosynthesis is the nadph:protochlorophyllide oxidoreductase (por) catalyzed reduction of protochlorophyllide (pchlide)which is light activated in angiosperms. this process is usually described on artificially dark-grown plants. in this work, we studied epicotyl segments developed under the soil surface, which were dissected from pea plants grown under natural light conditions. using k fluorescence spectroscopy, pigment analyses, electron microscopy and fluorescence microscopy, we found that upper segments showed transitional developmental stages, i.e. chl appeared besides pchl(ide) and etio-chloroplasts were typical. in regions under cm depth, however, the characteristics of the segments were similar to those of plants germinated artificially in complete darkness, i.e. only pchl(ide) and etioplasts were present. the results of this work prove that these latter symptoms may occur in shaded tissues of fully developed, photosynthetically active plants grown under natural conditions. in this overview talk it will be shown how atomistic computations can complement experimental measurements in our quest to understand biological electron and proton transfer reactions. at first the molecular simulation methods for calculation of important electron transfer parameter such as reorganization free energy, electronic coupling matrix elements and reduction potentials will be explained. then three applications will be discussed where such computations help interpret experimental data on a molecular level. the first example concerns electron tunneling between heme a and heme a in the proton pump cytochrome c oxidase. this reaction is very fast, occurring on the nano-second time scale, and it is unclear if this is due to an unusually low reorganization free energy or due to high electronic coupling. carrying out large-scale all-atom molecular dynamics simulation of oxidase embedded in a membrane, we do not find evidence for unusually small values of reorganization energy as proposed previously, implying that the nanosecond tunneling rate between heme a and a is supported by very efficient electronic coupling. the second example is on electron transport in a deca-heme 'wire'-like protein, used by certain anaerobic bacteria to transport electrons from the inside of the cell to extracellular substrates. the crystal structure of such a protein has been solved recently for the first time. however, it is unclear if and in which direction the wire structure supports electron transport. here we present results of heme reduction potential calculations that help us reveal the possible electron flow in this protein. in a third example we explain how quantum mechanical/molecular mechanical methods (qm/mm) recently helped us understand why the catalase from h. pylori is prone to undergo an undesired protein radical migration reaction during catalysis. proton pumping activity of purple and brown membranes regenerated with retinal analogues k. bryl , k.yoshihara university of warmia and mazury, department of physics and biophysics, olsztyn, poland, suntory institute for bioorganic research, wakayamadai, osaka , japan the retinal protein bacteriorhodopsin (br) acts as a light-driven proton pump in the purple membrane (pm) of halobacterium salinarium (h.s.). the aim of these studies was to clarify whether the specific crystalline structure of protein and protein-substrate interactions are significant for h ? transfer into the aqueous bulk phase. two membrane systems were prepared: purple membranes (br arranged in a two-dimensional hexagonal lattice) and brown membranes (br not arranged in a crystalline lattice) were regenerated with -fuororetinals. light-induced proton release and reuptake as well as surface potential changes inherent in the regenerated systems reaction cycles were measured. signals of optical ph indicators residing in the aqueous bulk phase were compared with signals of ph indicator covalently linked to the extracellular surface of proteins and with surface potential changes detected by the potentiometric probe. the energies of activation of proton transfer have been calculated. experimental results and thermodynamic parameters (energies of activation) suggest the different mechanism of proton transfer into the aqueous bulk phase in these two systems. the implications for models of localized-delocalized energy coupling by proton gradients will be discussed. iron regulation is a vital process in organisms and in most of them it is accomplished through the metal solubilisation and storage by ferroxidase enzymes of the ferritin family, which have the ability of sequester, oxidize and mineralize ferrous ions using oxygen or hydrogen peroxide as substrate. dnabinding proteins from starved cells (dps) belong to this ferritin's family. dps belongs to the sub-type designated as miniferritins and, besides iron storage and release capability, is responsible for hydrogen peroxide resistance showing the ability to form stable complexes with dna. the preferable cosubstrate of this enzyme is h o although the reaction can occur in the presence of oxygen with lower rate [ , ] . in this work, the electrochemical behaviour of the recombinant dps from pseudomonas nautica, was assessed as a function of metal content in anaerobic environment with h o as co-substrate. the obtained electrochemical results together with spectroscopic studies allowed inferring some new hypothesis on the dps iron uptake mechanism. for myoglobin electrostatically immobilized on au-deposited mixed self-assembled monolayers (sams) of the composition: -s-(ch ) -cooh/-s-(ch ) -oh. our approach allows for a soft switching of the haem group charge state and accurate probing of the accompanying reorganizational dynamics of conformational (quasi-diffusional) and quantum (e.g. protonrelated) modes. the electron transfer rate constants were determined with h o or d o as solvent, under the variable temperature ( - k) or pressure ( - mpa) conditions, revealing the overall reorganization free energy of . ± . ev, activation volume of - . ± . cm mol - and inverse solvent kinetic isotope effect of . ± . ( °c). on the grounds of an extended charge-transfer theory, we propose specific protoncoupled et mechanism additionally coupled to the slow conformational dynamics of the protein matrix accompanied by translocation(s) of haem-adjacent water molecule(s). proton gradients across pore spanning membranes: towards on-chip energy conversion daniel frese, claudia steinem institute for organic and biomolecular chemistry, georg-august-university goettingen, germany in cell organelles, chemiosmotic potentials resulting from proton gradients across membranes are widely used to fix chemical energy in forms of atp. the high efficiency of this protein-mediated energy conversion raises interest for artificial proton gradient setups. to investigate proton transport across artificial membranes, we prepared pore spanning membranes (psms) on porous silicon substrates via painting technique. this allowed us to trap aqueous content of well-defined composition and volume inside the substrates microcavities. nigericin, a peptide that acts as an h ? -k ? -antiporter and bacteriorhodopsin, a transmembrane protein which is well known to be a light driven proton pump, were reconstituted into the pre-formed psms to achieve proton transport from one aqueous compartment to the other. changes of proton concentrations inside the pores were monitored by means of confocal laser scanning microscopy (clsm). therefore, pores were filled with pyranine, a ph-sensitive fluorescence dye and variations in intensity were measured to analyze proton translocation. we were able to show that both, nigericin and bacteriorhodopsin are capable of building up a proton gradient across psms and plan to co-reconstitute atp synthases for on-chip energy conversion by formation of atp. application of the gibbs free energy profiles to sequential biochemical reactions pé ter farkas, tamá s kiss and eugene hamori department of biological physics, eö tvö s ló rá nd tudomá ny egyetem, budapest, hungary, and department of biochemistry, tulane university, new orleans, la., usa the full understanding of the energetic details of complex metabolic reaction sequences requires a step-by-step analysis of the gibbs free energy (g) changes of the ''parasystem'' (i.e., a collection of atoms comprising all the molecules participating in a given reaction) as it gradually changes from its initialreactants state to its final-products state along the reaction pathway. knowing the respective equilibrium constants of each of the participating reaction steps and also the actual in vivo concentrations of the metabolites involved, a free-energy profile can be constructed that will reveal important information about the progress of the reaction as driven by thermodynamic forces. this approach will be illustrated on some biochemical reactions including the glycolytic/gluconeogenetic pathways. furthermore, the often misleading text-book representation of enzymatic catalysis will be reexamined and explained in thermodynamic terms using the free-energy profiles of both the non-catalyzed and the enzyme-catalyzed reactions. redox-active proteins can be diversely functionalized at metaldeposited self-assembled monolayers (sams) of a widely variable composition and thickness. the voltammetric methodology in combination with the advanced data processing procedures allow for comprehensive kinetic data within the congruent series of nano-devices and the subsequent calculation of the key physical parameters, such as the medium reorganization energy of et, the donor-acceptor electronic coupling, effective relaxation time (related to the protein's and environment's fluctuational dynamics), etc. in our studies the ''model'' redox protein, cytochrome c (cytc), was either freely diffusing to sam terminal groups (mode ), or attached to sams through the electrostatic interaction (mode ), or specific ''wiring'' (mode ). another redox-active protein, azurin, was confined at terminal sam groups through the hydrophobic interaction (mode ). diverse experimental strategies including the variation of sam thickness, solution viscosity temperature and hydrostatic pressure, allowed for a severe demonstration of the full adiabatic and nonadiabatic control (thinner and thicker sams, respectively), and the intermediary regime, in a nice agreement with the major theoretical predictions. proton transfers in a light-driven proton pump j.k. lanyi dept. physiology & biophysics, university of california, irvine, usa illumination of bacteriorhodopsin causes isomerization of alltrans retinal to -cis, -anti and a cyclic reaction ensues, in which the protein and the chromophore undergo conformational changes with an overall ten millisecond turn-over time and a proton is transported from one membrane side to the other. with crystal structures of six trapped intermediate states and plausible structural models for the remaining two intermediates, structures are now available for the initial bacteriorhodopsin state and all intermediates. they reveal the molecular events that underlie the light-induced transport: protonation of the retinal schiff base by asp , proton release to the extracellular membrane surface, a switch event that allows reprotonation of the schiff base from the cytoplasmic side, side-chain and main-chain motions initiated in the cytoplasmic region, formation of a single-file chain of hydrogen-bonded water molecules that conducts the proton of asp to the schiff base, and reprotonation of asp from the cytoplasmic surface. the observed changes can be summarized as a detailed atomic-level movie in which gradual relaxation of the distorted retinal causes a cascade of displacements of water and protein atoms results in vectorial proton transfers to and from the schiff base. electron transfer (et) processes are fundamental in photosynthesis, respiration and enzyme catalysis. the relative importance of superexchange and sequential mechanisms in biological et is still a matter of debate. the identification of any ''stepping stones'' necessary for electron hopping is a key point in the understanding of long range et. hence, the study of a single event in the sequence of reactions occurring in these phenomena is a fundamental but formidable task. muon spin relaxation (lsr) has been shown to be sensitive to charge transport on a molecular lengthscale. the muon is a very sensitive probe of electron transport, as any changes to the electronic density sampled by the muon can change its spin polarization, which can easily be measured. in this context, a very useful tool is the detection of the so-called avoided level crossing (alc) resonances [ ] . the enhancement in the loss of polarization of the muon's spin on these resonances dramatically increases sensitivity. we show that a laser pump -lsr probe technique can measure et processes at particular, and most importantly known, sites within the amino acids chain, and therefore track the time evolution of the electron over the molecule. keywords: photosynthesis, reaction center, electron transfer, proton transfer, fourier transform infrared, l dn, isotopic labeling, band assignment, histidine, mechanism in photosynthesis, the central step in transforming light energy into chemical energy is the coupling of light-induced electron transfer to proton uptake. in the photosynthetic reaction center (rc) of rhodobacter sphaeroides, fast formation of the charge separated state p ? q a is followed by a slower electron transfer from the primary quinone q a to the secondary quinone q b and the uptake of a proton from the cytoplasm to q b . previous fourier transform infrared (ftir) measurements on rc suggested an intermediate x in the q a -q b to q a q b transition. mutation of the amino acid aspl to asn (l dn mutant) slows down proton uptake and oxidation of q a -. using time-resolved ftir spectroscopy we characterized this rc mutant and proposed specific ir bands that belong to the intermediate x. to study the role of the iron-histidine complex located between q a and q b , we performed fast-scan ftir experiments on the l dn mutant marked with isotopically labelled histidine. we assigned ir bands of the intermediate x between cm - and cm - to histidine vibrations. these bands show the protonation of a histidine, most likely hisl , during the q a -q b to q a q b transition. based on these results we propose a new mechanism of the coupling of electron and proton transfer in photosynthesis. complex i of respiratory chains is an energy transducing enzyme present in most bacteria and in all mitochondria. it is the least understood complex of the aerobic respiratory chain, even though the crystallographic a-helical structures of bacterial and mitochondrial complexes have been recently determined [ ] [ ] . this complex catalyses the oxidation of nadh and the reduction of quinone, coupled to cation translocation across the membrane. rhodothermus marinus complex i, our main model system, is a nadh:menaquinone oxidoreductase and has been extensively characterized. we have made an exhaustive study in order to identify all the subunits present in the complex [ ] . the nature of the coupling charge of r. marinus complex i was investigated using inside-out membrane vesicles, which were active with respect to nadh oxidation and capable of creating and maintain an nadh-driven membrane potential (dw) positive inside. it was observed that this bacterial complex i is able of h ? and na ? transport, although to opposite directions. the coupling ion of the system was shown to be the h ? being transported to the periplasm, contributing in this way to the establishment of the electrochemical potential difference, while na ? is translocated to the cytoplasm [ ] . the sodium ion extrusion from the membrane vesicles was due to the activity of complex i, since it was sensitive to its inhibitor rotenone, and it was still observed when the complex i segment of the respiratory chain was isolated by the simultaneous presence of cyanide and external quinones. additional studies have shown that although neither the catalytic reaction nor the establishment of the dph requires the presence of na ? , the presence of this ion increased the proton transport. combining all these results, a model for the coupling mechanism of complex i was proposed, suggesting the presence of two different energy coupling sites, one that works only as a proton pump (na ? independent), and the other functioning as a na ? /h ? antiporter (na ? dependent) [ ] . this model was reinforced by further studies performed in the presence of the na ? /h ? antiporter inhibitor, -(n-ethyl-n-isopropyl)-amiloride (eipa) [ ] . a deeper inside into the coupling mechanism of this enzyme was provided by studying the influence of sodium ions on energy transduction by complexes i from escherichia coli and paracoccus denitrificans. it was observed that the na ? / h ? antiporter activity is not exclusive of r. marinus complex i, since the e. coli enzyme is also capable of such a transport, but is not a general property given that the p. denitrificans enzyme does not performed sodium translocation [ ] . due to the fact that r. marinus and e. coli enzymes reduce menaquinone while p. denitrificans complex i reduce ubiquinone, it is suggested that the na ? /h ? antiporter activity may be correlated with the type of quinone used as substrate. under anaerobic conditions some bacteria can use nitrate instead of oxygen in a process called denitrification. during denitrification, the reduction of no to n o is catalyzed by a membrane-bound enzyme nitric oxide reductase (nor). this enzyme is an important step in the evolution of a respiratory system: nor belongs to the superfamily of o -reducing heme-copper oxidases and is assumed to be the evolutionary ancestor of cytochrome c oxidase. the understanding of nor functioning was limited by the lack of structural information, but recently the first structures (cnor type from ps. aerug. and qnor type from g. stearoth.) were solved [ ] [ ] . we will present results of the first computational studies of nor (both cnor and qnor types) [ ] [ ] . the studies include: (i) large-scale all-atom md simulations of proteins in their natural environment (i.e. embedded in membrane and solvent), which were performed to describe water dynamics inside the protein and to identify potential proton transfer pathways, and (ii) free-energy calculations by the empirical valence bond (evb) method [ ] for the explicit proton translocations along the pathways established by md. among important findings are new proton pathways, which were not predicted from the x-ray structure and could be identified only by means of computer simulations. simulations also reveal that, despite a high structural similarity between cnor and qnor, these enzymes utilize strikingly different proton uptake mechanisms. our results provide insights into the functional conversion between no and o reductases, and into evolution proton transfer mechanisms and respiratory enzymes in general. the genome of the bacterium geobacter sulfurreducens (gs) encodes for c-type cytochromes ( ). genetic studies using cytochrome deficient gs strains and proteomic studies identified cytochromes that were produced under specific growth conditions ( ) ( ) ( ) . a putative outer-membrane cytochrome, omcf, is crucial for fe ? and u ? reduction and also for microbial electricity production ( ). omcf is a monoheme c-type cytochrome with sequence similarity to soluble cytochromes c of photosynthetic algae and cyanobacteria ( ). the structure of oxidized omcf was determined ( ) being the first example of a cytochrome c -like structure from a nonphotosynthetic organism. the structural features of omcf hinted a different function compared to cytochromes c from photosynthetic organisms, being an excellent example of how structurally related proteins are specifically design by nature to perform different physiological functions. in order to elucidate omcf structural-functional mechanism, isotopic labeled protein ( n and c) was produced and its structure in the reduced form determined by nmr. single point-mutations at key residues were produced by site-directed mutagenesis and their impact on the structural and functional properties of omcf will be presented. in the early s, the search for the source of nitrogen monoxide (no) production in mammals led to the discovery of three major isoforms of no-synthases (nos): the neuronal nos (nnos), the inducible nos (inos), and the endothelial nos (enos) ( ( )). years later, based on genomic analysis, numerous nos-like proteins have been identified in the genome other organism and in particular of several bacteria (bacillus anthrax, staphylococcus aureus… ( )). in spite of superimposable d structures and the ability to catalyse no production, all these enzymes are carrying different (if not opposite) physiological activities including cgmp signalling, cytotoxic activities, anti-oxidant defence, metabolism… moreover, noss become increasingly associated to oxidative-stress related pathologies ranging from neurodegenerative disorders, cardiovascular and inflammatory diseases, diabetes, cancers ( )…. this apparent paradox seems related to the belief that the strong similarity of sequence and structure of no-synthases must lead to a unique and identical functioning (no production) for all isoforms. this is blatant for bacterial nos-like proteins that are lacking the essential components required for no biosynthesis but remain considered as genuine no synthases. this approach might remain an obstacle to the understanding of nos actual biological role and could prevent the design of efficient nos-targeted therapeutic strategies. to elucidate this ''nos paradox'' our group has initiated a multidisciplinary approach that aims to relate the wide diversity of nos biological activities to variations in the catalytic mechanism of noss, to modifications of their regulation patterns and to adaptations to their physiological environment. in this context we have been investigating the mechanism of bacillus subtilis nos-like proteins with a special focus on the features that are specific to nos mechanism: i) electron and proton transfers and the role of the substrate and the pterin cofactor ii) oxygen activation and the role of the proximal ligand iii) the very molecular mechanism and the variations in the nature of reaction intermediates. for that matter we have been using a combination of radiolytical techniques (cryoreduction with co y-irradiation, pulse-radiolysis with the elyse electron accelerator), stateof-the-art spectroscopies (epr, atr-ftir and resonance raman, and picoseconds uv-visible absorption spectrosocopies), organic synthesis (synthesized substrates and cofactors analogues) biochemistry and molecular biology (site-directed mutagenesis). we will present our results on the coupling of electron and proton transfers, on the tuning of the proximal ''push effect'' and we will discuss the conditions that favour for each nos isoform no production versus other reactive nitrogen and oxygen species. photosynthetic iron oxidation (pio) is an ancient form of photosynthesis with relevant consequences in the shaping of the planet. this form of metabolism may have been involved in the deposition of geological structures known as banded iron formations, which hold key information regarding the co-evolution of photosynthesis and earth. rhodopseudomonas palustris tie- and rhodobacter ferroxidans sw both use ferrous iron as an electron donor to support photosynthetic growth (i.e. photoferrotrophy). the sw foxeyz operon can stimulate light-dependent iron oxidation by other bacteria. it codes a two-heme cytochrome, a pyrroloquinoline quinone protein and an inner membrane transporter, respectively. in tie- the pioabc operon is required for photoferrotrophy. it codes for a ten-heme cytochrome, an outer membrane beta-barrel and a high potential iron-sulfur protein (hipip) respectively. here we present functional and structural characterization of proteins involved in pio. this molecular characterization is essential for understanding this mode of bioenergetic metabolism, and may one day aid the development of biotechnological applications like microbial fuel cells and bioremediation. alongside classical, cytochrome respiratory pathway, phycomyces blakesleeanus possess alternative, cyanideresistant respiration (crr) facilitated by alternative oxidase (aox). in order to study role of oxygen in regulation of crr, the effects of cyanide on respiration of h old mycelia in aerated (control), hypoxic and anoxic conditions were measured. mycelium was incubated in these conditions for . h, h and h. after . h, aox activity was increased only in specimens incubated in anoxic conditions ( . %). after h, increase in aox activity was significant in both hypoxic and anoxic specimens ( . % and . %, respectively), with even greater increase after h, . % for hypoxic and . % for specimens in anoxic conditions. mycelia treated for h was then oxygenated for minutes. this induced decrease in aox activity of % in anoxic and even . % in hypoxic mycelia. aox is recognized as one of the mechanisms for maintaining low levels of reduced ubiquione which can function in conditions in which cytochrome chain is disabled, such as anoxia. this is in concordance with results obtained on p. blakesleeanus, where aox levels rise in hypoxic and anoxic conditions and decrease close to control level shortly after introduction of oxygen into the system. influence of escherichia coli f f -atpase on hydrogenase activity during glycerol fermentation k. trchounian , , g. sawers , a. trchounian department of biophysics, yerevan state university, yerevan, armenia, institute for microbiology, martin-luther university halle-wittenberg, haale, germany e. coli encodes four hydrogenases (hyd); only three of these, hyd- , hyd- and hyd- have been well characterized. hyd- has been shown recently to reversibly evolve hydrogen during glycerol fermentation at ph . [ ] . proton reduction was inhibited by n,n'dicyclohexylcarbodiimide suggesting a link with the proton-translocating f f -atpase. indeed, at ph . in an e. coli mutant (dk ) lacking f f overall hyd-activity was reduced to approximately % of the wild type activity; hyd- , but not hyd- , was detected in an in-gel activity assay. f f is therefore suggested to be required for hyd- activity. at ph . in glycerol medium hyd-activity in dk was * % of wild type activity and hyd- and hyd- exhibited only weak activity. this indicated a significant f f contribution towards hyd-activity as ph decreased. furthermore, at ph . hydactivity was negligible and only a very weak activity band corresponding to hyd- could be observed. these results suggest that f f -atpase is essential for hydrogenase activity during glycerol fermentation at ph . . taken together, the results suggest an interdependence between hyd- , hyd- and f f -atpase activity. [ ion channels and transporters control many facets of cancer cell biology and blocking the activity of these impairs tumor cell growth in vitro and in vivo. this new paradigm has opened new opportunities for pharmaceutical research in oncology , . we have contributed to this field showing that k v . (herg ) channels are aberrantly expressed in several human cancers where they control different aspects of the neoplastic cell biology such as proliferation and apoptosis, invasiveness and angiogenesis, the latter through the regulation of vegf secretion (reviewed in ). the herg dependent effects were shown in vitro and, more recently, in vivo. in preclinical models of both leukemia and colorectal cancer , herg overexpression confers a higher malignancy to neoplastic cells. moreover, herg blockers have therapeutic potential, since preclinical tests showed that treatment with specific herg blockers overcame chemoresistance in acute leukemias as well as reduced gi cancer growth, angiogenesis and metastatic spread . the overall message emerging from our data is that the herg protein represents a novel biomarker and drug target in oncology. up to now, herg was considered an ''antitarget'' due to the cardiac side effects that many (but not all) herg blockers produce and that result in lengthening the electrocardiographic qt interval. we report here recent studies on known and newly developed herg blockers that exhibit no cardiotoxicity and are more specific for the herg channels expressed in cancer cells. we reported previously that the increase of the cholesterol content of the cell membrane (in vitro) modified the biophysical parameters of the gating of kvl. k ? ion channels in human t-lymphocytes. in our present study we aimed to determine the effect of hypercholesterolemia on the biophysical parameters of kv . gating and the proliferation of t cells. t-lymphocytes were isolated from the peripheral blood of patients with cholesterol level considered normal (\ . mmol/l,control group) and patients with hypercholesterinaemia (hc). whole-cell k? currents were measured in patchclamped t cells and the kinetic (activation and inactivation kinetics) and equilibrium parameters (voltage-dependence of steady-state activation) of kv . gating were determined. lymphocyte proliferation was measured using cfse staining with and without anti-cd and anti-cd stimulation. our results indicate that the biophysical parameters of kv . gating are similar in the control group and in the 'hc' samples. the cfse-based assay showed that hypercholesterolemic t cells had higher spontenaous activation rate compared to control group. however, t cells from high cholesterol level patients challenged by anti-cd and anti-cd exhibited lower proliferation rate than control cells. generalized epilepsy with febrile seizures plus (gefs?, omim ) is a childhood genetic epilepsy syndrome correlated to mutations in the ancillary b-subunit of neuronal voltage-gated sodium channels (nachs). b -subunit is non-covalently associated with nach a-subunits, serving as modulator of channel activity, regulator of channel cell surface expression and as cell adhesion molecule. the first and best characterized gefs? mutation is the c w. this mutation changed a conserved cysteine residue into a tryptophan, disrupting a putative disulphide bridge that should normally maintain an extracellular immunoglobulinlike fold. in this study, we investigated the presence of this putative disulphide bond using -d-diagonal-sds-page, where the proteins were separated in the first dimension in absence of a reduction agent and in presence of it in the second dimension. this method allows to visualize the protein above the diagonal experimentally confirming that the disulphide bond is intramolecular. duchenne muscular dystrophy (dmd) is associated with severe cardiac complications. recent research suggests that impaired voltage-gated ion channels in dystrophic cardiomyocytes accompany cardiac pathology. it is, however, unknown if ion channel defects are primary effects of dystrophic gene mutations, or secondary effects of the developing cardiomyopathy. here, we studied na and ca channel impairments in dystrophic neonatal cardiomyocytes, derived from dmd mouse models, prior to cardiomyopathy development. dystrophin-deficiency reduced na current density. in addition, extra utrophin-deficiency altered na channel gating. moreover, also ca channel inactivation was reduced, suggesting that ion channel abnormalities are universal primary effects of dystrophic gene mutations. to assess developmental changes, we also studied na channel impairments in dystrophic adult cardiomyocytes, and found a stronger na current reduction than in neonatal ones. the described na channel impairments slowed the action potential upstroke in adult cardiomyocytes, and only in dystrophic adult mice, the qrs interval of the ecg was prolonged. ion channel impairments precede pathology development in the dystrophic heart, and may be considered cardiomyopathy triggers. supported by austrian fwf (p ). it has been over years since sensory neuron-specific sodium channel na v . was identified. since then na v . has been shown to play a crucial role in pain pathways, and it became a prominent drug target for novel pain killers. in contrast to myelinated neurons, mechanisms that target voltagegated sodium channels to the unmyelinated c-fibre axons are largely unknown. we investigated the localisation of na v . in unmyelinated primary sensory neurons. na v . was found to be clustered in lipid rafts on unmyelinated axons. when the lipid rafts are disrupted, remarkable reduction in both the conduction velocity and the number of cells responsive to mechanical stimuli to the unmyelinated axons were seen. using a compartment culture system, we also found that disruption of rafts in the middle region of the sensory axons caused a significant reduction in the responsiveness of the neurons to chemical stimuli to nerve endings. this is due to the failure of action potential propagation through the axons. these data suggest that clustering of na v . in lipid rafts of unmyelinated fibres is a key factor for the functional properties of the channel, which may due to a change in the voltage threshold. disruption of the na v . cluster and modifying lipid rafts in primary sensory neurons may be a useful new approach to control the excitability of nociceptive neurons. ion currents in are crucially important for activation of t-lymphocytes. our aim was to investigate how the blockage of various ion channels in isolation or in combination affects mitogen-dependent activation and proliferation of t-cells. we activated human peripheral blood lymphocytes using monoclonal antibodies against the tcr-cd complex and cd . we applied specific channel blockers inhibiting the major ion channels of the t-cell: either the kv . (tea or anuroctoxin), ikca (tram- ), or the crac channel ( -apb) alone or in combination. five days after the stimulus we measured the change in cell size and cellular granulation with flow cytometry along with the proportion of dividing cells, using cfse (carboxyfluorescein succinimidyl ester) dilution assay. our measurements indicated that the ion channel blockers suppressed the proportion of dividing cells in a dosedependent manner. increasing the strength of the stimulation reduced the potency of the blockers to inhibit cell proliferation and eventually the blockers were ineffective in decreasing lymphocyte proliferation. we experienced the greatest amount of inhibition using the combination of the blockers, which indicates synergy in the regulation pathway of the various ion channels. recently, sodium dependent phosphate transporter napi b was revealed as potential marker for breast, thyroid and ovarian cancer. in vivo, napi b is involved in maintenance of phosphate homeostasis and mutations or aberrant expressions of its gene (slc a ) are associated with several diseases including cancer. however, data about napi b mrna expression in different types of cancer and correspondent normal tissues are controversial and restricted. we investigated slc a gene expression level in normal ovarian tissues and different histomorphological types of ovarian tumors using real-time pcr analysis. it was found that slc a gene was highly expressed in well-differentiated endometrioid and papillary serous tumors, but was not expressed in low-differentiated tumors, benign tumors and in most normal tissues. mrna expression of slc a in serouse and endometrioid ovarian tumors accurately correlated with protein expression that was detected in these tumor samples by western blot analysis and immunohistochemistry in our previous investigation. upregulation of slc a gene expression in well-differentiated tumors may reflect cell differentiation processes during ovarian cancerogenesis and could serve as potential marker for ovarian cancer diagnosis and prognosis. in the present contribution a procedure for molecular motion characterization based on the evaluation of the mean square displacement (msd), through the self-distribution function (sdf), is presented. it is shown how msd, which represents an important observable for the characterization of dynamical properties, can be decomposed into different partial contributions associated to system dynamical processes within a specific spatial scale. it is shown how the sdf procedure allows us to evaluate both total msd and partial msds through total and partial sdfs. as a result, total msd is the weighed sum of partial msds in which the weights are obtained by the fitting procedure of measured elastic incoherent neutron scattering (eins) intensity. we apply sdf procedure to data collected, by in , in and in spectrometers (institute laue langevin), on aqueous mixtures of two homologous disaccharides (sucrose and trehalose) and on dry and hydrated (h o and d o) lysozyme with and without disaccharides. the nature of the dynamical transition is highlighted and it is shown that it occurs when the system relaxation time becomes shorter than the instrumental energy time. finally, the bioprotectants effect on protein dynamics and the amplitude of vibrations in lysozyme are presented. we evaluated q for genotoxicity in mcf- breast cancer cells in the presence or absence of doxorubicin (dox), docetaxel (dtx) and paclitaxel (ptx), anticancer drugs commonly used in chemotherapy of different solid tumors. damage to dna was determinated by comet assay. the cells, after treatment with investigated compounds, were washed up and cultured in fresh medium for , , , and hours. we have found that q by itself caused significant dna damage. moreover flavonol enhanced genotoxic effect of anticancer drugs. the highest amount of dna in the comet tail was observed h after treatment with combination of q with dox. similar changes were found in cells incubated with combination of q with taxanes -ptx or dtx. however, damage to dna in this case was considerably lower than damage caused by combination of q with dtx. our results confirmed anticancer and genotoxic activity of quercetin, which makes it a promising candidate for a potential use as a modulator of cytotoxicity and anticancer activity of anthracycline and taxane chemotherapeutics. although the health effects of low-frequency and intensity electromagnetic fields (lfi-emfs) are controversial, increasing evidence suggests that lfi-emfs are capable for initiating various healing processes. many (bio)physical ideas were suggested to explain the influence of lfi-emfs in living systems but the main effect of lfi-emfs on cell functions remains vague. however, some effects of lfi-emfs may be explained by redox and membrane processes. during diseases, cells not only demonstrate altered biochemical processes but also produce altered non-linear bioelectromagnetic complex patterns. thus, it is reasonable to use non-linear bioelectric and bioelectromagnetic signals from cells of the body for potential therapeutic applications that may be more effective than the artificial lfi-emfs signals. our novel emost (electromagnetic-own-signal-treatment) method is based on the utilization of the non-linear, bioelectric and bioelectromagnetic signals of the patients without any electromagnetic wave modulation and inversion of recorded output signals of subjects. here, we report our some restorative results after emost application. we also suggest that the possible effects of the emost may be achieved via redox-related processes. background or leak potassium conductances are a major determinant of resting potential and input resistance, two key components of cell excitability. these currents are not passive but finely tuned and adapted to cell specific functions. k p channels producing these currents are tightly regulated by a variety of chemical and physical stimuli including temperature, membrane stretch, free fatty acids, ph, ca ? , neurotransmitters and hormones as well as protein partners. these different stimuli converge on gating mechanisms that show remarkable conservation between intracellular k p channels (twik channels) and k p channels located at the plasma membrane (trek / channels). living at the edge: volume the conduction system of interfacial forces into the alveolar type ii cell in previous investigations, using new microscopic approaches, we found that the presence of an a li leads to a paradoxical situation: it is a potential threat that may cause cell injury, but also a important stimulus: at ii cells respond promptly, and show sustained ca ?signals that activate exocytosis. exocytosed surfactant, in turn, clearly prolonged the time to irreversible cell damage, and may be an adaptive and evolutionary defense mechanism against the harmful nature of surface forces. recently we published that at ii cells are sensing the a li but how this stimulus is conducted and converted into the cell, is still obscure. currently we are searching for potential calcium sources and it seems that the cells signal ca ? by extracellular ca ? entry probably through mechanosensitive channels. specialdesigned gene chips allowed a whole genome profiling of a li exposed single cells. these cells react with rapid changes on the transcriptional level: cellular pathways that are involved include e.g. defense response and lipid metabolism, and we identified genes associated with several lung diseases and injuries. we summarize, interface forces are strong, they are acting on the cells and triggering cellular events that are closely related with classical concepts in mechanotransduction, it is very plausible that those forces play a crucial role in the lung surfactant homeostasis. calorimetric method and instrumentation were worked out and applied for investigation aqueous solutions of proteins. thermal effects were analyzed by own construction heat-fluxtype dsc cell designed for temperature range from the boiling point of water down to k. the achieved sensitivity of heat flow rate (hf) of the instrument is better than %w tested using ll of mm aqueous solution of nacl. from the integral value of hf -the total enthalpy change dh total , the enthalpies of transitions were separated from the heat capacities. using the method, several types of proteins (bsa, erd , ubq, a-, b-casein, and wt, a t, a-synuclein mutants) were investigated in that temperature range. the results are shown in detail as an illustrative example. potential applications are outlined, which include (i) the distinction between the solvent accessible surfaces of globular and intrinsically disordered proteins, (ii) the distinction between protein mutants, and (iii) the identification of monomer and polymer protein states. this method provides a possibility to study the polymerization process (amyloid formation) and to investigate in-situ the reason and circumstances of that. morphometry due to self gravity in living organism is an integral part in understanding self gravitation bio. computational studies of biological system, especially in ab-initio embryo as self gravitating mass, floating over amniotic fluid, as if maintaining anti -gravitation (extrinsic) mechanism, would be an interesting one to explore biomechanics of intrinsic gravity. since the work would be of exploratory nature, both from the point of view of identifying appropriate stage of development of embryological morulla and available computational logics, it was contemplated to initiate functional study with a few reported ultrasound evidential works on small animals like mice, grasshopper including compact human embryo as mathematical structure that may allow the formal definition of concepts such as convergence, mapping and continuity. as many of the finite-dimensional function analysis in topological vector spaces are available, we initiated our simulation and studies on concentrating locally compact banach spaces. institute for x-ray physics, university of gö ttingen, germany we report on hard x-ray phase contrast imaging of black lipid membranes (blms), which are freely suspended over a micro machined aperture in an aqueous solution. this new way of membrane structure analysis allows investigating bio molecular and organic substances in aqueous environments by parallel and divergent beam propagation imaging, using partially coherent multi-kev x-ray radiation. the width of the thinning film is significantly smaller than the detector pixel size, but can be resolved from quantitative analysis of the intensity fringes in the fresnel diffraction regime down to its native thickness of about nm. to our knowledge this is the first time that such small features of a very weak phase object have been visualized by direct x-ray imaging techniques. we have put forward a simplified but extendable model, which enables the theoretical description of image formation and characterization of membrane thickness and its decrease during the thinning process from a bulk to a bimolecular film. on the basis of recent experiments, future investigations will be performed to study the interactions of membranes, as they are for example known from synaptic fusion, with high spatial resolution. shown that the acid incorporates mainly in the exterior part of the erythrocyte membrane, inducing creation of echinocytes. this suggests that it interacts predominantly with the outer part of the lipid layer of erythrocytes and liposomes. it was also shown that the cga decreases the packing order of the hydrophobic part of the membranes, without changing the anisotropic fluorescence of the hydrophobic part. one of the unique features of single molecule absorption/ emission is their anisotropy due to the well-defined transition dipoles for both processes allowing the determination of the molecule's d orientation. therefore, several techniques have been proposed in order to determine the full d orientation of dipole emitters on a single molecule level. we recently demonstrated a technique that combines emission distribution and polarization detection [ , , ] . as the method is an intensity distribution technique and based on single photon detection in principle, one can extend the d orientation determination to fluorescence correlation spectroscopy (fcs) as well as dynamical anisotropy measurements. this allows for the determination of the dynamics in d orientation of single molecules down to a nanosecond timescales. the d orientation is particularly interesting in non-isotropic environments. a lipid membrane is such a non-isotropic environment of enormous importance in biological systems. we therefore use giant unilamellar vesicle (guv) labeled with dyes like dio as a model system. due to the defined curvature of such vesicles all possible dipole orientations can be achieved. this allows us to show the capabilities of our method on different timescales and to quantify the error in determination of d orientation dynamics in lipid membranes. [ the aim of the studies was to determine the changes that occur in a biological membrane and the model lipid membrane as a result of interaction with strawberry leaf extract. numerous studies conducted all over the world have documented a beneficial effect of polyphenolic compounds on the human organism. however, the mechanism of the interaction on the molecular and cell level is not yet known. in the work presented, the effect of strawberry leaf extract on the erythrocyte and black lipid membranes has been investigated. the applied methods -spectroscopic, fluorimetric and electric -allowed to determine the hemolytic and antioxidant activity, and the packing order in the erythrocyte membrane as well as the electric capacity of blms. the results obtained indicate that the extract is efficient in protecting membrane lipids against oxidation, does not induce hemolysis, increases osmotic resistance and decreases packing order in the hydrophilic region of the erythrocyte membrane. moreover, it increases stability and life-time of flat lipid membranes, without altering their specific capacity. supported lipid bilayers are an abundant research platform for understanding the behavior of cell membranes as they allow for additional mechanical stability and enable characterization techniques not reachable otherwise. however, in computer simulations these systems have been studied only rarely up to now. we present systematic studies on different length scales of the changes that a support inflicts on a phospholipid bilayer using molecular modeling. we characterize the density and pressure profiles as well as the density imbalance induced by the support. it turns out that the changes in pressure profile are strong enough that protein function should be impacted leading to a previously neglected mechanism of transmembrane protein malfunction in supported bilayers. we determine the diffusion coefficients and characterize the influence of corrugation of the support. we also measure the free energy of transfer of phospholipids between leaflets using the coarsegrained martini model. it turns out that there is at equilibrium about a - % higher density in the proximal leaflet. these results are in agreement with data obtained by very large scale modeling using a water free model where flip-flop can be observed directly. we are additionally characterizing the intermediate states which determine the barrier height and therefore the rate of translocation. we also study the influence of surface roughness and curvature on the behavior. simulations in atomistic detail are performed for selected systems in order to confirm the findings. [ the inverse bar (i-bar) domain is part of the superfamily of the membrane-deforming protein is bin-amphiphysin-rvs (bar) proteins which induce either positive or negative membrane curvature both in vitro and in cells. generation of membrane curvature by these membrane deforming proteins often works together with actin dynamics. i-bar shares its function between actin bundling and membrane binding but it is still obscured what molecular mechanisms are responsible for these functions. the aim of our project is to investigate the detailed membrane binding properties of the i-bar of irsp and its relations to the actin cytoskeleton. in vitro fret experiments and fluorescence quenching studies were carried out between the i-bar and liposomes made up from different lipid constructs. we have found that the i-bar has preference to bind to the negatively charged lipids however it can also bind to the uncharged lipids. the fluorescence quenching studies reflected that the accessibility of the i-bar surface was higher toward the negatively charged lipids than for the uncharged ones. tns fluorescence assay reflected that the i-bar domain binds to the surface of the micells rather than penetrating into its core. lipid bilayers present a well-known order-disorder chain transition at ambient temperatures. this transition may become anomalous if the lipid head-group presents ionic dissociation at low ionic strength, as detected by several experimental techniques: between the gel and the liquid phases an intermediate phase appears as a shoulder in the specific heat, a deep in turbulence or a maximum in conductivity. we propose a statistical model which allows ionic dissociation of the polar group on the membrane surface and thus introduces competition between the hydrophobic interaction of hydrocarbonic chains, which favours the gel phase, at low temperatures, and the electrostatic interaction of charged head-groups, which favours the fluid phase, at higher temperatures. the model presents an intermediate fluid phase with higher dissociation and charge ordering on the membrane surface, beyond a sharp gel-fluid transition. the model presents increasing temperature of the main transition by addition of salt, as well as the shrinking of the anomalous region as chain length increases. model thermodynamic behavior is compared to results for pgs, phospholipids with a glycerol head-group. the well-programmed membrane fusion systems, operating in a weakly acidic environment, have attracted attention in the fields of biochemistry, biophysics, and pharmacy because these acidic conditions are generally observed in endosomal membranes or tumor tissues. we have reported a selective liposomal membrane fusion system toward a sugar-like cyclic cis-diol structure on the target liposome. this system consists of a lapidated phenyl boronic acid derivative as membrane -bound fusogen and phosphatidylinositol as target. here we report the preparation of a boronic acid / ph-responsive polypeptide conjugate as a novel membrane fusion device and the development of a target selective liposomal membrane fusion system with endosomal ph-responsiveness. during the course of lipid-mixing, inner-leaflet lipid-mixing, and contents-mixing assays to characterize membrane fusion behavior, we clearly observed a liposomal membrane fusion phenomenon when the ph of the experimental system was changed from . (physiological) to . (endosomal). our highly effective methods, which include a target selective liposomal membrane fusion, can be useful in the area of nanomedicine such as hybridoma technology and liposomebased drug or gene delivery. complete and reversible chemical denaturation of an a-helical membrane protein jana broecker, sebastian fiedler, sandro keller molecular biophysics, university of kaiserslautern, erwin-schrö dinger-str. , kaiserslautern, germany the question of how an unordered polypeptide chain assumes its native, biologically active conformation is one of the greatest challenges in molecular biophysics and cell biology. this is particularly true for membrane proteins. chemical denaturants such as urea have been used successfully for in vitro un-and refolding studies of soluble proteins and b-barrel membrane proteins. in stark contrast with these two protein classes, in vitro unfolding of a-helical membrane proteins by urea is often irreversible, and alternative denaturation assays using the harsh detergent sodium dodecyl sulphate suffer from a lack of a common reference state. here we present the complete and reversible chemical denaturation of the bacterial a-helical membrane protein mistic out of different micellar environments by urea. we applied multidimensional spectroscopy and techniques typically used in b-barrel membrane protein unfolding. mistic unfolds reversibly following a two-state equilibrium that exhibits the same unfolded reference state. this allows for a direct comparison of the folding energetics in different membrane-mimetic systems and contributes to our understanding of how a-helical membrane proteins fold as compared with both b-barrel membrane proteins and water-soluble proteins. in recent years, buckwheat has been of great interest in the world markets of healthy food, due to its high energy value, the content of unsaturated fatty acids, mineral constituents and vitamins. its seeds contain flavonoids which are natural, efficient antioxidants. the aim of the present studies was to investigate the effect of buckwheat extracts on the properties of biological membrane, which is the main site of the interaction between the substances buckwheat contains and the organism. the research was conducted on red blood cells and their isolated membranes, using the spectrophotometric, microscopic and fluorimetric methods. from the results obtained it follows that the compounds contained in buckwheat extracts increase the osmotic resistance of erythrocytes, making them less sensitive to the medium's osmotic pressure, induce changes in cell shape, producing increased number of echinocytes, and decrease the packing order of the polar heads of membrane lipids. it can thus be inferred that the compounds contained in the extracts penetrate the hydrophobic region of the erythrocyte membrane and alter its properties. .due to its small size, symmetric structure, amphipathicity, proteolytic stability and testable mode of activity, the gs backbone is a convenient model system to examine the structure-activity relationship of individual amino acid substitutions. we have previously reported the structure analysis of two gs analogues in which either the val or leu residues on the hydrophobic face of the molecule were substituted by the aliphatic f-labeled amino acid f-phg. using f-ssnmr in oriented lipid bilayers, we observed a re-alignment of the peptide that is compatible with the formation of a putative pore [top.curr.chem. : ]. here, we present novel analogs of gs with different f-prolines in the b-turn region, and with cf -bpg in place of leu. based on these f-ssnmr results and supported by cd, dsc and activity tests, we could demonstrate that all analogues are structurally intact and antimicrobially active. we observe, however, differences in the re-alignment propensity when comparing these gs analogues in dlpc and dmpc bilayers. these differences can be rationalized in terms of molecular shape being changed upon incorporation of unnatural amino acids at various sites of the molecule. the beta-propiolactone (bpl) is an inactivating reagent commonly used to produce viral vaccine preparations (whole virions or split-virions). although bpl has been reported to inactivate nucleic acids, its mechanism of action on proteins and the outcome on viral infection remains ill-defined. in this work, h n /victoria/ / influenza virus strain has been submitted to various bpl inactivation conditions (from lm to mm). cell infection ability was progressively reduced and entirely abolished at mm bpl. to clarify the bpl effect, we focused on membrane fusion infection steps using kinetic fluorescence molecule leakage from liposome and lipid fret assays combined with cryo electron microscopy. membrane fusion measured at ph on gm liposomes was reduced in a dose-dependent manner. interestingly the fusion activity was partially restored using the proton-ionophore monensin as confirmed by cryoem images. in addition, a decrease of molecule leakage irrespective to bpl concentration was measured suggesting that the hemagglutinin affinity for gm was slightly modified even at low bpl concentration. altogether these results strongly suggest that bpl treatment impairs m protein activity likely by preventing proton transport and bring new light on the mechanism of action of bpl. cellular membranes have a heterogeneous lipid composition, potentially forming nano-domains or membrane rafts, believed to be platforms of altered fluidity involved in protein sorting and trafficking . an alternative mechanism, potentially leading to protein sorting, has recently been proposed, suggesting that the curvature of membranes can also actively regulate protein localization . recently we showed that a variety of protein anchoring motifs are membrane-curvature sensors and thus up concentrate in regions of high membrane curvature . furthermore the curvature sensing ability of the anchoring motifs persisted independently of their structural characteristics. this leads us to speculate that curvature sensing might be an inherent property of any curved membrane and as a consequence, the lipid composition of the bilayer could potentially regulate this recruitment by membrane curvature. thus there might be an intimate, yet unrecognized, link between the way raft-like membrane domains and membrane-curvature promotes the localization of membrane-anchored proteins. we examined how changing the lipid composition of liposomes influenced the recruitment by membrane curvature of a model amphiphilic protein-anchoring motif. employing our single liposome curvature assay, we tested lipid mixtures with different ratios of dopc, sphingomyelin and cholesterol, giving rise to liposome populations of different phase-states. we found an amplified recruitment by membrane curvature for all raft-like l o phase-state mixtures when compared to the l d phase-state counterparts. based on these findings we suggest a synergetic effect when combining a raft-like lipid phase-state and high membrane curvature, resulting in a highly potent mechanism for selective localization of membrane-anchored proteins. keywords: non-lamellar lipid structure, phase transition, minerval, -hydroxylated fatty acid. minerval ( -hydroxyoleic acid), a potent antitumoral drug, is known to modulate the lipid membrane structure by decreasing the lamellar-to-non-lamellar phase transition temperature (t h ). a series of -hydroxy fatty acid derivatives, varying in acyl chain length and degree of unsaturation, have been analyzed in terms of their ability to stabilize the inverted hexagonal (h ii ) phase in palmitoyl-oleoyl-phosphatidylethanolamine membranes. differential scanning calorimetry and p-nuclear magnetic resonance showed that mono-and polyunsaturated, but not saturated, -hydroxylated fatty acid molecules were able to decrease the t h . lipid vesicles mimicking the lipid composition of a cell membrane were solubilized at °c in the presence of triton x- . the results demonstrated that the amount of detergent-resistant membranes, which are related to liquid ordered (lo) structures, decreased in the presence of -hydroxylated fatty acids. the so-called lipid membrane therapy focuses on the reversion of cell disfunction through the modulation of the membrane structure, thus altering the activity of membrane-associated proteins. the ability to modify the biophysical properties of a lipid membrane makes the studied -hydroxylated fatty acid molecules be prospective candidates for the use in the lipid membrane therapy. ]. however, a significant downward divergence occurs above mol % of probe content, which might indicate deviations to ideal mixing in fluid phase. results for b-py-c -hpc, in mixtures of popc with and mol % pops were indistinguishable from those obtained with pure popc vesicles; however excimer formation in pure pops bilayers appears to be appreciably higher. we also compared the excimer formation findings with quenching of the same probes by low concentrations of doxyl quencher groups labeled acyl phospholipid chain at the same depth of the pyrenyl group. the results are also scrutinized by the same two-dimensional kinetic formalism and good correlation was also found. derek marsh max-planck-institut fü r biophysikalische chemie, gö ttingen, germany the amassing of comprehensive data on the lipid composition of biological membranes by lipidomics initiatives provides a potent challenge to the membrane biophysicist interested in lipid structure. this resolves itself essentially into two aspects. the first systematises the dependence of membrane biophysical parameters on lipid molecular structure. lipid volumes, membrane dimensions, chain-melting temperatures and enthalpies, nonlamellar phase formation and structure, critical micelle concentrations and thermodynamics of membrane formation, membrane-membrane interactions and lipid transfer are amongst the properties of central biophysical interest. the relevant structural parameters are lipid chain length, degree of unsaturation, chain branching and headgroup configuration. the second, more complex and less well developed, aspect concerns the lipidlipid interactions that determine the membrane properties of lipid mixtures. in part, these can be obtained from binary phase diagrams, and the more limited number of ternary phase diagrams -notably with cholesterol -that are available. extrapolation to higher order mixtures lies in the future. i shall attempt to summarise some of the progress in these directions. the immediate aim is a second edition of my handbook of lipid bilayers, which, in addition to a vastly expanded database, will include interpretative features and will be available in the early part of next year. lateral diffusion dynamics in phosphatidylcholine/cholesterol bilayers has been mostly accessed by means of epr, nmr and fcs spectroscopic techniques. reliable stady-state florescence quenching analysis of diffusion-controlled processes has been ampered by the lack of a self-consistent kinetic formalism for the two-dimensional ( d) counterpart of the classical stern-volmer analysis for three-dimensional ( d) solvents. we studied the excimer formation of phospholipid-labeled pyrenyl probes (proportion of mol %) in mixed popc/cholestrol mlv liposomes by combined steady-state and timeresolved fluorescence the findings are in very good agreement with the theoretical predictions of the kinetic formalism specific for fluorescence quenching processes occurring in the small hydrophobic head group, the closely packed acyl chains, and the capability of interfacial hydrogen bonding have been suggested to govern the characteristic membrane behavior of long chain saturated ceramides: the self-segregation, and the formation of hexagonal phases and highly ordered gel phases. while it has been shown that structural alterations of the ceramide acyl chains induce position dependent effects on their behavior, we wanted to study the effect of interfacial properties, including hydrogen bonding, on ceramide membrane properties. the h-bond donor functions of nh and oh in the sphingosine backbone of palmitoylceramide were disrupted either separately or simultaneously by replacing the hydrogen with a methyl-group. when the lateral phase behavior of mixed bilayers containing cholesterol/sphingomyelin-rich domains was studied in the presence of the ceramide analogs, the o-methylated ceramide appeared to form a thermally stable, sterol-excluding gel phase with sphingomyelin, whereas the o-methylated ceramide failed in both thermal stabilization and sterol displacement. the doubly methylated analog was the poorest ceramide mimic. together with the possible steric effects induced by the methylations, the lack of nh h-bond donor function impaired ceramide membrane behavior to a greater degree than the lack of oh h-bond donor function. sugar-based surfactants are made from renewable resources using the ''green chemistry'' methods, are easily biodegradable and used in washing agents, cosmetics, and drug carriers. besides, there are attempts to use them as nonviral vectors in gene therapy. we studied the influence of new x-(alkyldimethylamonium)alkylaldonamide bromides (c n gab) with different chain lengths (n = , , , ) on the thermotropic phase behavior of dppc, and dppc/chol bilayers by means of differential scanning calorimetry. the surfactants were added either to the water phase or directly to the lipid phase (a mixed film was formed). we analyzed the changes in the temperatures, enthalpies and shapes of the main phase transitions as a function of concentration. molecular modeling methods were also used. cytotoxicicity of the c n gabs was determined in the cell line l and a . for cytotoxicity test, the cells were seeded in -well plates ml of Á cells/ml in the culture medium eagle'a or dulbecco with % calf serum, penicillin and streptomycin was deposited into each plates. the cells were treated with various doses of surfactants and incubated. the minimal concentration which was toxic to approximately % of cells was taken as tccd . this work was supported by grant n n . membrane fusion is ubiquitous in life requiring remodeling of two phospholipid bilayers. as supported by many experimental results and theoretical analyses, merging of membranes seems to proceed via similar sequential intermediates. contacting membranes form a stalk between the proximal leaflets which expand radially into a hemifusion diaphragm (hd) and subsequently open to a fusion pore. direct experimental verification of the hd is difficult due to its transient nature. using confocal fluorescence microscopy we have investigated the fusion of giant unilamellar vesicles (guvs) containing fluorescent membrane protein anchors and fluorescent lipid analogues in the presence of divalent cations. time resolved imaging revealed that fusion was preceded by displacement of peptides and lipid analogues from the guv-guv contact region being of several lm in size. a detailed analysis showed that this structure is consistent with the formation of an hd. a quantitative model of the hemifusion equilibrium and kinetics of the growing hd was developed. bilayer tension could be shown to drive hd expansion and interleaflet tension was found to act as a counterforce, because the outer leaflets are compressed upon hd growth. the model and its predictions fit nicely with observations above. concentration effects of trehalose in the equivalent polarity of fluid popc bilayers c. nobre , d. arrais , j. martins , ibb -cbme, faro, portugal, dcbb -fct, universidade do algarve, faro, portugal trehalose is an important disaccharide, formed by two units of glucose linked by a a- , glicosidic bond. it is capable of replacing water molecules in the hydration shell of the phospholipid headgroups, in cases of extreme dehydration, by establishing hydrogen bonds with their -co and -po groups, preserving this way the membrane structure. the polarity gradient is a significant feature of lipid bilayers and is influenced by the amounts of water within this medium. it is therefore important to understand the effects of different concentrations of trehalose in simple model membranes. using the pyrene empirical polarity scale, we monitorized changes in the polarity values when varying trehalose concentration in the bounding aqueous phase. for lower concentrations (until . m), we observed a decrease in polarity, comparing with popc bilayers in pure water. for higher trehalose concentrations (above . m), the polarity values are indistinguishable from those popc in water. using the freeze and thaw technique we obtained the same results, except for the lower trehalose concentrations. general anesthetics are indispensible tools of daily surgery. yet, their molecular mode of action remains elusive. while one school favors specific (direct) interactions with proteins of the central nervous system, another school adheres to a nonspecific modulation of biophysical membrane properties. one of the strongest arguments against lipid theories is the absence of stereo-specific effects in model membranes, as opposed to their detection by electrophysiological measurements on ionchannels. we have combined x-ray scattering and molecular dynamics simulations on palmitoyl-oleoyl-phosphatidylcholine bilayers with fluorescence microscopy on live cells to study the effects of the stereoisomers of ketamine on membrane properties. we find significant effects of both enantiomers on the distribution of lateral pressures at clinically relevant concentrations, being more pronounced for s-(?)-ketamine. we further calculated the effect of the lateral pressure profile changes on the opening probability of an ion-channel using crystallographic information. the observed channel inhibition compares remarkably well with clinically observed effects of the enantiomers. we thus provide first evidence for a stereo-specific, but indirect effect of general anesthetics on ion-channels. dependence of gramicidin a channel lifetime on membrane structure obtained from x-ray scattering measurements horia i. petrache department of physics, indiana university purdue university indianapolis, in , usa the activity of ion channels, in particular the lifetime of their conducting (open) state depends on the physical properties of lipid bilayers [ , ] which in turn depend on lipid headgroup and acyl chain composition. in order to investigate this dependence, we have performed measurements of gramicidin a (ga) channel lifetimes in three different lipid series. in each series, the lipid headgroups were phosphatidylcholine (pc), phosphatidylethanolamine (pe), and phosphatidylserine (ps), while the acyl chains consisted of symmetric monounsaturated di( : ), mixed ( : )( : ), and methylated di( : - me). in order to minimize the effect of headgroup electrostatics, measurements where performed in m kcl salts. we show how ga lifetimes depend on headgroup and acyl chain composition and on structural parameters determined by x-ray scattering. for the lipids considered, ga lifetimes cover a range from . seconds in the dope lipid to seconds in dphps. in this range, we find a gaussian dependence of ga lifetime on bilayer thickness, consistent with hydrophobic matching models. we discuss different aspects of channel-lipid interactions and to what extent measurements of ga lifetime in binary mixtures are consistent with measurements in pure lipid systems. [ the aim of the studies was to determine the effect of chlorogenic acid (cga), which is the main constituent of plant extracts, on properties of the model membranes. its effect was studied on temperature of the main phase transition of various lipids with and without presence of cholesterol, using the differential scanning calorimetry (dsc) method and the fluorimetric method. in particular, the degree of packing order of the hydrophilic phase of liposomes was determined using the laurdan and prodan probes, and fluorescence anisotropy of the hydrophobic phase with the probes dph and tma-dph. it had also been studied the effect of chlorogenic acid on the structure and capacity of black lipid membranes (blms), formed of egg lecithin and lipids extracted from erythrocytes. the results obtained indicate that cga lowers the main phase transition temperature slightly, without changing the fluorescence anisotropy in the hydrophobic part of the bilyer, and causes a decease in the packing order of the hydrophilic phase. by monitoring the capacity during blm formation we have found that the presence of chlorogenic acid accelerates the process of lipid self-organization into a bilayer, and increases stability and life of the blms. however, the was no effect of cga on specific capacity of the membranes, and thus on thickness of the liposome membrane hydrophobic layer. this work was sponsored by the ministry of science and education, scientific project no. n n and n n . many examples have recently been found where biological processes in the lipid bilayer are affected by the changes in the physicochemical properties of the membrane, e.g. the local curvature, the membrane tension and certainly the membrane structure. it has been shown that the activity of polyene antibiotics is strongly correlated to the phase diagram in a membrane composed of a mixture of popc and ergosterol or cholesterol (j membrane biol, : - , ( )). it is known that polyene action is quite sensitive to the type of sterol in the membrane, which enables its medical use, mainly as antifungals. it has been proposed that this selectivity of the drug to fungi is related to structure modulation by the sterols (see for example, biophys. j., , , ( )) and therefore the correlation found could be due to structural differences between popc/ergosterol and popc/ cholesterol along the corresponding phase diagrams. to investigate this, molecular dynamics simulations of the above mixtures along their phase diagrams were performed. it was found that there are indeed marked differences in structure along the phase diagrams, but for the sterol-sterol distribution function. an analysis of the behavior of this observable and the implications on polyene action is discussed. acyl transfer from lipids without enzyme catalysis: a new paradigm for membrane protein ageing? john sanderson, catherine pridmore, jackie mosely, paul yeo durham university, department of chemistry, durham, uk membrane proteins are recycled in cellulo with half-lives ranging from minutes to days. in other systems, such as enveloped viruses, proteins may equally remain membranebound for periods of days. it is therefore of interest to examine the behaviour of proteins in model membranes over extended periods in order to determine the long-term stability of the mixed systems, both in kinetic terms (attainment of equilibrium states) and chemical terms (reactivity). the reactivity of proteins towards membranes has been examined using the peptide melittin as a model for membrane proteins. acyl transfer from phospholipids to the peptide was found to occur over a period of several days, in the absence of any enzyme catalysis. transfer was detectable after days and reached % conversion in days. using tandem mass spectrometry approaches, the sites of melittin modification were localised. these sites included the side chain of lysine, opening the possibility that this residue may be modified in any membrane protein where this residue has an appropriate disposition. these observations challenge preconceptions concerning the membrane as an inert medium and highlight potential new mechanisms for membrane protein ageing. interaction of poly(l-arginine) with negatively charged bilayers studied by ft-ir spectroscopy christian schwieger, alfred blume martin-luther-university, halle-wittenberg, institute of chemistry, van-dankelman-platz , halle / saale, germany e-mail: christian.schwieger@chemie.uni-halle.de oligoarginine residues attached to macromolecules are known to facilitate the transport through lipid membranes. since the mechanism of this transport is still unclear, the effect is often called ''arginine magic''. we studied the interaction of poly(larginine) (pla) of different molecular weight with negatively charged lipid bilayers. we have shown by calorimetric and monolayer techniques that the interaction is due to a combination of electrostatic and hydrophobic forces. now we present an ft-ir spectroscopic study to reveal the effect of pla binding on membrane organisation and peptide conformation. we will show that pla binding reduces the lipid miscibility of negatively charged (pg or pa) and zwitterionic (pc) lipids within the bilayer. from the shift of the c=o stretching vibration we deduce that arginine side chains penetrate into the hydrophobic/ hydrophilic interface and replace hydration water molecules. the binding reduces the rotational freedom of the lipid molecules, as could be shown by an analysis of the ch -streching vibrations. pla binds in a b-sheet conformation to pg or pa gel phase membranes whereas its structure in bulk is random coil. the shift of the guanidyl vibration frequencies shows that also hydrogen bonds contribute to the pla -lipid interactions. neutron scattering studies of model membrane as a function of hydration and temperature federica sebastiani , , alessandra filabozzi and giovanna fragneto dipartimento di fisica, università degli studi di roma ''tor vergata'', roma, italy, institut laue-langevin, grenoble, france cell membranes carry out highly specialised functions in living materials. the composition of bacterial membranes is essential to understand the mechanism of action of antimicrobial peptides. in order to understand the role of the various components contributing to the overall behaviour, we have reproduced the membrane of bacillus subtilis and carried out neutron diffraction studies on d (small momentum-transfer diffractometer) and d (reflectometer used as a diffractometer), at ill. an ordered and homogeneous sample has been obtained by using the widely studied dmpc. the measured d-spacing of dmpc as a function of the relative humidity (rh) is related to the physical and chemical conditions affecting the sample. consequently the reliability of the humidity chamber, which has been previously upgraded, has been stated. moreover, the most suitable preparation technique has been set up. in order to investigate the component roles within bacillus subtilis membrane, three samples of phospholipids were prepared (with pope, popg and cardiolipin). neutron diffraction measurements, performed at controlled rh and temperature, suggested the presence of interesting phase transitions or coexistence of phases. the rupture of membrane vesicles near solid surfaces annamá ria taká ts-nyeste, imre deré nyi department of biological physics, eö tvö s university, h- budapest pazmany p. stny. /a, hungary the behavior of lipid membranes near solid surfaces has a great significance both in medicine and in technology. in spite of the widespread use and study of such membrane phenomena, their theoretical analysis is rather scarce. our main goal here is to understand the process during which membrane vesicles first adhere to solid surfaces, then rupture (or go through a series of transient ruptures) due to the mechanical tension induced by the adhesion, and finally spread along the surface forming a supported lipid bilayer. in our theoretical description we simultaneously consider the dynamics of spontaneous pore opening and closing; volume loss via leakage through the pores; and the advancement of the adhesion front. all these processes are supposed to follow an overdamped dynamics and coupled to each other through membrane tension. our numerical simulations reveal that the rupture process consists of three well distinguishable phases: a fast initial volume loss; followed by a slow volume loss; ending with a final burst and surface spreading. the second phase can be skipped if either the first phase advances far enough or the third phase sets in early enough. the smaller the vesicle, the further the first phase can advance. the third phase can start earlier if either the surface is smooth enough, or the adhesion energy is large enough, or the line tension is small enough. when the second phase is not skipped the time needed for the rupture process can take very long with a large variance. in the realistic range of the material properties (line tension, bending rigidity) the process is qualitatively always the same, so the most decisive parameter remains the size of the vesicle: the smaller the vesicle the faster and easier it ruptures. ( )). we chose four plant-derived polyphenols (flavonoids and stilbenes) of documented biological activity to study their influence on lipid domain number, area, shape, and borderlength. we found that resveratrol elevated the number of domains per vesicle, decreased their area and markedly increased the total length of domain border without affecting domains' circular shape. surprisingly, no such effect was observed for piceatannol differing from resveratrol by one hydroxyl group only. neither genistein nor -prenylnaringenin changed the morphology of lipid domains significantly. the possible mechanism of resveratrol-induced effect on lipid domains' morphology could be its selective accumulation in the interfacial regions between liquid ordered and liquid disordered domains. putative cholesterol recognition amino acid consensus (crac) motif in hiv coreceptors cxcr and ccr mikhail a. zhukovsky, albrecht ott biological experimental physics department, saarland university, saarbruecken, germany we identified a cholesterol recognition amino acid consensus (crac) motif in transmembrane domain (tmd ) of two g protein-coupled receptors (gpcrs), human chemokine receptors cxcr and ccr , coreceptors of human immunodeficiency virus (hiv). we suggest that residues belonging to this crac motif are involved in cholesterol binding to cxcr and ccr that is responsible for cholesterol requirement for cxcr and ccr conformation and function and for the role that cell cholesterol plays in the cell entry of cxcr -using and ccr -using hiv strains. putative crac sequences involve residues v /l -y -k in cxcr and l /v /v -y -k in ccr . in cxcr , crac motif is highly conserved across chordata species, whereas in ccr , crac motif is less conserved. t curve describe quantitatively the interfacial landscape around the protein molecules and can be used for the distinction between the globular and idp states. the behavior of the t and t data showed that there are two reorientation types present for every protein solutions below °c, irrespective for the nature of the protein or the solvent composition. local field fluctuation and the bpp models were applied, which failed for the buffered protein solutions and for the idps dissolved in water. a main cause of the failure is the changing h in the analyzed temperature range. this case is valid for the solutions of idps and for buffered solutions of both protein types. another cause can be the active relaxation channels other than dipolar when ions of quadrupolar nuclei are present. ligand-induced disorder-to-order transition plays a key role in the biological functions of many proteins that contain intrinsically disordered regions. this trait is exhibited by rtx (repeat in toxin) motifs found in more than virulence factors secreted by gram-negative pathogenic bacteria. we investigated several cyaa rtx polypeptides of different lengths ranging from to residues. we showed that the rtx proteins exhibit the hallmarks of intrinsically disordered proteins in the absence of calcium: they adopt premolten globule conformations and exhibit a strong timeaveraged apparent hydration, due in part to the internal electrostatic repulsions between negatively charged residues, as revealed by the high mean net charge. calcium binding triggers a strong reduction of the mean net charge, dehydration and compaction, folding and stabilization of secondary and tertiary structures of the rtx proteins. we propose that the intrinsically disordered character of the rtx proteins may facilitate the uptake and secretion of virulence factors through the bacterial secretion machinery. these results support the hypothesis that the folding reaction is achieved upon protein secretion and, in the case of proteins containing rtx motifs, could be finely regulated by the calcium gradient across bacterial cell wall. occupational exposure to heavy metals has been recognized to be a risk factor for parkinson's disease via metaltriggered deposition of alpha-synuclein (as) , . in the present work, al ? induced conformational change and instant oligomerization of as have been studied using fret and fcs as main techniques. donor and acceptor were labeled in the c-terminal at positions a c and a c. the average lifetime of donor in the presence of acceptor increases with the increase of al ? concentration, indicating as adopts a more extended conformation upon al ? binding. the intrinsic tyr fluorescence rises sharply within the mixing dead time, reflecting an enhanced hydrophobicity of the tyr environment and a fast conformational change of as. al ? also induces an immediate oligomerization of as as monitored by fcs. the diffusion coefficient of as changes from ± lm /s as monomer state to ± lm /s as oligomer state. the oligomerization is supposed to be induced by the ligand bridging of trivalent al ions. nearly % of human genes encode protein-kinases (pk), enzymes involved in cellular signaling and several other vital biochemical functions, which transfer phosphate groups from atp to specific target molecules, modifying their activity. [ ] deregulated pk have been linked to numerous diseases including cancer and diabetes, making them attractive targets for drug design. [ ] conformational transitions play a central role in regulating the phosphorylation activity. pk adopt an on state that is maximally active and one or more inactive states that shows minimal activity. [ ] the similarity of the relatively rigid and largely conserved atp binding site makes the design of selective inhibitors binding to the active state very difficult. indeed some of the best cancer therapies available are based on inhibitors, as imatinib, that bind to inactive states peculiar to a small subset of pk (abl, c-kit and pdgfr in the case on imatinib). thus, understanding the atomic details of the active to inactive transitions in kinases has a great importance. here we study a particular active-toinactive transition of c-src, a fundamental proto-oncogene involved in cancer and metastasis, by using multi-microsecond long fully solvated molecular dynamics simulations, metadynamics and ptmetad calculations [ , ] . the results, validated by mutagenesis, x-ray crystallography and binding kinetics, are suggestive of a functional role for the conformational transition. moreover, we were able to single out the most important residues affecting the conformational transition and to show that even a very conservative amino-acid substitution can have a dramatic effect on the conformational free energy landscape. the time-scales of protein folding events range over many orders of magnitude. in order to understand the complex folding mechanisms, peptides with well-defined secondary structure are often used as model systems as they may be regarded as smallest folding units of proteins. the formation of secondary structure elements occur on the nanosecond to low microsecond time scale. thus, stopped-flow techniques are too slow whereas pulsed laser techniques are capable to trigger folding processes in nanoseconds and to analyze faster folding events. we study ns-to-ls peptide dynamics by temperature-jump infrared spectroscopy. after initiation of a nanosecond temperature jump, the spectral response is monitored at single wavelengths in the amide i region reflecting the dynamics of the peptide backbone. relaxation rates are obtained. the helix-to-coil relaxation of polyglutamic acid is a multi-step process and requires more complex models than two-state kinetics. however, there are kinetic steps that are well described by single-exponential behavior and a two-state model. we demonstrate how equilibrium and time-resolved infrared spectroscopic data can be combined to deduce folding rates. unfolding and refolding studies using chemical denaturants have contributed tremendously to our understanding of the thermodynamics and kinetics of protein folding and stability. however, a major limitation of this approach lies in the large uncertainty inherent in the extrapolation of the free energy of unfolding in the absence of denaturant from free energy values measured at finite denaturant concentrations. here we show that this limitation can be overcome by combining multiple spectroscopic signals-including fluorescence, circular dichroism, and absorbance-recorded in a quasi-simultaneous and fully automated way at different wavelengths. we have optimised the number of wavelength values used, the integration time per data point, the increment in the denaturant concentration, and the weighting scheme applied for global data fitting. compared with the traditional approach based on the use of a single or a few wavelengths, we could thus improve the precision of the free energy value by an order of magnitude. we exemplify and validate this novel approach using representative, well-studied globular proteins and explain how it can be exploited to quantify subtle changes in membrane-protein stability which have thus far remained elusive. the rates of protein conformational changes are usually not only limited by external but also internal friction, however, the origin and significance of this latter phenomenon is poorly understood. it is often found experimentally that a linear fit to the reciprocal of the reaction rate as a function of the viscosity of the external medium has a non-zero , the physical basis of pressure unfolding is still largely unknown. we report here a specific study of cavities contributions to the volume difference between unfolded and folded states (dv u ), using four single point mutants of staphylococcus nuclease (snase). each mutation is localised in a strategic position on the protein structure and was designed to change a large buried hydrophobic side chain into alanine, thus opening tunable cavities in the snase d structure. measuring hsqcs peaks intensities up to bar monitored the equilibrium high pressure unfolding and leads us to precise estimations of dv u for more than two-thirds of the residues of each mutant. so-fast hmqc experiments were also performed to measure folding and unfolding rates from bar pressure jumps. high-pressure fluorescence experiments were done on six additional alanine mutants to complement the nmr study, allowing a more complete exploration of the local pressure sensitivity along the protein d structure. all these highly reliable measurements shed light on the real signification of the thermodynamic parameter dvu, and bring an unprecedented complex and heterogeneous picture at a residue level of the apparent two-state folding process of snase. determination of contributing factors to the volume change magnitude between unfolded and folded states (dv u ) is a longstanding question in the high-pressure field. we provide here new experimental and computational data using two wellcharacterized model proteins: notch ankyrin repeat domain (nank) and staphylococcal nuclease (snase). the repetitive nature of the nank protein was used to study influence of the protein size on dv u in a systematic way with a set of deletion mutants. high-pressure fluorescence data provided new evidences that neither peptide bonds hydration nor side chains differential hydration could be considered as major contributor to the measured dv u value. additional molecular dynamics (md) simulations rather suggested that the heterogeneous distribution of void volume in the folded states structures could explain the dv u variations among the nank deletion mutants. the specific issue of the void volume contribution to dv u values was studied using cavity mutants in snase, allowing a large structural mapping of the alanine mutations on this globular protein. combination of x-ray crystallography, highpressure fluorescence, high-pressure nmr and md simulations provided a first clear determination of the void volume contribution to the dv u values. these results also bring an unprecedented complex and heterogeneous picture at a residue level of the apparent two-state folding process of snase. we expressed an ig domain (i ) and a -residue-long fragment of the pevk domain in order to investigate the effect of temperature and pressure on their conformation. ftir spectroscopy is a useful method for investigating the secondary structure of proteins. we analyzed the amide i band to obtain information on protein structure. fluorescence labeling was also used in some experiments. to generate high pressures, a diamond anvil cell was employed. the ftir and fluorescence spectra of the protein fragments were recorded across the pressure and temperature ranges of - gpa and - °c, respectively. moderate changes were observed in the conformation of the pevk fragments in the explored range of the t-p plane, suggesting that the domain is a highly flexible, random-coil across the entire studied t-p range. by contrast, the i domain showed quite stable secondary structure. intrinsically disordered proteins participate in important regulatory functions in the cell, including regulation of transcription, translation, the cell cycle, and numerous signal transduction events. disordered proteins often undergo coupled folding and binding transitions upon interaction with their cellular targets. the lack of stable globular structure can confer numerous functional advantages, including, paradoxically, both binding promiscuity and high specificity in target interactions. nmr is unique in being able to provide detailed insights into the intrinsic conformational preferences and dynamics of unfolded and partly folded proteins, and into the mechanism of coupled folding and binding. the function of intrinsically disordered protein domains in transcriptional regulation and signaling will be described, with particular reference to the general transcriptional coactivators cbp and p , the tumor suppressor p , and the adenovirus e a oncoprotein. the globular domains of cbp/p are targets for coupled folding and binding of disordered transactivation motifs of numerous transcription factors and viral oncogenes, which compete for binding to limiting amounts of cbp/p . many intrinsically disordered proteins contain multipartite interaction motifs that perform an essential function in the integration of complex signaling networks. the role of multipartite binding motifs and post translational modifications in regulation of p -mediated signaling pathways will be discussed. the early vascular network is one of the simplest functioning organs in the embryo. its formation involves only one cell type and it can be readily observed and manipulated in avian embryos or in vitro explants. the early vascular network of warm-blooded vertebrates self-organizes by the collective motility of cell streams, or multicellular ''sprouts''. the elongation of these future vascular network segments depends on a continuous supply of cells, moving along the sprout towards its tip. to understand the observed self-organization process, we investigate computational models containing interactions between adherent, polarized and self-propelled cells. by comparing the simulations with data from in vivo or simplistic in vitro experiments, we explore the role of active migration, leader cells, invasion of the ecm, and cell guidance by micromechanical properties of adjacent cell surfaces. boron neutron capture therapy (bnct) is a promising method for treating the highly fatal brain tumor; glioblastoma multiform. it is a binary modality; in which use is made of two components simultaneously; viz. thermal neutrons and boron- . the biophysics of bnct is very complicated; primarily due to the complexity of element composition of the brain. moreover; numerous components contributes to the over all radiation dose both to normal brain and to tumor. simple algebraic summation cannot be applied to these dose components, since each component should at first be weighed by its relative biological effectiveness (rbe) value. unfortunately, there is no worldwide agreement on these rbe values. thermal neutrons were formerly employed for bnct, but they failed to prove therapeutic efficacy. later on; epithermal neutrons were suggested proposing that they would be enough thermalized while transporting in the brain tissues. however; debate aroused regarding the optimum source neutrons energy for treating brain tumors located at different depths in brain. insufficient knowledge regarding the rbe values of different bnct dose components was a major obstacle. a new concept was adopted for estimating the optimum source neutrons energy appropriate for different circumstances of bnct. four postulations on the optimum source neutrons energy were worked out, almost entirely independent of the rbe values of the different dose components. four corresponding condition on the optimum source neutrons energy were deduced. an energy escalation study was carried out investigating different source neutron energies, between . ev and . mev. mcnp b monte_carlo neutron transport code was utilized to study the behavior of these neutrons in the brain. the deduced four conditions were applied to the results. a source neutron energy range of few electron volts (ev) to about kev was estimated to be optimum for bnct of brain tumors located at different depths in brain. simulation of mutation induction by inhaled radon progenies in the bronchial epithelium balá zs g. madas, Á rpá d farkas and imre balá shá zy hungarian academy of sciences kfki atomic energy research institute, konkoly-thege mikló s ú t - ., budapest, h- , hungary radon is considered as the second most important cause of lung cancer after smoking. to understand the mechanisms leading from radon exposure to cancer formation is of crucial importance. this study focuses on the description of mutation induction by radon progenies in the bronchial epithelium. computational fluid and particle dynamics approach was applied to determine the radio-aerosol deposition distribution in the central airways. a numerical replica of a small fragment of the bronchial epithelium was prepared based on experimental data. microdosimetric computations were performed to quantify the cellular radiation burdens at the very site of deposition accumulation. a mutagenesis model was applied supposing that radiation induces dna damages and enhances the cell turnover rate. the results show that both considered mutagenic effects of densely ionising radiation contribute significantly to mutation induction and mutation rate depends non-linearly on exposure rate. furthermore, simulations suggest that the local maintenance capacity of the bronchial epithelium can be exhausted by chronic exposure to radon progenies with activity concentration characteristic of some uranium mines. the present work demonstrates possible applications of numerical modelling in radon related carcinogenesis studies. the neural crest is a group of cells found in all vertebrate embryos. it forms in the neural folds at the border of the neural plate and gives rise to a huge variety of cells, tissues and organs. one of the astonishing characteristic of neural crest cells is that they are able to migrate very long distances in the embryo. the neural crest has been called the ''explorer of the embryo'' as it is one of the embryonic cell types that migrate most during development, eventually colonizing almost every tissue. in this talk i will discuss our recent finding about neural crest migration. we have shown that neural crest cells, classically described as mesenchymal cells, migrate in large clusters cytokinesis relies on tight regulation of the mechanical properties of the cell cortex, a thin acto-myosin network lying under the plasma membrane. although most studies of cytokinetic mechanics focus on force generation at the equatorial acto-myosin ring, a contractile cortex remains at the poles of dividing cells throughout cytokinesis. whether polar forces influence cytokinetic cell shape is poorly understood. combining cell biology and biophysics, we demonstrate that the polar cortex makes cytokinesis inherently unstable and that any imbalance in contractile forces between the poles compromises furrow positioning. we show that limited asymmetric polar contractions occur during normal cytokinesis, and that perturbing the polar cortex leads to cell shape oscillations and division failure. a theoretical model based on a competition between cortex turnover and contraction dynamics accurately accounts for the oscillations. we further propose that blebs, membrane protrusions that commonly form at the poles of dividing cells, stabilise the position of the cleavage furrow by acting as valves releasing cortical contractility. taken together, our findings show that the physical properties of the entire cell are integrated into a finetuned mechanical system ensuring successful cytokinesis. collective motion of individual cells marks the onset of the transition to multicellularity in many microorganisms. this transition is often mediated by intercellular communication signals between cells. here, we show, in contrast, that the transition from single cell to collective motion in an ensemble of gliding bacterial cells can be understood as a dynamical selfassembly process of self-propelled rods. experiments were carried out with a mutant of the bacterium myxococcus xanthus moving by means of the a-motility system only and without undergoing reversals. the collective motion phase is confined to a monolayer and is characterized by the organization of cells into larger moving clusters. a transition to collective motion is detected in experiments by image analysis, that reveals a qualitative change of the cluster-size distribution at a critical cell packing fraction around %. this transition is characterized by a scale-free power-law cluster size distribution with an exponent . . we provide a theoretical model for cluster formation of self-propelled rods that reproduces the experimental findings for the cluster size distribution. our findings suggest that the interplay of selfpropulsion of bacteria and volume exclusion effects of the rodshaped cell bodies is sufficient to explain the onset of collective motion and the related changes in the cluster statistics. despite much speculation on the existence of structurally distinct oligomeric species associated with the conversion of certain monomeric proteins into amyloid fibrils, it has not previously been possible to observe them directly or to relate them to any key mechanistic steps involved in the interconversion process. we have developed a novel application of singlemolecule intermolecular fret to investigate in unprecedented detail the aggregation and disaggregation of alpha-synuclein, the protein whose pathogenic deposition as intracellular lewy bodies is a characteristic feature of parkinson's disease. our study reveals that a range of oligomers of different size and structure are formed, even at physiologically relevant concentrations. interestingly, the resistance to degradation of the aggregated state of alpha-synuclein, which is a well- we focused on the structure-dynamics interplay and showed how the fractal-like properties of proteins lead to such anomalous dynamics. we used diffusion, a method sensitive to the structural features of the protein fold and them alone, in order to probe protein structure. conducting a large scale study of diffusion on over pdb structures we found it to be anomalous, an indication of a fractal-like structure. taking advantage of known and newly derived relations between vibrational dynamics and diffusion, we demonstrated the equivalence of our findings to the existence of structurally originated anomalies in the vibrational dynamics of proteins. more specifically, the time dependent vibrational mean square displacement (msd) of an amino acid is predicted to be subdiffusive. the thermal variance in the instantaneous distance between amino acids is shown to grow as a power law of the equilibrium distance. the autocorrelation function in time of the instantaneous distance between amino acids is shown to decay anomalously. our analysis offers a practical tool that may aid in the identification of amino acid pairs involved in large conformational changes. more recently, we studied the effect of the hydrodynamic interaction between amino acids using a zimm-type model. we computed the time-dependent msd of an amino acid and the time-dependent autocorrelation function of the distance between two amino acids, and showed that these dynamic quantities evolve anomalously, similar to the rouse-type behavior, yet with modified dynamic exponents. we also studied the dynamic structure factor s(k,t) of proteins at large wavenumbers k, kr g [ [ , with r g the gyration radius, that are sensitive to the protein internal dynamics. we showed that the decay of s(k,t) is dominated by the spatially averaged msd of an amino acid. as a result, s(k,t) effectively decays as a stretched exponential. we compared our theory with recent neutron spin-echo studies of myoglobin and hemoglobin for the rouse and zimm models of hydrodynamic friction. in addition, i will mention two other projects currently underway: (i) a new elastic network model that accounts for the tensorial aspects of protein elasticity and is a combination of stretch-compress springs and bond-bending energies. (ii) the unfolding of a protein under the exertion of a large pulling force. allosteric regulation of enzymatic activity is crucial for controlling a multitude of fundamental cellular processes. yet the molecular level details underlying regulation often remain poorly understood. here we employed single molecule activity studies to dissect the mechanistic origin of enzymatic activity regulation. as a model system we employed a lipase and measured its activity as a function of accessibility to surface tethered liposomes ( ), which are known regulators of its activity. our results surprisingly revealed that the lipase oscillates between states of different activity. we accurately quantified for the first time both the interconversion rates between activity states and the inherent activity of these states. based on these we calculated the energetic landscape of the entire reaction pathway and identified that regulatory interactions redistributed the probability to reside on preexisting enzymatic activity states but did not alter the activity of these states. our findings provide the missing link between conformational and activity substates supporting and represent the first direct validation of the textbook hypothesis of conformational selection for regulation of enzymatic activity to identify the potential targets of cgmp in arabidopsis plants we adopted a proteomic approach to isolate possible cgmp-binding proteins. purification of soluble cgmp-binding proteins was performed using cgmp-agarose-based affinity chromatography procedure. next eluted proteins were analyzed by sds-page which revealed ten bands. we focused the subsequent analysis on low-molecular peptides of , and kda which were bound cgmp more intensively. after d-ief-page of the proteins isolated by cgmp-agaroseaffinity chromatography eight most abundant protein spots in the low-molecular area were visualized. these spots of interest were excised from the gel and in gel digested by trypsin. then tryptic peptides were analyzed by maldi-tof mass spectrometry and identified as isoforms of nucleoside diphosphate kinase (ndpk) from arabidopsis. thus, our data suggest that ndpk is a potential target of cgmp signaling in arabidopsis. dual-color fluorescence-burst analysis (dcfba) was applied to measure the quaternary structure and high affinity binding of the bacterial motor protein seca to the protein-conducting channel secyeg reconstituted into lipid vesicles. dcfba is an equilibrium technique that enables the direct observation and quantification of protein-protein interactions at the single molecule level. seca binds to secyeg as a dimer with a nucleotideand preprotein-dependent dissociation constant. one of the seca protomers binds secyeg in a salt-resistant manner, while binding of the second protomer is salt-sensitive. since protein translocation is salt-sensitive we conclude that the dimeric state of seca is required for protein translocation. a structural model for the dimeric assembly of seca while bound to secyeg is proposed based on the crystal structures of the thermatoga maritima seca-secyeg and the escherichia coli seca dimer. • dcfba is a flurorescence based single molecule technique that allows assessment of the stoichiometry of ligands bound to membrane receptors • dimeric seca binds asymmetrically to the protein-conducting membrane channel secyeg • monomeric seca binds secyeg but dimeric seca is required for protein translocation • protein translocation depends on receptor cycling of the dimeric seca if the dna charge is sufficiently neutralized by counter-ions, electrostatic interactions between helical charge patterns can cause attraction [ ] . helix specific interactions also cause tilt, in one direction, between two dna fragments [ ] . in braids and supercoils, this impetus to tilt breaks positive-negative supercoil symmetry. we show that these effects may cause spontaneous braiding of two molecules, lowering the dna pairing energy [ ] . the pairing is more energetically favourable for homologues (same base pair text) than for nonhomologous pairs. this might explain pairing between only homologues observed in nacl solution [ ] . also, we construct a simple model for a closed loop supercoil, including chiral electrostatic interactions. there are very interesting effects, for sufficient charge neutralization and groove localization of counter-ions. i.) positive super-coils are more energetically favourable than negative ones. ii.) a transition between loosely and tightly wound supercoils as one moves from negative to positive values of the supercoiling density. iii.) in positive super-coils the chiral interaction underwinds dna. [ von willebrand factor (vwf) is a large multimeric protein that is crucial for the force sensing cascade triggering primary hemostasis. it mediates binding of activated thrombocytes to injured epithelial tissue and serves as a transporter for coagulation factor viii. while it was shown that the hemostatic activity of vwf is affected by shear stress [ ] , the exact impact that shear forces have on the inflammatory cascade remains unclear. it is assumed that hydrodynamic forces lead to partial unfolding of vwf, which in consequence exposes more binding sites. in order to observe shear-induced changes of the protein's functionality, we measure conformational changes of vwf under flow with fluorescence correlation spectroscopy (fcs). we aim to measure the degree of uncoiling of vwf multimers under various buffer conditions, e.g. in the presence of colloids, vesicles or platelets. as only large multimers show significant hemostatic activity we intend to monitor the molecular weight distribution of vwf. shifts in this distribution indicate various pathological conditions making our multimer analysis a fast diagnostical tool for vwf-related diseases. this will serve as a basis for studies of vwf binding to collagen, fviii, gpib, vesicles and membranecoated nanoparticles under shear flow. [ data on mechanical properties of medically important proteins located in neural junctions are very limited. contactins (cntn) and paranodin proteins, located in extracellular part of ranvier nodes, are important for proper brain wiring. here we study a new series of fniii modules from human cntn- and - using a single molecule afm force spectroscopy and advanced, all-atom steered molecular dynamics (smd) computer simulations. mutations in cntns are responsible for numerous brain disorders including autism or pathological development of odor maps. perhaps mechanical properties of individual fniii mutated protein modules are compromised, thus we address this problem. a comparison of our afm force spectra with those of reference proteins will be presented [ ] [ ] , and the molecular level interpretation fniii nanomechanics, based on our smd data will be given. we believe that these data should help to understand a role of cntn in regulation of sodium ion channels in both normal and autistic subjects. supported in part by polish ministry of education and science, grant no. n , the computational center task in gdansk and license for accelrys software. recent achievements in rational dna-motors engineering demonstrate the possibility to design nano-motors and nanorobots capable of performing externally controlled or programmed tasks. a major obstacle in developing such a complex molecular machine is the difficulty in characterizing the intermediates, the final products and their activity. typically, non in-situ gel and afm and in-situ bulk fluorescence methods are used. i will present two dna-motors recently developed and studied using in-situ single-molecule fluorescence resonance energy transfer (smfret), alternating laser excitation (alex) and total internal reflection fluorescence spectroscopy (tirf), and will demonstrate that these methods can improve the way we design, construct, measure and understand highly complex dna-based machines. a motor made of bipedal dna-walker, which walks on a dna track embedded on a dna-origami, capable of long walking distance and maintaining structural stability, will be presented. the motor is non-autonomous; it receives ss-dna fuel/anti-fuel commands from outside (as in shin & pierce, jacs, ). the motors assembly stages and singlemotor's walking steps are monitored using smfret. the second motor is based on published bipedal autonomous dna-motor (seeman, science ). it is characterized by coordinated activity between the different motor domains leading to processive, linear and synchronized movement along a directionally polar track. to prove that the motor indeed walks, the authors chemically froze the motor at each step and use a complicated radioactive gel assay. i will demonstrate that using single-molecule approach, we are able to directly and in-situ measure single-motor's movements in few simple experimental steps, and measure its structural dynamics and kinetics. translation by a single eukaryotic ribosome using single molecule total internal reflection fluorescence microscopy, we observed translation of a short messenger rna (mrna) strand by single eukaryotic ribosomes. the ribosome-mrnas complexes are fixed to a microscope coverslip through the mrna, and mrnas are located through fluorescently labelled oligonucleotides hybridized to it downstream start codon. because of the ribosome helicase activity, the double strand formed by the oligonucleotide and the mrna is opened while the ribosome translates this region of the mrna. thus, the loss of the fluorescence signal allows us to measure the distribution of translation speed of single ribosomes. careful attention was given to photobleaching for the data analysis. this experiment opens the door to the study of eukaryotic translation at the single molecule level. erythrocyte hyperaggregation, a cardiovascular risk factor, has been associated to high plasma concentrations of fibrinogen. using atomic force microscopy (afm)-based force spectroscopy measurements, we have recently identified the erythrocyte membrane receptor for fibrinogen, an integrin with a a or a -like subunit [ ] . after this, we extended the study to the influence of erythrocyte aging on fibrinogen binding [ ] . force spectroscopy measurements showed that upon erythrocyte aging, there is a decrease of the binding to fibrinogen by decreasing the frequency of its occurrence (from . % to . %) but not its force. this observation is reinforced by zeta-potential and fluorescence spectroscopy measurements. knowing that younger erythrocytes bind more to fibrinogen, we could presume that this population is the main contributor to the cardiovascular diseases associated with increased fibrinogen blood content, which disturbs the blood flow. our data also show that sialic acid residues on the erythrocyte membrane contribute for the interaction with fibrinogen, possibly by facilitating the binding to its receptor. antimicrobial peptides are usually polycationic and amphiphilic with high affinity for bacterial membranes. in order to characterize their therapeutic potential it is crucial to disclose which properties of the peptide/lipids are important for target selectivity, and to examine the peptide structure and its association with lipid bilayers. in this work, first experiments have been carried out on a promising peptide called sb , which might represent the basis for developing a novel class of antibiotics. with the goal of enhancing the activity of a new semi-synthetic sequence, two identical peptides (wkkirvrlsa) were assembled via a lysine-linker, carrying also an octanoyl-lipid anchor. a highly active compound was obtained, but its structure and mode-of-action remain unexplored. this dendrimeric peptide and its linear deca-peptide counterpart are being studied in parallel to highlight the relevant properties and differences between dendrimeric structure and the sequence. monolayer intercalation is investigated with microtensiometry, fluorescence spectroscopy is applied to study thermodynamics and kinetics of the binding process. circular dichroism, nmr and md simulations are employed with the aim of elucidating the d structure in the membrane-bound state. the capability of proteins to build structures via self-organization is fascinating biophysicists since decades. with the advent of single-molecule methods, namely fluorescence correlation spectroscopy (fcs) and fluorescence resonance energy transfer (fret), the process of complex formation is becoming accessible to direct observation. coronaviruses (cov) are enveloped positive-stranded rna viruses. for sars-cov, it was shown that coronaviruses encode a rna-dependent rna-polymerase (rdrp) build from non-structural protein (nsp ) and non-structural protein (nsp ). this hexadecameric nsp -nsp complex is a hollow, cylinder-like structure assembled from eight copies of nsp and held together by eight nsp molecules [ , ] . we are aiming at understanding the assembly process and conformational changes of the complex for the related feline coronavirus. first results implicate that nsp alone forms a dimer, where interchain fret is more efficient than intrachain fret. for the complex the results indicate that nsp -nsp form a heterodimer which is different from sars-cov. our experiments highlight the potential of single-molecule fret for the study of protein complex formation. diffracted x-ray tracking (dxt) has been considered as a powerful technique for detecting subtle dynamic motion of the target protein at single molecular level. in dxt, the dynamics of a single protein can be monitored through trajectory of the laue spot from the nanocrystal which was labeled on the objective protein. in this study, dxt was applied to the group ii chaperonin, a protein machinery that captures an unfolded protein and refolds it to the correct conformation in an atp dependent manner. a mutant group ii chaperonin from thermococcus strain ks- with a cys residue at the tip of the helical protrusion was immobilized on the gold substrate surface and was labeled with a gold nanocrystal. we monitored diffracted spots from the nanocrystal as dynamic motion of the chaperonin, and found that the torsional motion of the chaperonin in the presence of atp condition was times larger than that in the absence of atp condition. and uv-light triggered dxt study using caged atp revealed that the chaperonin twisted counterclockwisely (from the top view of chaperonin) when the chaperonin closed its chamber, and the angular velocity from open to closed state was % faster than that from closed to open state. peptides or proteins may convert (under some conditions) from their soluble forms into highly ordered fibrillar aggregates. in vivo such transitions can lead to neurodegenerative disorders such as alzheimer's disease. alzheimer's disease is characterised by the extracellular deposition of abeta peptide in amyloid plaques, and the intracellular formation of neurofibrillary tangle (nft) deposits within neurons, the latter correlating well with disease severity. the major constituent of nft deposits are paired helical filaments (phf) composed of a microtubule-associated protein known as tau. studying the process by which tau forms these large aggregates may be an essential step in understanding the molecular basis of alzheimer's disease and other tauopathies. we have applied a two-colour single molecule fluorescence technique, and single molecule intermolecular fret measurements to study the soluble oligomers of tau which are formed during the aggregation and disaggregation of phf's. the neuronal protein alpha-synuclein is considered to play a critical role in the onset and progression of parkinson's disease. fibrillar aggregates of alpha-synuclein are the main constituents of the lewy bodies that are found in the brains of parkinson patients. however, there is growing evidence suggesting that oligomeric aggregates are significantly more toxic to cells than fibrillar aggregates. very little is known about the structure and composition of these oligomeric aggregates. we present results using single-molecule photobleaching approaches to determine the number of monomeric subunits constituting the oligomers. our results show that the oligomers have a narrow size distribution, consisting of * - monomers per oligomer. fluorescence correlation spectroscopy data confirm the narrow size distribution and additionally indicate a very loose packing of the oligomers. in combination with bulk fluorescence spectroscopy results of tryptophan containing mutants of alpha-synuclein, we present a structural model for the alpha-synuclein oligomer. gold colloids are widely used for in vitro and in vivo imaging. compared to the traditional optical tags sers-coded nanoparticles show a narrow emission bandwidth with structured spectra typical of the molecule used, a wider excitation bandwidth, higher emission intensity, a better photo-stability, and a lower toxicity. this is why in cancer therapy, besides being considered good tools for the delivery of anti-tumor drugs, aunp can be also good optical tags for the analyses of both np localization by laser scanning microscopy and the process of drug release inside the cells by raman. in our work we used nm diameter aunp loaded with rhodamine g, a molecule with a high raman and fluorescence efficiency, and with a chemical structure similar to doxorubicin, the antitumoral drug used in our system. the data showed that aunp are internalized by cells and sers can be performed. nm and nm diameter aunp loaded with doxorubicin were incubated at different time points with a cell line (human adenocarcinomic alveolar basal epithelial cells). only nm aunp showed intense raman emission typical of the doxorubicin phonon transitions. in recent years biomedical applications of diamond nanoparticles have become of significant interest, which rises questions of their biocompatibility and mechanisms of interactions with cells. the aim of this study was to compare the effect of nonmodified diamond nanoparticles (dnps) and dnps modified by the fenton reaction on human endothelial cells. dnps (\ nm particle size, sigma) were modified by the fenton reaction introducing surface -oh groups. immortalized human endothelial cells (huvec-st) were incubated with - lg/ml dnps in the optimem medium. diamond nanoparticles modified by the fenton reaction had smaller hydrodynamic diameter estimated by dynamic light scattering and the surface potential (zeta potential) measured using laser-doppler electrophoresis. they were more cytotoxic as evaluated by the mtt reduction assay. dnps augmented generation of reactive oxygen species in the cells, estimated by oxidation of ', '-dichlorofluorescin, the effect being higher for the fenton-modified dnps after -h incubation. cellular production of nitric oxide, estimated with daf-fm, was also affected by dnps; after h, fentonmodified oh, in contrast to non-modified diamond, decreased no production. diamond nanoparticles affected also the cellular level of glutathione and activities of main antioxidant enzymes (superoxide dismutase, catalase, glutathione peroxidase, glutathione reductase and glutathione s-transferase). we aim at investigating how photoreactions of proteins can be controlled by means of intense thz radiation tuned in resonance to specific vibrational modes, much in analogy to coherent control experiments conducted by fs nir laser pulses [ ] . for this we will combine a time-resolved ir difference spectroscopic setup with uniquely intense, tunable narrow bandwidth thz radiation ( - lm) at the ps beamline of the thz free electron laser felbe. these experiments will be performed on bacteriorhodopsin (br) which is the sole protein of the purple membrane of the archaebacterium halobacterium salinarum [ ] . upon illumination, the chromophore retinal isomerizes around the c -c double bond [ ] and br pumps a proton from the cytoplasmic to the extracellular side. this proton gradient is used by the bacterium to drive photosynthetic atp production under low oxygen tension [ ] . in our experiment, the photoreaction is initiated by a visible laser pulse as in standard experiments, but then the sample will be irradiated by a thz pulse from the free electron laser tuned into resonance with low-energy vibrational modes which is supposed to influence the photoreaction [ ] . such vibrational control will be monitored by time-resolved ftir spectroscopy using the step-scan technique [ ] . liposomes are increasingly studied as nanoscale drug delivery systems and biomembrane models. however the exact structure dynamics and mechanical behavior of liposomes is little known. atomic force microscopy (afm) is a powerful tool to characterize nanoscale morphology and enables the mechanical manipulation of submicron-sized vesicles. a drawback of afm, however, is that liposomes may flatten and rupture on substrates to form patches or supported planar bilayers (spb). our aim was to obtain better understanding of factors affecting liposomes on substrates and find experimental conditions at which liposomes preserve their structural integrity. in the presence of divalent cations dppc liposomes formed spb on mica. vesicles sedimented subsequently preserved their integrity and showed stronger attachment to spb. in addition to cross-bridging lipid head groups, divalent cations influence the surface charge of liposomes, thereby modulating liposome-substrate and liposome-liposome interfacial interactions. preserved vesicles stabilized by divalent cations may provide a unique experimental system for studying membrane-protein interactions. the influence of e.g. ph and ionic strength on various chromatographic bead -biomass combinations. to analyze the force curves, possible elastic contributions, e.g. from deforming cellular membranes, have to be decoupled from the interaction forces. then, bead-biomass interactions will be modeled using (extended) dlvo theory and resulting data can also be compared to real-life eba processes. the project aims for a better understanding of the interaction forces in chromatography and might help to improve the process quality of eba. long-term non-invasive in vivo monitoring of the survival, migration, homing and fate of transplanted cells is of key importance for the success of cell therapy and regenerative medicine. tools for in vivo magnetic resonance (mr) imaging of labeled cells are therefore being developed. we have prepared superparamagnetic iron oxide nanoparticles by the coprecipitation of fe(ii) and fe(iii) salts and oxidation. to stabilize the particles and to facilitate their internalization by the cells, the nanoparticles were coated with several novel low-and highmolecular weight compounds including d-mannose, poly(llysin), poly(n,n-dimethylacrylamide) and dopamine-hyaluronate conjugate. the surface-modified magnetic nanoparticles were thoroughly characterized by a range of physico-chemical methods, which proved the presence of the coating on the particles. the particles were then investigated in stem cell experiments in terms of real time cell proliferation analysis, viability, labeling efficiency and differentiation. the iron oxide concentration of the labeled cells was assessed using mr relaxometry. the advantages/disadvantages of particular iron oxide coatings will be discussed and the optimal coating suggested. excellent contrast was achieved by labeling the cells with dopamine-hyaluronate-coated nanoparticles. support of the as cr (no. kan ) is acknowledged. in the last decades the interest towards the fabrication of innovative bio-sensors with improved sensitivity and reliability for medical-diagnostics applications has been constantly risen. among the different techniques, microfluidic systems are playing a major role. in order to detect extremely low concentrations of biomolecules (pm and fm), attention should be placed on the controlled, selective functionalization of micro-and nano-channels. in this work we propose a new approach to functionalize gold patches inside fluidic channels. we start from self-assembled monolayers (sams) of thiolated molecules on a gold electrode deposited inside the channel. then, by using an electrochemical approach [ , ] we remove molecules from the sam at selected locations, by applying a negative voltage to the electrode. the newly exposed gold surface can be re-functionalized by using a thiolated biomolecule (i.e an antibody) capable to bind specific proteins flowing inside the channel. the cycle can be applied to other electrodes in the microfluidic system, creating a multiplexing device which, as we will show, can differentially measure ionic current flows in different channels. optically actuated micromanipulation and micro-probing of biological samples are increasingly important methods in the today's laboratory. microbeads as probes are the most commonly used tools in this field, although the only manipulative motion they allow is translation. we present polymerized d microstructures which can also be used for optical micromanipulation with more degree of freedom than microbeads. the two-photon polymerization (tpp), based on focused fs laser beam into appropriate photopolymers is a powerful method to build structures of arbitrary complexity with submicrometer resolution. the presented tools have the advantage of being capable of twisting and rotational manipulative motion, and also that the position of biological manipulation and optical trapping is spatially separated. different manipulative interfaces, the positioning stability and surface activation of the manipulators will be discussed. the potential application of multiplexed quantum dot labeling, mqdl, in clinical detection, prognosis and monitoring therapeutic response has attracted high interests from bioengineers, pathologists and cancer biologists. mqdl is superior comparing with conventional organic dye staining in its narrow emission bandwidths, wide signal dynamic ranges, high detection sensitivity, and low noise to signal ratios. however, the majority of the mqdl application has been limited to identification of specific cell type or cancer subtype and the improvement in labeling methodology. in this study, we focused on simultaneous detection and analysis of proteins in the c-met activation pathway, i.e. rankl, vegf, nrpln- , p-c-met and mcl- , which are known to be associated with human prostate cancer progression and metastasis. two experimental systems were analyzed: ) fixed xenograft tissues from an established ltl castration resistance human prostate cancer or crpc model; and ) clinical prostate tissue specimens from localized cancer and bone metastasis. in the presentation we will report our experience in ) the mqdl protocol optimization for the sequential reactions of individual primary antibody, the biotinylated secondary antibody and streptavidin-coated qd conjugate with nuclear dapi staining; and ) the multiplexed image catching, image unmixing, and subsequent per cell base quantification. for future multi-specimen analyses and validation, we will introduce a high throughput vectra image analysis system. carbon nanotubes (cnts) are already quite popular among many scientific and technological disciplines . in recent years they have been targeted for biotechnological and medical applications. in this work we have investigated the nanostructural self assemblies of biological lipid molecules in presence of cnts. advantage of using highly aligned cnts for this purpose being the possibility of studying the interactions of lipid molecules on the macromolecular surface as well as in the confinement of aligned cnts. we have observed various lyotropic nanostructures that are found for corresponding lipids in the bulk under dry and hydrated conditions. nanostructural studies were mainly performed using small and wide angle x-ray scattering techniques. this work is crucial for designing the nano-micro-fluidic architectures and supported model membranes where both -functionalization of cnts and nanostructural assembling of lipids, could be employed simultaneously. alternative of the commonly used measuring tools in protein research and medical diagnostics. thazolidone derivatives are novel synthetic compounds possessing various biological activities. we selected three such compounds les- , , which passed national cancer institute in vitro tests. annexin v/pi and dapi staining, dna electrophoresis in agarose gel, and western-blot analysis using specific antibodies against cellular proteins involved in apoptosis were applied to study molecular mechanisms of tumor cell death induced by these compounds. it was found that molecular targets of thiazolidones in target cells strongly depend on structure of their side groups: les- containing isatine fragment, activated caspase- involved in receptor-mediated apoptosis, while les- possessing benzthiazol residue, induced mitochondrial apoptosis mediated by caspase- , and les- which has a unique chlorine atom in side chain, also led to mitochondrial apoptosis mediated by aif (apoptosis-inducing factor). to increase anticancer potential of these molecules, in silico study was performed and most active groups of les- and les- were combined into one molecule. in vitro studies showed that such hybrid molecule called les- possessed times higher anticancer potential (ic = lm) comparing with initial compounds. we report on the synthesis and characterization of vaterite microcontainers for controlled drug release. moreover, we present experiments on possible release strategies of encapsulated substances via recrystallization, ph controlled, or by desorption methods. vaterite spherical particles were fabricated with controllable average sizes from ± nm till ± hm. we considered two ways of functionalization of the containers: encapsulation of the substances during the vaterite synthesis or their adsorption onto the prepared particles. as model experiments, vaterite containers, encapsulating rhodamine g, were imaged by two-photon microscopy, showing dye release into the aqueous medium due to recrystallization to calcite within days. differently, in ethanol only small amounts of the encapsulated markers were diffusion released after one week. the release mechanisms can be further controlled by covering the microcontainers with additional polymer layers to increase diffusion and recrystallization time. a change of the ph from neutral to acid conditions leads to the destruction of the vaterite matrix followed by a quick release of the encapsulated materials. these flexible control mechanisms make this system an interesting candidate for pharmaceutical applications. magnetic nanoparticles (np) in combination with therapeutic molecules represent one of most promising methods for targeted drug delivery. one of major current limitations of magnetic drug targeting is to achieve efficient concentration of magnetic carrier-drug complexes at the targeted sites due to poor mobility of nanoparticles in tissue structures. interstitial delivery is hindered by microscopic extracellular matrix, which represents a major barrier for nanoparticles motilities. in order to achieve efficient magnetic drug targeting it is crucial to know particle mobility in a given in vivo environment as well as to apply magnetic field having appropriate field gradient which drags magnetic nps. we used gel magnetophoresis in order to measure motilities of different magnetic nps (co-ferrite, c-fe o ) in agarose gel. numerical modeling using fem method was used to determine appropriate settings of magnets, which generate sufficient magnetic field gradient. further, we used the numerical modeling to evaluate the magnetic force on the nps for different geometries. we obtained that one of crucial factors which determines final mobility in tissue is formation of larger aggregates of nanoparticles under physiological conditions and interaction of nanoparticles with surrounding matrix. defining the forces required to gate mechanosensitive channels in mammalian sensory neurons kate poole and gary lewin department of neuroscience, max delbrueck center for molecular medicine, robert-roessle str , , berlin-buch, germany our sense of touch and mechanical pain is based on mechano-electrical transduction (met) at the terminal endings of subsets of dorsal root ganglion (drg) neurons innervating the skin. to quantify the stimulus strengths required to gate mechanosensitive channels in these subsets of neurons, we developed an approach using microstructured surfaces. the drg neurons are grown on laminin-coated pdms pillar arrays, mechanical stimuli are applied by deflecting individual pili and the deflection is monitored using light microscopy. as the pili behave as light-guides, the center of each pilus can be determined from a fit of the intensity values, allowing detection of movements of a few nanometers. the response to such stimuli is monitored using whole-cell patch-clamp. pili deflections of nm can gate the rapidly adapting-current in mechanoreceptor cells, while deflections above nm are required for gating of slowly adapting-currents in nociceptors. smaller stimuli are required to generate currents via pili deflection ( nm) vs neurite indentation ( - nm), suggesting that gating occurs at the cell-substrate interface. we have also characterized the met currents present in n a cells which we show are modulated by the substrate to which the cells are attached. enhanced stimulation of toll-like receptor via immunostimulatory nanoparticles jan rother, anna pietuch, andreas janshoff georg-august university gö ttingen, institute of physical chemistry, tammanstr. , d- gö ttingen, germany e-mail: jrother@gwdg.de among the toll-like receptor family (tlrs), the tlr has been subject of intensive research because of its predominant localization in the lysosomes of immune cells and its ligand rendering it a potential candidate for immunotherapy of autoimmune diseases and cancer. additionally, a use as an adjuvant in vaccination is aimed using synthetic cpg-oligodeoxynucleotides (cpg-odn's). albeit, immunostimulatory cpg-odns already showed promising results in animal experiments and clinical trials, several groups found that tlr is also expressed by tumor cells. first experiments show that activation of tlr displayed by cancerous cells leads to a decreased apoptosis rate and proliferation posing unpredictable threat to tumor patients exposed to cpg-odns. therefore, detailed knowledge about the impact of cpg-odns on cancer cells is inevitable for a save use in pharmaceutics. herein, we describe a sophisticated way to address tlr in cancer cells using cpg-odn functionalized ''superparamagnetic'' mno-and c-fe o -nanoparticles (nps) to stimulate tlr in a cells. analysis of impedimetric measurements revealed a cytotoxic effect of the mno-nps. cells treated with immunostimulatory fe o -nps showed an increased micromotility as well as a higher long-term correleation of the impedance signal. biomedical diagnostics like high-sensitivity, single-molecule study, easy sample preparation. furthermore, sers allows to conduct non-invasive studies of conformations of the molecules without destruction of living cells, i.e. in vivo [ ] . this work presents a sers study of cytosolic hemoglobin (hb c ) using silver nanoparticles (agnps). the hb c was isolated from cytoplasm of red blood cells taken from rat erythrocytes and diluted. agnps were prepared by developing leopold and lendl method [ ] . three types of colloids were prepared at various temperatures ( , and °c). the resulted agnps were characterized by uv-vis-, ftirspectroscopy, dls and tem. reduction of ag ions leads to the formation of predominantly spherical agnps but also silver nanorods, faceted and aggregated agnps in small quantities with a surface plasmon resonance band in the range of - nm. for agnps synthesized at °c, for example, a bimodal size distribution was observed (about and nm medium sizes, respectively). sers measurements were optimized for each type of agnps. it was demonstrated that agnps gave strong raman enhancement from hb c and types of sers spectra differ from each other. nano-zno is characterized by unique properties, low toxicity and high biocompatibility instead of a lot of others nanomaterials. for this fact nanoparticles of zno have great potential for applications in biosystems, for example biolabeling, biosensoring, delivery systems and others, which can be used in genetics, pathology, criminology, safety of food and many other industries. for these bioapplications are necessary surface modifications, which can made to the nanostructures to better suit their integration with biological systems, leading to such interesting properties as enhanced aqueous solubility, bio-recognition or applicability for biological systems. for synthesis of zno nanoparticles in aqueous solution we used -mercapto-undecanoic acid (mua) as stabilizing agent. the coating of nanoparticles with mua could allow their solubility in the water and the binding through carboxyl groups present in its structure. we defined the optimal ph for mua modificated nano-zno solubility and their ability interaction with positive charges. we studied the optical properties of pure and surface modificated nanoparticles and their conjugates with cytochrome c and also the effect of ph on the interaction between nano-mua and horse cytochrome c. the permeation of water soluble molecules across cell membranes is controlled by channel forming proteins and particularly the channel surface determines the selectivity. an adequate method to study properties of these channels is electrophysiology and in particular analyzing the ion current fluctuation in the presence of permeating solutes provides information on possible interactions with the channel surface. as the binding of antibiotic molecules in the channels of interest is significantly weaker than that of preferentially diffusing nutrients in substrate-specific pores, the resolution of conductance measurements has to be significantly increased to be able to resolve the events in all cases. due to the limited time resolution, fast permeation events are not visible. here we demonstrate that miniaturization of the lipid bilayer; varying the temperature or changing the solvent may enhance the resolution. although electrophysiology is considered as a single molecule technique, it does not provide atomic resolution. molecular details of solute permeation can be revealed by combining electrophysiology and all atom computer modeling. [ novel functionalized nanocomposites (nc) were designed and synthesized on the basis of polymeric surface-active oligoelectrolytes. the developed technology permits controlling: ) quality and quantity of structural blocks of nc, and size unimodality; ) branching at specific sites in polymer chain of nc; ) providing nc with reactive chemical groups; ) covalent conjugation of specific bio-targeting molecules. provided bioactive elements were: a) specific anticancer drugs, antibiotics, alkaloids; b) dna and sirna; c) immunoglobulins and lectins; d) lipids and amino acids; e) polyethylene glycol. fluorescent, luminescent, super-paramagnetic, or x-ray detectable compounds were also incorporated in nc to make them detectable and measurable. biocompatible nc possessing low toxicity towards mammalian cells in vitro and in vivo (mice) were created. they were effective in delivery of: ) drugs (doxorubicine and antibiotics) for chemotherapy in vitro and in vivo; ) dna for transfection of mammalian, yeast and bacterial cells; ) protein antigens for animal immunization and specific lectins for targeting apoptotic cells. these and other approaches in application of developed nc and nanobiotechnologies are considered. this work was supported by stcu grants # , # , # . in the anti-cancer drug delivery domain, nanotechnologies are a promising tool, providing a good tissue distribution and a low toxicity. drug delivery vehicles relying on solid nanoparticles have been proposed, among which diamond nanoparticle (size\ nm) is a very promising candidate [ ] . we have investigated the delivery of sirna by nanodiamonds (nd) into cells in culture, in the context of the treatment of a rare child bone cancer (ewing sarcoma), by such a gene therapy. sirna was bound to nds after nds coating by cationic polymers, so that the interaction is strong enough to pass the cell membrane without loss of the drug and does not prevent its subsequent release. the cellular studies showed a specific inhibition of the gene expression at the mrna and protein level by the nd vectorized sirna. we also uses the fluorescence of color center created in the nanodiamonds [ ] to monitor the release of fluorescently-labeled sirna in the intracellular medium. this technique brings a quantitative insight in the efficiency of sirna to stop cell proliferation. considering the success of the cell model we recently started the drug delivery in tumor xenografted on nude mice. silica nanoparticles are stable aqueous suspension of condensed siloxane nanocomposites, having an average diameter between and nm. particles containing organic functional groups on their surface are called organically modified silica nanoparticles (ormosil). due to the various chemical and physical properties of the surface groups, ormosil nanoparticles may have an enormous variety of biological applications, such as in vivo bioimaging, non-viral gene delivery or targeted drug delivery. our aim was to synthesize both void and fluorescent dye doped amino functionalized ormosil nanoparticles through the microemulsion method and use them for gene delivery. the obtained nanoparticles have been characterized by transmission electron microscopy and dynamic light scattering. furthermore, the nanoparticles have been investigated to exploit their transfection efficiency and the possible toxicity caused by surfactants used in the synthesis. the transfection efficiency was tested on various cell cultures. our further aim is the in vivo transfection of salivary glands using ormosil nanoparticles. our work has shown that the nanomedicine approach, with nanoparticles acting as a dna-delivery tool is a promising direction for targeted gene therapy. in vivo amperoetric cells for detection of fast diffusing, physiologically important small molecules lívia nagy, bernadett bareith, tü nde angyal, erika pinté r, gé za nagy university of pé cs, pé cs, hungary h s is a naturally occurring gas that is toxic in high concentration. it exists also in different tissues of living animals sometimes in concentrations as high as lm. it is generally accepted, that h s has important roles in modulating different, physiologically important biochemical processes similarly to other, fast diffusing molecules like no, co and h o . for investigation of the physiological effects of these species their local concentration in the studied biological media is important to know. this means methods needed for measuring the instantaneous concentration with high spatial resolution in living tissues without major invasion. electrometric micro, and ultramicro sensors are often gain application in experimental life sciences for measurement of local ion concentration or following neurotransmitter species in vivo measurements. in our work efforts are being carried out to improve the applicability of selective electrometric sensors in life science experiments. as a result of these work an improved h s measuring cell and improved electrode and method was developed for measurement of electroactive small molecules like no or h o . in the poster to be presented the structure, the working principles and the performances of the different sensors mentioned will be described. bacteriorhodopsin (br) is the only protein in the purple membrane of the halophilic organism halobacterium salinarium. it is a light-driven proton pump converting light into a transmembrane proton gradient through isomerization of the covently bound retinal chromophore. its stability, as well as its photoactivity in dried films, has made br an attractive material for biomolecular devices. such studies, however, have used br within the membrane, on relatively large surfaces. here, conducting-probe atomic force microscopy (c-afm) analysis was performed after isolating the protein from its native membrane environment while keeping its basic trimeric structure, and demonstrated that the molecular conductance of br can be reversibly photoswitched with predictable wavelength sensitivity. intimate and robust coupling to gold electrodes was achieved by using a strategically engineered cysteine mutant located on the intracellular side of the protein which, combined with a % delipidation, generated protein trimers homogenously orientated on the surface. c-afm proximal probe analysis showed a reproducible fold drop of br mean resistance over * cycles of interspersed illuminations at the same gold-br-gold junction when k[ nm, while no shift was observed with other wavelengths. capture of circulating tumor cells with a highly efficient nanostructured silicon substrates with integrated chaotic micromixers shutao wang ). this core technology shows significantly improved sensitivity in detecting rare ctcs from whole blood, thus provides an alternative for monitoring cancer progression. by assembling a capture-agent-coated nanostructured substrate with a microfluidic chaotic mixer, this integrated microchip can be applied to isolate ctcs from whole blood with superb efficiency. ultimately, the application of this approach will open up opportunities for early detection of cancer metastasis and for isolation of rare populations of cells that cannot feasibly be done using existing technologies. this technology helped to find a needle in a haystack and will open up the opportunity for single cell genomic and epigenetic sequencing and gene expression profiling. results from further development of this technology will assist the physicians in follow-up patients and testing vigorously the concept of personalized oncology with individualized therapy. this novel technology has recently been reviewed and highlighted by nature medicine the growing crisis in organ transplantation and the aging population have driven a search for new and alternative therapies by using advanced bioengineering methods. the formation of organized and functional tissues is a very complex task: the cellular environment requires suitable physiological conditions that, presently, can be achieved and maintained by using properly-designed bioreactors reproducing all specific functions and bioactive factors that assure viability/regeneration of cells cultured in an appropriate scaffold. the creation of biomimetic environment requires the use of biomaterials such as membranes with specific physico-chemical, morphological and transport properties on the basis of the targeted tissue or organ. tailor-made membranes (organic, functionalized with specific biomolecules, in hollow-fiber configuration), designed and operated according to well-defined engineering criteria are able to sustain specific biotransformations, to provide adequate transport of oxygen, nutrients and catabolites throughout the cellular compartment, and to supply appropriate biomechanical stimuli of the developing tissue. in this talk the author will show the development of membrane engineered constructs focusing on liver and neuronal systems. the role of membrane surface and transport properties in providing instructive signals to the cells for the guiding of proliferation and differentiation will be discussed. membrane bioreactors, which through the fluid dynamics modulation may simulate the in vivo complex physiological environment ensuring an adequate mass transfer of nutrients and metabolites and the molecular and mechanical regulatory signals, will be presented. here we present a novel but simple system for cell-based assays enabling simultaneous testing of multiple samples on a same tissue without cross-contamination between neighbouring assays, as well as sequenced or repeated assays at the same tissue location. the principle of this method lies in the spatially-controlled diffusion of test compounds through a porous matrix to the target cells. a simple microfabrication technology was used to define areas where diffusion processes are allowed or inhibited. we performed proof-of-principle experiments on madin-darby canine kidney (mdck) epithelial cells using hoechst nuclear staining and calcein-am cell viability assay. fluorescent staining superimposed properly on membrane pattern with a dose-dependent response, indicating that both compounds specifically and selectively diffused to the target cells. mdck cells similarly treated with cytochalasin b showed their actin network rapidly altered, thus demonstrating the suitability of this system for drug screening applications. such a well-less cell-based screening system enabling multiple compounds testing on a same tissue and requiring very small volumes of test samples appears interesting for studying potential combined effects of different biochemicals applied separately or sequentially. it is generally believed that all-optical data processing is the most promising direction to achieve serious improvements both in capacity and speed of internet data traffic. one of the bottlenecks of the state-of-the-art photonic integration technology is to find the proper nonlinear optical (nlo) materials that are supposed to serve as cladding media in waveguidebased integrated optical circuits performing light-controlled active functions. recently, the unique chromoprotein bacteriorhodopsin (br) has been proposed to be used as an active, programmable nlo material in all-optical integrated circuits. in integrated optical applications of br, its light-induced refractive index change is utilized. in this paper we exploit the refractive index changes of a dried br film accompanying the ultrafast transitions to intermediates i and k, which allows even sub-ps switching, leading beyond tbit/s communication rate. in the experiments direct pulses of a femtosecond laser system at nm were used along with synchronized ultrafast laser pulses at nm. we believe that the results may be the basis for the future realization of a protein-based integrated optical device, and represent the first steps to a conceptual paradigm change in optical communication technologies. last years, such autoantibodies attract an increasing attention of researchers as potential cancer biomarkers. since the sera of cancer patients typically contain a unique set of antibodies that reflect the tumor-associated antigens expressed in a particular malignant tissue, diagnosing and predicting the outcome of disease such as breast cancer based on serum autoantibody profiling is an attractive concept. to create a representative panel of antigens for detecting of breast cancer autoantibody profile we selected breast cancer associated antigens. these antigens were identified by screening of tumor cdna libraries with autologous sera using serex (serological investigation of recombinantly expressed clones) approach. all antigens were cloned, expressed, purified in bacteria and tested with sera of breast cancer patients and healthy donors in large-scale allogenic screening using elisa. the utility of selected tumor associated antigens for detecting of autoantibody profile in different types of breast cancer was evaluated. (controlled by architectural software) is carried out according to a design template, consistent with the geometry and composition of the desired organ module. structure formation occurs by the post-printing fusion of the discrete bio-ink units. when the bio-ink units contain more than one cell type, fusion is accompanied by sorting of the cells into the physiologically relevant pattern. thus structure formation takes place through self-assembly processes akin to those utilized in early embryonic morphogenesis. we demonstrate the technology by detailing the construction of vascular and nerve grafts. spherical and cylindrical bio-ink units have been employed to build fully biological linear and branching vascular tubular conduits and multiluminal nerve grafts. upon perfusion in a bioreactor the constructs achieved desirable biomechanical and biochemical properties that allowed implantation into animal models. our results show that the printing of conveniently prepared cellular units is feasible and may represent a promising tissue and organ engineering technology. femtosecond lasers have become important tools for noncontact microprocessing of biological specimens. due to the short pulse length and intensity-dependent nature of the multiphoton ionization process, fs-laser pulses affect only a small volume of a treated cell, providing a high degree of spatial localization. we employed fs-laser to address topical bioengineering and biomedical problems such as cell fusion and embryo biopsy respectively. a tightly focused laser beam (cr:f seed oscillator and a regenerative amplifier, nm, fs, hz) was used for a fusion of blastomeres of two-cell mouse embryos and for a polar body (pb) biopsy. in order to fuse blastomeres the contact border of cells was perforated by a single laser pulse. the fusion process usually completed within * min. in order to perform a noncontact laser based pb biopsy we initially drilled an opening in the zona pellucida with a set of laser pulses, and then extracted the pb out of zygote by means of optical tweezer (cw laser, nm). the energy of laser pulses was thoroughly optimized to prevent cell damage and increase the fusion and biopsy rates. the proposed techniques demonstrate high efficiency and selectivity and show a great potential for using fs lasers as a microsurgical tool. new insights into mechanisms of electric field mediated gene delivery maš a kanduš er and mojca pavlin university of ljubljana, faculty of electrical engineering, si- ljubljana, slovenia gene electrotransfer is widely used for transfer of genetic material in biological cells by local application of electric pulses and is currently the most promising non-viral delivery method for gene therapy for a series of diseases as well as for dna vaccination. current description of the process defines several steps: electropermeabilization, dna-membrane interaction, translocation, trafficking to nucleus and into nucleus. but the mechanisms of electrotransfer are still not fully understood. we present results of the systematic in vitro analysis using pegfp of all steps involved in electrotransfection from electropermeabilization, analysis of different pulsing protocols, theoretical analysis of plasmid mobility to visualization of the processes of dna-membrane interaction. we demonstrate that in order to translate in vitro results to tissue level sub-optimal plasmid concentrations have to be used. furthermore, sofar the method of dna entry into cytoplasm was only speculated. our results suggest that it is crucial that first, membrane is electropermeabilized, then sufficient electrophoretic force is crucial for insertion of dna into destabilized lipid bilayer followed by dna translocation into cytoplasm via a slow process. efficiency of electrotransfer depends also on the stage of of cell culture -cells in dividing phase are easer to electrotransfect. gentamicin interaction with b f cell membrane studied by dielectrophoresis dielectrophoresis (dep) is the translational motion of polarizable particles due to an electric field gradient. positive-dep and negative-dep correspond to particle movement forward or backward the region of high field intensity, respectively. our study reveals some of the cell membrane modifications induced by gentamicin (gt), as they are reflected in the crossover frequency f co of b f murine cells incubated with gt for different concentrations and durations. f co is the ac frequency when cells turn from positive-dep to negative-dep. gentamicin is a positively charged aminoglycosidic antibiotic, with concentration-dependent killing action; it is widely used because of its low cost and reliable bactericidal activity. gt drawbacks consist in high toxicity for renal and hearing cells; the molecular mechanisms of this toxicity are still unclear. for low external medium conductivities (& . s/m), f co of control and gt-cells was found to range from to khz. f co shifts to higher frequencies with the increase of gt concentration and incubation time. cells dielectrophoretic behavior is discussed using the cell singleshell based model. extracellular matrix (ecm) is a major obstacle for succesful delivery of genes. chitosan is a versatile and biocompatible polysaccharide derived from chitin and is a promising gene carrier. chitosan-dna interactions, and hence dna polyplexation and release can be controlled through chitosan de-acetylation degree, molecular weight and functionalization of chitosan cationic groups. grafting of poly(ethylene glycol) peg to gene delivery vectors increases circulation time of gene delivery systems in blood vessels and reduces polyplexes charge. diffusion and unpacking of pegylated and non-pegylated chitosan-dna polyplexes through articial ecms based on collagen and collagen-hyaluronic acid (ha) gels were compared using fluorescence correlation microscopy, confocal microscopy and colocalization analysis. non-pegylated polyplexes were immobilized in the gels whereas pegylated polyplexes were diffusing. the smaller charge of pegylated polyplexes seems to decrease interactions between polyplexes and ecm components. furthermore, ha might also screen collagen fibers-pegylated polyplexes interactions. pegylated polyplexes also showed a higher degree of unpacking in gels, probably due to a looser compaction of dna by pegylated chitosan compared to non-pegylated chitosan. fabrication of vesicles, a close membrane made of an amphiphile bilayer, has great potentiality for encapsulation and controlled release in chemical, food or biomedical industries but also from a more fundamental point of view for the design of biomimetic objects. methods based on lipid film hydratation , inverse emulsion techniques and more recently microfluidic techniques such as double emulsion or jetting method are limited either by a low yield, a low reproducibility, a poor control on the size, or by the presence of remaining solvent or defects. we propose a fast and robust method easy to implement: continuous droplet interface crossing encapsulation (cdice), that allows the production of defect-free vesicles at high-yield with a control in size and content. the vesicles have controlled bilayer composition with a polydispersity in size lower than %. we have shown that solutions as diverse as actin, cells, micrometric colloids, protein and high ionic strength solutions can easily be encapsulated using this process. by adjusting the parameters of our set-up, we are able to produce vesicles in the range - lm in diameter, stable for weeks. we believe this method open new perspectives for the design of biomimetic systems and even artificial tissues. under appropriate conditions. the extremely variable d domain of flagellin subunits, comprising residues - , protrudes at the outer surface of flagellar filaments. the d domain has no significant role in the construction of the filament structure. thus, replacement of d may offer a promising approach for insertion of heterologous proteins or domains without disturbing the self-assembly of flagellin subunits. our work aims at the construction of flagellinbased fusion proteins which preserve the polymerization ability of flagellin and maintain the functional properties of the fusion partner as well. in this work a fusion construct of flagellin and the superfolder mutant of green fluorescent protein (gfp) was created. the obtained gfp variant was highly fluorescent and capable of forming filamentous assemblies. our results imply that other proteins (enzymes, binding domains etc.) can also be endowed by polymerization ability in a similar way. this approach opens up the way for construction of multifunctional filamentous nanostructures. generation polyamidoamine (pamam) dendrimer has been shown to be highly efficient nonviral carriers in gene delivery. however, their toxicity limits their applications. in this study, to improve their characteristics as gene delivery carriers, g pamam dendrimer was modified with anti-tag nanobody through hetrobifunctional peg, then complexed with t-bid coding pdna, yielding pamam-peg-anti-tag nanobody/pdna nanoparticles (nps). nuclear magnetic resonance (nmr) spectroscopy, zeta sizing and gel retardation assay results provided evidence that the nanovector was successfully constructed. the transfection efficiency of vector/pdna complexes were evaluated in vitro. real time pcr results also demonstrated that anti-tag nanobody modified nps are more efficient in t-bid killer gene expressing in colon cancer cell line than the unmodified nps. in conclusion, pamam-peg-anti-tag nanobody showed great potential to be applied in designing tumour-targeting gene delivery system. dept of chemistry, faculty of science, national university of singapore, singapore, dept of biochemistry, yong loo lin school of medicine, national university of singapore, singapore, division of bioengineering, faculty of engineering, national university of singapore, singapore macromolecular crowding (mmc) is a biophysical tool which has been used extensively to enhance chemical reactions and biological processes by means of the excluded volume effect (eve). the in vivo stem cell microenvironment contains macromolecules which are crucial for stem cell selfrenewal and cell fate determination. in order to mimic this physiological microenvironment, crowders are included in cell culture medium. we have observed that the ex vivo differentiation of human mesenchymal stem cells (hmscs) into the adipogenic lineage is significantly amplified when a crowder mixture comprising ficoll and ficoll is added to the culture medium. stem cell differentiation is modulated by soluble chemical substances as well as interactions between cells and the extracellular matrix (ecm), and both these external influences may be affected by mmc. measurements we have performed by fluorescence correlation spectroscopy (fcs) show that ficoll additives cause anomalous subdiffusion within a crowder concentration range of to mg/ml. the diffusion of fluorophorelabelled molecules in artificial lipid bilayers and membranes of living cells is not changed by crowders, suggesting that these crowders do not directly alter membrane properties and cell surface signalling. however, we have data to suggest that crowders increase actin polymerization reaction rates in vitro. we have also observed that crowders are taken up by stem cells and that they localize to specific compartments. based upon our observations, we hypothesize that crowders can influence stem cell differentiation by influencing molecular kinetics. lignocellulose-based composites are becoming extremely important and perspective sustainable and renewable natural materials. fibre modification enhancing their existing properties can be obtained to broaden the application areas. in response to shortcomings of traditional chemical and physical methods, enzymes and chemo-enzymatic methods have emerged as eco-friendly catalysts working under mild conditions and enable tailoring of the material surface properties by substrate specificity and regional selectivity. recently, binding of different functional molecules to lignin-rich fibres by using an oxidative enzyme (e.g. laccase) has been reported leading to their functionalisation through free radical reactions. by the application of electron paramagnetic resonance spectroscopy (epr) laccase action was inspected. consumption of substrates was investigated and their polymerization traced. stable radical intermediates were detected with epr when substrate molecules were in contact with active enzymes. secondly, oxidation of mediators like nitroxides was determined via epr spectroscopy of stable water-soluble nitroxide radicals. finally, the generation of short-lived radicals as well as their reduction was measured via epr spin trapping using dmpo as sensitive water soluble spin trap. mammalian ovary hormone stimulation (ohs) is known to be an inalienable stage of reproductive biotechnology as well as human infertility treatment. the basic aim of the ohs is to receive a stock of valuable oocytes and early embryos for subsequent utilization in the reproductive technology, experimental work et al. however, it is known that ohs itself affects the character of ovulation and oocyte quality, which in its turn affects the development of embryos and even has distant consequences. the wideness of cell parameters and appropriate methods for investigation of gamete/embryo quality are very important. the aim of this study is determination of specific electric conductivity of mouse oocytes and early embryos which have been received after ohs in comparison with the ones that have been received in natural animal sex cycle. using techniques of electroporation the dependence of specific electric conductivity of mouse oocytes, zygotes, -cell and -cell embryos on the external electric field intensity has been studied. it is shown that the whole pool of oocytes that were obtained in the result of ohs consists of two groups of oocytes that don't differ from each other morphologically, but differ by their electric parameters and resistance to electric breakdown. at the zygote stage, dividing of embryos into two groups is preserved, but is less expressed. at the stage of -cell and -cell dividing of embryos into two groups on their electric conductivity disappeared but certain scattering of the parameters due to individual embryo peculiarities is observed. the obtained data show that ohs may lead to latent changes of oocyte state that in their turn affect embryo quality. many microbes synthesize and accumulate granules of polyhydroxyalkanoates (pha, biodegradable storage materials alternative to traditional plastics), which help them survive under stresses. in particular, the plant-growth-promoting rhizobacterium azospirillum brasilense, that is under investigation worldwide owing to its agricultural and biotechnological significance, can produce poly- hydroxybutyrate (phb) [ ] . in our work, phb synthesis in a. brasilense cells was studied under various stresses using diffuse reflectance ftir spectroscopy. phb in cells was determined from the band intensity ratio of the polyester m(c=o) at * cm - to that of cell proteins (amide ii band at * cm - ), showing a. brasilense to be able to produce phb up to over % of cells' dry weight. stresses induced phb accumulation, enhancing ir absorption in phb specific regions. analysis of a few structure-sensitive phb vibration bands revealed changes in the degree of intracellular phb crystallinity (related to its enzymatic digestion rate) at different stages of bacterial growth, reflecting a novel trait of the bacterial adaptability to an enhancing stress, which is of great importance to agricultural biotechnology. the aim of this work is to furnish enzymes with polymerization ability by creating fusion constructs with the polymerizable protein, flagellin, the main component of bacterial flagellar filaments. the d domain of flagellin, exposed on the surface of flagellar filaments, is formed by the hypervariable central portion of the polypeptide chain. d is not essential for filament formation. the concept in this project is to replace the d domain with suitable monomeric enzymes without adversely affecting polymerization ability, and to assemble these chimeric flagellins into tubular nanostructures. to test the feasibility of this approach, xylanase a (xyna) from b. subtilis was chosen as a model enzyme for insertion. with the help of genetic engineering, a fusion construct was created in which the d domain was replaced by xyna. the flic(xyna) chimera exhibited catalytic activity as well as polymerization ability. these results demonstrate that polymerization ability can be introduced into various proteins, and building blocks for rationally designed assembly of filamentous nanostructures can be created ( table ) . the support of the hungarian national office for research and technology and the hungarian scientific research fund (otka) (grants ck , nk , nanoflag) is acknowledged. cluster phases of membrane proteins an alternative scenario for the formation of specialized protein nano-domains (cluster phases) in biomembranes dna fragmentation induced in human fibroblasts by accelerated fe ions of differing energies swift heavy ion irradiation of srtio under grazing incidence'' proc. natl. acad. sci. usa forespore engulfment mediated by a ratchet-like mechanism a channel connecting the mother cell and forespore during bacterial endospore formation a feeding tube model for activation of a cell-specific transcription factor during sporulation in bacillus subtilis the scanning ion-conductance microscope imaging proteins in membranes of living cells by high-resolution scanning ion conductance microscopy nanoscale live-cell imaging using hopping probe ion conductance microscopy beta -adrenergic receptor redistribution in heart failure changes camp compartmentation simultaneous noncontact topography and electrochemical imaging by secm/sicm featuring ion current feedback regulation timasheff in protein-solvent interactions the roles of water in foods on motor and electrical oscillations in urinary tract: computer evaluation daniele martin , , viktor foltin , , erich gornik , rumen stainov , , tanya zlateva nü rnberg. icsd e.v. postfach (pob) , d- mü nchen method: parameters: motor patterns (guinea-pig) -frequency/f, amplitudes/a (% init. length, isot. & intracell. rec.) of spontaneous phasic/spc & tonic/stc contractions, also electrical spikes/s, bursts/b, burst plateaus/bp (neu et al. biophys.j. / a/jan stretch ( - mn), k-/ca-influence induced specific changes in motor/electrical parameters. special computer programme reflects exactly biophysical parameters. conclusion: acc. to earlier/recent results mechano-sensitive ca ?? -activated k ? -channels participate in electrical oscillations of detrusor/ureteral myocytes. further experiments/evaluations incl effect of hydrophobic mismatch on the light-induced structural changes in bacterial reaction centers s. s. deshmukh, h. akhavein, and lá szló ká lmá n department of physics mechanism of proton transfer in nitric oxide reductase: computational study andrei pisliakov riken advanced science institute, wako-shi proteins , . acknowledgments this work was supported by project grant ptdc/qui/ / (cas) and doctoral grants sfrh th anniversary conference aicr this work was supported by the italian association for cancer research (airc), the istituto toscano tumori and the associazione noi per voi differential hydration, void volume: which factor provides the main contribution to dv u ? inserm umr fr; roumestand@cbs.cnrs.fr introduction: globalization needs new organizational models also for biophysics. reports on necessity of int. institutes for biophysics (iib) c/o int. universities (proposed by british nobel laureate b.russell) are given conception: proposals for ebsa-discussion: . enlargement of executive committee by a. honorary & presidents (permanent - : moral support & - fixed term), b. interdisciplinary commission: scientists from biology, medicine, physics, etc. (feps/iups, iuphar, iupab, etc). . implication of interdisciplinary topics to esba/iupab congressprogrammes, . also for biophysical journals. . organization of common interdisciplinary sessions not only to biophysical, but also to other congresses. . co-operation between esba/iupab with int. interdisciplinary organisations (waas, icsd/ias, eur. academies) for creation of iib by network of national ones: successive common personnel, possibility for whole life work, etc. conclusion: realization of proposals .- . could increase scientific/political authority of ebsa/iupab, leading to model for renewal of scientific organizations collective migration of neural crest cells: a balance between repulsion and attraction roberto mayor university college london goodilin , , olga v single-molecule cut-and-paste surface assembly (smcp) has been also used to build up a biotin scaffold that streptavidin utilizing specific molecular interactions, for example between dna-binding proteins and dna or antibodies and antigens, this technique is capable of providing a scaffold for the controlled self-assembly of functional complexes. furthermore, this allows for the introduction of smcp into protein science. we aim to employ dna-binding zinc-finger variants and gfp-binding nanobodies as shuttle-tags fused to the proteins of interest. thus a fully expressible system that can be used for the step-wise assembly of individual building blocks to form single-molecule cut-and-paste surface assembly optically monitoring the mechanical assembly of single molecules nanoparticle self-assembly on a dna-scaffold written by single-molecule cut-and-paste torsional motion analysis of group ii chaperonin using diffracted x-ray tracking nanomechanical manipulation of mason-pfizer monkey retroviral rna fragment with optical tweezers melinda simon , zsolt má rtonfalvi , pasquale bianco , beá ta vé rtessy , mikló s kellermayer micro-viscosimeter generated and manipulated by light andrá s buzá s , lá szló oroszi , ló rá nd kelemen , pá l ormos temesvá ri krt proc. natl. acad. sci. usa neural signal recordings with a novel multisite silicon probe gergely má rton , anita pongrá cz , lá szló grand , , É va vá zsonyi pé ter pá zmá ny catholic university, faculty of information technology, h- , /a prá ter st multiscale pattern fabrication for life-science applications francesco valle , beatrice chelli , michele bianchi , eva bystrenova , marianna barbalinardo , arian shehu , tobias cramer , mauro murgia , giulia foschi miroslava kuricova , jana tulinska , aurelia liskova , eva neubauerova , maria dusinska , , ladislava wsolova acceleration neuronal precursors differentiation induced by substrate nanotopography gianluca grenci , jelena ban , elisabetta ruaro , massimo tormen , marco lazzarino , and vincent torre light-induced structural changes are reported near the primary electron donor of bacterial reaction centers (brc) dispersed in detergent micelles and in liposomes from lipids with different fatty acid chain lengths. in this study we present evidence for the correlation between the light-induced increase of the local dielectric constant, determined by the analysis of the electrochromic absorption changes, and the lifetime of the charge-separated state at physiologically relevant temperatures. the increase of the local dielectric constant induced a significant decrease of the oxidation potential of the primary electron donor and a slow proton release, which appears to be the rate limiting step in the overall process. systematic selection of the head group charges of detergents and lipids, as well as the thickness of the fatty acid chains of the liposome forming lipids can increase the lifetime of the charge-separated state by up to orders of magnitude. such extensions of the lifetime of the charge-separated state were reported earlier only at cryogenic temperatures and can provide new opportunities to utilize the brc in energy storage. ontogenesis of photosynthetic bacteria tracked by absorption and fluorescence kinetics m. kis, e. asztalos, p. maró ti department of medical physics and informatics, university of szeged, hungarythe development of photosynthetic membrane of rhodobacter sphaeroides was studied by absorption spectroscopy and fast induction of bacteriochlorophyll fluorescence in different phases of the growth, under various growing conditions (oxygen content, light intensity etc.) and in synchronous cell population. the results are: ) the newly synthesized components of the membranes were imbedded immediately into the proteinous scaffold independently on the age of the cell (no ,,transient'' membranes were observed). ) under aerobic conditions, the pigments were bleached and under anaerobic conditions the pigment systems showed greening. the relative variable fluorescence (f v /f max ) had small age-dependent (but not cellcycle-related) changes. the fluorescence induction kinetics was sensitive marker of the aerobiosis: the f v /f max ratio dropped from . to . and the photochemical rate constant from Á s - to Á s - with an apparent halftime of about - hours after change from anaerobic to aerobic atmosphere. ) the electrogenic signal (absorption change at nm) reflected the energetization of the membrane which showed cell-cycle dependent changes. that included periodic production and arrangement of protein-lipid components of the membrane synchronized to the cell division. interfacial water in b-casein molecular surfaces: wide-line nmr, relaxation and dsc characterization t. verebé lyi , m. bokor , p. kamasa , p. tompa , k. tompa research institute for solid state physics and optics, hungarian academy of sciences, pob. , budapest,hungary, institute of enzymology, biological research center, hungarian academy of sciences, pob. , budapest, hungarywide-line proton nmr fid, echoes, spin-lattice and spin-spin relaxation times were measured at . mhz frequency in the - °c to ? °c temperature range, in lyophilized bcasein and aqueous and buffered solutions, and dsc method were also applied. the motivation for the selection of b-casein is the uncertainty of structural order/disorder. naturally, the nmr and thermal characteristics were also evaluated. the melting of hydration water could be detected well below °c and the quantity of mobile water molecules (hydration) was measured. the hydration vs. melting temperature curve has informed us on the bonding character between the protein surfaces and water molecules. the generally used local field fluctuation model and the bpp theory were applied in the interpretation, and the limits of the models were concluded. the torsional properties of dna play an important role in cellular processes such as transcription, replication, and repair. to access these properties, a number of single-molecule techniques such as magnetic tweezers have been developed to apply torque to dna and coil it. i will briefly refer to investigations of dna-protein interactions using these techniques, and describe what has been learnt. i will then focus on the development of novel magnetic techniques that go beyond standard magnetic tweezers, such as the magnetic torque tweezers and the freely-orbiting magnetic tweezers . these approaches allow one to quantify conjugate variables such as twist and torque. for example, the magnetic torque tweezers rely on high-resolution tracking of the position and rotation angle of magnetic particles in a low stiffness angular clamp. we demonstrate the experimental implementation of this technique and the resolution of the angular tracking. subsequently, we employ this technique to measure the torsional stiffness c of both dsdna molecules and reca heteroduplex filaments. lastly, i will describe novel applications of the optical torque wrench , . the optical torque wrench is a laser trapping technique developed at cornell capable of applying and directly measuring torque on microscopic birefringent particles via spin momentum transfer. we have focused on the angular dynamics of the trapped birefringent particle , demonstrating its excitability in the vicinity of a critical point. this links the optical torque wrench to non-linear dynamical systems such as neuronal and cardiovascular tissues, non-linear optics and chemical reactions, which all display an excitable binary ('all-or-none') response to input perturbations. based on this dynamical feature, we devise a conceptually novel sensing technique capable of detecting single perturbation events with high signal-to-noise ratio and continuously adjustable sensitivity.for the first time we report a comparative approach based on surface enhanced raman spectroscopy (sers) and raman spectroscopy to study different types of haemoglobin molecules in living erythrocytes. in erythrocytes there are two fractions of haemoglobin: cytosolic (hb c ) and membranebound (hb m ). the concentration of hb m is less then , % and therefore it is impossible to study hb m with traditional optical techniques. modifications of cellular membrane can affect conformation of hb m . therefore, it can be used as a sensitive marker of pathologies. firstly, we investigated enhancement of sers signal of hb m depending on ag nanoparticles' size. we found that the intensity of sers spectra of hb m and enhancement factor increase with the decrease in ag nanoparticles' size. secondly, we investigated the dependence of haemoporphyrin conformation in both hb c and hb m ion ph values. we observed different sensitivity of hb c and hb m to the ph and found that conformational movements of haemoporphyrin (vibrations of pyrrol rings and side radicals) in hb m are sterically hampered comparably with hb c . our observation is an evidence of a benefit of application of surface enhanced raman spectroscopy to investigate properties of the hb m in erythrocytes and provide new information about conformational changes and functional properties of hb m . rna nanotechnology is an emerging field with high potential for nanomedicine applications. however, the prediction of rna three-dimensional nanostructure assembly is still a challenging task that requires a thorough understanding the rules that govern molecular folding on a rough energy landscape. in this work, we present a comprehensive analysis of the free energy landscape of the human mitochondrial trna lys , which possesses two different folded states in addition to the unfolded one. we have quantitatively analyzed the degree of rna tertiary structure stabilization, firstly, for different types of cations, and, secondly, for several naturally-occurring nucleotide modifications in the structural core of the trna lys . , thus, notable variations in the rna binding specificity was observed for the divalent ions of mg ? , ca ? and mn ? , that can be attributed to their sizes and coordination properties to specific ligands. furthermore, we observed that the presence of m g modification together with the principal stabilizing m a modification facilitates the rna folding into the biologically functional cloverleaf shape to a larger extent than the sum of individual contributions of these modifications. in order to elucidate the mechanism of the recognition, we used diffracted x-ray tracking method (dxt) that monitors real-time movements of individual proteins in solution at the single-molecule level. we found that peptides move distinctly from i-a k , and the rotational motions of peptides correlate with the type b t cell activation. in the case of diabetogenic i-a g , immediately after peptide exchange, all the peptides moves magnificently but the motion ceased in a week, then new ordered motion appears; the rotational motion of peptides correlate to t cell activation, which is analogous to the peptide in i-a k . the rotational motion of peptides may create transient conformation of peptide/mhc that recognized by a population of t cells. dxt measurement of peptide/mhc complex well correlated to other biological phenomenon too. our finding is the first observation that fluctuations at the level of brownian motion affect to the functions of proteins.mason-pfizer monkey virus (mpmv) is an excellent model for the analysis of retrovirus assembly and maturation. however, neither the structure of the viral rna, nor its modulation by capsid-protein binding are exactly known. to explore the structure of the mpmv genome, here we manipulated individual molecules of its packaging signal sequence with optical tweezers. the -base-long segment of mpmv rna corresponding to the packaging signal, extended on each side with -base-long indifferent gene segments for use as molecular handles, was cloned into a pet b vector. rna was synthesized in an in vitro transcription system. rna/ dna handles were obtained by hybridization in a pcr with complementary dna initiated with primers labeled with either digoxigenin or biotin. the complex was manipulated in repetitive stretch and relaxation cycles across a force range of - pn. during stretch, transition occurred which increased the rna chain length and likely corresponds to unfolding. the length gain associated with the unfolding steps distributed across three main peaks at * , , nm, corresponding to * , , bases, respectively. often reverse transitions were observed during mechanical relaxation, indicating that refolding against force proceeds in a quasi-equilibrium process. structural investigation of gpcr transmembrane signaling by use of nanobodies jan steyaert , structural biology brussels, vrije universiteit brussel, pleinlaan , brussel, belgium, department of structural biology, vib, pleinlaan , brussel, belgiumin , scientists at the vrije universiteit brussel discovered the occurence of bona fide antibodies devoid of light chains in camelidae. the small and rigid recombinant antigen binding fragments ( kd) of these heavy chain only antibodies -known as vhhs or nanobodies -proved to be unique research tools in structural biology. by rigidifying flexible regions and obscuring aggregative surfaces, nanobody complexes warrant conformationally uniform samples that are key to protein structure determination by x-ray crystallography:• nanobodies bind cryptic epitopes and lock proteins in unique native conformations • nbs increase the stability of soluble proteins and solubilized membrane proteins • nbs reduce the conformational complexity of soluble proteins and solubilized membrane proteins • nbs increase the polar surface enabling the growth of diffracting crystals • nbs allow to affinity-trap active protein i will focus my talk on the use of nbs for the structural investigation of gpcr transmembrane signaling to illustrate the power of the nanobody platform for generating diffracting quality crystals of the most challenging targets including gpcrs and their complexes with downstream signaling partners. dynamics of the type i interferon receptor assembly in the plasma membrane stephan wilmes, sara lö chte, oliver beutel, changjiang you, christian paolo richter and jacob piehler university of osnabrü ck, division of biophysics, barbarastrasse , osnabrü ck, germanytype i interferons (ifn) are key cytokines in the innate immune response and play a critical role in host defense. all ifns bind to a shared cell surface receptor comprised of two subunits, ifnar and ifnar . detailed structure-function analysis of ifns has established that the ifn-receptor interaction dynamics plays a critical role for signalling specificities.here we have explored the dynamics of receptor diffusion and ifn assembly in living cells. by using highly specific orthogonal posttranslational labelling approaches combined with tirf-microscopy we probed the spatio-temporal dynamics of receptor diffusion and interaction in the plasma membrane of live cells on the single molecule level. for this purpose, we employed posttranslational labelling with photostable organic fluorophores. this allowed us to map diffusion and lateral distribution of ifnar and ifnar with very high spatial and temporal resolution by using single particle tracking (spt) and single molecule localization imaging. observed events of ''transient confinement'' and co-localization with the membrane-proximal actin-meshwork suggest partitioning of ifnar / in specialized microcompartments. this will be investigated in terms influence on receptor assembly and recruitment of cytoplasmic effector proteins. cytoplasmic dynein moves through uncoordinated action of the aaa ring domains ahmet yildiz department of physics, and department of molecular and cell biology, university of california, berkeley, ca usa cytoplasmic dynein is a homodimeric aaa? motor that moves processively toward the microtubule minus end. the mechanism by which the two catalytic head domains interact and move relative to each other remains unresolved. by tracking the positions of both heads at nanometer resolution, we found that the heads remain widely separated and move independently along the microtubule, a mechanism different from that of kinesin and myosin. the direction and size of steps vary as a function of interhead separation. dynein processivity is maintained with only one active head, which drags its inactive partner head forward. these results challenge established views of motor processivity and show that dynein is a unique motor that moves without strictly coordinating the mechanochemical cycles of its two heads.o- self-controlled monofunctionalization of quantum dots and their applicaitons in studying protein-protein interaction in live cells changjiang you, stephan wilmes, sara loechte, oliver beutel, domenik lisse, christian paolo richter, and jacob piehler universitä t osnabrü ck, fachbereich biologie, barbarastrasse , osnabrü ck, germanyindividual proteins labeled with semiconductor nanocrystals (quantum dots, qd) greatly facilitate studying protein-protein interactions with ultrahigh spatial and temporal resolution. multiplex single molecule tracking and imaging require monovalent quantum dots (mvqd) capable of orthogonally labeling proteins with high yield. for this purpose, we prepared monovalance qd-trisnta by a chemical conjugation method. our results indicated that monovalent qd-trisnta was obtained in high yield by restricting the coupling by means of electrostatic repulsion. monovalent functionalization of the qd-trisnta was confirmed by assays in vitro and in vivo. two-color qd tracking of interferon receptors ifnar and ifnar based on mvqd-trisnta were realized on live cell [ ] .to broaden the multiplex toolbox of mvqds, we extended the electrostatic-repulsion induced self-control concept for mono-functionalizing quantum dots with different affinity moieties. as a first instance, we used negatively-charged biotin peptide to produce qd with biotin mono-functionalization. we confirmed our approach was a general method to rend qd monovalent by single molecule assays based on stepwise photobleaching. these mvqds facilitate obtaining spatiotemporal information of ifnars' organization in live cells. by orthogonal labeling u a cells stably expressing ifnar at low level with biotin mvqd and mvqd-trisnta-ifn, we verified colocalization and colocomotion of individual ifn and ifnar at minute scale. combined with super-resolution imaging of ifnars' cytosolic effecter stat , we observed the dynamic coming-and-going contact between the microcompartments of ifnar and stat .a micron sized viscometer was fabricated using the couette type geometry that is capable of measuring the complex viscosity of fluids. the viscometer was produced by two photon polymerization of su photopolymer using a femtosecond laser system, a high na objective and a piezo translator stage. the viscometer was manipulated by holographic optical tweezers and operated in the . - hz frequency range. video analysis algorithm was used to evaluate our measurements. we tested the viscometer with water-glycerol solutions. one of the main reasons for lack of reliability in protein analysis for disease diagnostics or monitoring is a lack of test sensitivity. this is because, for many tests, to be reliable, they need to be performed on a homogeneous, and therefore very small, sample. current in-vitro techniques fail in accurately identifying small differences in protein content, function and interactions starting from samples constituted of few or even single cells. a nanotechnology approach may overcome the current limits in low abundance protein detection. we aim at designing a microwell device for the trapping (in native environment) and the parallel characterization of rare cells (e.g. adult stem cells). such versatile device, based on soft and nanolithography, will promote cell adhesion and viability on differently functionalized bio-compatible materials, allowing for the morphological characterization of the cells, at a single cell level. in parallel, by facing our microwell device with a protein nanoarray, produced via atomic force microscopy nanolithography, we can run proteomic studies at a single/few cells level. moreover, we could foresee the possibility to deliver different stimuli to each cell, correlating the changes in chemistry/ morphology with the protein profile at a single cell level. using an electrophysiological assay the activity of nhaa was tested in a wide ph range from ph . to . . forward and reverse transport directions were investigated at zero membrane potential using preparations with inside out and right side out oriented transporters with na ? or h ? gradients as the driving force. under symmetrical ph conditions with a na ? gradient for activation, both the wt and the ph-shifted g s variant exhibit highly symmetrical transport activity with bell shaped ph dependencies, but the optimal ph was shifted . ph units to the acidic range in the variant. in both strains the ph dependence was associated with a systematic increase of the k m for na ? at acidic ph. under symmetrical na ? concentration with a ph gradient for nhaa activation an unexpected novel characteristic of the antiporter was revealed; rather than being down regulated it remained active even at ph as low as . these data allowed to advance a transport mechanism based on competing na ? and h ? binding to a common transport site and to develop a kinetic model quantitatively explaining the experimental results. in support of these results both alkaline ph and na ? induce the conformational change of nhaa associated with nhaa cation translocation as demonstrated here by trypsin digestion. furthermore, na ? translocation was found to be associated with the displacement of a negative charge. in conclusion, the electrophysiological assay allowed to reveal the mechanism of nhaa antiport and sheds new light on the concept of nhaa ph regulation. swimming motility is widespread among bacteria. however, in confined or structured habitats bacteria often come in contact with solid surfaces which has an effect on the swimming characteristics. we used microfabrication technology to quantitatively study the interaction of swimming cells with solid boundaries. we tracked bacteria near surfaces with various engineered topologies, including flat and curved shapes. we were able to study several surface related phenomena such as hydrodynamic trapping and correlated motion. we think that our results may help to understand how physical effects play a role in surface related biological processes involving bacteria such as biofilm formation. cell labeling efficiency of oppositely charged magnetic iron oxide nanoparticles-a comparative study raimo hartmann , christoph schweiger , feng zhang , wolfgang. j. parak , thomas kissel ,# , pilar rivera_gil ,# biophotonics, institute of physics, philipps university of marburg, pharmaceutical technology, institute of pharmacy, philipps university of marburg e-mail: kissel@staff.uni-marburg.de; pilar.riveragil@physik.uni-marburg.dethe interaction of nanomaterials with cells is a key factor when considering their translocation into clinical applications. especially an effective accumulation of nanoparticles inside certain tissues is beneficial for a great number of applications. predominantly size, shape and surface charge of nanoparticles influence their cellular internalization and distribution. to investigate this, two series of maghemite (c-fe o ) nanoparticles were synthesized either via aqueous coprecipitation or via thermal decomposition of organometallic precursor molecules. size and the spherical shape of both nanoparticle types were kept constant whereas the charge was changed by modifying the surface of the nanoparticles with polymers of opposite charge, in detail poly(ethylene imine) (pei) and a polymaleic anhydride derivative (pma). the positively and negatively charged c-fe o nanoparticles were characterized with respect to size, zeta potential, colloidal stability and magnetic properties. furthermore, the uptake rate and localization of both formulations into a carcinoma cells after fluorescent labeling of the carriers as well as the resulting alteration in mr-relaxation times were evaluated. membrane proteins are the target of more than % of all drugs and are encoded by about % of the human genome. electrophysiological techniques, like patch-clamp, unravelled many functional aspects of membrane proteins but usually suffer from poor structural sensitivity. we have developed surface enhanced infrared difference absorption spectroscopy (seidas) , to probe potential-induced structural changes of a protein on the level of a monolayer. a novel concept is introduced to incorporate membrane proteins into solid supported lipid bilayers in an orientated manner via the affinity of the his-tag to the ni-nta terminated gold surface . full functionality of surface-tethered cytochrome c oxidase is demonstrated by cyclic voltammetry after binding of the natural electron donor cytochrome c. general applicability of the methodological approach is shown by tethering photosystem ii to the gold surface . in conjunction with hydrogenase, the basis is set towards a biomimetic system for h -production. recently, we succeeded to record ir difference spectra of a monolayer of sensory rhodopsin ii under voltage-clamp conditions . this approach opens an avenue towards mechanistic studies of voltage-gated ion channels with unprecedented structural and temporal sensitivity. initial vibrational studies on the novel light-gated channelrhodopsin- will be presented . probing biomass-chromatographic bead interactions by afm force spectroscopy gesa helms, marcelo ferná ndez-lahore, rami reddy vennapusa, and jü rgen fritz school of engineering and science, jacobs university bremen, bremen, germany e-mail: g.helms@jacobs-university.dein expanded bed adsorption (eba), bioproducts are purified from an unclarified fermentation broth by their adsorption on chromatographic beads in a fluidized bed. the unspecific deposition of biomass onto the adsorbent matrix can severely affect the process performance, leading to a poor system hydrodynamics which then decreases the success of this unit operation. to quantify the bead-biomass interactions different chromatographic beads are attached to afm cantilevers, and force spectroscopy experiments are performed with these colloidal probes on model surfaces and cells in solution. the experiments are conducted under varying conditions to study uncovering physiological processes at the cellular level is essential in order to study complex brain mechanisms. using multisite signal recording techniques in the extracellular space, functional connectivity between different brain areas can be revealed. a novel microfabrication process flow, based on the combination of wet chemical etching methods was developed, which yields highly reproducible and mechanically robust silicon-based multielectrode devices. the fabricated shaft of the probe is lm wide, lm thick, has rounded edges and ends in a yacht-bow like, sharp tip. its unique shape provides decreased invasivity. the sensor contains platinum recording sites at precisely defined locations. murine in vivo experiments showed that the probes could easily penetrate the meninges. high quality signals, providing local field potential, multi-and single unit activities, were recorded. the interfaces between the tissue and the platinum contacts were further improved by electrochemical etching and carbon nanotube coating of the metal sites. the integrated optical mach-zehnder interferometer is a highly sensitive device, considered a powerful lab-on-a-chip tool for specific detection of various chemical and biochemical reactions. despite its advantages, there is no commercially available biosensor based on this technique. the main reason is the inherent instability of the device due to slight changes of environmental parameters. in this paper we offer a solution to this problem that enables the optimal adjustment of the working point of the sensor prior to the measurement. the key feature is a control unit made of a thin film of the lightsensitive chromoprotein bacteriorhodopsin deposited on the reference arm of the interferometer. after showing the transfer characteristics of such a device, we demonstrate its applicability to sensing of specific protein-protein interactions. we expect our method to become a rapid and cost-efficientthe combination of unconventional fabrication technology and biomaterials allows both to realize state-of-the-art devices with highly controlled lateral features and performances and to study the main properties of the biomolecules themselves by operating at a scale level comparable with the one crucial for their activity. soft lithography and microfluidic devices offer a tool-box both to study biomolecules under highly confined environments [ ] and to fabricate in an easy way topographic features with locally controlled mechanical and chemical surface properties, thus leading to a finer control of the interplay of mechanics and chemistry. i will present an application of this technology to the control of cell fate that is becoming a key issue in regenerative medicine in the perspective of generating novel artificial tissues. patterns of extracellular matrix (ecm) proteins have been fabricated, by a modified lithographically controlled wetting (lcw), on the highly antifouling surface of teflon-af to guide the adhesion, growth and differentiation of neural cells (shsy y, n , ne- c) achieving an extremely accurate guidance [ ] . local surface topography is also known to influence the cell fate [ ] , thus, integrating this parameter in the substrate fabrication could increase the complexity of the signals supplied to the cells. in this perspective we have developed a novel fabrication technique, named lithographically controlled etching (lce), allowing, in one step, to engrave and to functionalize the substrate surface over different lengthscales and with different functionalities. i will conclude showing how we have been developing ultrathin film organic field effect transistors (ofets) as label-free biological transducers and sensors of biological systems. ofets are low dimensional devices where ordered conjugated molecules act as charge transport material. unconventional patterning techniques and microfluidics have been adapted to proteins and nucleic acids to dose the molecules on the ofet channel with a high control of the concentration. in another set of experiments, we have also been addressing the signalling from neural cells and networks grown on pentacene ultra-thin film transistosr [ , ] .advances in nanotechnology are beginning to exert a significant impact in medicine. increasing use of nanomaterials in treatment of diseases has raised concerns about their potential risks to human health. in our study, the effect of poly(lactic-co-glycolic acid) (plga) and titanium dioxide (tio ) nanoparticles (nps) on function of b-and t-lymphocytes was investigated in vitro. human blood cultures were treated with plga and tio nps in concentrations: . ; and lg/cm for h. lymphocyte transformation assay was used to assess the effect of nps on lymphocyte function. lymphocytes were stimulated with mitogens: concanavalin a, phytohaemmagglutinin (t-cell response) and pokeweed mitogen (b-cell response). our findings indicate immunomodulatory effect of plga nps. proliferative response of t-and b-lymphocytes exposed in vitro to the highest dose of plga for h was suppressed significantly (p. , p. ). on the other hand, we observed stimulative effect of exposure to middle dose of plga nps on b-lymphocyte proliferation (p. ). no alteration was found in lymphocyte proliferation treated in vitro with tio nps for h. in conclusion, proliferation of lymphocytes in vitro might be one of the relevant tests for evaluation of nps immunotoxicity.embryonic stem (es) cell differentiation in specific cell lineage is still a major challenge in regenerative medicine. differentiation is usually achieved by using biochemical factors (bf) which concentration and sides effects are not completely understood. therefore, we produced patterns in polydimethylsiloxane (pdms) consisting of groove and pillar arrays of sub-micrometric lateral resolution as substrates for cell cultures. we analyzed the effect of different nanostructures on differentiation of es-derived neuronal precursors into neuronal lineage without adding biochemical factors. neuronal precursors adhere on pdms more effectively than on glass coverslips but the elastomeric material itself doesn't enhance neuronal differentiation. nano-pillars increase both precursors differentiation and survival with respect to grooves. we demonstrated that neuronal yield was enhanced by increasing pillars height from to nm. on higher pillar neuronal differentiation reaches * % hours after plating and the largest differentiation enhancement of pillars over flat pdms was observed during the first hours of culture. we conclude that pdms nanopillars accelerate and increase neuronal differentiation. key: cord- -lj us dq authors: flower, darren r.; davies, matthew n.; doytchinova, irini a. title: identification of candidate vaccine antigens in silico date: - - journal: immunomic discovery of adjuvants and candidate subunit vaccines doi: . / - - - - _ sha: doc_id: cord_uid: lj us dq the identification of immunogenic whole-protein antigens is fundamental to the successful discovery of candidate subunit vaccines and their rapid, effective, and efficient transformation into clinically useful, commercially successful vaccine formulations. in the wider context of the experimental discovery of vaccine antigens, with particular reference to reverse vaccinology, this chapter adumbrates the principal computational approaches currently deployed in the hunt for novel antigens: genome-level prediction of antigens, antigen identification through the use of protein sequence alignment-based approaches, antigen detection through the use of subcellular location prediction, and the use of alignment-independent approaches to antigen discovery. reference is also made to the recent emergence of various expert systems for protein antigen identification. of smallpox were reported annually from across the globe, leading to about million deaths a year. yet, today, the disease has been completely eradicated. in the last years, there have been no known cases. poliomyelitis or polio is the other largescale disease which has come closest to eradication. its success too has been formidable: in , the pan american health organization effectively eradicated polio from the western hemisphere, since when the global polio eradication programme has significantly decreased the overall incidence of poliomyelitis through the rest of the world. in , there were approximately , cases spread through countries; in the past years, global figures amounted to less than , annually. yet, in spite of such remarkable success, death from vaccine-preventable diseases remains unacceptably high [ ] . there are over common infectious diseases responsible for one in four deaths globally. rotavirus and pneumococcus are pathogens causing diarrhoea and pneumonia, the leading causes of infant deaths in underdeveloped countries. in the next decade, effective, widespread vaccination programs against such pathogenic microbes could save the lives of . million children under years of age. hepatitis b causes , deaths in adults and children aged over . seasonal, non-pandemic influenza kills upwards of half a million globally each year. for those aged under in particular, a series of diseases causes an extraordinary and largely preventable death toll. for example, tetanus accounts every year for , deaths, pertussis is responsible for over , deaths, hib gives rise to in excess of , deaths, diphtheria accounts for , deaths, and yellow fever over , deaths. arguably, the most regrettable, the most lamentable situation is that of measles. measles accounts for the unneeded deaths of , under-fives and over , adults and older children. despite this, the situation is by no means bleak. by the close of , approximately million had been vaccinated against hib and million children against hepatitis b. during its first decade, vaccinations against polio, hep b, hib, measles, pertussis, and yellow fever funded by gavi had prevented the unnecessary loss of over million lives. there are approximately vaccines licensed for use in humans, around half of these are widely prescribed. yet, most of these vaccines target the prevention of common childhood infections, with the remainder addressing tropical diseases encountered by travellers to the tropics; only a relatively minor proportion combat endemic disease in under-developed countries. balancing the persisting need against the proven success and anticipated potential, vaccines remain an area of remarkable opportunity for medical advance, leading directly to unprecedented levels of saved and improved lives. from a commercial perspective, the vaccine arena has long been neglected, in part because of the quite astonishing success limned above; today, and in comparative terms at least, activity within vaccine discovery is feverish [ , ] . during the last years, tens of vaccines and vaccine candidates have moved successfully through clinical trials, and vaccines in late development number in the hundreds. in stark contrast to antibiotics, vaccine resistance is negligible and nugatory. despite the egregious and outrageous success enjoyed by vaccines, many major issues persist. the world health organisation long ago identified tuberculosis (tb), hiv, and malaria as the three most significant life-threatening infectious diseases globally. no vaccine has been licensed for malaria or hiv, and there seems little realistic hope for such vaccines appearing in the immediate future. bacille calmette guérin (bcg), the key anti-tb vaccine, is of limited efficacy [ ] . levels of morbidity and mortality generated by diseases already targeted by vaccines remain high. influenza is the key example, with a global annual estimated death toll in the region of half a million. in the twenty-first century, the world continues to be threatened by infectious and contagious diseases of many kinds: visceral leishmaniasis, marburg's disease, west nile, dengue, as well as sars potentially pandemic h n influenza, and over human and emerging zoonotic infections, as well as the persisting threat from hiv, tb, and malaria mentioned above. all this is further compounded by the additional risk arising from antibiotic-resistant bacteria and bioterrorism, not to mention major quasi-incidental issues, such climate change, an accelerating growth in the world's population, increased travel, and the overcrowding seen within the burgeoning populations concentrated into major cities [ ] . for reasons we shall touch on below, the discovery of vaccines is both more urgent and more difficult than it has ever been. in an era where conventional drug discovery has been seen to fail-or at least as seen by cupiditous investors, for whom the current model of pharmaceutical drug discovery is broken-vaccines are one of a number of biologically derived therapies upon which the future economic health of the pharmaceutical industry is thought to rest. the medical need, as stated above, is clear. set against this is the unfortunate realisation that vaccines exist for most easily targeted diseases, those mediated by neutralising antibodies, and so outstanding vaccine-targets are those of more intractable diseases mediated primarily by cellular immunity. to address those properly requires what all discoveries required: hard work and investment; but they also need new ideas, new thinking, and new vaccine discovery technology. amongst, these are computational techniques, the most promising of which are those targeting the discovery of novel vaccine antigens: the candidate subunit vaccines of tomorrow see fig. . . vaccines are agents-either molecular (epitope-or antigen-based vaccines) or supramolecular (attenuated or inactivated whole pathogen vaccines)-which are able to create protective immunity against specific pathogenic infectious microorganisms and any diseases to which they might give rise. protective immunity can be characterised as an enhanced but highly specific response to consequent re-infection-or infection by an evolutionarily closely related micro-organismsmade by the adaptive immune system. such increased or enhanced immunity is facilitated by the quantitative and qualitative augmentation of immune memory, which is able to militate against the pernicious effects of infectious disease. vaccines synergise with the herd immunity they help engender, leading to reduced transmission rates as well as prophylaxis against infection. the term "vaccine" derives from vacca (latin for cow). the words vaccine and vaccination were coined specifically for anti-smallpox immunization by the discoverer of the technique, edward jenner ( - ). these terms were later extended by louis pasteur ( - ) to include a far more extensive orbit or remit, including the entire notion of immunisation against any disease [ , , ] . several fundamentally distinct varieties of vaccine exist. these include inter alia inactivated or attenuated whole pathogen-based vaccines; subunit vaccines are based on one or more protein antigens, vaccines based upon one or more individual epitopes, carbohydrate-based vaccines, and combinations thereof. hitherto, the best-used and, thus, the most successful types of vaccine were built from attenuated-"weakened" or non-infective or otherwise inactivated-pathogenic whole organisms, be they bacterial or viral in nature. well-known examples include the following: the bcg vaccine which acts prophylactically against tuberculosis and albert sabin's anti-poliomyelitis vaccine based on attenuated poliovirus. the vast majority of subunit vaccines are immunogenic protein molecules, and are typically discovered using a somewhat haphazard search process. concerns over the safety of whole-organism vaccines long ago prompted the development of other kinds of vaccine strategy, including those based upon antigens as the innate or immanent active biological constituent of either single or composite vaccines. the vaccine which targets hepatitis b is a good exemplar of a so-called subunit vaccine as it is based on a protein antigen: the viral envelope hepatitis b surface antigen. other types of as-yet-unproven vaccines include those based on epitopes and others based on antigen-presenting cells; many have entered clinical trials, but none have fulfilled their medical or commercial potential. whole antigen discovery. when looking at a reverse vaccinology process, the discovery of candidate subunit vaccines begins with a microbial genome, perhaps newly sequence, progresses through an extensive computational stage, ultimately to deliver a shortlist of antigens which can be validated through subsequent laboratory examination. the computational stage can be empirical in nature; this is typified by the statistical approach embodied in vaxijen [ ] . or this stage can be bioinformatic; this involves predicting subcellular location and expression levels and the like. or, this stage can take the form of a complex mathematical model which uses immunoinformatic models combined with mathematical methods, such as metabolic control theory [ ] , to predict cell-surface epitope populations it is often difficult to capture the proper scientific meaning and use of recondite terms, often borrowed from common usage or archaic language. so, let us be more specific. an immunogen-a molecular moiety exhibiting the property of immunogenicity-is any material or substance capable of eliciting a specific immune response. an antigen, on the other hand, is a molecular moiety exhibiting the property of antigenicity. it is a substance or material recognised by a primed immune system. such a persisting state of immune readiness may be mediated by humoral immunity (principally via the action of soluble antibodies) or by cellular immunity (as mediated by t-cells, antigen presenting cells (apcs), or other phagocytic cells), or a combination of both, in what is often referred to as a "recall" response. immunogenicity is vital: it is the signature characteristic or property that prompts a certain molecular moiety to evoke a significant immune response. here, we shall strictly limit use of "immunogen" and "antigen" to a sole meaning. here, an "antigen" or an "immunogen" will mean a protein that is capable of educing some kind of discernible response from the host immune system. specifically, and for practical reasons, we will almost exclusively be referring to proteins derived from a pathogenic micro-organism. at present, the prophylaxis engendered by all current effective vaccines-all except bcg-is primarily mediated by the humoral immune system, via soluble antibodies. however, the disease mechanisms of most serious diseases for which vaccines are not available are usually mediated by cellular immunity. thus, for untreated disease, we seek to identify immunogenicity generated principally by cellular responses or by a combination of cellular and humoral responses, rather than by humoral immunity alone. to some extent, subunit vaccines can be thought to represent something of a compromise between vaccines based on attenuated or otherwise inactivated wholeorganisms and the many more recent and more innovative vaccine strategies typified by epitope or poly-epitope vaccines. vaccines based around whole pathogens have long engendered safety concerns [ ] [ ] [ ] . from the lubeck disaster and the cutter incident [ ] [ ] [ ] to the recent mmr debacle, issues over safety, real or imagined, have always dogged the development of vaccines [ , ] . indeed, during the eighteenth century the pre-vaccination practice of variolation against smallpox prefigured much of the current debate over the perceived danger of vaccines [ ] . while the case for vaccines is unanswerable, we should not be complacent. any live vaccine, however extensively attenuated, can revert to a pathogenic, diseaseinducing form. this is currently an on-going issue for polio vaccination [ ] . other issues, particularly the chemical or biological contamination of vaccines during manufacture, remain enduring and persistent problems. undesired immunogenicity, the type leading to severe and pathological immune responses, rather than enduring immune memory, is a concern for both whole-organism and subunitbased vaccines, as well as putative biologics [ ] . immunologists and vaccinologists have thus long sought alternatives to the use of whole organisms as vaccines. subunit vaccines and conjugate vaccines are one such. vaccines based on epitopes, singly or in combination, are another. the diversity of innovations in vaccine design holds much potential for success, but, thus far at least, has proved spectacularly unsuccessful in a clinical context. logically, a vaccine that relies solely on, at most, a few well-chosen epitopes, should be effective, efficacious, and, above-all, safe. epitopes, as peptides, may be cytotoxic and might possibly prompt some kind of inopportune immune response but cannot be infective or revert to infectivity. in many ways, epitopes are closer in size and share many properties with synthetic small molecules; possibly dealing with their pharmacokinetics as such may be better than thinking of them as biologic drugs. in practice, of course, epitope-based vaccines, like subunit vaccines, suffer from poor immunogenicity, necessitating the use of a complex combination of adjuvants and complicated delivery systems. for diverse reasons, including immunogenicity, stimulating protective immune responses against intracellular pathogens remains problematic when using nonreplicating vaccines. why should this be? first, the immune response is very complex, involving both the innate and adaptive immunity, and significant interaction between them. in all probability, and particularly when viewed in the context of the whole population, many epitopes and danger signals are involved; likewise, the many different immune actors, be they acting at the cellular or molecular levels, interact with each other and are subject to complex mechanisms of genetic, epigenetic, and system-level control and regulation. it may be that only the large and complex organism-sized vaccines can induce the range of immune responses necessary across the population to induce protection, since they comprise a potential host of immunogenic molecular moieties, not just a single immunodominant epitope see fig. in that which follows, we shall seek to explore the availability and accessibility of informatic techniques and informatic tools used to identify candidate subunit vaccines of microbial origin. yet, we shall start by adding context with an examination of experimental approaches to antigen discovery: so-called reverse vaccinology. reverse vaccinology already relies on informatics, but, in a sense at least, what we would like to do using informatics is to reproduce as much as is possible the steps inherent in successful reverse vaccinology in silico rather than in vitro. reverse vaccinology, and the necessary computational support, is a much more prevalent means of identifying subunit vaccines [ ] . see fig. . . even today, many experimentalists retain a deep and atavistic distrust of all computation. experimentalists seldom trust the reliability and dependability of computational methodology, choosing to trust instead in what they believe to be infallible, if actually rather elusive, empirical reliability of observations, experiments, and the whole paraphernalia of laboratory experimentation. yet, things are in the process of changing, and this change is likely to accelerate as we move forward into a future that looks more parsimonious and uncertain by the day. vaccines have come a long way from the days when they were prepared directly from the fluids of smallpox pustules or extracts of infected spinal cords. yet vaccine discovery and development remains firmly empirical. many modern vaccines still comprise entire inactivated pathogens. while vaccines targeting papillomavirus, tetanus, hepatitis b, and diphtheria are subunit vaccines, few are recombinant proteins devoid of contaminants. some would argue that the only molecular vaccines are glycoconjugates: oligosaccharides conjugated to immunogenic carrier proteins. conventional empirical, experimental, laboratory-based microbiological ways to identify putative candidate antigens require cultivation of target pathogenic micro-organisms, followed by teasing out their component proteins, analysis in a series of in-vitro and in-vivo assays, animal models and with the ultimate objective of isolating one or two proteins displaying protective immunity. unfortunately, in reality, the process is more complex, and more confusing, and much more confounding as this brief synopsis might suggest. cultivating pathogens outside the environment offered by their host organism can be difficult, even impossible. not every protein is readily expressed in adequate quantities in vitro, and many proteins are only expressed in an intermittent basis during the time course of infection. thus, a considerable number of potential, putative, and possible vaccine candidate antigens could be missed by conventional experimental approaches. reverse vaccinology [ ] [ ] [ ] [ ] has the potential to analyse genomes for potential antigens, initially scanning "open reading frames" (orfs), then selecting proteins because they are open to surveillance by the host immune system. this usually involves some complex combination of informatic-based prediction methodologies. recombinant expression of the resulting set of identified molecules can overcome their reduced natural abundance, which has often prevented us recognising their true potential. by enlarging the repertoire of native antigens, this technology can help to foster the development of a new cohort of vaccines. reverse vaccinology was originally established and has been established by studying neisseria meningitidis, which is responsible for meningococcal meningitis and sepsis. vaccines are currently available for all serotypes, except that serogroup b. n. meningitidis orfs were found initially [ , ] ; proteins were then identified, expressed in vitro and found to be surface exposed. seven proteins elicited immunity over many strains. the culmination of this work was a "universal" vaccine for serogroup b based on five antigens [ ] . this protovaccine, when used with alum as adjuvant, induced murine bactericidal antibodies versus % of meningococcal strains drawn from the world population of n. meningitidis. strain coverage increases to over % when used with cpg or mf as adjuvant. another key illustration is porphyromonas gingivalis, an anaerobic gramnegative bacterium found in the chronic adult inflammatory gum disease periodontitis. initially, orfs were identified [ ] ; of these, protein sequences were open to immune surveillance and were positive for several sera. two antigens were found to be protective in mice. yet another fascinating instance is provided by streptococcus pneumoniae, a prime cause of meningitis, pneumonia, and sepsis [ , ] . in this study, potential orfs were initially identified, with of these proteins being readily expressed. finally, six proteins were seen to induce protection against the pathogen. more recently, other and more advanced experimental techniques, such as microarrays, are beginning to come on-stream, opening up a gallimaufry of possible technologies to the new but maturing field of reverse vaccinology. the following gives but a taste of what is to come. using ribosome display to undertake in-vitro protein selection, weichart et al. [ ] identified within the methicillin-resistant col strain of the virulent human pathogen staphylococcus aureus genes, the majority of which were secreted or surface-localized proteins; of these, % had cell envelope function, % were transporter proteins, and % were virulence factors or toxins. using an ingenious combination of advanced proteomics techniques and in-vitro assays, giefing et al. [ ] identified novel vaccine candidates which prevented infections in children and in the elderly caused by a variety of pneumococcus serotypes; four demonstrating major protection versus sepsis in animals. two leads-stkp (a serine/threonine protein kinase) and pcsb (a structural protein with a role in cell wall separation of group b streptococcus)-showed clear cross-protection as potential candidate vaccines against four separate pneumococcal serotypes. using a whole proteome microarray, and in order to identify protein antigens, eyles et al. [ ] probed serum from balb/c mice previously immunized with a vaccine comprising: killed francisella tularensis and two immunomodulatory adjuvants. eleven out of the top twelve immunogenic antigens were known already as immunoreactive, although further proteins were discovered using this experimental approach. in further work from this consortium, titball and co-workers [ ] constructed a protein microarray of , burkholderia pseudomallei proteins, treated it with patient samples, identifying antigens. this smaller set was treated with a further distinct sera from groups of patients, identifying putative candidate antigens. this survey, brief though it is, helps to highlight the potential power of reverse vaccinology for vaccine discovery. however, since the number of antigens is high, given all the potential difficulties in characterising and expressing them, it is important to note that both computational and experimental techniques and methodologies will doubtlessly omit important and interesting proteins from further analysis, though not necessarily for the same or similar reasons. thus, with the burgeoning discipline of reverse vaccinology, both computational and experimental techniques are in need of constant development and improvement. compared to its role to drug discovery, genomics, and a host of other bioscience sub-disciplines, bioinformatics support for the preclinical discovery and development of vaccine is in its infancy; yet, as interest in vaccine discovery increases, the situation changes. there are two key types of bioinformatics support for vaccine design, discovery, and development. at the technical level, the first of these cannot be properly or meaningfully distinguished from general support for target discovery. it includes the annotation of pathogen genomes, more conventional host genome annotation, and the statistical analysis of immunological microarray experiments. the second form of support concentrates on immunoinformatics, that is, the informatics analysis of immunological problems, principally epitope prediction. b-cell epitope prediction remains defiantly basic or is largely dependent on a sometimes unavailable knowledge of three-dimensional protein structure. both structure- [ ] and data-driven [ ] prediction of antibody-mediated epitopes evince poor results. however, methods developed to predict t-cell epitopes now possess considerable algorithmic sophistication. moreover, they continue to develop and evolve, as well as extend their scope and remit to address new and ever larger and more challenging epitope prediction problems. presently, accurate and reliable t-cell epitope prediction is restricted to predicting the binding of peptides to the major histocompatibility complex (mhc). class i peptide-mhc prediction can be reasonably accurate, or is for properly characterised, wellunderstood alleles [ ] . yet a number of key studies have demonstrated that class ii mhc binding prediction is almost universally inaccurate, and is thus erratic and unreliable [ ] [ ] [ ] . a similar situation persists for structure-driven prediction of mhc epitopes [ , ] . irrespective of poor predictive performance, several other problems exist for epitope prediction. for t cell prediction in particular, a prime concern is with the availability or rather lack of availability of relevant data. it is now known that immunogenic t cell epitopes, thought previously to be peptides no more than amino acids in length, can be or more residues long. longmer epitopes now greatly expand the number of possible peptides open to inspection by t cells [ ] [ ] [ ] [ ] . the inadequate results generated by b cell epitope prediction algorithms may indicate that a fundamental reinterpretation of extant b cell epitope data is necessary before improved methods become feasible. these factors, when taken together, are consistent with the notion that methods relying only on the possession of certain epitopes will not be fully effective when tasked with antigen or immunogen identification. this is supported by information indicating a lack of correspondence between selected antigens and experimentally verified protective proteins. there are many means of identifying antigenic proteins. most focus on the properties of protein sequence and structure, but arguably one of the most insightful is instead to examine properties, both local and global, of the underlying nucleic acid. one notable way is to look for evidence of the horizontal or lateral transfer of so-called pathogenicity islands or pais. horizontal transfer, such as transformation, conjugation, or transduction, is distinct from the vertical transfer of genetic material from an ancestor within its lineage. it typically involves an organism incorporating genetic material from an evolutionarily distant organism without being its offspring. pais are a specific type of genomic island; that is, part of a genome acquired through direct transfer between microbes. a genomic island can occur in distantly related species and may be mono-or multi-functional; there are many sub-classes classified by function. other examples include antibiotic resistance islands, metal resistance, and secretion system islands. the gene products of pais are crucial to the propagation of disease pathogenesis, much as the pais themselves are key to the evolution of pathogenesis. pathogen-associated type iii and type iv secretion systems are, for example, often found together in the same pai. detecting such large (> kb) and discrete clusters of genes clusters, habitually possessing a characteristically atypical g/c content, at least when compared with the remainder of the genome, leads, in turn, to the individual identification within clusters of virulence-associated protein antigens. prokaryotic pais are frequently associated with trna-encoding genes, many are flanked by repeat structures, and many contain fragments of mobile genetic elements such as plasmids and phages. pais can be identified by combining analysis of nucleotide composition and phylogeny, amongst others. composition-based approaches rely on the natural variation between genome sequences from different species. regions of the genome with abnormal composition, as demonstrated by nucleotide or codon bias, may be potentially transferred horizontally. such methods are prone to inaccuracies; these result from inherent genomic sequence variation, such as is seen in highly expressed genes, and the observation that over time the sequences of genomic islands alter to mirror the composition of host genomes. evolution-based approaches seek regions that may have been transferred horizontally by comparing related species. put at its simplest: a putative genomic island present in one species, but absent from several related species, is consistent with horizontal transfer. of course, the island may have been present in the last common ancestor shared by the species compared and subsequently been lost from the other species. a less likely explanation would be that the island arose by mutation and selection in this species and no other. to decide, a body of extra evidence would need to be explored, such as the size of the pai, the mechanistic ease of deletion, the consistent presence of the island in more distantly related species, the relative pathogenicity of island-less species, and the divergence of the genome relative to that of other related species. many methods, which seek to quantify and leverage these somewhat vague notions, are now available [ ] [ ] [ ] . such analysis at the nucleic acid level shares many features in common with approaches used to identify cpg islands in eukaryotic genomes [ ] [ ] [ ] [ ] . recently, langille et al. tested six sequence-composition genomic island prediction methods and found that islandpath-dimob and sigi-hmm had the greatest overall accuracy [ ] . island path was designed to help identify prokaryotic pais, through the visualisation of common pai characteristics such as mobile element-associated genes or atypical sequence composition [ ] . sigi-hmm is a very accurate sequence composition-based genomic island predictor, which combines a hidden markov model (hmm) and codon usage measurement to identify genomic islands [ ] . in another work, yoon et al. coupled heuristic sequence searching methods, which aimed simultaneously to identify pais and individual virulence genes, with composition and codon-usage bias [ ] . exploiting a machine learning approach, vernikos and parkhill sampled the structural features of genomic islands using a hypothesis-free, bottom-up search, with the objective of explicitly quantifying the contribution made by each feature to the overall structure of different genomic islands [ ] . arvey et al. sought to identify large chromosomal regions with atypical features using a general divergence measureable to quantify the compositional difference between genomic segments [ ] . islandpick is a comparative genomic island predictor, rather than a composition-based approach, that can identify very probable genomic islands and very probable non-genomic islands within investigated genomes but does require that several phylogentically related genomes are available [ ] . observing pais as having a g + c composition closer to their host genome, wang et al. used so-called genomic barcodes to identify pais. these barcodes are based on the fact that the frequencies of -mers to -mers, and their reverse complement, are very stable across a whole genome when using a window size of over , bps and that this constituted a characteristic signature for genomes [ ] . the ready detection of pais, as a tool in computational reverse vaccinology, has been greatly aided by the deployment of several web-based resources. a key example of a server that successfully integrates several accurate genomic island predictors is islandviewer [ ] , which combines the methods: islandpick [ ] , islandpath [ ] , and sigi-hmm [ ] and is available at the url: http://www. pathogenomics.sfu.ca/islandviewer/query.php. the gui facilitates the visualisation of genomic islands and downloading of data at the gene and chromosome levels in a variety of formats. another important, web-accessible resource is paidb or the pai database. this is a wide-ranging database of pais, containing distinct pais and genbank accessions present in strains of pathogenic bacteria [ ] . paidb may be accessed via the url: http://www.gem.re.kr/paidb. thus, alternative techniques and methodologies are required in order to select and to rank proteins likely to be protective antigens and thus candidate vaccines. below, we shall explore three key approaches: subcellular location prediction, alignment-dependent sequence similarity searching, and alignment-independent empirical statistical approaches. in this section, we consider, perhaps, the clearest and cleanest way to identify potential new antigens in any microbial genome to alignment-dependent sequence similarity searching. there are two complimentary but distinct ways of identifying the immunogenicity of a protein from its sequence. one is to look for significant similarity to proteins of known immunogenicity. this idea seems so straightforward as to be almost facile. the other approach is somewhat less obvious conceptually but almost as straightforward logistically and involves seeking to identify antigens as proteins without discernible sequence similarity to any host protein. let us turn to the first of these two alternatives. let us begin by stating or rather reiterating the obvious. if we know the sequence of an existing antigen or antigens, we can use sequence searching to find similar sequences in the target genome [ , ] . any candidate antigens selected by this process can then be selected for further verification and validation. the same old, familiar caveats apply here: are chosen thresholds appropriate? are high-scoring matches an artefact or are they real and meaningful? the litany of such conditions is all too familiar to anyone well versed in sequence similarity searching. clearly, when a sequence search is run, using blast or fasta , for example, an enormously long list of nearly identical proteins might ensue, or one that does not get any hits at all, or almost any intervening result might be obtained. as reflective practitioners, we must judge which result can be classified as useful and which cannot, and in so doing, identify sets of suitable thresholds, above which we expect usefulness and below which we might anticipate little or no utility. thresholds are contingent upon the sequence family studied, as well as being dependent solely on the problem investigated. thus heuristically identified cut-offs are desirable, but much thinking and empirical investigation are required to select appropriate values. of course, the process adumbrated above presupposes that sufficient antigenic protein sequences are known. compilation of this data is the role of the database. recently, extensive literature mining, coupled with factory-scale experimentation, has created many functional immunology databases, although databases, such as syfpeithi [ , ] , focussing on cellular immunology-primarily mhc processing, presentation, and t cell recognition-have existed for - years. arguably, the best extant database is the hiv molecular immunology database [ ] , although clearly the depth of the database is at the expense of generality and breadth. other recent databases include mhcbn [ , ] and epimhc [ ] , amongst many others. two databases, warrant particular attention: antijen [ ] , formerly known as jenpep [ , ] ; and iedb [ ] . implemented as a relational postgresql database, antijen integrates a wideranging set of data items, much of which is not stored by other databases. in addition to the kind of cellular immunological information familiar from syfpeithi, such as mhc binding and t cell data, antijen additionally archives b cell epitopes and also includes a significant stockpile of quantitative data: kinetic, thermodynamic, as well as functional, including measurements of immunological peptide-protein and protein-protein interactions. the iedb database is considerably more extensive than other equivalent database systems, benefiting from the input of dedicated epitope sequencing projects. iedb has come to eclipse other work in this area. although both antijen and iedb are full of epitope-focussed information of many flavours, they remain incomplete concerning immunogenic antigens. fortuitously, specific antigen-orientated-rather than epitope-focusseddatabases are starting to be available. arguably, the most obvious and most unambiguous example of an antigen is virulence factor (vf): proteins, such as toxins, able to induce disease directly by attacking a host. analysis of known pathogens has allowed recurring vf systems of + distinct proteins. often, sets of vfs exist as discrete, distinct genome-encoded pais, as well as being more widely spread through the genome. clearly, antigens do not need to be vfs in order to be immunogenic and thus candidates for subunit vaccines. instead, they need only be accessible to the immune system. they do not need to directly or indirectly mediate infection. thus, other databases are needed which capture, collate, and archive the burgeoning plethora of antigen-orientated data. recently, we have helped developed a very different database: antigendb [ ] . it contains over antigens collated from the primary scientific literature, as well as other sources. another related database system has been christened violin (vaccine investigation and online information network) [ ] , which allows straightforward curation and the analysis and comparison of research data across diverse pathogens in the context of human medicine, animal models, laboratory model systems, and natural hosts. as we outline above, in addition to identifying sequence similarity to known antigens, another idea gaining ground is that the immunogenicity of an antigen is solely determined by the absence of similarity to host proteins. some think this is the prime determinant of potential protein immunogenicity [ , ] . such ideas are supported by the belief that immune systems are actively educated to lack reactivity to self-proteins [ ] , a process-often termed "immune tolerance"-which is generated via epitope-specific mechanisms [ , ] . what we really want is a meaningful measure of the "foreignness" of a protein correlating with its immunogenicity. usually, "evolutionary distance" substitutes for "foreignness." clearly, such an evolutionary distance must be specified in terms of biomacromolecular structures or sequences. but, is this practically useful for selecting candidate vaccines? another way to formulate this idea is to say that the probability that a protein is immunogenic is exclusively a product of its dissimilarity, at the whole-sequence or sequence-fragment level, to each and every protein contained within the host proteome. most search software is well matched to this problem. in terms of fragment length, the typical length of an epitope might seem logical, since the epitope is the molecular moiety typically recognised during the initial phase of an immune response. yet, even at the epitope level-say a peptide of - amino acid residues-even a single conservative mutation or mismatch in an otherwise identical match might prove significant. single sequence alterations may totally abrogate or significantly enhance neutralising antibodies binding or recognition by the machinery of cellular immunology. we have attempted to benchmark sequence similarity and correlate it with immunogenicity in order to explore the potential of this idea in a quantitative fashion. to that end, we examined the differences between sets of antigens and non-antigen using sequence similarity scores. we looked specifically at sets of known non-antigenic and antigenic protein sequences from six sources: bacteria, viruses, fungi, and parasites, as well as allergens and tumours [ ] [ ] [ ] , comparing pathogen sequence to those from humans and mice using blast [ ] . most non-antigenic and antigenic sequences were non-redundant; implying a lack of homologues between pathogens and host proteomes, although certain parasite antigens, such as catalases and heat shock proteins, had a much greater level of similarity. we were not able to determine a suitable and appropriate threshold based on the hypothesis of non-redundancy to the host's proteome, suggesting that this is not a viable solution to vaccine antigen identification. however, rather than looking at nucleic acid sequences, or at protein sequences using an alignment-based approach, a new set of techniques, based upon alignmentfree techniques, has been and is being developed; as this approach begins to show significant potential, we shall examine it next. proteins accessible to immune system surveillance are assumed to lie external to the microbial organism or be attached to its surface rather than being sequestered and sequestrated within the cell. for bacteria, this means being located on-or in-the outer membrane surface or being secreted. thus, being able to accurately predict the physical location of a putative antigen can provide considerable insight into the likelihood that a particular protein will prove to be an immunogenic and possibly protective. there are two basic kinds of prediction method for identifying subcellular location: manual rule construction and the application of data-driven machine learning methods. data used to discriminate between compartments include sequence-derived features of the protein, such as hydrophobic regions; the amino acid composition of the whole protein; the presence of certain specific motifs; or a combination thereof. accuracy differs significantly between different methods and different compartments, mostly resulting from the deficiency and inconsistency of data used to derive models. gross overall sequence similarity is unable to predict protein sub-cellular location reliably or accurately. even nearly identical protein sequences may be found in distinct locations, while there are many proteins which exist simultaneously at several distinct locations within the cell, often having equally distinct functions at these different sites [ ] . eukaryotes and prokaryotes have quite distinct subcellular compartments. the number of such compartments used in prediction studies varies. a common schema reduces prokaryotic to three compartments (cytoplasmic, periplasmic, and extracellular) and eukaryotic cells to four compartments (nuclear, cytoplasmic, mitochondrial, and extracellular). other structural classifications evince in excess ten eukaryotic compartments. ten compartments maybe a conservative estimate, such is the complex richness of sub-cellular structure. any prediction method must account for permanent, transient, and multiple locations, and, in addition, multi-protein complexes and membrane-bound organelles as possible sites. numerous signal sequences exist. several methods predict lipoproteins. the prediction of proteins translocated via the tat-dependent pathway is important but has yet to be addressed properly. however, amongst binary, single-outcome approaches, signalp is probably the most accurate and reliable method available. it uses neural networks to predict the presence and probable cleavage sites of type ii or n-terminal spase-i-cleaved secretion signal peptides [ ] [ ] [ ] . this signal is common to both prokaryotic and eukaryotic organisms. signalp has recently been enhanced with a hmm intended to discriminate cleaved from uncleaved signal anchors. a limitation of signalp is its proclivity to over-predict: it cannot properly discriminate reliably between a number of very similar yet functionally different signal sequences, regularly predicting lipoproteins and integral membrane proteins as type ii signals. many methods have been devised capable of dividing a genome or virtualproteome between the various subcellular locations of a eukaryotic or prokaryotic cell. psort is a good example; it is a multicategory prediction procedure, comprising many different programmes [ ] [ ] [ ] [ ] . psort i predicts subcellular compartments, while psort ii predicts ten different locations. ipsort deals with several compartments: chloroplast, mitochondrial, and proteins secreted from the cell, while psort-b focuses solely on predicting bacterial sub-cellular locations. another effective programme is hensbc [ ] . hensbc can assign gene products to one of four different types (nuclear, mitochondrial, cytoplasmic, or extracellular) with an accuracy of about eight out of ten for gram-negative bacteria. another programme, subloc [ ] , predicts prokaryotic subcellular location divided between three compartments. another programme is gpos-ploc [ ] , which integrates several basic classifiers. other methods include phobius [ ] , lipop . [ ] , and tatp . [ ] . a comparison of several such programmes, using mycobacterial proteins as a gold standard [ ] , showed subcellular localisation prediction and possessed high predictive specificity. we have developed a set of methods which predict bacterial subcellular location. using a set of methods for lipoprotein, tat secretion, and membrane protein prediction [ ] [ ] [ ] [ ] [ ] [ ] [ ] , three different bayesian network architectures were implemented as software pipelines able to predict specific subcellular locations, and two serial implementations using a hierarchical decision structure, and a parallel implementation with a confidence-level-based decision engine [ ] . the soluble-rooted serial pipeline performed better than the membrane-rooted predictor. the parallel pipeline outperformed the serial pipeline but was significantly less efficient. genomic test sets proved more ambiguous: the serial implementation identified more of the proteins of known location yet more accurate predictions are made overall by the parallel implementation. the implications of this work are clear. the complexity of subcellular structures must be integrated fully into sub-cellular location prediction. in extant studies, many important cellular organelles are not considered; different routes by which proteins can reach the same compartment are ignored; and proteins existing simultaneously at several locations are likewise discounted. clearly, combining high specificity predictors for each compartment appropriately must be the way forward [ ] . many difficulties, problems, and quandaries persist; the most keenly felt is the lack of high-quality, verified, and validated datasets which unambiguously established the location of well-characterised proteins. this dearth is particularly serious for certain types of secreted protein, such as type iii secretion. in a similar manner, considerably more work is required to accurately predict the locations for proteins of viral origin; while certain studies are encouraging [ , ] , the complexity of viral interaction with host organisms continues to confound attempts at analysis. predicting antigens in silico typically utilise bioinformatics tools. such tools can identify signal peptides or membrane proteins or lipoproteins successfully, yet the majority of algorithms tend to depend on motifs characteristic of antigens or, more generally, sequence alignment as the principal arbiter of definitive and meaningful sequence relationships. this is potentially a problem of some magnitude, particularly given the wide range of evolutionary rates and mechanisms amongst microbial proteins. certain protein families do not, however, show obvious or significant sequence similarity, despite having common biological properties, functions, and three-dimensional structures [ , ] . thus alignment-based approaches may not always produce useful and unequivocal results, since they assume a direct sequence relationship that can be identified by simple sequence search techniques. immunogenicity, as a signature characteristic, may be encrypted within the structure and/or sequence instead. this may be encoded so cryptically or so subtlety as to completely confound or at least mislead conventional sequence alignment protocols. discovery of utterly novel and previously unknown antigens will be totally stymied by the absence of similarity to known antigenic proteins. alignment-dependent methods tend to dominate bioinformatics and, by extension, immunoinformatics. several authors have chosen to look at alternative strategies, implementing so-called alignment-independent or alignment-free techniques. the first authors to do so were mayer et al., who reported that protective antigens had a different amino acid composition compared to control groups of nonantigens [ ] . such a result is unsurprising since it has long been known that the structure and sequence composition of proteins adapted to the different redox environments of different sub-cellular compartments [ ] . mayer's analysis was formulated primarily in terms of univariate comparisons of antigens versus controls for different properties. subsequently, we explored bivariate comparison in terms of easily comprehensible scatter-plots. see fig. . for representative examples. what their results ably demonstrate is the potential for the discrimination of antigens and non-antigens by the appropriate selection of orthogonal descriptors. the challenge, of course, is to identify a robust choice of descriptors which are capable of extrapolating as well interpolating when used predictively. progressing beyond this type of analysis, and synergising with our other work on alignment-independent representation [ ] [ ] [ ] [ ] [ ] , we have initiated the development of new methods to differentiate antigens-and thus potential vaccine candidates-and non-antigens, using more sophisticated alignment-free approach to sequence representation [ , ] . rather than focus on epitope versus nonepitope, our approach utilises data on protective antigens derived from diverse pathogens to create statistical models capable of predicting whole-protein antigenicity. our alignment-independent method for antigen identification uses the auto cross covariance (acc) transformation originally devised by wold et al. [ , ] to transform protein sequences into uniform vectors. the acc transform has found much application in peptide prediction and protein classification [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . in our method, amino acid residues are represented by the well-known and well-used z descriptors [ ] [ ] [ ] , which characterise the hydrophobicity, molecular size, and polarity of residues. our method also accounts for the absence of complete independence between distinct sequence positions. we initially applied our approach to groups of known viral, bacterial, and tumour antigens, developing models capable of identifying antigen. extra models were subsequently added for fungal and parasite antigens. for bacterial, viral, and tumour antigens, models had prediction accuracies in the - % range [ , , ] . for the parasite and fungal antigens, models had good predictive ability with - % accuracy. these models were incorporated into a server for protective antigen prediction called vaxijen [ ] (url: http://www.darrenflower.info/ vaxijen). vaxijen is an imperfect but encouraging start; future research will yield significantly more insight as well-characterised protective antigens increase significantly in number [ ] . as we have said, a number of bioinformatics problems are unique to the discipline of immunology: the greatest of these is the accurate quantitative prediction of immunogenicity. this chapter has in its totality been suffused and pervaded by the idea of immunogenicity and the challenge of predicting this property in silico. such an endeavour is confounding, yet exciting, and, as a key instrument in developing better, safer, more effective vaccines, is also of undisputed practical utility. successful immunogenicity prediction is at its simplest made manifest through the identification of b cell or t cell epitopes. epitope recognition, when seen as a chemical event, may be understood in terms of the relationships between apparent biological function or activity and basic physicochemical properties. delineating structure-activity or property-activity relationships of this kind is a key concern of immunoinformatics. at the other end of the spectrum, immunogenicity can be viewed is a cohesive, integrated, system property: a property of the entire and complete immune system and not a series of individual and isolated molecular recognition events. thus, the task of predicting systems-level immunogenicity is in all likelihood manifold more demanding than predicting peptide-binding say. the clinical manifestation of vaccine immunogenicity arises from the complex amalgam of many contributing extrinsic and intrinsic factors, which includes pathogen-side and host-side properties, as well as those just coming directly from proteins themselves. see fig. . . protein-side properties include the aggregation state of candidate vaccines and the possession of pamps. pathogen-side properties are clearly properties intrinsic to the pathogen, including expression levels of the antigen, the time-course of this expression, as well as its subcellular location. socalled host-side properties are innate recognition properties of host immunity, and most obviously include t cell epitopes or b cell epitopes. a bona fide candidate antigen should be available for immune surveillance and thus highly expressed, constitutively or transiently, as well as having several epitopes. a protein without immunogenicity would logically lack all or some of these characteristics. as a prediction problem, this is, to say the least, not uncomplicated; clearly consisting of a great variety of difficult-to-compute stages. in terms of mechanism, many of these stages are poorly understood. yet, each can be addressed using standard computational and statistical tools. they can all be predicted, however, presupposing, of course, the presence of relevant data in sufficient quantity. one of the strongest messages to emerge from this review is that immunogenicity is a strongly multi-factorial property: some protein antigens are immunogenic for one reason, or set of reasons, and other immunogenic proteins will be so for another possibly tangential reason or set of reasons. each such causal manifold is itself complex and potentially confusing. thus, the prediction of immunogenicity is a problem in multi-factorial prediction, and the search for new antigens is a search through a multi-factorial landscape of contingent causes and discombobulating decoys. some of the evidence will be highly precise and quantitative. the kind provided by predictive immunoinformatics, for example. this typically yields exact values for, say, the binding affinity of a peptide to a protein component of the immune system, or an unequivocal yes or no answer to the question: is this peptide sequence an epitope? however, for each such exact prediction, we have some notional associated probability concerning how reliable we regard this result. different methods evince a range of accuracy, which, in practice, equate to probabilities of reliability: we naturally have more confidence and assume a greater reliability for a highly accurate prediction versus one of average predictability, though it can still give wrong predictions and generally inaccurate predictors may work well for a specific subset of the data. other types of forms of evidence will have a distinctly more anecdotal flavour. take, for example, the case of bacterial exotoxins. together with endotoxins, such as lps, and so-called superantigens, exotoxins form the principal varieties of toxin secreted by pathogenic bacteria. exotoxins have evolved to be the most toxic substances known to science: in terms of the median lethal dose, botulinum toxin-the active ingredient of botox and causative agent of botulism, amongst others-is about ten times as lethal as radioactive isotope polonium- and a million times more deadly than mainline poisons, such as arsenic or potassium cyanide. virtually, all such potent bacterial exotoxins comprise two functionally distinct subunits, either separate proteins or distinct domains, usually denoted a and b. the a subunit is habitually an enzyme, such as a protease, which modifies specific protein targets, thus disrupting key cellular processes with host cells. the b subunit is a protein which binds to host cell surface lipids or proteins, enabling the toxin to be internalised efficiently. the high specificity of this dual action lends exotoxins much of their remarkable lethality. exotoxins are also extremely immunogenic, inducing the immune systems to produce high-affinity neutralising antibodies against them, and thus make excellent targets for vaccinology. a toxoid-a toxin which has been treated or inactivated, often by formaldehyde-is in essence a form of subunit vaccine and, as such, requires adjuvant to induce adequate immune responses. vaccines targeting tetanus and diphtheria, which usually need boosting every decade, are based on toxoids, albeit typically combined with pertussis toxin acting as an adjuvant. poisoning by exotoxins, on the other hand, requires treatment with antitoxin comprising preformed antibodies. however, and say that we were offered a newly sequenced pathogen genome, is such a classification for ab toxins helpful when trying to identify a potential exotoxins? the answer is neither yes nor is it no, but lies somewhere between these extremes. assuming we had extant knowledge or a reliable method predicting the presence of structural and functionally distinct domains, this very simple ruleof-thumb would become a useful tool for eliminating large numbers of possible toxin molecules. it would not directly identify an antigen but would enormously reduce the workload inherent in their discovery. as well as needing more and more reliable predictors, we also need a way of combining the information we gather from any set of reliable predictors to which we have access. thus, when analysing a pathogen genome, what we seem to need, at least in order to identify immunogenic proteins, is both a set of reliable and robust tools and a cohesive expert system within which to embed them. such systems, albeit still at a relatively crude and faltering level, do exist. because there is an implicit hierarchy of one prediction being based on others, there is a need to balance and judge different pieces of probabilistic evidence. an effective expert system should be capable of such a feat. to a first approximation, an expert system is a computer programme that undertakes tasks that might otherwise be prosecuted by a human expert ostensively by simulating the apparent judgement and behaviour of an individual or organization with expertise and experience within a particular discipline. an expert system might make financial forecasts, or play chess; it might diagnose human illnesses or schedule the routes of delivery vehicles. to create an expert system, one first needs to analyse human experts and how they make decisions, before translating this into rules that a computer can follow. such a system leverages both a knowledge base of accumulated expertise and a set of rules for applying such distilled knowledge to particular situations in order to solve problems. sophisticated expert systems can be updated with new knowledge and rules and can also learn from the success of its prediction, again mirroring the behaviour of properly performing experts. at the heart then of an expert system is the need to combine evidence in order to reach decisions. combining evidence, and reaching a decision based on that combined evidence, is no easier in the laboratory, be that virtual or actual, than it is in the court room. the problem of combining evidence is encountered across the disciplines, and various solutions have arisen in these different areas. within bioinformatic prediction, a particular variety of evidence combination, so-called meta-prediction, is a now a well-established strategy [ , ] . this approach seeks to amalgamate the output of various predictors, typically internet servers, in an intelligent way so that the combined result is more accurate than any of those coming from a single predictor. indeed, combining results from multiple prediction tools does often increase overall accuracy. a consensus strategy was first proposed by mallios [ ] , who combined syfpeithi [ , , ] , propred [ , ] , and the iterative stepwise discriminant analysis meta-algorithm [ ] [ ] [ ] . multipred [ ] integrates hmms and artificial neural networks (ann). six mhc class ii predictors were combined by dai and co-workers [ ] [ ] [ ] basing its overall prediction on the probability distributions of the different scores. trost et al. have used a heuristic method to address class i peptide-mhc binding [ ] . wang et al. [ ] applied a consensus method to calculate the median rank of the top three predictive methods for each mhc class ii protein initially evaluated so as to rank all possible -, -, and -mers from one protein. this rank was used to identify the top % of peptides from each protein. in probabilistic reasoning, or reasoning with uncertainty, there are many ways to represent espoused beliefs-or, in our domain, predictions-that effectively encode the uncertainty of propositions. these include fuzzy logic and the evidential method, among many others. for quantitative data, information fusion, in its various guises [ ] , is one robust route to effective combination. another requires us to enter the world of bayesian statistics, or, at least, a special thread within it. bayes theory, and the ever-expanding strand of statistics devolving from it, is concerned primarily with updating or revising belief in the light of new evidence, while so-called dempster-shafer theory [ ] is concerned not with the conditional probabilities of bayesian statistics but with the direct combination of evidence. it extends the bayesian theory of subjective probability, by replacing bayesian probabilities with belief functions that describe degrees of belief for one question in terms of probabilities for another and then combines these using dempster's rule for merging degrees of belief when based on independent lines of evidence. such belief functions may or may not have the mathematical properties of probabilities but are seemingly able to combine the rigor of probability theory with the flexibility of rule-based approaches. several expert systems of different flavours and hues have now become available within the vaccinology arena. sundaresh et al. developed a specialist software package for the analysis of microarray experiments that could easily be classified as an expert system and used it in the area of reverse vaccinology. this package, which was written in the open-source statistical package r, was used to help analyse a variety of complex microarray experiments on the bacteria f. tularensis, a category a bio-defense pathogen [ ] . this programme implements a two-stage process for diagnostic analysis: selection of antigens based on significant immune responses coupled with differential expression analysis, followed by classification of measured antigen responses using a combination of k-means clustering, support vector machines, and k-nearest neighbours. we have already discussed vaxijen [ , , ] , and the related server epijen [ ] , which combines various methods for identifying epitopes within extant proteins. these two servers can also be classified as vaccine-related expert systems. nerve is another expert system, which has been developed to help automate aspects of reverse vaccinology [ ] . using nerve, the prioritisation of potential candidate antigens consists of several stages: prediction of subcellular localisation; is the antigen an adhesion?; identification of membrane-crossing domains; and comparison to pathogen and human proteomes. candidates are filtered then ranked and putative antigens graded by provenance and its predicted immunogenicity. the web-based expert system, dynavacs [ ] , was developed to facilitate the efficient design of dna vaccines and is available in the url: http://miracle. igib.res.in/dynavac. it takes a structured approach for vaccine design, leveraging various key design parameters, including the choice of appropriate expression vectors, safeguarding efficient expression through codon optimization, ensuring high levels of translation by adding specific sequence signals, and engineering of cpg motifs as adjuvant mechanisms exacerbating immune responses. it also allows restriction enzyme mapping, the design of primers, and lists vectors in use for known dna vaccines. vaxign is another expert system developed to help facilitate vaccine design [ ] . vaxign undertakes dynamic vaccine target prediction from sequence. methodologically, it combines protein subcellular location prediction with prediction of transmembrane helices and adhesins, analysis of the conservation to human and/or mouse proteins with sequence exclusion from the genomes of nonpathogenic strains, and prediction of peptide binding to class i and class ii mhc. as a test, vaxign has been used to predict vaccine candidates against uropathogenic escherichia coli. however, nerve and its various and varied siblings are tasked with such a confounding and difficult undertaking that they are obliged to fall somewhat short of what is required. an obvious first step in tackling the greater problem is to address first subcellular location prediction. then, we can look at antigen presentation, modelling for each component step, before building these into a fully functional model. we can also develop empirical approaches-such as vaxijen [ , , ] . we must also factor in antibody-mediated issues, properly address pamps, post translational danger signals, expression levels, the role of aggregation, and the capacity of molecular adjuvants to enhance the innate immunogenicity to usable levels. see fig. . . the value of vaccines is not yet unchallenged. however, most reasonable people would, in all probability, agree that they are a good thing, albeit with a few minor provisos. the idea underlying all vaccines is a strong and robust one: it is in the reification-that is, the realisation, manifestation, and instantiation-of this abstract concept that the trouble lies, if indeed trouble there is. existing vaccines are by no means perfect; again, most sensible and well-informed people would no doubt acknowledge this also. one might argue that their intrinsic complexity, and the highly empirical nature of their discovery over decades, and the fraught nature of their manufacture, has much to answer in this regard. why should this be? in part, it is due to the extreme complexity of immune response to an administered vaccine, which is largely specific to each individual or at least is different in different sub-groups within the totality of the vaccinated population. the immune responses is comprised, at least for whole-pathogen vaccines, of the adaptive immune response to multiple b cell and t cell epitopes as well as the responses made by the innate immune responses to diverse molecular structures, principally pamps. when one considers also the degree to which such a repertoire of responses is augmented and modified by the action of additives, be they designed to increase the durability and stability of vaccines or be they adjuvants, which are intended to raise the level of immune reactions. add in stochastic and coincidental phenomena, such as reversion to pathogenicity, and we can see immediately that navigating our way through the vaccine minefield is no easy task. all such problems engendered by this intrinsic complexity are themselves compounded by our comparatively weak understanding of immunological mechanisms, since, if we understood the mechanism of responses well enough, we could and would have designed our vaccines to circumvent these issues. part of the answer to this cacophony of conflicting and confounding quandaries is the newly emergent discipline of vaccinomics. a proper understanding of the relationships between gene variants and vaccine-specific immune responses may help us to design the next generation of personalised vaccines. vaccinomics addresses this issue directly. it seeks to identify genetic factors mediating or moderating vaccine-induced immune responses, which are known to be extremely variable within population. much data indicate that host genetic polymorphisms are key determinants of innate and adaptive response to vaccination. hla genes, non-hla genes, and genes of the innate immunity all contribute, and do so in many ways, to the variation observed between individuals for immune responses to microbial vaccines. vaccinomics offers many techniques that can help illuminate these diverse phenomena. principal amongst these are population-based gene/snp association studies between allele or snp variation and specific responses, supplemented by the application of next-generation sequencing technology and microarray approaches. yet, and for all this nay-saying and gainsaying, vaccines and vaccination have demonstrated their worth time after time; yet, to justify the continuing faith we invest in them, new and better ways of making safer and more focussed vaccines must be found. most current vaccines work via antibody-mediated mechanisms; and most target viruses and the diseases they cause. unfortunately, the stock of such disease targets is dwindling. low-hanging fruit has long since been cut down. only fruit that is well out of reach remains. vaccines based on apcs and peptides are new but unproven strategies; most modern vaccine development relies instead on effective searches for vaccine antigens. one of the clearest points to emerge from such work is that there are many competing concepts, thoughts, and ideas that may confound or help efficient identification of immune reactive proteins. certain such ideas we have outlined. some are indisputably persuasive, even compelling, yet many strategies-and the technical approaches upon which they are based-have singly failed to deliver on their promise. long ago, and based on his lifetime's experience of all things immunological, professor peter cl beverley sketched out a paradigm for protein-focussed vaccine development, which we have formalised further, and which schema is summarised in fig. . . some of his factors overlap with the factors from fig. . . he identified many of the factors that potentially contribute to the immunogenicity of proteins, be they of pathogen origin or another source entirely, and also other features which might make proteins particularly suitable for becoming candidate vaccines. of these, some are as-yet beyond prediction, such as the attractiveness for apcs or the inability to down-regulate immune responses. the status of proteins as evasins is currently only possibly addressable through sequence similarity-based approaches and likewise for the attractiveness for uptake by apcs is again, though possible there exist motifs, structural or sequence, which could be identified. currently, the dearth of relevant data precludes prediction of such properties; and, while it is possible to predict some of these properties with some assurance of success, and others are predictable but only incidentally, overall, we are still some way from realising the dream embodied in fig. failure occurs for simple reasons: we deal with simplified abstractions and cannot hope to capture all that which is required for prediction by looking superficially at a single factor. protein immunogenicity comes instead from the dynamic combination of innumerable contributing factors. this is by no means a facile or easily solved informatics conundrum. a vaccine candidate should have epitopes that the host recognises, be available for immune surveillance, and be highly expressed. factors mediating protein immunogenicity are many; possession of b or t cell epitopes, post-translational danger signals, sub-cellular location, protein expression levels, and aggregation state amongst them. predicting such diverse, complex, confounding properties is-and remains-a challenge. vaccine antigens, once discovered, should, ultimately, and with appropriate manipulation, together with an apt, apposite, and appropriate delivery system and the right choice of adjuvant, become first a candidate for clinical trials, before, hopefully, progressing to regulatory approval. we require an integrative, systemsbiology approach to solve this problem. no single approach can be applied universally and with success; what we crave is the full integration of numerous equally wakefield's article linking mmr vaccine and autism was fraudulent computer-aided biotechnology: from immuno-informatics to reverse vaccinology harnessing bioinformatics to discover new vaccines new vaccines against tuberculosis bioinformatics for vaccinology lessons learned concerning vaccine safety vaccines: the real issues in vaccine safety target the fence-sitters an american tragedy'. the cutter incident and its implications for the salk polio vaccine in new zealand - the cutter incident, years later poliomyelitis following formaldehyde-inactivated poliovirus vaccination in the united states during the spring of . ii. relationship of poliomyelitis to cutter vaccine bioinformatics for vaccinology vaccine-derived poliovirus (vdpv): impact on poliomyelitis eradication advances in predicting and manipulating the immunogenicity of biotherapeutics and vaccines the use of genomics in microbial vaccine development post-genomic vaccine development microbial genomes and vaccine design: refinements to the classical reverse vaccinology approach biotechnology and vaccines: application of functional genomics to neisseria meningitidis and other bacterial pathogens complete genome sequence of neisseria meningitidis serogroup b strain mc identification of vaccine candidates against serogroup b meningococcus by whole-genome sequencing a universal vaccine for serogroup b meningococcus identification of vaccine candidate antigens from a genomic analysis of porphyromonas gingivalis use of a whole genome approach to identify vaccine molecules affording protection against streptococcus pneumoniae infection identification of a universal group b streptococcus vaccine by multiple genome screen functional selection of vaccine candidate peptides from staphylococcus aureus whole-genome expression libraries in vitro discovery of a novel class of highly conserved vaccine antigens using genomic scale antigenic fingerprinting of pneumococcus with human antibodies immunodominant francisella tularensis antigens identified using proteome microarray a burkholderia pseudomallei protein microarray reveals serodiagnostic and cross-reactive antigens antibody-protein interactions: benchmark datasets and prediction tools evaluation benchmarking b cell epitope prediction: underperformance of existing methods prediction of mhc-peptide binding: a systematic and comprehensive overview in silico tools for predicting peptides binding to hlaclass ii molecules: more confusion than conclusion on evaluating mhc-ii binding peptide prediction methods evaluation of mhc-ii peptide binding prediction servers: applications for vaccine research a critical cross-validation of high throughput structural binding prediction methods for pmhc limitations of ab initio predictions of peptide binding to mhc class ii molecules t cell receptor recognition of a 'super-bulged' major histocompatibility complex class i-bound peptide high resolution structures of highly bulged viral epitopes bound to major histocompatibility complex class i. implications for t-cell receptor engagement and t-cell immunodominance have we cut ourselves too short in mapping ctl epitopes? a long, naturally presented immunodominant epitope from ny-eso- tumor antigen: implications for cancer vaccine design identification and characterization of pathogenicity and other genomic islands using base composition analyses a novel strategy for the identification of genomic islands by comparative analysis of the contents and contexts of trna sites in closely related bacteria mobilomefinder: web-based tools for in silico and experimental discovery of bacterial genomic islands cpgcluster: a distance-based algorithm for cpg-island detection cpgif: an algorithm for the identification of cpg islands identifying cpg islands by different computational techniques cpg_mi: a novel approach for identifying functional cpg islands in mammalian genomes evaluation of genomic island predictors using a comparative genomics approach islandpath: aiding detection of genomic islands in prokaryotes score-based prediction of genomic islands in prokaryotic genomes using hidden markov models a computational approach for identifying pathogenicity islands in prokaryotic genomes resolving the structural features of genomic islands: a machine learning approach detection of genomic islands via segmental genome heterogeneity prediction of pathogenicity islands in enterohemorrhagic escherichia coli o :h using genomic barcodes islandviewer: an integrated interface for computational identification and visualization of genomic islands towards pathogenomics: a web-based resource for pathogenicity islands identification and characterization of a novel family of pneumococcal proteins that are protective against sepsis functional genomics of pathogenic bacteria syfpeithi: database for searching and tcell epitope prediction syfpeithi: database for mhc ligands and peptide motifs hiv sequence databases mhcbn . : a database of mhc/tap binding peptides and t-cell epitopes mhcbn: a comprehensive database of mhc binding and non-binding peptides epimhc: a curated database of mhcbinding peptides for customized computational vaccinology antijen: a quantitative immunology database integrating functional, thermodynamic, kinetic, biophysical, and cellular data jenpep: a novel computational information resource for immunobiology and vaccinology jenpep: a database of quantitative functional peptide data for immunology the immune epitope database . antigendb: an immunoinformatics database of pathogen antigens violin: vaccine investigation and online information network epitopic peptides with low similarity to the host proteome: towards biological therapies without side effects peptimmunology: immunogenic peptides and sequence redundancy primer: mechanisms of immunologic tolerance recent advances in immune modulation cutting edge: contributions of apoptosis and anergy to systemic t cell tolerance discriminating antigen and non-antigen using proteome dissimilarity iii: tumour and parasite antigens discriminating antigen and non-antigen using proteome dissimilarity ii: viral and fungal antigens discriminating antigen and non-antigen using proteome dissimilarity: bacterial antigens gapped blast and psi-blast: a new generation of protein database search programs single proteins might have dual but related functions in intracellular and extracellular microenvironments locating proteins in the cell using targetp, signalp and related tools improved prediction of signal peptides: signalp . a comprehensive assessment of n-terminal signal peptides prediction methods wolf psort: protein localization predictor secreted protein prediction system combining cj-sphmm, tmhmm, and psort psort-b: improving protein subcellular localization prediction for gram-negative bacteria psort: a program for detecting sorting signals in proteins and predicting their subcellular localization predicting protein subcellular locations using hierarchical ensemble of bayesian classifiers based on markov chains subloc: a server/client suite for protein subcellular location based on soap gpos-ploc: an ensemble classifier for predicting subcellular localization of gram-positive bacterial proteins advantages of combined transmembrane topology and signal peptide prediction-the phobius web server prediction of lipoprotein signal peptides in gram-negative bacteria prediction of twin-arginine signal peptides validating subcellular localization prediction tools with mycobacterial proteins toward bacterial protein sub-cellular location prediction: single-class discrimminant models for all gram-and gram+ compartments multi-class subcellular location prediction for bacterial proteins alpha helical trans-membrane proteins: enhanced prediction using a bayesian approach beta barrel trans-membrane proteins: enhanced prediction using a bayesian approach a predictor of membrane class: discriminating alpha-helical and beta-barrel membrane proteins from non-membranous proteins tatpred: a bayesian method for the identification of twin arginine translocation pathway signal sequences lippred: a web server for accurate prediction of lipoprotein signal sequences and cleavage sites combining algorithms to predict bacterial protein sub-cellular location: parallel versus concurrent implementations predicting the subcellular localization of viral proteins within a mammalian host cell virus-ploc: a fusion classifier for predicting the subcellular localization of viral proteins within host and virus-infected cells structure and sequence relationships in the lipocalins and related proteins structural relationship of streptavidin to the calycin protein superfamily analysis of known bacterial protein vaccine antigens reveals biased physical properties and amino acid composition adaptation of protein surfaces to subcellular location hierarchical classification of g-protein-coupled receptors with data-driven selection of attributes and classifiers gpcrtree: online hierarchical classification of gpcr function optimizing amino acid groupings for gpcr classification on the hierarchical classification of g protein-coupled receptors proteomic applications of automated gpcr classification vaxijen: a server for prediction of protective antigens, tumour antigens and subunit vaccines identifying candidate subunit vaccines using an alignment-independent method based on principal amino acid properties dna and peptide sequences and chemical processes multivariately modeled by principal component analysis and partial least-squares projections to latent structures principal property-values for nonnatural amino-acids and their application to a structure activity relationship for oxytocin peptide analogs peptide binding to the hla-drb supertype: a proteochemometrics analysis proteochemometrics mapping of the interaction space for retroviral proteases and their substrates proteochemometrics analysis of substrate interactions with dengue virus ns proteases generalized modeling of enzyme-ligand interactions using proteochemometrics and local protein substructures rough set-based proteochemometrics modeling of g-protein-coupled receptor-ligand interactions improved approach for proteochemometrics modeling: application to organic compound-amine g protein-coupled receptor interactions melanocortin receptors: ligands and proteochemometrics modeling proteochemometrics modeling of the interaction of amine g-protein coupled receptors with a diverse set of ligands peptide quantitative structureactivity-relationships, a multivariate approach multivariate parametrization of coded and non-coded amino-acids new chemical descriptors relevant for the design of biologically active peptides. a multivariate characterization of amino acids bioinformatic approach for identifying parasite and fungal candidate subunit vaccines jafa: a protein function annotation meta-server metamqap: a meta-server for the quality assessment of protein models a consensus strategy for combining hla-dr binding algorithms prediction of hla-a -restricted ctl epitope specific to hcc by syfpeithi combined with polynomial method propred analysis and experimental evaluation of promiscuous t-cell epitopes of three major secreted antigens of mycobacterium tuberculosis propred: prediction of hla-dr binding sites predicting class ii mhc/peptide multi-level binding with an iterative stepwise discriminant analysis meta-algorithm class ii mhc quantitative binding motifs derived from a large molecular database with a versatile iterative stepwise discriminant analysis meta-algorithm iterative stepwise discriminant analysis: a meta-algorithm for detecting quantitative sequence motifs neural models for predicting viral vaccine targets building a meta-predictor for mhc class ii-binding peptides a probabilistic meta-predictor for the mhc class ii binding peptides a meta-predictor for mhc class ii binding peptides based on naive bayesian approach strength in numbers: achieving greater accuracy in mhc-i binding prediction by combining the results from multiple prediction tools a systematic assessment of mhc class ii peptide binding predictions and evaluation of a consensus approach combination of fingerprint-based similarity coefficients using data fusion connectionist-based dempster-shafer evidential reasoning for data fusion from protein microarrays to diagnostic antigen discovery: a study of the pathogen francisella tularensis epijen: a server for multistep t cell epitope prediction nerve: new enhanced reverse vaccinology environment dynavacs: an integrative tool for optimized dna vaccine design vaxign: the first web-based vaccine design program for reverse vaccinology and applications for vaccine development enzymes, metabolites and fluxes key: cord- -o r ktie authors: kokoszka, malgorzata e.; kay, brian k. title: mapping protein–protein interactions with phage-displayed combinatorial peptide libraries and alanine scanning date: - - journal: peptide libraries doi: . / - - - - _ sha: doc_id: cord_uid: o r ktie one avenue for inferring the function of a protein is to learn what proteins it may bind to in the cell. among the various methodologies, one way for doing so is to affinity select peptide ligands from a phage-displayed combinatorial peptide library and then to examine if the proteins that carry such peptide sequences interact with the target protein in the cell. with the protocols described in this chapter, a laboratory with skills in microbiology, molecular biology, and protein biochemistry can readily identify peptides in the library that bind selectively, and with micromolar affinity, to a given target protein on the time scale of months. to illustrate this approach, we use a library of bacteriophage m particles, which display -mer combinatorial peptides, to affinity select different peptide ligands for two different targets, the sh domain of the human lyn protein tyrosine kinase and a segment of the yeast serine/threonine protein kinase cbk . the binding properties of the selected peptide ligands are then dissected by sequence alignment, kunkel mutagenesis, and alanine scanning. finally, the peptide ligands can be used to predict cellular interacting proteins and serve as the starting point for drug discovery. very often in research projects, there is interest in mapping the protein-protein interactions of a protein of interest as a way of understanding its function in the cell or virus. while a variety of techniques exist for this purpose, such as yeast two-hybrid screening, mass spectrometry, and protein complementation assays, another approach is the use of phage-displayed combinatorial peptide libraries. in such an approach, one takes a purifi ed, recombinant protein and isolates peptide ligands to it through affi nity selection (fig. ) . interestingly, the phage-displayed peptides bind at "hot spots" for molecular interaction and very often share the same primary structure as short regions within cellular interacting proteins. several examples where this approach has proven useful include such targets as protein interacting domains [ , ] . to demonstrate the utility of this approach, we describe its application to two targets, human protein tyrosine kinase, lyn, and the yeast serine/threonine-protein kinase cbk . lyn was fi rst discovered as a viral oncogene [ ] and later appreciated to be a proto-oncogene in humans. it is a non-receptor protein tyrosine fig. a general workfl ow diagram for isolating and characterizing the peptide ligands to protein domains or fragments using phage display methods. in principle, coding regions of any protein domain of interest can be converted into recombinant dna and expressed as a fusion protein to a partner such as glutathione s-transferase (gst). with a soluble protein in hand, one can perform affi nity selections with phage-displayed combinatorial peptide libraries to identify its peptide ligands. the binding properties of those ligands can be then further characterized through affi nity and specifi city measurements. to assess the importance of each residue in the peptide ligand, a mutagenic analysis known as alanine scanning can be performed on the recombinant dna. with that knowledge, potential interacting partners of the protein of interest can be predicted [ ] . finally, improving affi nity and specifi city of selected ligands through directed evolution may lead to the development of antagonists, which could be used as inhibitors of specifi c cellular interaction for proof-of-principle experiments of drug development kinase, which is a member of the src family of proteins. it has a modular architecture: a src homology (sh ) domain, a src homology (sh ) domain, several linker regions, and a catalytic domain that phosphorylate tyrosines in proteins [ ] . the sh domain plays a role in mediating protein-protein interactions and has been described to bind proline-rich motifs in proteins. the cbk belongs to a large family of ndr/lats protein kinases, which is conserved across eukaryotes and includes members such as myotonic dystrophy kinase [ ] . cbk plays a role in controlling cell separation after cytokinesis, cell integrity, and polarized growth in saccharomyces cerevisiae [ ] . to date, only a few substrates of cbk have been reported. one of them, ace , a transcription factor that is activated by cbk via three phosphorylation sites [ ] , is responsible for transcriptional control of enzymes required for septum degradation after cytokinesis [ ] . the other, ssd , is an rnabinding protein that suppresses translation of certain mrnas and which loses activity when phosphorylated by cbk [ ] . we chose these two proteins as targets in parallel affi nity selection experiments to illustrate that the same phage-displayed combinatorial peptide library can yield very different peptide ligands to different targets. the lyn sh domain has previously been used in affi nity selection experiments, and so it served as a positive control. the cbk protein had not been used previously in affi nity selection of combinatorial peptide libraries. we also show that the kunkel mutagenesis in combination with alanine scanning and elisa are very simple and expedient methods for evaluating the contribution of certain amino acids in the peptide ligands for binding to their targets. prepare using deionized water (dih o). autoclave or fi lter sterilize and store at room temperature, unless indicated otherwise. single-stranded dna samples were isolated with the qiaprep spin m purifi cation kit (qiagen). . covalently closed, circular double-stranded m dna, synthesized via kunkel mutagenesis [ , ] , was purifi ed with qiaquick pcr purifi cation kit (qiagen). . concentrations of dna samples were determined using nanodrop nd- uv spectrophotometer (thermo fisher scientifi c, inc.) and measuring the optical absorbance at nm. . the following steps: phosphorylation of oligonucleotides, annealing to the template, and synthesis of covalently closed circular double-stranded dna (cccdna) were performed using dna engine dyad ® thermal cycler (bio-rad). in described methods, affi nity selections with phage-displayed combinatorial peptide libraries (figs. and ) were employed to identify peptide ligands that bind to human lyn sh domain and yeast cbk protein kinase. in general, once peptide ligands have been identifi ed for a protein target, an interacting motif can be deduced via sequence alignment (logo, fig. ) of the primary structures of the displayed binding peptides. however, to assess the functionality of this motif, amino acid replacements of the critical residues are generally necessary. while this is possible through the chemical synthesis of variant peptides, it is more expedient to use mutagenesis techniques, such as alanine scanning (fig. , [ ] ). for the purpose of this selection protocol, the target domain was overexpressed in escherichia coli in the form of a glutathione s-transferase (gst) fusion protein. recombinant protein can be prepared via commercially available pgex vectors (ge healthcare). figure presents a schematic diagram of the selection protocol utilizing glutathione-conjugated magnetic beads. to maintain optimum sterility and eliminate carryover, only fi ltered tips should be used throughout the selection procedure. . add μl of glutathione magnetic bead slurry ( μl of settled beads) into ml eppendorf tube, and wash twice with μl of ice-cold wash buffer # (pbs) on magnetic separator. remove the supernatant. . remove the supernatant. resuspend in μl of blocking buffer # . add μg of gst protein to remove potential gst-binding phage clones and library equivalents (e.g., if the library size is clones and the titer pfu/ml use μl). bring the volume to μl with ice-cold wash buffer # . incubate at °c for h on the rotating shaker. . remove the supernatant. wash three times with μl of icecold wash buffer # (pbst). remove supernatant, add μl of ice-cold wash buffer # , and transfer into a new ml eppendorf tube to eliminate any plastic-bound phage. . repeat the washing step two more times with wash buffer # . remove the supernatant. . elute the phage with μl of elution buffer. incubate for min at room temperature, and gently mix the content every couple minutes. . place the tube on magnetic separator, and transfer the eluted phage supernatant into a new . ml eppendorf tube containing μl of neutralization buffer, and mix well. fig. c ) revealed a known y/fxfp docking motif to cbk [ ] . also, it should be noted that sequence (fig. c ) contains the fkfp motif, which is present in ssd , a known cbk substrate [ , ] . our fi ndings clearly indicate that potential binding partners of certain targets can be deduced via selections with phage-displayed peptide libraries. as only three sequences were used for alignment, any additional amino acid preference analysis could be biased. ( c ) further characterization of the isolated y/fxfp motif showed preference to positively charged residues, r and k, at the "x" position. other allowed residues at that position included v, m, t, q, when at least one positively charged residue was observed outside the motif, or c, when a second cysteine was present following the motif. all sequences incorporated in the logo ( isolates, data not shown) were isolated by screening phage-displayed focused library. the library was synthesized via kunkel mutagenesis (as described in subheading . . if blue-white screening can be performed, add μl of mm iptg and μl of % x-gal. . immediately before plating, add ml of × yt top agar ( . %), gently swirl, and pour over prewarmed × yt agar plates. incubate overnight at °c. . add anti-m hrp-conjugated antibody diluted : , in pbst (same per well volume as the target protein), and incubate min to h at room temperature. . wash the plates with μl of pbst three more times. . add chromogenic substrate solution (subheading . , item ) (same per well volume as the target protein), and incubate for - min. . quantify the results by measuring the optical absorbance at nm, using a microtiter plate spectrophotometer. all binding phage clones isolated via elisa are amplifi ed and their binding regions subsequently identifi ed by dna sequencing ( see note ). if the selection generates enough unique isolates, the binding motif of the target can be predicted by sequence alignment known as logo plot (fig. ) . with that knowledge, one can attempt to better defi ne the known interactions and to predict new substrates of the target. to further assess the importance of each residue in the motif, mutagenic analysis known as alanine scanning can be performed (fig. , [ ] ). in this method, each residue is replaced, one by one, by alanine (or by glycine if originally occupied by alanine). one way to generate all the variants is to incorporate them into m phage genome via kunkel mutagenesis (described in subheading . . ) [ , , ] . once the phage-displayed mutant pool is generated, the effect of the substitutions on their binding to the target is evaluated by phage elisa (described in subheading . ). for the purpose of this protocol, a modifi ed m phage vector containing an amber stop codon has to be fi rst obtained [ ] . in the vector constructed by scholle et al., an amber stop codon, tag, has been placed at the n-terminus of the coding region of gene iii, following the signal sequence of protein iii (piii). that modifi cation eliminates the need for generation of uracil-containing single-stranded dna (u-ssdna) template, using e. coli strain cj ( dut − ung − ). a mutagenic oligonucleotide, containing both complementary and exogenous dna fragments, is then annealed to the ssdna template, with the exogenous fragment looping over the tag-containing vector region. once the double-stranded dna (dsdna) is synthesized from the ssdna template, it is electroporated into non-suppressor e. coli strain (e.g., ss ). if the tag triplet-containing region has not been replaced by the exogenous fragment, the phage will not be propagated as the minor coat protein, piii, cannot be translated. the wild-type phage can be propagated via the suppressor e. coli strain such as tg cells. the amber stop codon is then translated into glutamine. the use of tag-containing template facilitates nearly % recombination effi ciency [ ] . . if low quantities of target are available, a small amount can be used to coat the beads (e.g., μg of protein and μl of slurry, respectively). also, decreasing the amount of used target can facilitate isolation of higher affi nity clones. . for convenience, target-bound beads can be prepared for the entire selection procedure and stored at °c in the blocking buffer # (pbs containing % bsa (w/v)) for up to a week. . as long as no contamination is introduced, phage particles can be stored in a culture medium at °c for many years. . since the phage titer, recovered after fi nal round of selection, depends on many controllable and uncontrollable factors such as stringency of the selections (i.e., number of washes, amount of target used) or type and structure of the target, from our experience, we recommend to cover a wide dilution range from at least − - − . . if fi ltered tips are used for phage transfer, it is convenient to briefl y rinse and remove the tips with multichannel pipette. . in order to identify dna sequences of potential "binders," the phage is fi rst amplifi ed by infecting μl of tg cells (from a mid-log phase culture) with μl of phage supernatant (or single plaque) and incubated overnight in a shaking incubator ( rpm), in ml fi nal volume. the pelleted cells are then used for isolation of dsdna that consists of replicative form of phage dna. subsequently, the samples are analyzed via sequencing. remaining phage supernatant can be stored at °c ( see note ). . in order to isolate high quantity of ssdna template, large amount of phage particles has to be fi rst collected. amplifi cation of phage by infecting cells with just a single plaque may result in low quantity of template. thus, we suggest a two-step preparation process, where the peg-precipitated concentrated virions are used to infect the fresh mid-log cells. . to remove any polyethylene glycol (peg) residues from the surface of the tube prior to phage reconstitution, it is recommended to fi rst briefl y rinse the side of the tube with pbs, avoiding the phage pellet. usually - ml of room temperature pbs is used to resuspend the pelleted virions. one of the major advantages of affi nity selection of phage-displayed combinatorial peptide libraries is the potential for rapid discovery of peptide ligands to a target protein. it is relatively straightforward to use homemade or commercially purchased libraries to isolate peptide ligands to a given target and then deduce a binding motif. a peptide with a consensus motif can then be used to study the biology of the target (i.e., predict cellular interacting partner, solve the three-dimensional structure of the peptide and target in a complex, and to inhibit the target in cells). however, when only one or a few peptide ligands are isolated to a target, it is diffi cult to know a priori what residues in the phage-displayed peptide ligand contribute to binding. while one can explore the binding parameters of the peptide sequence through chemical synthesis of peptides with systematic amino acid replacements across the primary structure, we present an alternative, faster, and easier approach involving kunkel mutagenesis, alanine scanning, and elisa. we fi nd that by coupling these three techniques, one can readily determine at fi rst pass many important aspects of the binding interaction. exploring protein -protein interactions using peptide libraries displayed on phage mapping intracellular protein networks the yesrelated cellular gene lyn encodes a possible tyrosine kinase similar to p lck src family kinases: regulation of their activities, levels and identifi cation of new pathways cbk p, a protein similar to the human myotonic dystrophy kinase, is essential for normal morphogenesis in saccharomyces cerevisiae the saccharomyces cerevisiae mob p-cbk p kinase complex promotes polarized growth and acts with the mitotic exit network to facilitate daughter cell-specifi c localization of ace p transcription factor the ndr/lats family kinase cbk directly controls transcriptional asymmetry cbk regulation of the rna-binding protein ssd integrates cell fate with translational control effi cient construction of a large collection of phage-displayed combinatorial peptide libraries effi cient site-directed mutagenesis using uracilcontaining dna highresolution epitope mapping of hgh-receptor interactions by alanine-scanning mutagenesis improvements to the kunkel mutagenesis protocol for constructing primary and secondary phage-display libraries rapid and effi cient sitespecifi c mutagenesis without phenotypic selection convergent evolution with combinatorial peptides can we infer peptide recognition specifi city mediated by sh domains? distinct ligand preferences of src homology domains from src, yes, abl, cortactin, p bp , plcy, crk, and grb proteome-wide discovery of evolutionary conserved sequences in disordered regions this work was funded by the chicago biomedical consortium, with support from the searle funds at the chicago community trust. we would like to thank mr. kyle schneider, dr. brian yeh, and dr. eric weiss (nu) for providing the gst-cbk dna constructs and purifi ed gst-cbk fusion protein. we are grateful to dr. michael kierny and dr. renhua huang (uic) for their helpful comments on this manuscript. key: cord- - evbeijx authors: pandey, rajan kumar; bhatt, tarun kumar; prajapati, vijay kumar title: novel immunoinformatics approaches to design multi-epitope subunit vaccine for malaria by investigating anopheles salivary protein date: - - journal: sci rep doi: . /s - - - sha: doc_id: cord_uid: evbeijx malaria fever has been pervasive for quite a while in tropical developing regions causing high morbidity and mortality. the causal organism is a protozoan parasite of genus plasmodium which spreads to the human host by the bite of hitherto infected female anopheles mosquito. in the course of biting, a salivary protein of anopheles helps in blood feeding behavior and having the ability to elicit the host immune response. this study represents a series of immunoinformatics approaches to design multi-epitope subunit vaccine using anopheles mosquito salivary proteins. designed subunit vaccine was evaluated for its immunogenicity, allergenicity and physiochemical parameters. to enhance the stability of vaccine protein, disulfide engineering was performed in a region of high mobility. codon adaptation and in silico cloning was also performed to ensure the higher expression of designed subunit vaccine in e. coli k expression system. finally, molecular docking and simulation study was performed for the vaccine protein and tlr- receptor, to determine the binding free energy and complex stability. moreover, the designed subunit vaccine was found to induce anti-salivary immunity which may have the ability to prevent the entry of plasmodium sporozoites into the human host. increase the drug efficacy and kills the remaining parasites. the most widely used combination therapies are artemether-lumefantrine (commercial name: coartem) and amodiaquine-artesunate (commercial name: coarsucam) . while the recently approved combination therapies are artesunate-pyronaridine (commercial name: pyramax) and dihydroartemisinin-piperaquine (euartesim) . the recent emerging resistance against artemisinin urges to develop some new strategy to prevent the malaria diseases condition . therefore, in this study, we applied a novel immunoinformatics approach to design multi-epitope based subunit vaccine that may prevent the disease by maintaining the host hemostasis by the inhibition of anticoagulant and anti-inflammatory proteins present in mosquito saliva. it will also inhibit the entry of parasite within the host body by a similar mechanism. apart from this, if any way parasite enters into the host body, vaccine candidate will stop the salivary protein-mediated induction of parasitic growth. among different anopheline vectors, anopheles stephensi is a sub-tropical species that most abundantly present in the indian subcontinent and also distributed across the middle east and south asia region . a. stephensi transmit the malarial infection by injecting various plasmodium species to the human host typically via bites. salivary gland of mosquitos implement various functions for the survival of the vector and complement their feeding behavior by producing a large array of a biochemically active molecule that has immunomodulatory, anti-coagulant and anti-inflammatory properties that disable the host hemostatic response for successful blood feeding , . salivary proteins are antigenic and immunogenic in nature which helps the infectivity of parasite , . d protein and salivary apyrase are two different salivary proteins that help in binding, and inhibition of the platelets aggregation, respectively. salivary peroxidase helps in heme binding and peroxidase activity while putative til domain polypeptide functions as trypsin inhibitor . recently, it was reported that hamadarin is a kda protein present in anopheles stephensi saliva which inhibits the activation of plasma contact system and ultimately blood coagulation. another protein from anopheles gambie saliva namely gsg plays essential blood feeding , . recently, vijay s. et al. reported that salivary proteins might be utilized to develop novel antimalarial control strategies via innate immune protection against malaria . these proteins could likewise evoke a host igg response in natural conditions , . mosquito salivary gland surface (sgs) proteins are the prevalent immunogenic component present in saliva having ability to induce immunogenic responses . this is the reason why we chose the a. stephensi salivary proteins from the national center for biotechnology information (ncbi) and subjected to design multi-epitope subunit vaccine. allergenicity, antigenicity and physiochemical properties were also obtained for the vaccine protein. moreover, tertiary structure prediction followed by refinement was performed to get a refined d model having a higher number of residues in the favored region of ramachandran plot. molecular docking and molecular dynamics simulation of vaccine constructs with tlr were also performed to check the binding energy and complex stability. finally, disulfide engineering and in silico cloning was performed to increase the stability of vaccine construct and ensuring its effective expression in the microbial system, respectively. this study finally represents a novel approach to develop malaria vaccine using salivary protein instead of parasitic protein, which could be helpful to prevent the plasmodium infection to human host. sequence retrieval of salivary protein and assurance of antigenic conduct. in order to design an immunogenic multi-epitope subunit vaccine, the sum of a. stephensi salivary protein sequences was retrieved from the ncbi protein database. major proteins name is salivary lysozyme, a salivary protein precursor, salivary galectin, salivary lipase, anti-thrombin anopheline, salivary protein sg , salivary apyrase, salivary secreted serine protease inhibitor, salivary defensin and salivary cecropin. among salivary protein sequences, only proteins were found to be antigenic as predicted by antigenpro. these sequences were selected based on their score obtained for the probability of antigenicity and all these proteins having a score of ≥ . obtained score for antigenicity probability clearly denoting the antigenic nature of selected protein sequences which can be used for the subunit vaccine designing . subset of t-cell responses to kill those target cells having intracellular viral, bacterial or protozoan infection . during infection, whenever they encounter to the mhc-i mounted antigen specific to their receptor, they enter the cell cycle and perform several mitotic divisions followed differentiation into the effector cells . here, we tried to predict the ctl receptor specific immunogenic epitopes using the netctl . server and total ctl epitopes of mer length were obtained for the input of salivary protein sequences . in the next step, the immunogenicity of epitopes was determined and as per the instruction of iedb , higher score indicate greater probability to elicit an immune response; therefore total ctl epitopes with high immunogenicity score were selected and subjected to the vaccine designing (table ) . helper t-lymphocyte is the key player of both humoral and cell-mediated immune response . therefore, htl receptor specific epitopes are probably going to be a crucial part of the prophylactic and immunotherapeutic vaccine . all salivary protein sequences were subjected to iedb mhc-ii epitope prediction module and epitopes of mer length were obtained. in order to become highest immunogenic epitopes, they must have a lower percentile rank and ic value . only epitopes with lowest percentile rank ranging from . - . were selected for the vaccine designing (table ). their ic value ranging from - denoting that out of epitopes with lowest percentile rank, epitopes have intermediate affinity while remaining to have low affinity for the htl epitopes. on the other side, all epitopes were found to have ifn-γ inducing capability that was obtained from their positive score on the ifnepitope server output , (supplementary table ). all these epitopes were used for the vaccine construction. construction of multi-epitope subunit vaccine. a final vaccine construct of amino acid residues was designed using ctl and htl epitopes as described elsewhere , (supplementary figure ) . in order to attain maximum immune response tlr- agonist (rs ) was used as an adjuvant at the n-terminal site of the vaccine construct . each joint was occupied by the suitable linkers as described by nezafat et al. , for example, adjuvant and ctl epitopes were combined together by eaaak linker, intra-ctl and intra-htl epitopes joint by aay and gpgpg linker, respectively. finally, vaccine construct was obtained having adjuvant, linker, ctl, and htl epitopes in a sequence moving from n-terminal to c-terminal. as this designed subunit vaccine consisting of immunogenic ctl and ttl epitopes along with suitable adjuvant and linker, it may have the ability to inhibit the entry of malaria parasite within the human host body , . b-cell epitope mapping. b-cells are a key player of humoral immunity. an epitope corresponding to the b-cell receptor plays an important role in vaccine design following antibody production . therefore, bcpreds server was used to reliably predict the linear b-cell epitopes where bcpred was the selected prediction method . total b-cell epitopes of mer length were predicted among the primary input sequence of final vaccine construct. among them, only epitopes were selected and finalized because of their high score of . ( fig. a ). due to the selection of highest scoring b-cell epitope among designed subunit vaccine, our vaccine may have the ability to enhance humoral immunity as well as cell mediated immunity . discontinuous epitopes of amino acids long were also predicted from the final d model of vaccine construct with the probability scoring of . (fig. b) . the obtained probability score also confirming the immunogenic behavior of the designed subunit vaccine . antigenicity and allergenicity prediction of designed vaccine. a vaccine given to human host must be immunogenic in nature and capable to trigger significant humoral immune response which ultimately leads to the memory cell formation against the pathogenic epitopes. the antigenicity of designed vaccine construct was determined by using an alignment-free antigenpro server and found that it has the antigenicity probability of . , which represent the antigenic nature of vaccine construct . the antigenicity score obtained for this vaccine construct is comparable with the antigenicity of subunit vaccine reported elsewhere . allergy is an overreaction by our immune system to the previously encountered, ordinarily harmless substance that results in sneezing, wheezing, skin rash, and swelling of the mucous membrane . allergenicity of predicted vaccine construct was determined using allertop online server and found that the vaccine protein is nonallergic in nature and safe for the human use , . physiochemical properties assessment. the physiochemical properties of vaccine construct were characterized by using protparam server and evaluated for seven parameters. the molecular weight of vaccine protein was found to be kda which will favor the antigenicity of the vaccine construct . the theoretical pi was found to be . showing its slightly acidic nature while the total numbers of negative and positive charge residues were and , respectively . the estimated half-life in mammalian reticulocytes was . hours, in vitro; while and hours in yeast and e. coli, in vivo. the extinction coefficient was found to be m- cm- , at nm measured in water, assuming that all cysteine residues are reduced. the score obtained for instability index was . , showing the stable nature of vaccine construct. the value of the aliphatic index and grand average of hydropathicity (gravy) was . and − . , respectively. the estimated value of aliphatic index represents the thermostable nature of designed subunit vaccine because higher the value of aliphatic index, greater will be the thermo stability . while, negative value of gravy for the input subunit vaccine represents the hydrophilic nature of vaccine . conclusively, the designed vaccine is immunogenic, thermostable and hydrophilic in nature. tertiary structure prediction, refinement, and validation. the tertiary structure was predicted by using the raptorx server and d model was obtained as described elsewhere (fig. a) . the best template used for the homology modeling was crystal structure of a legionella phosphoinositide phosphatase (pdb id: fye). total amino acid residues were modeled as a single domain with % disorder. secondary structure information resulting in the presence of % helix, % beta sheet, and % coiled structure. p-value is a parameter of homology modeling where low p-value defines the good quality of modeled structure . the p-value obtained for the modeled structure was . e- which is low and significant. further, protein refinement using galaxyrefine leads to the increase in a number of residues in the favored region . initially, % of residues were in the rama-favored region while after refinement the number of residues in the rama-favored region reached to . %. the refinement output was also validated by plotting ramachandran plot and found the same that . % residue in rama-favored region, . % residues in allowed region and only . % residues in outlier region (fig. b ). disulfide engineering for vaccine stability. in order to stabilize the modeled structure of final vaccine constructs disulfide engineering was performed using disulfide by design v . and found that there are total pairs of residues that can be used for the purpose of disulfide engineering. but after evaluation on other parameters like energy and chi value, only four pairs of residues were finalized because their value comes under the allowed range i.e. the value of energy should be less than . and chi should be in between − and + degree . therefore, total mutations were created at the residues pairs namely ala -gln , tyr -gly , trp -glu , and gly -phe (fig. ) . codon adaptation and in silico cloning. the main purpose of in silico cloning was to express the vaccine protein epitope of anopheles mosquito origin into e. coli expression system . therefore, it was necessary to adapt the codon respective to subunit vaccine construct as per the codon usage of e. coli expression system. we adapted the codons as per e. coli k strain using jcat server and found that the gc-content of the improved sequence was . % while the value of codon adaptive index was . which is near to . that was satisfactory . later on, xhoi and ndei restriction sites were created and cloned into the pet a(+) vector (fig. ) . the target sequence in the clone is represented in blue color in between aforementioned restriction sites . the target sequence is also enclosed between -histidine residues on both ends that will be helpful to the purification purposes. the total length of the clone was . kbp. and tlr- receptor was performed using the cluspro . and total models were generated . among them, only that model was selected which properly occupied the receptor and having lowest energy score and found that model number . fulfill the desired criteria that's why selected as the best-docked complex (fig. ). the energy score obtained for the model . was found to be − which is lowest among all other predicted docked complex showing highest binding affinity. formed using gromacs . . using a gromos a force field . the potential energy obtained for the complex was − . kj/mol, while the value of temperature, pressure, and density was obtained as . k, . bar and . kg/m ( supplementary figure a-c) . the radius of gyration obtained for the docked complex showing that the distance in rotating complex from the center of mass is . nanometers that decreases up to . nanometers at the time duration of nanoseconds (supplementary figure d) . the rmsd value of protein backbone was . nanometers (fig. a) while rmsf score obtained for the protein side chain was found to be . nanometers (fig. b ). both these scores are satisfactory showing strong complex stability , . malaria is severe infection characterized by the high fever with irregularity but may also lead to brain injury and coma. it affects million people from countries, worldwide. lack of effective vaccine and emergence of resistance against artemisinin created a disastrous condition among the people living in the endemic zone. therefore, it's the need of time to search for the new options to tackle this severe problem. the vector for malaria is the anopheles stephensi in the indian subcontinent leads to the transfer of malaria parasite. literature survey reveals that salivary proteins of anopheles mosquito not only supports the pathogenesis but are also immunogenic in nature. therefore, this study was designed to reach a step ahead in the path of vaccine development. we used the primary amino acid sequence of anopheles mosquito salivary protein to design a subunit vaccine construct. the constructed vaccine has ctl, htl and bcl epitopes of varying length. it has antigenic properties in the absence of allergenic properties. it was stable and having a good binding affinity for the tlr- receptor. collectively, this study applied a series of immunoinformatics tools in a sequential manner to find an effective vaccine that may fight against the malaria infection. however, this study needs experimental validation to prove this computational work. the experimental work may include the synthesis of designed subunit vaccine followed by the in vitro and in vivo analysis to determine the immunogenicity and safety concern of the same. anopheles stephensi mosquito is the vector of malaria transmission to the human being in the indian subcontinent. while the salivary gland proteins of anopheles mosquito was reported for their role in parasite pathogenesis and having the ability to induce igg response in the natural host , . therefore, total salivary proteins of a. stephensi were obtained from the national center for biotechnology information (ncbi) protein database (retrieval date / / ) and subjected to multi-epitope vaccine designing. as the main purpose of vaccination is to induce an immunogenic response within the host body, all the retrieved protein sequences were subjected to their antigenicity prediction using antigenpro. based on the antigenicity result, only proteins were found to have an antigenic probability of ≥ . were selected and used in the next step. cytotoxic t-lymphocyte were shown to inhibit malaria parasitic growth and development, inside the hepatocytes cells . to get an immunogenic ctl epitopes having the ability to elicit cell-mediated immunity and form the memory cells, all salivary protein sequences were subjected to the netctl . server . netctl . is an online web server intended for predicting ctl epitopes among input protein sequences based on the training dataset. netctl was selected to predict the ctl epitopes because of its higher prescient execution on all execution parameters as compared to the recently developed servers namely mhc-pathway, mappp, and epijen. all the salivary protein sequences were submitted in the fasta format to predict the ctl epitope at the threshold score of . (default). those epitopes having a combined score of greater than . were selected as ctl epitope and further subjected to the immune epitope database (iedb) mhc class i immunogenicity prediction module. predicted ctl epitopes for each salivary protein was used as an input sequence and the result was obtained in the form of the score, where higher score determines that greater will be the probability of eliciting an immune response. helper t-lymphocyte (htl) epitope prediction. helper t-cell response is the major part of cell-mediated immunity and helps in pathogen clearance by the help of various cytokines and immune cells , . they have the ability to induce both ctl and humoral immune response by the secretion of lymphokines like il- , il- , il , granulocyte-macrophage colony-stimulating factor (gm-csf), and ifnγ. in view of that, we can say that htl epitopes mainly of th type are most likely going to be a crucial part of the prophylactic and immunotherapeutic vaccine. therefore, iedb mhc-ii epitope prediction module was used to predict the htl epitopes for all anopheles salivary protein sequences . the available parameters were kept default except for allele selection where the nominated alleles were h -iab, h -iad, and h -ied. output epitopes were ranked based on their percentile rank score where lower percentile rank representing that greater will be the binding affinity for htl receptor. secondly, to prove our work that the predicted htl epitopes will have ability to activate th type immune response followed by the ifn-γ production, top predicted htl epitopes were subjected to the ifn epitope server using predict option. all epitopes were submitted in the fasta format followed by the approach selection and model of prediction. motif and svm hybrid was selected as the approach and ifn-gamma versus other cytokine as model of prediction. designing of multi-epitope subunit vaccine. so as to plan an appropriate vaccine candidate, it must have the ability to induce ctl and htl immune response. in other words subunit vaccine must contain both ctl and htl epitopes along with suitable linkers. keeping in mind the end goal to effectively activate both innate and adaptive immune response, subunit vaccine must consist of a strong immunostimulatory adjuvant. in the previous decades, there is a huge headway in the adjuvant engineering, for instance, toll-like receptor (tlr) agonists have made its contribution as a part of peptide-based subunit vaccine as a functional option for present-day immunotherapy . recently, junqueira et al. have shown that cpgs oligodeoxynucleotides (cpg odns) and glycoinositolphospholipids (gipl) gotten from trypanosome cruzi having the ability to activate tlr- and tlr leads to actuate potent pro-inflammatory reaction . secondly, proteo-glycolipid complex (p glc) derived from leishmania parasite has shown its affinity for the tlr- receptor and recognized as ligand . moreover, tlrs having the capability to recognize the plasmodium ligands, for example, plasmodium falciparum primes the human tlr- response towards high proinflammatory cytokine profile . shanmugam a. et at. has reported that synthetic tlr- agonist namely rs- (sequence: apphals) can be used as a novel class of adjuvant , therefore, it was added as an adjuvant and linked with epitopes (ctl and htl) by using eaaak linker . linkers assume an imperative part in simulating the vaccine construct to work as an independent immunogen and producing higher antibody titer than that of single immunogen . total three linkers namely eaaak, aay, and gpgpg, were used to construct the final vaccine. aay and gpgpg linkers were added at the intra-epitope position to link the ctl and htl epitopes, respectively. b-cell epitope prediction. b lymphocytes, a type of white blood cells, are the key player of humoral immunity by antibody production. the identification of b-cell epitopes is an essential part in vaccine designing. bcpreds server was used to predict the linear b-cell epitopes of amino acids long. the amino acid sequence of final vaccine construct was used as an input sequence in plain format followed by the selection of fixed length epitope prediction method and length of the epitope. bcpreds (default method) was selected as the prediction method for the epitope of amino acids long . the specificity threshold was set to be by default at % to get the result in a user-friendly format. while conformational epitopes were predicted using ellipro server for the input of tertiary protein structure of vaccine construct. antigenicity and allergenicity prediction of designed vaccine. antigenicity determines the ability of an antigen to binds with the b-and t-cell receptor that may lead to the immune response and memory cell formation. therefore, the antigenic nature of predicted vaccine construct was determined to ensure its ability to interact with the immune receptor. antigenpro is a sequence based, pathogen independent and alignment-free prediction method that was used to check the antigenic behavior of vaccine protein. it uses svm classifier to summarize the probable antigenic or non-antigenic nature of proteins. antigenpro uses the existing protein antigenicity microarray files of eight feature sets for five pathogens to construct two-stage architecture; among them, the first one is multiple representations of the primary protein sequence and the second one is five machine learning algorithms. allergenicity is the potential of a material to cause sensitization and allergic reactions associated with the ige antibody response. therefore, the predicted vaccine construct must be free from the allergenic nature. allertop v. . was used to check the allergenicity of the vaccine construct based on the method that uses auto cross-covariance (acc) transformation of protein sequences into uniform equal-length vectors. input protein sequence of vaccine protein was classified by the k-nearest neighbor algorithm (knn, k = ) which is based on the training set of known allergen from different species and nonallergen from similar species. physiochemical properties assessment. the main purpose of vaccination is to induce an immune response after injecting the vaccine into the body. therefore, it is necessary to define the physical and chemical parameters associated with the vaccine. protparam web server, a part of expert protein analysis system (expasy), was used to define various physicochemical properties of predicted vaccine construct. the primary protein sequence of the vaccine was used to predict the various parameters including molecular weight (kda), estimated half-life, theoretical pi, aliphatic index, grand average of hydropathy (gravy) and so on. tertiary structure prediction. protein molecule achieves maximum stability in its lowest energy state by proper bending and twisting to form a tertiary structure. it is the interaction between the amino acids side chain residue which is responsible to stabilize the protein structure. the -dimensional structure of predicted vaccine construct was obtained by utilizing raptorx structure prediction server. raptorx is a pure ab initio method that can be used to build a d model in a template-free manner. it is the degree of likeness between the target and available template structure that determines the quality of protein model structure created by contemporary protein structure prediction techniques , . therefore, it was necessary to improve the template based predicted model beyond the accuracy by utilizing the template information. to fulfill this thought, output model of raptorx server was subjected to the galaxyrefine web server , which is based on the casp tested refinement method. galaxyrefine performs rehashed structure perturbation followed by overall structural relaxation by performing molecular dynamics simulation. disulfide engineering for vaccine stability. before proceeding to the next step, it was necessary to improve the stability of refined protein model. disulfide bonds are covalent interactions that emulate the stabilizing molecular interaction and provide a considerable stability to protein model by confirming precise geometric conformations. disulfide engineering is a novel approach for creating disulfide bonds into the target protein structure. therefore, the refined model of final vaccine construct was subjected to the disulfide by design . to perform disulfide engineering. initially, the refined protein model was uploaded and run for the residue pair search that can be used for the disulfide engineering purpose. total residue pairs were selected to mutate them with cysteine residue using create mutate function of the disulfide by design . server. in silico cloning. codon adaptation is a way to attain major expression rate of foreign genes in the host when the codon usage of the host differs from that of the organism where the gene stems from. unadapted codon may lead to the minor expression rate in the host. therefore, the primary sequence of vaccine protein was submitted to the java codon adaptation tool (jcat) to adapt their codon usage to most sequenced prokaryotic organisms (e. coli k ) . cai value and gc content of the adapted sequence was also obtained. later on, the adapted nucleotide sequence corresponding to the designed vaccine construct was cloned into the e. coli pet a(+) vector by using the restriction cloning module of snapgene tool . used to predict the preferred orientation of ligand molecule to the receptor molecule in their stable complex form . it can be also used to predict the binding affinity between these two molecules in terms of scoring function. as mentioned in the previous section, tlrs having the capability to recognize the plasmodium ligands and p. falciparum primes the human tlr- response towards high proinflammatory cytokine profile . therefore, tlr- was selected as receptor and its pdb file (pdb id: g a) was obtained from rcsb-protein data bank while the refined model of vaccine protein was used as a ligand. protein-protein docking was performed using the cluspro . : protein-protein docking server, to check the binding affinity between them . widely accepted computational approach which is used to determine the stability of protein-ligand complex at the microscopic level . the protein-protein docked complex output of cluspro was used as an input to perform the molecular dynamics simulation using gromacs v . . . initially, the crystal water of complex was removed followed by the topology generation using a gromos a force field. in the next step, protein complex was centered in a cubic boundary box and filled by water molecule using simple point charge (spc) water model and chloride ion was used for the charge neutralization of complex. moreover, energy minimization followed by canonical equilibration (nvt ensemble) and isothermal-isobaric (npt ensemble) was performed for a time duration of ps. finally, molecular dynamics simulation was executed for the time duration of ns . the root mean square deviation (rmds) for backbone and root mean square fluctuation (rmsf) for side chain was determined. cerebral malaria; mechanisms of brain injury and strategies for improved neuro-cognitive outcome exploring dual inhibitory role of febrifugine analogues against plasmodium utilizing structure-based virtual screening and molecular dynamic simulation antimalarial drug discovery -approaches and progress towards new medicines pre-travel malaria chemoprophylaxis counselling in a public travel medicine clinic in são paulo recent advances in malaria drug discovery a large focus of naturally acquired plasmodium knowlesi infections in human beings advances and challenges in malaria vaccine development the global pipeline of new medicines for the control and elimination of malaria recent clinical and molecular insights into emerging artemisinin resistance in plasmodium falciparum the biology and control of malaria vectors in india from sialomes to the sialoverse: an insight into salivary potion of blood-feeding insects an insight into the sialome of blood-feeding nematocera salivary gland lysates from the sand fly lutzomyia longipalpis enhance leishmania infectivity mass spectrometry based proteomic analysis of salivary glands of urban malaria vector anopheles stephensi a mosquito salivary protein inhibits activation of the plasma contact system by binding to factor xii and high molecular weight kininogen the anopheles gambiae salivary protein gsg : an anopheline-specific protein with a blood-feeding role molecular and immunological toxic effects of nanoparticles natural human humoral response to salivary gland proteins of anopheles mosquitoes in thailand identification of antigenic proteins from salivary glands of female anopheles maculatus by proteomic analysis members of the salivary gland surface protein (sgs) family are major immunogenic components of mosquito saliva exploring leishmania secretory proteins to design b and t cell multi-epitope subunit vaccine using immunoinformatics approach exploring dengue genome to construct a multi-epitope based subunit vaccine by utilizing immunoinformatics approach to battle against dengue infection regulation of cd (+) t cell responses to infection with parasitic protozoa the great balancing act: regulation and fate of antiviral t-cell interactions discovery of t-cell driven subunit vaccines from zika virus genome: an immunoinformatics approach properties of mhc class i presented peptides that enhance immunogenicity a novel multi-epitope peptide vaccine against cancer: an in silico approach designing of interferon-gamma inducing mhc class-ii binders intranasal vaccination against hiv- with adenoviral vector-based nanocomplex using synthetic tlr- agonist peptide as adjuvant the role of b cells and humoral immunity in mycobacterium tuberculosis infection epitope-based vaccine target screening against highly pathogenic mers-cov: an in silico approach applied to emerging infectious diseases allertop -a server for in silico prediction of allergens disulfide by design . : a web-based tool for disulfide engineering in proteins a multi-subunit based, thermodynamically stable model vaccine using combined immunoinformatics and protein structure based approach developing imidazole analogues as potential inhibitor for leishmania donovani trypanothione reductase: virtual screening, molecular docking, dynamics and admet approach molecular docking and molecular dynamics simulation based approach to explore the dual inhibitor against hiv- reverse transcriptase and integrase t cells as mediators of protective immunity against liver stages of plasmodium large-scale validation of methods for cytotoxic t-lymphocyte epitope prediction differential expression of mirna regulates t cell differentiation and plasticity during visceral leishmaniasis infection perturbed microrna expression by mycobacterium tuberculosis promotes macrophage polarization leading to pro-survival foam cell peptide binding predictions for hla dr, dp and dq molecules advances in the design and delivery of peptide subunit vaccines with a focus on toll-like receptor agonists trypanosoma cruzi adjuvants potentiate t cell-mediated immunity induced by a ny-eso- based antitumor vaccine mechanisms of immune evasion in leishmaniasis plasmodium falciparum infection causes proinflammatory priming of human tlr responses synthetic toll like receptor- (tlr- ) agonist peptides as a novel class of adjuvants a potential protein adjuvant derived from mycobacterium tuberculosis rv enhances dendritic cells-based tumor immunotherapy new directions in nicotine vaccine design and use predicting linear b-cell epitopes using string kernels protein identification and analysis tools in the expasy server molecular modeling and virtual screening approach to discover potential antileishmanial inhibitors against ornithine decarboxylase protein structure refinement driven by side-chain repacking febrifugine analogues as leishmania donovani trypanothione reductase inhibitors: binding energy analysis assisted by molecular docking, admet and molecular dynamics simulation the cluspro web server for protein-protein docking high-throughput virtual screening and quantum mechanics approach to develop imipramine analogues as leads against trypanothione reductase of leishmania structure-based virtual screening, molecular docking, admet and molecular simulations to develop benzoxaborole analogs as potential inhibitor against leishmania donovani trypanothione reductase r.k.p. is thankful to department of science and technology for providing inspire fellowship. v.k.p. is thankful to the central university of rajasthan for providing computational facility. we acknowledge the scientific help of ms. rani soni from department of biotechnology, central university of rajasthan for this manuscript. protocol designed by r.k.p., t.k.b., v.k.p. methodology performed by r.k.p., v.k.p. manuscript was written by r.k.p., t.k.b.,v.k.p. supplementary information accompanies this paper at https://doi.org/ . /s - - - . the authors declare that they have no competing interests.publisher's note: springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.open access this article is licensed under a creative commons attribution . international license, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons license, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons license, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. to view a copy of this license, visit http://creativecommons.org/licenses/by/ . /. key: cord- -ura oci authors: hosseini, elahe seyed; zeinoddini, mehdi; saeedinia, ali reza; babaeipour, valiollah title: optimization and one-step purification of recombinant v antigen production from yersinia pestis date: - - journal: mol biotechnol doi: . /s - - -x sha: doc_id: cord_uid: ura oci the purpose of this study was to develop an efficient and inexpensive method for the useful production of recombinant protein v antigen, an important virulence factor for yersinia pestis. to this end, the synthetic gene encoding the v antigen was subcloned into the downstream of the intein (int) and chitin-binding domain (cbd) from the ptxb vector using specific primers. in the following, the produced new plasmid, ptx-v, was transformed into e. coli er( ) strain, and the expression accuracy was confirmed using electrophoresis and western blotting. in addition, the effects of medium, inducer, and temperature on the enhancement of protein production were studied using the taguchi method. finally, the v antigen was purified by a chitin affinity column using int and cbd tag. the expression was induced by . mm iptg at °c under optimal conditions including tb medium. it was observed that the expression of the v-int–cbd fusion protein was successfully increased to more than % of the total protein. the purity of v antigen was as high as %. this result indicates that v antigen can be produced at low cost and subjected to one-step purification using a self-cleaving int tag. yersinia pestis has evolved from gastrointestinal pathogens and its antibiotic resistance can cause a dangerous disease, i.e., plague. therefore, plague is still considered as a major threat to human health [ ] [ ] [ ] . the human plague vaccine (usp) is a killed whole-cell plague bacilli that prevents plague infections through subcutaneous injection [ ] . however, some reports have recently demonstrated the cytotoxic effects of whole-cell vaccines and their poor protection against virulent strains without capsules [ , ] . the low calcium response (lcr) of v antigen (lcrv) and the component (f ) capsular antigen are the two important virulence factors which have been considered as vaccine candidates tested for their efficacy on humans and primates. lcrv is known as the virulence and multifunctional protein. this crucial protein has been shown to act at the level of secretion control by binding to other proteins in order to modulate the host immune response by altering cytokine production [ , ] . genetic engineering can be used to produce recombinant vaccines using different parts of yersinia, such as v antigen and f [ , ] . to this purpose, the selection of effective methods to recombinant protein purification is a key factor of production. highly purified recombinant proteins with the lowest and shortest time are the major problems for biotechnology researchers. intein (int) is the internal part of the protein that is derived from immature proteins during protein splicing process [ ] [ ] [ ] . since the discovery of the int feature in , this fusion tag has been used in many biotechnology applications. one of its wonderful applications is in the int-tagged self-cleaving step to purify recombinant proteins [ , ] . the int tag is separated from the target protein by thiol induction, as well as by temperature and ph changes for one-step protein purification. the advantage of this technique over other protein purification procedures is that the protease phase is eliminated, making the method economically affordable [ ] [ ] [ ] . thus, in the commercial ptxb and ptwin vectors, intein is fused to the chitin-binding domain (cbd). the expressed fusion protein can be ligated to the chitin beads and then, ph, temperature changes, or thiol induction leads to cleavage and separation of the target protein from int tag and cbd. finally, the target protein can be purified from the column without any additional amino acid [ , ] . on the other hand, the traditional reaction optimization method involves the influence of one variable factor at a time. this method requires costly, time-consuming, and laborious experiments. statistical methods including response surface and taguchi methodologies are usually used to optimize the effective factors in increasing the production of recombinant proteins. in response surface method, the relationship between the independent factors and the effective response can be evaluated in designed experiments to predict, with more tests than taguchi approach [ , ] . in turn, in taguchi methodology, more variables and qualitative factors can be investigated [ , ] . generally, when the cost and time limitations make it difficult to perform more experiments in the optimization process and also when discrete or qualitative factors such as media culture types are investigated, taguchi design is preferred in bench-scale fermenter [ ] . in this work, the int tag was used to purify v antigen from y. pestis in order to develop a new purification strategy for this important protein. in addition, detailed studies were conducted to find optimal conditions of temperature, medium, inducer concentrations, and overexpressed v-int-cbd fusion protein using the taguchi method. to amplify the v antigen encoding sequence, specific primers were designed based on the v antigen gene sequence retrieved from gene bank (accession no. af . ). the ndei enzyme restriction site sequence was added to the forward primer ( ʹ-ggt ggt cat atg att agag cct ac gaac- ʹ) and sapi enzyme restriction site sequence was also added to the reverse primer ( ʹ-ggt ggt t gctct tccgc att tac cag acg tgt cat c- ʹ). the pet-v (pet a containing the v antigen encoding sequence) was amplified using the polymerase chain reaction (pcr). the pcr mixture ( μl) contained × pcr buffer, mm magnesium sulfate, mm of each dntp, pmol per primer, μl ( ng) v antigen in pet vector, and . unit pfu dna polymerase (fermentase). the amplification was performed using the techne thermocycler, with initial denaturation at °c for min, cycles at °c for s, s at °c, and s at °c, and the final extension was performed at °c for min. the pcr product was analyzed using % agarose gel electrophoresis. the pcr product was purified using a gel purification kit (bioneer). the pcr product and the ptxb vector (neb #n , biolab) were double-digested with sapi and ndei enzymes and then ligated together with t dna ligase. the cloning of v antigen in the ptxb vector was verified by restriction enzyme mapping. after confirming the ptx-v construct (fig. c) , the plasmid was transformed into the competent er strain of e. coli, prepared by the calcium chloride method [ ] . the transformed bacteria were grown in a -l lysogeny broth (lb) containing μg/ml ampicillin at °c in an air shaker ( rpm) until the optical density (od ) at nm reached . - . . then, the t promoter was induced by . mm isopropyl β-d- -thiogalactopyranoside (iptg) at °c for h (to increase the soluble protein). the expression of v antigen was assessed by sds-page and western blotting at different times using specific antichitin-binding domain serum antibodies (neb #s , new england biolabs inc.). the cells were harvested and resuspended in ml of column buffer ( mm tris-hcl, ph . , . m nacl) and the crude cell extracts were prepared by sonication for cycles of s (hielscher ultrasonic, germany). the supernatant was separated from the cell debris by centrifugation at , ×g for min at °c and passed through a × cm column (bio-rad, hercules, ca) containing ml of chitin beads (neb #s ). the flow rate was . ml/min. after loading the supernatant on the column, the flow rate was increased to ml/min, and the column was thoroughly washed with the column buffer until the eluted non-specific protein content reached a minimum. thereafter, a column buffer containing mm dithiothreitol (dtt) was slowly passed through the column, the flow was stopped, and the column was incubated at room temperature for to h. each fraction ( ml) containing v antigen was obtained by eluting the column with the column buffer. all samples were analyzed by sds-page using % tris-glycine gel. the protein concentration was estimated using the bradford method. after purification, dtt was removed from the buffer and the protein was concentrated using a millipore centricon tube. the enhancement of recombinant protein production and subsequently, the purification of the target protein, the effects of temperature on the induction ( , , and °c), media (terrific broth or tb, contains tryptone . %, yeast extract . %, and glycerol . %; supper broth or sb which contains tryptone %, yeast extract %, mops %, and glucose %; and y which contains tryptone . %, yeast extract . %, and nacl . % in mm tris-hcl, ph . ), and inducer concentration ( . , . and . mm) were investigated using the taguchi statistical method [ ] ( table ) . the taguchi method is used to evaluate the factors considered. this method allows the simultaneous study of different effective factors and their interactions. for this purpose, the experiments were designed on the l orthogonal array ( table ) based on which each proposed test condition was repeated twice. then the results (final concentration of the recombinant protein) were analyzed using minitab- software. taguchi uses analysis techniques that do not depend on a model that relates the response to the factor effects. rather, it uses a calculated response average to identify factors and their levels that yield maximum quality or quantity of the considered response (in this research the amount of recombinant protein). to compute the average performance or mean effect of any factor, amount of recombinant protein of experiments related to any level of each parameter are added and divided by the number of such trials. after the pcr reaction, the product was confirmed by the emergence of a bp band on % agarose gel electrophoresis at about bp (fig. a) . to clone the v antigen in ptxb , the pcr product of the previous step and ptxb were digested by sapi and ndei restriction enzymes, and the product was ligated into the ptxb . two colonies were randomly selected and ptxb -v double digestion was performed using psti and ndei enzymes (we could not use sapi enzyme because this site was disrupted after cloning, and the restriction site of psti is present on ptxb but not in the sequence of the cloned v antigen). according to fig. b , the emergence of a bp band on % agarose gel electrophoresis confirmed the production of a new plasmid ptxb -v (fig. c) . the ptxb -v plasmid was transferred into e. coli er and induced by iptg. the protein expression was assessed by using sds-page and western blotting at and h postinduction. a protein size of about -kda confirmed the expression of v antigen together with int and cbd (fig. ) in which the molecular weight of int and cbd was about kda and the molecular weight of v antigen was about kda. the v-int-cbd fusion protein was purified using chitin beads affinity chromatography. to this end, the lysed cells were passed through a column packed with chitin beads. after washing and removal of non-specific proteins, most of the fusion precursors were attached to the resin due to the high affinity of cbd to chitin beads. a lysis buffer containing mm dithiothreitol (dtt) as a cleavage disulfide bond was added into the column and incubated at room temperature for h to release v antigen. in this step, int and cbd were bound to chitin column. finally, to remove int-cbd from the resin, a % sds stripping buffer was flowed through, and cbd and int ( kda) were eluted from the chitin resin. figure shows the purification step of v antigen with an int tag on sds-page. the results of recombinant protein optimization using the taguchi method are shown in table . table shows the analysis of variance (anova) for the responses of recombinant protein production carried out according to the factors' contribution ( ), the output after passing the cell lysis from column ( ), output of washing column with buffer ( ), initial output after induction by cleavage buffer ( ), purified protein after h ( , , and ), the chitin beads after cleavage step ( ), output after striping step ( ) by the taguchi method. in addition, the optimal conditions were obtained by analyzing the final concentration of the recombinant fusion protein and the variance of productivity (anova). the average effect of the investigated four factors has been plotted in fig. for visual inspection. the higher the slope of the graph of each factor in these figures indicates the amount of factor effect on the amount of produced protein. hence, it can be concluded that the type of medium had the greatest influence on the production of the recombinant fusion protein. also, the induction temperature had the greatest effect impact on the protein production, while iptg concentration had the least effect (fig. ) . furthermore, the results showed that test had the optimum conditions for the production of an int-v antigen fusion protein in e. coli. figure shows the effect of different conditions including ( , ) , test level one ( ), two ( ), three ( ), protein size marker ( ) , test level four ( ) , five ( ), six ( ) , seven ( ), eight ( ) and nine ( ) temperature, iptg concentration, and medium on the v antigen expression. after optimizing the production conditions, the cells were incubated in the cleavage buffer after incubation at °c for h to obtain a protein with a high purity of % and a suitable efficiency, as shown in fig. . there is no safe or effective plague vaccine. the world health organization classifies the plague as a recurrent disease and as a class a pathogen causing an epidemic disease [ ] . killed whole-cell vaccines can only prevent pests and live attenuated vaccines, making guinea pigs and mice immune to plague, although this vaccine has health problems for humans [ , ] . today, researchers emphasize subunit vaccine development. for decades, humans have been focusing on two recombinant proteins, yersinia pestis, f and v proteins, as vaccine candidates [ , ] . in order to facilitate the purification of recombinant proteins, soluble and affinity tags, such as schistosomiasis glutathione s-transferase (gst), escherichia coli maltose-binding protein (mbp), transcription termination anti-termination factor (nusa), and peptide tags, such as poly his and poly arg, are often used [ ] [ ] [ ] [ ] . since the affinity tag can affect the activity and/or structure of the target protein, it is typically cleaved from the target protein by a protease. therefore, the isolation of protease and target protein fusion markers requires some additional purification steps [ ] . in addition, proteases are generally inefficient and non-specific, resulting in partially cleaved products. in an intein (int)-based purification system, a modified int fusion that induces a cleavage reaction at one end is catalyzed between the chitin-binding domain (cbd) and the recombinant target protein [ ] . using the natural self-splicing mechanism of int, genetic engineers are able to produce commercial int vectors to obtain recombinant proteins that do not require enzymatic or cofactor removal labels [ ] . the most important feature of int tag in recombinant protein production is its self-cleavability with the addition of amino acids at the end of the target protein [ ] . furthermore, this method does not require proteases enzyme and time consuming steps for the removal of target proteins. wood and co-authors believed that this method combined with other methods not only reduces the time and cost of recombinant protein production but also improves the efficiency of the final product [ , , ] . co-expression of v antigen with a suitable carrier protein results in the protease being less susceptible to degradation during purification [ ] . for this purpose, different expression systems are used to increase the production of v antigen. v antigen with gst and int-cbd was produced in a relatively fused form in using three expression systems and after cleavage and purification of the target protein, its final concentration was (pgex- x- ), (pgex- p- ), and . mg/l [ ] . in the present study, we first cloned and optimized the expression and purification of v antigen in the ptxb vector containing the -kda-int tag and -kda-cbd. v antigen production was examined using the int tag to assess the self-cleavage properties of int and to optimize culture conditions to increase protein production. the results of the taguchi method showed that the yield increased to % ( . g/l). recent studies have also shown that the molecular weight of hybrid proteins increases by kd compared to the native protein, but it has no negative effect on protein expression. on the other hand, intthiol induction demonstrated suitable v antigen purification (fig. ) and this strategy was used for large-scale production of proteins and int self-cleavage can be performed at °c without dtt. in summary, the ptx-v expression system can be used as a suitable substitute for the industrial production of vaccines against sub-units of plague. the yersiniae: a model genus to study the rapid evolution of bacterial pathogens yersinia pestis-etiologic agent of plague genome sequence of yersinia pestis, the causative agent of plague vaccination against bubonic and pneumonic plague plague vaccine development: current research and future trends plague vaccines: current developments and future perspectives map of f and v antigens from yersinia pestis astride innate and adaptive immune response lcrv plague vaccine with altered immunomodulatory properties expression of a recombinant form of the v antigen of yersinia pestis, using three different expression systems recombinant v antigen protects mice against pneumonic and bubonic plague caused by f -capsule-positive and -negative strains of yersinia pestis protein splicing involving the saccharomyces cerevisiae vma iintein: the steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing system protein splicing: its discovery and structural insight into novel chemical mechanisms inteins: structure, function, and evolution intein-mediated purification of a recombinantly expressed peptide single-column purification of free recombinant proteins using a self-cleavable affinity tag derived from a protein splicing element a genetic system yields self-cleaving inteins for bioseparations inteins and affinity resin substitutes for protein purification and scale up overview of tag protein fusions: from molecular and biochemical fundamentals to commercial systems utilizing the c-terminal cleavage activity of a protein splicing element to purify recombinant proteins in a single chromatographic step the cyclization and polymerization of bacterially expressed proteins using modified self-splicing inteins the taguchi methodology as a statistical tool for biotechnological applications: a critical appraisal a primer on the taguchi method laccase production by pleurotusostreatus : optimization of submerged culture conditions by taguchi doe methodology taguchi's experimental design for optimizing the production of novel thermostable polypeptide antibiotic from geobacillus pallidus sat enhanced production of insulin-like growth factor i protein in escherichia coli by optimization of five key factors molecular and cell biology aspects of plague yersinia pestis (plague) vaccines developing live vaccines against yersinia pestis enhancement of soluble protein expression through the use of fusion tags inteinmediated protein purification of fusion proteins expressed under high-cell density conditions in e. coli xhis-ni-nta chromatography as a superior technique in recombinant protein expression/purification expression and production of recombinant scorpine as a potassium channel blocker protein in escherichia coli gene fusion expression systems in escherichia coli intein as a novel strategy for protein purification cloning and expression of aequorin photoprotein using intein tag purification of antibacterial chap k protein using a self-cleaving fusion tag and its activity against methicillinresistant staphylococcus aureus the potential role of self-cleaving purification tags in commercial-scale processes new trends and affinity tag designs for recombinant protein purification active immunisation with recombinant v antigen from yersinia pestis protects mice against plague expression of a recombinant from of the v antigen of yersinia pestis, using three different expression systems publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations we would like to thank the research council of malek-ashtar university of technology for the financial support of this investigation. key: cord- -ukqvhzws authors: themsakul, sirintra; suebwongsa, namfon; mayo, baltasar; panya, marutpong; lulitanond, viraphong title: secretion of m e:hbc fusion protein by lactobacillus casei using cwh signal peptide date: - - journal: fems microbiol lett doi: . /femsle/fnw sha: doc_id: cord_uid: ukqvhzws the ability to serve as a delivery vehicle for various interesting biomolecules makes lactic acid bacteria (lab) very useful in several applications. in the medical field, recombinant lab expressing pathogenic antigens at different cellular locations have been used to elicit both mucosal and systemic immune responses. expression–secretion vectors (esvs) with a signal peptide (sp) are pivotal for protein expression and secretion. in this study, the genome sequence of lactobacillus casei atcc was explored for new sps using bioinformatics tools. three new sps of the proteins cwh, sura and sp were identified and used to construct an esv based on our escherichia coli–l. casei shuttle vector, prceid-lc . . functional testing of these constructs with the green fluorescence protein (gfp) gene showed that they could secrete the gfp. the construct with cwhsp showed the highest gfp secretion. consequently, cwhsp was selected to develop an esv construct carrying a synthetic gene encoding the extracellular domain of the matrix protein fused with the hepatitis b core antigen (m e:hbc). this esv was shown to efficiently express and secrete the m e:hbc fusion protein. the identified sps and the developed esvs can be exploited for expression and secretion of homologous and heterologous proteins in l. casei. lactic acid bacteria (lab) are a heterogeneous group of bacteria that are used in various biotechnological applications. with respect to human and animal health, many lab are being used as mucosal delivery vehicles for therapeutic and prophylactic molecules because of their gras (generally regarded as safe) status (eijsink et al. ) , their probiotic properties and the ease with which they can be engineered to express heterologous proteins (ouwehand et al. ) to elicit an effective immune response (lee et al. ). up to now, a definite conclusion on the best cellular location for optimal immunization, i.e. intracellular, cell wall anchored or secreted, has yet to be reported. this seems to depend on parameters such as the bacterial type, the amount and localization of the antigen, the route of administration, etc. (wells and mercenier ) . many expression and secretion systems for lab to deliver protective antigens have already been developed (kruger et al. ; pusch et al. ) . such systems require the use of expression-secretion vectors (esvs) with a signal peptide (sp) to translocate the protein after synthesis across the bacterial cell membrane. several sps have been identified and incorporated into esvs (hols et al. ; hazebrouck, pothelune and azevedo ) . the secretion efficiency of different sps seems to be host specific. for this reason, homologous sps are thought to drive protein secretion more efficiently than heterologous sps (mathiesen et al. ) . the availability of the complete genome sequences of many lactobacillus casei strains (makarova et al. ) makes it easy to search for new and efficient sps that can be used in the construction of new esvs for this species and maybe other lactobacilli. in this study, putative sps were selected from the genome of l. casei atcc (accession no. cc . ) using bioinformatics analysis. the selected sps were tested for the secretion of a gene product encoding a green fluorescent protein (gfp) as a reporter protein and using the esv based on escherichia coli-l. casei shuttle vectors previously developed in our group (panya et al. ; suebwongsa et al. ) . the sp with the highest efficiency was then used to drive the secretion of a synthetic gene coding for an m e:hbc fusion protein. m e is a highly conserved protein of the influenza a virus that has been incorporated as an antigen into many influenza vaccine formulations. the immunogenicity of m e has been reported to be enhanced when fused with the hepatitis b core antigen (hbc) (de filette et al. ) . the final construct, plc-cwh:m e:hbc, successfully expressed and secreted the fused antigenic protein. the bacterial strains, plasmid vectors and oligonucleotide primers used in this study are listed in table . lactobacillus strains were grown statically in de man rogosa and sharpe (mrs) medium (labm, heywood, lancashire, uk) at • c. escherichia coli dh α was cultured in luria-bertani (lb) broth (labm) at • c with shaking. agarified media were prepared by adding g l − bacteriological agar to the corresponding broth. when required, appropriate antibiotics (sigma-aldrich, st louis, mo, usa) were added to the media as follows: erythromycin for l. casei at . μg ml − , and ampicillin for e.coli at μg ml − . all dna manipulation procedures were performed as described by sambrook and russell ( ) . plasmids from e. coli were isolated and purified using the hiyield plasmid mini kit (rbc bioscience, new taipei city, taiwan). plasmids from l. casei were extracted as described by o'sullivan and klaenhammer ( ) . genomic dna from l. casei was extracted using the genelute bacterial genomic dna kit (sigma-aldrich). pcr amplification was performed using taq dna polymerase (invitrogen, carlsbad, ca, usa). amplicons were purified from agarose gels using the hiyield gel pcr dna fragments extraction kit (rbc bioscience). the nucleotide sequences of cell surface-associated proteins of l. casei atcc were retrieved from the ncbi microbial genomes database (http://www.ncbi.nlm.nih.gov/genomes/ lproks.cgi). the signalp . program (http://www.cbs.dtu.dk/ services/signalp/) was used to predict the putative signal peptides. the likely location of the proteins was predicted with the programs psortb v . (http://www.psort.org/psortb/), subloc v . (http://www.bioinfo.tsinghua.edu.cn/subloc/pro predict.htm) and gpos-mploc (http://www.csbio.sjtu.edu.cn/ bioinf/gpos-multi/). dna sequences encoding sps of cell surface-associated proteins with a clear sp cleavage site and an extracellular location were selected for the construction of esvs. to determine whether the three selected sps (i.e. cwhsp, surasp and sp ) could drive heterologous protein secretion in l. casei, an esv based on each of these was constructed. sp strength was analyzed using the gfpuv-encoding gene as a reporter. sequences encoding the three sps were amplified from genomic dna of l. casei atcc under the following conditions: one cycle of • c for min; cycles of • c for s, • c for s, • c for s; and • c for min. the lengths of the amplicons, cwhsp, surasp and sp , were bp, bp and bp, respectively. the gfpuv gene fragment, recovered as a kpni/spei-digested fragment from pgfpuv, was cloned into pcwh, psura and psp to generate pcwhsp:gfpuv, psursp:gfpuv and psp :gfpuv, respectively. the cwh:gfpuv fusion gene was obtained by double digestion of the pcwh:gfpuv with hindiii/spei and cloned into hindiii/spei-digested pldh-pro , a vector containing the constitutive promoter (p ldh ) and ribosome binding site (rbs ldh ) of the lactate dehydrogenase gene (genbank accession no. d . ), resulting in pldh:cwh:gfpuv. using the same procedure, pldh:sur:gfpuv and pldh:sp :gfpuv were also obtained. the ldh:cwh:gfpuv dna fragment obtained as a aatii/spei-digested pldh:cwh:gfpuv was cloned into aatii/speidigested prceid-lc . resulting in plc-cwh:gfpuv. the same strategy was followed to obtain the recombinant esvs containing gfpuv with sursp and sp , designated as plc-sur:gfpuv and plc-sp :gfpuv. figure shows the construction diagram of plc-cwh:gfpuv and a similar procedure was used for the generation of the pldh:sur:gfpuv and pldh:sp :gfpuv. all three recombinant plasmids were verified by dna sequencing (data not shown) and were independently electrotransformed into l. casei rceid . the preparation of competent cells and the electroporation protocol were as described elsewhere (chassy and flickinger ) . to determine the secretion efficiency, the fluorescence intensity of culture supernatants derived from recombinant l. casei rceid harboring plc-cwh:gfp, plc-sur:gfp and plc-sp :gfp was determined in triplicate. cultures of the above three recombinant constructs were incubated at • c for h with shaking at rpm until an optical density of . at nm (od ) was reached. supernatants of the cultures were harvested and the fluorescence intensity was measured with a fluorometer (cary eclipse, victoria, australia), as previously described (wu and chung ) . as a negative control, the culture supernatant of l. casei rceid harboring the expression vector without signal peptide (plc-gfpuv) was used. transformants containing plc-cwh:m e:hbc were cultured at • c for h with shaking at rpm until an od of . was reached. the supernatant and cell pellet were collected. the supernatant was concentrated -fold using centricons with a molecular weight cut-off of kda (pall life sciences, ny, usa). the total cell extract and concentrated supernatant were electrophoresed in % sodium dodecyl sulphate-polyacrylamide gel electrophoresis (sds-page) followed by western blotting. the m e:hbc fusion protein on the membrane was detected with mouse anti-m monoclonal antibody (abcam, cambridge, uk) at a dilution of : . horseradish peroxidase-conjugated goat anti-mouse igg (abcam) was used as a secondary antibody at a dilution of : . the signal was developed with a chemiluminescent substrate reagent (thermo fisher scientific, rockford, il, usa) and recorded with a digital camera (image quant tm las , uppsala, sweden). total cell extracts from the transformants with plc-cwh:m e:hbc and those with plc-m e:hbc were used as positive controls for m e:hbc expression. supernatants from the transformants with the plc-m e:hbc construct and those with prceid-lc . were used as negative controls. seventy-seven deduced amino acid sequences of l. casei atcc genes encoding cell surface-associated proteins were retrieved from the ncbi database and analyzed with the sig-nalp . program to predict likely sp sequences (data not shown). the program generates a parameter called the d score, which is used to discriminate signal peptide from non-signal peptide sequences. it also indicates the presumptive location of the signal peptide cleavage site. six proteins with a d score above the cutoff value of . contained good predictions for sp sequences ( table ). the d score of these proteins and the likely cleavage sites are summarized in table . cellular location of each of the six proteins were predicted using three different programs. five, three and one of six proteins were predicted to be extracellular by gpos-mploc, psortb and subloc programs, respectively. based on this bioinformatics analysis, sps of the proteins encoded by the orfs yp˙ . , yp . and yp . were selected. these were designated as cwhsp, sursp and sp , respectively. to determine the secretion efficiency of the selected sequences, esvs based on each sp with the gfpuv gene as a reporter were constructed and designated plc-cwh:gfpuv, plc-sur:gfpuv and plc-sp :gfp, respectively. these esvs were independently transformed into l. casei rceid , resulting in the recombinant strains rceid :cwhsp, rceid :sursp and rceid :sp . the fluorescence intensity of culture supernatants of these strains, and that of l. casei rceid plc-gfpuv (used as a negative control), was determined. the highest fluorescence intensity value was shown by the cwhsp ( . ), followed by that of surasp ( . ) and sp ( . ) (fig. ) . the cwhsp was selected for constructing the esv containing the m e:hbc fusion gene, i.e. plc-cwh:m e:hbc. western blot analysis of the supernatant from l. casei cells containing plc-cwh:m e:hbc showed a prominent band with an apparent molecular weight of around . kda (fig. , lane ) , which presumably corresponded to the expected fusion protein. the supernatant from an intermediate construct lacking cwhsp showed no band (fig. , lane ) . the m e:hbc fusion protein was detected in total cell extracts of both l. casei carrying plc-m e:hbc (fig. lane ) and plc-cwh:m e:hbc (fig. lane ) . no protein band was detected in either total cell extracts (fig. , lane ) or supernatant (fig. , lane ) of the negative control strain. engineered lab strains constitute bacterial systems that may serve as alternative mucosal delivery vehicles for a variety of biomolecules (wells and mercenier ) . recombinant lab can be used as vaccine vehicles to elicit both secretory iga and systemic antibody responses against the delivered antigen (neutra and kozlowski ) . in this study we constructed recombinant l. casei cells expressing m e:hbc in a secreted form with the use of novel sps derived from the l. casei genome. several sps derived from cell surface-associated proteins and secreted proteins of lab have been identified and used to construct distinct esvs (wu and chung ; mathiesen et al. ) . of these, the sp of usp sp from lactococcus lactis (daniel et al. ) is the most widely used heterologous sp for protein secretion in lab species (dieye et al. ; mathiesen et al. ) . previous studies in gram-positive bacteria have shown that the secretion efficiency depends not only on the sp but also on the protein to be secreted and on bacterial host (dieye et al. ; mathiesen et al. ) . nonetheless, studies on the sp functionality for heterologous protein secretion in lactobacillus plantarum wcfs found that the homologous sps have similar or higher secretion efficiencies than those of heterologous origin (mathiesen et al. ) . aimed to express and secrete m e:hbc in l. casei, we screened for homologous sps using a bioinformatics approach. in our hands, the best sp predictor is signalp . (petersen et al. ) . this program has been successfully used to select efficient sps from the protein pool of l. plantarum wcsf (mathiesen et al. ) . the subloc, psortb and gpos-mploc programs were also used to predict the localization of the proteins produced. using these programs, three putative sps (cwhsp, surasp and sp ) were selected from cell surface-associated proteins of l. casei atcc and used for construction of esvs with gfp as reporter protein. the reason that the subloc program predicts cwhsp and sp as periplasmic might be due to the fact that this program uses a dataset based on that of reinhardt and hubbard ( ) . this dataset contains a small number of sequences of both extracellular and periplasmic groups for training the neural network, which may result in a less accurate prediction. based on these esv constructs, it was found that all three sps can function for gfp secretion in l. casei rceid , and that the construct using cwhsp provides the highest gfp secretion. previous studies found that an efficient signal peptide in gram-positive bacteria generally contains the consensus sequence vla-x-ala↓ala or ala-x-ala↓ala as the sp cleavage site (nielsen et al. ; mathiesen et al. ) . in this study, all three selected sps have the consensus motif val-x-ala at position − to − relative to the putative cleavage site. however, none of the selected sps contained ala at position + . in addition, the most efficient sp, cwhsp, harbors ser at position + , which indicates ala at this position is not required. cwhsp was further used to construct an esv containing m e:hbc. recombinant l. casei carrying this construct expressed and secreted m e:hbc successfully, demonstrating the usefulness of bioinformatics to predict and select novel sps for l. casei. however, this approach does not guarantee that the sps selected will function, since protein secretion depends on the sp itself, the genetic background of the host strain and the target protein. in addition, secretion efficiency may be affected by the extent to which protein production levels and rates are adapted to the capacity of the translocation machinery, the influence of sp sequence on mrna stability and its translation efficiency (mathiesen et al. ) . for these reasons, to maximize the production of a secreted protein in lactobacillus, and before optimization and balancing of other factors, it might be worthwhile selecting an optimal sp for each individual protein. besides conferring only short-term immunity, a major drawback of the current influenza virus a vaccines is that they confer only short-term subtype-specific protection. thus, keeping vaccines up-to-date with influenza a antigens requires constant monitoring of the subtypes of the circulating viruses (johansson and brett ) . this inconvenience has led to much discussion about the development of a universal influenza a vaccine, able to provide protection against all virus subtypes (du, zhou and jiang ) . such a vaccine would require identification of a viral component conserved through all subtypes of influenza a virus for eliciting a broad-spectrum immunity (stanekova and vareckova ) . the extracellular domain of the m protein (m e) of the influenza virus is one such conserved component (ebrahimi and tebianian ) . however, this is a short peptide of only amino-acid residues. in this study, the m e-encoding gene was fused with the gene coding for the hepatitis b core antigen (hbc) protein in order to increase its immunogenicity. in a previous study, strong immunogenicity and full protection were obtained in mice after either intraperitoneal or intranasal administration of the m e:hbc fusion protein (de filette et al. ) . considering their gras status, the use of l. casei as an antigen delivery vehicle can confer great advantage for the expression of the m e:hbc fusion protein. as a conclusion, in this study, we used a bioinformatics approach to select potential sps from the l. casei atcc genome. the three selected sps, which include those of cwh, sura and proteins, proved to direct secretion of the gfp in l. casei. cwhsp, which showed the highest secretion efficiency, was then used to develop an esv construct expressing and secreting the fused m e:hbc protein. this sp and the esv developed may further serve for the expression and secretion of a variety of homologous and heterologous proteins in l. casei and surely in other lab species. transformation of lactobacillus casei by electroporation recombinant lactic acid bacteria as mucosal biotherapeutic agents universal influenza a vaccine: optimization of m -based constructs design of a protein-targeting system for lactic acid bacteria research and development of universal influenza vaccines influenza a viruses: why focusing on m e-based universal vaccines production of class ii bacteriocins by lactic acid bacteria; an example of biological warfare and communication efficient production and secretion of bovine β-lactoglobulin by lactobacillus casei. microb efficient secretion of the model antigen m -gp e in lactobacillus plantarum ncimb changing perspective on immunization against influenza in situ delivery of passive immunity by lactobacilli producing single-chain antibodies mucosal immunization with surfacedisplayed severe acute respiratory syndrome coronavirus spike protein on lactobacillus casei induces neutralizing antibodies in mice comparative genomics of the lactic acid bacteria heterologous protein secretion by lactobacillus plantarum using homologous signal peptides mucosal vaccines: the promise and the challenge identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites high-and low-copy-number lactococcus shuttle cloning vectors with features for clone screening probiotics: an overview of beneficial effects sequencing and analysis of three plasmids from lactobacillus casei tistr and development of plasmid-derived escherichia coli-l. casei shuttle vectors signalp . :discriminating signal peptides from transmembrane regions bioengineering lactic acid bacteria to secrete the hiv- virucide cyanovirin using neural networks for prediction of the subcellular location of proteins molecular cloning: a laboratory manual conserved epitopes of influenza a virus inducing protective immunity and their prospects for universal vaccine development cloning and expression of a codon-optimized gene encoding the influenza a virus nucleocapsid protein in lactobacillus casei coli host strains significantly affect the quality of small scale plasmid dna preparations used for sequencing mucosal delivery of therapeutic and prophylactic molecules using lactic acid bacteria green fluorescent protein is a reliable reporter for screening signal peptides functional in lactobacillus reuteri we would like to acknowledge prof. david blair for editing the manuscript via the kku publication clinic. this study was supported by the higher education research promotion and national research university project of thailand, office of the higher education commission, thailand and by the invitation research grant (i ) from the faculty of medicine, khon kaen university. none declared. key: cord- -p v wi authors: bigot, yves; samain, sylvie; augé-gouillou, corinne; federici, brian a title: molecular evidence for the evolution of ichnoviruses from ascoviruses by symbiogenesis date: - - journal: bmc evol biol doi: . / - - - sha: doc_id: cord_uid: p v wi background: female endoparasitic ichneumonid wasps inject virus-like particles into their caterpillar hosts to suppress immunity. these particles are classified as ichnovirus virions and resemble ascovirus virions, which are also transmitted by parasitic wasps and attack caterpillars. ascoviruses replicate dna and produce virions. polydnavirus dna consists of wasp dna replicated by the wasp from its genome, which also directs particle synthesis. structural similarities between ascovirus and ichnovirus particles and the biology of their transmission suggest that ichnoviruses evolved from ascoviruses, although molecular evidence for this hypothesis is lacking. results: here we show that a family of unique pox-d ntpase proteins in the glypta fumiferanae ichnovirus are related to three diadromus pulchellus ascovirus proteins encoded by orfs , and . a new alignment technique also shows that two proteins from a related ichnovirus are orthologs of other ascovirus virion proteins. conclusion: our results provide molecular evidence supporting the origin of ichnoviruses from ascoviruses by lateral transfer of ascoviral genes into ichneumonid wasp genomes, perhaps the first example of symbiogenesis between large dna viruses and eukaryotic organisms. we also discuss the limits of this evidence through complementary studies, which revealed that passive lateral transfer of viral genes among polydnaviral, bacterial, and wasp genomes may have occurred repeatedly through an intimate coupling of both recombination and replication of viral genomes during evolution. the impact of passive lateral transfers on evolutionary relationships between polydnaviruses and viruses with large double-stranded genomes is considered in the context of the theory of symbiogenesis. approximately two-thirds of these wasps are endoparasites, meaning that the larval stages develop within the body cavity of their hosts, typically other insects. among the most successful of these endoparasitic wasps are those that use lepidopteran larvae as hosts. owing to the economic importance of these insects and the utility of their wasp parasites as biological control agents, the ability of these parasites to develop within lepidopteran hosts without triggering an intense immune response has been the subject of numerous studies over the past forty years. early studies of the mediterranean flour moth, ephestia kuhniella, parasitized by the ichnemonid, venturia canescens, showed that eggs of this species are coated with particles that resemble virions [ ] [ ] [ ] and contain surface proteins that mimic host proteins, thus keeping the eggs and larvae from being recognized as foreign material by their host. these particles lack dna, and thus are not considered virions [ ] . with respect to both species number and mechanisms that lead to successful parasitism, endoparasitic wasps are known to inject secretions at oviposition, but only a few lineages use viruses or virus-like particles (vlps) to evade or to suppress host defences. in the family ichneumonidae, for example, four types of host defence suppression mediated by the injection of fluids or suspensions are known that lead to successful parasitism. ) fluid injected with eggs bypasses host defences without the aid of viruses or vlps [ ] . ) wasps inject a virus that replicates in both the wasp and lepidopteran host. one example is the wasp diadromus pulchellus, which injects an ascovirus, dpav [ ] into host pupae to circumvent host defence response. ) the wasp injects vlps capable of molecular mimicry and/or direct defence suppression. ) the wasp injects polydnavirus particles that contain genes coding for proteins that interfere with host defence responses. the last mechanism is by far the best-studied type of direct immune suppression by ichneumonid wasps, and occurs in many species belonging to genera campoletis, hyposoter and tranosema (ichneumonidae, campopleginae), and glypta (ichneumonidae, banchinae) [ ] . in these cases, female wasps inject eggs along with ichnovirus particles into their hosts. similarly, in certain lineages of endoparasitic braconid wasps, other types of immunosuppressive particles containing dna occur in the fluid injected along with eggs [ [ ] ; for a review, [ ] ]. once in the host, ichneumonid and brachonid particles enter host nuclei and their dna is transcribed, producing proteins that selectively suppress various steps in the host defence response. as a result of this unusual biology, these particles were described as symbiotic viruses belonging to new viral family, polydnaviridae [ ] [ ] [ ] since the 's, it was assumed that the dna in the polydnavirus particles, as with all other viruses, encoded typical enzymes and proteins for viral replication and virion assembly and structure. however, several recent genomic studies have shown that only a small number of the genes vectored into lepidopteran hosts, less than %, have homologs in other viruses. most viral dna is noncoding, except that which codes for wasp proteins involved in suppression of immune pathways, such as phenoloxidase activation and the toll pathways [ , , ] . even before these genomic studies, it was suggested that these particles were more similar to organelles than viruses [ ] . the similarities between particle structure and virions of known types of complex dna insect viruses are striking, and suggest these immunosuppressive particles originated by symbiogenesis between viruses and endoparasitic wasps, the same evolutionary process by which mitochondria and plastids originated from symbiotic bacteria [ ] . for example, most braconid wasps produce enveloped bacilliform particles classified as bracoviruses, and these resemble baculovirus and nudivirus virions [ , ] . similarly, ichneumonid wasps produce enveloped spindle-shaped particles classified as ichnoviruses that resemble virions of ascoviruses, viruses lethal to lepidopterans, which, interestingly, are vectored by endoparasitic wasps [ ] . it must also be noted that ichnoviruses resemble other true virus particles that are structurally very similar to virions of ascoviruses, but which remain unclassified because the lack of information about their genomes [ ] [ ] [ ] [ ] [ ] . however, ascoviruses and ichnoviruses display very different genome properties; similar genomic differences occur between bracoviruses and baculoviruses or nudiviruses, suggesting that convergent evolution led to the origin the different polydnavirus types from at least two different types of viruses. in ascoviruses, the genome consists of a single circular dna molecule ranging from -to -kpb in size [ ] . phylogenetic analyses of several viral genes have revealed that ascoviruses are closely related to iridoviruses [ ] , and likely evolved from them. in contrast, the genome of ichnoviruses is composed of multiple circular dna molecules ( to ) representing a total size of to kbp, all of which are replicated from the wasp chromosomes. the ichnovirus proviral genome is specifically excised and amplified in several segments in the female calyx cells, the only wasp tissue in which ichnovirus virogenesis occurs. after assembly, these particles are secreted into the female genital tract. once injected into the host, the ichnovirus genome does not replicate, and does not lead to the production of a new virus generation. the third characteristic of ichnoviruses is that most of the genes borne by the particles are not related to viral genes. among the annotated ichnovirus gene families, there are four (rep, prrp, n, and trv) for which no homology with known eukaryotic (or prokaryotic) proteins has been detected and for which no function has been proposed. among the remaining three (cys, ank and inx), cys-motif proteins have no clear homologs among eukaryotic (or prokaryotic) proteins, although the "cysteine knot" that they form is a folding domain found in many proteins, but not one that is necessarily related to eukaryotic host immune systems [ , ] . however, some protein domains and their putative functions suggest that they might be related to regulatory components of eukaryotic host defence systems that are not sufficiently elucidated. although the resemblance of the polydnavirus virions to those of conventional insect viruses suggests that the former evolved from the latter, to date no molecular evidence supports this hypothesis. in the case of ascoviruses and ichnoviruses, well-conserved genes found among the three ascoviruses sequenced so far (sfav a [ ] , tnav c [ ] , and hvav e [ ] ) are not found in ichnovirus genomes. as noted above, the principal reason for this is that the genomes of the latter viruses appear to contain mainly wasp genes, not viral genes. this highlights the need for new and alternative types of sequence data obtained from pertinent biological systems. in this regard, dpav has features that could provide important insights. indeed, it is the only ascovirus known to replicate in both its wasp and caterpillar hosts. it is transmitted vertically from wasp to caterpillars to suppress the defence response of the latter host, thereby enabling parasite development [ , ] . moreover, in males and females of d. pulchellus, the dpav genome resides in the nuclei of all hosts cells, providing a possible example of what may have been an intermediate stage in the symbiogenesis that led to the evolutionary origin of ichnoviruses. we recently sequenced the dpav genome, and a combination of our analysis of this genome and recent data from new types of ichnoviruses, as well as new software programs that elucidate protein relationships based on structural analysis, have enabled us to detect phylogenetic relationships between proteins encoded by open reading frames of dpav and the glypta fumiferanae (gfiv) and campolitis sonorensis (csiv) ichnoviruses. in support of the symbiogenesis hypothesis for the origin of ichnoviruses, data and analyses suggest two independent symbiogenic events, in agreement with what was previously proposed [ ] . the first led to the ichnoviruses in banchinae lineage. this hypothesis is based on the occurrence of a gene cluster present in gfiv and dpav . the second symbiogenic event led to ichnoviruses in the campopleginae wasp lineage. this hypothesis is based on relationships of the major capsid proteins among csiv, ascoviruses and iridoviruses. extending our investigations to proteins encoded by open reading frames of certain ascoviruses and bracoviruses, hosts and bacteria, in the light of recent analyses about the involvement of the replication machinery of virus groups related to ascoviruses in lateral gene transfer [ ] , we discuss the robustness and the limits of the molecular evidence supporting an ascovirus origin for ichnovirus lineages. the dpav genome sequenced by genoscope (france) is , -bp in length. its organization, gene content and evolutionary characteristics will be detailed in a separate publication (manuscript in preparation; additional file ). however, blast results obtained with several orfs in the dpav genome provide evidence that certain ichnovirus orfs have their closest relatives in an ascovirus genome. specifically, we identified a -kbp region that contains a cluster of three genes ( fig. , orf , and ; additional files and ) that have close homologs in a gfiv gene family composed of seven members [ ] . all contain a domain similar to a conserved domain found in the pox-d family of ntpases. to date, this pox-d domain has been identified as a ntp binding domain of about amino acid residues found only in viral proteins encoded by poxvirus, iridovirus, ascovirus and mimivirus genomes. these genes seem to be specific to gfiv, as they are absent in the three sequenced genomes of other ichnoviruses, namely csiv, tranosema rostrales ichnovirus (triv), and hyposoter fugitivus ichnovirus (hfiv). more specifically, in dpav , orf encodes a protein of amino acid residues that is % similar from position to to a protein of amino acid residues encoded by the orf contained in the segment c in the gfiv genome (fig. ) . these two proteins can therefore be considered putative orthologs. the c-terminal residues of this dpav protein are also % similar to the cterminal domain of the protein homologs encoded by the orf of the d and d gfiv segments, % similar to the n-terminal and the c-terminal domains of the protein encoded by the orfs r and l of the iridovirus civ and lcdv, and % similar with those encoded by orfs , and in the ascovirus genomes of hvav e, sfav a and tnav c, respectively. overall, this indicates that this dpav protein is more closely related to that of gfiv than to those found in other ascovirus and iridovirus genomes currently available in databases. orf encodes a protein of amino acid residues similar only with the c-terminal domain of three proteins encoded by the orfs , and , contained, respectively, in gfiv segments d , d and d . in contrast, orf is closer to iridovirus and ascovirus genes than to gfiv genes. this protein of amino acid residues is % similar over all its length to civ orf r orthologs in all iridoviral and ascoviral genomes and is only % similar over amino acid residues to the c-terminal domain of the gfiv protein homologs encoded by the orf , , , , and in, respectively, the c , c , d , d , d and d segments of this virus. analysis of the genes surrounding the dpav orf- - - cluster confirms that this virus has an ascovirus origin since this region contains orfs that are close homologs of genes in iridovirus and ascovirus genomes. upstream from the orf- - - cluster, an orf encoding the dna-dependent rna polymerase subunit c is present, which is an ortholog of the iridoviral civ orf r and the ascoviral sfav a orf . downstream from this cluster, there are two genes, absent in known ascoviral genomes, but similar to the iridoviral civ orf l and civ orf l. these two genes encode, respectively, a chromosomal replication initiation protein and zinc finger protein. in between them, a gene encoding a small protein is present that is similar to that encoded by the orf l of the iridovirus civ, and which corresponds to the ali-like protein also found in entomopoxviruses [ ] . since the three dpav genes have relatives in all ascovirus and iridovirus genomes sequenced so far, their presence in the dpav genome cannot result from a lateral transfer that occurred from an ichnovirus genome related gfiv to dpav . thus, as these dpav genes are the closest relatives of the pox-d gene family present in gfiv identified so far, they could be considered a landmark of the symbiogenic ascovirus origin of the ichnovirus lineage to which this polydnavirus belongs. an alternative explanation is that the presence of dpav -like genes in the genome of gfiv resulted from a lateral transfer from viral genomes closely related to those of gfiv and dpav . indeed, this might have happened when a glypta wasp was infected by an ancestral virus related to dpav . nevertheless, the symbiogenic origin of gfiv from ascoviruses is also supported by morphological features of its virions [ ] , which, aside from similarities in shape, also show reticulations on their surface in negatively stained preparations, a characteristic of the virions of all ascovirus species examined to date [ ] . because ascovirus virions and ichnovirus particles display structural similarities, we developed an approach to search for homologs of virion structural proteins in ichnoviruses. these approaches were initiated in and recently finalized, but some of the conclusions have been published [ ] . to date, only two virion proteins from the campoletis sonorensis ichnovirus (csiv) have been characterized [ , ] . the first is the p (acc n° aad ), a structural protein that appears to be located as a layer between the out envelope and nucleocapsid, and the second, p , a capsid protein (acc n° af ). presently, there are more than one hundred ascoviral or iridoviral mcp sequences in databases. blast searches using these sequences failed to detect any similarities between csiv virion proteins and ascoviral or iridoviral mcps, or any other proteins [ ] . to evaluate the possibility that homology between ichnovirus and ascovirus virion proteins may simply not be detectable by conventional blastp searches, we used a different method, wapam (weighted automata pattern matching; [ ] ). the models were designed on the basis of a previous study [ ] demonstrating that mcp encoded by ascovirus, iridovirus, phycodnavirus and asfarvirus genomes are related, and all contain conserved domains separated by hinges of very variable size. we investigated these conserved domains further using hydrophobic cluster analysis (hca, [ ] ). this map of the -kbp region of the dpav genome (embl acc. n° cu and cu ) that contains the gene cluster with direct homologs in the genome of the glypta fumiferanae ichnovirus amino acid sequence comparison resulting from a blast search done with the dpav orf as a query, and the best hit corresponding to the protein encoded by the orf of the ichnovirus segment gfv-c (subject; genbank acc. n° yp_ ) figure amino acid sequence comparison resulting from a blast search done with the dpav orf as a query, and the best hit corresponding to the protein encoded by the orf of the ichnovirus segment gfv-c (subject; genbank acc. n° yp_ ). analysis revealed that most conservation occurred at the level of hydrophobic residues, as expected for structural proteins (additional file a and b). the size variability of the hinges between conserved domains and the conservation of hydrophobic residues might explain why blast searches using iridoviral and ascoviral mcp sequences have limited ability to detect mcp orthologs in phycodnavirus and asfarvirus genomes. we designed two syntactic models (see materials and methods), which together were able to specifically align all mcp sequences of the four virus families. importantly, wapam aligned the csiv ichnovirus p structural protein with both models. complementary structural and hca confirmed the presence of the seven conserved domains in this csiv structural protein ( fig. a and additional file c). in addition to the above analysis, ten syntactic models were developed using proteins conserved in the three sequenced ascovirus species (sfav a, tnav c, and hvav a) and twelve iridoviruses [ ] . none of these and , typed in black) , dpav (lanes and , typed in blue) and sfav a (lanes and , typed in purple) . conserved positions among the amino acid sequence of csiv and those of dpav and sfav a are highlighted in grey. secondary structures in the three sfav a orf orthologs were calculated with the network protein sequence analysis at http://npsa-pbil.ibcp.fr/ and the statistical relevance of the secondary structures were evaluated with psipred at http://bioinf.cs.ucl.ac.uk/psipred/. c, e and h in lanes to respectively indicated for each amino acid that it is involved in a coiled, b sheet or a helix structure. using default parameters of psipred, upper case letters indicate that the predicted secondary structure is statically significant in psipred results. significant secondary structures are highlighted in yellow. in (a), the comparisons were limited to three of the seven conserved domains (additional file a, b and c), the , and . indeed, classical in silico methods appeared to be inappropriate to predict statistically significant secondary structures in conserved structural protein rich in b strand such as iridovirus and ascovirus mcp. in contrast, a complete and coherent domain comparison was obtained by hca profiles (fig. s b, c) . , developed from small proteins encoded by the dpav orf , sfav a orf , hvav a orf , and tnav c orf in the ascovirus genomes, and iridovirus civ orf l and mimivirus miv orf r genomes, respectively. importantly, these proteins have orthologs in vertebrate iridoviruses, phycodnaviruses, and asfarvirus. in sfav a, the peptide encoded by orf is one of the virion components. in ascoviruses, iridoviruses, phycodnaviruses, and the asfarvirus, they have been annotated as thioredoxines, proteins that play a role in initiating viral infection [ ] [ ] [ ] . database mining with our model revealed four hits with csiv sequences (acc n°. m , s , af , af ) each a homolog orf of sfav a orf . in fact, these sequences correspond to several variants of a single region contained in the b segment of the csiv genome. to date, these have not been annotated in the final csiv genome, probably because they overlap a recombination site. hca analyses confirmed that the hydrophobic cores were conserved ( fig. b and additional file d and e). the chromosomal locations of genes encoding these two csiv proteins, i.e., p and p , were also consistent with the symbiogenesis hypothesis. in fact, the orf encoding p is not found in proviral dna. it is notable that no orfs encoding orthologs of p or other structural proteins such as mcps are found in any of the other three ichnovirus genomes sequenced -triv, gfiv, hfiv [ , ] . therefore, this indicates that the orthologs of ichnovirus mcps and other virion structural proteins are also probably located in the genomes of these wasps, i.e., not in proviral dna. in contrast to this, we found that the gene encoding the csiv ortholog of sfav a orf is located within the proviral dna. whether ortholog proteins are similarly involved in the triv, gfiv and hfiv biology, their genes are not found in proviral dna, since no matches were detected in their viral genomes. the phylogenetic analysis performed previously on p and the sfav a orf orthologs [ ] indicated that they have an ancestor close to that of the ascoviruses and iridoviruses. as in the case of genes encoding pox-d family of ntpases in all ascoviruses, iridoviruses, and gfiv, genes encoding virion proteins cannot result from a horizontal transfer from a campoplegine or banchine ichnovirus genome to all ascovirus, iridovirus, phycodnaviruses and asfarvirus genomes. as the ascovirus genes encoding the two virion proteins investigated here are the closest relatives of virion proteins in csiv, they can be considered a landmark reflecting the symbiogenic origin of the two ichnovirus lineages from ascoviruses closely related to dpav . in fact, the difficulty encountered in elucidating their sequence relationships can be explained by a combination of the marked transition from ascovirus to ichnovirus, and the significant selection constraints that resulted as the latter virus type evolved from the former. analysis of available ascovirus, iridovirus and ichnovirus genomes provides some of the first molecular support for the hypothesis that ichnoviruses evolved from ascoviruses by symbiogenesis. however, examining genes shared only by ascovirus, iridovirus and ichnovirus genomes likely limits the sources of genes that contributed to the evolution and complexity of these viruses, especially of the role of lateral gene transfer. relevant to this is the recent finding that an important part of the mimivirus and phycodnavirus genomes had a bacterial origin [ ] . obviously, this did not lead to the conclusion that these viruses had a bacterial origin. the cytoplasmic environment in which these viruses replicate is rich in bacterial dna because their amobae and unicellular algae hosts feed on bacteria that they digest in their cytoplasm. thus, it has been proposed [ ] that lateral transfers of bacterial dna within these viral genomes were driven by intimate coupling of recombination and viral genome replication. indeed, replication of these viruses is similar to that of bacteriophage t . this mode of replication has been called recombination-primed replication. it permits integration of dna molecules with sequence homology as short as -bp [ , ] . the replication machinery used by ascoviruses, iridoviruses, mimiviruses, phycodnaviruses, and other nucleocytoplasmic large dna viruses (ncldv) [ , ] is common to all of them, despite differences in the specifics of replication in each virus family. it can therefore be expected that recombination-primed replication occurred repeatedly during evolution of both these viruses and the genome of their eukaryotic hosts. in an eukaryotic cellular environment in which bacteria, chromosomes, ncldv viruses and non-ncldvs (such as baculoviruses) intimately cohabit temporarily or permanently, recombination-primed replication is able to allow reciprocal passive lateral transfers between viral genomes, host chromosomes, and bacterial dna. under these conditions, lateral transfers are considered passive since they just result from the intimate environment and not from an active mechanism dedicated to genetic exchanges. in ascoviruses and iridoviruses, the occurrence of such lateral transfers is supported by blastp searches that detected the presence of orfs whose closest relatives have their origin within eukaryotic genomes (e.g., for dpav , in additional data , orfs , , , , , ), bacterial genomes (e.g., for dpav , in additional data , orfs , , , , and ) or viruses belonging to other ncldv and non-ncldv families (e.g., for dpav , in additional data , orfs , , , ). the transmission of ascoviruses is unusual in that they are poorly infectious per os and appear to be transmitted among lepidopteran hosts by parasite wasp vectors at oviposition [ , ] . the genome of the ascoviruses can be replicated in presence of polydnavirus dna either within the reproductive tissues of female wasps or within the body of the parasitized hosts infected by both polydnavirus and ascovirus. consequently, integrated sequences of ascovirus origin can be expected within wasp and polydnavirus genomes. reciprocally, sequences of polydnavirus origin may have been integrated in ascovirus genomes, whatever the wasp origin, ichneumonid or braconid. one gene family related to a bacterial family of n-acetyl-l-glutamate -phosphotransferase (acc. n° of the closest bacterial relatives yp_ , cam , zp_ , zp_ ), identified only within the sfav a, hvav e and tnav c genomes, supports this conclusion. it has been found in the genome of a bracovirus, cotesia congregata bracovirus (ccbv [ ] ; fig. ). since this gene is absent in the genome of microplitis demolitor bv, a related bracovirus [ ] , it is difficult to infer the direction of the lateral transfer between the common ancestors of the three ascoviruses and of the wasp c. congregata. however, they unambiguously indicate that there was at least one lateral transfer for this gene between the common ancestor of ascoviruses and the parasitic wasp. since iridoviruses, like ascoviruses and other virus species [ , ] , are, in some cases, vectored by parasitic wasps, databases were mined using all the available ichnovirus virus proteins as queries. we found no significant relationships between csiv, hfiv and triv genomes and genomes of their putative closest relatives ncldv and non-ncldv relatives. this indicates that passive lateral gene transfers from virus to eukaryotes that are successfully spread and maintained in ichnovirus genomes remain rare events. one case of such lateral transfer was described in the ccbv genome. in this genome, aside from the presence of cardinal endogenous eukaryotic retrotranposon and polintons that transposed in the chromosomal dna of the proviral form of ccbv [ ] [ ] [ ] , two genes encoding acmnpv p -related proteins, which have their closest relatives among granuloviruses (xcgv), were found. this suggests that ccbv contained at least two cases of lateral transfers between non-ncldv and a bracovirus. our results provide another source of evidence that passive lateral gene transfers have occurred regularly during evolution from bacteria to viruses and eukaryotes, and between viruses and eukaryotes [ ] [ ] [ ] [ ] . even if the pox-d ntpase genes in the gfiv genome, and the mcp and sfav -like genes in the csiv genome, indicate that they have an ascovirus origin, they provide only limited evidence supporting an ascovirus origin of ichnoviruses. indeed, their sequence conservation and biological characteristics suggest that there were repeated lateral transfers during evolution between ascoviruses and wasp genomes, including the proviral ichnovirus loci. this raises an important issue about the role of lateral transfers during co-evolution of the ncldvs and non-ncldvs, ichnovirus, wasp and parasitized host. indeed, genetic materials of various origins have been exchanged and maintained during co-evolution. this therefore suggests that ichnoviruses might be chimeric entities partly resulting from sev- symbiogenesis was first proposed as an evolutionary mechanism when it became widely recognized that mitochondria and plastids originated from free-living prokaryotes [ ] . the genomes of the endosymbiotic cyanobacteria and proteobacteria, respectively, at the origin of chloroplasts and mirochondria have evolved by reduction of several orders of magnitude to the approximate size of plasmids. concurrently, nuclear genomes have been the recipients of plastid genomes. this relocation of the genes encoding most proteins of the endosymbiotic bacteria to the host nucleus is the ultimate step of this evolutionary process, so-called endosymbiogenesis [ , ] . recent studies of plants have revealed a constant deluge of dna from organelles to the nucleus since the origin of organelles [ ] . this allows the host cell to have the genetic control on its organelles, in a relationship that is closer to enslavement or domestication than to a symbiosis or a mutualism in which the organelles would recover benefits from their contribution to the eukaryotic cell well-being. to date, this deluge of dna is considered to correspond to passive lateral transfers that result from the interactions between the life cycle of the organelle and nuclear replication. numerous cases of symbiogenesis between endocellular bacteria and a wide variety of eukaryotic hosts have been characterized. however, recent work has demonstrated that this evolutionary process was not restricted to bacteria. it also occurred between endocellular eukaryotes such as unicellular algae and fungal endophyte in plants [ , ] . endosymbiogenesis was also proposed as the evolutionary mechanism that allowed some invertebrate viruses with a large double-stranded dna genome related to the nudiviruses and the ascoviruses [ ] , to have led, respectively, to the origin of bracoviruses and ichnoviruses, which are currently recognized as forming two genera within the family polydnaviridae. although presently there is no definitive evidence ruling out the hypothesis that the resemblance between ichnovirus and ascovirus virions is only an evolutionary convergence, the genomic differences between ascovirus and ichnoviruses are in good agreement with the symbiogenetic hypothesis. indeed, they match an evolutionary scenario of endosymbiogenesis during which, from a single integration event of symbiotic virus genome, viral genes were lost and/or translocated from the provirus to other chromosomal regions (fig. ). in parallel, host genes of interest for the wasp parasitoid were integrated and diversified by selection and gene duplication in the proviral dna. in this scenario, the more ancient symbiogenesis, the rarer the traces of genes from viral origin in the ichnovirus genome would be. this constitutes a constraint that dramatically limits the possibility to investigate the evolutionary links between ascovirus and ichnovirus. results of our analyses demonstrate that the situation is also complicated by the fact that lateral gene transfers unrelated to the origin of ichnoviruses cause important misleading background noise. moreover, the scenario in figure is close to a previously proposed version [ ] , but is not consistent with results presented here, nor with recently accumulated knowledge on dna transfer from organelles into the nucleus. since endocellular environments favour lateral transfers between virus and wasp nucleus, it can be proposed that genes of virus origin that are involved in the ichnovirus biology were passively integrated in one or several loci, step by step over time, alone or through transfers of gene clusters, or even the entire viral genome. since parasitoid wasps are able to vector different viruses [ , ] , this second scenario opens the exciting possibility that virus genes involved in the ichnovirus biology might correspond to a gene patchwork resulting from transfers from viruses belonging to different ncldv and non-nclvd families. because of the background noise due to lateral gene transfers found in these systems, elucidating the origins of ichnoviruses will be very time-consuming, requiring new accurate experimental approaches to generate more robust evidence. sequencing wasp genomes to identify proteins of viral origin that are components of virions and involved in the assembly of these may well contribute to our understanding of how ichnoviruses and bracoviruses evolved from other insect dna viruses. searches for similarities were mainly developed using facilities of blast programs at two websites http:// www.ncbi.nlm.nih.gov/blast/blast.cgi and http:genoweb.univ-rennes .fr/serveur-gpo/out ils.php ?id_rubrique= . for dpav genes having their origin within eukaryotic, bacterial or virus genomes belonging to ncldv and non-ncldv families, the closest gene was located using the distance trees supplied with each blast search at the ncbi website. construction of syntactic models: conserved amino acid blocks and positions described previously [ , ] and with new data sets were verified or determined using meme at http://meme.sdsc.edu/meme/meme.html. in the first step, we used motifs resulting from meme to make mast minings in databases at http:// meme.sdsc.edu/meme/mast.html. since meme motifs depend significantly on the data set use to calculate them, this approach did not enable an exhaustive detection of homologs among ascoviruses, iridoviruses, phycodnaviruses, mimiviruses and asfarviruses, and the detection sensitivity was ultimately very similar to that obtained with blast. to reach our detection objectives, we therefore constructed syntactic models that only included the most conserved positions and their variable spacing using wapam at the website. http://genoweb.univ-rennes .fr/ serveur-gpo/ outils_acces.php ?id_syndic= &lang=en. defining these models was obtained empirically until they allowed an exhaustive detection in refseq-protein and genbank databases of the homologs among ascoviruses, iridoviruses, phycodnaviruses, mimiviruses and asfarviruses. the procedures were done until we were only able to detect exact match with the syntactic model. whatever obtained with wapam, they required a confirmation with other approaches. here, we used psipred result comparison for regions with scores over and hca analyses for regions having scores lower than with psipred. this simplified the statistical treatment of the result obtained with wapam, since all exact matches have significance or a score of %. syntactic hypothetical mechanism for the integration and evolution of ascovirus genomes in endoparasitic wasps figure hypothetical mechanism for the integration and evolution of ascovirus genomes in endoparasitic wasps. schematic representation of the three-step process of symbiogenesis, and dna rearrangements that putatively occurred in the germ line of the wasp ancestors in the banchinae and campopleginae lineages, from the integration of an ascoviral genome to the proviral ichnoviral genome. sequences that originate from the ascovirus are in blue, those of the wasp host and its chromosomes are in pink. genes of ascoviral origin are surrounded by a thin black or white line, depending on their final chromosomal location. two solutions can account for the final chromosomal organisation of the proviral ichnovirus genome, monolocus or multilocus, since this question is not fully understood in either wasp lineage. more complex alternatives to this three-step process might also be proposed and would involve, for example, the complete de novo creation of a mono or multi locus proviral genome from the recruitment by recombination or transposition of ascoviral and host genes located elsewhere in the wasp chromosomes. this model for the chromosomal organization of proviral dna in polydnaviruses is consistent with data recently published [ ] . immune surface of eggs of a parasitic insect the resistance of insect parasitoids to the defense reactions of their hosts an insect glycoprotein: a study of the particles responsible for the resistance of a parasitoid's egg to the defence reactions of its insect host role of virus-like particles in parasitoid-host interaction of insects venom from the endoparasitic wasp pimpla hypochondriaca adversely affects the morphology, viability, and immune function of hemocytes from larvae of the tomato moth, lacanobia oleracea characteristics of pathogenic and mutualistic relationships of ascoviruses in field populations of parasitoid wasps polydnavirus genomes reflect their dual roles as mutualists and pathogens particles containing dna associated with the oocyte of an insect parasitoid family polydnaviridae. in virus taxonomy. eighth report of the international commitee on taxonomy of viruses edited by: fauquet cm virus in aparasitoid wasp: suppression of the cellular immune response in the parasitoid's host polydnaviridae -a proposed family of insect viruses with segmented, doublestranded, circular dna genomes genome sequence of a polydnavirus: insights into symbiotic virus evolution shared and species-specific features among ichnovirus genomes origin and evolution of polydnaviruses by symbiogenesis of insect dna viruses in endoparasitic wasps symbiosis in cell evolution hyenoptera: formicidae) from brazil the ultrastructure of microorganisms in the tissues of casenaria infesta (hymenoptera: ichneumonidae) apparent replication of an unusual viruslike particle in both parasitoid wasp and its host an unusual virus from the parasitic wasp cotesia melanoscela. virology viruslike particles in the ovaries of microctonus aethiopoides loan (hymenoptera: braconidae), a parasitoid of adult weevils (coleoptera: curculionidae) evidence for the evolution of ascoviruses from iridoviruses genomic sequence of spodoptera frugiperda ascovirus a, an enveloped, double-stranded dna insect virus that manipulates apoptosis for viral reproduction sequence and organization of the trichoplusia ni ascovirus c (ascoviridae) genome. virology sequenceand organization of the heliothis virescens ascovirus genome biological and molecular features of the relationships between diadromus pulchellus ascovirus, a parasitoid hymenopteran wasp (diadromus pulchellus) and its lepidopteran host, acrolepiopsis assectella dpav- , on thehemocytic encapsulation response and capsule melanization of the leek-moth pupa, acrolepiopsis assectella genomic and morphological features of a banchine polydnavirus: comparison with bracoviruses and ichnoviruses i am what i eat and i eat what i am: acquisition of bacterial genes by giant viruses the genome of melanoplus sanguinipes entomopoxvirus cloning and expression of a gene encoding a campoletis sonorensis polydnavirus structural protein a gene encoding a polydnavirus structural polypeptide is not encapsidated what does structure tell us about virus evolution? cluster of re-configurable nodes for scanning large genomic banks deciphering protein sequence information through hydrophobic cluster analysis (hca): current status and perspectives comparative genomic analysis of the family iridoviridae: reannotating and defining the core set of iridovirus genes the thioredoxin system in retroviral infection and apoptosis mimivirus giant particles incorporate a large fraction of anonymous and unique gene products cell entry by enveloped viruses: redox considerations for hiv and sars-coronavirus genetic recombination of the dna plant virus pbcv- in a chlorella alga common origin of four diverse families of large eukaryotic dna viruses evolutionary genomics of nucleo-cytoplasmic large dna viruses effects of the nonoccluded virus of spodoptera frugiperda (lepidoptera: noctuidae) on the development of a parasitoid parasitoid-mediated transmission of an iridescent virus non-poly-dna viruses, their parasitic wasp, and hosts the few virus-like genes of cotesia congragata self-synthesizing dna transposons in eukaryotes marvericks, a novel class of giant transposable elements widespread in eukaryotes and related to dna viruses evolution of viruses by acquisition of cellular rna or dna nucleotide sequences and genes: an introduction microbialgenes in the human genome: lateral transfer or gene loss? science are there bugs in our genome? science express genome-wide survey for genes horizontally transferred from cellular organisms to baculoviruses morphogenesis by symbiogenesis endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes a cryptic intracellular green alga in ginkgo biloba: ribosomal dna markers reveal worldwide distribution forest succession suppressed by an introduced plant-fungal symbiosis unfolding the evolutionary story of polydnaviruses structure and evolution of a proviral locus of glyptapanteles indiensis bracovirus this research was funded by grants from the c.n.r.s. (pics n° ), the genoscope, the a.n.r. project in bioinformatics modulome, the ministère de l'education nationale, de yb is the leader of all aspects of the research on the biology, genomics, and evolution of dpav . ss coordinated the sequencing, assembly, and sequence quality control of the dpav genome. cag participated in the bioinformatics analysis of the dpav genome development of the manuscript. baf contributed original concepts regarding the evolutionary origins and role of polydnaviruses in endoparasitoid biology, provided virological expertise to optimize data interpretation, and participated in writing the manuscript. predicted orfs in dpav genome. key: cord- - mev otu authors: rathore, abhishek singh; sarker, animesh; gupta, rinkoo devi title: production and immunogenicity of fubc subunit protein redesigned from denv envelope protein date: - - journal: appl microbiol biotechnol doi: . /s - - -y sha: doc_id: cord_uid: mev otu dengue virus (denv) is a vector-borne human pathogen that usually causes dengue fever; however, sometime it leads to deadly complications such as dengue with warning signs (dws+) and severe dengue (sd). several studies have shown that fusion (fu) and bc loop of denv envelope domain ii are highly conserved and consist some of the most dominant antigenic epitopes. therefore, in this study, fu and bc loops were joined together to develop a short recombinant protein as an alternative of whole denv envelope protein, and its immunogenic potential as fusion peptide was estimated. for de novo designing of the antigen, fu and bc peptides were linked with an optimised linker so that the three dimensional conformation was maintained as it is in denv envelope protein. the redesigned fubc protein was expressed in e. coli and purified. subsequently, structural integrity of the purified protein was verified by cd spectroscopy. to characterise immune responses against recombinant fubc protein, balb/c mice were subcutaneously injected with emulsified antigen preparation. it was observed by elisa that fubc fusion protein elicited higher serum igg antibody response either in the presence or in absence of freund’s adjuvant in comparison to the immune response of fu and bc peptides separately. furthermore, the binding of fubc protein with mice antisera was validated by spr analysis. these results suggest that fu and bc epitope-based recombinant fusion protein could be a potential candidate towards the development of the effective subunit vaccine against denv. electronic supplementary material: the online version of this article ( . /s - - -y) contains supplementary material, which is available to authorized users. in the recent decades, dengue has emerged as one of the world's most dominant tropical diseases (guzman et al. ) with around million ( % credible interval to million) cases recorded per year, of which only million ( % credible interval to million) cases were manifested clinically (giraldo-garcía and castaño-osorio, ; bhatt et al. ). according to world health organization (who), an estimate of . billion people in countries are living in areas with a high risk of dengue infection (brady et al. ). the recently estimated annual million dengue cases reveals that the dengue disease burden has tripled as compared to previous predictions of to million reported cases without dengue warning signs. nonetheless, , to , patients were hospitalised due to dengue with warning signs (dws+) and severe dengue (sd), and the total annual cost of dengue burden was estimated globally around us$ . billion (giraldo-garcía and castaño-osorio, ; guzman et al. ) . in spite of having paucity of effective vaccines and drugs to expel dengue comprehensively, still it's enduring a big challenge to human (martina et al. ). therefore, next-generation vaccine strategies such as inactivated purified virus, dna or protein-based subunit vaccines, and their fusion chimeras are now going under investigation and some of them even under clinical trials (coller et al. ; danko et al. ; liu et al. ) . recently, dengvaxia (cyd-tdv), a tetravalent live attenuated vaccine has been approved for use in some of the highly dengue endemic areas where the sero-prevalence is higher than % (guy et al. ; pang et al. ). according to abhishek singh rathore and animesh sarker contributed equally to this work. electronic supplementary material the online version of this article (https://doi.org/ . /s - - -y) contains supplementary material, which is available to authorized users. who, it has shown significant efficacy and acceptable safety profile during clinical trials in seropositive individuals; however, it carries a risk of severe dengue infection in seronegative individuals, and also failed to confer subsequent protection in denv- positive people in areas with less exposure of dengue (arredondo-garcía et al. ; durbin et al. ; guy et al. ) . several other vaccine candidates were also formulated based on live-attenuated dengue viruses (e.g. tv , tv ) and inactivated purified virus (e.g. dpiv), of which some have already completed clinical trials and some are currently going under phase ii to iii clinical trials (whitehead et al. ; diaz et al. ). on the other hand, dna or proteinbased alternative subunit vaccines (e.g. tvdv, den- ) and dengue-flavivirus chimeras (e.g. ediii-p k, e-stf , ediii-hbsag) are going either under preclinical study or phase i clinical test (govindarajan et al. ; danko et al. ; castaño-osorio et al., ) . although, still most of them are seemed to be slow immunity booster and require a strong adjuvant and longer immunisation to achieve full protection against each of the dengue serotype, which all make them ill-suited for universal vaccine licence . the dengue envelope (e) protein is composed of three ecto-domains, a membrane-proximal stem and a transmembrane anchor (klein et al. ). various crystal structures have shown that the ecto-domains are arranged into three antiparallel dimer on virus surface in a icosahedral symmetry, where ecto-domain i (ei) is located at the centre holding domain ii (edii) and iii (ediii) in two opposite sites in each of the monomer of a dimeric unit (fibriansah et al. ; kuhn et al. ; modis et al. ). most of the protective antibodies against dengue virus are identified to domain iii, and initially were thought to be a potential subunit vaccine target (murphy and whitehead ; sukupolvi-petty et al. ) . although, recombinant ediii domain injected by plasmid dna or purified from bacterial expression systems was found to be poorly immunogenic, and has shown low protective efficacy in animal model (guzman et al. ). on the other hand, edii has been reported as the dimerisation domain and consists of the most conserved fusion (fu) and bc loop (li et al. ; zhang et al. ). several studies have also shown that the conserved fu loop is highly immunogenic (lai et al. ; cherrier et al. ; smith et al. ) , and induces cross-reactive antibodies, of which some are crosstalk with adjacent bc loop (gallichotte et al., ; cherrier et al. ). moreover, during endocytosis, both of the loops are found to involve in tertiary conformation of membrane fusion (nayak et al. ; sukupolvi-petty et al. ) . therefore, it has been anticipated that both of the loops might have integrated role for boosting cross-neutralizing immunity. in this study, we aim to develop an alternative vaccine by combining fu and bc loop together in a single orf for production of a fusion antigen. although, the recombinant antigen in isolation tends to be poorly immunogenic in vivo; the use of potent immunomodulating compounds, fusion partner or suitable delivery systems improve specific immune response (higgins et al. ; garçon et al. ) . herein, we have expressed the recombinant fubc antigenic protein, optimised its large-scale purification protocol and finally evaluated its protein-specific immune response in balb/c mice. prior to the animal challenge, its secondary structure was checked by cd spectroscopy, and binding specificity was cross-checked with a characterised anti-fusion loop scfv antibody ). all of the male and female balb/c mice were obtained from national institute of nutrition, telangana, hyderabad, india and were maintained in standard light: dark ( : ) cycle with the supplement of adequate standard food, and water was provided from ad libitum. all of the animals were acclimated and randomly distributed into different experimental groups. furthermore, all the in vivo experiments were performed in accordance with the committee for the purpose of control and supervision on experiments on animals (cpcsea) guidelines and were approved by the south asian university institutional animal ethics committee (iaec) that was responsible for the care and use of laboratory animals. the dna sequence of fubc gene was retrieved from fusion (fu) and bc loop of denv serotype envelope protein deposited in the protein data bank (pdb: oan). in order to construct stable and immunogenic protein, highly conserved fusion (fu) and bc loops and their neighbouring residues (amino acid residues to ) were selected by using antigen variability analyser (avana), pblast and multiple sequence alignment tool of ncbi. for convenient expression in bacteria, all of the oligos were codon optimised for e. coli, using codon optimisation tool of integrated dna technology. complete fubc gene was constructed by assembly pcr reactions using four nucleotide long overlapping oligonucleotides. for cloning into pet a expression vector, two unique restriction sites, ecori and xhoi, were also incorporated at the ′ and ′ ends respectively during the final amplification of fubc full-length gene using forward and reverse primers. initially, the full-length fubc gene was cloned into a ta cloning vector using instaclone kit (thermo scientific). positive clones were screened using x-gal bluewhite screening method and digested with ecori and xhoi restriction enzymes. the digested fubc gene was further sub-cloned into a pet a expression vector. the recombinant plasmid (fubc + pet a) was transformed into e. coli xl- gold for cloning, and subsequently into e. coli bl- rosetta (de ) for protein expression. the e. coli bl- (rosetta) cells carrying fubc gene in pet a vector were grown overnight at °c in ml lb broth (luria-bertani medium) containing μg/ml kanamycin (sigma, usa). overnight grown ml primary culture was used to inoculate kanamycin containing ml secondary culture and was further incubated at °c. the incubated secondary culture was induced by . mm isopropyl β-d- thiogalactopyranoside (iptg) when its od reached at around . , and the culture was grown for another h at °c. the cells were harvested by centrifugation at rpm for min. the resulting cell pellet was resuspended in lysis buffer containing mm tris-hcl ph . , mm cacl , with . % triton x- , lysozyme . mg/ml, mm edta and mm pmsf. then, the resuspended cells were kept on a rocker for an hour at room temperature, and sonication was done using % amplitude for five times s on/off pulse. the lysed sample was separated into supernatant and pellet by centrifugation at , rpm for min at °c. finally, fubc expression level in supernatant and pellet were checked on % sds-page. a major fraction of the fubc protein was observed in pellet after a number of efforts made to recover it in a soluble form. then, it was decided to recover soluble protein from pellet fraction. the resulting pellet protein was washed to times with te / ( mm tris ph . and mm edta ) buffer to remove impurities and extra salts. the remaining pellet was re-solubilised using mild denaturing agent such as . m urea and % n-propanol along with pbs buffer (ph . ) and was centrifuged at , rpm for min. the soluble protein fraction was initially purified by using ni-nta agarose beads and was confirmed by western blot using anti-his antibody. however, the purity and yield were not sufficient. therefore, soluble protein fraction was subjected to gel filtration in superose / column by using fast protein liquid chromatography (fplc) for largescale good quality protein production. the column was preequilibrated with pbs ( mm phosphate buffer ph . and mm nacl), and the protein sample was eluted with the same pbs buffer. . ml protein sample was injected in each run by using . ml loop at . ml/min flow rate. the elution profile of injected protein was followed by monitoring uv absorbance at nm on the akta fplc system with u -l uv monitor. different peaks greater than mau were collected in fraction collector and checked using a % sds page. multiple runs of fplc were carried out, and the fraction containing fubc protein was pooled together. finally, fubc was concentrated and desalted by using . ml millipore-amicone filter with a cut-off of kda. concentration of the purified fubc was also measured by using bca protein assay kit. in order to assure the proper folding of purified (> %) fubc protein, secondary structure was analysed by cd spectroscopy at far uv wavelength ranging from to nm. to achieve the best conformational reading, cd spectrum was obtained at the different protein and buffer concentrations, because the cd spectrum of a protein needs to be adequately intense for interpreting the data as the intensity of a cd spectrum directly relies on the protein and buffer concentration (kelly et al. ; miles and wallace ) . the cd spectra of pbs buffer and fubc protein samples at . mg/ml and . mg/ml concentrations (diluted in mm and mm pbs buffer, ph . ) were recorded at far uv spectra ranging from to nm with a step size of nm to bandwidth nm. the measurement was performed at room temperature ( °c), and the uv spectra were recorded for each sample with five scans. the baseline cd spectrum of the buffer was deducted from the spectrum containing the protein to yield the actual fubc protein cd spectrum. the mean residue ellipticity [θ] mrw at wavelength λ was quoted in units of degree cm / dmol, and was calculated as [θ] mrw, λ = mrw × θλ/ × d × c, where θ is the observed ellipticity (degrees) at wavelength λ, d is the path length (cm) and c is the concentration (g/ml). mrw is the mean residual weight for the peptide bond which is given as mrw = m/(n − ); where m is the molecular mass of the polypeptide chain (in da), and n is the number of amino acids in the chain; the number of peptide bonds is n − . complete and incomplete freund's adjuvant (sigma, usa) was used for primary (day ) and booster immunisation (days , and ) respectively. to prepare a primary dose of the immunogen, complete freund's adjuvant (fa) was mixed with equal volume of purified fubc ( μg) in pbs and emulsified by vigorous vortex. similarly, three booster doses of immunogen were prepared with an equal volume of fubc protein ( μg) and incomplete freund's adjuvant. all of the immunogens were prepared according to the protocol "immunization of mice" by maira-litrán, . finally, emulsified fubc immunogens were checked by observing stable droplet on the water surface. healthy balb/c mice ( weeks old, male and female) were randomly distributed into four different groups. each group was immunised with different antigen preparations. out of the four, two groups were immunised with fubc antigen: one group was injected with fubc protein with adjuvant and other was with fubc protein without adjuvant. rest of the two groups were immunised with fu and bc peptide: one group was injected with fu and bc peptides with adjuvant, and other was fu and bc peptide without adjuvant. each of the groups was also subdivided into male and female sets, and with each set, one adjuvant control was used replacing adjuvant with pbs buffer. the mice of each group were subcutaneously immunised with emulsified antigen preparations, first dose (day ) with cfa and three subsequent booster doses (at days , and ) with ifa (according to protocol maira-litrán ). after days of the final booster dose, blood samples were collected by retro-orbital cavity, and all the antisera isolated from blood were stored at − °c for further analysis. recombinant fubc protein-specific igg was measured by indirect elisa. for this, well elisa plates were coated by incubating overnight at °c with fubc protein ( μg/ well) in mm sodium carbonate-bicarbonate buffer ph . . blocking was done for h with % bsa in pbst (pbs plus . % tween ). serum collected from immunised and control mice were incubated for h at °c after making double dilutions in pbst starting from / μl. subsequently, the elisa plate was washed once with pbst, and μl of anti-mouse hrp-conjugated secondary antibody ( : dilution in pbst) was added and incubated for min at room temperature. then, the plate was washed three more times with pbst, and the binding reaction was developed by adding μl/ well opd substrate (prepared in a phosphate-citrate buffer, . m, ph − . plus . % h o ). the reaction was stopped by adding μl of . m h so just after observing an optimum colour, and the absorbance was measured at -nm wavelength using biotek synergy ht microplate reader (winooski, vt). mean absorbance was plotted against anti-sera dilutions, and two-way anova was performed to find a statistically significant difference between two groups. comparison of the immune response in different groups of antisera was made by converting each elisa curve to linear equations (y = mx + c), and computing serum dilutions for a fixed elisa absorbance. the data are expressed as mean ± standard deviation (sd). p values of < . were considered statistically significant. to validate the immune response generated by recombinant fubc, collected mice anti-sera were allowed for spr interaction experiment with a autolab esprit instrument. similar to elisa, here also recombinant fubc protein of denv envelope was immobilised on a gold plate of spr device. for coupling of fubc protein on gold disc, -ethyl- -( dimethylaminopropyl) carbodiimide (edac) and nhydroxysuccinimide (nhs) were applied for activation. nhs activates the carboxymethyl groups by creating a highly reactive succinimide ester on the disc surface, which reacts with amine and other nucleophilic groups on proteins that subsequently help to bind the target protein on activated disc surface. ethanolamine was added to block the remaining activated carboxymethyl group. for the qualitative assay, all of the serum samples (diluted in running buffer) were applied at a flow rate of μl/min over min. in between injections, the surface of the sensor chip was regenerated by injecting m nacl at the same flow rate for s. in addition, surface coupling was done by using buffer ( mm nacl, mm edta, . % surfactant p and mm hepes-naoh) at ph . , and the buffer containing mm phosphate and mm nacl at ph . was used for sample running. initially, a series of serum dilutions were applied as analytes to find an optimum refractive index. it was observed that the refractive index at : dilution was within the detection limit among the series of serial dilutions (fig. a) . therefore, : dilution of different experimental samples was applied as standard for further comparative analysis between fubc immune response with and without freund's adjuvant. serum control (the serum collected before immunisation) sample with the identical dilution was also applied to find the basal (non-specific) immune response of serum. finally, refractive index was then analysed from association and dissociation curve of the spr sensorgram. the fubc synthetic gene sequence was deposited in genbank database with accession number mn . generally, dengue envelope (e) folded into three distinct domains (designated by domain i, ii and iii), membrane proximal stem and a transmembrane anchor (klein et al. ) . throughout these structural element, four highly conserved regions have been identified by in silico sequence analysis, and two of them were found in the domain ii with a very less informational entropy . further studies have revealed that these conserved regions are the part of previously characterised flavivirus fusion (fu) and bc loop (kuhn et al. ) . during the process of endocytosis, this hydrophobic fusion loop remains buried at the dimer interface in the prefusion state and forms cluster into larger hydrophobic surface at one end to form trimer at later state that finally initiates membrane fusion (klein et al. ). in addition to this structural property, fu and bc regions consist some of the most potent antigenic epitopes that were identified by denv neutralising conformation sensitive anti-denv hmabs (goncalvez et al. ; costin et al. ; yamanaka et al. ) . therefore, these two conserved fu and bc loops have been selected to design a recombinant fubc antigenic protein for the development of dengue subunit vaccine. in this study, an alignment of the domain ii amino acid sequences of denv - envelope proteins spanning residues from to was used to find optimum conserved sequence for the development of fubc fusion protein (fig. a) . the structural details of truncated fubc protein compare to whole envelope in dengue and fu, and bc peptides were also analysed by modelling each of their three dimensional structure. it reveals that the presence of three anti-parallel β-strands and one disulphide linkage play a crucial role in preserving the three dimensional conformations of truncated fubc protein same as in original denv envelope protein (fig. b, c) . ironically, due to the absence of anti-parallel β-strands and a single protein frame of fu and bc loops, these two separately expressed peptides fail to form similar three-dimensional conformation as in original denv envelop (fig. c) . the electrostatic and solvent accessible areas of fubc truncated protein are also comparable with original dengue envelope. hence, it is speculated that the recombinant fubc protein would be an alternative vaccine target of whole dengue envelope. fu loop is shown in orange and bc loop is shown in yellow. b in denv envelope, the fu and bc loops are present at the end of domain ii and are linked by the di-sulphide linkage that holds both loops together to form a stable structure that can also work as an epitope. c when the structure of fubc is compared to original denv envelope, it is plausible that the presence of three anti-parallel β-strands present in fubc protein (same as denv protein) plays a crucial role in maintaining the three dimensional conformation of fubc protein as it is in denv protein. another major factor that plays in conformation of fu and bc loops is the disulphide linkage between the two highly conserved loop in the fubc protein that ensures the same conformation of denv envelope as it is shown in b and c, and d due to the lack of anti-parallel β-strands and same protein frame while these peptides are expressed separately, it is unlike to form di-sulphide linkage and fold in the same conformation as reside in denv envelope and fubc proteins for in vitro synthesis of short antigenic protein, a recombinant fubc gene was constructed by assembly pcr using overlapping oligonucleotides, designed from fu and bc loop encoding dna sequences of dengue envelope. therefore, full length of fubc gene was amplified by using and end primers flanked by ecori and xhoi restriction sites. the fubc gene was then cloned into a ta cloning vector, and the clones were screened by colony pcr using fubc gene specific end primers. two positive clones were then confirmed by restriction digestion using ecori and xhoi enzymes (fig. s a-d) . the digested and purified fubc insert gene was further sub-cloned into pet a expression vector, and the positive clones were confirmed by restriction digestion (fig. s b) and sanger sequencing. primarily, the recombinant fubc protein expression was observed only in pellet fraction; there was no such fubc equivalent protein band in supernatant fraction, while they were separated on sds page (fig. s c) . therefore, fubc expression level was checked further by lowering growth temperature and iptg concentration, but no significant change was noticed in supernatant fraction. chaperone-assisted folding system did not help remarkably to express the fubc recombinant protein in the soluble form. therefore, the insoluble pellet fraction of fubc protein was further utilised in mild solubilisation process to recover soluble fubc protein. initially, the recovered soluble fubc protein fraction was purified by using ni-nta agarose beads, and the purified protein band was confirmed by western blot analysis using anti-his tag monoclonal antibody (fig. s ) . although, ni-nta affinity chromatography has not revealed good purification quality and the yield was also not sufficient. furthermore, gel filtration chromatography was used to recover good quality large-scale recombinant protein, and single distinct peak greater than mau was observed at approximately . ml position for all of injected fubc protein samples (fig. a) . after pooling together, the peak fractions were separated on % sds-page, and single bright band was observed at molecular weight . kda (fig. b) . originally, the fu and bc loop of dengue envelope is composed of three anti-parallel β-sheets. and the formation of recombinant fubc secondary structure was characterised here by negative bending at nm and nm wavelength (kumagai et al. ) . the cd spectrum of recombinant fubc protein has showed significantly decreased spectral peaks around and nm that signify the formation of β-sheet (fig. ). in addition, it was noticed that the acquired cd spectrum of fubc at a protein concentration of . mg/ml in mm pbs buffer was adequately intense (fig. ) . therefore, it can be inferred from fig. (lower panel) that fubc at concentrations of . mg/ml and . mg/ml in mm pbs retained the best globular folded state as compared to other conditions. immune response to the purified fubc protein in balb/c mice from indirect elisa, it was observed that the serum igg levels in both male and female groups treated with fubc proteins were significantly higher than those treated with only adjuvant and pbs control (fig. a, b) . in addition, the response with only fubc recombinant protein (without adjuvant) in the female group was also observed higher than their male counterpart group (fig. b) . all of the elisa data were also found statistically significant by two-way anova (table s ) . similarly, by spr assay, high level immune response was also observed for serum injected with recombinant fubc protein along with adjuvant, and without any adjuvant, the response was also higher than the control (serum with pbs only) (fig. b ). in post-dengue infection, most of the circulating antibodies are non-neutralizing and found to be raised against dengue envelope e protein and the prm protein (wahala and de silva ) . due to the absence of highly specific neutralising antibodies in secondary infections, cross-reactive nonneutralising antibodies usually enhance the dengue severity (de alwis et al. ). according to the revised dengue case classification (denco), to % of secondary infections with another serotype causes life threating dengue with warning signs (dws+) and severe dengue (sd). it was reported that serotype-cross-reactive non-neutralising antibodies enhance the entry of dengue genome into fc receptor-bearing monocyte cells and promote disease severity by a process known as antibody-depended enhancement (ade) flipse et al. ) . that means nature of the antibody response to denv is most likely to play a major role in defining disease outcome. therefore, it is predictable that antibodies that recognise specific neutralizing epitopes help in virus clearance and reduce symptoms; however, antibodies that recognise non-neutralising epitopes lead to more severe forms of disease like dws+/sd. hence, there is an urgent need for an advanced vaccine which could generate highly specific and cross-neutralising antibody (costin et al. ). recently, several attempts have been made towards the development of a potent dengue vaccine. the most advanced candidate, dengvaxia (cyd-tdv), was licenced in some of the dengue endemic countries (imai and ferguson ) . however, it was revealed risky in children or naïve dengue patient with severe infection as they were vaccinated (arredondo-garcía et al. ) . therefore, considering safety issues, production of recombinant subunit vaccine with efficient immune protective properties is looking attractive (govindarajan et al. ) . meanwhile, an admixture of four live attenuated recombinant dengue vaccine tv /tv have completed phase iii clinical trial and licenced to several manufacturers including butantan, vabiotech and merk (whitehead et al. ) . moreover, some recombinant tetravalent vaccines (e.g. den- e, tvdv) expressing the prm and e genes of each of the four denv serotypes from plasmid dna, have already completed phase i clinical trial (govindarajan et al. ; danko et al. ). it has also been shown that the recombinant dengue envelope domain iii can inhibit dengue infectivity, and induce dengue-neutralising immunoglobulin in mice (hermida et al. ) . in addition, a number of antibodies which were raised in mice and fig. purification of recombinant fubc protein by size exclusion chromatography. a fplc chromatogram of fubc protein expressed in e. coli inclusion bodies. three distinct peaks at . ml, . ml and . ml position were collected separately. b gel image of unpurified and purified fubc protein sample separated on % sds-page. lanes and represent the un-purified sample and lanes and represent purified protein sample collected from peak at position . ml fraction chimpanzees against dengue domain ii fusion loop were found cross-reactive to other flaviviruses (goncalvez et al. ; stiasny et al. ) . although, the hmabs specific to dengue domain ii fusion loop were not found equally effective on other flaviviruses including wnv, yfv and other denv strain (costin et al. ) . therefore, it was conjectured that adjacent to the fusion loop, additional contact residues of original domain ii surface structure might be involved to raise cross-neutralizing antibodies. several studies have identified that adjacent to the fusion loop, another similar loop exists in most of the flaviviral (e.g. wnv, tbe, jev, yfv) envelope (fibriansah and lok, ; li et al. ) . our previous bioinformatics studies have also shown that the envelope of all denv serotype consists of four conserved regions (> %), and two of them were found in domain ii, in which one is fusion loop (fu) with more than % conservation and another is its nearby bc loop (fig. a ) . therefore, these two highly conserved loops were targeted in this study for the development of a fusion protein. by using reverse dna technology, we have previously shown the structural integrity ( fig. b-d) and binding specificity of fubc protein with an anti-fusion loop scfv antibody, derived from dengue and wnv-specific mab e (rodenhuis-zybert et al. ; rathore et al. ) . the very early challenge of subunit vaccine design is to produce large quantities of functional protein. hereby, we have successfully optimised the expression and purification methods for the production of large-scale high-quality recombinant fubc protein. the e. coli expression system was used here, and the recombinant protein was found to be expressed in higher quantity, though most of the protein was extracted initially in pellet fraction. no significant change was noticed by altering regular growth temperature and inducer concentration. therefore, we have utilised the mild-solubilisation methodology to recover soluble fubc protein from insoluble pellet protein. for purification, firstly, we have used convenient ni- nta affinity purification method, and by adjusting the lysis and elution buffer, we were able to purify recombinant fubc protein to some extent but the quality and quantity was not sufficient. finally, to scale up the quality protein production, size exclusion chromatography was used in akta fplc system. to confirm the recombinant protein expression, simple western blot was performed by using anti-his monoclonal antibody as primary. furthermore, in vitro structural integrity and functional authenticity of the experimental fubc protein were also verified by cd spectroscopy (fig. ) and indirect elisa (fig. ) . however, most of the hmabs identified earlier from dengue patient serum were predominantly cross-reactive and recognise epitopes containing highly conserved residues at the fu loop of domain ii (lai et al. ). however, having such sequence homology in fu loop of all flaviviruses, these hmabs are non-neutralizing against heterologous serotypes (lai et al. ; smith et al. ) and thus was found to be responsible for ade in animals (goncalvez et al. ). previously, it has also been stated that this scenario might happen due to the cryptic nature and poor accessibility of the fusion loop epitope on the surface of the mature virion (lai et al. ; cherrier et al. ). however, in addition to the partially exposed fusion loop on immature flaviviruses, some neutralizing hmabs were also found to bind with bc loop (costin et al. ) . therefore, it is suggestive that along with conserved fusion loop, adjacent less conserved linker sequence and bc loop might be required to be exposed as a whole to generate effective cross-neutralizing immunity. our current mouse immunisation experiment also supports this idea as it was observed that the antibody response to full length fubc protein was stronger than the response elicited by the fu and bc peptides in separate (figs. and ) . moreover, without freund's adjuvant, recombinant fubc protein was found immunogenic, and the response in female mice was stronger than in male (figs. and ) . these observations were also symmetrical with other studies where it was stated that female mice have a better immune response due to the higher number of leukocytes occupying the naive peritoneal and pleural cavities, and also have a number of t-and blymphocytes as well as macrophages (terres et al. ; weinstein et al. ; klein et al. ) . therefore, we have used female mice serum samples for further qualitative elisa and spr test to quantify immune response of recombinant fubc protein. it was noticed that igg antibody response to fu, and bc peptides were significantly lower than the response observed to fubc recombinant protein, and the immune response achieved to fu and bc peptide with freund's adjuvant was significantly higher as compared to the peptides without adjuvant and serum control (fig. ) . two-way anova was also performed to analyse the significance of igg immune responses between fubc + fa and fubc without fa; fu, bc peptide + fa and fu, bc peptide without fa. from f values and p values obtained by anova test, it can be interpreted that the difference between fubc protein with freund's adjuvant and fu, bc peptide with freund's adjuvant were significantly higher than protein and peptides injected without freund's adjuvant (table s ) . also, the igg response to recombinant fubc protein was recorded better than fu and bc peptides in both cases either with or without adjuvant. in addition, the p value of igg immune response between fubc protein with fa and without fa was found insignificant (p = . > . ) that suggests that fubc without any adjuvant is sufficient to elicit significant immune response (table s ) . consistently, it was observed by spr experiment that the refractive index of fubc + fa was significantly higher than fubc + without fa and serum control (fig. b) . since, the newly developed fubc recombinant protein expresses vastly in e. coli and induces significant immune response in mice, it might be a good agent of dengue subunit vaccine development. due to the presence of highly conserved fusion loop epitope, overlapping less conserved linker sequences and also bc epitope, this fusion protein might induce cross-neutralisation immunity against heterologous dengue serotypes. though, still a lot of investigations, like the evaluation of a proper adjuvant to induce robust immune response, the memory immune response generated against the fubc protein both by humoral and cell-mediated immunity, and whether this memory response will provide protection against a secondary encounter with denv, are required before going clinical trial. nevertheless, the production process and immune response of this fusion protein would provide new insight for the development of dengue subunit vaccine. four-year safety follow-up of the tetravalent dengue vaccine efficacy randomized controlled trials in asia and latin america the global distribution and burden of dengue refining the global spatial limits of dengue virus transmission by evidencebased consensus current status of vaccines against dengue virus. dengue fever -a resilient threat in the face of innovation intechopen book chapter structural basis for the preferential recognition of immature flaviviruses by a fusion-loop antibody the development of recombinant subunit envelope-based vaccines to protect against dengue virus induced disease mechanistic study of broadly neutralizing human monoclonal antibodies against dengue virus that target the fusion loop development of dengue dna vaccines safety and immunogenicity of a tetravalent dengue dna vaccine administered with a cationic lipid-based adjuvant in a phase clinical trial dengue viruses are enhanced by distinct populations of serotype cross-reactive antibodies in human immune sera phase i randomized study of a tetravalent dengue purified inactivated vaccine in healthy adults from puerto rico development and clinical evaluation of multiple investigational monovalent denv vaccines to identify components for inclusion in a live attenuated tetravalent denv vaccine dengue virus cryo-em structure of an antibody that neutralizes dengue virus type by locking e protein dimers the development of therapeutic antibodies against dengue virus molecular mechanisms involved in antibody-dependent enhancement of dengue virus infection in humans rs a new quaternary structure epitope on dengue virus serotype is the target of durable type-specific neutralizing antibodies development of a dengue vaccine and its use in pregnant women monoclonal antibody-mediated enhancement of dengue virus infection in vitro and in vivo and strategies for prevention epitope determinants of a chimpanzee fab antibody that efficiently cross-neutralizes dengue type and type viruses map to inside and in close proximity to fusion loop of the dengue type virus envelope glycoprotein preclinical development of a dengue tetravalent recombinant subunit vaccine: immunogenicity and protective efficacy in nonhuman primates from research to phase iii: preclinical, industrial and clinical development of the sanofi pasteur tetravalent dengue vaccine development of the sanofi pasteur tetravalent dengue vaccine: one more step forward dengue infection domain iii of the envelope protein as a dengue vaccine target a recombinant fusion protein containing the domain iii of the dengue- envelope protein is immunogenic and protective in nonhuman primates immunostimulatory dna as a vaccine adjuvant targeting vaccinations for the licensed dengue vaccine: considerations for serosurvey design how to study proteins by circular dichroism structure of a dengue virus envelope protein late-stage fusion intermediate sex-based differences in immune function and responses to vaccination structure of dengue virus: implications for flavivirus organization, maturation, and fusion going deep into protein secondary structure with synchrotron radiation circular dichroism spectroscopy antibodies to envelope glycoprotein of dengue virus during the natural course of infection are predominantly cross-reactive and recognize epitopes containing highly conserved residues at the fusion loop of domain ii potent neutralizing antibodies elicited by dengue vaccine in rhesus macaque target diverse epitopes immunogenicity and efficacy of flagellin-envelope fusion dengue vaccines in mice and monkeys immunization of mice dengue virus pathogenesis: an integrated view synchrotron radiation circular dichroism spectroscopy of proteins and applications in structural and functional genomics structure of the dengue virus envelope protein after membrane fusion immune response to dengue virus and prospects for a vaccine crystal structure of dengue virus type envelope protein in the postfusion conformation and its implications for membrane fusion dengue vaccination: a more balanced approach is needed designing antibody against highly conserved region of dengue envelope protein by in silico screening of scfv mutant library a fusionloop antibody enhances the infectious properties of immature flavivirus particles evaluation of scfv protein recovery from e. coli by in vitro refolding and mild solubilization process the potent and broadly neutralizing human dengue virus-specific monoclonal antibody c reveals a unique cross-reactive epitope on the bc loop of domain ii of the envelope protein cryptic properties of a cluster of dominant flavivirus cross-reactive antigenic sites structure and function analysis of therapeutic monoclonal antibodies against dengue virus type a quantitative difference in the immune response between male and female mice the human antibody response to dengue virus infection sex-associated differences in the regulation of immune responses controlled by the mhc of the mouse in a randomized trial, the live attenuated tetravalent dengue vaccine tv is well-tolerated and highly immunogenic insubjects with flavivirus exposure prior to vaccination a mouse monoclonal antibody against dengue virus type mochizuki strain targeting envelope protein domain ii and displaying strongly neutralizing but not enhancing activity publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations key: cord- -qaogpxy authors: too, issac horng khit; bonne, isabelle; tan, eng lee; chu, justin jang hann; alonso, sylvie title: prohibitin plays a critical role in enterovirus neuropathogenesis date: - - journal: plos pathog doi: . /journal.ppat. sha: doc_id: cord_uid: qaogpxy a close relative of poliovirus, enterovirus (ev ) is regarded as an important neurotropic virus of serious public health concern. ev causes hand, foot and mouth disease and has been associated with neurological complications in young children. our limited understanding of the mechanisms involved in its neuropathogenesis has hampered the development of effective therapeutic options. here, using a two-dimensional proteomics approach combined with mass spectrometry, we have identified a unique panel of host proteins that were differentially and dynamically modulated during ev infection of motor-neuron nsc- cells, which are found at the neuromuscular junctions where ev is believed to enter the central nervous system. meta-analysis with previously published proteomics studies in neuroblastoma or muscle cell lines revealed minimal overlapping which suggests unique host-pathogen interactions in nsc- cells. among the candidate proteins, we focused our attention on prohibitin (phb), a protein that is involved in multiple cellular functions and the target of anti-cancer drug rocaglamide (roc-a). we demonstrated that cell surface-expressed phb is involved in ev entry into neuronal cells specifically, while membrane-bound mitochondrial phb associates with the virus replication complex and facilitates viral replication. furthermore, roc-a treatment of ev -infected neuronal cells reduced significantly virus yields. however, the inhibitory effect of roc-a on phb in nsc- cells was not through blocking the craf/mek/erk pathway as previously reported. instead, roc-a treated nsc- cells had lower mitochondria-associated phb and lower atp levels that correlated with impaired mitochondria integrity. in vivo, ev -infected mice treated with roc-a survived longer than the vehicle-treated animals and had significantly lower virus loads in their spinal cord and brain, whereas virus titers in their limb muscles were comparable to controls. together, this study uncovers phb as the first host factor that is specifically involved in ev neuropathogenesis and a potential drug target to limit neurological complications. a a a a a enterovirus (ev ) is a non-enveloped, positive-sense, single-stranded rna virus, and causes hand, foot and mouth disease (hfmd) in humans. being a close relative of poliovirus, ev is deemed as an important neurotropic virus worldwide [ ] . since its first isolation in california in , several major outbreaks have been reported in china, singapore, korea, and japan [ ] [ ] [ ] [ ] . although the clinical manifestations are generally mild and self-limiting, including hfmd and herpangina, severe neurological complications have been consistently reported with ev -associated infections, causing brainstem encephalitis, acute flaccid paralysis, pulmonary edema and cardiopulmonary failure [ , ] . in addition, some patients who have recovered from severe disease have been reported to develop long term neurologic and psychiatric disorders [ ] . there are currently no effective prophylactic or therapeutic agents against ev . although several vaccines have completed phase iii clinical trials [ ] , regulatory issues may limit their widespread utilization. in addition, as these vaccine candidates consist of inactivated virus from a single ev genotype (c ), cross-protection against other genotypes may be limited [ , ] . the increasing awareness of life-threatening ev infections has boosted research in recent years to further understand virus-host interactions and develop effective antiviral strategies [ ] [ ] [ ] [ ] [ ] [ ] . however, the neuropathogenesis of ev is still poorly understood. infection occurs when the virus enters the body upon ingestion and/or inhalation. the virus multiplies initially in the alimentary tract mucosa and rapidly reaches the deep cervical and mesenteric lymph nodes via the tonsils and peyer's patches [ ] . after a short transient systemic dissemination phase, the virus accumulates and actively replicates in muscles where it is believed to infect motor neurons at the neuromuscular junctions. experimental evidence supports that ev migrates to the brainstem via retrograde axonal transport as previously described for its close relative poliovirus [ , , [ ] [ ] [ ] . however, the molecular mechanisms involved in ev infection of motor neurons to access the central nervous system (cns) have not been studied. indeed in vitro studies aiming at studying ev neurovirulence have employed neuroblastoma cell lines that may not reflect accurately infection in motor neurons. to address this gap, we have recently reported a novel in vitro model of ev infection in the murine motor neuron cell line nsc- [ ] . nsc- cells originate from the fusion between murine neuroblastoma and spinal cord cells, and possess motor neuron-like properties, such as generation of action potentials and production of acetylcholine [ ] , therefore making it a relevant model to study the mechanism of ev neuropathogenesis. we demonstrated that nsc- cells are permissive to ev clinical isolates and found that, unlike any other mammalian cell types so far reported, ev -infected nsc- cells do not undergo apoptosis and lysis. instead we showed that the virus exits the cells via a non-lytic mode, a phenomenon that has also been previously described for poliovirus [ , , ] . these unique features thus suggested that the infection cycle of ev in nsc- cells involves host pathways and partners that are likely to be different from those previously identified in other mammalian cell types such as muscle cells and neuroblastoma cells. in this work, using a proteomics approach coupled with mass spectrometry, we have identified a panel of cellular proteins that were dynamically regulated during ev infection of nsc- cells. among the host protein candidates that were up-regulated, we focused our attention on prohibitin (phb) and characterized its role during ev infection in nsc- cells. we also demonstrated the importance of phb during ev infection in a symptomatic mouse model of ev infection. to identify the host proteins involved in ev infection cycle in nsc- cells, a de proteomic approach was undertaken. nsc- cells were infected with ev at m.o.i. , and the cell lysates were harvested at , , and hours for downstream proteomic analysis in which a range of - spots were resolved. by using pdquest -d analysis software (biorad), a total of protein spots (fig a) that displayed at least . -fold differential expression (p< . , two-tailed student's t-test) compared to uninfected controls, were excised for in-gel digestion and maldi-tof ms analysis. the peptide fingerprints were then searched against ncbinr mouse genome database for protein identification using mascot program (http://www.matrixscience.com/). the protein candidates were then categorized based on their primary functional class indicated in uni-protkb database (s table) . to illustrate the dynamic regulation of host proteins during the viral infection, a heat map was generated using multiexperiment viewer (mev), with the distance between proteins represented by euclidean average linkage clustering (fig b) . this clustering analysis revealed that proteins that were up-regulated (fig c) during infection are mainly involved in motility ( %) and catalytic processes ( %), while proteins that participate in rna processing ( %) and energy biosynthesis ( %) generally displayed a down-regulation trend during the course of infection (fig d) . functional interactions among the selected host proteins were analyzed by string (search tool for the retrieval of interacting genes/proteins). this platform allows establish proteinprotein interactions based on published literature, online databases, predicted functional associations using genomic information or observations made with other organisms [ ] . the protein network obtained was significantly enriched with the p value of less than . , suggesting that the interactions are highly associated and unbiased (fig ; s table) . furthermore, some of the selected host proteins appear to have strong associations among each other as indicated by the thickness of connecting lines which reflects the confidence level of the interactions [ ] . using go annotations for biological processes, molecular functions, cellular compartments and protein classes, the protein candidates were localized within the cytoplasm ( . %), organelles ( . %) and macromolecular complexes ( . %) (s a fig). in addition, they were found to be involved in various biological processes including mitochondrial biogenesis, proteolytic activity, cytoskeletal machinery and rna processing (s a fig) , consistent with the protein clusters observed in the string network (fig ) . finally, molecular function analysis indicates that majority of these proteins contribute to nucleic acid binding transcription factor activity ( . %), structural molecule activity ( %) or binding ( . %) (s a fig). a meta-analysis with other selected proteomic studies of ev -infected muscle and neuronal cells [ ] [ ] [ ] , [ ] [ ] [ ] revealed minimal overlap between nsc- , rd and other neuronal cell types with protein candidates only, namely actb, tubb, pdia and eno , suggesting that the host-pathogen interactions in nsc- cells are unique (s b fig). the limited overlap may also be partly explained by differential proteomics approaches. act and tubb are involved in maintaining cytoskeletal structure, and they are found highly modulated during viral infection to facilitate virus internalization and transportation [ ] [ ] [ ] . eno has been shown previously to interact with cytoskeletal proteins in intermediate filaments framework rearrangement [ ] . on the other hand, pdi functions in catalyzing reduction and oxidation processes and protein folding [ ] . it has also been demonstrated to be involved in humoral immune response [ ] or viral replication (denv) [ ] and entry (hiv) [ ] . not surprisingly, greater overlap was seen between motor-neuron nsc- and other neuronal cells ( shared hits) than between nsc- and rd cells ( shared hits). importantly, neuro-specific proteins such as prph and uchl were only identified from profiling studies in nsc- and other neuronal cells, thus validating our de proteomic approach. seven protein candidates namely, alpha-enolase (eno ), dep domain-containing mtorinteracting protein (deptor), peripherin (prph), phosphatidylethanolamine-binding protein (pebp ), prohibitin (phb), stomatin-like protein (stoml ) and protein disulfideisomerase a (pdia ) were selected for validation of the proteomic findings by gene knockdown. these host proteins have been previously shown to be associated with various steps in the life cycle of viruses, such as entry [ ] [ ] [ ] [ ] and replication [ , , ] , or to be involved in autophagy [ ] [ ] [ ] and axonal transport [ ] [ ] [ ] . silencing of each selected gene target was achieved by reverse transfecting the on-targetplus sirna smartpool into nsc- cells, prior to viral infection. sirna smartpools consist of four highly potent gene-specific sirna molecules which have been modified to minimize off-target activity and enhance gene specificity [ ] . cytotoxicity of the sirnas smartpools was first established. apart from the sirna pool targeting pdia , no significant cytotoxicity was observed with the other sirna pools at and nm with cell viabilities greater than the % viability threshold (fig a) . the pdia -specific sirna pool concentrations were lowered to and nm to avoid cytotoxicity (fig a) . virus titers in the culture supernatants of sirna-transfected cells were then determined at h.p.i. results indicated that silencing of stoml , prph, phb and deptor led to significantly lower virus titers, whereas pdia-, pebp -and eno -knocked down resulted in increased viral titers in the supernatants of ev -infected nsc- cells compared to control (fig b) . importantly, both trends were dose-dependent. therefore, these results validate the d-proteomics approach as a powerful way to identify host proteins that play a role during ev infection in nsc- cells. prohibitins belong to a highly conserved protein family present in unicellular and multicellular eukaryotes [ ] . prohibitin (phb; bap- ) and prohibitin (phb , rea, bap- ) are two highly homologous members of this family and are ubiquitously expressed in multiple cellular compartments including the mitochondria, nucleus, and the plasma membrane. prohibitins have been involved in multiple cellular functions including cell proliferation and maintenance of the functional integrity of the mitochondria [ ] . in addition, phb specifically has been previously reported to be involved in the entry step of alphavirus chikungunya (chikv) [ ] , and flaviviruses dengue (denv) [ ] and hepatitis c (hcv) [ ] , and to interact with envelope proteins from white spot syndrome virus to prevent infection [ ] . however, there has been no report so far on the role of phb during ev infection. first, modulation of phb expression during ev infection in nsc- cells was confirmed by western blot and showed an overall up-regulation of phb during the course of infection compared to uninfected control (s fig). next, the impact of phb gene silencing on virus production was further analyzed using a wider range of sirna pool concentrations including , , and nm. efficacy of the gene silencing was assessed by western blot and showed a dose-dependent decrease of phb expression in nsc- cell lysates (fig c) . this dose-dependent phb knockdown correlated well with a dose-dependent reduction in the viral titers measured in the culture supernatant (fig d) , therefore supporting the role for phb in ev infection cycle. to address the possibility of false positive or off-target effects of the sirna pool, phb gene silencing was performed with the individual sirnas species from the pool. western blot showed that each individual sirna was capable of silencing phb expression significantly (s a . the viral titers in the culture supernatants were determined by plaque assay at h.p.i.. cell viability of the transfected cells was assessed using alamarblue assay. statistical analysis was performed using two-tailed student's t-test ( Ãà p< . , ÃÃà p< . ). relative band quantification (below western blot) was determined by imagej, by normalizing to loading control, β-actin. error bars represent mean ± standard deviation. one representative of two biological repeats is shown. to determine if phb mediates entry of ev into nsc- cells, a competition assay was performed using commercially available anti-phb antibody. incubation of nsc- cells with anti-phb antibody prior to infection led to significant reduction of the viral titer in a dosedependent manner (fig a) . to demonstrate a physical interaction between cell surfaceexpressed phb and ev , a proximity ligation assay (pla) was performed. in this assay, phb and ev are recognized by specific primary antibodies raised in two different species, which are in turn recognized by species-specific secondary antibodies conjugated to a probe and a target, respectively. should ev and phb be in close proximity, the probe and the target ligate, amplify and result in emission of a fluorescent signal. scarb- which has been previously demonstrated as the main receptor for ev in rd cells [ ] was used as positive control and a red fluorescent signal was readily detected in ev -infected rd cells (fig b) . a positive signal was also detected with nsc- cells incubated with anti-phb and anti-ev antibodies thus supporting the close proximity between ev virus particles and surfaceexpressed phb (fig b) . in contrast, and expectedly, no signal was detected with mouse scarb- (mscarb ) and ev antibodies (fig b) , since we have shown previously that entry of ev into nsc- cells is not mediated by mscarb- [ ] . the physical interaction between ev and cell surface-expressed phb was further assessed by performing a co-immunoprecipitation experiment. nsc- cells were incubated with ev for hours at ˚c to allow viral adsorption onto the cell surface but no internalization. the total cell lysate was obtained and a pulldown was carried out using antibody specific to phb or an isotype igg antibody control. the immunoprecipitates were then analyzed by western blot using anti-ev primary antibody. a discrete band at the expected size was obtained when pulldown was performed with the anti-phb antibody whereas no band was seen when pulldown was done with the igg isotype (fig c) . these findings thus support that ev physically interacts with cell surface-expressed phb, and suggest that phb may serve as a receptor for ev entry into nsc- cells. in addition to its association to the plasma membrane at the cell surface, phb is also present intracellularly [ ] . to study the role of intracellular phb in ev infection cycle, phbknocked down nsc- cells were transfected with ev rna genome and the viral titers were determined at , , and hours post-transfection, in order to assess virus production within a single cell infection cycle. transfection of viral genome was meant to bypass the virus entry step which we have shown involves cell surface-expressed phb. no viral titer was obtained at h.p.t. in the phb-knocked down and control cells (fig a) . from h.p.t onwards the viral titers detected in the culture supernatant from phb-knocked down cells were consistently lower than those measured in sintc or non-treated cells (fig a) . this result thus demonstrates that phb plays a role in the intracellular virus infection cycle. to further confirm this hypothesis, a luciferase ev (lucev ) replicon transfection assay was performed. in this replicon, the viral structural genes have been replaced with a luciferaseencoding gene while the other parts of the viral genome are retained (fig b) . upon transfection, the replicon undergoes a single replication cycle with no production of virus progeny since it is deficient in viral structural proteins. the luminescence measured from the cell culture is proportional to the amount of luciferase produced inside the cell thereby reflecting the replication activity of the replicon. here, a significant reduction in the luminescence signal was observed in phb-knocked down nsc- cells compared to sintc-treated and nontreated cells (fig c) . thus, together the data support that intracellular phb is involved in ev viral replication. ) were incubated with ev at ˚c for hours to allow viral adsorption, prior to pla staining. scarb -stained nsc- and rd cells served as negative and positive controls, respectively. scale bar represents μm. a representative experiment of two independent repeats is shown. (c) co-immunoprecipitation of ev and surface-expressed phb. ev and nsc- cells were co-incubated for hours at ˚c. the cell lysate was pulled down with anti-phb or igg isotype control antibodies prior to immunoblotting using anti-vp primary antibody. mock-infected cells and igg isotype served as control. a representative experiment of two independent repeats is shown. ip, immunoprecipitation; ib, immunoblot. to further investigate the role of intracellular phb during ev infection cycle, immunostaining was performed on ev -infected nsc- cells probing for phb, ev capsid proteins vp /vp and the viral replication intermediate dsrna. co-localization between phb and dsrna, and between phb and ev capsid proteins was readily observed (fig a) , supporting that intracellular phb could be involved in the viral replication and/or assembly processes. to further study the role of intracellular phb in viral replication, co-immunoprecipitation was carried out using antibody against phb. pull down with anti-phb antibody followed by western blot using anti-ev d/ cd antibody led to the detection of a kd band that corresponds to the ev cd protein complex and a kda band ( d polymerase) which comigrated with igg heavy chain (fig b) . taken together, these data support a physical interaction between intracellular phb and ev non-structural proteins d and cd, indicating that intracellular phb is likely involved in viral replication. previous studies have shown that the main replication sites of picornavirus are located at the golgi apparatus and endoplasmic reticulum (er) [ ] . we have demonstrated that intracellular phb co-localizes with dsrna and is closely associated with the ev d polymerase. given that intracellular phb is abundantly and mainly expressed on mitochondria [ ] , we speculated that in nsc- cells mitochondria may be exploited by ev as replication site. consistently, co-localization of phb and ev with mitochondria was observed by ifa (fig a and b) . furthermore, co-localization of phb, dsrna and mitochondria was also readily detected, thus indicating that mitochondrial phb is associated with the viral replication (fig c) . to exclude the possibility that the replication complexes detected were actually associated to the er, which is in close proximity to mitochondria, the mitochondrial fraction was prepared from ev -infected nsc- cells and western blot analysis revealed the presence of viral capsid protein vp ( kd), d ( kd) and cd ( kd) proteins (fig d) . furthermore, the mitochondrial fraction was shown to be free of cytoplasmic contamination, as evidenced by the presence of mitochondrial marker (atpb, kd) and lack of er marker (calreticulin, kd). similar observation was made with the mitochondrial fraction prepared from ev -infected rd cells (fig d) , suggesting that ev is able to exploit various cellular organelles as replication scaffolds in various mammalian cell types from different species. this finding is consistent with a previous study where ev vp was found to be associated with mitochondria in hela cells [ ] . to further support the close proximity and likely interactions between the viral replication complexes and mitochondria, transmission electron microscopy was performed on ev -infected nsc- cells. clustering of mitochondria surrounding viral replication complexes could be seen (electron-dense like structures) in the infected cells, and examination at a higher magnification indicated a close association between mitochondria membrane and virus complex/virus particle (fig e) . collectively, the data strongly indicate that mitochondria in nsc- cells are exploited by ev as a replication scaffold and that mitochondria-associated phb plays a role in this process. recent studies have shown that phb activity is inhibited by a group of phytochemicals called rocaglamides, which are derived from the traditional chinese medicinal plants aglaia [ , [ ] [ ] [ ] . we therefore investigated whether roc-a could interfere with ev infection cycle by blocking phb activity in nsc- cells. incubation of roc-a with virus prior to nsc- cell infection (co-treatment) did not result in any significant reduction in viral titer (s a fig) . when cells were pre-treated with roc-a prior to ev infection (pre-treatment), less than log pfu/ml of decrease in viral titer was observed at the highest drug concentration only ( nm) (s b fig). in contrast, a dose-dependent decrease in the viral titer was seen when roc-a treatment was applied after infection (post-treatment) at concentrations ranging between - nm (fig a) . western blot analysis of the cell lysates further confirmed the dose-dependent reduction of intracellular viral capsid protein and phb (fig b) . taken together, the data suggest that the antiviral effect of roc-a on ev -infected nsc- cells targets the viral replication step and not the entry step, in contrast to a previous study with hcv [ ] . prior studies focusing on cancer have reported that the mechanism by which roc-a targets and inhibits phb activity involves blocking the craf/mek/erk pathway [ , ] , and this was also described in the hcv study [ ] . to investigate whether the craf/mek/erk to decipher the mode of action of roc-a in nsc- cells, immunostaining of roc-a treated nsc- cells was performed. decreased signals for phb and mitochondria were observed with increasing concentrations of roc-a (fig c) , suggesting that roc-a might affect mitochondrial integrity. we thus assessed the mitotoxicity and cytotoxicity of roc-a using the mitochondrial toxglo assay (promega). while cytotoxicity remained generally minimal over the range of roc-a concentrations tested, the intracellular atp levels were significantly reduced in a dose-dependent manner, thus indicating functional impairment of the mitochondria in roc-a-treated nsc- cells (fig d) . consistently, using the membrane-permeant jc- dye as an indicator of mitochondrial membrane potential, roc-a-treated nsc- cells displayed marked and dose-dependent mitochondrial depolarization as evidenced by the decrease of red fluorescent j-aggregates, compared to untreated cells (s fig). together, these observations suggest that roc-a treatment in nsc- cells results in reduced levels of phb, leading to mitochondrial destabilization and lower atp production. one could thus speculate that the lack of intact mitochondria and reduced intracellular atp levels might eventually impact negatively on ev replication efficacy. the role of phb in ev infection cycle was also studied in human muscle (rd) and neuronal (sk-n-sh) cell lines. as human (gi ) and murine (gi ) phb display high similarity in their amino acid composition, most of the anti-phb antibodies commercially available demonstrate good cross reactivity with cell lines of both species. we first showed by flow cytometry comparable levels of surface expression of phb on rd, sk-n-sh and nsc- cells (s fig). however, both phb gene silencing and phb receptor blocking experiments performed in human muscle rd cells did not impact the viral titer (s a and s b fig) . on the contrary, phb silencing in the human neuroblastoma cells sk-n-sh led to a significant dosedependent reduction in viral titer in the culture supernatant (fig a) . in addition, reduced virus titers were observed with sk-n-sh cells pre-treated with anti-phb antibodies, thus supporting that cell surface-expressed phb is involved in ev entry into this human neuroblastoma cell line (fig b) . to investigate if intracellular phb is also involved in viral replication in sk-n-sh cells, transfection of lucev replicon into phb-silenced sk-n-sh cells was performed. results showed a significant reduction in the luminescence signal compared to controls (fig c) . finally, the effectiveness of roc-a treatment in ev -infected sk-n-sh cells was also assessed. similar to our observations with nsc- cells, a significant dose-dependent decline in viral titers was observed (fig d) . taken together, these findings thus strongly indicate the specific involvement of phb in both viral entry and replication of ev in neuronal cells from both human and murine origins. concentrations of roc-a for hours before assessment of cytotoxicity (fluorescence) and mitotoxicity (luminescence) using mitochondrial toxglo assay. statistical analysis was performed using one-way anova with dunnett's post-test ( à , p< . ; Ãà , p< . ; ÃÃà , p< . ; ÃÃÃà , p< . ). one representative from two independent experiments is shown. https://doi.org/ . /journal.ppat. .g i. the culture supernatant was collected for viral titer determination by plaque assay, and the cell lysate was harvested for western blot analysis. statistical analysis was performed using one-way anova with dunnett's post-test ( Ãà , p< . ). relative band quantification (below western blot) was determined by imagej, by normalizing to loading control, β-actin. error bars represent mean ± standard deviation. cellular cytotoxicity was assessed using alamarblue assay. one representative from two independent experiments is shown. https://doi.org/ . /journal.ppat. .g the role of phb was further investigated in vivo, using the mouse model of ev infection that we established previously where -week old ag mice (deficient in type i&ii ifn pathways) infected with ev display progressive limb paralysis and spatio-temporal virus accumulation in the limb muscles, spinal cord and brainstem [ ] . here, immunohistochemical analysis showed that phb was readily detected in the limb muscles, brainstem and spinal cord at day p.i. (fig a) . furthermore, some co-localization with ev was observed (fig a) . next, the in vivo anti-ev efficacy of roc-a was assessed by treating therapeutically ev -infected mice with roc-a at day and p.i. the development of clinical manifestations was clearly delayed in the roc-a-treated mouse group which resulted in increased survival time compared to the untreated or vehicle-treated control groups (fig b) . in addition, the viral loads in limb muscles, spinal cord and brain in both roc-a-treated and vehicle-treated mice were determined. comparable viral loads were detected in the limb muscles from both groups (fig c) . in contrast, viral titers in the spinal cord and brain from the roc-a-treated animals were significantly lower compared to the vehicle-treated group (fig c) , thus supporting that roc-a treatment specifically impairs ev neuropathogenesis. these findings correlate well with our in vitro data showing that the role of phb during ev infection cycle is specific to neuronal cells. together, the in vivo data support that phb plays a critical role in ev neurovirulence, and that roc-a represents a potential therapeutic strategy to limit ev neuropathogenesis, thereby minimizing neurological manifestations and complications. the re-emergence of neurotropic enteroviruses in recent years has motivated investigations into ev transmission in the neuronal system. understanding the interplay between virus and host proteins is likely to result in the identification of potential novel drug targets and development of novel antiviral strategies. here, using a proteomics approach, we have identified a panel of host factors that displayed dynamic regulation during the course of ev infection in the motor neuron nsc- cells. the host protein candidates are mainly involved in cytoskeletal structure maintenance, rna processing and mitochondrial biogenesis. by employing a sirna gene silencing approach, we have shown that some of these host factors either facilitate or limit ev productive infection. among these host factors, phb was found to exert a pro-viral effect as evidenced by the reduced viral titers measured in the culture supernatant of nsc- cells when the expression of phb was down-regulated, and by an increased viral titer in phb over-expressing cells. phb is mainly localized on plasma membrane, mitochondria and nucleus, and has been involved in multiple signaling pathways regulated by growth factors, immune response, mitochondrial biogenesis, cell migration, proliferation and survival [ , ] . interestingly, knockdown in nsc- cells of stoml , which was shown to interact with phb and participate to mitochondria biogenesis [ ] , resulted in significant reduction of viral titer, similar to that seen with phb-knocked down cells. this further supports the involvement of mitochondrial proteins during ev infection cycle in nsc- cells. in addition, previous studies have reported the association of phb with internalization of several viruses, including hcv [ ] , chikv [ ] , denv [ ] , and coronavirus (sars-cov) [ ] . furthermore, phb was shown to promote hiv replication by interacting with the hiv- glycoprotein [ ] . using various experimental approaches, we have demonstrated that cell surface-expressed phb is physically associated with ev and is involved in the entry of the virus into nsc- cells. on the other hand, by employing a lucev replicon, we have shown that intracellular (mitochondrial) phb plays a role in ev replication activity. this was further supported by (a) two week-old ag mice (n = ) were infected i.p. with ev ( pfu) and the limb muscles, spinal cord and brainstem were harvested for immunohistochemical analysis at day p.i. scale bars denote μm ( × magnification) and μm ( × magnification). (b) ev -infected ag mice (n = ) were treated i.p. with roc-a at . mg/kg at day and p.i. and were monitored for survival and clinical manifestations. clinical scores were defined as follows: , healthy; , ruffled hair and hunched back; , limb weakness; , one limb paralysis; , both limbs paralysis at which point the animals were euthanized. statistical analysis of survival curve was performed using logrank (mantel-cox) test ( Ãà , p< . ). (c) limb muscles, spinal cord and brain were harvested at day p.i. for viral load determination by plaque assay (n = / ). dotted line represents the limit of detection. statistical analysis was performed using mann-whitney u test ( à , p< . ). error bars represent mean ± sem. one representative of two biological repeats is shown. https://doi.org/ . /journal.ppat. .g role of prohibitin in ev neuropathogenesis the observation that mitochondrial phb co-localizes with the replicating viral genome (dsrna) and the non-structural proteins d polymerase and cd complex. co-immunoprecipitation and tem approaches also confirmed the physical proximity and likely interactions between viral complexes/viral particles and mitochondria. these data thus led us to propose that mitochondria could serve as replication site for ev in nsc- cells. the association of ev with mitochondria was reported in a previous study where it was proposed that ev could potentially interact with some mitochondrial signaling proteins to evade host anti-viral innate immunity [ ] . previous studies have reported that membrane-bound phb binds to ras in a gtp-dependent manner, which in turn activates craf kinase and eventually triggers the mapk pathway [ ] . similar to other flavaglines, rocaglamide (roc-a) is a natural product that displays insecticidal, anti-fungal, anti-inflammatory and anti-cancer activities [ ] . mechanistically, roc-a was found to inhibit craf-phb interactions in tumor cells [ ] [ ] [ ] , and in an in vitro model of hcv infection [ ] . in ev -infected nsc- cells, incubation with nm concentrations of roc-a resulted in a dose-dependent reduction in virus titers. however, the mechanism by which roc-a exerts its antiviral effect against ev in nsc- cells does not seem to be mediated by blocking the craf/mek/erk pathway, given that phosphorylated erk proteins could not be detected in uninfected nsc- cells, suggesting that this pathway is not functional in these cells. instead, we found that roc-a-treated cells displayed reduced expression of mitochondrial phb and lower levels of intracellular atp, which suggests that mitochondria integrity/functionality is impaired in roc-a treated nsc- cells. since we showed that mitochondrial phb is involved in ev replication and that mitochondria serve as replication site for this virus, the impact of roc-a on phb expression and mitochondria integrity could represent the basis of its antiviral activity. further investigation is necessary to decipher the molecular mechanisms by which roc-a affects the expression of mitochondrial phb. interestingly, we found that the role of phb in ev entry and replication was limited to cells of neuronal origin, thus supporting a role of phb specifically in ev neuropathogenesis. this neuro-specific phenotype was also observed in vivo where roc-a-treatment resulted in reduced virus loads in the cns (spinal cord and brain) only but not in the limb muscles from infected mice, although phb was readily detected in the muscle cells as well. the cell typedependent involvement of phb during ev infection likely reflects differential intracellular events with different host factors being engaged during ev intracellular life cycle. since ev is known to be able to use multiple receptors to enter host cells, one could speculate that the host factors that are engaged during ev infection depend on the entry receptor that is being used by the virus. further study is necessary to explore this idea. in conclusion, our work has uncovered a novel host factor that is specifically involved in ev neurovirulence. in addition, our data support that roc-a, a previously established anticancer drug that targets phb, could represent a therapeutic approach to limit ev neuropathogenesis, and thus prevent or limit associated neurological complications. given the current attrition in effective antiviral drugs against ev , roc-a repurposing is worth considering seriously. all the animal experiments were carried out under the guidelines of the national advisory committee for laboratory animal research (naclar) in the aaalac-accredited nus animal facilities. the animal experiments described in this work were approved under the nus institutional animal care and use committee (iacuc) protocol number - . non-terminal procedures were performed under anesthesia, and all efforts were made to minimize suffering. murine motor neuron nsc- cells (cellutions biosystems, clu ), human rhabdomyosarcoma (rd) cells (atcc ccl- ) and human neuroblastoma sk-n-sh cells (atcc htb- ) were used in this study. all cell lines were cultured in dulbecco's modified eagle's medium (dmem) (gibco) containing % fetal bovine serum (fbs) (gibco) at ˚c with % co . non-mouse-adapted ev s ( /sin/ , accession no.: af ), kindly provided by prof. chow v. t. k. at national university of singapore, was isolated from the lymph node of a ev -infected patient who died of encephalitis and pulmonary edema in singapore [ ] . the virus stocks were made in rd cells and the viral titers were determined by plaque assay on rd cells. total proteins extract from infected cells was prepared using proteoextract complete mammalian proteome extraction kit (millipore). briefly, the cell pellet was thawed by resuspension in ice-cold resuspension buffer and the proteins were extracted with extraction buffer at room temperature (rt). benzonase and reducing agent were added during protein extraction to minimize nucleic acid contamination and to remove disulphide bonds, respectively. the solubilized protein suspension was subjected to centrifugation at , ×g for minutes at ˚c to remove the remaining insoluble material. the recovered cell extract was stored at - ˚c until further analysis. the protein samples ( μg, quantified using rc dc bradford assay, biorad) were loaded onto individual lanes of the isoelectronic focusing (ief) tray with pre-wetted electrode wicks. passive rehydration was performed for each protein sample using cm ph - readystrip ipg strips (biorad) for hours at rt with gel side down configuration. after rehydration, the protein sample was subjected to ief on protean ief cell i (biorad) according to the following conditions: v for minutes with linear ramp, , v for . hours with linear ramp and , v for , v-hours with rapid ramping. ipg strips equilibration was achieved by incubating the strips with pre-warmed dtt equilibration buffer i (biorad) followed by iodoacetamide-supplemented equilibration buffer ii (biorad) for minutes each on orbital shaker. equilibrated ipg strips were then transferred onto . % tris-hcl criterion gel (biorad) and overlaid with readyprep overlay agarose (biorad). electrophoresis was run at v for minutes. after electrophoresis, the gels were stained with instantblue (expedeon) for hour and submerged in milliq water overnight to remove background signal. gels were scanned using gs- calibrated densitometer (biorad). gel images were further processed using pdquest -d analysis software (biorad), whereby the different gel images from three independent experiments were matched and the intensities of detected spots were measured. protein spots that showed at least . -fold change in spot intensity (p< . , two-tailed student's t-test), compared to the uninfected control sample, were excised for maldi-tof ms. the fold change was calculated using the equation: mean of spot expression in infected samples for each time point. in-gel digestion and maldi-tof ms of the excised protein spots were done by protein and proteomics centre, national university of singapore (singapore). the data was search against the murine and viruses national centre for biotechnology information non-redundant (ncbinr) database using a mascot program (http://www.matrixscience.com). no threshold was applied to the ms/ms fragment ions intensities. data mining of the identified proteins was done by searching in panther (http://www. pantherdb.org/) and swiss-prot/trembl (http://www.uniprot.org/) databases. the enrichment analysis of protein-protein interactions was performed using string network analysis version (http://string-db.org/). hierarchical clustering and classification were performed using multiexperiment viewer version . (http://mev.tm .org/#/welcome). on rd cells ( cells/well) were seeded onto wells plate. culture supernatant from ev -infected samples was serially diluted ( -fold) with dmem containing % fbs prior to infection. the cell monolayer was incubated with μl of the diluted viral suspension for hour at ˚c. the cells were then washed twice with pbs and replaced with ml dmem containing % fbs and % carboxymethyl cellulose (cmc, sigma aldrich). after days incubation at ˚c, the infected monolayers were fixed and stained with % paraformaldehyde/ . % crystal violet solution (sigma aldrich). the number of plaques was scored visually and viral titers were expressed as plaque-forming units (pfu) per milliliter (pfu/ml). drug treated or sirna-transfected cells ( . × nsc- or × sk-n-sh cells/well) were washed twice with pbs and × alamarblue reagent (invitrogen) diluted with dmem containing % fbs was added. after hours incubation at ˚c, the fluorescence signals were determined using microplate reader (infinite , tecan) at ex nm and em nm . percentage of viable cells was calculated using non-treated cells as control. table) for mins at rt, the protein complexes were pulled down and subjected to western blot analysis (s table) . briefly, the protein extracts were incubated with the conjugated magnetic beads and further incubated for hour at ˚c with constant rotating, before eluting using laemmli buffer. isotype antibody pull-down and uninfected cells were used as controls. table) . ) . briefly, μl of × cytotoxicity reagent were added into each well and incubated at ˚c for minutes. cytotoxicity was measured using fluorescence at ex nm and em nm . after equilibrating the assay plate to rt for minutes, μl of atp detection reagent were added into each well and mitotoxicity was measured by luminescence. all readings were normalized against non-treated cells. sodium azide (s , sigma aldrich) and staurosporine (s , sigma aldrich) were used as mitochondrial toxin and cytotoxin positive controls, respectively. nsc- ( cells/well) cells were seeded onto -wells chamber slides (ibidi) and incubated overnight. the cells were treated with various concentrations of roc-a (diluted with % dmem) for hours. jc- dye (invitrogen) ( μg/ml in % dmem) was added into each well and incubated for another mins, prior to imaging. nsc- cells incubated with sodium azide nan at μm in % dmem (s , sigma aldrich) was used as positive control. cell lysate was prepared using m-per mammalian protein extraction reagent containing % of halt protease inhibitor cocktail and % of . m edta (thermoscientific). protein quantification was performed using quick start bradford protein assay (biorad). denatured proteins ( μg) were resolved in % sds-page gel and transferred electrophoretically onto nitrocellulose membrane using trans-blot turbo transfer system (biorad). after blocking with % (w/v) milk in tbst (tbs buffer with . % tween ) for hour at rt, the membrane was probed using specific primary antibodies and relevant secondary antibodies (s table) . the chemiluminescence signal was visualized using clarity western ecl substrate (biorad) on x-ray films. densitometric quantification was performed using imagej and the relative band intensity was normalized against β-actin. nsc table) . the nucleus was revealed using nucblue live readyprobes reagent (molecular probes). fluorescence images were captured using olympus ix microscope and further processed using imagej. limb muscles, spinal cord and brainstem from ev -infected ag mice were harvested at day p.i. after systemic perfusion. the organs were fixed in % pfa overnight prior immersing in % and % sucrose solution, and embedded in tissue-tek oct (vwr) solution. the organ samples were then frozen at - ˚, sectioned ( μm) using a cryostat (leica) and mounted on a glass slide prior to blocking and staining as described above. samples were fixed for h at rt with . % glutaraldehyde containing % tannic acid in . m cacodylate buffer (ph . ), then washed three times for min (each time) in . m cacodylate buffer and post-fixed for h at rt with % osmium tetroxide in the same buffer. samples were then dehydrated in a graded series of ethanol and embedded in spurr. thin sections were stained with % uranyl acetate and lead citrate and observations were performed by transmission electron microscopy using a fei tecnai spirit g at kv. image were taken using a fei eagle k ccd camera. nsc- , sk-n-sh ( × cells) and rd ( × cells) cells were seeded onto -well plate overnight. the cells were dislodged by incubating them in mm edta/pbs in ˚c for mins and spun down to collect the cell pellet. cells were then blocked with either human or murine fc-blocker ( or , bd pharmigen; : ) for min at rt, and subsequently stained with anti-phb antibody (ab , abcam; : ) and/or anti-rabbit af antibody (a , thermoscientific; : ) for min at ˚c. stained cells were then fixed with % pfa. flow cytometry analysis was carried out using the becton-dickinson fortessa flow cytometer and analysed using flowjo v . two-week old ag mice (b&k universal) were bred and housed under specific pathogenfree conditions in individual ventilated cages. infection was performed by injecting intraperitoneally (i.p.) pfu of s (in μl) per mouse. at day and post-infection (p.i.), mice were injected i.p. with . mg/kg of roc-a (in . % dmso in sterile olive oil). the control group was inoculated with . % dmso in olive oil. clinical manifestations were observed for a period of days. clinical score was graded as follows: , healthy; , ruffled hair and hunched back; , limb weakness; , one limb paralysis; , two limbs paralysis; and , death. two limbs paralysis was used as criterion for early euthanasia. for virus titer determination, infected mice were euthanized at day p.i. and perfused with ml of sterile pbs systemically. the fore and hind limb muscles, spinal cord and brain were harvested and weighed before mechanical homogenization in ml of serum-free dmem. the homogenates were spun down at , rpm for minutes at ˚c, and clarified using a . μm filter before serial dilution was carried out for plaque assay. viral titers were expressed as pfu per gram of tissue. supporting information s relative band quantification (below western blot) was determined by imagej, by normalizing to loading control, β-actin. statistical analysis was performed using two-tailed student's t-test ( Ãà , p< . ). one representative from two independent experiments is shown. (pdf) (a) for co-treatment assay, virus was incubated with roc-a for hour before adding the mixture onto the cells. after one hour of incubation on cell monolayer, the mixture was then removed and replaced with fresh % dmem. (b) for pre-treatment assay, the cells were pre-treated with roc-a for hours, prior to viral infection. culture supernatants were harvested at h.p.i. for viral titer determination by plaque assay. cell viability was assessed using alamarblue viability assay. statistical analysis was performed using one-way anova with dunnett's post-test ( Ãà , p< . ). one representative from two independent experiments is shown. (pdf) . viral titer in the culture supernatant was determined by plaque assay at h.p.i. statistical analysis was performed using two-tailed student's t-test ( à p< . , Ãà p< . , ÃÃà p< . , ÃÃÃà p< . ). error bars represent mean ± standard deviation. relative band quantification (below western blot) was determined by imagej, by normalizing to loading control, β-actin. non-targeting sirna (ntc) served as control. (b) rd cells were pre-incubated with anti-phb antibody or isotype control antibody for hour before infection. culture supernatant was harvested at h.p.i. for viral titer determination. cell viability was determined using alamarblue cytotoxicity assay. one representative of two biological repeats is shown. (pdf) neurotropic enterovirus infections in the central nervous system epidemiological analysis, detection, and comparison of space-time patterns of beijing hand-foot-mouth disease enterovirus infection with central nervous system involvement the epidemiology of hand, foot and mouth disease in asia: a systematic review and analysis virology, epidemiology, pathogenesis, and control of enterovirus . the lancet infectious diseases outbreaks of hand, foot, and mouth disease by enterovirus . high incidence of complication disorders of central nervous system ev vaccine, an invaluable gift for children ev vaccine: protection from a previously neglected disease status of research and development of vaccines for enterovirus proteomic analysis of extremely severe hand, foot and mouth disease infected by enterovirus proteomics analysis of ev -infected cells reveals the involvement of host protein nedd l in ev replication comparative proteome analyses of host protein expression in response to enterovirus and coxsackievirus a infections transcriptomic and proteomic analyses of rhabdomyosarcoma cells reveal differential cellular gene expression in response to enterovirus infection human genome-wide rnai screen reveals host factors required for enterovirus replication global quantitative proteomic analysis of human glioma cells profiled host protein expression in response to enterovirus type infection a practical guide to clinical virology retrograde axonal transport: a major transmission route of enterovirus in mice virus infections in the nervous system & feuer r. enterovirus infections of the central nervous system enterovirus infection of motor neuron-like nsc- cells undergoes a non-lytic exit pathway neuroblastoma x spinal cord (nsc) hybrid cell lines resemble developing motor neurons nonlytic viral spread enhanced by autophagy components topology of double-membraned vesicles and the opportunity for nonlytic release of cytoplasm string v : protein-protein interaction networks, integrated over the tree of life cellular proteome alterations in response to enterovirus and coxsackievirus a infections in neuronal and intestinal cell lines beta-actin variant is necessary for enterovirus replication proteomic analysis of human brain microvascular endothelial cells reveals differential protein expression in response to enterovirus infection requirement for an intact t-cell actin and tubulin cytoskeleton for efficient assembly and spread of human immunodeficiency virus type tubulin: a factor necessary for the synthesis of both sendai virus and vesicular stomatitis virus rnas subversion of the actin cytoskeleton during viral infection multifunctional alpha-enolase: its role in diseases protein disulfide isomerase in redox cell signaling and homeostasis comparing the primary and recall immune response induced by a new ev vaccine using systems biology approaches dengue virus infection induces upregulation of hn rnp-h and pdia for its multiplication in the host cell galectin- binding to cell surface protein disulfide isomerase regulates the redox environment to enhance t-cell migration and hiv entry identification and characterization of prohibitin as a receptor protein mediating denv- entry into insect cells a novel class of small molecule compounds that inhibit hepatitis c virus infection by targeting the prohibitin-craf pathway identification of prohibitin as a chikungunya virus receptor protein interaction of stomatin with hepatitis c virus rna polymerase stabilizes the viral rna replicase complexes on detergent-resistant membranes network based analysis of hepatitis c virus core and ns b protein interactions pebp , a raf kinase inhibitory protein, negatively regulates starvation-induced autophagy by direct interaction with lc epigenetic regulation of autophagy by the methyltransferase ezh through an mtordependent pathway knockdown of deptor induces apoptosis, increases chemosensitivity to doxorubicin and suppresses autophagy in rpmi- human multiple myeloma cells in vitro defective axonal transport of neurofilament proteins in neurons overexpressing peripherin intermediate filaments in peripheral nervous system: their expression, dysfunction and diseases neuronal subtype and satellite cell tropism are determinants of varicella-zoster virus virulence in human dorsal root ganglia xenografts in vivo prohibitins role in cellular survival through ras-raf-mek-erk pathway prohibitin interacts with envelope proteins of white spot syndrome virus and prevents infection in the red swamp crayfish, procambarus clarkii human scarb -dependent infection by coxsackievirus a , a , and a and enterovirus viral and host proteins involved in picornavirus life cycle the mitochondrial phb complex: roles in mitochondrial respiratory complex assembly, ageing and degenerative disease enterovirus protease apro targets mavs to inhibit anti-viral type i interferon responses inhibition of the craf/prohibitin interaction reverses craf-dependent resistance to vemurafenib the natural anticancer compounds rocaglamides inhibit the raf-mek-erk pathway by targeting prohibitin and prohibitin ligands in cell death and survival: mode of action and therapeutic potential a non-mouse-adapted enterovirus (ev ) strain exhibits neurotropism, causing neurological manifestations in a novel mouse model of ev infection stomatin-like protein binds cardiolipin and regulates mitochondrial biogenesis and function severe acute respiratory syndrome coronavirus nonstructural protein interacts with a host protein complex involved in mitochondrial biogenesis and intracellular signaling identification of the cellular prohibitin /prohibitin heterodimer as an interaction partner of the c-terminal cytoplasmic domain of the hiv- glycoprotein rocaglamide, silvestrol and structurally related bioactive compounds from aglaia species expression of a mutant prohibitin from the ap gene promoter leads to obesity-linked tumor development in insulin resistance-dependent manner & chellappan s. prohibitin induces the transcriptional activity of p and is exported from the nucleus upon apoptotic signaling prohibitin: a potential target for new therapeutics complete sequence analyses of enterovirus strains from fatal and non-fatal cases of the hand, foot and mouth disease outbreak in we would like to thank a/p vincent chow from the department of microbiology & immunology, nus, for sharing the s strain and the electron microscopy unit core facility from yong loo lin school of medicine, nus. conceptualization: issac horng khit too, sylvie alonso. formal analysis: issac horng khit too, isabelle bonne. key: cord- - lf j lo authors: anderson, kevin; bond, clifford w. title: structural and physiological properties of mengovirus: avirulent, hemagglutination-defective mutants express altered alpha ( d) proteins and are adsorption-defective date: journal: arch virol doi: . /bf sha: doc_id: cord_uid: lf j lo structural and physiological properties of two mutants of mengovirus, and , were compared to those of wild-type virus to understand the molecular basis of changes exhibited in their biological function. two dimensional gel electrophoresis of wild-type and mutant structural proteins revealed alterations in the isoelectric character of the alpha ( d) protein of both mutant and . these data suggest that alterations in the alpha ( d) protein may be responsible for the phenotypic changes by the mutants. a delay in detectable virus-specified protein synthesis was exhibited in mutant-infected cells in comparison to wild-type. the amount of rna synthesized in mutant- and revertant-infected cells was less than that synthesized in wild-type infected cells. changes in virus-specified macro-molecular synthesis in mutant and revertant-infected cells reflected a decrease in the ability of the viruses to attach to cells. biological properties of two mengovirus mutants, and , were compared with those of wild-type virus ( ) . these mutants were defective in their ability to agglutinate erythroeytes, produced smmler plaques in cell culture, were avirulent in mice, but were not temperature-sensitive. these data suggested that changes in the structure of tile mutant viruses may be responsible for the expression of altered phenotypie traits. in this communication, the structurm proteins of the wfld-ty]pe and mutant, viruses were compared by sds-page, peptide mapping, and twodimensional gel electrophoresis to determine which of the mutant structural proteins differed from that of wild-type and the extent of homology among analogous proteins. the structural proteins of two revertants of mutant were also examined by two-dimensional gel electrophoresis to relate changes in biological function to structural differences in the analogous proteins of the mutants and wild-type mengovirus. in addition, the progression of events associated with wild-type, mutant, and revertant virus infection were examined to identify-factors responsible for physiological changes associated with mutant and revertant virus-specified macromolecular s:ymthesis. the growth and assay of wild-type, mutant and revertant viruses using bhk- cells have been described previously ( ) . mutant and revertant virus stocks used in the experiments described below were from the second passage of virus isolated by two successive rounds of plaque purification. purified virions were radiolabeled in vivo with [hc(u)]-l-amino acid mixture ( ~ci/ ml) [new england nuclear (nen), nec- ] were prepared and analyzed by sds-polyacrylamide gel electrophoresis as described previously ( ) . surface tyrosine residues of purified virions were labeled with [ ~i]-sodium iodide [(nen), nez- hi using a modification of the method described by millar and smith ( ) . pellets containing purified virions were resuspended in tn buffer [ mm tris-hc (ph . ), m~ nac ] and quantitated by absorbanee at nm as described ( ) . suspensions of purified virus in tn buffer containing . absorbance units ( nm), ~ci of lesi, and ~g of iodogen (pierce) were mixed in a total volume of . ml and agitated for l minutes. an equal volume of . ~m nai, . ~m -mercaptoethanol was added to the mixture and layered onto sephadex g- columns ( × mm) equilibrated with tn buffer. the excluded volumes were collected and centrifuged in an sw rotor at , rpm for minutes at °c. pellets were prepared for preparative sds-polyacrylamide gel eleetrophoresis (sds-page). virus-specified intracellular proteins were labeled with l-[ s]-methionine, immunoprecipitated and analyzed by sds-page. cells were mock-infected or infected with virus in suspension ( × eells/ml) at an moi of , adsorbed, plated into plastic mm dishes ( . × ceils/dish), and incubated at ° c. cells were pulse-labeled for minutes with ~ci/ml ~s-methionine in . ml methionine-free medium. the cells were washed twice with serum free medium and lysed with . ml b [ mm tris-hcl (ph . ), mm mgci~, . percent np , . percent sds, percent aprotinin, txg/ml ribonuelease a, ~g/ ml deoxyribonuelease] for minutes on ice. cellular lysates were centrifuged at × g for minute at ° c and the supernatant fluid was colleeted and stored at - ° c. b [ m~ tris (ph . ), mm nacl, m~i edta, . percent sodium azide, . percent np , percent aprotinin, . percent bovine sernm albumin]. twelve txl of mouse anti-mengovirus hyperimmune aseitic fluid ( ) was added to the diluted lysates ( . ml volume) and incubated at ° c for minutes. immune complexes were precipitated with t~l of percent fixed staphylococcus aureus (cowan strain) ( ) by incubation at ° c for minutes and pelleted by centrifugation for minute at × g. pellets were washed four times with b buffer at ° c and resuspended in t~l mm dithiothreitol (dtt), percent sds. after minutes of incubation at room temperature, the immunoprecipitated proteins were eluted and reduced by heating at ° c for minutes. bacteria were removed by eentrifugation and the supernatant fluids were prepared for sds-page. virus-specified intracellular rna was labeled with [ , -'~h]-uridine [(nen), net- ]. cells were infected in suspension with virus ( × ceus/ml) at an moi of , plated in plastic mm dishes ( . × cells/dish) following adsorption at ° c for minutes, and incubated at ° c. actinomyein d ( l~g/ml final concentration) was added to each dish at various time points and the cells were incubated at ° c for minutes. the medium was aspirated, replaced with . ml dme containing ~ci/mt '~h-uridine, and the monolayers were incubated at ° c for minutes. at the end of the labeling period, the cells were lysed at ° c for minutes with net buffer [ mm tris-hc (ph . ), mm nac , mm edta] supplemented with percent np . the lysate was centrifuged at × g for i minute. duplicate samples containing i~t of the supernatant fluid were spotted onto whatman gfc glass fiber filters and precipitated in ice cold percent trichloroacetic acid (tca). filters were placed in vials with scintillation fluid [ g / l , diphenyloxazoie (pp ) in xylene], and counted in a packard lsc cd liquid scintillation counter using the pre-set tritium channel ( ). to prepare virus particles containing radiolabeled i~na, cells were infected with virus at an m i of and were labeled at . hpi with ~ci/ml [ , - h]-uridine [(nen), net- ] or ~ci/m p-orthophosphate [(nen), nex- ] in medium containing . rag/ l monosodium phosphate. virus-infected cells were allowed to incubate at, ° c until lysis and virions were purified as described above. a~s-methionine labeled virions were suspended in sample buffer containing . m urea (bio-l~ad), percent. np- (particle data laboratories), percent bio-lyte / (bio-rad), and m~[ dtt as described ( ) and incubated at ° c for minutes. isoelectrie focusing (ief) was performed as described ( ). following equilibration, the tube gels were placed onto percent polyaerylamide slab gels and subjected to sds-page. the ph gradients for ief and nephge were generated from cm serial gel sections equilibrated in mm nac and the ph values were lowered by . ph unit~s to correct for measurement in the presence of urea ( ) . the isoclectric point (pi) value for each protein species was estimated from these gradients. bands corresponding to the structural proteins of mcngovirus were identified by exposure of dried preparative sds-page gels to x-ray film, excised, rehydrated in mm ammonium bicarbonate, . percent sds, and electroeluted as described ( , ) . protein samples were lyophilized and sds was removed as described ( ) . the proteins were dried, oxidized for minutes at ° c with . mt of fresh performie acid and lyophitized three times in distilled water. proteins were anmyzed for purity by sds-page prior to digestion. purified, oxidized proteins were resuspended in . ml of percent ammonium bicarbonate (ph . ) for digestion with n-mpha-p-tosyl-l-lysine ehloromethyl ketonetreated chymotrypsin (tlck-chymotrypsin) at ° c as described ( ), enzyme treated samples were frozen and lyophilized before peptide analysis by thin layer chromatography (tlc). two-dimensional peptide mapping was performed as previously described (ll). lyophilized samples were dissolved in electrophoresis buffer [butanol-pyridine-acetie acid-water ( : : : )] and -- pl was spotted onto cellulose tlc plates (e. merck). separation of peptides in the first dimension was perfomed by eleetrophoresis for minutes at v ( to ma). the tlc plates were air-dried and the peptides were separated in the second dimension by ascending chromatography in n-butanol-pyridineacetic acid-water ( : : : ) until the solvent front reached cm from the top. dried plates were sprayed with en hanee (nen) and exposed to preflashed kodak nag- x-ray film at - ° c. the number and percentage of wild-type, mutant, and revertant virus particles adsorbing to bhk- cells were determined as follows. cells were infected with purified ~ep_ labeled virus at a multiplicity of particles/cell. the p-labeled virus suspensions were allowed to adsorb to bhk- cell monolayers in plastic dishes ( mm diameter) for minutes at ° c. the cells were lysed with . ml b for minutes on ice. duplicate t~ samples were spotted onto whatman glass fiber filters and precipitated in ice-cold percent tca. the filters were placed in vials with scintillation fluid and counted in a packard cd liquid scintillation counter using the pre-set ~ p channel ( ) . the fraction of cell-associated virus particles was calculated by dividing the number of cell associated counts per minute (cpm) by the cpm of the input virus. the average number of virus particles adsorbed per cell was calculated by multiplying the fraction of cell-associated virus by the multiplicity of infection ( particles/cell). the structural proteins of purified c-labeled wild-type and m u t a n t mengovhnases were analyzed by s d s -p a g e on , a n d p e r c e n t polya e r y l a m i d e slab gels. a n a l y s e s of the structural proteins resolved on percent gels are shown (fig. ) . five structural proteins were resolved for each of the viruses: epsilon (lab), kd; alpha ( d), kd; b e t a ( b), kd; g a m m a ( c), kd; and delta (t a), . kd. no changes in the migration of the m u t a n t structurm proteins were o b s e r v e d in c o m p a r i s o n to those of the wild-type virus. in addition, the migration of the structural proteins of several h a + r e v e r t a n t s of m u t a n t was also similar to t h a t of wild-type (data n o t shown). although some v a r i a t i o n in the a m o u n t of delta ( a) p r o t e i n of the viruses w a s o b s e r v e d (fig. ) , this was not a result found consistently t h r o u g h o u t our analyses. s-methionine-labeled structural proteins of wild-type and m u t a n t viruses and were analyzed following digestion with t p c k -t r y p s i n b y r e v e r s e -p h a s e h p l c to further e x a m i n e the structure of the proteins by separating peptides in a gradient on the basis of charge, however, the spectra of methionine-eontmning peptides of the delta ( a), beta ( b), gamma ( c), and alpha ( d) proteins from the different viruses were identical (data not shown). although not all of the peptides generated by tpck-trypsin digestion were likely to contain methionine, the results suggested that the peptide compositions of the four strueturm proteins of the wild-type and mutant viruses were remarkably similar. hplc analyses of the tryptie peptide compositions of the four structural proteins of the wildtype and mutant viruses uniformly labeled with c-amino acid mixture by labeled strueturm proteins was inconclusive because the resulting ehromatograms contained many unresolvable, overlapping peaks (data not shown). to determine whether the arrangement of the structural proteins on the surface of mutants and was similar to that of wild-type, purified virions were labeled with r i and analyzed by sds-page (fig. ) . most of the tyrosine residues on the surface of the wild-type and mutant viruses were on the alpha ( d) protein as demonstrated by the extensive labeling of this protein. the beta ( b) protein was labeled to a much lesser extent. there was no detectable difference in the pattern of surface protein labeling among the three viruses suggesting that the arrangement of the structural proteins on the surfaces of the virions of the wild-type and mutant viruses was similar. to examine the structure of the surface proteins of the three viruses in greater detail, two-dimensional tlck-ehymotryptie peptide ehromato- although not all of the surface peptides generated by tlck-chymotrypsin are likely to be labeled, numerous peptides were labeled. these data suggest that the arrangement of the structural proteins on the surface of the three viruses was remarkably similar. the structural proteins of the wfld-ts~e, mutant, and revertant strains were analyzed by ph gradient gel electrophoresis to determine whether any changes were apparent in migration of the mutant and revertant structural proteins relative to those of the wild-type virus. the proteins were analyzed by ief and nephge two-dimensional gel electrophoresis in order to resolve both acidic and basic proteins. the two-dinlensional electrophoretie migration of the structural proteins of wild-type mengovirus is shown in fig. , panel a. the epsflon ( ab) protein had a pi of . . the alpha ( d) protein was separated into four major protein species, each with a distinct pi ( . , . , . , and . ). the beta ( b) protein was separated into two major protein species with pi values of . and . and a heterogenous species ranging from . to . the gamma ( c) protein was not well resolved by this technique and appears to be slightly acidic or neutral in charge. the delta ( a) protein does not appear in any of the two-dimensional ief gels. the two-dimensional electrophoretic migration of the structural proteins of mutant is shown in fig. , panel b . the etectrophoretic migration of the proteins was identical to that of the wild-type strain with one exception. only three alpha ( d) protein species were resolved. one of the protein species resolved in separation of the wild-type proteins, p i = . , was absent. however, this protein species was resolved in the wild-type/ mixed sample separation (fig. , panel d) . these data suggest that the absence of this particular protein species represents a phenotypie change or mutation in the alpha ( d) protein of mutant relative to the wild-type virus. in addition, the two-dimensional electrophoretic patterns of the structural proteins of the revertent viruses, -a and -d , were identical to those of mutant (data not shown). the two-dimensional electrophoretie migration of the structural proteins of mutant is shown in fig. , panel c. the eleetrophoretic migration of the proteins was identiem to t h a t of the wild-type strain with one exception. the pi of one of the four alpha ( d) protein species (pi = . ) was slightly more acidic t h a n tile analogous wild-type isoelectric species ( p i = . ). this species a p p e a r e d b r o a d e r and slightly lower in molecular weight t h a n the corresponding wild-type protein species. the wild-type/ mixed sample separation (fig. , panel e) shows a broad protein species in the region of the gel corresponding to tile apparent, change in the pis of the wild-type and alpha ( d) protein species. the altered migration of the m u t a n t isoeleetrie species was found consistently in two dimensional profiles of independently derived virion protein samples. these data suggest that the change in form and migration of this protein species represents a phenotypic change or mutation in the alpha ( d) protein of mutant relative to the wild-type virus. two-dimensionm nephge analyses confirmed the results obtained by ief analysis of the wild-type, mutant, and revertant structural proteins. the delta ( a) proteins of the wild-type and mutant viruses were resolved by this technique and migrated similarly as a single acidic protein species at ph . (data not, shown). the specific activities (cpm/particle) of the wild-type and mutants differed when bhk- cells were intbcted and labeled with s-methionine or c-amino acids under similar conditions (ani)e~son and bo~-d, unpublished data). therefore, to determine whether differences in the kinetics of virus-specified macromotecular synthesis exist among the viruses, virusspecified intraeellular protein and t~na synthesis were examined. virusinfected cells were pulse-labeled at hour intervals from to ll hpi with s-methionine for minutes. cytoplasmic lysates were immunoprecipitated and analyzed by sds-page (fig. ) . eight virus-specified intracellular proteins were resolved in wild-type and mutant strain-infected cells: a ( - a), b ( ), c ( ), d ( cd), abc, d ( cd), alpha ( d), and gamma ( c). the number and molecular weights of the proteins specified by the wild-type and mutant strains were identical. however, virus-specified protein synthesis was detected initimly at hpi for the wild-type strain and at hpi for mutant strains and . to determine whether changes in the kinetics of lgna synthesis could explain the delay in detectable protein synthesis observed in mutantinfected cells, virus-infected cells were treated with aetinomyein d, pulselabeled with h-uridine, lysed, and counted. the results are shown in fig. . the peak of rna synthesis for each virus was from to hpi. however, the amount of rna synthesized by the wild-type virus was -fold greater than that of the mutants and -fold greater than that of the revertants. since the magnitude of virus-specified rna synthesized in the mutant virusinfected cells was -fold less than that of wild-type virus, the beginning of virus-specified protein synthesis, as detected by immunoprecipitation (fig. ) , would appear to be delayed due to the fact that less rna would be available for translation. since the magnitude of virus-specified i%na synthesis in mutant virusinfected cells was -fold less than that of the wild-type virus, mterations in uncoating or adsorption of the m u t a n t viruses to cells m a y account for this difference. however, surface peptide analysis suggested t h a t the arrangem e n t of the strueturm proteins of the wild-type and m u t a n t virions was r e m a r k a b l y similar. therefore, it is possible t h a t uncoating of the virions would occur at a similar rate in infected ceils due to their similarity in structure. the average n u m b e r of wild-type, mutant, and revertaa~t virus particles adsorbing to bhk- cells was examined tbllowing incubation at ° c for minutes. u n d e r these conditions, the a m o u n t of virus adsorption to cells would approximate saturation. the results are shown in table . the average n u m b e r of virus particles adsorbing per cell clearly differed among the wild-type, mutant,, and r e v e r t a n t viruses. therefore, it is a p p a r e n t t h a t the mechanisms of adsorption of the m u t a n t and r e v e r t a n t viruses to cells is modified from t h a t of wild-type. the diftbrenee in the cellular binding affinities among the wild-type, mutant, and revei'~ant viruses would be an i m p o r t a n t factor contributing to the differences in the magnitude of viral rna and protein synthesis observed fbr these viruses. these data suggest t h a t fewer cells would be infected by the m u t a n t and r e v e r t a n t viruses and therefore, fewer virus-specified macromoleeules would be synthesized. kevln anderson and clifford w. bond: cells were infected at an moi of particles/cell and incubated at ° c for minutes b the average number of particles adsorbed/cell ° the percentage of adsorption of the mutants and revertants in comparison to wild-type mengovirus biologiem and structural properties of two mengo~drus :mutants have been compared to those of the parental wild-type strain. ~e s e mutants, and , exhibited alterations in agglutination of erythrocytes, virulence in mice, and plaque morphology ( ) . in addition, biological characterization of several ha + revertants of mutant isolated from the brains of mice infected intraeranieally indicated that agglutination and virulence may be linked traits and that the size of plaques produced by a pa~ieular mengovirus isolate may reflect its bindhag affinity to cells and virulence in mice. analysis of s-methionine-labeled structural proteins of wild-type and mutant viruses by sds-page and hplc revealed that extensive homology exists among these viruses. to further examine these viruses for structural differences, surface labeling of intact wild-type and mutant virus particles was done to compare the arrangement of the structural proteins which form the xdral capsids. previous studies have shown that surface iodination of intact potiovirus and mengovirus labels primarily the d (vp or alpha) protein of the respective virus eapsids ( , ) and the b (vp or beta) protein is also labeled, but to a much lesser extent than the d protein. we obtained similar data tbr the surface iodination of intact wild-type and mutant virus particles. in addition, no apparent differences were evident from the analysis of ehymotryptie peptides of i-labeled wild-type and mutant alpha ( d) proteins. these data suggest that arrangement of structural proteins on the surface of mutant eapsids did not differ from that of wild-type. therefore, the mutations may result in the expression of altered determinants that reflect differences in biologicm function and may not result in the masking of otherwise funetionm determinants due to altered arrangement of the capsid proteins. we compared the isoeleetrie character of the viral capsid proteins by two-dimensional gel electrophoresis. multiple species of the alpha ( d) and beta ( b) proteins were reprodueibly deteeted by this technique in both the presence (fig. ) and absence (data not shown) of sds prior to isoeleetrie focusing. previous studies have demonstrated multiple isoelectrie species for the major eapsid proteins of poliovirus [vp ( d), vp ( b), and vp ( c)] ( , ) and emc virus [alpha ( d) and beta ( b)] ( ). vriasen et al. ( ) also excluded the possibility that the multiple species are derived from differential binding of sds or the accumulation of mutants in the original virus stoek. these authors suggested that the multiple species may result from heterogeneity in the cleavage of structural protein precursors; although the sequenees of the amino-and earboxyl-termini of the major eapsid polypeptides of mengovirus were determined without mention of sequenee variations ( ) . we suggest that the charge heterogeneity of these molecules may reflect differences in the interaction of these proteins with the ampholines. the different protein species observed may represent different forms of the viral proteins associated with either intraeellular partieles or free extraeellular particles or extraeellular particles bound to or eluted from membranes, assuming that these particles share the same buoyant density in equilibrium gradients. therefore, the heterogeneity observed may reflect differenees in the manner of isolating virus partieles, whether intraeellularly or from lysed cells. differenees were deteeted among the alpha ( d) structural proteins of the wild-type and mutant viruses. since the mutants exhibited different alterations in the alpha ( d) protein, yet similar ehanges in biological activity, it is possible that alteration of this protein resulted in the observed phenotypie ehanges. the differenees in the number and migration of the alpha ( d) protein isoelectrie species are likely to be due to changes in uneharged amino acid residues affeeting the interaction of the altered species with the ampholines. changes in eharged amino acid residues would result in altered migration of all four isoeleetrie species. since the ehromatograms of surface-and metabolieally-labeled peptides were similar, we speculate that the alterations in the mutant alpha ( d) proteins may be assoeiated with a particular determinant on this protein. this determinant may serve as or be part of a funetional attachment site on the surface of virus partieles that determines their affinity for various cells. atomic resolution of the structure of another pieornavirus, human rhinovirus , revealed a large cleft on the ieosahedral faee that has been postulated to serve as the host eell receptor binding site ( ) . the large cleft separates the major part of five d (alpha) subunits from the other viral protein subunits. sinee the pieornaviruses share similar struetural eharaeteristies, by analogy it seems reasonable to speculate that mutations in the d (alpha) protein could affect the strueture of the deft, and thus, alter the binding affinities of the mutant and revertant viruses. alteration of the mengovirus host cell receptor binding site may lead to changes in affinity for erythrocytes as well as other cell types which may explain the lack of virulence of the mutants in mice. previous work by morishima et al. ( ) demonstrated that differences in binding affinities of closely related strains of emc and mengovirus to various cells reflects their difference in pathogenicity for mice. ha + revertants were isolated from the brains of mice infected intracranially with mutant ( ) . in addition to regaining agglutination activity, the revertants were also virulent for mice and exhibited a slight increase in plaque size. however, revertants required . to -fold more p f u to kill mice than the wild-type virus. since the isoelectric character of t/he proteins of the ha + revertants were identicm to that of mutant , this partial phenotypic reversion may have resulted from a change in an uncharged amino acid associated with a determinant, located on the alpha ( d) protein, which partially restores its biological activities. the partial reversion of this mutation may involve modification of the altered surface determinant, increasing the binding affinity of the virus for cells and restoring virulence, but would not result in reappearance of the isoelectric species absent in mutant and revertant alpha ( d) capsid proteins. alternatively, mutant may express more than one mutation since complete reversion to w i l d -t~e virulence and plaque size did not occur and the isoelectric profile of the alpha ( d) protein was the same as that of mutant . however, revertants were not isolated from mice infected intracranially with mutant , which shared biological properties as well as alpha ( d) protein alterations with mutant . therefore, the mutation observed in the alpha ( d) structural protein of mutant is apparently more stable than that of mutant . however, unlike mutant , expression of the mutation of mutant resulted in a more subtle, yet reproducible change in the isoelectric point of one of the four alpha ( d) protein species. changes in virus-specified macromolecular synthesis in mutant and revertant virus-infected cells can be explained by a decrease in the ability of these viruses to attach to cells. these data suggest that a smaller proportion of cells were productively infected with the mutant and revertant viruses in comparison to wild-type. therefore, the effective moi for the mutant and revertant viruses would be less than that of wild-type. this difference can be explained by changes in the cellular binding affinities; and the higher particle : p f u ratios exhibited by the mutant and revertants ( ), assuming that a greater proportion of noninfectious particles would lower the probability of these viruses to infect cells. this interference phenomenon would explain why p f u values obtained under dilute conditions could not be used to accurately predict the fraction of infected cells in high particle : cell infections. since fewer cells would be productively infected with mutant and revertant viruses, less virus-specified rna would be synthesized, although the peak hour of rna synthesis would be the same as that of wild-type. therefore, less rna would be available for translation; and protein synthesis, as detected by immunoprecipitation, would appear to be delayed in ceils infected with the mutant viruses. changes in the metabolic rates of mutant, virus-specified gna synthesis would predict different results than those presented here if tile fraction of infected cells were similar to ~t d -t y p e infections. collectively, our data suggest that the phenotypie changes expressed by the mutants may be due to mutations toeated exclusively within the alpha ( d) coding region of the genome. previous work by a(~ol et al. ( ) has indicated that the neurovirulenee of poliovirus maps to the ' end of the genome which includes the coding region of the eapsid proteins. consistent with these results, we have identified two different mutations in the alpha ( d) capsid protein, which maps to the ' end of tiffs region, of two avirulent mengovirus mutants. in addition, ko~tara et al. ( ) have stated that a molecular recombinant of the sabin vaccine strain of poliovirus containing vp ( d) and most of vp ( c) of the neurovirulent mahoney strain is virulent, but not as virulent as the mahoney strain and retain the small plaque morphology of the sabin strain. our data also indicate that changes in the alpha ( d) protein result in altered virulence of a pieornavirus. since revertants of mutant shared similar characteristics with the poliovirus recombinant, the plaque-forming ability of a particular virus isolate may be a characteristic that reflects as well as contributes to its virulence. future experiments which compare the nueleotide sequences of the nmtants and revertant structural proteins to that of the parental wild-type virus will be useful in determining the genetic basis for the altered biological properties exhibited by the mutant and revertant viruses. construction and properties of intertypie poliovirus reeombinants: first approximation mapping of the major determinants of neurovirulence biological properties ofmengovirus: characterization of avirulent, hemagglutination-defective mutants iodination of poliovirus capsid proteins liquid scintillation counting: elimination of spurious results due to static electricity relatedness of virion and intracellular proteins of the murine coronaviruses jhm and a two-dimensional electrophoretic analysis of encephalomyocarditis viral proteins identification of the initiation site of poliovirus protein synthesis two-dimensional gel electrophoresis and computer analysis of proteins synthesized by clonal cell lines isoelectric points of polypeptides of standard poliovirus particles of different serological types and of empty capsids and dense particles of poliovirus type a micromethod for complete removal of dodecyl sulfate from proteins by ion-pair extraction characterization of t antigens in polyoma-infected and transformed cells structure of the mengo virion. distribution of the capsid polypeptides with respect to the surface of the virus particle rapid isolation of antigens from cells with a staphylococcal protein a-antibody absorbent: parameters of the interaction of antigen-antibody complexes with protein a in vitro phenotypic markers of a poliovirus recombinant constructed from infectious cdna clones of the neurovirulent mahoney strain and the attenuated sabin strain protein iodination using iodogen genomic and receptor attachment differences between mengovirus and encephalomyocarditis virus two-dimensional polyacrylamide gel electrophoretic fraetionation structure of a human common cold virus and functional relationship to other picornaviruses on the structure and morphogenesis of picornaviruses systematic nomenclature for picornavirus proteins poliovirus replication proteins: rna sequence encoding p - b and the sites of proteolytie processing isoelectric points and conformations of proteins. i. effect of urea on the behavior of some proteins in isoelectric focusing gesolution of the major poliovirus eapsid proteins into doublets structure of the mengo virion. iv. amino-and carboxylterminal analysis of the major capsid polypeptides we thank drs. sandra ewald and andreas luder for their hetpfhl discussions and interest in our work and dr. andrew king for helpful comments on the manuscript. support for this work was obtained from three geseareh creativity development grants awarded to k. a. by the department of graduate studies, montana state universi~-and a grant from the montana heart association, inc. awarded to c.w.b. p~eceived march , key: cord- -lsksys authors: goto, keiko; yamaoka, yutaro; khatun, hajera; miyakawa, kei; nishi, mayuko; nagata, noriko; yanaoka, toshikazu; kimura, hirokazu; ryo, akihide title: development of monoclonal antibodies and antigen-capture elisa for human parechovirus type date: - - journal: microorganisms doi: . /microorganisms sha: doc_id: cord_uid: lsksys human parechovirus type (hpev ) is an etiologic agent of respiratory diseases, meningitis, and sepsis-like illness in both infants and adults. monoclonal antibodies (mabs) can be a promising diagnostic tool for antigenic diseases such as virus infection, as they offer a high specificity toward a specific viral antigen. however, to date, there is no specific mab available for the diagnosis of hpev infection. in this study, we developed and characterized mabs specific for hpev capsid protein vp . we used cell-free, wheat germ-synthesized viral vp protein for immunizing balb/c mice to generate hybridomas. from the resultant hybridoma clones, we selected nine clones producing mabs reactive to the hpev -vp antigen, based on enzyme-linked immunosorbent assay (elisa). epitope mapping showed that these mabs recognized three distinct domains in hpev vp . six mabs recognized hpev specifically and the other three mabs showed cross-reactivity with other hpevs. using the hpev -specific mabs, we then developed an elisa for viral antigen detection that could be reliably used for laboratory diagnosis of hpev . this elisa system exhibited no cross-reactivity with other related viruses. our newly developed mabs would, thus, provide a useful set of tools for future research and ensure hpev -specific diagnosis. human parechoviruses (hpevs) belong to the parechovirus genus of the picornaviridae family [ , ] . the two serotypes (hpev and hpev ) of parechoviruses were initially isolated in from children with diarrhea, and were assigned to the enterovirus genus. however, it became evident that they were genetically distinct from the enteroviruses, and reclassified as the parechovirus genus in . at present, hpevs are reported and categorized, based on the nucleic acid sequences of the vp gene, not on the classification as enteroviruses serotype. hpev was first identified in japan. hpev was isolated from a stool sample provided by a -year-old infant who was experiencing fever, gastritis like symptoms, and transient lower extremity paralysis [ ] . hpev types to are the common identified strains; among them, hpev type , , and account for the majority of infectious strains worldwide [ ] . infection with hpevs is associated with a broad spectrum of clinical manifestations, ranging from respiratory symptoms and mild gastrointestinal illness to sepsis-like diseases, meningitis, and encephalitis in children [ ] . while most hpevs cause mild symptoms in children between and years of age, human parechovirus (hpev ) is clinically the most important genotype, owing to its association to severe diseases in younger infants under months of age [ ] [ ] [ ] . hpev infection in infants can trigger a sepsis-like dysregulated host response involving the central nervous system [ ] [ ] [ ] [ ] . in cases of acute meningitis or encephalitis, patients might develop abnormal white matter lesions and neurological sequelae, and even death might occur [ ] [ ] [ ] [ ] [ ] . apnea can occur in children regardless of encephalitis [ ] . hpev is known to cause myalgia and myositis in adult patients and a similar pattern is also sporadically seen among pediatric patients. [ , ] . an epidemic of hpev occurs every to years in japan. as respiratory disease or meningitis cases due to hpev are not subject to notifiable disease surveillance in japan, the actual number of the patients is not known [ , , [ ] [ ] [ ] [ ] . the appropriate diagnosis tool for hpev detection might be able to rule out infectious etiology and avoid unnecessary antibiotics use, which is a given because hpev leads to septic shock-like symptoms. for these reasons, the establishment of a method to detect hpev plays a vital role in healthcare fields. importantly, hpev s epidemic cycle occurs in summer time and is concurrent with enteroviral infection. thus, it is essential to develop a detection method that does not cross-react with enterovirus. hpev contains a small, non-enveloped, single-stranded positive-sense rna genome of approximately . -kb nucleotides [ , ] . the hpev virion is composed of copies of three structural proteins (vp , vp , and vp ) that fit together to form a -nm-diameter icosahedral shell around the viral genome [ ] . the genome encodes a single polyprotein that, during infection, is subsequently cleaved into all essential capsid components and non-structural proteins [ ] . vp is an important protein for stabilizing the surface of the viral capsid, and the assembly of hpev is controlled by multiple interactions of the genome with the capsid, through conserved amino acids in vp and vp [ ] . although rt-pcr-based diagnostic tests targeting -utr of the hpev genome were developed for hpev detection in clinical samples, there is currently no diagnostic method for detecting the viral antigens. recently, chen et al. generated polyclonal antibodies for hpev vp , and proposed an immunofluorescence-based diagnostic assay [ ] . however, this method requires virus isolation by cell culture and takes several weeks for the identification of viral genotype/serotype. abed et al. developed a serological enzyme-linked immunosorbent assay (elisa), using a synthetic peptide from the vp protein of hpevs [ ] . although it can provide a definitive diagnosis, serological test requires paired serum samples from acute and recovery phases, which makes it difficult to diagnose immediately as a point of care testing (poct). to develop a rapid and effective diagnostic strategy, there is an urgent need to produce highly specific monoclonal antibodies (mabs) toward hpev antigens. in this study, we sought to generate mabs specific to the capsid protein vp of hpev . we prepared the viral vp antigen using the wheat germ cell-free system, which has the advantage of producing properly folded functional proteins [ , ] , to immunize mice. as a result, we obtained nine mab clones for characterization, and thereafter, generated an elisa system that is specifically able to detect the hpev vp antigens. complementary dna encoding hpev -vp (genbank no. ab ) was used to generate the expression vector for antigen production with the wheat germ cell-free system. the hpev -vp open reading frame was amplified by pcr, using the corresponding primer pairs. the amplified fragment was cloned into vector peu-e -his-tev-mcs-n (cellfree sciences, yokohama, japan), using restriction enzymes xhoi and spei. in vitro wheat germ cell-free protein synthesis was carried out as previously described [ , ] . for cell-free protein synthesis, wepro h wheat extract (cellfree sciences, yokohama, japan) was used in the bilayer translation reaction, as previously described. synthesized proteins were confirmed by immunoblotting. the his-hpev -vp (full length, - ) protein was synthesized using a proteomist xe robotic protein synthesizer (cellfree sciences, yokohama, japan) for mouse immunization. the cell-free translation reaction mixture was separated into soluble and insoluble fractions by centrifugation at , × g for min. the soluble fraction was mixed with ni-sepharose high performance beads (ge healthcare, waukesha, wi, usa) in the presence of mm imidazole. the beads were washed thrice with a washing buffer [ mm tris-hcl (ph . ), mm nacl] containing mm imidazole. his-hpev -vp was then eluted in another washing buffer containing mm imidazole. amicon ultra centrifugal filters (millipore, bedford, ma, usa) were used to concentrate the purified his-hpev -vp . protein concentration was determined using the bradford method, with bovine serum albumin (bsa) as a protein standard. immunization of balb/c mice and generation of anti-hpev -vp mab-producing hybridomas were carried out as previously described [ , ] . briefly, his-tagged full-length hpev -vp protein was injected into the footpad of the balb/c mice, using keyhole limpet hemocyanin as an adjuvant. four weeks later, the spleen cells were isolated and fused to the myeloma cell line, sp /o, using polyethylene glycol (peg ). monoclonal antibodies in the hybridoma culture supernatant were tested using elisa with his-tagged recombinant hpev -vp protein. isotype determination was performed using isostrip mouse monoclonal antibody isotyping kit, following the manufacturer's instructions (roche diagnostics, basel, switzerland). vero cells were grown in dmem containing % fbs. hpev was provided by dr. masaki takahashi (iwate prefectural institute of public health). hpev was propagated in vero cells and quantified by qrt-pcr. the sequence information was as follows: -gtaacaswwgcctctgggs ccaaaag- (forward primer), -ggccccwgrtcagatccayagt- (reverse primer), and -vic-cctrygggtacctycwgggcatccttc-bhq- (probe). wheat germ-synthesized recombinant his-tagged hpev -vp proteins were separated by % sds-page in running buffer ( mm glycine, mm tris, . % sds). the separated proteins were transferred to a pvdf membrane (millipore). the membranes were washed with blotting buffer tbst (tbs containing . %-tween ), and then blocked for h at room temperature in % non-fat powdered milk in tbst. thereafter, the membranes were incubated overnight with generated hybridoma supernatant ( : dilution in tbst) at • c. next, after washing thrice in tbst, the membranes were incubated for h at room temperature with anti-mouse igg-hrp secondary antibody ( : dilution in tbst). finally, after washing thrice in tbst, the target protein was detected with the immobilon western chemiluminescence detection system (ge healthcare) using fluor chem fc (alpha innotech corp. tokyo, japan). for epitope mapping, we prepared deletion mutants of hpev -vp using pcr mutagenesis with template vector peu-his-hpev -vp , followed by wheat germ cell-free protein synthesis. the proteins were analyzed by immunoblotting using our generated mabs. to examine the amino acid variability among the vp proteins of other hpev genotypes, vp sequences for hpev - , , and - were accessed from the genbank and aligned with the hpev -vp sequence using the mega software. k d values were determined by bio-layer interferometry (bli) using octet red (fortebio, usa). anti-mouse igg capture biosensor tips (amc, fortebio) were loaded with µg/ml of mab # and # for min, in pbs containing . % bsa and . % tween . the association of recombinant vp at concentrations of , , , and nm for # mab, and , , . , and . nm for # mab, was measured for min, followed by a -min-long dissociation phase. all measurements were corrected for baseline drift by subtracting a reference well. the operating temperature was maintained at • c. data were analyzed using a : binding model with global fitting algorithms in the fortebio data analysis software. each mab was diluted in mm of carbonate buffer (ph . ) to a concentration of µg/ml, and then added to an elisa plate (agc techno glass, shizuoka, japan). to immobilize the antibodies, the plate was incubated overnight at • c. wells were blocked with pbs containing % (w/v) skim milk for h at room temperature (rt). after three washes with pbs containing . % (v/v) tween- (pbs-t), µl of antigen protein ( ng/ml) diluted with pbs-t or blank (pbs-t alone) was added, and the mixture was incubated for min at rt. after three washes with pbs-t, µl of each mab, conjugated with horseradish peroxidase (hrp), was added into each well and incubated for min at rt. antibody labeling was performed using the peroxidase labeling kit-nh (dojindo laboratories, kumamoto, japan). after three washes with pbs-t, µl of abts substrate solution (kirkegaard and perry laboratories, washington, dc, usa) was added and the mixture was incubated for min at rt. absorbance at / nm was measured on glomax discover system (promega), and the signal-to-noise ratio (s/n) was calculated. for generation of mabs, we produced n-terminal his-tagged full-length vp protein of hpev as an antigen. as a result, hpev -vp protein was produced with high aqueous solubility (figure a) . the protein was subsequently purified using ni-sepharose beads, followed by elution with imidazole. balb/c mice were then immunized with the purified protein. after weeks of immunization, the mice splenocytes were isolated, fused with myeloma cells, and hybridomas were produced. as a result, stable hybridomas were generated and designated as # to # . among these clones, nine were selected (# , # , # , # , # , # , # , # , and # ), based on the reactivity in elisa to the target antigens vp proteins derived from hpev and hpev (figure b) . isotype analysis revealed that # mab belongs to igg , kappa isotype, # and # mabs belong to igg a, kappa isotype, while the others belongs to igg b, kappa isotype ( figure c) . to demonstrate the antibody-binding sites within the antigen, we next performed epitope mapping. for epitope mapping, we produced five deletion mutants of vp and performed immunoblotting analysis. we found that our newly developed mabs recognized three distinct domains in hpev vp : # and # mabs bind to - amino acid (aa), # , # , and # mabs bind to - aa, and remaining four mabs (# , # , # , and # ) bind to - aa within c-terminal region of the vp protein of hpev (figure a) . we further created deletion mutants for a more precise epitope determination for these mabs, and found that # and # mabs bind to - aa, # , # , and # mabs bind to - aa, and # , # , # , and # mabs bind to - aa (figure b) . we next examined whether the antigenic epitopes were located on the surface of hpev -vp . the ucsf chimera software revealed that, except for # and # , the binding regions of all mabs were located on the molecular surface of vp protein (figure c) . the binding region of mabs # and # was relatively conserved among the analyzed hpevs (figure d ). to demonstrate the antibody-binding sites within the antigen, we next performed epitope mapping. for epitope mapping, we produced five deletion mutants of vp and performed immunoblotting analysis. we found that our newly developed mabs recognized three distinct domains in hpev vp : # and # mabs bind to - amino acid (aa), # , # , and # mabs bind to - aa, and remaining four mabs (# , # , # , and # ) bind to - aa within c-terminal region of the vp protein of hpev (figure a) . we further created deletion mutants for a more precise epitope determination for these mabs, and found that # and # mabs bind to - aa, # , # , and # mabs bind to - aa, and # , # , # , and # mabs bind to - aa (figure b ). we next examined whether the antigenic epitopes were located on the surface of hpev -vp . the ucsf chimera software revealed that, except for # and # , the binding regions of all mabs were located on the molecular surface of vp protein (figure c) . the binding region of mabs # and # was relatively conserved among the analyzed hpevs (figure d ). we next investigated the specificity of our newly developed mabs. for this, we created vp proteins encoded by the six different hpev genotypes and vp -vp proteins derived from enteroviruses and d . immunoblotting analysis showed that six mabs (# , # , # , # , # , and # ) react specifically with hpev vp , while three mabs (# , # , and # ) showed some crossreactivity to other hpevs (figure ) . no mabs showed cross-reactivity to enterovirus vp -vp proteins. we next investigated the specificity of our newly developed mabs. for this, we created vp proteins encoded by the six different hpev genotypes and vp -vp proteins derived from enteroviruses and d . immunoblotting analysis showed that six mabs (# , # , # , # , # , and # ) react specifically with hpev vp , while three mabs (# , # , and # ) showed some cross-reactivity to other hpevs (figure ) . no mabs showed cross-reactivity to enterovirus vp -vp proteins. sandwich elisa systems with highly specific matched antibody pairs are commonly used to detect and quantify viral antigens in immunoassay. hence, we next determined the optimal pair of mabs for antigen-capture using elisa, by evaluating all possible combinations of immobilized and labeled mabs. among the possible pair combinations, only the # and # mab pair represented a combination of antibodies specific for hpev -vp (figure a ). therefore, we selected this combination for further analysis. sandwich elisa systems with highly specific matched antibody pairs are commonly used to detect and quantify viral antigens in immunoassay. hence, we next determined the optimal pair of mabs for antigen-capture using elisa, by evaluating all possible combinations of immobilized and labeled mabs. among the possible pair combinations, only the # and # mab pair represented a combination of antibodies specific for hpev -vp (figure a ). therefore, we selected this combination for further analysis. to characterize the equilibrium dissociation constant (k d ) of the selected antibodies and the target vp antigen, we used the bli octet assay system. k d for antibody-antigen binding in mabs # and # was calculated as < × − and . × − , respectively, suggesting that both mabs showed high binding affinity to the hpev -vp (figure b) . we next performed antigen-capture elisa with recombinant vp protein and virions released into the cell-culture supernatant of the hpev -infected cells. using the optimal antibody pair (# and # mabs) identified above, we determined the detection threshold for antigen recognition by antigen-capture elisa. our results revealed that our system was highly sensitive to the recombinant antigen, capable of detecting the protein at a concentration of ng/ml (figure c, left) . in parallel, we investigated the detection limit of elisa for the hpev virion. this elisa system could detect heat-treated hpev , but not non-heated virions, and its detection limit of the system was × copies/ml (figure c, right) . we also found that this elisa system exhibited no cross-reactivity with enteroviruses ( figure d ) to characterize the equilibrium dissociation constant (kd) of the selected antibodies and the target vp antigen, we used the bli octet assay system. kd for antibody-antigen binding in mabs # and # was calculated as < × − and . × − , respectively, suggesting that both mabs showed high binding affinity to the hpev -vp (figure b) . we next performed antigen-capture elisa with recombinant vp protein and virions released into the cell-culture supernatant of the hpev -infected cells. using the optimal antibody pair (# and # mabs) identified above, we determined the detection threshold for antigen recognition by antigen-capture elisa. our results revealed that our system was highly sensitive to the recombinant antigen, capable of detecting the protein at a concentration of ng/ml (figure c, left) . in parallel, we investigated the detection limit of elisa for the hpev virion. this elisa system could detect heattreated hpev , but not non-heated virions, and its detection limit of the system was × copies/ml hpev is increasingly being highlighted as a potentially severe viral infection in neonates and young infants. therefore, there is an urgent need to develop assays for early diagnosis of hpev infection for reducing inappropriate antimicrobial use, unnecessary investigations, and prolonged hospitalization. it is also likely to lead to follow-up for potential complications in infants who are severely affected [ ] . in this context, a real-time pcr-based molecular test to detect virus from patients was recently developed [ ] . however, pcr tests are extremely sensitive and need extensive controls, whereas antigen detection by mabs has the advantage of the relative ease of sample handling and the use of less stringent procedures. specific mabs could then be used to develop a rapid test such as elisa. in this study, we sought to generate specific mabs and develop an elisa test for the detection of hpev vp antigen. hpev genome encodes for three structural proteins, namely vp , vp , and vp , which assemble to create the virus particle [ ] . among these, vp was identified as an antigenic determinant and it might be relatively more useful for diagnostic purposes, due to a higher level of sequence conservation [ ] . furthermore, it possesses high immunogenicity [ ] . therefore, the selection of vp as an antigen is both practical and reasonable. in addition, the mab quality was determined mostly by preparation of high-quality antigen. here, we used a wheat germ, cell-free protein production system for synthesizing recombinant vp proteins, as this system produces properly folded, soluble, and biologically active native proteins similar to those expressed in mammalian cells [ , , ] . in our current study, we newly produced nine different mabs that recognized hpev -vp as an antigen. we then performed epitope mapping of our generated mabs, as identification of the epitope is a key step in the characterization of monoclonal antibodies [ ] . based on the epitope analysis, the mabs were able to recognize three different areas of hpev -vp and specify - aa length epitopes within the hpev -vp . interestingly, two mab clones (# and # ) exhibiting cross-reactivity to vp proteins of other hpevs, bound a distinct epitope ( - aa), which partially overlapped with the recognition site for the polyclonal antibody ( - aa) created by chen et al. [ ] . nevertheless, we obtained mabs specific to hpev , which recognized sites - aa and the - aa site of vp , which was not reported earlier. moreover, we also developed an elisa for detecting hpev antigen using two of our newly generated hpev -specific mabs (# and # ), as mab-based elisa is highly specific and sensitive towards viral antigen detection [ ] . our octet assay suggested that both mabs show a high binding affinity to the full-length hpev -vp recombinant protein. vp protein can be folded into the correct native structure and is likely to form the capsid-like structure via oligomerization. in this situation, # mab could bind various sites on polymerized vp proteins, as a result of an allosteric effect or "avidity", owing to which the k d value of # mab was calculated to be much lower than that of # mab. our newly developed elisa system requires heat treatment of hpev to detect vp antigen. based on previous studies [ ] , the binding area of mab # was estimated to localize to the interface between vp and vp . on the other hand, three-dimensional modeling of viral particles revealed that the antigen-recognizing sites of mabs # and # in vp protein are proximally located. moreover, a previous study showed that glu (in the epitope region of # ) and ser (in the vp protein of hpev ) bind together by a hydrogen bond [ ] . thus, a possible explanation for our observation is that heat treatment helps antigen retrieval for the interface between vp and vp , resulting in efficient access of mabs to the epitopes. according to a previous study [ ] , an increase in antibody-binding capacity was exhibited when glycosylation of capsid protein was removed. based on this finding, we carried out pretreatment by resecting the connection between o-linked and n-linked glycosylation of hpev . however, there was no obvious effect on antibody recognition in our sandwich elisa assay (data not shown), indicating that glycosylation might not affect the antigenicity for mab recognition. the detection limits of our newly developed sandwich elisa were ng/ml for recombinant vp protein and × copies/ml for viral particles, respectively. assuming that one viral particle contains a single copy of a viral genome, there are copies of vp protein per viral particle. for a viral particle detection sensitivity of × copies/ml, the vp protein detection sensitivity is presumed to be ng/ml. therefore, we conclude that the detection sensitivity for the recombinant vp protein and the virus particles was almost equivalent, suggesting the feasibility of our method for use in actual clinical settings. other than the elisa method, several antigen-detecting tests using an antigen-antibody interaction are now available. for instance, the multi-array technology using electrochemiluminescence immunoassay (eclia) and chemiluminescent enzyme immunoassay (cleia) can provide several hundred times more sensitivities than the conventional elisa method [ ] . another example is an influenza-testing kit using a highly sensitive immunochromatographic detection method based on silver amplification [ ] . the limitation of sensitivity might be overcome by utilizing these sophisticated technologies combined with our monoclonal antibodies for hpev vp antigen. furthermore, we can potentially improve the detection sensitivity by altering the second antibody to recognize the poly-hrp complex [ ] or by adding other antibodies, which can target vp or vp proteins. in summary, we utilized the wheat germ cell-free protein production system to synthesize the hpev vp protein and produced mabs that could specifically detect hpev but not other hpevs. we further explored the feasibility of these mabs in terms of their utility in various immunological applications. to the best of our knowledge, the hpev vp antibody as well as the elisa-based viral detection system reported here is the first of its kind ever reported. with the implementation of more sophisticated applications, our newly developed mabs could be useful for further development of diagnostic methods for hpev infection. human parechovirus: an increasingly recognized cause of sepsis-like illness in young infants detection and characterization of a novel human parechovirus genotype in thailand isolation and identification of a novel human parechovirus identification and molecular characterization of the first complete genome sequence of human parechovirus type enterovirus and parechovirus infection in children: a brief overview sepsis-like disease in infants due to human parechovirus type during an outbreak in australia strain-dependent neutralization reveals antigenic variation of human parechovirus human parechovirus causing sepsis-like illness in children from midwestern united states human parechovirus and neonatal infections human parechovirus- infection: emerging pathogen in neonatal sepsis prevalence and characteristics of human parechovirus and enterovirus infection in febrile infants comparison of diagnostic clinical samples and environmental sampling for enterovirus and parechovirus surveillance in scotland infant deaths associated with human parechovirus infection in wisconsin human parechovirus causes encephalitis with white matter injury in neonates lyall, h. characteristics and outcomes of human parechovirus infection in infants human parechovirus type infection: an emerging infection in neonates and young infants epidemic myalgia and myositis associated with human parechovirus type infections occur not only in adults but also in children: findings in yamagata epidemic myalgia associated with human parechovirus type infection the human parechoviruses: an overview epidemic myalgia associated with human parechovirus type infection among adults occurs during an outbreak among children: findings from yamagata severe epidemic myalgia with an elevated level of serum interleukin- caused by human parechovirus type : a case report and brief review of the literature a . -angstrom-resolution cryo-electron microscopy structure of human parechovirus in complex with fab from a neutralizing antibody multiple capsid-stabilizing interactions revealed in a high-resolution structure of an emerging picornavirus causing neonatal sepsis genomic rna folding mediates assembly of human parechovirus parechovirus a detection by a comprehensive approach in a development of a serological assay based on a synthetic peptide selected from the vp capsid protein for detection of human parechoviruses development of monoclonal antibody and diagnostic test for middle east respiratory syndrome coronavirus using cell-free synthesized nucleocapsid antigen production and characterization of monoclonal antibodies specific for major capsid vp protein of trichodysplasia spinulosa-associated polyomavirus clinical relevance of positive human parechovirus type and pcr in stool samples strategies to improve detection and management of human parechovirus infection in young infants wheat germ cell-free system-based production of hemagglutinin-neuraminidase glycoprotein of human parainfluenza virus type for generation and characterization of monoclonal antibody monoclonal antibody-based capture elisa in the diagnosis of previous dengue infection identification of two n-linked glycosylation sites within the core of the simian immunodeficiency virus glycoprotein whose removal enhances sensitivity to soluble cd measurement of sirolimus concentrations in human blood using an automated electrochemiluminescence immunoassay (eclia): a multicenter evaluation development of a highly sensitive immunochromatographic detection kit for h influenza virus hemagglutinin using silver amplification improving the sensitivity of traditional western blotting via streptavidin containing poly-horseradish peroxidase (polyhrp) we thank naohito nozaki and satoko matsunaga for antibody production and technical assistance, and masaki takahashi (iwate prefectural institute of public health) for providing regents.conflicts of interest: y.y. is a current employee of kanto chemical co., inc. the remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. key: cord- - d m n authors: nan title: biomedical vignette date: - - journal: j biomed sci doi: . /s - - - sha: doc_id: cord_uid: d m n nan connective tissue growth factor (ctgf) was initially discovered in as a secreted protein in the conditioned media of cultured human umbilical vascular endothelial cells [ ] . ctgf is a member of the cnn family of secreted, matrix-associated proteins encoded by immediate early genes that play various roles in angiogenesis and tumor growth [ ] . although papers and reviews on ctgf have been published, this review [ ] focuses on the functional role of ctgf in cancer progression. the influence of ctgf expression on the behavior and progression of various cancer cells, as well as its regulation on various types of protein signals and their mechanisms are highlighted. although ctgf expression seems to be associated with progression of many kinds of cancers, its expression may have tumor suppressive effects in a few cases such as lung adenocarcinoma cells, colorectal cancer cells and oral squamous carcinoma cells. cgcgh: a tool for molecular karyotyping using dna microarray-based comparative genomic hybridization (array-cgh) to analyze the rare events of single-copy dna aberrations, a matlab-based, array cgh analyzing program, chang gung comparative genomic hybridization (cgcgh) was employed to survey chromosomal amplifications and deletions in fetal aneuploidies or cancer tissues [ ] . the analyzed chromosomal data are displayed in a graphic interface, and cgcgh allows users to launch a corresponding g-banding ideogram. in karyotyped samples, the cgcgh program outperformed other programs and cgcgh supported the data generated from cdna microarrays, spotted oligonucleotide microarray and affymetrix human mapping arrays. a computational screen for c/d box snornas in the human genomic region associated with prader-willi and angelman syndromes to identify snornas (small nuclear rnas) in the deletion of human chromosomal region q -q related to prader-willi syndrome (pws) and angelman syndrome (as), computational scanning and screening and a novel hybridization energy test were used to identify all the snornas. three previously unknown methylation snor-nas targeting ribosomal s and s rnas and two snornas targeting serotonin receptor cmrna were identified [ ] . the application of the present method to the pws/as region of human genome identified snornas, three of which pass the hybridization test. these snornas require experimental confirmation. present hybridization test can be incorporated and automated with motif scanning for genome-wide studies. interactions between m protein and other structural proteins of severe acute respiratory syndrome coronavirus the study by hsieh et al. [ ] in the current issue focuses on the protein-protein interactions that regulate sars coronavirus assembly. this was performed by co-localization studies in cells that express structural viral proteins either individually or combined. changes in localization induced by the presence of other virus components indicate direct protein-protein interactions. these experiments indicate that the sars m protein plays a pivotal role in virus assembly, similar to findings with other coronaviruses (vennema et al., reference in the paper). a model is presented to explain how the m protein interacts with multiple other structural proteins (s, e and nc) to facilitate virion assembly. severe acute respiratory syndrome coronavirus nucleocapsid protein confers ability to efficiently produce virus-like particles when substituted for the human immunodeficiency virus nucleocapsid domain the assembly of virus particles is a complex, but illunderstood process. some viruses like the human immunodeficiency virus type (hiv- ) require only a single protein for the assembly of virus-like particles (vlps) (gheysen, reference in the paper). the situation is more complex for the sars coronavirus, which requires at least two structural proteins for vlp formation. the current study by wang et al. [ ] asked whether the sars nucleocapsid (n) protein and fragments thereof can functionally replace the nucleocapsid domain of the self-assembling hiv- gag protein. using this novel chimeric vlp approach, the authors were able to map more precisely the sars n domain that facilitates self-association and vlp formation. host factors, such as iron concentration at the site of infection, may influence the virulence of pseudomonas aeruginosa and hence the outcome of infection. here, mittal et al. [ ] reported that the depletion of iron in the growth medium led to increased adherence of p. aeruginosa to uroepithelial cells and decreased phagocytosis of the bacteria. furthermore, p. aeruginosa growth in irondepleted medium showed increased renal bacterial load and tissue pathology in the challenged mice. the current study may provide insight into the pathogenesis of p. aeruginosa and facilitate the development of a preventive approach against p. aeruginosa-induced urinary tract infections. in vertebrate retinas, glycinergic synapses regulate glutamate transmissions in both synaptic layers, the inner plexiform layer (ipl) and outer plexiform layer (opl). the function of glycinergic synapses in the inner retina has been known to inhibit glutamate transmissions, shaping light-evoked response in ganglion cells [ , ] . however, little is known about the function of glycinergic synapses in the outer retina. in this issue, shen et al. [ ] reported that glycine depolarized rods and activated voltage-gated ca + channels in the neurons, resulting in facilitation of glutamate release in photoreceptors and increase of the spontaneous excitatory postsynaptic currents in off-bipolar cells. furthermore, these authors reported that inhibition of a cluptake transporter nkcc effectively eliminated glycine-evoked depolarization in rods suggesting that nkcc maintains a high cllevel in rods, which is responsible for glycine-induced depolarization. this finding is quite significant since it reveals a new function of glycine in retinal synaptic transmission. dose-finding study with nicotine as an anti-antiseizure agent in ptz-induced seizure model in mice nicotine exerts agonistic effect on neuronal nicotinic acetylcholine receptors and is reported to enhance release of excitatory neurotransmitter glutamate into the synapses [ ] . pentylenetetrazole (ptz), on the other hand, produces seizures in small rodents [ ] and is believed to act by antagonizing the inhibitory gabaergic [ ] neurotransmission in central nervous system. in this issue, medhi et al. [ ] reported that subthreshold dose of nicotine pretreatment significantly decreased the cd value for ptz. sodium valproate but not topiramate, significantly inhibited ptz-induced seizure. nonconvulsive dose of nicotine significantly antagonized the protective efficacy of sodium valproate against ptz-induced seizures. these findings are quite significant since they bear clinical relevance particularly amongst epileptic smokers who may show failure of efficacy of antiseizure agents and present with breakthrough seizure attacks due to nicotine. benzyl alcohol inhibits n-methyl-d-aspartate receptor-mediated neurotoxicity and calcium accumulation in cultured rat cortical neurons benzyl alcohol has been used as a preservative in some small multiple-dose vials of bacteriostatic sodium chloride or water for injection. however, there is concern that excipients such as benzyl alcohol may cause adverse reactions to neurons in premature infants [ , ] . in this issue, takadera and ohyashiki [ ] reported that benzyl alcohol inhibited nmda-induced cytotoxicity. furthermore, these authors showed that the protective effect of benzyl alcohol on nmda-induced toxicity is due to its effect in reducing nmda receptor-mediated calcium accumulation, indicating that benzyl alcohol inhibits nmda receptor activity. this finding is significant since it shows potential beneficial effect of benzyl alcohol on mature neurons against glutamate-induced neurotoxicity although it may have adverse effect on immature neurons. bone morphogenetic proteins (bmps) are the most potent class of all osteoinductive proteins, and bmp- has already been clinically applied to accelerate bone regeneration in both fracture and spinal fusion [ ] . bone narrow-derived mesenchymal stem cells (bmmscs) have been shown to be able to differentiate in vitro toward osteogenic lineages, when treated with established lineage-specific factor [ ] . kim et al. [ ] studied if a combination of the undifferentiated bmmscs and bmp- delivered via heparin-conjugated plga nanoparticles (hcpn) would extensively regenerate bone in vivo. in the in vivo testing, the undifferentiated bmmscs with bmp- with bmp- loaded hcpn induced far more extensive formation, indicating the feasibility of entensive in vivo bone regeneration by transplantation of undifferentiated bmmscs and bmp- delivery via hcpn. previous studies indicated that t-box genes play an essential role in cell specification and morphogenesis [ , ] . in vertebrates, tbx plays a critical role in cardiac and upper limb development. expression of tbx at the appropriate dose, time and position is important for normal cardiac development. zebrafish appears to be a well-established model used in studying the development of vertebrates. the zebrafish is an ideal model and has been receiving attention as a human disease model, because the species fertilizes embryos externally, the fetus develops rapidly, and it manages to survive without cardiac function for days. thus, zebrafish was used as the model organism to facilitate our investigation on the causal relationship between tbx and cardiac myogenesis during cardiac looping. results demonstrated that in zebrafish, injection of tbx morpholino antisense rna caused changes of heart conformation, defect of heart looping, pericardium effusion, dropsy of ventral position and decreased heart rate. in conclusion, this study showed deficiency of tbx might perturb cardiac looping progress, as well as the formation of atrium and ventricle, possibly through down-regulating cardiacmyogenesis genes such as amhc, vmhc and cmlc [ ] . the discovery of the naturally occurring cardiac nonfunction (c) animal strain in ambystoma mexicanum (axolotl) provides a valuable animal model to study cardiomyocyte differentiation. it was shown that this recessive mutation, in homozygous animals, causes incomplete differentiation of the embryonic heart at heartbeat initiation stages [ ] due to a lack of organized myofibril formation in homozygous mutant hearts [ , ] . in this issue, jia et al. [ ] reported cloning of a peptide cdna (n ) from an anterior-endoderm-conditioned-medium rna library. furthermore, the authors have shown a dramatic decrease of expression of n mrna in mutant (c/c) embryos, indicating that the n gene is involved in heart development. these findings are quite significant since revealing the underlying molecular mechanisms of heart development will be an important step in finding cures for heart diseases. immunohistochemical assessment of cyclic guanosine monophosphate (cgmp) and soluble guanylate cyclase (sgc) within the rostral ventrolateral medulla the rostral ventrolateral medulla (rvlm) in the caudal medulla oblongata is a major site for generation of neurogenic vasomotor tone. nitric oxide (no) in the rvlm plays an important role in central cardiovascular regulation by regulating the sympathetic vasomotor outflow [ , ] , and cgmp-dependent no signaling in the rvlm is impaired in hypertension [ ] . to date, no studies have described the immunohistochemical localization of neurons capable of expressing cgmp within the rvlm. in this communication, powers-martin et al. [ ] sought to identify the cellular targets for no in the rvlm by visualizing anatomical relationship of cgmp with the tyrosine hydroxylase (th) or phenylethanolamine n-methyltransferase (pnmt) cell group. double label immunohistochemistry for cgmp-immunoreactivity (ir) and th or pnmt neurons failed to reveal cgmp-ir neurons in the rvlm of either normotensive wistar-kyoto rats or the spontaneously hypertensive rats. in addition, soluble guanylate cyclase (sgc)-ir was found throughout neurons of the rvlm, but did not co-localize with pnmt-or th-ir neurons. these results indicate that within the rvlm, cgmp is not detectable using immunohistochemistry in the basal state and this raises the hypothesis that functional network inputs, such as the sympathetic baroreflex pathway are required to drive a sgc/cgmp cascade in the rvlm. in vivo gm-csf promoter-based assay for drug screening in drug discovery research, in vitro cell-based screening systems are well established as methods for evaluation of candidate lead compounds. for example, in vitro assays of nf-jb [ ] and cox- [ ] , two examples of drug targets, are employed to develop therapeutic strategies to counter inflammation. since the regulation of immuno-modifiers is highly dependent on three-dimensional microenvironments, an in vivo assay can more accurately evaluate the effects of drugs on the expression of key cytokines. su et al. [ ] devised an in vivo, transgenic, human cytokine (e.g. gm-csf) gene promoter assay using defined epidermal skin cells as test tissue. test compounds were topically applied to mouse skin before or after gene gun transfection, using a cytokine gene promoter-driven luciferase reporter. croton oil, an inflammation inducer, induced six-folds transgenic gm-csf promoter activity in skin epidermis, and this effect was drastically inhibited by the phytocompound shikonin. these results demonstrate that this in vivo transgenic promoter assay system is cytokine gene-specific, and highly responsive to pro-inflammatory stimuli. arecoline and the - kda fraction of areca nut extract differentially regulate mtor and respectively induce apoptosis and autophagy-a pilot study in taiwan, chewing of betel quit causes oral cancer, which is the top sixth killer of all types of cancer. betel quit consists of areca nut (an), lime and influorescene of piper betel. it was demonstrated previously that extracts of an (ane), but not those of lime, inflorescence of piper betel, induced rounding of cell morphology and nuclear shrinkage in several carcinoma cell lines. in the present study [ ] , the mw of active principle was found to be between - kda, which induced nuclear shrinkage, clearance of the cytoplasm, cleavage of lc - q and appearance of autophagic vacuoles and acid vesicles. on the other hand, arecoline (are) triggered caspase- activation, peri-nuclear condensation and micronucleation. furthermore, ane - k but not are, inhibited the phosphorylation of rapamycin (m-tor)-ser . it indicates that different constituents of an induces apoptosis or autophage of an oral cancer cell line. to the present, utilization of high-resolution gas chromatography with high-resolution mass spectrometry (hrgc/ ms) is recognized as the most efficient way for determining dioxin compounds [ ] . however, the procedures of this detection system are not easy to carry out. the fluorescence resonance energy transfer (fret) technique can precisely evaluate interaction between two molecules in cells, and the use of cyan-fluorescent protein (cfp) as an energy donor and yellow-fluorescent protein (yfp) as an energy acceptor has been reported as the most suitable combination of fret signal detection [ ] . lin et al. [ ] therefore established a fret-based dioxin-detection assay. aryl hydrocarbon receptor (ahr) and ahr nuclear translocate (arnt) fused-cyan fluorescent protein cfp) and-yellow fluorescent protein (yfp) constructs were transiently cotransfected into rat hepatoma cell line h iiec cells. no fret signals were detected in ahr-cfp-and arnt-yfptransfected cells. however, dioxin treatment upregulated fret signals in these transfected cells in a dose-dependent manner, indicating the potential of fret technique in the detection of dioxin-like compounds. connective tissue growth factor: a cysteine-rich mitogen secreted by human vascular endothelial cells is related to the src-induced immediate early gene product cef- nov (nephroblastoma overexpressed) and the ccn family of genes: structural and functional issues connective tissue growth factor (ctgf) and cancer progression cgcgh: a tool for molecular karyotyping using dna microarray-based comparative genomic hybridization (array-cgh) a computational screen for c/d box snornas in the human genomic region associated with prader-willi and angelman syndromes interactions between m protein and other structural proteins of severe, acute respiratory syndrome-associated coronavirus severe acute respiratory syndrome coronavirus nucleocapsid protein confers ability to efficiently produce virus-like particles when substituted for the human immunodeficiency virus nucleocapsid domain iron dictates the virulence of pseudomonas aeruginosa in urinary tract infections synaptic mechanisms that shape visual signaling at the inner retina glycinergic synaptic inputs to bipolar cells in the salamander retina glycine input induces the synaptic facilitation in salamander rod photoreceptors nicotine enhancement of fast excitatory synaptic transmission in cns by presynaptic receptors laboratory evaluation of antiepileptic drugs. review of laboratory methods the gaba postsynaptic membrane receptorionophore complex dose-finding study with nicotine as a proconvulsant agent in ptz-induced seizure model in mice fatal benzyl alcohol poisoning in a neonatal intensive care unit the gasping syndrome and benzyl alcohol poisoning benzyl alcohol inhibits n-methyl-d-aspartate receptor-mediated neurotoxicity and calcium accumulation in cultured rat cortical neurons bmp- evaluation in surgery for tibial trauma-allgraft (bestt-all) study group ( ) recombinant human bmp- and allograft compared with autogenous bone graft for reconstruction of diaphyseal tibial fractures with cortical defects. a randomized, controlled trial epidermal growth factor as a candidate for ex vivo expansion of bone marrow-derived mesenchymal stem cells enhancement of ectopic bone formation by bone morphogenetic protein- delivery using heparin-conjugated plga nanoparticles with transplantation of bone marrow-derived mesenchymal stem cells t-box genes: what they do and how they do it t-targets: clues to understanding the functions of t-box proteins cascade effect of cardiac myogenesis gene expression during cardiac looping in tbx knockdown zebrafish embryos genetic and experimental studies on a mutant gene (c) determining absence of heart action in embryos of the mexican axolotl (ambystoma mexicanum) morphology of developing heart in cardiac lethal mutant mexican axolotls imunofluorescent confocal analysis of tropomyosin in developing hearts of normal and cardiac mutant axolotls a novel protein involved in heart development in ambystoma mexicanum is localized in endoplasmic reticulum differential cardiovascular responses to blockade of nnos or inos in rostral ventrolateral medulla of the rat the nnos/cgmp signal transducing system is involved in the cardiovascular responses induced by activation of nmda receptors in the rostral ventrolateral medulla of cats pressor and sympathoexcitatory effects of nitric oxide in the rostral ventrolateral medulla immunohistochemical assessment of cyclic guanosine monophosphate (cgmp) and soluble guanylate cyclase (sgc) within the rostral ventrolateral medulla nf-jb: a key role in inflammatory diseases development and use of a gene promoter-based screen to identify novel inhibitors of cyclooxygenase- transcription immunomodulatory effects of phytocompounds characterized by in vivo transgenic human gm-csf promoter activity in skin tissues arecoline and the - kda fraction of areca nut extract differentially regulate mtor and respectively induce apoptosis and autophagy: a pilot study a hybrid hrgc/ms/ms method for the characterization of tetrachlorinated-p-dioxins in environmental samples reducing the environmental sensitivity of yellow fluorescent protein. mechanism and applications establishment of a fluorescence resonance energy transfer-based bioassay for detecting dioxin-like compounds key: cord- -s qf lj authors: spiridonova, v. a. title: molecular recognition elements: dna/rna-aptamers to proteins date: - - journal: biochem mosc suppl b biomed chem doi: . /s sha: doc_id: cord_uid: s qf lj the review summarizes data on dna/rna aptamers, a novel class of molecular recognition elements. special attention is paid to the aptamers to proteins involved into pathogenesis of wide spread human diseases. these include aptamers to serine proteases, cytokines, influenza viral proteins, immune deficiency virus protein and nucleic acid binding proteins. high affinity and specific binding of aptamers to particular protein targets make them attractive as direct protein inhibitors. they can inhibit pathogenic proteins and data presented here demonstrate that the idea that nucleic acid aptamers can regulate (inhibit) activity of protein targets has been transformed from the stage of basic developments into the stage of realization of practical tasks. during the last decade a significant break has been achieved in the use of basic knowledge on dna in applied studies. the development of highly technolog ical analytic methods employing immobilized dna is one of rapidly developing directions. the major achievement of microarray technology (i.e. dna chips) consists in possibility of the use of various dna libraries amplified by polymerase chain reaction (pcr) for the development of sets of dna sequences. using hybridization these sets can rapidly analyze and compare sequences of thousands genes, their mutant forms, dna polymorphism and to discover new genes. the second direction consists in the develop ment of irrational design of nucleic acids for studies of nucleic acid protein recognition. in , two labs (the gold and szoztak laboratories; usa) indepen dently developed the selex method (systematic evolution of ligands by exponential enrichment) [ , ] . using this method it is possible to isolate targeted nucleic acid molecules (aptamers) from the large set of individual molecules (more than ) known as the combinatorial library. aptamers are small single stranded dna or rna molecules of ⎯ nucle otide residues in length with rather complex three dimensional structure. such complex structure deter mines aptamer ability to bind various molecules including proteins. thus, such complex process of biosynthesis of protein recognizing elements, antibod ies, which nature has been naturally creating for thou sands years, is now modeled in vitro. selection begins with generation of a large rna library with fixed ' and ' ends and a degenerated region of - nucleotides in length (fig. ) . such library contains - variants of rna molecules, which are folded in complex d structure. the library is incubated with a protein and rna molecules bound to the protein target are separated from unbound rna molecules. the bound rna molecules are separated from proteins and then amplified by means of reverse transcriptase and pcr to obtain a new pool of mole cules with increased affinity. the procedure is usually repeated - times until maximal number of aptamers exhibiting affinity to the target will be clearly detectable in the enriched fraction. aptamers are then cloned (usually into a bacterial vector) and sequenced. in the case of dna the selection also begins with dna library in which the randomized region is flanked by fixed sequences at ' and ' ends. to pro duce single stranded dna molecules either asymmet ric pcr or one of primers carries a biotinylated tag, which helps to separate one dna strand from another on streptavidin columns are used. many reviews on detailed description of all steps of this method have been published to date [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . now, the idea that nucleic acid aptamers can regulate (inhibit) activity of protein targets has been transformed from the stage of basic developments into the stage of realization of practical tasks. in this review the major attention is paid to the aptamers to proteins involved into patho genesis of wide spread human diseases. tuer and gold proposed to use combinatorial rna libraries for creation of rna ligands selectively bound to t dna polymerase [ ] . rna ligands were named as aptamers by ellington and szoztak [ ] . they also introduced a name of this method, selex. now convincing evidence exists that aptamers are a new effective group of therapeutics, which may represent the scheme illustrating the selex method for preparation of dna and rna aptamers. an initial randomized dna library is transformed into single stranded dna (ssdna) and then introduced into binding reaction with a protein, pre immo bilized onto a column. rna is obtained by transcription of the initial library; the latter carries introduced promoter of t rna polymerase. rna molecules are also exposed for binding with the protein immobilized onto the column. after removal of unbound molecules, dna/rna molecules that bind to the immobilized protein are separated from this protein by phenol chlo roform treatment and then subjected to alcohol sedimentation. rna molecules are used for cdna preparation by means of reverse transcriptase. all resultant molecules are then transformed into double stranded dna (dsdna). in the case of dna ssdna prepared using asymmetric pcr is used again for repeated binding. rna molecules are subjected for transcription and resultant molecules are used for binding with the immobilized protein. the selection includes ⎯ rounds and this yields an enriched fraction of aptamers. the next step includes cloning into a plasmid vector and sequencing of resultant sequences. an important tool against many diseases. the drug macugen has already been approved by us fda (food and drug administration) for the treatment of age related macular degeneration (amd). some aptamers are now in different stages of pre and clini cal trials [ , ] . highly specific (antibody like) recognition and binding of aptamers to their protein targets make them attractive therapeutics. aptamers (as well as antibod ies) are folded into complex three dimensional struc tures and form hairpins and loops. the range of disso ciation constants characterizing binding of dna and rna aptamers to their protein targets varies from nanomolar to subnanomolar levels. aptamers can dis criminate related proteins consisting of the same structural domains [ ] [ ] [ ] [ ] . it should be noted that the use of fold excess of aptamer doses in animal models employed in preclinical trials and in therapeu tic applications in humans did not cause allergic reac tions [ , ] . studies on biocompatibility and phar macokinetics of aptamers and investigations of various modifications of aptamer structures have been per formed for their further applications as drugs [ ] . nuclease degradation is the major problem that complicates manipulations with oligonucleotides. protection against nonspecific action of nucleases during selection includes modification of pyrimidine nucleotides at ribose c ' (amino and fluoro deriva tives) [ ] [ ] [ ] , use of liposomes as carriers [ ] , post selection ribose c ' hydroxyl modifications by intro ducing methyl, allyl, amino groups, etc. [ ] [ ] [ ] . the molecular mass of short polynucleotides is - kda; this corresponds to - nucleotides. such a small size facilitates rapid renal filtration within a few minutes. aptamer modification by conjugation to polyethylene glycol or other agents and their attach ment to the liposome surface prolonged the period of aptamer action [ , ] . usually, aptamers exhibit high specificity towards their targets and this should be taken into consider ation in preclinical trials on animal models. however, protein orthologs may decrease efficacy of such com pounds. nevertheless, an improved selection process named as "toggle selex" seems to overcome this problem. toggle selex has been proposed for rna libraries incubated with the same protein from human plasma. during the follwing rounds of selection it was also exposed to binding with the animal ortholog used in subsequent preclinical trials [ ] . studies on pharmacokinetics of aptamers subjected to modifications should be accompanied by analysis of their excretion from the body. this is important because in addition to the main disease patients may have multiple dysfunctions including renal insuffi ciency. for regulation of aptamer activity rusconi pro posed the antidote proof of concept (the method of rational design) [ ] . using watson crick base pairing he designed an antidote structure as a complementary sequence that bound to the aptamer, altered its struc the mechanism of antidote action in the pair aptamer antidote (modified from [ ] ). the antidote is an oligonucleotide of nucleotides in length; it forms an inactive complex with the aptamer to factor ixa involved into the blood coagulation cas cade. possessing a sequence complementary to the aptamer, the antidote forms a complex with the aptamer and thus alters its d structure. antidote inactive form ture and, thus, prevented its complex formation with the protein target (fig. ) . the proposed concept gave a unique possibility of aptamer regulation because it allows to control administration of an aptamer based drug in any clinical use. considering aptamers as direct protein inhibitors it is very attractive to use them for studies of inhibitory mechanisms and also for therapeutic application. pos sibility of aptamer preparation for any class of biomol ecules allows to evaluate importance of their use and increases areas of their applications. advantage of aptamers maintaining their activity in multicellular organisms significantly facilitates preclinical trials, saves time and reduces costs required for antibody preparation on animal models. finally, design of an antidote that control aptamer activity possibly repre sents the most important contribution to the develop ment of nuclei acid therapy because it can control drug dosage and, thus, determine required safety. it should be noted that in addition to biomedical studies aptam ers are also used as a recognizing elements in microar ray based biosensors; this is another important and logic continuation of the development of the dna chip technology. this review summarizes current knowledge on aptamers to various proteins, their affinity to protein targets; it describes inhibitory properties of aptamers and information about preclinical and clinical trials. . . thrombin thrombin is a key protein in blood clotting process. this serine protease is generated during a cascade of proteolytic reactions initiated by epithelial damage. thrombin is produced from prothrombin by factor xa. active thrombin catalyzes the reaction of fibrinogen conversion into fibrin, which forms a fibrin matrix for the thrombus by "capturing" blood cells [ ] . throm bin also activates platelets via interaction with their par receptors and regulates expression of some sub strates and activation molecules such as p selectin [ , ] . problem of hemostasis requires creation of such thrombin inhibitor, which would be specific for blood clotting process, does not cause allergic reaction and effectively regulates this process. an anti thrombin aptamer was one of the first therapeutic aptamers obtained by the selex method. the single stranded dna aptamer was isolated from a pool of ~ oligo nucleotide sequences containing a nucleotide ran domized region [ ] . the round selection resulted in identification of aptamers forming a complex with thrombin, which was characterized by the k d values ranged from to nm. these aptamers were based on the nucleotide sequence: dggttggtgtg gttgg ( tba). the aptamer increased time required for clot formation from to s in vitro and from to s in human blood plasma. the tba structure was investigated by means of nmr analysis, which included the study of tba alone, in the complex with thrombin and the study of aptamer binding with the anion binding site of thrombin, exosite [ ] [ ] [ ] [ ] . tba forms a complex compact tertiary structure known as g quadruplex (fig. ). other laboratories also used the selex method to perform aptamer selection to thrombin [ , ] . tba also influenced platelet aggregation stimulated by thrombin. thrombin also caused proteolytic activa tion of platelet par receptor and its aptamer inhib ited this activation in a dose dependent manner [ ] . the anticoagulant activity of the aptamer was tested on monkeys. the prothrombin time (pt) increased by . fold in min and returned to base line min after aptamer administration. tba also inhibited platelet aggregation and prolonged platelet activation induced by thrombin [ ] . the aptamer was also investigated using an anticoagulation model of extracorporeal circulation in sheep. the pt values reached ⎯ s (versus . s of the baseline level), whereas control pt remained close to the baseline. in the other experiment tba was investigated using a the structure of the tba aptamer to thrombin, which inhibits fibrinogen hydrolyzing activity. tba has the oligonucleotide sequence dggttggtgtggt tgg. the g quadruplex structure is a structure forming element for dna. eight of nine guanines form two planar g quartets with three loops; the loop tgt located in the center and two symmetrical loops tt. the presence of octa coordinating calcium ion and stacking interaction between g quartets of the duplex determine maintenance of the g quadruplex. calcium ion is located between par allel planes of the g quadruplex. cardiopulmonary bypass (cpb) model. the study included examination of anticoagulation activity, pharmacokinetics and renal clearance of the aptamer [ ] . animals were subdivided into two groups: one group received injections of heparin ( u/kg) and protamine in boluses and was used as control of activ ity by cpb. the second group of animals received aptamer infusion ( . - . mg/kg per min). these animals were characterized by increased pt, activated partial thromboplastin time (aptt), and activated clotting time (act), which then returned to the base line after infusion [ ] [ ] [ ] . in the pharmacokinetic studies using the cpb model, the elimination half life of the aptamer was . min; however, during the min infusion this parameter increased up to . min. these results sug gested that the aptamer would function in the animal model and that unmodified dna aptamers were rap idly eliminated from the body. now this dna aptamer is under preclinical trials by archemix corp. for subsequent trials in humans. anti thrombin rna aptamers were obtained using the library with a nucleotide randomized sequence [ ] . the anti thrombin aptamers were iso lated after rounds of selection. the enriched frac tion was then cloned in the plasmid puc and sequenced; this yielded two classes of aptamers. the conservative motif in clones of the class i rnas was represented by the sequence uccggaucgaag uuaguaggcgga inside a variable zone. one of the best anti thrombin aptamers was characterized by the k d value of . ± . nm. members of the second class exhibited lower affinity (the k d value of ± . nm). competitive analysis with heparin and hirudin dem onstrated that heparin but not hirudin displaced the rna aptamer from its complex with thrombin. this suggests affinity of this aptamer to thrombin exosite ii. however, tests on functional activity in animal models have not been performed. in preclinical trials highly specific aptamers to human proteins may demonstrate lowered affinity to protein orthologs in animal models. to overcome this problem so called "toggle" approach has been pro posed: ' fluoro rna aptamers were incubated with a mixture of human and porcine thrombin during the first round and then porcine and human thrombin were alternatively used in subsequent rounds of selection [ ] . after rounds of selection clones with the con servative sequence gggaacaaagcugaaguac uuaccc have been found; they exhibited cross reac tivity with porcine and human thrombin. the complex with human thrombin was characterized by the disso ciation constant k d of . ± . nm and the complex with porcine thrombin had the k d value of ± pm. the aptamer increased clotting time (thrombin con centration was nm) of blood plasma from . ± . to . ± . s. in porcine plasma tog increase clot ting time from . ± . to . ± . s. improvement of thrombin dependent platelet aggregation by the aptamer occurred in the dose dependent manner. the higher effect was achieved using porcine platelets: a fold excess of tog inhibited thrombin activity by % [ , ] . factor viia (fviia) is a trypsin like protease involved into the coagulation cascade. in combination with the tissue factor (tf) fviia plays a critical role in thrombin formation and thus promotes active clot for mation. aptamers to fviia have been isolated from an rna library using the selex method. these aptam ers inhibited activation of factor x (inactive precursor in the coagulation cascade) by fviia [ ] . after rounds of selection from the ' amino modified rna library the isolated aptamers formed a complex with fviia characterized by the k d value of . ± . nm. specificity of some aptamers was investigated in bind ing reactions with other protein factors (fxia and fxa). the micromolar range of k d values determined for these complexes suggested nonspecific binding of these aptamers with protein factors. addition of the anti fviia aptamer inhibited an initial rate of fx acti vation by about %. experiments on dilution of the reaction mixture revealed a dose dependent mode of inhibition. the aptamer prolonged clotting time up to % in the pt test. factor ixa (fixa) is a serine protease that plays an important role in formation of critical mass of throm bin required for coagulation. the complex tf/fviia performs proteolytic cleavage of the protein factor fix into its active form fixa; the latter binds to fviiia on the platelet surface and activates factor fx to fxa, which catalyzes conversion of prothrombin into thrombin [ ] . rusconi et al. performed rna selection to fixa; after eight rounds of selection they found an aptamer, which bound to fixa with the k d value of . ± . nm and exhibited fold higher affinity to fixa compared with fviia, fxa, fxia and activated protein c [ ] . a truncated version of this aptamer ( . t) maintained high affinity to fixa (k d of . ± . nm) and totally inhibited fx hydrolysis by the enzyme complex. the anticoagulation activity of . t was evaluated using activated partial thromboplastin time (aptt). the aptamer increased clotting time in the dose dependent manner and caused a several fold increase in aptt. in continuation of the antidote theory rus coni obtained an rna antidote, which caused revers ible inhibition of . t thus creating a drug/antidote pair for the anticoagulation therapy. using the com plementary base pairing principle the second rna oligonucleotide complementary to the . t aptamer was created. after administration of the antidote nucleotide the anticoagulation activity of the anti fixa aptamer changed in min and this effect per sisted for over hours [ ] . almost in % of million people receiving heparin therapy heparin induced thrombocytopenia (hit) is developed after one year [ ] and this is the reason for cessation of the heparin therapy. patients who need repeated anticoagulant therapy receive hemodialysis, which complicates patient's life. to prolong the effect of the anti fixa aptamer in vivo rusconi c.p. et al. prepared a cholesterol deriva tive (ch . t), which exhibited high affinity (k d = . ± . nm) and anticoagulant activity [ ] . tests on por cine and mouse plasma have shown the same efficacy of the animal models as in the case of human plasma. using the porcine anticoagulant system the aptamer increased pt and aptt comparable with its effects on pt and aptt in human plasma samples. there was significant difference between the modified and initial aptamers: the cholesterol moiety increased half life of ch . t to - min (versus - min for - min). the antidote c neutralized more than % of the aptamer effect within min in animal models [ ] . the antithrombotic effect of the aptamer was investigated in mouse thrombosis model, which was induced by administration of ferric chloride to the carotid artery; mice were pretreated with ch . t or a functionally inactive aptamer with scrambled nucle otide sequence (negative control). all the mice in the negative control group developed an occlusive throm bus in . ± . min. in the aptamer treated group % of mice maintained clear normal carotid artery blood flow during min (time required for the occlu sive thrombus formation ≥ . min) [ ] . the effect of the c antidote was accessed using the model of active bleeding (tail transection). mice were pre treated with ch . t or an aptamer with scrambled sequence and the tail was cut h after the treatment. blood losses were measured for min after tail transection. animals treated with ch . t exhibited significantly more blood loss ( ± . μl) compared with controls ( ± . μl). administration of the c immediately after tail transection prevented hem orrhage in the aptamer treated animals (blood loss was . ± . μl) [ ] . the biopharmaceutical company regado bio sciences continues studies on the fixa aptamer anti dote pair named as reg , which is under first stage of clinical trials. hepatitis c virus (hcv) is a major cause of both sporadic and viral hepatites differed from hepatitis a or b [ ] . the nonstructural protein (ns ) is a serine pro tease that exhibits protease and helicase activity and represents a good target for inhibition of hcv. aptamer selection was performed using a library car rying a nucleotide randomized region and after rounds of selection two aptamers inhibiting protease and helicase activities were obtained [ ] . for identi fication of the aptamer demonstrating affinity to the active site of ns subsequent selection was performed using a truncated polypeptide Δns . using an rna library with a randomized region authors per formed rounds of selection and identified clones, which bound Δns [ ] . according to their nucleotide sequences aptamers were subdivided into three families. they all contained a conservative region ga(a/u)ugggac. these aptamers formed a complex with Δns with the k d value of nm, caused % inhibition of protease activity of the Δns peptide and full sized ns bound to a maltose binding protein (mbp ns ). in vivo hcv proteins are processed by ns and ns a cofac tor. for modeling of physiological conditions the aptamer effect on ns activity was tested in the pres ence of the p peptide, which caused a sevenfold increase of mbp ns activity. under these conditions the aptamer inhibited mbp ns activity by % [ ] . human neutrophil elastase (hne) is involved in various inflammatory diseases, including acute respi ratory distress syndrome (ards), septic shock, arthritis, and ischemia reperfusion injury [ ] . a covalent inhibitor of hne, a diphenyl phosphate derivative of valine (valp), was coupled to an rna library to enhance the binding of the inhibitor with hne [ ] . ten rounds of selection yielded an rna aptamer conjugated to the dna:valp substrate (rna . : dna:valp). the aptamer demonstrated bind ing to hne (k d = nm) and enzyme inhibition (k i = nm) in vitro. in contrast to the rna aptamer . or the substrate dna:valp administered separately the aptamer modified with the substrate inhibited hne ex vivo in the rat model of ards [ ] . the same group also performed a valyl phosphonate: dna library selection to find more potent hne inhibitors. authors used a single enantiomer form of the valyl phospho nate, which was compared with a racemic mixture. inhibitor selection was performed using purified elastase and also secreted elastase in the presence of neutrophils [ ] . after rounds of selection the aptamer ed , which inhibited hne, was found. the aptamer was truncated to a mer, named nx , and tested in a rat model of lung inflammation. a nmol dose of nx inhibited neutrophil infiltration by % in the lung of rat in vivo [ ] . . . vegf angiogenesis plays a central role in various physio logical and pathological processes. vegf (vascular endothelial growth factor) is one of the best character ized growth factors; it is involved into initial steps of angiogenesis and represents one of the most promising targets for anticancer therapy [ ] . an increased vegf level associated with angio genesis was observed during tumor growth and metastases, premature aging and age related degener ation of tissues [ ] [ ] [ ] . using the selex method ruckman performed rounds of selection cycles and isolated aptamers to human vegf with k d of pm in a ' f pyrimi dine rna library [ , , ⎯ ] . for increased sta bility against nucleases two aptamers were additionally modified by the ' o position [ ] . these aptamers were characterized by the k d values of and pm; they were specific to vegf and did not bind to related proteins: vegf and placenta growth factor plgf . the aptamers to vegf inhibited the bind ing of vegf to its receptors, flt and kdr (kinase domain receptor). using i labeled vegf inhibition of receptor binding was evaluated: theic values for aptamer competition with the flt receptor and kdr were ranged from - and - pm, respectively [ ] . therapeutic potential of the aptamers to vegf was evaluated by the miles assay representing simple and rapid means of monitoring the ability of aptamers to inhibit the activity of vegf in vivo. it is assessed as vascular wall permeability in animal models. the test was performed using adult guinea pigs and the most effective aptamer inhibited vascular permeability by % at μm [ ] . pharmacokinetics of the fluoro pyrimidine and ' o methyl purine aptamer to vegf called as nx has been investigated in monkeys. during intravenous administration of this aptamer as a conjugate with a kda polyethylene glycol was characterized by half life of . h and a clearance rate of . ml/h. subcuta neous administration resulted in % absorption into the tissues within - h [ ] . preclinical and clinical trial of nx also called as macugen was performed by eyetech pharmaceuti cals inc for the treatment of age related macular degeneration in diabetic patients [ , ] . the synthetic aptamer nx was also investi gated in the rat model of angiogenesis. these studies confirmed a significant inhibition by % of angio genesis by means of vegf in the presence of this aptamer. the a phase of clinical trials did not reveal any significant complications after a single adminis tration of the drug. in addition % of patients dem onstrated stable improvement during observation for months after injections, % of patients demon strated a threefold improvement of vision among dia betic patients (etdrs) [ ] . clinical trials (phase ) have shown that multiple macugen administration with or without photody namic therapy (pdt) did not cause any serious impairments; moreover . % of patients demon strated stable vision improvement and % of patients demonstrated significant improvement evaluated using the etdrs system (early treatment for dia betic retinopathy study). during the third phase of clinical trials macugen (under the commercial name pegaptanib) was used as the only drug every weeks over a period of weeks at a dose of . , and mg intravitreously [ ] . all three groups of patients demonstrated significant improvement of vision. the severe loss of visual activ ity determined as the loss of letters of visual acuity reduced from to % in the group receiving . mg of macugen. in addition, % of patients receiving this dose maintained their visual acuity or gained acu ity (versus % of control group). no antibodies against macugen were found. eytech in cooperation with pfizer obtained fda approval for the use of macugen for the treatment of amd. these first results demonstrate that aptamers may be effective drug prep arations. it is known that the intensive tumor development is accompanied by neovascularization and therefore an aptamer to vegf was used for inhibition of tumor vascularization (and tumor growth). the aptamer isolated and optimized by ruckman et al. was tested in a mouse model of wilmis tumor (the most common malignant tumor of the kidneys in chil dren). tumor was implanted into a mouse kidney and its growth was maintained for one week, and then the aptamer to vegf ( μg) or vehicle (phosphate buff ered saline) were administered for experimental and control mice respectively, daily for weeks [ ] . after decapitation of animals authors observed that tumors weighed % less in treated versus control animals. lung metastases were seen only in % of the aptamer treated animals (versus % of animals from the control group). the aptamers tested in the murine nephroblastoma exhibited the decrease in tumor growth by % compared with control [ ] . the increase in bfgf correlates with appearance of various diseases including retinopathy, rheumatoid arthritis, leukemia [ ] . jellinek and co authors used a ' amino pyrimi dine derivative rna library and performed rounds of aptamer selection [ ] . they found an aptamer named as m a, which exhibited binding to bfgf with k d of . nm; a competitive binding study revealed that it competed for bfgf binding with unfractionated heparin and low molecular weight hep arin. the inhibitory activity of m a was also investi gated using chinese hamster ovary (cho) cells: the aptamer bound to its target with the k d values of - nm [ ] . the effect of m a on the endothelial cell motility was also investigated using the migration of endothelial cells to a denuded area in bovine aortic cells where endogenous bfgf is essential for activity. at concentrations > nm the aptamer inhibited cell migration in a dose dependent manner (as compared with control). the rna aptamer inhibited bfgf bind ing to its cell receptor [ ] . platelet derived growth factor (pdgf) is a mito gen composed of two homologous (a and/or b) chains linked by three disulfide bonds; this dimeric protein is involved into wound healing and progression of vari ous diseases including atherosclerosis and glomerulo nephritis. many tumor cell lines produce and secrete pdgf [ ] . a dna selection in vitro against human recombi nant pdgf ab was performed and after rounds dna aptamers characterized by k d of pm were iso lated. three aptamers effectively inhibited pdgf bb binding to pdgf α and β receptors with the k i value of nm. the anti pdgf aptamers also inhibited mitogenic effects of pdgf on cells expressing pdgf β receptors with k i of . nm [ ] . one aptamer termed t was truncated, 'o methyl, ' fluoro modified and capped at the ' end (to increase resistance to nucleases) and conjugated to kda peg (to increase its lifetime in blood circula tion) [ ] . the modified aptamer exhibited high affin ity binding to the human protein (k d = pm); it was tested using a rat glomerulonephritis. in this model intravenous administration of this aptamer ( . mg/kg) twice a day decreased mitoses by % on day and by % on day . animals treated with this aptamer were characterized by a decreased mono cyte/macrophage index and glomerular matrix over production. control animals received a scrambled sequenced oligonucleotide or peg for days [ ] . interferon γ (ifn γ)exhibits various immunoregu latory effects. although its antiproliferative effect is less pronounced than in ifn α and ifn β, ifn γ is the most potent activator of macrophages and the inducer of expression of mhc class ii molecules [ ] . in healthy nervous tissue ifn γ is almost absent, how ever, during inflammatory processes in the nervous system and in multiple sclerosis it is overproduced. ifn γ secretion can result in inflammatory and autoim mune diseases. rna selection using fluoropyrimi dine and aminopyrimidine rna or a mixture of these two modifications were screened for aptamers that inhibited receptor binding of ifn γ [ ] . the resultant aptamer, ' amino had a k d value for its complex with receptor of . nm. in the culture of a cells it inhibited receptor binding of ifn γ with k i value of nm. this aptamer also inhibited induction of the mhc complex regulated by ifn γ and icam expression with ic values of and nm, respectively [ ] . endothelial receptor tyrosine kinase tie plays an important role in vascular wall stability. angiopoietin (ang ) is a natural antagonist, which is obviously expressed only during active angiogenesis (e.g. tumor growth) [ ] . for investigation of ang by aptamers rounds of rna selection were performed and rna molecules exhibiting specific binding to ang were iso lated. one aptamer demonstrated high affinity to ang (k d = . nm); it did not bind to ang (k d > μm). this aptamer was truncated to mer (k d = . nm) and the truncated aptamer inhibited ang function in a cell culture and in a rat model, where it significantly inhibited neovascularization by % [ ] . influenza is one of the most widespread disease in the world. control of this disease includes active cam paigns of vaccination, use of drugs blocking neuro minidase action. the use of aptamers helps to block virus binding to cell receptors. binding of isolated dna aptamers to viral hemagglutinins blocked virus penetration into cells [ , ] . extracellular domains of influenza hemagglutinin cause agglutination of blood cells (mainly erythro cytes). hemagglutinin determines virus binding to cells. neuraminidase is responsible for : ) ability of a viral particle to penetrate into the host cell; ) ability of viral particles to leave host cells after reproduction. two dna aptamers were obtained to the hemaggluti nin peptide (residues - ), which is responsible for binding of an oligosaccharide component of cell receptors. the aptamer a exhibited high binding activity and blocked agglutination of chicken red blood cells [ ] . the effect of a was confirmed by microscopic studies; they revealed preservation of cell structure compared with control preparations, in which damage of cell structure in the presence of influenza virus was observed. the same authors iso lated two rna aptamers to hemagglutinin using an rna library containing a nucleotide randomized region. a predicted secondary structure in bases long included nucleotides of the randomized region and also constant sequences of the flanking region. one of the aptamers exhibited binding to hemaggluti nin with the k d value of . nm. the affinity of this aptamer was fold higher than that of a monoclonal antibody to this protein. moreover, the rna aptamer allowed to discriminate hemagglutinins isolated from two various strains [ ] . binding proteins . . tat protein therapeutic applicability of aptamers has been undertaken during studies of hiv replication. human immunodeficiency viruses hiv and hiv that belong to a lentivirus class selectively affect t helpers. a regulatory tat protein activates viral replication. the tar element (of nucleotides long) presented in all predicted viral transcripts is required for func tioning of tat protein. it was proposed to express the nucleotide tar sequence to capture tat protein into an rna decoy [ , ] . tar rna, hiv tran script, was expressed in cem ss cells. the rna decoy inhibited hiv replication over % in vitro. inhibition of viral replication in cd + cells proceeded at the high level tar aptamer expression from the trna promoter. changes in the nucleotide sequence of hair pins or loop in the structure of the tar aptamer abol ished the ability of the tar aptamer to inhibit hiv rep lication [ , ] . the selex library consisted of a randomized sequence of nucleotides was used for selection of aptamers to tat proteins. after rounds of selection a truncated variant of an rna aptamer named as rna tat was obtained; it formed a complex with the tat protein with k d of pm. this mer rna aptamer inhibited hiv in vitro and decreased viral replica tion by % in a cell culture [ ] . other groups also found rna decoys, which bound tat protein and inhibited hiv [ , ] . no significant incompatibility between tar and tat interactions of hiv and hiv belonging to var ious subfamilies were found. however, although tat could transactivate hiv through tar , tat did not interact with tar in hiv [ ] . the viral protein rev promotes transportation of partially spliced rna molecules to cytoplasm, where they provide synthesis of usual retroviral products. the rre (rev responsive element) element composed of about nucleotides forms a complex three dimen sional structure, to which rev protein binds. using a trna promoter for expression of the rre element, the major rev binding site in hiv , overexpression of this construct has been estimated in the cells. expres sion of the chimeric trna rre aptamer caused inhi bition of viral replication by more than % [ ] . the aptamers passed phase clinical trials, in which this construct was transduced in vitro to cd + cells obtained from bone marrow of a hiv infected sub jects followed by subsequent reinfusion into these sub jects. aptamer administration did not cause adverse effects, however, a rather low level of rre gene expression was observed possibly due to inappropriate conditions for gene transfer [ ] . reverse transcriptase (rt) was the first target for the development of the selex method for hiv ther apy. tuerk and gold published the pioneer paper on selex. they used rt as the target for isolation of rna ligands, inhibiting hiv replication [ ] . after rounds selection from rna populations random ized at positions authors isolated rna that specifi cally bound to hiv rt and inhibited activity of this enzyme [ ] . the sructure of the rna aptamer was subsequently characterized and experiments per formed on cells indicated inhibition of hiv replica tion by - % [ ] . in addition, aptamer expressing t cells completely blocked the spread of hiv in cul ture [ ] . proliferation of myocardial and vascular cells is a central problem in the development of such cardiovas cular diseases as hyperplasia, atherosclerosis, malig nant tumors [ , ] . e f plays a central role in regu lation of cell proliferation. this factor exhibits highly specific binding to a double stranded dna containing eight base pairs tttcgcgc. the constructed mer dna aptamer containing a sequences for e f binding was tested for inhibition of e f activity [ ] . in vascular smooth muscle cells (vsmc) stimulated by e f the mer oligonucleotide (odn) inhibited vsmc proliferation and expression of the genes c myc and cdc , controlling cell cycle, and proliferating cell nuclear antigen (pcna). in vivo the mer odn was transfected to rats with experimental carotid injury and this markedly suppressed the fibrosis for mation compared with nontransfected arterial seg ments. furthermore, this inhibition continued up to weeks after a single transfection [ ] . transfer of an e f decoy can therefore modulate gene expression and inhibit smooth muscle proliferation and vascular lesion formation in vivo. using the selex method other aptamers to e f were also prepared; they inhib ited dna binding activity of this protein [ ] . these interesting results prompted dzau's group to test the e f aptamer in humans in order to determine whether it can limit intimal hyperplasia during intra venous administration [ , ] . the e f aptamer was delivered to the infra inguinal vein by transfection. cell transfection efficiency was %, expression of c myc and pcna reduced by and %, respectively, com pared with the control group. after months in the group of patients treated with the e f aptamers fewer occlusions were registered compared with control. the e f dna aptamer is now evaluated by cor genetch inc., in a phase iii study to estimate its effi cacy at limiting coronary and peripheral vascular damages. this aptamer is very close to clinical appli cation. the e f aptamer was also used for evaluation of long term protection from neointimal hyperplasia and atherosclerosis [ ] . hypercholesterolemic rabbits were treated with intravenous injections of e f aptamer or scrambled oligonucleotide. after months (when animals were put on a cholesterol containing diet) the e f aptamer treated group of animals was free of plaque whereas animals treated with the scram bled oligonucleotide and also control animals had extensive plaque formation [ ] . finally, selex was used to obtain an rna aptamer that would bind and inhibit e f. insertion of the e f aptamer into a trna expression cassette yielded rnas exhibited effective inhibition of e f binding to dna [ ] . to test the ability of the e rna aptamer to block proliferation, human fibro blasts were treated with e rna aptamer and prolifer ation was then induced. the rna aptamer inhibited s phase by % compared with control [ ] . thus, natural and in vitro selected aptamer can act as prolif eration inhibitors. transcription factor nf κb activates genes involved into inflammatory processes and synthesis of cytokines, interferons, mhc proteins, growth factors, and cell adhesion molecules, which play a central role in infarctions and various ischemic pathologies [ ] . it is also required for hiv gene expression and regula tion of cell tumors. a double stranded dna aptamer exhibiting high affinity binding to nf κb and named as a "natural decoy" was investigated in vivo using a cardiac ischemic/reperfusion model and a significant effect in inhibiting this injury was observed [ ] . in a rat model of thrombosis animals transfected with the nf κb aptamer showed improved recovery of coro nary flow ( % versus % in control) days after transfection [ ] . the aptamer treated group demon strated a lower percentage of neutrophil adhesion to endothelial cells ( % versus %) and a lower level of interleukin ( versus ng/mg) as compared with control [ ] . a fluorescent labeled aptamer to nf κb was investigated in a murine model of nephritis, where it blocked glomerular inflammation and expression of the inflammatory markers il α, il β, il , icam , vcam [ ] . using the selex method an rna aptamer was also genetade against the p subunit of nf kb. four teen rounds of selection yielded the rna aptamer, exhibiting high affinity binding to p and inhibition of nf kb binding to dna by preventing protein dimerization [ ] . work with aptamers has important advantages over antibodies: ⎯in clinical practice aptamers may be applicable in the same fields where antibodies are already used for treatment, but in contrast to antibodies aptamers are non immunogenic; ⎯aptamers exhibit the same high affinity to their protein targets as antibodies; ⎯aptamers can bind and penetrate to a pathologi cal nidus faster than antibodies; ⎯aptamer antidotes may be developed and they can control activity of the administered aptamer. the selex method originally developed for nucleic acid binding proteins is now actively used for studies of proteins, which lack natural complexes. the method is applicable for manipulations with individ ual proteins and for work with cell cultures. proc. natl. acad. sci. usa trombozy v kardiologii. mekhanismy razvitiya i vozmozhnosti terapii (thromboses in cardiology. mechanisms of develop ment and therapeutic capacities) proc. natl. acad. sci. usa proc. natl. acad. sci. usa proc. natl. acad. sci. usa proc. natl. acad. sci. usa proc. natl. acad. sci. usa proc. natl. acad. sci. usa, proc. natl. acad. sci. usa proc. natl. acad. sci. usa key: cord- -prygoc q authors: segawa, hiroaki; kato, masahiko; yamashita, tetsuro; taira, hideharu title: the roles of individual cysteine residues of sendai virus fusion protein in intracellular transport( ) date: - - journal: j biochem doi: . /oxfordjournals.jbchem.a sha: doc_id: cord_uid: prygoc q the role of intramolecular disulfide bonds in the fusion (f) protein of sendai virus was studied. the cysteine residues were changed to serine residues using site-directed mutagenesis. none of the cysteine mutant f proteins reacted with a monoclonal antibody specific for the mature conformation of the f protein, but eight of ten mutants reacted with an immature conformation-specific monoclonal antibody. the transport of these mutant proteins to the cell surface was drastically reduced. all of the cysteine mutant f proteins remained sensitive to endoglycosidase h (endo h) for h after their synthesis. moreover, cell surface transport of the hemagglutinin-neuraminidase (hn) protein co-expressed with each of these cysteine mutant f proteins was also reduced. these results suggest that all cysteine residues participate in the formation of intramolecular disulfide bonds, that co-translational disulfide bond formation is crucial to the correct folding and intracellular transport of the f protein, and that interaction of the f and hn proteins takes place intracellulary. the two glycoproteins of paramyxoviruses, the fusion (f) and hemagglutinin-neuraminidase (hn) proteins, are present as integral proteins which form spike-like projections on the outer surface of the viral envelope. the hn protein exhibits both the hemagglutinating and the neuraminidase activities, while the f protein has been shown to be involved in virus penetration, hemolysis and cell fusion (for reviews, see refs. and ). the f protein is synthesized as an inactive precursor, f o , which is cleaved by proteases to form the biologically active protein consisting of the disulfide-linked subunits fi and f . the n-terminal portion of the fi subunit is very hydrophobic, a feature which is highly conserved among paramyxovirus f proteins, and is suggested to mediate fusion of the virus envelope with the target membrane as well as cell fusion resulting in the syncytium formation. the sequences of paramyxovirus f proteins reveal the conservation not only of amino acid residues of the fusioninducing domain, cleavage site, and transmembrane domain, but also of cysteine residues at specific positions. as shown in fig. , mature sendai virus f o protein has cysteine residues, designated cl to c , and the relative position of cysteine residues is highly conserved among paramyxovirus f proteins ( , ) . this suggests that these cysteine residues play important structural and functional roles. two cysteine residues are present in the signal peptide and are not likely to participate in intramolecular disulfide bonds of the mature protein. the cl cysteine is in the f subunit, and the remaining cysteine residues are in the fi subunit. eight of the cysteine residues are clustered in a narrow region near the transmembrane domain. the formation of disulfide bonds takes place in the er and is co-translationally catalyzed by protein disulfide isomerase. it is suggested that only native disulfide bonds are found in the principal folding intermediates and that disulfide bond formation plays an integral role in the folding. the mechanisms of folding, oligomeric assembly, and sorting of viral membrane proteins have been characterized in recent years (for review, see ref. ) . to date, the roles of individual cysteine residues in the folding of newcastle disease virus (ndv) ( ) and measles virus ( ) have been reported. on the other hand, the roles of individual cysteine residues of paramyxovirus f proteins have not been characterized and remain largely unknown. intramolecular and intermolecular disulfide bonds are essential components of the structure of the majority of proteins, and it is of great interest to understand their functions. roles of disulfide bonds that have been suggested to include (i) aiding in protein folding and maturation and (ii) maintenance of stability and solubility. two general approaches, in vivo reduction using reducing reagents ( ) ( ) ( ) ( ) and site-directed mutagenesis to substitute each of the cysteine residues with other amino acids ( , , ( ) ( ) ( ) ( ) , have been used to study the functional role of disulfide bond formation within cells. recently, the sites of the disulfide bonds of sendai virus f protein were determined using protein sequencing analysis, and all cysteine residues were shown to participate in disulfide bond formation { ). thus, mutagenesis to substitute each cysteine residue should provide additional tgc-tcc tgt-»tct tgt-uct tgotcc tgt-»tct tgt-uct tgotcc tgt-»tct tgc-»tcc tgt-»tct a findings regarding the functional role of each disulfide bond in the folding, intracellular transport and antigenicity. previously, we described the efficient expression of sendai virus f gene cdna with mutations at its cleavage site and hn gene cdna to induce cell fusion ( ) . in this study, we employed site-directed mutagenesis to prepare mutant f protein genes at each of the cysteines and analyzed the role of each cysteine residue in f protein intracellular processing, cell surface expression, and immunoreactivity with different monoclonal antibodies (mabs) after transient expression of the cysteine mutants in cos cells. plasmids-mutagenesis of pcr fragments by use of mismatched primers was performed in a three-step pcr ( ) . to replace each cysteine, except for c , we used the sets of four primers listed in table i . in the first pcr, two separate pcr reactions were run using the '-mutagenic and '-outer primers and the '-mutagenic and '-outer primers to amplify separate, overlapping sequences of the template plasmid puc-f ( ). the '-mutagenic and '-mutagenic primers have overlapping homologous regions containing a mutation of interest. following amplification, the pcr products were purified, mixed, then denatured and re-annealed. an overlapping duplex was formed and extended by the second pcr reaction without primer to give the full-length target sequence containing the mutation. this mutated pcr fragment was amplified by the third pcr using the '-outer primer and '-outer primer used in the first pcr. plasmid psrd-fcls was constructed as follows: the pcr product containing a mutation at its cl site was digested with clal and ligated to the fc fragment, which was excised from puc-f by digestion with clal and hindlll. the ligated whole f gene was purified by gel extraction, treated with klenow fragment of dna polymerase i (klenow fragment) to create blunt ends, then treated with t polynucleotide kinase and ligated into the expression vector pcdl-srd (psrd) ( ) , which had been plasmid psrd-fc s was constructed as follows: the pcr product containing a mutation at its c site was digested with bglll and pstl and used to replace the corresponding fragment of puc-f. subsequently, the fc s gene was excised by digestion with hindlll, blunt-ended by treatment with klenow fragment, and inserted into psrd, which had been linearized by digesting with £cori and treated with klenow fragment. to construct the expression plasmids psrd-fc s, psrd-fc s, psrd-fc s, psrd-fc s, psrd-fc s, psrd-fc s, and psrd-fclos, we used the following procedure: a cdna fragment containing the carboxyl terminal portion of f protein from to amino acid residues was subcloned into pbluescript ii sk+ (stratagene cloning systems, la jolla, ca) at the pstl site to yield pbs-fc. mutagenesis was performed as described above, and the resulting pcr products containing a mutation at each cysteine site were digested with xmnl and ndel, then inserted into pbs-fc, which had been digested with smal and ndel. the resulting plasmids were digested with pstl, and the mutated fc fragment was ligated with . kb pstldigested psrd-f. plasmid psrd-fc s was constructed as follows: pcr was performed using n primer and mutagenic primer '-flcs . the pcr product was digested with bamkl and inserted into pbs-fc, which had been digested with the same restriction enzyme. the resulting plasmid was digested with psti, and the mutated fc fragment was ligated with . kb psti-digested psrd-f as described above. the mutations at the desired sites of all the mutant dnas were confirmed by dna sequencing ( ) . as shown in fig. , a total of cysteine mutants were prepared and designated fcls to fc s from the amino terminus of the f o protein to the carboxyl terminus, corresponding to the cysteine residues at amino acid positions , , , , , , , , , and , respectively. the expression plasmid psrd-hn was constructed by inserting the whole hn gene into psrd as described previously ( ) . cell culture and dna transfection-monkey cos- cells ( ) were grown in dulbecco's modified eagle's medium (dmem) supplemented with % fetal calf serum at °c in % co . for transient expression of the f protein, cos cells were grown to % confluency in -mm tissue culture dishes, then transfected with jug of expression plasmid per dish by calcium phosphate precipitation ( ) . cells incubated for h after transfection were analyzed by immunoprecipitation ( ) . antibodies-anti-sendai virus antiserum was prepared from sendai virus-infected rabbits. the antiserum used for immunoprecipitation was obtained from rabbits immunized with f proteins, which were expressed in escherichia coli as maltose-binding fusion proteins and purified by amylose resin as described by the manufacturer (new england biolabs, beverly, massachusetts). anti-f serum used to detect cell surface or intracellular expression of the f protein was obtained from rabbits immunized with the fj protein, which was purified from virions by sds-polyacrylamide gel electrophoresis (sds-page) ( ) . anti-hn serum used to detect hn protein by western blotting was obtained from rabbits immunized with the hn protein, which was purified from virions by sds-page. monoclonal antibodies (mabs) f- , f- , f- , and yl- were the generous gift of dr. h. tozawa (kitasato university). mab f- reacts with the mature form of f protein ( ) , and f- recognizes both f o and f! in immunoprecipitation but reacts specifically with f o in western blot analysis. mab f- reacts with the immature form of f protein. in a competitive binding assay, the mabs were divided into two groups, f-i and f-ii. mabs f- and f- belonged to the f-i group and f- belonged to the f-ii group ( ) . mab yl- recognizes the hn protein. indirect immunofluorescence staining-indirect immunofluorescence staining was performed as described previously ( ) . in general, cos cells were grown on glass coverslips and transfected with plasmid dna as described above. forty-eight hours after transfection, cells were washed with ice-cold phosphate buffered saline (pbs) and fixed with acetone for min at - °c or with . % paraformaldehyde in mm sodium phosphate buffer (ph . ) for min at room temperature. the cells were washed twice with pbs for min, then incubated for h at °c in pbs containing % bsa, . % nan , and anti-f rabbit serum or yl- (diluted : ). after incubation with the antiserum, cells were washed with pbs twice, then incubated for h at °c in pbs containing nan , bsa, and fluorescein isothiocyanate (fitc)-conjugated anti-rabbit igg or anti-mouse igg (wako pure chemical industries), each diluted : , as secondary antibody to anti-f rabbit serum or yl- , respectively. immunofluorescence was examined by fluorescence microscopy. western blot analysis-cos- cells (in -well dishes) were transfected with plasmid dna as described above. forty-eight hours after transfection, cells were washed with ice-cold pbs containing mm iodoacetamide (iaa), lysed in a l of ripa buffer [ . m tris-hcl (ph . ), . m nacl, % triton x- , % sodium deoxycholate, . % sds] containing mm pmsf and mm iaa, and sonicated for s. twenty-five microliters of xsds sample buffer [ . mm tris-hcl (ph . ), % -mercaptoethanol ( -me), % glycerol, % sds, . % bromophenol blue] was added to the cell lysates and boiled for min. fifteen microliters of each sample was electrophoresed on a % polyacrylamide gel. proteins were transferred to immobilon-p membrane (millipore) by semi-dry electroblotting. the membrane was preincubated in trisbuffered saline (tbs) [ . m tris-hcl (ph . ), . m nacl] containing . % tween , % low fat milk for h at room temperature. the membrane was then washed in tbs containing . % tween , incubated for h at room temperature in tbs-tween buffer containing a : , each dilution of the anti-f and anti-hn antisera, washed twice for min in tbs-tween, and incubated for h at room temperature in tbs-tween containing a : , dilution of horseradish peroxidase (hrp)-conjugated protein a (e. y laboratories). after extensive washing of the membrane, bound antibodies were detected using the ecl western blotting detection reagent system (amersham). immunoprecipitation-cos- cells (in -mm dishes) were transfected with plasmid dna as described above. forty-eight hours after transfection, the medium was replaced with methionine-and cysteine-free minimum essential medium (mem). twenty minutes later, cells were labeled with ix\ of methionine-and cysteine-free mem supplemented with [ s] methionine and [ s] cysteine ( /^ci/ml, , ci/mmol; du pont/nen), and labeling was continued for min. the cells were chased with dmem supplemented with % fbs, mm methionine, and mm cysteine for the time indicated in each figure legend. then the cells were washed twice with ice-cold pbs containing mm iaa, lysed in /^ of ripa buffer containing mm iaa, sonicated for s, and centrifuged for min at , rpm. fifty microliters of the supernatant (cellular extract) was incubated with //i of antibody raised against recombinant f protein expressed in e. coli or mabs reacting with various epitopes ( ) on ice for h. twenty microliters of a suspension of protein-a sepharose fast flow (pharmacia biotech, uppsala, sweden) was added to the lysates. the mixture was incubated at °c for h with gentle mixing, and immune complexes adsorbed on the beads were washed three times in mm tris-hcl (ph . ), mm nacl, mm edta, . % (w/v) gelatin, . % (v/v) np- , . % (w/v) nan , then denatured in sds sample buffer [ . mm tris-hcl (ph . ), % sds, % glycerol] in the presence or absence of % -me by boiling for min. the samples were analyzed by % sds-page followed by fluorography. glycosidase treatment-the immune complexes adsorbed on protein-a sepharose were resuspended in fi\ of mm sodium acetate buffer (ph . ), then samples were incubated with mu of endo-y -at-acetylglucosaminidase h (endo h: boehringer mannheim biochemica) for h at °c. after digestion, the immune complexes on the beads were recovered by centrifugation and denatured in sds sample buffer containing % -me by boiling for min. the immune complexes were analyzed by sds-page followed by fluorography. expression of the cysteine mutant f proteins-the f protein of the z strain of sendai virus possesses cysteine residues. the first and second cysteine residues are present in the signal peptide while the rest are distributed across the entire ectodomain of the protein. the signal peptide is cleaved in the er, and thus the remaining cysteine residues are present in the mature f protein. the relative locations of these cysteine residues are highly conserved among paramyxovirus f proteins, and these cysteines are likely to participate in intramolecular disulfide bonds in the mature protein. to examine the processing of the intramolecular disulfide bonds and their contribution to the structure of the f protein, site-directed mutagenesis using pcr was used to substitute each cysteine residue by a serine residue to prevent the formation of disulfide bonds. ten expression plasmids of the cysteine mutant f proteins, named fc s to fc s, were prepared as shown in fig. . recombinant plasmids were introduced into cos cells by the calcium phosphate precipitation method ( ) , and the cells were incubated for h. transfected cells were labeled with [ s]methionine and [ s] cysteine for min, chased for h, then lysed in ripa buffer. the lysates were immunoprecipitated with anti-f antiserum and analyzed by % sds-page under reducing ( fig. a) or nonreducing (fig. b) conditions. in fig. a , psrd-f transfected cells gave one main band (lane ) with mobility corresponding to the uncleaved form of f, or f o , synthesized in sendai virusinfected cos cells as reported previously ( ) . the cysteine mutant-transfected cells also gave one main band corresponding to the uncleaved f o protein. a minor band of about kda was also detected in all cells transfected with recombinant plasmids (lanes - ), and this seems to be nonspecifically degraded f proteins or unglycosylated f proteins, because the unglycosylated f proteins and immature f proteins could be detected by the antiserum used here, which was raised against the unglycosylated f proteins expressed in e. coli. the cysteine mutant psrd-fc s ( fig. a, lane ) transfected cells gave three bands, one main band corresponding to the f o protein, the common minor band ( kda), and another higher molecular weight species of f protein. the alteration of the cysteine residue of the f subunit to a serine residue introduced a new glycosylation site in this protein sequence. thus, the higher molecular weight species seems to be the f protein utilizing the additional new glycosylation site. approximately equal levels of most of the f proteins in the cells transfected with the above plasmids were detected by western blot analysis (data not shown). this showed that these cysteine mutant f proteins were efficiently expressed at almost the same level as wild-type f protein in psrd-f transfected cells and were relatively stable. under nonreducing conditions, formation of disulfide bonds in a protein should allow it to assume a more compact form and to migrate faster on sds-page than unfolded forms without disulfide bonds, as reported by machamer et al. ( ) . as shown in fig. b , the compact and unfolded forms of monomer f proteins and aggregates were detected with the mutants as well as the wild-type f protein. the compact form migrated as a kda band and the unfolded form migrated as a kda band, which is similar to the molecular mass of the reduced form of f protein as shown in fig. a . most of the wild-type f protein migrated as the compact form (lane ). on the other hand, the relative amount of the unfolded form of the each cysteine mutant f proteins was increased, and the ratio of the compact form to unfolded form was roughly as determined by densitometry (lanes [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . this indicated that all cysteine residues contributed to make the compact form. no obvious oligomeric form of f proteins was detected in the mutant or wild-type f protein-expressing cells. this implied that f protein does not form disulfide-linked oligomers. although the aggregates disappeared from the top of the gel when the radiolabeled cell lysates were reduced by boiling in % -mercaptoethanol before sds-page, treatment with thiol alkylation reagents such as iodoacetamide (iaa) or iv-ethylmaleimide (nem) had little effect on the formation of the aggregates. in preliminary experiments, we did not detect such aggregates in sendai virus-infected cells analyzed under nonreducing conditions. these observations indicated that these aggregates were formed before sample preparation and that other viral factors might be required for efficient folding of f protein. antibodies-to examine the antigenicity of the cysteine mutant proteins, the cell lysates were immunoprecipitated with mabs f- , f- , and f- and analyzed by sds-page under reducing conditions. in preliminary experiments, f- detected intracellular f proteins but not cell surface-transported f proteins. mab f- recognized both f o and fj in the immunoprecipitation, but specifically recognized f o in western blot analysis (unpublished data). mab f- recognized the mature form of f protein ( ) . thus, these mabs bind independent epitopes. as shown in fig. , only the wild-type f protein was precipitated with anti-mature conformation mab f- (panel a, lane ), the wild-type f protein and a small amount of fc s protein were precipitated with mab f- (panel b, lanes and ), and the wild-type f protein and most of the cysteine mutant f proteins, except for the fc s and fc s proteins, were precipitated with mab f- (panel c, lanes - and - ). these data indicated that loss of any one of the cysteine residues of the f protein had dramatic effects on the immunoreactivity to anti mature mabs and that all of the cysteine mutant f proteins were blocked at some stage in their maturation. cell surface transport of cysteine mutant f proteins-to examine the expression of cysteine mutant f proteins in the cells and their transport to the cell surface, the cells were fixed with acetone or paraformaldehyde and examined by indirect immunofluorescence staining using anti-f antiserum. to detect subcellular localization, the cells expressing wild-type and cysteine mutant f proteins were fixed with acetone, then examined by indirect immunofluorescence staining (fig. , panels a, c, e, g, i, k, and m). we expected that a pair of cysteine mutants which had altered cysteine residues involving the same disulfide bond might show the same expression phenotype. recent results by iwata et al. ( ) identified specific disulfide bonds between cl and c , c and c , and c and c , while cio appears to be linked to c , c , or c . thus, only the cysteine mutants fc s, fc s, fc s, fc s, and fc s were shown in fig. . the cos cells transfected with vector plasmid psrd showed no fluorescence (panel a), whereas the cells transfected with psrd-f (panel c), psrd-fcls (panel e), psrd-fc s (panel g), psrd-fc s (panel i), psrd-fc s (panel k), and psrd-fc s (panel m) displayed a bright staining pattern, and the frequency of positive cells was estimated to be - % of total cells in each case. as shown in panel c, the cells expressing wild-type f protein displayed internal staining throughout the cytoplasmic reticulum as well as in the juxtanuclear region. on the other hand, the cysteine mutant-expressing cells showed an intracellular staining pattern limited to a reticular perinuclear structure (panels e, g, i, k, and m). this indicates that cysteine mutant f proteins were efficiently expressed in these cells. to detect whether the expressed proteins were transported to the cell surface, the cells were fixed with paraformaldehyde instead of acetone, then examined by indirect immunofluorescence staining. the cells transfected with psrd-f displayed a staining pattern on the cell surface (panel b), whereas the surface fluorescence of the cells transfected with the cysteine mutant f plasmids was clearly much less intense than that of cells expressing wild-type f protein (panels d, f, h, j, and l). this indicates that the wild-type f protein was properly transported to the cell surface, but these cysteine mutant f proteins were not transported to the cell surface. in sendai virus-infected cells, both f protein and hn protein are transported to the cell surface. to examine the effect of hn on cell surface expression of wild-type and cysteine mutant f proteins, cells co-transfected with the hn gene and wild-type f or one of the cysteine mutant f genes were fixed with paraformaldehyde, then cell surface expression of f or hn proteins was tested with anti-f antiserum or anti-hn mab yl- . the cell surface expression of cysteine mutant f proteins was not detected (data not shown). furthermore, as shown in fig. a , the cell surface expression of the hn protein was drastically reduced in the cells co-expressing hn and cysteine mutant f proteins. in panels (b) and (c), hn protein was efficiently detected at the cell surface of psrd-hn-or psrd-f and psrd-hn-transfected cells. on the other hand, the cell surface expression of the hn protein was drastically reduced in the cells co-transfected with psrd-hn and one of the cysteine mutant f genes (examples shown in fig. a panels d, e, and f). as shown in fig. b , in western blot analysis, both hn and wild-type or the cysteine mutant f proteins were detected in the cells co-transfected with hn and wild-type or cysteine mutant f genes (lanes [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . the expression levels of hn proteins in these cells and in the cells transfected with hn gene alone were similar (lanes - ) . these results indicated that association of f and hn proteins takes place intracellulary and that the transport deficiency of the cysteine mutant f proteins affected the intracellular transport of the hn protein. intracellular transport of f proteins-the f protein has three iv-glycosylation motifs and contains complextype sugar chain(s) ( ) . the f protein is synthesized on the rough er, then glycosylated in a well-defined sequence in the er and golgi apparatus in the process of transport to the cell surface. high mannose-type sugar chains of the glycoprotein in the er and the cis-golgi compartments are sensitive to digestion with endoglycosidase h (endo h), while sugar chains once trimmed and further processed in the medial and trans-golgi compartments are not. we therefore determined the critical state in the transport process of the cysteine mutant f proteins by testing endo h sensitivity of the sugar chains. to follow the process of maturation of glycoproteins, the transfected cells were labeled with [ s]methionine and [ s] cysteine for min at h after transfection. after being chased for h, cells were lysed and the lysates were analyzed after immunoprecipitation with anti-f followed by endo h digestion, as described in "materials and methods." as shown in psrd-fcls-transfected cells was partially glycosylated f protein modified at a novel glycosylation motif generated by the mutagenesis. because the antiserum used in this study reacts efficiently with immature f protein, relatively large amounts of endo h-sensitive species of f proteins were detected even in the cells expressing wild-type f protein. these data showed that cysteine mutant f proteins were trapped in the er or the cis-golgi region, which prevented their transport to the cell surface. we used mutagenesis of the cysteine residue of the f protein to examine the effect of the disruption and rearrangement of the disulfide bonds on the folding of the f protein and its transport to the cell surface. we found that all cysteine mutant f proteins were efficiently expressed in cos cells ( fig. a) , but failed to fold into the compact form and to achieve proper conformation (fig. ) . they were retained in the er to cis-golgi compartments (fig. ) , formed aggregates (fig. b) , and failed to be transported to the cell surface (fig. ) . this indicated that disulfide bond formation involving all cysteine residues is required for the f protein to exit the er. although our cysteine mutant f proteins were glycosylated, none of them acquired native structure. assignment of disulfide bonds of the f protein has been reported ( ); thus, we expected that we would be able to detect some specific folding intermediates corresponding to the lack of individual disulfide bonds by analyzing individual cysteine mutant f proteins under nonreducing conditions. when mutations are made in either one of a pair of cysteines that are normally linked in a mature structure, the phenotype of the mutant proteins is often similar. in our study, two species of f proteins were detected in the cysteine mutant f proteins under nonreducing conditions: folded f protein and unfolded f protein. this indicated that formation of incorrect disulfide bonds led to rapid unfolding or rearrangement of disulfide bonds. as shown in fig. , almost all cysteine mutants exhibited a drastic decrease in reactivity to the mature conformation specific mab f- . this result showed that cysteine residues contribute to the whole protein structure through disulfide bridging, even though they are fairly distant from each other in the primary amino acid sequence. mab f- could not immunoprecipitate fc s and fc s proteins but could immunoprecipitate the rest of the mutants. this implied that the epitope of f- is present in the loop formed by the c and c disulfide bond, and the antigenic site is exposed when the f protein is inside the cells, but buried when the f protein is expressed on the cell surface. fc s showed only slight reactivity to mab f- . these data indicated that f- and f- may recognize cysteine-rich domains and supported the idea that the cysteine-rich domain in the bunched structure of paramyxovirus f proteins is recognized by mabs ( ) . since the formation of the correct intramolecular disulfide bonds seems to be a complex process involving disruption and rearrangement of disulfide bonds during intracellular transport, the three-dimensional structure of these cysteine mutants was changed, and the immunoreactivity of mabs with these mutant proteins decreased dramatically. these data suggested that co-translational disulfide bond formation is an absolute requirement for subsequent folding. because of the incorrect folding of cysteine mutant f proteins, their transport to the cell surface was reduced drastically and they contained only high mannose-type oligosaccharides. these results indicated that the mutant proteins were retained in the er. viral glycoprotein mutants that had a temperature-sensitive phenotype and folded correctly at °c but misfolded at °c have been reported ( ). in hsv- , the cysteine mutant gd proteins showed temperature sensitive cell surface expression ( ) . we examined whether cysteine mutant sendai virus f proteins show temperature sensitivity, but we could not detect cell surface expression of the cysteine mutant f proteins by incubating mutant gene-expressing cells at °c or at . °c (data not shown). these results suggest that the inhibition of the expression of the cysteine mutant f proteins on the cell surface occurred at a stage involved in the intracellular transport. though many reports have shown that cysteine mutant proteins are improperly folded and found as disulfide -linked aggregates in the er, there are some reports that cysteine mutant proteins could be transported to the cell surface, e.g., newcastle disease virus (ndv) hn, measles virus h, and asialoglycoprotein receptor h b subunit, all of which have type ii topology ( , , ) . it seems that type i transmembrane proteins more strictly require disulfide bond formation than do type ii proteins in order to exit from the er. we previously found that cleavage of an intrinsically protease-sensitive mutant f, fmut, was enhanced by coexpression with hn protein ( ) , indicating that f protein associates with hn protein intracellulary. thus, we examined the effect of co-expression of hn on cell surface expression of cysteine mutant f proteins. although the expression of both proteins was detected at roughly equal levels by western blot analysis (fig. b) , cell surface expression of cysteine mutant f proteins was not detected, and a decrease of cell surface expression of hn protein was observed (fig. a) . these observations are in accord with the down-regulation of the human parainfiuenza type (hpiv ) hn protein cell surface expression by the mutant f protein containing an intracellular retention signal ( ) . recently, many reports have shown a direct interaction of paramyxovirus f and hn proteins ( ) ( ) ( ) ( ) . however, the cell surface expression of f and hn proteins is reported to occur at different rates because of the different degrees of association with bip ( ) . thus, decrease of cell surface expression of hn protein in the cysteine mutants coexpressing cells may be caused not only by direct interactions of the f and hn proteins but also by indirect interaction affected by binding of molecular chaperons such as bip or calnexin. morrison et al. reported that ndv f protein undergoes conformational change with disruption and rearrangement of disulfide bonds during intracellular transport, and that the conformational change occurs before the cleavage reaction ( ) . because of the intracellular transport deficiency of the cysteine mutant f proteins, we could not identify the cysteine residues involved in such disulfide bond rearrangements. to study sequential disulfide bond formation and bond rearrangements, in vivo reduction experiments using well-characterized mabs are also now under way. paramyxovirus fusion: a hypothesis of changes the paramyxoviruses determination of the complete nucleotide sequence of the sendai virus genome rna and the predicted amino acid sequences of the f, hn and l protein folding and assembly of viral membrane proteins the role of the individual cysteine residues in the formation of the mature, antigenic hn protein of newcastle disease virus role of individual cysteine residues in the processing and antigenicity of the measles virus haemagglutinin protein ing disulfide bond formation and protein folding in the endoplasmic reticulum quality control in the endoplasmic reticulum folding and misfolding of vesicular stomatitis virus g protein in cells in vitro role of cotranslational disulfide bond formation in the folding of the hemagglutinin-neuraminidase protein of newcastle disease virus disulfide bonds in folding and transport of mouse hepatitis coronavirus glycoproteins cysteine mutants of herpes simplex virus type glycoprotein d exhibit temperature-sensitive properties in structure and function disulfide bond structure of glycoprotein d of herpes simplex virus types and the contribution of cysteine residues to antigenicity and extent of processing of herpes simplex virus type glycoprotein d enhanced folding and processing of a disulfide mutant of the human asialoglycoprotein receptor h b subunit assignment of disulfide bridges in the fusion glycoprotein of sendai virus transfection of sendai virus f gene cdna with mutations at its cleavage site and hn gene cdna into cos cells induces cell fusion a general method for rapid site-directed mutagenesis using the polymerase chain reaction construction of expression plasmids for the fusion protein of sendai virus, and their expression in e. coli and eucaryotic cells sra promoter: an efficient and versatile mammalian cdna expression system composed of the simian virus early promoter and the r-u segment of the human t-cell leukemia virus type long terminal repeat dna sequencing with chain-terminating inhibitors sv -transformed simian cells support the replication of early sv mutants a new technique for the assay of infectivity of human adenovirus dna cleavage of structural proteins during the assembly of the head of bacteriophage t conformational aberrance of sendai virus f protein in thapsigargin-treated cells allowing exit from the endoplasmic reticulum but causing arrest at the golgi complex neutralizing activity of the antibodies against two kinds of envelope glycoproteins of sendai virus expression of the sendai virus fusion protein and the hemagglutinin-neuraminidase protein using a baculovirus vector heavy chain binding protein recognizes incompletely disulfide-bonded forms of vesicular stomatitis virus g protein carbohydrate structures of hvj (sendai virus) glycoproteins perturbation of cellular calcium blocks exit of secretory proteins from the rough endoplasmic reticulum downregulation of paramyxovirus hemagglutinin-neuraminidase glycoprotein surface expression by a mutant fusion protein containing a retention signal for the endoplasmic reticulum identification of regions on the hemagglutinin-neuraminidase protein of human parainfluenza virus type important for promoting cell fusion functional interaction of paramyxovirus glycoproteins: identification of a domain in sendai virus hn which promotes cell fusion association of the parainfluenza virus fusion and hemagglutinin-neuraminidase glycoproteins on cell surfaces detection of an interaction between the hn and f proteins in newcastle disease virus-infected cells selective and transient association of sendai virus hn glycoprotein with bip conformational change in a viral glycoprotein during maturation due to disulfide bond disruption key: cord- - d o y authors: rein, theo title: post-translational modifications and stress adaptation: the paradigm of fkbp date: - - journal: biochem soc trans doi: . /bst sha: doc_id: cord_uid: d o y adaptation to stress is a fundamental requirement to cope with changing environmental conditions that pose a threat to the homeostasis of cells and organisms. post-translational modifications (ptms) of proteins represent a possibility to quickly produce proteins with new features demanding relatively little cellular resources. fk binding protein (fkbp) is a pivotal stress protein that is involved in the regulation of several executers of ptms. this mini-review discusses the role of fkbp in the function of proteins responsible for setting the phosphorylation, ubiquitination and lipidation of other proteins. examples include the kinases akt , cdk and gsk β, the phosphatases calcineurin, pp a and phlpp, and the ubiquitin e -ligase skp . the impact of fkbp on ptms of signal transduction proteins significantly extends the functional versatility of this protein. as a stress-induced protein, fkbp uses re-setting of ptms to relay the effect of stress on various signaling pathways. the physiological stress response is elicited whenever a change in the environment is sensed and interpreted as threat to homeostasis. effector systems comprise the autonomic nervous system and the hypothalamic-pituitary-adrenocortical (hpa) axis [ ] . the effector molecules of these systems are noradrenaline and adrenaline for the autonomic nervous system, and cortisol for the hpa axis (corticosterone in rodents). adrenaline and noradrenaline act through g protein-coupled receptors that are hooked to various intracellular pathways involving determinants of post-translational modifications (ptms) such as kinases and phosphatases [ ] . these determinants are also referred to as 'writers' and 'erases' of ptms [ , ] . using ptms for signal transduction comes with the obvious advantage of allowing for quickly changing the mode of action of pre-synthesized proteins in a vast functional space defined by the huge number of possible combinations of ptms at different amino acids [ , ] . moreover, ptms are reversible at a small time scale, which is another important feature for the stress response, in particular when stress exposure is short [ ] . the hpa axis also activates g protein-coupled receptors, namely the receptors for the hormones corticotropin-releasing factor and adenocorticotropic hormone [ ] . however, its final effector cortisol operates through the steroid receptors glucocorticoid receptor (gr) and mineralocorticoid receptor [ ] . these receptors are transcription factors, and thus this part of the overall stress response typically takes longer than the time required to redefine the function of proteins through re-setting their ptms. nevertheless, the action of these receptors also is intertwined with ptms: they are subject to regulation by ptms and they impact writers and erasers of ptms [ ] [ ] [ ] . in part, this is achieved through the synthesis of specific proteins that take part in the orchestration of proteome function through ptms. fk binding protein (fkbp) turned out to be one of these proteins which is subject of this review. the stress protein fkbp , promoted through translational research fkbp is a show-case of translational research where clinical and basic science approaches stimulated each other. the background of fkbp is laid out here only shortly, and the reader is referred to the numerous recent reviews for more detailed information [ ] [ ] [ ] [ ] [ ] [ ] . originally discovered as part of steroid receptor-heat shock protein hetero-complexes, fkbp was shown to be a potent inhibitor of gr by several laboratories [ ] [ ] [ ] [ ] . by virtue of its binding to immune suppressive drugs such as fk , fkbp also has been classified as 'immunophilin' [ ] . biochemically, fkbp is able to isomerize peptidyl-prolyl bonds [ ] ; the physiological relevance, if any, of this function is not clear [ , , ] . however, the peptidylprolyl isomerase domain of fkbp is engaged in protein interactions, and drug binding to this domain likely affects several functions of this protein [ ] . the inducibility of fkbp gene (named fkbp ) expression by the activated gr [ ] [ ] [ ] [ ] [ ] [ ] gives rise to an intracellular ultra-short negative feedback loop, as one of the hallmarks of adaptive molecular circuits [ ] . of particular interest for neuropsychiatric research was the observation that fkbp was overexpressed in squirrel monkeys featuring altered set-points of the hpa axis [ ] . thus, the molecular settings in these animals were considered a model for gr-resistance [ ] . this was related to the situation in patients suffering from depression where malfunction of gr was hypothesized to be causal for the development of the disease [ ] . based on these considerations, fkbp was included as candidate gene in an early gene association study in depression that found this gene linked to the response to antidepressant treatment [ ] . later, fkbp could be linked to additional stress-related diseases such as post-traumatic stress disorder [ , ] . these findings strongly amplified the interest in fkbp and stimulated research on its function and regulation in several laboratories; this greatly expanded the knowledge base on fkbp 's ( patho)physiological role, regulation on several levels and involvement in multiple molecular pathways, going beyond stress regulation [ , ] . the variety of its physiological functions goes along with its association with several proteins, including proteins involved in writing and erasing ptms, as detailed in the subsequent sections. more than ptms are known with a major impact in the configuration of protein networks [ ] [ ] [ ] ; the vast majority of these modifications are reversible with prominent examples being the attachment of chemical groups (e.g. phosphorylation, acetylation, methylation, nitrosylation, sulfonation), the conjugation with polypeptides (e.g. ubiquitination, nedd [neural-precursor-cell-expressed developmentally down-regulated ], and ubiquitin-like peptides such as sumo [small ubiquitin-like modifier] or atg [autophagy-related gene ]) and the addition of a complex group of molecules including, e.g. prenylation, farnesylation, glycosylation, palmitoylation, myristoylation, glutamylation, adp-ribosylation and ampylation [ , ] . protein phosphorylation is one of the first known ptm [ , ] and probably the best studied one affecting almost all biological processes [ ] . in eukaryotes, proteins are phosphorylated primarily through phosphor-ester bonds formed at the residues serine, threonine and tyrosine, and up to % of the protein-coding genes produce the enzymatic machinery governing this process [ ] . while this review focusses on the effect of fkbp on ptms of other proteins, it should be noted that fkbp also is subject to ptms itself. this has been reviewed very recently [ ] , and thus is mentioned here only briefly: not surprisingly, the first reported ptm of fkbp was phosphorylation. originally, it was inferred from the pattern of this protein in d gel electrophoresis and the modulation of this pattern by the use of kinase inhibitors or phosphatases [ ] [ ] [ ] . more recently, pten-induced putative kinase (pink ) was found to phosphorylate fkbp at yet to be mapped serine residues [ ] . thereby, pink regulates the interaction of fkbp with the kinase akt and the phosphatase phlpp [ ] . this interaction further is controlled by acetylation of fkbp at lysines and [ ] . the sirtuin sirt has been identified as deacetylase acting at these sites [ ] . sumoylation of fkbp was detected and mapped to lysine [ ] . it regulates the inhibitory action of fkbp on gr [ ] . the first evidence for the involvement of fkbp in the regulation of the phosphoproteome was provided by the observation that fk -bound fkbp inhibits the serine/threonine-phosphatase calcineurin (also known as protein phosphatase b), thereby inhibiting nuclear factor of activated t cells (nfat) [ , ] . it also has been reported that fkbp interacts with calcineurin in the absence of fk [ ] , which was not observed by others [ ] . nevertheless, impacting ptms through re-arranging protein association of kinases and phosphatases as in the case of calcineurin/nfat is the mode of action also revealed for the effect of fkbp on many other signaling pathways. the inhibition of calcineurin by fkbp is reported to also affect the nuclear factor (nf)κb pathway [ ] . in this pathway, phosphorylation of the inhibitor of κb (iκb) by the kinase of iκb (ikk) leads to activation of nfκb [ ] . therefore, the protein associations of fkbp with calcineurin as well as with ikkα and other kinases of the nfκb pathway [ ] support a model where fkbp impacts nfκb activity through re-setting phosphorylation at multiple levels with variable outcome [ ] . for example, a direct association between fkbp and both tnf receptor-associated factor (traf ) and ikkγ was found [ ] . traf catalyzes k -linked poly-ubiquitination of receptor-interacting protein (rip ) in response to tnfα, thereby facilitating the recruitment of the ikk complex to its upstream activating kinase tak (transforming growth factorbeta activated kinase ) [ , ] . fkbp enhances and shapes this polyubiquitin-mediated interaction, thereby changing the phosphorylation of ikk and iκb and thus the activity of nfκb [ ] . the serine/threonine kinase akt (also known as protein kinase b) is another well-examined example of fkbp 's impact on ptms through organizing protein complexes. akt is activated by step-wise phosphorylation in response to extracellular signals that involves its translocation from the cytoplasm to the cell membrane [ , ] . fkbp employs its ability to interact with various proteins, frequently referred to as scaffolding, to recruit the phosphatase phlpp that de-phosphorylates and thereby inactivates akt [ , ] (figure a ). akt is a central pathway regulator that inhibits apoptosis and promotes cell growth [ ] . accordingly, fkbp expression is enhanced in most cancer types and is linked to resistance to chemotherapy [ , [ ] [ ] [ ] . evidence has also been provided that fkbp mediates the inactivation of akt induced by stress [ ] . thus, fkbp may also relay the effect of stress on the phosphoproteome [ ] . fkbp 's effect on akt also alters downstream ptm-dependent pathways. for example, through akt fkbp impacts p mapk, thereby differentially regulating the transcription factors gr and peroxisome proliferator-activated receptor-γ [ ] . fkbp also governs the effects of akt on its targets beclin (becn ) and s-phase kinase-associated protein (skp ), constituting a link to autophagy as detailed below [ , , ] . [ , , ] . the recruitment of phlpp leads to lower akt phosphorylation and activity, entailing less phosphorylation of becn and of the e -ligase skp . thereby, skp is less active resulting in lower ubiquitination of becn . whether or not fkbp associates with all these proteins in one complex remains to be elucidated. another downstream target of akt is glycogen synthase kinase β (gsk β) [ ] . consistent with the inhibitory effect of fkbp on akt , it has been reported that overexpression of fkbp decreased the phosphorylation of gsk β at serine [ ] . however, it also has been found that fkbp associates with gsk β, and increases its phosphorylation [ ] . this appears to be accomplished through rearrangement of the protein heterocomplex governing phosphorylation and thus the activity of gsk β. more specifically, fkbp recruits cyclin-dependent kinase (cdk ) and furthermore associates with the three subunits of the phosphatase pp a [ ] (figure b) , which acts in concert with cdk to regulate gsk β affecting downstream targets [ ] . thus, fkbp redefines signaling pathway connections through protein associations. fkbp 's impact on protein phosphorylation furthermore provides a link to epigenetic regulation as well as to metabolic function. the link to epigenetics is evidenced by its impact on phosphorylation, and thus the activity of dna methyltransferase (dnmt ) [ ] . mechanistically, this effect appears to be achieved through the differential association of fkbp and its close homolog fkbp with cdk and its regulatory protein p [ ] . metabolic function is impacted by fkbp through protein associations that dephosphorylate and thus inhibit akt , leading to dephosphorylation and thus inhibition of as and reduced glucose uptake [ ] . ptms are also involved in the effect of fkbp on microtubule dynamics. it is assumed that phosphorylation of tau leads to its dissociation from microtubules and adoption of the trans configuration of specific peptidylprolyl bonds, while the association of phosphorylated tau with fkbp promotes its dephosphorylation and cis configuration that is required for microtubule association [ ] [ ] [ ] . another mass spectrometry-based screen for fkbp -associated proteins revealed the rho (ras homologous) gtpase-activating proteins deleted in liver cancer (dlc) and dlc as novel interaction partners [ ] . accordingly, fkbp enhances rhoa activity and signaling through the serine/threonine kinase rock (rho-associated coiled-coil containing protein kinase) along with the linked processes cell migration and invasion [ ] . the exact mechanism remains to be elucidated. it appears that this is an example for a more indirect effect of fkbp on the phosphoproteome: it inhibits a protein, dlc, that serves as a gtpase activator for members of the rho family of gtpases that regulate downstream kinases [ , ] . ubiquitination is an essential ptm in all eukaryotes [ , ] . it determines protein stability as well as protein function through changing protein-protein interaction. biochemically, ubiquitination is a process where the amino acid protein ubiquitin is covalently linked to other proteins, in most cases through the formation of an amide bond between its carboxy terminus and the ε amino group of lysine residues in the substrate proteins [ , ] . in addition to this isopeptide bond, ubiquitin forms other links to target diverse protein residues such as cysteines, serines or threonines [ ] . ubiquitination comes in the form of mono-ubiquitination and polyubiquitination, where distinct lysines of one ubiquitin serve as attachment sites for additional ubiquitin moieties. the site of linkage destines the modified protein to different functions. for example, poly-ubiquitination through the lysines at position and typically lead to degradation of the protein through the s proteasome [ , ] . the first indication that fkbp influences the ubiquitination of other proteins came from the observation that it stabilizes tau and protects it from becoming ubiquitinated [ ] . the mode of action appears to be an indirect mechanism where fkbp influences the conformation and/or phosphorylation of tau to prevent the access of ubiquitinating enzymes [ , , ] . for the differential effect of fkbp and fkbp on nfκb, it has been proposed that a mechanism is involved that is similar to pin 's promotion of the ubiquitin-mediated proteolysis of the nfκb subunit p /rela [ ] . further experimental evidence is awaited; in any case, this effect also would be indirect. the reported effect of fkbp on nfκb signaling through the association with traf and ikk is dependent on the k -poly-ubiquitination of rip , but does not appear to impact ubiquitination in this context [ ] . the e ubiquitin-protein ligase complex members traf and traf also were discovered as fkbp associating proteins, with currently unknown consequences for their enzymatic activity [ ] . more recently, a direct association of fkbp with glomulin, a regulator of the scf (skp -cul -f-box protein) e ubiquitin-protein ligase complex, has been described in detail and in comparison with other fkbps [ ] . it is likely that this interaction affects the ubiquitination activity of scf; experimental investigation of this potential impact of fkbp on the ubiquitination machinery has not been reported yet. a yeast two-hybrid screen indicated protein associations of fkbp with the ubiquitin-specific peptidases (usps) , and as well as with the e ubiquitin ligases ring finger protein and skp [ ] . while the functional consequences were not explored in this report, another study revealed that usp stabilizes fkbp by the removal of ubiquitin chains [ ] . thus, in this case, the association with a ptm executer results in fkbp being a target rather than a modifier. conversely, fkbp regulates the ubiquitination activity of skp through phosphorylation, probably by functioning as protein scaffolder for the association with phlpp and akt [ ] . this novel fkbp function on protein ubiquitination has far-reaching consequences: the study further discovered the autophagy regulator becn as novel target of skp executing k -linked poly-ubiquitination at this autophagy regulator [ ] . thus, fkbp drives autophagy involving at least two types of ptms of becn , ubiquitination and phosphorylation [ , , ] (figure c ). given the multiple targets of skp [ ] [ ] [ ] [ ] , it is assumed that the ubiquitination of more proteins will be changed by fkbp with diverse functional consequences. autophagy is another fundamental cellular process essential for protein, organelle and energy homeostasis [ ] . it involves several atg products that through several steps form autophagosomes, vesicles that engulf material destined for degradation which is accomplished upon the fusion with lysosomes [ , ] . an important step in this process is the lipidation of atg (also known as microtubule associated protein light chain beta), a ubiquitin-like protein that is integrated into the autophagosomal membrane upon formation of an amide bond the stress protein fkbp is intertwined with gr as both inhibitor and target. as target, it has the potential to relay the stress response to downstream pathways through the association with writers and erasers of ptms. known associations include de-ubiquitinases, ubiquitinases, protein kinases and protein phosphatases (first box, writers and erasers of ptms are grouped according to the type of their biochemical activity). some of the associated proteins are changed in their activity, others are redirected to certain targets. examples of target proteins affected by the altered activities of ptm writers and erasers are provided in the box below. '+' and '−' after the protein names indicate the overall effect of fkbp on the activity of associating proteins or downstream proteins. not all potential interactions are displayed. becn is a downstream protein that also forms a complex with fkbp . however, most of the downstream target proteins are indirectly affected in the sense that they were not shown to associate with fkbp . with phosphatidylethanolamine [ , ] . similar to the process of ubiquitination, the formation of this amide bond is executed by a set of ligases (e -e ) that use cysteine linked thioesters as intermediates. the last step, conjugation to phosphatidylethanolamine, is mediated by the e -like atg -atg :atg complex [ ] . the effect of fkbp on this protein lipidation probably is indirect through driving autophagy by regulating becn . however, fkbp also leads to enhanced levels of atg , a member of the conjugation system [ ] , pointing to different pathways used by fkbp to enhance atg lipidation. • fkbp is receiving increased attention for its pivotal role in research on stress and stressrelated diseases, but also several additional fields such as immunology, metabolism, oncology, neurology, etc. • the stress protein fkbp is engaged in various signaling pathways through diverse protein interactions. it both affects the stress response and is affected by the stress response, and furthermore relays stress to multiple pathways. likewise, it is the subject of ptms and affects the activity of ptm writers and erasers ( figure ). • protein associations of fkbp with ptm writers and erasers could affect ptms of fkbp itself or of other proteins, and deciphering these scenarios will significantly contribute to our mechanistic understanding of this versatile protein. furthermore, it will be of high interest to elucidate which of the ptm-mediated downstream effects of fkbp contribute to the physiological stress reaction. the author declares that there are no competing interests associated with this manuscript. atg, autophagy-related gene; becn , beclin ; cdk, cyclin-dependent kinase; dlc, deleted in liver cancer; dnmt , dna methyltransferase ; fkbp, fk binding protein; gr, glucocorticoid receptor; gsk, glycogen synthase kinase; hpa axis, hypothalamic-pituitary-adrenocortical axis; iκb, inhibitor of κb; ikk, kinase of iκb; mapk, mitogen-activated protein kinase; nfκb, nuclear factor kappa b; nfat, nuclear factor of activated t cells; phlpp, pleckstrin homology domain leucine-rich repeat protein phosphatase; pp a, protein phosphatase a; pparγ, peroxisome proliferator-activated receptor-γ; ptm, post-translational modification; rip , receptor-interacting protein ; scf, skp -cul -f-box protein; skp , s-phase kinase-associated protein ; tnf, tumor necrosis factor; traf, tnf receptor-associated factor; usp, ubiquitin-specific peptidase. the corticotropin-releasing factor family: physiology of the stress response molecular pathways: beta-adrenergic signaling in cancer principles of ubiquitin-dependent signaling orchestrating the proteome with post-translational modifications adaptive posttranslational control in cellular stress response pathways and its relationship to toxicity testing and safety assessment mechanisms of stress in the brain post-translational modifications of the mineralocorticoid receptor: how to dress the receptor according to the circumstances? role of phosphorylation in the modulation of the glucocorticoid receptor's intrinsically disordered domain glucocorticoid receptor control of transcription: precision and plasticity via allostery fkbp and fkbp in signaling and disease fk binding protein integrates pathways of adaptation: fkbp shapes the reactivity to environmental change the fkbp glucocorticoid receptor co-chaperone: regulation, function, and implications in health and disease the many faces of fkbp hsp and fkbp : complex regulators of psychiatric diseases biological actions of the hsp -binding immunophilins fkbp and fkbp fk -binding proteins and differentially regulate dynein interaction and nuclear translocation of the glucocorticoid receptor in mammalian cells the hsp -binding peptidylprolyl isomerase fkbp potentiates glucocorticoid signaling in vivo squirrel monkey immunophilin fkbp is a potent inhibitor of glucocorticoid receptor binding overexpression of the fk -binding immunophilin fkbp is the common cause of glucocorticoid resistance in three new world primates functional analysis of the hsp -associated human peptidyl prolyl cis/trans isomerases fkbp , fkbp and cyp ) peptidylprolylisomerases, protein folders or scaffolders? the example of fkbp and fkbp tissue distribution and abundance of human fkbp , and fk -binding protein that can mediate calcineurin inhibition isolation and characterization of glucocorticoid-and cyclic amp-induced genes in t lymphocytes intronic hormone response elements mediate regulation of fkbp by progestins and glucocorticoids glucocorticoid-resistant b-lymphoblast cell line derived from the bolivian squirrel monkey (saimiri boliviensis boliviensis) allele-specific fkbp dna demethylation mediates gene-childhood trauma interactions glucocorticoid receptor activates poised fkbp locus through long-distance interactions glucocorticoid hormone resistance during primate evolution: receptor-mediated mechanisms the new world primates as animal models of glucocorticoid resistance the corticosteroid receptor hypothesis of depression polymorphisms in fkbp are associated with increased recurrence of depressive episodes and rapid response to antidepressant treatment posttranslational modifications the roles of post-translational modifications in the context of protein interaction networks post-translational modifications in signal integration uniprot: the universal protein knowledgebase conversion of phosphorylase b to phosphorylase a in muscle extracts protein-tyrosine kinases exploring the function of dynamic phosphorylation-dephosphorylation cycles evolution of protein kinase signaling from yeast to man regulation of fkbp and fkbp functions by post-translational modifications molecular cloning of human fkbp and comparisons of immunophilin interactions with hsp and progesterone receptor the -kda heat-shock protein (hsp )-binding immunophilin fkbp is a mitochondrial protein that translocates to the nucleus to protect cells against oxidative stress pink regulates fkbp interaction with akt/phlpp and protects neurons from neurotoxin stress induced by mpp + regulation of serine-threonine kinase akt activation by nad(+)-dependent deacetylase sirt . cell rep the activity of the glucocorticoid receptor is regulated by sumo conjugation to fkbp fkbp , a novel t-cell-specific immunophilin capable of calcineurin inhibition comparative analysis of calcineurin inhibition by complexes of immunosuppressive drugs with human fk binding proteins calcium-and fk -independent interaction between the immunophilin fkbp and calcineurin overexpression of fkbp in idiopathic myelofibrosis regulates the growth factor independence of megakaryocyte progenitors regulation and function of ikk and ikk-related kinases a physical and functional map of the human tnf-α/ nf-κb signal transduction pathway fkbp employs both scaffold and isomerase functions to promote nf-κb activation in melanoma regulation of the nf-κb-inducing kinase by tumor necrosis factor receptor-associated factor -induced degradation nf-κb-inducing kinase and iκb kinase participate in human t-cell leukemia virus i tax-mediated nf-κb activation aktivation mechanisms the pi k pathway in human disease fkbp affects cancer cell response to chemotherapy by negatively regulating akt association of fkbp with priming of autophagy pathways and mediation of antidepressant treatment response: evidence in cells, mice, and humans akt/pkb signaling: navigating downstream role of fk -binding protein in the control of apoptosis of irradiated melanoma cells hsp -binding immunophilin fkbp forms complexes with htert enhancing telomerase activity fkbp immunohistochemical expression: a new prognostic biomarker for oscc? fkbp controls cellular adipogenesis through p kinase-mediated phosphorylation of grα and pparν fkbp /fkbp enhances autophagy to synergize with antidepressant action skp attenuates autophagy through beclin -ubiquitination and its inhibition reduces mers-coronavirus infection inhibition of glycogen synthase kinase- by insulin mediated by protein kinase b fkbp inhibits gsk β and augments the effects of distinct psychotropic medications mice lacking phosphatase pp a subunit pr /b δ (ppp r d) develop spatially restricted tauopathy by deregulation of cdk and gsk β chaperoning epigenetics: fkbp decreases the activity of dnmt and mediates epigenetic effects of the antidepressant paroxetine stress-responsive fkbp regulates akt -as signaling and metabolic function the hsp cochaperone, fkbp , increases tau stability and polymerizes microtubules organization and function of the fkbp and fkbp genes bending tau into shape: the emerging role of peptidyl-prolyl isomerases in tauopathies fkbp regulates cell motility and invasion via rhoa signaling rhoa-rock signaling as a therapeutic target in traumatic brain injury the ubiquitin system protein modifications: beyond the usual suspects' review series cellular quality control by the ubiquitin-proteasome system and autophagy accelerated neurodegeneration through chaperone-mediated oligomerization of tau nf-κb transcriptional activity is modulated by fk -binding proteins fkbp and fkbp : a role for peptidyl-prolyl isomerase activity mitochondria-nucleus shuttling fk -binding protein interacts with traf proteins and facilitates the rig-i-like receptor-mediated expression of type i ifn fkbp and fkbp . -novel and tight interactors of glomulin a quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways usp negatively regulates tumorigenesis and chemoresistance through fkbp -akt signaling skp is required for ubiquitin-mediated degradation of the cdk inhibitor p expression of skp , a p (kip ) ubiquitin ligase, in malignant lymphoma: correlation with p (kip ) and proliferation index kip meets skp : new links in cell-cycle control dysregulated expression of skp and its role in hematological malignancies autophagy: renovation of cells and tissues dynamics and diversity in autophagy mechanisms: lessons from yeast the atg conjugation system is indispensable for proper development of autophagic isolation membranes in mice atg controls phagophore expansion during autophagosome formation atg systems from the protein structural point of view key: cord- - cwb g h authors: shaw, shyh-yu; laursen, richard a.; lees, marjorie b. title: analogous amino acid sequences in myelin proteolipid and viral proteins date: - - journal: febs lett doi: . / - ( ) - sha: doc_id: cord_uid: cwb g h computer analysis of the intrinsic membrane protein, myelin proteolipid, shows strong sequence similarities between the putative extramembrane segments of the proteolipid protein and a number of viral proteins, several of which infect humans. these similarities are even more striking than those reported previously between viral proteins and the encephalitogenic myelin basic protein (mbp). these findings, along with other reports of molecular mimicry by viruses, suggest that immunological cross‐reactions between virus‐induced antibodies or t‐cells and analogous antigenic determinants (epitopes) in myelin proteolipid could be involved in the pathophysiology of multiple sclerosis or post‐infectious demyelinating syndromes. current evidence suggests that the etiology of multiple sclerosis and other demyelinating diseases involves a combination of viral and autoimmune factors [ ] . a particularly well studied model for multiple sclerosis is eae which can be induced by immunization of laboratory animals with cns tissue or with mbp, one of the major myelin proteins [ ] . recently it has been reported that mbp shows homology with certain viral proteins [ ] . furthermore, immunization with peptides having regions common to mbp and hepatitis b virus dna polymerase showed histological eae in rabbits [ ] . however, it has long been recognized that components other than mbp contribute to the encephalitogenic response to cns tissue [ ] . * to whom correspondence should be addressed abbreviations: mbp, myelin basic protein; eae, experimental allergic encephalomyelitis; cns, central nervous system; mlv, murine leukemia virus; aids, acquired immune deficiency syndrome; htlv-iii/lav, human t-lymphotropic retrovirus the most abundant protein of cns myelin is myelin proteolipid protein, which is embedded in the myelin membrane. its amino acid sequence has been reported recently [ , ] , and two different models have been proposed for its conformation in the myelin membrane [ , ] . immunization of rabbits with the proteolipid apoprotein leads to the development of a chronic, progressive or relapsing form of eae [ ] . however, the epitopes in myelin proteolipid which canse the encephalitogenic response are not known. in the present study we compared decapeptide sequences from proposed [ , ] extramembrane hydrophilic regions of myelin proteolipid with various viral proteins by computer analysis, with the aim of searching for similarities, such as found between mbp and viral proteins, that might shed light on the origin of demyelinating diseases. decapeptides were chosen for comparison with the extramembrane segments of the proteolipid protein, the rationale being that decapeptides are more than sufficient to induce an immune response [ ] and that epitopes of membrane proteins are generally located at extramembrane hydrophilic regions. comparisons and statistical scorings were made using the database (release . of august , containing sequences) and programs of the protein identification resource of the national biomedical research foundation. the search program compared every decapeptide in bovine brain myelin proteolipid with every decapeptide in the database. the calculation of similarity scores was accomplished using the mutation data matrix [ ] program. for each proteolipid decapeptide, comparisons were made and the highest scoring segments and their standard deviations were printed out. we selected those viral decapeptides, similar to extramembrane hydrophilic segments of myelin proteolipid, with scores higher than , and which also exceed the average by standard deviations; i.e., the probability that the match is due to random chance is < x - . a vax / computer was used for searching and calculations. sequence alignments with scores of or greater were observed between proteolipid and viral decapeptides, excluding identities seen with proteins from similar strains of virus. these include decapeptides from a variety of types of viruses, such as adenoviruses and epstein-barr, influenza and measles viruses. only those with a score of or greater, or which show at least % identity are shown in table . the degree of similarity between proteolipid and viral proteins is significantly greater than that observed [ ] between mbp and viral proteins. furthermore, although only % of the proteins in the protein sequence database are viral proteins, they account for over % of the similarities. it is interesting and perhaps significant that many of the strong similarities lie between residues and in the proteolipid. this region is predicted (shaw, s.-y., unpublished) to have a strong tendency to form an amphipathic helix (hydrophobic on one side and polar on the other), and thus might be regarded as a prime candidate for an antigenic site. likewise, many similarities are found with the n-terminus of the proteolipid, a region which has also been predicted to be an amphipathic helix [ l. the two proposed models [ , ] for the organization of the proteolipid in the myelin membrane are table comparison of myelin proteolipid decapeptide sequences with those of certain viral proteins all of the peptides shown have alignment scores of or greater and/or have at least identities. amino acids which are identical with the corresponding residues in the proteolipid are underlined similar in that they depict the protein as threading through the lipid bilayer several times with the polar segraents external to the bilayer; they differ, however, in the sidedness of some of the extramembrane segments. while both models put residues - and - on the extracellular face of the membrane, the model of laursen et al. [ ] puts residues - on the extracellular face and - on the cytoplasmic face, whereas the model of stoffel et al. [ ] predicts the opposite orientation for the latter two segments. regardless of which (if either) model is correct, similarities between viral proteins and proposed extracellular and cytoplasmic segments of the proteolipid are seen. it has been proposed [ , ] that eae-like diseases can arise when virus-evoked antibodies and/or sensitized t-cells cross-react with homologous amino acid sequences (epitopes) in mbp. however, since mbp is located entirely on the cytoplasmic face of the membrane and is therefore within the oligodendroglial cell, it would be expected to be relatively inaccessible to immune surveillance. on the other hand, the proteolipid has potential antigenic sites not only on the cytoplasmic face but also on the outer surface of the membrane. recognition of these sites by antivira[ antibodies or sensitized cells could induce a cascade of immunological events leading to cell destruction. in the process, mbp would be released, which might result in further inflammatory consequences. the similarities between mbp and virus proteins have been discussed [ , ] previously in terms of 'homology', which implies a common ancestor for the proteins. when comparing myelin proteins with viral proteins, it is difficult to make a compelling case for homology, since the similarities fall off after a few residues. while it is not uncommon to encounter five or six identical residues in a stretch of ten (see table ), extended regions of similar sequence are encountered only rarely. for example, although a % identity is seen between the first residues of the proteolipid and the late kda protein of adenovirus or , i.e., gia.e('ca r('i v(;ap.i asi.va illll ii ! ii~ adenovirus ( - ) gi.i.i;('hcr('ni_(.'tpi, trsi.vc, more likely that the virus and myelin proteins are 'analogous', i.e., that similar portions arose by convergent evolution to a common sequence or secondary structure. this idea is consistent with the well-known propensity of viruses to undergo rapid mutation, particularly of their surface antigenic sites [ ] . the tryptophan containing region of mbp has long been recognized as being highly encephalitogenic [ , ] . in particular, the nonapeptide, phe-ser-trp-gly-ala-glu-gly-gin-lys, has been shown to be highly effective in inducing eae in guinea pigs, and studies with peptide analogs indicate that the essential features of an encephalitogenic determinant are x-x-trp-x-x-x-x-gln/asn-lys/arg. the proteolipid contains two regions that have a trp and a lys residue separated by residues, but otherwise they are not remarkably similar to the encephalitogenic mbp peptide. it is interesting, however, that one of the tryptophans (trp- ) occurs in a region that shows several similarities with viral proteins (table ). in the course of our analysis, we also found a strong similarity, not previously reported [ ] , between the encephalitogenic mbp protein and the gag-pol polyprotein of murine leukemia virus it would not be surprising if antibodies against the mlv peptide were found to cross-react with mbp. there are now at least two reports of mimicry of normal human proteins by viruses: mbp by hepatitis b virus polymerase [ ] and c~-thymosin by the aids virus htlv-iii/lav [ ] . in addition pruijn et al. [ ] have reported that sera from patients suffering from autoimmune diseases can inhibit adenovirus dna replication, suggesting a similar phenomenon. the similarities between viral proteins and myelin proteolipid may represent yet another case. in this instance, it seems likely that portions of the protein which are potential antigenic determinants are located on the outer surface of the cell membrane, making it more understandable how the cell membrane could come under immunological attack. the similarity does not extend further. it seems viral infections of the nervous system experimental allergic encephalomyelitis: a useful model for multiple sclerosis hoppc seyler's proc. natl. acad. sci. usa proc. natl. acad. sci. usa this work was supported by grants from the national institutes of health (ns- and ns ) and national science foundation (dmb - ). key: cord- - caeui authors: seay, montrell; dinesh-kumar, savithramma; levine, beth title: digesting oneself and digesting microbes: autophagy as a host response to viral infection date: journal: modulation of host gene expression and innate immunity by viruses doi: . / - - - _ sha: doc_id: cord_uid: caeui although research in this area is still in a stage of infancy, it seems likely that the lysosomal degradation pathway of autophagy plays an evolutionarily conserved role in antiviral immunity. the interferon-inducible, antiviral pkr signaling pathway positively regulates autophagy, and both mammalian and plant autophagy genes restrict viral replication and protect against virus-induced cell death. given this role of autophagy in innate immunity, it is not surprising that viruses have evolved numerous strategies to inhibit host autophagy. different viral gene products can either modulate autophagy regulatory signals or directly interact with components of the autophagy execution machinery. moreover, certain rna viruses have managed to “co-apt” the autophagy pathway, selectively utilizing certain components of the dynamic membrane rearrangement system to promote their own replication inside the host cytoplasm. in addition to this newly emerging role of autophagy in innate immunity, autophagy plays an important role in many other fundamental biological processes, including tissue homeostasis, differentiation and development, cell growth control, and the prevention of aging. accordingly, the inhibition of host autophagy by viral gene products has important implications not only for understanding mechanisms of immune evasion, but also for understanding novel mechanisms of viral pathogenesis. it will be interesting to dissect the role of viral inhibition of autophagy in acute, persistent, and latent viral replication, as well as in the pathogenesis of cancer and other medical diseases. the cellular pathway of autophagy is as ancient as the origins of eukaryotic life. derived from the greek and meaning to eat ("phagy") oneself ("auto"), the term autophagy refers to a lysosomal pathway of selfdigestion, involving dynamic membrane rearrangement to sequester cargo for delivery to the lysosome, where the sequestered material is degraded and recycled. for decades, it has been known that autophagy is the primary intracellular catabolic mechanism for the degradation and recycling of longlived cellular proteins and organelles. for decades, it has also been known that the recycling function of autophagy is an important adaptive response to nutrient deprivation and other forms of environmental stress. however, only recently have we discovered that autophagy may also be an important mechanism for the degradation of intracellular pathogens and that autophagy may also be important in cellular protection against the stress of microbial infection. not surprisingly, we have also recently learned that some successful intracellular pathogens have devised strategies either to block host autophagy or to subvert the host autophagic process to foster their own replication. in this chapter, we will review recent progress in understanding the interrelationships between viruses, autophagy, and innate immunity (figures & ) . before discussing the interrelationships between viruses, autophagy, and innate immunity, we will provide a brief overview of the molecular and cell biology of autophagy. while this subject has been covered extensively in a recent book and numerous recent review articles - we will highlight the aspects of this subject that may have particular relevance for viral infections. r r the process of autophagy was first described more than forty years ago, however, for many decades our understanding of autophagy was based largely on morphological observations from electron microscopy (reviewed in ). the field has expanded considerably within the last years after the cloning and molecular characterization of the yeast autophaggy (atg)related genes (reviewed in ) . the analysis of sequenced genomes of higher eukaryotes has identified atg homologues in mammals, c. elegans, drosophila, dictyostelium, and plants, and many of these genes have been shown to be essential for autophagy function in higher eukaryotes (reviewed in ) ( table ). in addition to the identification of the autophagy genes, significant progress has been made in the past decade in understanding some of the signaling events that regulate autophagy (reviewed in , ) . interestingly, some of the signaling molecules that regulate autophagy, as well as some of the autophagy genes, play a role in the host antiviral innate immune response (table , figure ). the initial step of autophagy is the formation and elongation of the isolation membrane. the isolation membrane invaginates and sequesters cytoplasmic constituents including mitochondria, endoplasmic reticulum (er) and ribosomes, and the edges of the membrane fuse with each other to form a double-membrane structure called an autophagosome. the outer membrane of the autophagosome fuses to the lysosome/vacuole with subsequent delivery of the inner vesicle or autophagic body into the lumen of the degradative compartment. the source of the autophagosomal membrane is still unclear, but presently, it is thought that the preautophagosomal structure (pas) acts as the site of vesicle formation during autophagy [ ] [ ] [ ] . the pas is thought to form de novo, but the source of the vesicle membrane is not known. it seems likely that the "typical" autophagosomes observed during viral infection that contain a mix of virions and self-cytoplasmic constituents, originate from the pas. however, it is not yet known whether the pas also serves as the site of vesicle formation for the formation of "atypical" autophagic-like double-membrane vesicles that function as replication sites for certain rna viruses (e.g. poliovirus, mouse hepatitis virus, equine arterivirus) (reviewed in ). more likely, these double-membrane vesicles arise directly from the endoplasmic reticulum (er) . autophagosomes are lipid-rich, protein-poor vesicles that vary in size and membrane thickness depending on the organism and cell type. the composition and abundance of proteins sequestered within autophagosomes reflects the relative composition and abundance of proteins in the surrounding cytoplasm . this observation has led to the concept that autophagosomes indiscriminately sequester cytoplasmic content. however, in yeast, there are well-established pathways of specific autophagy, including the biosynthetic cytoplasm-to-vacuole targeting pathway and pexophagy (reviewed in ), and in mammalian cells, mitochondria-specific autophagy has been reported . although molecular determinants of cargo recognition have been identified in yeast pathways of specific autophagy, virtually nothing is known about the specificity of cargo recognition in higher eukaryotes. in circumstances where there is degradation of viruses observed inside "typical" autophagosomes that also contain cellular constituents, the sequestration step may lack specificity. however, in circumstances where viruses utilize components of the autophagic machinery for the formation of "autophagic-like" double-membrane structures that exclusively contain viral constituents, the sequestration step is likely to have exquisite specificity. the identification of the viral and cellular determinants of this specificity will be an important advance in understanding the cell biology of these types of rna virus infections and may eventually lead to the identification of novel antiviral therapeutic targets. autophagy is tightly regulated by nutritional, hormonal, and other environmental cues. it occurs as a cellular response to extracellular stimuli (e.g. nutrient starvation, hypoxia, overcrowding, high temperature, hormonal or chemotherapeutic treatment), and intracellular stimuli (e.g. accumulation of damaged, superfluous or unwanted organelles, accumulation of misfolded proteins, invasion of microorganisms). although it is not yet known whether different stimuli act through parallel, convergent, or divergent pathways to trigger autophagy, significant progress has been made within the past decade in identifying different signaling molecules that function in the positive (e.g. eif kinases, class iii pi- kinases, pten, death-associated protein kinases) or negative (e.g. tor, insulin-like growth factor signals, class i pi- kinase, rho/ras family of gtpases) regulation of autophagy (reviewed in , ) . the identification of a role for these signaling molecules in autophagy regulation has implications for understanding antiviral immunity, and more speculatively, generates hypotheses about novel principles of virus-host interactions. the recently defined evolutionarily conserved role of the eif kinase signaling pathway in autophagy induction suggests that autophagy regulation may contribute to the antiviral function of the interferon-inducible eif kinase, pkr. pkr and other eif kinases induce a general translational arrest by phosphorylating the serine residue of eif (reviewed in ) . genetic studies in yeast and mammalian cells have also shown that the eif kinase signaling pathway is required for starvation and herpes simplex virus-induced autophagy . while further analyses are required to dissect the relative contributions of autophagy induction vs. translational arrest in mediating the antiviral effects of pkr, these findings link a new cellular function (i.e. autophagy) with interferon signaling. although the eif kinase signaling pathway is the only as-of-yet defined autophagy regulatory signaling pathway that has known antiviral functions, it is interesting to note that most autophagy regulatory signals play a role in other important cellular processes, including cell growth control, cell death, and aging. some of these effects may be the consequence of divergent downstream targets of these regulatory signals and some of these effects may be directly mediated through autophagy. as will be discussed below, given the evidence for a role of autophagy in innate immunity, it is likely that viruses have evolved different strategies to antagonize host f f autophagy, which, at least in the case of herpes simplex virus (see sections . and ) include the targeting of upstream autophagy regulatory signals f . therefore, it is tempting to speculate that some of the effects of viruses on cell growth control and cell death may be either direct or indirect consequences of the evolutionary pressure that viruses face to modulate host autophagy. as one example, the insulin-like/class i pi- k/akt signaling pathway inhibits autophagy , , promotes oncogenesis (reviewed in ) , and decreases lifespan (most likely through autophagy-inhibitory effects) . certain retroviruses have recruited the catalytic subunit of pi -k and its downstream target akt and these viral gene products function as oncoproteins (reviewed in ) . the emerging link between these signaling molecules and autophagy inhibition raises the interesting hypothesis that the initial acquisition of these molecules by viruses was perhaps related to the selective advantage of autophagy inhibition in viral growth. in view of recent evidence supporting a role of autophagy in tumor suppression [ ] [ ] [ ] [ ] , the presence of these genes in retroviral genomes could contribute to oncogenesis at least, in part, through inhibition of autophagy signaling, as well as through modulation of other downstream pathways. the atg genes encode proteins important for responding to upstream signaling pathways as well as proteins needed for the generation, maturation, and recycling of autophagosomes (reviewed in , , , ). the atg proteins can be grouped into four functional groups, including a protein kinase cascade important for responding to upstream signals, a lipid kinase signaling complex important for vesicle nucleation, ubiquitin-like conjugation pathways important for vesicle expansion, and a recycling pathway important for the disassembly of atg protein complexes from matured autophagosomes. the role of some of the atg proteins, including ones that act in the lipid kinase signaling complex and in the ubiquitin-like conjugation pathways, has been studied in plant and mammalian viral infections (see table ). autophagy is a dynamic process that is tightly regulated by protein kinases and phosphatases. one of the first atg genes identified in yeast, atg , encodes a serine/threonine kinase . the atg kinase maintains a weak interaction with a hyperphosphorylated atg protein, atg , in nutrientrich conditions. upon starvation conditions or stress, atg is dephosphorylated resulting in a tighter association with atg . atg binding is essential for autophagy since atg mutants unable to bind to atg are completely defective in autophagy . atg , which is also thought to play a role in atg activation, also interacts with atg . downstream targets of atg have not been identified, although atg interacts with other proteins independently of its kinase activity . the upstream kinase, tor (target of rapamycin), indirectly or directly results in atg hyperphosphorylation, which is one presumptive mechanism by which tor kinase inhibits autophagy. of note, the atg component of the yeast autophagy induction complex plays a conserved role in autophagy in higher eukaryotes. however, as-of-yet, the role of atg in antiviral immunity has not been evaluated. lipid kinase signaling vps , which encodes a phosphatidylinositol- kinase (pi -k), d d phosphorylates the ' hydroxyl group inosotiol ring of phosphoinositides u u . although there is only one pi -k in yeast, there are three classes of pi -k in higher eukaryotes; class iii pi -k has been shown to be analogous to yeast vps . the importance of class iii pi -k signaling in autophagy has been demonstrated pharmacologically and genetically. the nucleotide derivative, -methyladenine, inhibits class iii pi -k activity and is widely used to inhibit autophagosome formation in mammalian cells , . a null mutation in vps causes defects in autophagosome formation in yeast , and microinjection of an inhibitory antivps antibody blocks autophagy in cultured mammalian cells . vps functions through the association with other atg proteins in a large complex that includes vps , atg /vps and atg . this complex is thought to be important in vesicle nucleation by mediating the localization of other atg proteins at the pas , . vps and atg /vps are conserved in higher eukaryotes. importantly, the mammalian (beclin ) and plant (beclin ( ( ) homologues of yeast atg /vps have been the most extensively studied atg genes in viral infections. as will be discussed in more detail below, mammalian beclin restricts viral replication, protects against virus-induced cell death, and is a target of inhibition by different virally-encoded gene products [ ] [ ] [ ] . furthermore, both plant beclin and its binding partner, class iii pi -k/vps prevent the spread of programmed cell death during the plant antiviral hypersensitive response . thus, the lipid kinase complex plays an evolutionarily conserved role in antiviral innate immunity. autophagic vesicle expansion and completion involves conjugation machinery analogous to the ubiquitin conjugation needed for proteasomemediated protein degradation. autophagy utilizes an e -like enzyme (atg ), two e -like enzymes (atg and atg ) that facilitate the conjugation, and activation and localization of different ubiquitin-like modifiers (atg and t atg ). the conjugation modification of atg proteins is necessary for the formation of an autophagosome of appropriate size and shape . however, the precise molecular functions of the conjugation reactions are not known and remain a critical unanswered question in autophagy research. the first conjugation system involves the lipidation of atg , a ubiquitinlike protein whose close mammalian homologues have three-dimensional structures very similar to ubiqutin [ ] [ ] [ ] [ ] . both atg and the mammalian homomlogue lc are cleaved post-translationally by the cysteine endopeptidase atg , . the cleavage of atg /lc is essential for conjugation and further maturation of the autophagosomes . in yeast and mammalian systems, the cleaved atg /lc is immediately activated by atg , an e -like enzyme; transferred to atg , an e -like enzyme; and finally conjugated to the lipid molecule phosphatidylethanolamine (pe) , , , . the second ubiquitin-like reaction is the conjugation of atg to atg . atg is an ubiquitin-like protein that is activated by atg (e -like enzyme), transferred to atg (e -like enzyme), and subsequently conjugated to atg through an isopeptide bond , . the conjugation of atg to atg is necessary for autophagosome formation but not necessary for localization to the pas . almost all of the components of the autophagy machinery that participate in the protein conjugation systems have orthologs in at least some higher eukaryotes. however, only mammalian atg has been studied in the context of its role in viral infections. the contrasting phenotypes of atg null cells infected with two different rna viruses illustrate two distinct mechanisms by which viruses interact with the autophagic machinery. the murine coronavirus, mouse hepatitis virus, which replicates in association with double-membrane vesicles, has severely impaired growth in atg null embryonic stem (es) cells , suggesting that atg is required for the formation of coronavirus replication complexes. in contrast, the prototype alphavirus, sindbis virus, replicates to higher titers in atg null murine embryonic fibroblasts (mefs) than in wildtype controls , suggesting that the autophagic machinery functions to restrict sindbis virus replication. in yeast, atg proteins that act at the stage of vesicle formation are not associated with the completed autophagosome, with the exception of atg . this suggests that atg proteins are retrieved at some point prior to, or upon, vesicle completion, and then reutilized in the generation of new autophagosomes. the process of recycling requires the action of atg and atg , which allow the recycling of atg , the only transmembrane protein that is part of the autophagic machinery , . atg and atg have orthologues in higher eukaryotes, but their function in antiviral responses has not been studied. the unique association of atg /lc with the mature autophagosome has led to an important technical advance in autophagy research. atg /lc is presently the most widely used and reliable marker for labeling autophagosomes , , , , and with the recent availability of transgenic mice that express gfp-tagged lc , it is now possible to study autophagy induction in vivo during viral infections. in addition to its emerging role in innate immunity, autophagy plays a role in diverse other biological processes, including survival during starvation, differentiation and development, tissue homoeostasis, aging, cell growth control, and certain forms of programmed cell death. these biological functions of autophagy have been reviewed in detail elsewhere , , . in this section, we will however, briefly discuss selected biological functions of autophagy that have relevance either to understanding the mechanisms by which autophagy protects cells against virus infections ( figure ) or to understanding the potential consequences for the host of viral evasion of autophagy ( figure ). perhaps the primordial function of autophagy is its ability to recycle nutrients and help sustain life during periods of starvation. several decades ago, starvation was noted to be a potent inducer of autophagy in rodent liver (reviewed in ), leading to the hypothesis that autophagy is an adaptive response to starvation. following the identification of the conserved autophagy genes, genetic studies in different species have confirmed that autophagy genes are required for the maintenance of eukaryotic life in the face of limited environmental nutrient supply. this principle was first demonstrated in yeast, i.e. all atg gene mutant yeasts grow normally in nutrient rich conditions, but unlike wild-type yeasts, die rapidly during carbon or nitrogen starvation . similarly, dictyostelium discoideum that lack atg genes also grow normally in the presence of their food, nonpathogenic bacteria, but die rapidly when subjected to starvation , . during nitrogen or starvation, atg and atg mutant plants display two phenotypes that are thought to result from a defective ability to mobilize nutrients through autophagic delivery, including enhanced chlorosis (yellowing of leaves due to a loss of chlorophyll) and accelerated leaf senescence , . in addition, mammalian cells deleted of the autophagy genes, atg or beclin , also undergo accelerated cell death in response to starvation as compared to their wild-type counterparts . the pro-survival function of autophagy during starvation is thought to be related directly related to its ability to recycle nutrients to generate a sufficient pool of amino acids required for the synthesis of essential proteins. while the eif kinase signaling pathway shuts off general translation during starvation, at least in yeast, gcn signaling simultaneously stimulates the transcription of essential starvation response genes, including autophagy genes , . thus, this signaling pathway provides a coordinated method to effectively generate new amino acids by autophagy and redirect the host cell synthetic machinery to use its limited amino acid supply specifically for the synthesis of essential starvation response proteins. although a downstream transcription factor like yeast gcn (which is downstream of yeast eif and transcriptionally transactivates autophagy genes), has not yet been identified for mammalian pkr signaling initiated during virus infection, it seems likely that there are functionally homologous molecules that direct virus-infected cells to mount an adaptive and selective transcriptional and translational response during virus infection. even in the absence of this postulated arm of pkr signaling, the mere recycling of nutrients in virus-infected cells would be predicted to have a beneficial function for the host. although few studies have compared cellular amino acid pools during nutrient starvation and virus infection, acute viral replication involves the parasitism of not only the host cell's translational machinery, but also the host cell's translational building blocks. therefore, it seems likely that acute viral replication induces what can be thought of as a state of "pseuodostarvation". according to this model, the prediction is that the nutrient recycling function of autophagy plays a similar protective function during viral infection as it plays during nutrient deprivation. differentiation and development both require cells to undergo significant phenotypic changes and must entail a mechanism for the breakdown and recycling of obsolete cellular components. genetic studies have revealed an essential role for components of the autophagic machinery in differentiation and developmental processes in several different organisms, including sporulation in yeast, multicellular development in dictyostelium, dauer development in c. elegans, and embryonic development in mice (reviewed in ). in addition, the mammalian autophagy gene, beclin , appears to play a role in epithelial cell differentiation, since the mammary glands in beclin heterozygous-deficient mice display striking morphological abnormalities . since viral gene products can inhibit the autophagy function of beclin (see below) and potentially other autophagy proteins , it is possible that autophagy blockade represents a mechanism by which viruses can affect cellular differentiation. for example, the bcl- -like bhrf protein encoded by ebv binds to beclin blocks its autophagy function , and also perturbs epithelial cell differentiation . further studies are needed to determine the role of beclin binding in the perturbation of epithelial cell differentiation by bhrf , as well as to investigate the effects of other viral inhibitors of autophagy on cellular differentiation and multicellular development. many different viruses, including retroviruses, gammaherpesviruses, papillomaviruses, and hepatitis viruses are oncogenic. studies of the mechanisms of viral oncogenesis have largely focused on the ability of viruses to alter mitogenic signaling, cell cycle regulation, and/or apoptosis. however, new evidence is emerging that autophagy plays a role in tumor suppression and that autophagy is antagonized by gene products encoded by certain oncogenic viruses. accordingly, it will be important to evaluate the role of viral inhibition of autophagy in viral oncogenesis. normal cell growth requires a well-coordinated balance between the cell's biosynthetic machinery (e.g. protein synthesis and organelle biogenesis) and its degradative processes (e.g. protein degradation and organelle turnover). in the 's, it was first proposed that protein catabolism through autophagy is a major determinant of cell growth , . according to this model, both cell mass and the rate of cell growth is a balance between the amount of protein synthesized and the amount of autophagic protein degradation. although this model has received little attention in recent years, interest in the role of autophagy in cell growth control has reemerged in light of new biochemical and genetic links between autophagy and the negative regulation of tumorigenesis. as stated above in section . , several different oncogenic signaling molecules, including members of the insulin signaling pathway (e.g. class i pi- k, akt) and members of the rho and ras family of gtpases negatively regulate autophagy in mammalian cells and the pten tumor suppressor positively regulates autophagy (reviewed in ). furthermore, the autophagy inhibitor, tor, is an important positive regulator of cell growth in diverse organisms, and the tor inhibitor, rapamycin, has promising anti-tumor effects in human clinical trials (reviewed in ). oncogenic viruses have developed multiple different strategies to activate autophagy-inhibitory signaling pathways. these strategies include encoding viral oncoproteins that represent activated forms of the corresponding cellular proto-oncogene (reviewed in ) or upregulating rho/ras or class i pi- k/akt/tor signaling through alternative mechanisms [ ] [ ] [ ] [ ] [ ] [ ] [ ] . components of the autophagic machinery may also play a direct role in tumor suppression. the beclin gene is monallelically deleted in a high percentage of cases of human breast, ovarian, and prostate cancer (reviewed in ) and has tumor suppressor function in cultured mammary carcinoma cells , . heterozygous disruption of beclin in mice increases the frequency of spontaneous tumorigenesis (including papillary lung carcinomas, b cell lymphomas, and hepatocellular carcinomas) and accelerates the development of hepatitis b virus-induced pre-malignant lesions , . in addition, atg null es cells are more tumorigenic in mice than their wild-type counterparts and result in teratomas that are less welldifferentiated . together, these findings lead to the concept that autophagy genes may represent a novel class of tumor suppressor genes and that genetic disruption of autophagy may represent a novel mechanism of tumorigenesis. as will be discussed in more detail below, two different classes of viral gene products have been identified thus far that bind to beclin and inhibit its autophagy function, including the alphaherpesvirus-encoded neurovirulence protein, hsv- icp . , and the gammaherpesvirus-encoded bcl- -like proteins, kshv vbcl- and ebv bhrf , . the gammaherpesviruses are oncogenic viruses that are etiologically linked to a variety of different malignancies, including lymphoma, nasopharyngeal carcinoma, and kaposi's sarcoma. at present, the precise role of gammaherpesvirus bcl- like proteins in viral oncogenesis is uncertain. nonetheless, given the welldefined role of cellular bcl- in oncogenesis and the emerging evidence that beclin is a tumor suppressor protein, it will be important to evaluate whether viral bcl- antagonism of beclin function plays a role in gammaherpesvirus oncogenesis. of note, preliminary data indicates that kshv may also encode other gene products that interact with other components of the autophagic machinery . thus, oncogenic gammaherpesviruses may have multiple mechanisms to disarm host autophagy. it will be of interest to determine whether other oncogenic dna viruses, especially human papillomavirus, also directly inhibit the host autophagic machinery. in many tissues in the adult organism (especially post-mitotic cells), protein and organelle turnover by autophagy plays an essential cellular homeostatic or housekeeping function, removing damaged or unwanted organelles and proteins. for many decades, it has been presumed that this homeostatic function of autophagy represents an anti-aging mechanism, perhaps by reducing reactive oxidative species and other toxic intracellular substances that contribute to genotoxic stress (reviewed in ). the conserved effects of protein caloric restriction (a dietary inducer of autophagy) on lifespan extension has provided further fuel for this concept (reviewed in ). recent genetic studies, especially those performed in c. elegans, provide more direct evidence for a role of both autophagy regulatory signals and components of the autophagic machinery in anti-aging pathways. loss-offunction mutations in autophagy-inhibitory insulin-like signaling pathway extend lifespan (reviewed in ) , and inactivation of the c. elegans ortholog of yeast autophagy gene, atg /vps , blocks this lifespan extension . while the precise mechanisms by which autophagy extends lifespan are unknown, one theory is that autophagy selectively removes damaged mitochondria, resulting in decreased levels of intracellular reactive oxygen species and cellular protection against oxidative damage. viral infections, as well as the inflammatory response to viral infections, can damage mitochondria and/or increase the intracellular generation of reactive oxygen species, and these effects may contribute to viral pathogenesis. for example, the mitochondrial damage that occurs in hiv infection (even in the absence of antiretroviral treatment) is thought to be a major contributory factor to the metabolic abnormalities and cardiomyopathy that occur in patients with aids (reviewed in ). as another example, in a transgenic mouse model of hepatitic c virus (hcv) infection, oxidative stress in the absence of inflammation has been implicated in hcv-associated hepatocarcinogenesis . similarly, studies in transgenic mice and cultured cells indicate that pre-s /s mutant hepatitis b virus surface antigens, which accumulate in late stages of hbv infection, cause oxidative stress and dna damage . therefore, it is possible that the mechanisms by which autophagy functions as an anti-aging pathway may be relevant to potential roles that autophagy may play in protecting cells against adverse sequelae of oxidative stress during virus infection. diseases associated with an accumulation of misfolded and aggregated proteins, including neurodegenerative disorders and -anti-trypsin liver disease, are associated with an increase in the accumulation of autophagic vacuoles (reviewed in ). in these diseases, it has both been argued that autophagy plays a protective role (i.e. by removing protein aggregates and damaged mitochondria) and a pathologic role (i.e. by promoting liver dysfunction in -anti-trypsin deficiency through excessive mitochondrial autophagy or by promoting autophagic cell death). although both of these roles may be operative in different diseases or even in different facets of a single disease, recent studies provide compelling evidence that autophagy plays a protective role against the toxic effects associated with protein aggregation. for example, mutant -synuclein (associated with early onset parkinson's disease), and aggregate-prone proteins with polyglutamine expansions (associated with huntington's disease) are targeted for autophagic degradation , . rapamycin, which stimulates autophagy, not only enhances the clearance of aggregate-prone proteins but also reduces the appearance of the aggregates and the cell death associated with expression of mutant huntington's proteins . furthermore, induction of autophagy with rapamycin protects against neurodegeneration in both a fly and mouse model of huntington's disease . recent advances have also been made in understanding the mechanisms by which autophagy is induced in response to misfolded protein aggregates. in cell models, transgenic mice, and samples from human brains of patients with huntington's disease, mtor is sequestered into polyglutamine aggregates. this sequestration impairs its kinase activity, leading to induction of autophagy . although it has not yet been evaluated, it is likely that the accumulation of misfolded protein aggregates also induces autophagy through activation of the er stress response, which is mediated by the eif kinase, pkr-like er resident kinase (perk) , , since other stress stimuli (e.g. starvation and virus infection) that activate other eif kinases (e.g. gcn and pkr) induce autophagy through this same signaling pathway . these observations are potentially relevant to understanding the role of autophagy in protection against virus-induced diseases in which protein misfolding and er stress are thought to play pathogenetic roles. similar to genetic neurodegenerative disorders, there is increasing evidence that murine retrovirus-associated spongiform-like neuronal degeneration is also associated with protein misfolding and er stress. for example, viral envelope proteins from avirulent strains are processed normally and fail to induce er stress, whereas envelope proteins from neurovirulent strains are misfolded and activate er stress response pathways [ ] [ ] [ ] . in addition, it has been proposed that the mechanism by which pre-s mutant hbv surface antigens promote oxidative stress and dna damage is through the accumulation of misfolded mutant proteins and activation of er stress , . thus, based on recent studies with non-viral associated neurodegenerative disorders, the prediction is that autophagy induction might be beneficial in attenuating diseases associated with misfolded viral proteins, such as retrovirus-associated spongiform encephalopathy and hepatitis b virusinduced liver damage. as a corollary, the possibility that these viruses might possess mechanisms to evade host autophagy could be an exacerbating factor in the pathogenesis of these infections. an interesting question is whether viral protein aggregates trigger autophagy by mechanisms that are similar to those involved in autophagy induction initiated by cellular protein aggregates. different viral glycoproteins are known to activate the er stress-related eif kinase, perk , , although a role for perk in autophagy induction has not yet been formally demonstrated. it is completely unknown, however, whether viral protein aggregates, like polyglutamine aggregates in huntington's disease, sequester and thereby inactivate the autophagy-inhibitory kinase, mtor. if so, this would represent a highly novel mechanism by which viruses trigger intracellular innate immune responses. autophagy is emerging as a newly described mechanism of antiviral innate immunity that is targeted by viral virulence gene products. although there are not yet many published articles in this area, there are several observations that support this concept. first, during herpes simplex virus infection, the interferon-inducible antiviral pkr signaling pathway regulates the autophagic degradation of cellular and viral components , . second, mammalian autophagy execution genes, including beclin and atg , regulate sindbis virus replication and sindbis virus-induced cell death , . third, plant autophagy execution genes, including beclin , class iii pi -k/vps , atg , and atg , restrict tobacco mosaic virus replication and limit the spread of cell death during the innate immune response . in this section, each of these observations will be described in more detail. the interferon-inducible dsrna-dependent protein kinase r (pkr) plays an important role in innate immunity against viral infections. pkr activation leads to phosphorylation of the subunit of eukaryotic initiation factors (eif ) and a subsequent shutdown of host and viral protein synthesis and viral replication (reviewed in ). to avoid this translational shutdown, many viruses have evolved different strategies to antagonize pkr function. these include interference with the dsrna-mediated activation of pkr or pkr dimerization; blockade of the kinase catalytic site or pkr-f substrate interactions; alterations in the levels of pkr; direct regulation of eif phosphorylation; and effects on components downstream of eif (reviewed in , ). the importance of viral antagonism of pkr function in viral pathogenesis has been most clearly demonstrated using a herpes simplex virus type (hsv- ) model system , . the hsv- neurovirulence protein, icp . , binds to protein phosphatase and causes it to dephosphorylate eif , thereby negating the activity of pkr , . a neuroattenuated hsv- mutant lacking icp . exhibits wild-type replication and virulence in mice genetically lacking pkr , proving that the icp . gene product mediates neurovirulence by antagonizing pkrdependent functions. in addition to regulating host translation during viral infection, the pkr signaling pathway also regulates the autophagic degradation of host proteins . as mentioned above, molecules in the yeast eif kinase signaling pathway (e.g. the eif kinase, gcn , the eif ser residue, and the transcriptional transactivator, gcn ) are required for nitrogen starvationinduced autophagy. interestingly, the autophagy defect of gcn null yeast can be rescued by mammalian pkr transformation . direct evidence that viruses can induce pkr-dependent autophagy has been provided by studies done with herpes simplex virus type (hsv- ) infection in genetically engineered mefs . a herpes simplex virus (hsv- ) mutant virus lacking the icp . inhibitor of pkr signaling (termed hsv- . ), but not wild-type hsv- , is able to induce the autophagic breakdown of long-lived cellular proteins in wild-type mefs. however, hsv- . infection is not able to induce the autophagic breakdown of long-lived cellular proteins in mefs lacking pkr or with a nonphosphorylatable mutation in ser- of eif . these findings indicate the pkr-dependent signaling events regulate the autophagic breakdown of host proteins during viral infection and that this function of pkr is antagonized by the hsv- icp . neurovirulence gene product. as discussed in section . , the breakdown of cellular proteins may help protect host cells against the effects of "pseudostarvation" induced by viral infection. more recent studies indicate that pkr-dependent signaling also regulates the breakdown of viral components during hsv- infection . ultrastructural analyses of wild-type and pkr-deficient mefs and sympathetic neurons infected with wild-type and hsv- . demonstrate that hsv- is degraded in autophagosomes by a pkr-dependent process. in wild-type cells infected with wild-type hsv- , the majority of intracytoplasmic virions are a either randomly dispersed in the cytoplasm or are contained within "viral vacuoles," a structure thought to represent an important intermediate in the egress of hsv- from the nucleus out of the cell . in contrast, in wild-type cells infected with hsv- . , most cytoplasmic virions are localized within autophagosomes that contain a mix of different cytoplasmic constituents ( figure a ). pkr-deficient mefs or pkr-deficient neurons infected with hsv- . have very few autophagosomes, and appear similar to wild-type hsv- -infected wild-type cells, with randomly dispersed intracytoplasmic virions and numerous viral vacuoles. together, these observations demonstrate that hsv- is degraded by autophagy, that hsv- icp . antagonizes the cellular autophagic degradation of hsv- , and that this process requires pkr. recent biochemical analyses confirm that pkr signaling and hsv- icp . regulate viral protein degradation . hsv- protein degradation is significantly accelerated in wild-type mefs infected with hsv- t t . as compared to wild-type mefs infected with wild-type hsv- , indicating that icp . delays viral protein degradation. however, in autophagy-deficient pkr -/-mefs or eif s a mutant mefs, the rate of hsv- protein degradation is similar in hsv- -and hsv- . -infected cells, indicating that hsv- protein degradation is positively regulated by the pkr signaling pathway. thus, the eif kinase-dependent autophagy signaling pathway not only regulates the degradation of long-lived cellular proteins but also regulates the degradation of viral proteins. accordingly, it seems logical to speculate that pkr-dependent autophagic degradation of viruses inhibits viral replication and is an antiviral defense mechanism. however, the relative contributions of the effects of pkr on viral protein synthesis and the effects of pkr on viral protein degradation in the regulation of the hsv- replication have not yet been assessed. for this purpose, it will be necessary to selectively inhibit the autophagic protein degradation machinery and/or have hsv- mutant viruses that selectively block specific downstream functions regulated by pkr. it will also be important to determine whether pkr-dependent autophagy degrades and inhibits the replication of viruses other than hsv- . preliminary observations indicate that pkr-dependent autophagy does lead to the degradation of another neurotropic virus, the enveloped, positive-strand rna virus in the alphavirus genus, sindbis virus ( figure b ). the role of pkr in antiviral innate immunity is well established, but pkr regulates many different cellular processes, and it is not yet known exactly what role autophagy induction plays in the antiviral effects of pkr. however, the concept that autophagy is important in innate immunity is more directly supported by studies involving components of the mammalian and plant autophagic machinery. the first identified mammalian autophagy gene product, beclin , was isolated in a yeast two-hybrid screen, in the context of studies of the mechanism by which the antiapoptotic protein, bcl- , protects mice against lethal sindbis virus encephalitis . similar to the neuroprotective effects of bcl- , enforced neuronal expression of wild-type beclin in a recombinant chimeric sindbis virus vector reduces sindbis virus replication, reduces sindbis virus-induced apoptosis, and protects mice against lethal sindbis virus encephalitis . mutations in the bcl- -binding domain of beclin and mutations in other regions of beclin that block its autophagy function also block its protective effects during sindbis virus infection , . thus, it appears that both the interaction with bcl- and the autophagy function may be required for the antiviral effects of beclin . however, further studies are required to define the precise mechanism of how beclin inhibits viral replication and virus-induced apoptosis and to identify the precise role of bcl- -beclin interactions in these processes. preliminary studies with beclin null es cells and atg null mefs indicate a role for these two endogenous autophagy genes in innate immunity against sindbis virus infection. sindbis virus replicates to higher titers and results in accelerated death in beclin null es cells as compared to wild-type control es cells and in atg null mefs as compared to wild-type control mefs . in the case of sindbis virus infection, it is not known whether the acceleration of virus-induced death in beclin null or atg null cells is a result of increased viral replication or of independent effects of atg gene deficiency on cell death. however, studies comparing hsv- infection in wild-type and beclin -/-es cells suggest that beclin can protect against virus-induced cell death in the absence of inhibitory effects on viral replication . together, the studies of sindbis virus infection in neurons overexpressing beclin or in cultured cells lacking beclin or atg demonstrate a role for mammalian autophagy genes in both restricting viral replication and in protection against virus-induced cell death. it will be important to examine whether other autophagy genes have a similar antiviral function and to examine whether autophagy genes also protect against other types of virus infections. the mechanisms by which autophagy genes exert protective effects in sindbis virus infection are not yet known. presumably, the autophagic breakdown of viral components leads to decrease viral yields. however, it is also possible that autophagy leads to the breakdown of cellular components required for viral replication. as noted above, the protective effects against cell death may be secondary to inhibitory effects on viral replication. alternatively, the protective effects may relate to the nutrient recycling functions or "damage control" functions of autophagy, or in the case of beclin , to interactions between autophagy proteins and antiapoptotic pathways. it is also possible that autophagy may protect against cell death by degrading specific viral proteins (e.g. the sindbis virus e and e envelope glycoprotein's ) that are involved in triggering the apoptotic pathway. the protective effects of beclin and atg on virus-induced cell death are consistent with the "pro-survival" effects of autophagy during nutrient starvation and other forms of environmental stress. it is not yet clear how to reconcile these pro-survival effects with the view that autophagy represents an alternative form of non-apoptotic programmed cell death (reviewed in , , ). while the primary basis for this view has been morphologic correlations between the presence of autophagic vacuoles and dying cells (reviewed in , ), recent genetic experiments establish a more direct role for autophagy genes in certain types of programmed cell death. mammalian atg and beclin rnai blocks cell death in fibroblast and macrophage cell lines treated with the caspase inhibitor, zvad , and atg , , and rnai blocks salivary gland destruction during drosophila development . thus, the relationship between autophagy, cell survival, and cell death is quite complex and likely varies according to the cell type and the specific physiological or pathophysiological setting. it remains to be determined whether autophagy genes primarily play a protective role in preventing cell death during virus infection, or whether they also participate directly in cell death that is induced by certain viruses. the plant homologue of beclin also functions in antiviral host defense in plants. similar to mammalian beclin , plant beclin restricts viral replication; tobacco mosaic virus replication is increased in beclin silenced tobacco plants as compared to vector-treated control plants . however, in contrast to mammalian beclin , which prevents the death of sindbis virus-and hsv- -infected cells, plant beclin plays an interesting role in preventing the death of uninfected cells . in plants, the innate immune response during virus infection is characterized by a hypersensitive response which is a programmed cell death response that occurs around the infected areas (reviewed in [ ] [ ] [ ] ). this hypersensitive response limits virus spread and confers pathogen resistance. it is triggered by a pathogen-encoded avirulence protein, which is recognized by a specific plant cognate resistance protein, termed an r protein. in plants lacking r proteins, there is uncontrolled virus spread and pathogen sensitivity. a tobacco mosaic virus infection model has recently been used to study the role of autophagy genes in plant innate immunity. during tobacco mosaic virus infection, the hypersensitive response is triggered by tobacco mosaic virus protein, tmv p , which is the helicase domain of the viral replicase . tmv p is recognized by an r protein (called the n protein) of n. benthamiana, which is composed of a toll/interleukin receptor domain, a nucleotide binding domain, and a leucine-rich repeat domain . therefore, in tobacco plants containing the n protein (n ( ( +/+ n n ), there is local cell death in cells that are either infected with tmv or that express the tmv p protein but no systemic illness is observed. beclin silencing in n +/+ n n tobacco plants reveals a striking role for this autophagy gene in limiting the spread of cell death during the hypersensitive response . during tmv infection of n n plants, cell death begins as discreet and defined foci but continues to spread beyond the site of tmv infection until there is death of the entire inoculated leaf and other uninoculated leaves. a similar spreading cell death phenotype is seen with local expression of the tmv p protein, suggesting that the cell death occurs in response to a specific signal triggered by the pathogen encoded avirulence protein, and is not due to increased tmv replication or altered virus movement. in addition, in plants that lack the n gene, beclin silencing does not lead to cell death after tmv infection. moreover, beclin silencing also results in spreading cell death during the hypersensitive response triggered by bacterially-encoded pathogen avirulence proteins. together, these observations demonstrate that the spreading cell death phenotype in beclin -silenced plants is mediated by r gene-mediated innate immune responses and that beclin is an important negative regulator of cell death during the plant innate immune response. a similar role for other autophagy genes in limiting the spread of programmed cell death during tmv infection has also been observed. as discussed in section . . , pi -k/vps is a protein that physically interacts with atg /beclin in yeast and mammals and is essential for proper autophagosome formation. interestingly, silencing of the plant class iii pi- k/vps in n +/+ n n plants results in a spreading cell death phenotype during tmv infection that is similar to that observed with beclin silencing. as discussed in section . . , yeast and mammalian atg and atg are essential for conjugation reactions needed for autophagosome formation, and d silencing of the plant homologues of these genes also results in a spreading cell death phenotype after tmv infection. thus, multiple different f f autophagy genes, including those that act in the vesicle nucleation stage (e.g. t beclin , class iii pi -k/vps ) and those that act in the vesicle expansion stage (e.g. atg and atg ) are necessary to prevent the spread of cell death during the plant innate immune response. while plant autophagy genes protect uninfected cells against death whereas mammalian autophagy genes protect infected cells against death during virus infection, the plant data nonetheless further support a "prosurvival", rather than "pro-death" function of autophagy genes during viral f f infections. at present, it is not yet clear how autophagy genes protect uninfected cells against death during the plant hypersensitive response. one possibility is that the absence of autophagy genes in uninfected cells somehow modifies the r gene-mediated signal transduction pathway in a way that instructs uninfected cells to die. an alternative, perhaps more likely, possibility is that the absence of autophagy genes in uninfected cells f renders them more susceptible to pro-death signals emitted from infected cells. regardless of the mechanism, this newly defined role for autophagy genes in preventing the spread of cell death during plant innate immunity has significant implications for understanding the role of autophagy in systemic protection against viral infections. an important question is whether autophagy genes play a similar role during animal virus infections. the evolutionarily conserved function of both mammalian and plant autophagy genes in restricting viral replication and/or protection against cell death suggests an essential role for autophagy in innate immunity. this concept is further supported by recent observations indicating that the herpes simplex virus neurovirulence protein, icp . , possesses multiple mechanisms to disarm host autophagy. it can both antagonize the pkr signaling pathway required for autophagy induction and inhibit the function of one component of the autophagic machinery, beclin . as discussed in section . , icp . blocks pkr-dependent, eif ser- -dependent autophagic degradation of cellular and viral components in f hsv- -infected mefs and neurons . one predicted mechanism by which icp . blocks pkr-dependent autophagy is through its known ability to promote the dephosphorylation of eif via interactions with pp . however, new evidence also suggests a second potential mechanism. roizman et al. isolated the mammalian autophagy protein, beclin , in a yeast two-hybrid screen using icp . as a bait . subsequent studies have shown that icp . directly interacts with beclin in mammalian cells and inhibits the ability of beclin to rescue autophagy in autophagy-defective atg null yeast and in autophagy-defective human mcf breast carcinoma cells . since icp . binds to beclin via a domain that is distinct from its pp -binding domain, it should be possible to construct hsv- viruses containing mutations in icp . that help differentiate between the role of pp -binding (and eif phosphorylation) and the role of beclin -binding in hsv- icp . -mediated neurovirulence. besides hsv- icp . , there are numerous other viral proteins or rnas that suppress pkr signaling through a variety of different mechanism (reviewed in , ). for example, vaccinia virus e , influenza virus ns , hsv- us , reovirus , and rotavirus nsp encode double-stranded (ds) rna-binding proteins that prevent pkr activation. adenovirus vai rnas and hiv tar rnas bind to dsrna substrates and inhibit pkr. hepatitis c virus ns a protein inhibits the dimerization of pkr, and influenza virus recruits a cellular protein, p ipk , that directly interacts with pkr and inhibits its dimerization. the vaccinia virus k l, hepatitis c virus e , and hiv tat proteins act as pseudosubstrates of pkr. as-of-yet, the role of these other viral rnas and proteins in autophagy inhibition has not been investigated. however, given the evolutionarily conserved requirement for an intact eif kinase signaling pathway in autophagy induction, the prediction is that these other viral inhibitors of pkr, like hsv icp . , also function as antagonists of host autophagy. further studies are needed to test this prediction and to study the role of this predicted antagonism of host autophagy in the pathogenesis of diseases caused by these other important viral pathogens that encode putative autophagy inhibitors. not only may other viruses antagonize the autophagy function of pkr, but other viral gene products may also antagonize the autophagy function of specific mammalian atg genes. beclin was originally isolated in a yeast two-hybrid screen with the cellular anti-anti-apoptotic protein, bcl- . subsequently, beclin has also been shown to interact with viral bcl- -like proteins encoded by different gammaherpesviruses, including ebv-encoded bhrf , kshv-encoded v-bcl- , and murine hv -encoded m . like icp . , these viral proteins can also inhibit the autophagy function of beclin in yeast and mammalian assays. in addition, preliminary evidence indicates that other kshv-encoded proteins may interact with other specific atg proteins . an as-of-yet explored area is whether viruses also inhibit autophagy by activating autophagy inhibitory signaling pathways. as noted in section . and . , the class i pi- k/akt signaling pathway negatively regulates autophagy in both mammalian cells and c. elegans , , and several different viruses activate this pathway. certain oncogenic retroviruses encode the catalytic subunit of pi -k and akt (reviewed in ). in addition, the ebv latent membrane proteins, lmp and a, the hepatitis b virus protein, hbx, the kaposi's sarcoma virus protein, k , and the hepatitis c virus protein, ns a all activate the pi -k/akt signaling pathway [ ] [ ] [ ] [ ] [ ] [ ] . presumably, such activation plays a role in autophagy inhibition, although this has not yet been formally tested. while further studies are required to more precisely define the interactions between viral gene products and autophagy regulatory signals and autophagy proteins, there is, however, accumulating evidence that viruses do target multiple different steps of the host autophagy pathway. this observation strongly suggests an evolutionary advantage for viruses to inhibit host autophagy, and by extrapolation, a beneficial role for host autophagy in defense against viral infections. some viruses appear to have even further outsmarted host autophagy. rather than merely devising strategies to block host autophagy, certain positive-strand rna viruses have figured out ways to "co-opt" elements of the autophagy pathway to promote their own replication. this subject has been recently reviewed in detail elsewhere and will therefore only be briefly summarized in this section. as early as , electron microscopic studies of poliovirus-infected cells demonstrated the presence of large numbers of membranous vesicles that were postulated to develop by an autophagic-like mechanism . more recently, work by kirkegaard et al, has extended these findings to further show that poliovirus replication complexes are associated with doublemembrane vesicles that resemble autophagosomes, in that they ( ) have similar double membrane-bound morphology; ( ) have low buoyant density; and ( ) label with the autophagosome marker, gfp-lc , and the lysosome marker, lamp , . unlike classical autophagosomes, these autophagiclike vesicles do not appear to have a destructive role or mature into degradative compartments. in support of this, treatment with autophagy r r inducers, rapamycin or tamoxifen, both increase, rather than decrease poliovirus growth . furthermore, these double-membrane vesicles are also different from classical autophagosomes in that they contain sec and sec , components of the anterograde transport system that bud from the er . therefore, it is possible that poliovirus-induced vesicles arise from an alternate source rather than the pas, but still share some of the same characteristics of classical autophagosomes (e.g. gfp-lc and labeling, augmentation with rapamycin treatment). similar to the replication vacuoles that are associated with certain intracellular bacterial pathogens (e.g. legionella pneumophila) , the poliovirus-induced double-membrane vesicles likely originate from the er . furthermore, these poliovirusinduced vesicles seem to have an alternate function than autophagosomes (i.e. they are pro-replicative, rather than degradative compartments). these r observations suggest that poliovirus may promote its own replication by inducing dynamic membrane rearrangements that share in common certain features of the autophagy pathway (e.g. formation of sequestering doublemembrane vesicles, presence of overlapping markers) but avoid, other unwanted features of the autophagy pathway (e.g. maturation into degradation compartments). of note, specific poliovirus proteins, including bc and a, have been identified that are sufficient for the induction of t these "autophagic-like" double-membrane bound vesicles . however, the mechanisms by which these proteins induce the formation of such vesicles are not yet known. a recent study with the coronavirus, murine hepatitis virus, has provided more direct evidence that components of the autophagic machinery can be utilized for rna virus replication . mhv replication complexes localize to double-membrane vesicles (that are also thought to arise from the er) ( figure a ) and they co-localize with certain autophagy proteins, including lc and atg . in mhv-infected atg -/-es cells, double-membrane vesicles are not detected ( figure b) , and viral replication is dramatically reduced. these observations provide the first genetic demonstration that proteins necessary for autophagic vacuole formation are also required for maximal levels of viral replication. thus, mhv, and potentially other viruses that replicate in association with double-membrane vesicles (e.g. poliovirus, equine arterivirus), utilize components of cellular autophagy to foster their f own growth. presumably, the atg protein conjugation system (involving atg ) that plays a role in autophagic vesicle expansion and completion also plays a role in the formation of double-membrane vesicles involved in viral replication. it is not yet known whether the entire autophagic machinery or only selective components of the autophagic machinery are used for the formation of double-membrane vesicles that are associated with viral replication complexes. these observations with poliovirus and mhv represent two examples of how viruses can "subvert" elements of the host autophagy pathway to promote their own intracellular growth. in these infections, rna replication complexes are observed in association with "autophagic-like" doublemembrane vesicles but not in association with degradative autophagosomes. it is not clear whether this represents fundamental differences in the host pathways leading to the formation of "autophagic-like" double-membrane vesicles and classical degradative autophagosomes, the diversion of the autophagic machinery towards the formation of "autophagic-like" doublemembrane vesicles from the formation of classical degradative autophagosomes, or specific viral mechanisms to antagonize the maturation of "autophagic-like" double-membrane vesicles into mature degradative autophagosomes. however, interestingly, mhv infection does lead to the induction of atg- -dependent long-lived cellular protein degradation, ruling out the hypothesis that the autophagic machinery is entirely diverted to form membranes required for viral replication complexes. perhaps mhv possesses as-of-yet defined mechanisms to shield its replication complexes from autophagic degradation. although research in this area is still in a stage of infancy, it seems likely that the lysosomal degradation pathway of autophagy plays an evolutionarily conserved role in antiviral immunity. the interferon-inducible, antiviral pkr signaling pathway positively regulates autophagy, and both mammalian and plant autophagy genes restrict viral replication and protect against virus-induced cell death. given this role of autophagy in innate immunity, it is not surprising that viruses have evolved numerous strategies to inhibit host autophagy. different viral gene products can either modulate autophagy regulatory signals or directly interact with components of the autophagy execution machinery. moreover, certain rna viruses have managed to "co-apt" the autophagy pathway, selectively utilizing certain components of the dynamic membrane rearrangement system to promote their own replication inside the host cytoplasm. in addition to this newly emerging role of autophagy in innate immunity, autophagy plays an important role in many other fundamental biological processes, including tissue homeostasis, differentiation and development, cell growth control, and the prevention of aging. accordingly, the inhibition of host autophagy by viral gene products has important implications not only for understanding mechanisms of immune evasion, but also for understanding novel mechanisms of viral pathogenesis. it will be interesting to dissect the role of viral inhibition of autophagy in acute, persistent, and latent viral replication, as well as in the pathogenesis of cancer and other medical diseases. development by self-digestion: molecular mechanisms and biological functions of autophagy development by self-digestion: molecular mechanisms and biological functions of autophagy autophagy: a regulated bulk degradation process inside cells. d the molecular mechanism of autophagy functions of lysosomes a unified nomenclature for yeast autophagy-related genes diversity of signaling controls of macroautophagy in mammalian cells the pre-autophagosomal structure organized by concerted functions of apg genes is essential for autophagosome formation convergence of multiple autophagy and cytoplasm to vacuole targeting components to a perivacuolar membrane compartment prior to de novo vesicle formation yeast autophagosomes: de novo formation of a membrane structure cellular autophagy: surrender, avoidance and subversion by microrganisms nonselective autophagy of cytosolic enzymes by isolated rat hepatocytes autophagy as a regulated pathway of cellular degradation the mitochondrial permeability transition initiates autophagy in rat hepatocytes pkr: a sentinel kinase for cellular stress regulation of starvation-and virus-induced autophagy by the eif a kinase signaling pathway distinct classes of phosphatidylinositol '-kinases are involved in signaling pathways that control macroautophagy in ht- cells autophagy genes are essential for dauer development and lifespan extension in c. elegans oncogenic kinase signalling promotion of tumorigenesis by heterozygous disruption of the beclin gene beclin , an autophagy gene essential for early embryonic development, is a haploinsufficient tumor suppressor autophagy as a cell death and tumor suppressor mechanism defective autophagy leads to cancer molecular dissection of autophagy: two ubiquitin-like systems isolation and characterization of autophagy-defective mutants of saccharomyces cerevisiae tor-mediated induction of autophagy via an apg protein kinase complex chemical genetic analysis of apg reveals a non-kinase role in the induction of autophagy phosphatidylinositol -kinase encoded by yeast vps gene essential for prtoein sorting -methyladenine: specific inhibitor of autophagic/lysosomal protein degradation in isolated rat hepatocytes the phosphatidylinositol -kinase inhibitors wortmannin and ly inhibit autophagy in isolated rat hepatocytes the hansenula polymorpha pdd gene product, essential for the selective degradation of peroxisomes, is a homologue of saccharomyces cerevisiae vps p two distinct vps phosphatidylinositol -kinase complexes function in autophagy and carboxypeptidase y sorting in saccharomyces cerevisiae inhibition of autophagy in mitotic animal cells protection against fatal sindbis virus encephalitis by beclin, a novel bcl- -interacting protein unpublished data unpublished data autophagy genes are essential for limiting the spread of programmed cell death associated with plant innate immunity a protein conjugation system essential for autophagy a ubiquitin-like system mediates protein lipidation the x-ray crystal structure and putative ligand-derived peptide binding t t properties of gamma-aminobutyric acid receptor type a receptor-associated protein solution structure of human gaga(a) receptor-associated protein gabarap: implications for biological function and its regulation the cyrstal structure of microtubule-associated protein light chain , a mammalian homolgue of saccharomyces cerevisiae atg formation process of autophagosome is traced with apg /aut p in yeast hsatg b/hsapg b/autophagin- cleaves the carboxyl termini of three humans atg homologues and delipidates lc -and gabarap-phospholipid conjugates the reversible modifcation regulates the membrane-binding state of apg /aut essential for autophagy and the cytoplasm to vacuole targeting pathway apg p/cvt p: a novel protein-activating enzyme essential for autophagy lc , a mammalian homologue of yeast apg p, is localized in autophagosome membranes after processing coronavirus replication complex formation utilizes components of cellular autophagy apg /cvt p is an integral membrane protein required for transport vesicle formation in the cvt and autophagy pathways the atg -atg complex regulates atg and atg retrieval transport from the preautophagosomal structure in vivo analysis of autophagy in response to nutrient starvation using transgenic mice expressing a r r fluorescent autophagy marker autophagy in health and disease: a double-edged sword functions of lysosomes macroautophagy is required for multicellular development of the social amoeba dictyostelium discoideum dictyostelium macroautophagy mutants vary in the severity of their developmental defects leaf senescence and starvation-induced chlorosis are accelerated by the disruption of an arabidopsis autophagy gene the apg / -activating enzyme apg is required for proper nutrientrecycling and sensescence in arabidopsis thaliana unpublished data translation initiation: adept at adapting trascriptional profiling shows that gcn p is a master regulator of gene expression during amino acid starvation in yeast bhrf , a viral homologue of the bcl- oncogene, disturbs epithelial cell r differentiation reduced rates of proteolysis in transformed cells role of the vacuolar apparatus in augmented protein degradation in cultured fibroblasts inhibitors of mammalian target of rapamycin as novel antitumor agents: from bench to clinic human papillomavirus type b virus-like particles are able to activate the ras-map kinase pathway and induce cell proliferation epstein-barr virus lmp a transforms epithelial cells, inhibits cell differentiation, and activates akt latent membrane protein a inhibits transforming growth factor-beta -induced apoptosis through the phosphatidylinositol -kinase/akt pathway differential signaling pathways are activated in the epstein-barr virus-associated malignancies nasopharyngela carcinoma and hodgin lymphoma hepatitis b viral hbx induces matrix metalloproteinase- gene expression trhough activation of erk and pi- k/akt pathways: involvement of invasive potential the k protein of kaposi's sarcoma-associated herpesvirus activates the akt signaling pathway the hepatitis c virus ns a protein activates a phosphoinosite -kinase-dependent survival signaling cascade cloning and genomic organization of beclin , a candidate tumor suppressor gene on chromosome q induction of autophagy and inhibition of tumorigenesis by beclin beclin contains a leucine-rich nuclear export signal that is reguired for its autophagy and tumor suppressor function unpublished data autophagy and aging--when "all you can eat" is yourself the anti-ageing effects of caloric restriction may involve stimulation of macroautophagy and lysosomal degradation, and can be intensifid pharmacologically genetic pathways that regulate ageing in model organisms mitochondria and hiv infection: the first decade oxidative stress in the absence of inflammation in a mouse model for hepatitis c virus-associated carcinogenesis pre-s mutant surface antigens in chronic hepatitis b virus infection induce oxidative stress and dna damage mitochondrial autophagy and injury in the liver in -antitrypsin deficiency aggregate-prone proteins with polyglutamine and polyalanine expansions are degraded by autophagy inhibition of mtor induces autophagy and reduces toxicity of polyglutamine expansions in fly and mouse models off huntington disease identification and characterization of pancreatic eukaryotic initiation factor alpha-subunit kinase, pek, involved in translational control protein translation and folding are coupled by an endoplasmic-reticulum-resident kinase endoplasmic reticulum stress is a determinant of retrovirus-induced spongiform neurodegeneration activation of endoplasmic reticulum stress signaling pathway is associated with neuronal degeneratio in momulv-ts -induced spongiform encephalopathy endoplasmic reticulum (er) stress induced by a neurovirulent mouse retrovirus is associated with prolonged bip binding and retention of a viral protein in the er different types of ground glass hepatocytes in chronic hepatitis b virus infection contain specific pre-s mutants that may induce endoplasmic reticulum stress replication of a cytopathic strain of bovine viral diarrhea virus activates perk and induces endoplasmic reticulum stress-mediated apoptosis of mdbk cells possible involvment of both endoplasmic reticulum-and mitochondriadependent pathways in momulv-ts -induced apoptosis in astrocytes pkr-dependent autophagic degradation of herpes simplex virus type molecular mechanisms of interferon resistance mediated by viral-directed inhibition of pkr, the interferon-induced protein kinase com: maneuvering the internetworks of viral neuropathogenesis and evasion of the host defense mapping of herpes simplex virus- neurovirulence to g . , a gene nonessential for growth in culture specific phenotypic restoration of an attenuated virus by knockout of a hostresistance gene association of a m(r) , phosphoprotein with protein kinase pkr in cells exhibiting enhanced phosphorylation of translation initiation factor eif- alpha and premature shutoff of protein synthesis after infection with gamma . -mutants of herpes simplex virus the ( ) . protein of herpes simplex virus complexes with protein phosphatase to dephosphorylate the subunit of the eukaryotic translation initiation factor and preculde the shutoff of protein synthesis by double-stranded rna-activated protein kinase electron microscopic observations on the development of herpes simplex virus unpublished data bcl- protects mice against fatal alphavirus encephalitis the transmembrane domains of sindbis virus envelope glycoproteins induce cell death the autophagosomal-lysosomal compartment in programmed cell death regulation of an atg -beclin program of autophagic cell death by caspase american society for cell biology th annual meeting plant pathogens and integrated defence responses to infection plant innate immunity -direct and indirect recognition of general and specific pathogen-associated molecules controlled cell death, plant survival and development the helicase domain of the tmv replicase proteins induces the nmediated defense response in tobacco the product of the tobacco mosaic virus resistance gene n: similarity to toll and the interleukin- receptor the tumor suppressor pten positively regulates macroautophagy by inhibiting the phosphatidylinositol -kinase/protein kinase b pathway electron microscopic study of the formation of poliovirus remodeling the endoplasmic reticulum by poliovirus infection and by individual viral proteins: an autophagy-like origin for virus-induced vesicles cellular coopii proteins are involved in production of the vesicles that form the poliovirus replication complex apg p, a novel protein kinase required for the autophagic process in saccharomyces cerevisiae apg p and apg /vps p form a protein complex essential for autophagy in the yeast, saccharomyces cerevisiae the drosophila homolog of aut is essential for autophagy and development human apg p/aut p homologue is an authentic e enzyme for multiple substrates, gate- , gabarap, and map-lc , and facilitates the conjugation of hapg p to hapg p the loss of drosophila apg /aut function modifies the phenotypes of cut and notch signaling pathway pathway mutants a single protease, apg b, is specific for the autophagy-related ubiquitin-like proteins gate- , map -lc , gabarap, and apg l dissection of autophagosome formation using apg deficient mouse embryonic stem cells a new protein conjugation system in human. the counterpart of the yeast apg p conjugation system essential for autophagy structural and functional analyses of apg , a gene involved in autophagy in yeast the human homolog of saccharomyces cerevisiae apg p is a protein-activating enzyme for multiple substrates including human apg p, gate- , gabarap, and map-lc glucose-induced autophagy of peroxisomes in pichia pastoris requires a unique e -like protein mouse apg as an apg conjugating enzyme: analysis by the conugation-mediated yeast two-hybrid method apg p, a novel protein-conjugating enzyme essential for autophagy in yeast murine apg p has a substrate preference for murine apg p over three apg p homologs apg p is required for the function of the apg p-apg p conjugate in the yeast autophagy pathway cvt /gas is required for cytoplasm-to-vacuole transport, pexophagy, and autophagy in saccharomyces cerevisiae and pichia pastoris the work done in the authors' laboratories was supported by nih ro grants ai and ai to b.l, and an nsf plant genome grant dbi- and nih ro grant gm to s.p.d-k. key: cord- -wyipzwo authors: gleeson, paul a.; teasdale, rohan d.; burke, jo title: targeting of proteins to the golgi apparatus date: journal: glycoconj j doi: . /bf sha: doc_id: cord_uid: wyipzwo the golgi apparatus maintains a highly organized structure in spite of the intense membrane traffic which flows into and out of this organelle. resident golgi proteins must have localization signals to ensure that they are targeted to the correct golgi compartment and not swept further along the secretory pathway. there are a number of distinct groups of golgi membrane proteins, including glycosyltransferases, recyclingtrans-golgi network proteins, peripheral membrane proteins, receptors and viral glycoproteins. recent studies indicate that there are a number of different golgi localization signals and mechanisms for retaining proteins to the golgi apparatus. this review focuses on the current knowledge in this field. the survival of a cell depends on maintaining the integrity of the intracellular organelles. this feat is achieved by highly selective sorting and accurate transport of proteins to their correct destinations. over the past few years the dissection of the molecular machinery for targeting and localization of proteins, in particular proteins of the secretory pathway, has been studied vigorously. defined sequence motifs have been identified on proteins which can act as 'address labels'. the golgi apparatus represents the 'hub' of the secretory pathway where intense membrane traffic is controlled. this organelle not only co-ordinates the sorting of newly synthesized proteins but is also responsible for the control of posttranslational modifications, in particular glycosylation. a fundamental question currently being addressed in cell biology is how the golgi apparatus is organized to achieve these demanding functions and how it maintains its structural integrity in spite of the intense membrane traffic which enters and leaves this organelle. this review will focus primarily on our understanding of the molecular signals and mechanisms for the retention of resident golgi proteins. the golgi apparatus is a highly complex and dynamic organelle, which has been difficult to define in three-dimensional terms [ ] . it consists of a number of (reflecting in part the cholesterol content), ph, and most importantly in the populations of resident proteins which they contain. however, detailed biochemical characterization of the individual cisternae is lacking as the current methods are inappropriate to allow their purification. newly synthesized proteins are transported sequentially from the er to the golgi and then to their final destination. the transport of newly synthesized proteins from the endoplasmic reticulum to golgi cisternae, between adjacent cisternae within the golgi stack, and from golgi cisternae to various destinations is mediated by vesicles shuttling between donor and recipient compartments. vesicles bud from one compartment and then target and fuse with the next compartment [ , ] . an increasing number of structural and regulatory components have been identified which are involved in the orchestration of the complex and intriguing processes of budding, specific targeting, docking and fusion [ ] [ ] [ ] . some of the components of this machinery are localized only to golgi membranes and are thought to be specific for membrane transport through and from the golgi apparatus. the restricted location of these components indicates the presence of specific localization signals. until recently it was widely believed that, if correctly folded, newly synthesized proteins are transported from the endoplasmic reticulum, through the golgi stack to the tgn without the requirement for a specific transport signal [ , ] . however, the concept of 'bulk flow' of membrane proteins from the er has recently been challenged by the finding that vesicular stomatitis virus g glycoprotein is significantly concentrated during export from the er [ ] . nonetheless forward transport, at least from the golgi apparatus to the cell surface, appears to constitute a signal-independent or default pathway. despite this extensive flux of proteins, it is imperative that the golgi apparatus maintains the set of resident proteins which define its unique structural and functional properties. thus, it would appear that newly synthesized golgi membrane proteins must stop at the correct cisterna, or subcompartment, and be prevented from being swept into transport vesicles that bud from the dilated rims of the cisternae. clearly, specific localization signals are required for retention of proteins which reside in the golgi apparatus. what do we know about the sorting signals and mechanisms for the localization of non-golgi proteins within the secretory and endocytic pathways? a number of sorting signals have been found associated with the cytoplasmic domains of membrane proteins [ ] . for example, short tyrosine-containing peptide motifs found on cytoplasmic domains direct the sorting of proteins from the plasma membrane via the receptor mediated endocytosis pathway [ ] and the transcytosis of basolateral proteins to the apical surface of certain polarized cells [ ] , while a dileucine-containing peptide motif directs the transport of man- -p receptors from the tgn to the late endosomes [ , ] . these cytoplasmic domain sorting signals mediate interactions with coat structures of budding vesicles and thereby allow the selective vesicular transport of these membrane proteins between a variety of compartments [ ] . much progress has been made in defining the retention signals for resident er proteins. targeting signals have been identified for both soluble and membrane-bound proteins residing in the er. a specific retention signal, comprising the carboxy terminal sequence kdel/hdel, has been identified for a number of resident soluble er proteins [ , ] , and a receptor for this retention sequence has been identified [ ] [ ] [ ] . retention of these soluble er proteins is mediated by a receptor-based salvaging mechanism, whereby escaped kdel-bearing proteins are retrieved from a post-er compartment by a recycling kdel receptor [ ] . for membrane proteins of the e r a double lysine motif (kkxx) at the cytoplasmically exposed carboxy terminus of certain type i membrane proteins has been shown to specify er residence [ ] . for type li membrane er proteins, a related double arginine motif at the cytoplasmically orientated amino terminus has been identified [ ] . interestingly, the localization of ergic- (p ), a type i membrane protein of the intermediate compartment or cgn, requires a kkxx er retention motif, again suggesting that the cgn may be an extension overall, the localization signals of non-golgi proteins are hydrophilic motifs located on either the cytoplasmic or luminal domains of the protein, and some of these signals have been shown to interact specifically with receptor molecules or with protein coats of budding vesicles. it is now clear from recent studies that there are a number of distinct types of golgi localization signals. based on these localization signals and other biochemical features the resident proteins of the golgi apparatus can be divided into five groups (table ) . they are all membrane proteins. interestingly no soluble resident golgi proteins have been identified within the lumen of the golgi apparatus, which probably indicates that the mechanisms for retaining proteins to this organelle are restricted to membraneassociated proteins. the five groups are described individually as the mechanism of golgi localization may be unique for each group. the golgi apparatus plays a key role in the glycosylation of newly synthesized membrane and secreted proteins [ , ] . based on the exquisite specificities of the currently defined glycosyltransferases [ , ] , the synthesis of all the known carbohydrate chains of glycoconjugates must require in the region of - different glycosyltransferase enzymes distributed throughout the golgi stack. however, very little is known about the structural organization of these integral membrane enzymes within the membranes of the golgi cisternae and the signals which define their localization within the golgi are only now beginning to be defined. a number of glycosyltransferases have restricted distributions within the golgi apparatus, notably /~ , galactosyltransferase (/? , galt) (trans-golgi) [ ] [ ] [ ] , :~ , sialyltransferase (e , st) (trans-golgi and trans-golgi network) [ , ] and n-acetylglucosaminyltransferase i (glcnacti) (medial-golgi) [ , , ] . furthermore, simultaneous immunogold localization of /~i, galt and gtcnacti in the same golgi apparatus, confirms that these enzymes have distinct, though overlapping, distributions [ ] . from the purification tables ofgolgi glycosyltransferases, it is clear that individual transferases constitute only a minor percentage of the proteins of the cisternae in which they reside. for example, glcnacti constitutes about ~ of medial-specific golgi membrane protein in rabbit liver [ ] . however, in view of the estimated number of golgi glycosyltransferases, collectively the glycosyttransferases of each golgi cisterna may represent a very significant proportion, if not the bulk, of the resident membrane proteins. numerous mammalian glycosyltransferases have been cloned and sequenced (see reviews [ ] [ ] [ ] [ ] ). individual glycosyltransferases are highly conserved across species, for example rabbit glcnacti shares ~o and ~ amino acid sequence identity with human and mouse glcnacti, respectively [ ] . but comparison of the amino acid sequences between the glycosyltransferases has revealed only isolated cases of sequence similarity. for example, there is a high degree of sequence similarity between blood group a and b glycosyltransferases [ ] , which are products of two alleles, and between a number of e ( ) fucosyltransferases [ ] . furthermore, a conserved motif has been identified in the catalytic domain of cloned sialyltransferases (the 'sialylmotif') [ ] . however, these examples of sequence similarity are the exceptions and, overall, little sequence similarity has been detected between different glycosyltransferases. this is dramatically illustrated by a lack of obvious amino acid similarity between the sequences of four different glcnac transferases involved in the synthesis of the outer antennae of complex n-glycans, namely glcnacti [ , ] , glcnactii [ ] , glcnactiii [ ] , and glcnactv [ ] . one would expect there to be structural similarity between the catalytic sites of these glcnac transferases but this has not been detected from their amino acid sequences. thus, comparison of the primary structures of golgi glycosyltransferases has not revealed a potential golgi localization motif. there is, however, a striking similarity in the domain structure of these golgi enzymes. all golgi transferases cloned to date are nin/cou t (type ii) membrane proteins containing a single hydrophobic membrane-spanning domain ( - amino acids) which also serves as a non-cleavable signal sequence. each has a short n-terminal cytoplasmic domain (many have less than amino acids), and a large carboxyl-terminal catalytic domain situated in the lumen of the golgi apparatus. the catalytic domain is linked to the transmembrane domain by a loosely defined 'stem' region which may play a role in positioning the catalytic domain away from the lipid bilayer facilitating access to the substrate. over the past years a number of groups have attempted to identify the targeting signal responsible for the localization of gtycosyltransferases. three glycosyltransferases have been extensively examined, namely cd, st, /~i, galt, and glcnacti. these enzymes are residents of the tgn, tgn/trans-gotgi, and medial-golgi respectively. a common strategy has been employed by all groups to identify a putative golgi retention signal(s) by analysing the localization, in transfected mammalian cells, of hybrid molecules containing limited sequences derived from golgi glycosyltransferases. in all cases, the membrane-spanning domains of the golgi glycosyltransferases have been shown to direct, at least partial localization of hybrid molecules to the golgi apparatus [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . indeed it has been further demonstrated that the transmembrane domain of/?i, galt and glcnacti can specifically localize hybrid proteins to the trans and medial cisternae, respectively [ , ] . however, a number of studies have shown that sequences flanking the transmembrane domain also play auxiliary roles in mediating golgi localization [ , [ ] [ ] [ ] . these studies all agreed that the transmembrane domain plays a central role in the targeting and localization of resident golgi glycosyltransferases. this was an unexpected finding as localization signals up until then were hydrophilic glycopinion mini-review regions of the cytoplasmic or luminal domains. the involvement of a hydrophobic stretch of amino acids in targeting indicated a unique mechanism for the localization of these resident golgi proteins. these initial studies were based to a large extent on the premise that a discrete region, or motif, was responsible for the golgi localization signal. further studies have indicated that, although the transmembrane domain is relevant, the situation is far more complex than at first appreciated. figs - summarize graphically the regions of glycosyltransferases that have been examined and their ability to direct reporter molecules to the golgi apparatus. collectively, a large number of constructs have now been analysed. figs - include many of these constructs, however, it is by no means all-inclusive. those selected are the most instructive and highlight the complexity of the situation. there is considerable variability in the results obtained between groups, even when comparing the same glycosyltransferase. for example, wong et al. [ ] reported that the transmembrane domain of e , st resulted in very efficient golgi localization of a hybrid molecule, whereas a hybrid construct of munro [ ] , containing the equivalent c~ , st domain, resulted in considerable leakage to the cell surface ( fig. ) . site-directed mutagenesis of residues of the transmembrahe domain of // , gait, in the context of hybrid molecules, suggested that uncharged polar residues are critical for the ability of these hydrophobic domains to mediate golgi retention [ ] . however, a number of other studies have indicated that considerable alterations can be made to the transmembrane domain of the native enzymes without abolishing gotgi retention. for example, colley et al [ ] showed that sequential replacement of - amino acid blocks of the transmembrane domain of e , st had no effect on golgi localization. further, munro [ ] made the striking demonstration that the transmembrane domain of an e , st hybrid protein (containing the stem and tail of e , st) can be totally replaced by a poly-leucine sequence of similar length without adversely affecting golgi retention. munro [ ] also reported that the length of the polyteucine segment appeared to be important in maintaining efficient golgi localization as a transmembrane domain of leucine residues showed leakage to the cell surface. in contrast to this apparent length requirement, dahdat and colley [ ] replaced the amino acid transmembrane domain of e , st with the long amino acid transmembrane domain from influenza neuraminidase without any apparent disruption of the retention signal. the difficulty in defining the structural elements associated with transmembrane domains in golgi localization has been highlighted by a recent study by low et al. [ ] who demonstrated that swapping the transmembrane domains of two cell surface proteins resulted in hybrid molecules which either accumulated in the golgi or were retarded in transport through the golgi apparatus. gol jeell smc~ace (m) o~ ~ ce~ surface (n,w) -_ ~. t aa i stem aa gotgi ~w) ce~ surface (lv~ go~ce~ surface (d) thus, although both native proteins are transported efficiently to the cell surface, swapping the transmembrane domains of these two proteins altered golgi to cell surface transport. these investigators concluded that the hydrophobic transmembrane domain in relation to its charged flanking sequences is important in transport from the golgi apparatus to the cell surface. for both ct , st and glcnacti it is clear that regions flanking the transmembrane domain can augment the efficiency of golgi localization. for example, additional sequences from the stem region and the cytoplasmic tail increase the efficiency of golgi localization of ~ , st and glcnacti [ , , ] , although the tail and/or stem of ~ , st and glcnacti alone is not capable of retaining a reporter molecule to the golgi - , , ] , and removal of the stem region from wild type c~ , st [ ] or / , gait [ does not disrupt golgi localization. although the potential of the stem region has been addressed in a number of studies, the potential role of the catalytic domain has been overlooked in most studies. yet, removal of the cytoplasmic tail and stem, and considerable alteration of the signal/anchor domain, still allowed hybrid ~ , st molecules containing the catalytic domain to be golgi localized [- , ] . it should be noted that the membrane flanking sequences, comprising short stretches of charged residues, were maintained in the latter constructs which may also be an important factor in localization. the contribution of the membrane flanking sequences of /~i, galt to golgi localization is not known. comparison of the results of all these studies is not straightforward and there are a number of factors which may account for the apparent lack of agreement between them. first, as yet there is no direct evidence that glycosyltransferases localized to different golgi subcompartments are retained by identical mechanisms. there may be subtle differences between the localization signals which specify residency in medialand trans-cisternae and in the tgn. second, in the majority of studies sequences involved in golgi localization have been identified by their ability a. third, the definition of the transmembrane domain has varied from group to group; in some cases the transmembrane domain has been defined as a stretch of hydrophobic residues, excluding charged residues necessary for anchoring membrane proteins within the lipid bilayer [ , ] , whereas in other studies, two or three charged amino acids on either side of the hydrophobic stretch have been included in the sequence defined as the transmembrane domain [ ] . fourth, the appearance of hybrid or mutant glycosyltransferases at the cell surface has been frequently used as a measure of disruption of golgi localization. however, the majority of groups have only used fluorescence microscopy to compare levels of cell surface expression. fluorescence microscopy is relatively insensitive and comparisons are at best qualitative. only a few studies have employed the more sensitive and quantitative technique of flow cytometry to compare levels of cell surface expression between constructs. fifth, the expression levels of the hybrid constructs vary between and within studies. we believe this to be a critical factor in assessing these results. whereas many groups have shown that the native gtycosyltransferases can be expressed at very high levels without saturation of the golgi retention mechanism [ , , , ] [ ] . of expression have been observed between different constructs. in addition, for any one construct there is a wide range of expression within one transfection experiment, and indeed dahdal and colley [ ] noted that surface expression of some hybrid proteins appeared to be related to the level of expression in the cells. in our studies on fll, galt, we have observed that a construct expressed transiently in cos cells showed a different intraceltular distribution to that of the same construct stably expressed in mouse l cells. replacement of the amino acid transmembrane domain of /?i, galt with the amino acids from the transferrin receptor resulted in abundant cell surface expression in cos cells, and with very little detected in the golgi region, whereas in stable l cells, which expressed the hybrid molecules at a to -fold lower level, substantial amounts of the hybrid molecules were specifically retained within the golgi apparatus [ ] . clearly, stable clones expressing low levels of the hybrid molecules are likely to be more informative. sixth, several groups have identified glycosyltransferase sequences which are capable of conferring golgi localization upon reporter proteins, but have neglected to assess the role played by these sequences within the context of the full length enzyme. strategies involving reporter proteins are useful for determining the minimum sequence requirements for golgi localization of hybrid molecules. however, it does not necessarily follow that a sequence which is sufficient to confer golgi localization upon a reporter molecule is the only sequence involved in retention of the native enzyme. this point was illustrated earlier with the golgi localization of glycosyltransferases bearing substituted transmembrane domains (figs - ) . furthermore, most of the studies which have made substitutions of the native enzyme have not assessed the effect of those substitutions on enzyme activity, thus it is unclear whether the structure of the luminal catalytic domain has been perturbed in these studies. in our recent study on glcnacti [ ] we have attempted to address many of these problems and have assessed the relative contribution of the cytoplasmic tail, transmembrane domain, and catalytically active luminal domain in medial-golgi localization. stable l cell clones expressing hybrid molecules were generated, and clones which expressed equivalent amounts of hybrid proteins were selected for analyses. all hybrid molecules expressing the luminal domains of glcnacti were catalytically active, inferring a native structure for this domain. cellular localization was assessed by fluorescence microscopy, immuno-electron microscopy and flow cytometry. overall our study showed that each of the three gicnacti domains contributes significantly to medial-golgi localization. soluble, catalytically active forms of /?i, galt and ~ , st, which lack the cytoplasmic tail, transmembrane domain, and luminal stem region, have been detected in body fluids and are thought to be derived from the membrane-bound forms by proteolytic cleavage [ - . when the cytoplasmic tail, transmembrane domain and luminal stem region of either /~ , st or /~i, galt are replaced by a cleavable signal sequence, the resulting truncated enzymes are also rapidly secreted from transfected cells [ , ] . from these data it has been argued that the catalytic domains of glycosyltransferases do not contain golgi retention signals. however, it is entirely possible that the luminal catalytic domain can only function in golgi retention if it is anchored to the membrane. at this stage the relative contribution of the stem and catalytic domains in the localization of glycosyltransferases is unresolved. overall, it is most unlikely that golgi retention is determined by a discrete and continuous sequence motif or peptide segment, but rather localization of golgi glycosyltransferases could be mediated by interactions spanning the entire length of the molecule. what are the possible mechanisms for the compartmentspecific localization of the membrane-bound glycosyltransferases? it is unlikely that localization of glycosyltransferases involves a simple receptor-ligand interaction where the receptor is fixed in the golgi cisternae, as over-expression of wild-type transferases does not result in saturation of the retention mechanism [ , - , , ]. an alternative glycopinion mini-review possibility is that escaped golgi glycosyltransferases are retrieved from post-golgi compartments, as with soluble er proteins. however, experimental evidence seems to argue against a retrieval system for golgi glycosyltransferases. wong et al. [ ] have demonstrated that ~ , st leaked to the cell surface is not retrieved back to the golgi apparatus. also we [ ] have demonstrated that/~i, galt which has escaped golgi retention undergoes a post-translational modification, probably in the tgn, before appearance at the cell surface; as the golgi-localized /? , gait does not accumulate this modification, a retrieval system would appear unlikely to play a dominant role. what could be the basis of an active golgi retention mechanism? it has been suggested by a number of investigators that retention of golgi glycosyltransferases could be mediated by the formation of protein aggregates within the membranes of the correct golgi cisternae [ ] [ ] [ ] . this model proposes that such oligomers or aggregates of glycosyltransferases would then be excluded from entry into vesicles for forward transport. although an attractive hypothesis, the evidence for aggregation remains largely indirect. recent elegant experiments performed by nilsson et al. [ ] have shown that the addition of an er retention motif to the glcnacti cytoplasmic tail not only causes glcnacti to localize to the er but also partially retains another medial-golgi enzyme, namely ~mannosidase ii, within the er. furthermore, burke [ ] has demonstrated co-precipitation of glcnactii activity, another medial-golgi enzyme, using antibodies specific to glcnacti. as the amino acid sequences of the glcnacti and glcnactii transferases are not related, a likely explanation is the association of enzymes which occupy the same golgi cisternae. warren and colleagues have coined the term 'kin recognition' to denote this self-association of golgi enzymes [ ] . the proposed aggregation of golgi glycosyltransferases is also consistent with earlier observations that the majority of glcnacti and/~i, galt exist as high molecular weight material following detergent extraction of tissue [ , ] . how could the three domains of a glycosyltransferase play a role in aggregation? the fact that each domain of glcnacti is required for complete golgi retention implies that all three domains may be involved in the lateral interactions which lead to aggregate formation. for example, the transmembrane domains of resident golgi proteins may mediate homo-or hetero-dimerization via protein protein interactions along uncharged polar faces of c~-helixes, predicted for some of the glycosyltransferases [ ] , or along one face of the c~-helix containing a leucine zipper, as predicted for glcnacti [ ] . such dimers may form prior to their arrival in the golgi, as indicated by the results of er retention of glcnacti/manii [ ] . these homo-or hetero-dimers may then be induced to interact, within the correct golgi microenvironment, through their large luminal domains, resulting in a two-dimensional aggregate ( fig. ) . aggregation may be induced by differences within the golgi cisternae, such as ph and calcium concentration. this model differs somewhat from that of warren's group which proposes that the golgi enzymes form homo-dimers (via their stem regions) and interact via their transmembrane domains with different neighbours to generate linear hetero-oligomers. ~-mannosidase ii has been shown to be a disulphide-bonded dimer, but there is no evidence of stable covalent dimer formation for any of the golgi glycosyltransferases. the first report of a purified membrane-bound form of glycosyltransferase, namely /~i, galt, indicates that no disulphide bonded dimers exist [ ] , contrary to an earlier suggestion [ ] . in addition, warren's model of linear heteroaggregates cannot readily explain the efficient golgi retention of hybrid molecules containing a transmembrane domain of a glycosyltransferase and a reporter molecule known to be a monomer in the native state, such as ovalbumin [ , , ] . retention of such hybrid molecules could only occur at the ends of the iinear aggregates, via their transmembrane domains, and would effectively cap these linear structures resulting in only a very minimal number of hybrid molecules retained in the golgi apparatus. finally, the cytoplasmic tail of the glycosyltransferases may be necessary for either transmembrane-mediated dimerization or, as proposed by slusarewicz et al. [ ] , may interact in a salt-dependent manner with a putative intercisternat matrix. consistent with this proposal, the differences in solubility of wild-type gicnacti and the glcnacti hybrid proteins indicate that golgi localized molecules may exist in a different physical state from their cell surface counterparts [ ] . an interaction of the glycosyltransferases with the intercisternal matrix (the "golgi glue'), either directly or indirectly, would ensure that the aggregates are immobilized within the golgi membranes and so are excluded from transport vesicles. clearly an aggregation model of retention may involve many additional components and further biochemical analysis is now required. a model of golgi localization also needs to account for the presence of soluble catalytically active forms of fil, galt and ~ , st which have been detected in body fluids. the retention model proposed above could allow the release of soluble catalytic oligomers from the golgi aggregate by proteolytic cleavage, with the subsequent dissociation of the oligomers to monomers. gotgi membranes differ in lipid composition from the er and plasma membranes. such lipid differences may be important in mediating interactions between the transmembrane domains of glycosyltransferases. a lipid mediated mechanism of protein sorting has been proposed by bretscher and munro [ ] who have suggested that the typically shorter transmembrane domains of golgi proteins may interact selectively with the low cholesterol bilayers of golgi membranes and be excluded from the thicker, cholesterol-sphingotipid enriched bitayers of post-golgi membranes. a protein-lipid interaction is compatible with the observations that the length of the hydrophobic stretch coupled with the adjacent flanking residues is important in gotgi retention, rather than the actual amino acid sequence. the models discussed above are by no means mutually exclusive and it is possible that the golgi retention mechanism includes both a protein-lipid interaction (via the transmembrane domains of the proteins) as well as protein-protein aggregation. based on the aggregation model of golgi retention outlined above, wild type glycosyltransferases expressed in transfected cells may be retained within golgi cisternae as a consequence of self aggregation, or by virtue of their ability to interact or 'dock' to existing aggregates within the golgi apparatus of the mouse cells. self aggregation, as opposed to docking, represents a potentially non-saturable means of retention, consistent with the many reports that glycosyltransferases expressed in heterologous cell lines do not leak to the plasma membrane even when expressed at vastly elevated levels [ , - , , ] . on the other hand, hybrid constructs would have a reduced capacity to self-aggregate, due to insufficient domains, and may be retained by interacting or 'docking' to existing aggregates within goigi membranes, either through their luminal domain or transmembrane domain (fig. ) . this would be a more readily saturable means of retention, with only a finite number of exposed, endogenous molecules available as 'docking' sites. the fact that golgi localized glcnacti hybrid proteins, including those which lack the glcnacti cytoplasmic tail, are predominantly localized to medial-golgi cisternae is in agreement with this proposal [ ] . thus, golgi localization of hybrid molecules probably reflects the minimum structure(s) required to 'dock' with endogenous golgi aggregates. this model would also help to explain the discrepancies between studies as the expression levels of the hybrid molecules would be an important factor in the efficiency of golgi localization. the high level of conservation of individual glycosyltransferases across species is also consistent with structural constraints imposed by such an aggregation model. a conserved structure would be required in order to preserve the many interactions with neighbouring enzyme molecules of a heteroaggregate within the golgi compartment. if golgi glycosyltransferases have evolved with a fundamental requirement for such inter-molecular interactions, it would also explain the conservation of the retention mechanism across species and also the ability of an animal glycosyltransferase to be apparently correctly housed in the golgi apparatus of plant cells [ ] . while most viruses mature at the plasma membrane, a limited number of viruses acquire their envelopes by budding into intracellular compartments. viruses which assemble from golgi membranes include, coronavirus, bunyavirus and pox virus [ , ] . viral budding from the golgi apparatus is probably determined by the targeting of one or more viral glycoproteins to the golgi membranes. indeed, a number of viral proteins have been shown to be independently targeted to the golgi apparatus, including the m glycoproteins of an avian coronavirus [ ] and a related murine coronavirus [ , ] , the e and e spike glycoproteins of rubella virus [ ] and the g glycoprotein of punta tora virus [ ] . as a consequence of the specific localization of these viral glycoproteins they represent useful tools for the study of protein targeting to the golgi apparatus. the m (formerly called el) glycoprotein of the avian coronavirus, infectious bronchitis virus (ibv), has been shown to be localized specifically to the cis-golgi cisternae [ ] . in contrast to the type ii membrane orientation of the glycosyltransferases, the ibv m gtycoprotein contains a short glycosylated amino-terminal domain, three membrane spanning domains and a carboxy-terminat cytoplasmic tail. only the first of the three membrane spanning domains of m glycoprotein of ibv is required to retain this protein in the golgi [ ] . furthermore, this membrane spanning domain is sufficient to confer golgi localization upon a plasma membrane localized protein [ ] . thus, as for the gtycosyltransferases of the medial and trans-cisternae and the tgn, the transmembrane domain of a resident protein of the cis-cisternae has also been implicated in retention. extensive mutagenesis showed that four polar residues in the first m transmembrane domain were critical for golgi retention of a hybrid protein with the vsv g glycoprotein [ ] . these four polar residues are predicted to form an uncharged polar face along one side of an c~-helix, which has potential to be involved in protein-protein interactions and mediate oligomer formation. indeed, aggregation has been shown to correlate with retention of this m hybrid protein in the golgi apparatus [ ] . these investigators demonstrated that the appearance of sds-resistant aggregates of the m hybrid protein correlated with golgi localization, whereas mislocatized transmembrane domain mutants do not form oligomers. the aggregates have not been biochemically characterized but it is possible that they include endogenous golgi proteins. however, sds-resistant oligomers of the native m glycoprotein were not detected in this study [ ] , thus the relationship between aggregate formation of the m hybrid molecule and golgi retention of the native m protein remains unclear. in contrast to the findings for the m glycoprotein of ibv, machamer et al. [ ] in the past few years it has become apparent that there is a distinct set of resident gotgi proteins in the tgn of mammalian cells, and the late golgi of yeast, that have features associated with their localization which are distinct from the golgi glycosyltransferases [ ] [ ] [ ] . these differences are associated with the structure of the proteins. the group includes the mammalian tgn / [ ] and furin [ ] , and the yeast proteolytic enzymes kexlp, kex p, and dipeptidyl aminopeptidase a (dpap a) (for review see [ ] ). in contrast to the golgi glycosyltransferases, tgn / , furin, kextp and kex p are type i membrane proteins, however, membrane orientation is not a distinguishing characteristic of the group as dpap a is a type ii membrane protein. in contrast to the golgi glycosyltransferases, the cytoplasmic tail of all these proteins is essential for golgi localization and, in addition, a retrieval signal plays a role in defining residence of these proteins to the tgn or late golgi. tgn / is a heterodimeric membrane protein complex which cycles between the tgn and the cell surface [ , [ ] [ ] [ ] . tgn / has been shown to interact with cytosolic proteins and may be involved in the formation of exocytic vesicles from the tgn [ , [ ] [ ] [ ] . a number of groups have demonstrated that the tetrapeptide sequence yqrl, within the amino acid cytoplasmic tail of tgn , is both necessary and sufficient to target this membrane protein to the tgn [ ] [ ] [ ] . this tyrosine-containing motif also acts as an internalization motif from the plasma membrane, via interaction with clathrin-coated pits. recently this tyrosine motif has been shown to lie within an ~-helix, and not a tight /~-turn conformation which is typical of other tyrosine-containing internalization motifs [ ] . there is evidence that individual amino acids around the tyrosine of the tgn / internalization motif could signal different intracellular locations. for example, mutation of the yqrl sequence to yqdl abrogated tgn localization of tgn but did not affect internalization [ ] . recently furin, a membrane associated subtilisin-like protease, has been shown to be concentrated in the tgn [ ] . like tgn / , furin also cycles between the cell surface and tgn. sequences of the cytoplasmic tail of furin are required for tgn targeting, and a potential tyrosine motif has been identified [ , ] . on the other hand, potential tyrosine motifs for internalization appear to be absent in the cytoplasmic tails of gotgi glycosyltransferases. the yeast proteins dpap a, kex p and kexlp are all integral membrane proteins with cytoplasmic tails of about amino acids. these cytoplasmic tails are required for retention of these enzymes in the late golgi since deletions in the tail reduce the efficiency of retention [ ] [ ] [ ] . the retention signals within the cytoplamic tails of these proteins have been identified and are very similar to the proposed general motif for clustering into clathrin-coated pits of animal cells (see [ , ] ). deletion of the golgi retention signal, or over-expression of these proteins, results in mislocalization to the vacuolar compartment. this initially surprising finding has led to the conclusion that the default destination for membrane proteins in the yeast secretory pathway is the vacuolar compartment and not the plasma membrane. studies on the yeast vps mutants suggest that dpap a may leak from the late golgi and is transported, via the default pathway, to a post-golgi/pre-vacuolar compartment [- ] . the cytoplasmic localization signals of these escaped dpap a moiecules then mediate retrieval back to the late golgi; in the absence of the cytoplasmic tail golgi localization signals these membrane proteins would continue to be transported along this default pathway to vacuoles. thus there are clear similarities in the mechanism of golgi localization of these yeast proteolytic enzymes and mammalian tgn / and furin. the kdel receptor resides in the cgn and possibly throughout the entire golgi stack [ , ] . the kdel receptor is predicted to have six or seven transmembrane domains [ ] . empty receptors do not recycle back to the er, however, after binding to ligand the ligand-receptor complex is then returned by retrograde transport to the er [ ] . thus, this receptor is likely to have signals for goigi localization. however, mutational analysis of the kdel receptor, although defining structural features associated with ligand binding and retrograde transport, revealed very little about the nature of the putative golgi localization signal [t ]. there are a number of structural membrane proteins and proteins associated with the machinery of vesicular transport that are localized specifically to the golgi apparatus, for example/~-cop [ ], rab and tab [ ] , p [ ] , p [ ] , heterotrimeric g proteins [ , ] , sec [ ] and the actin binding protein, comitin [ ] (table ) . these are not integral membrane proteins as they do not have transmembrane domains, but rather are peripheral membrane proteins associated with the cytosolic face of golgi membranes. some of these components recycle between a cytosolic pool and golgi membranes. in general very little is known about the gotgi localization signals for these peripheral membrane proteins. there is evidence that the carboxy-terminat region of the gtp binding protein g,n is required for golgi membrane binding [ ] . membrane association of rab proteins requires the geranylgeranylation of one or two c-terminal cysteines [ ] as well as a localization signal to define the organelle-specificity. the hypervariable c-terminus of rab proteins has been implicated in localization [ ] , although a recent study on rab indicates that efficient localization of this rab protein to golgi membranes requires both n-terminal and c-terminal domains [ ] . the identification of the precise nature of the targeting signals and the mechanism of localization of these peripheral membrane proteins will be important to the understanding of the organization of the golgi apparatus and vesicular transport. it is now apparent that the localization of resident golgi proteins includes more than one mechanism. for some late golgi membrane proteins a retrieval system operates to recycle proteins from post-golgi compartments. on the other hand, golgi glycosyltransferases appear to be actively retained within golgi membranes; there is no evidence that glycosyltransferase molecules which have leaked from the golgi apparatus can be retrieved. from many 'cut and paste' experiments it is apparent that the localization of glycosyltransferases does not involve a discrete retention signal but may be dependent on many interactions spanning the length of the molecule. furthermore, there is increasing evidence to suggest that retention of glycosyltransferases involves the formation of aggregates within the golgi apparatus. the challenge now is to biochemically characterize these aggregates, to identify any associated molecules that may be important in mediating retention, and to identify the conditions which induce aggregation. this will require the development of novel strategies to allow the isolation and biochemical analyses of individual golgi compartments, in particular the lipid composition, the organization of the resident proteins within golgi membranes, and the nature of interactions with the intercisternal matrix. thus, the problem now is understanding the biogenesis of golgi membranes themselves. the enzymes of biological membranes (martonosi an this work was supported by grants from the national health and medical research council of australia and the australian research council. key: cord- -wnkjdg y authors: li, sheng-hsiang; lee, robert kuo-kuang; hsiao, ya-ling; chen, yee-hsiung title: demonstration of a glycoprotein derived from the ceacam gene in mouse seminal vesicle secretions( ) date: - - journal: biol reprod doi: . /biolreprod. . sha: doc_id: cord_uid: wnkjdg y ceacam was purified from mouse seminal vesicle secretions by a series of purification steps that included ion exchange chromatography on a deae-sephacel column and ion exchange high-performance liquid chromatography on a sulfopropyl column. it was shown to be a -kda glycoprotein with an n-linked carbohydrate moiety. the circular dichromoism spectrum of ceacam in mm phosphate buffer at ph . appeared as one negative band arising from the β form at nm. ceacam was expressed predominantly in seminal vesicles of adult mice. both ceacam and its mrna were demonstrated on the luminal epithelium of the mucosal folds in the seminal vesicle. the amount of ceacam mrna in the seminal vesicle was correlated with the stage of animal maturation. castration of adult mice resulted in cessation of ceacam expression, while treatment of castrated mice with testosterone propionate in corn oil restored ceacam expression in the seminal vesicle. during the entire course of pregnancy, ceacam might be silent in the embryo. a cytochemical study illustrated the presence of the ceacam binding region on the entire surface of mouse sperm. ceacam -sperm binding greatly enhanced sperm motility in vitro. mammalian sperm display an intriguing sense of timing in undergoing some modification during their transit in the reproductive tract before encountering an egg. studying how the lumen of the reproductive tract affects sperm function is a prerequisite to unraveling the molecular mechanisms underlying the complex modification of sperm. factors that affect sperm motility have been reported in the seminal plasma of several mammals including the pig [ ] [ ] [ ] , bovine [ ] , mouse [ ] , and human [ ] . the seminal vesicle is a male accessory sexual gland found in many species of more than mammalian species alive on the earth today. after puberty, the gland se- cretes a fluid called seminal vesicle secretion (svs), which accumulates in its lumen. svs contains both protein and nonprotein components. when ejaculated, svs squirts into the urethra, contributing the major part of the liquid portion of seminal plasma, which is the complex biological fluid formed from mixing of various fluid in the male reproductive tract. it has been found that extirpation of the seminal vesicle from mice and rats greatly reduces fertility [ , ] , demonstrating the importance of svs to sperm modification under natural circumstances. svs differs extensively in terms of volume and composition in various species of mammals. however, rodents have proven to be good experimental animals for the molecular study of mammalian reproduction, so attempts have been made to isolate the proteins involved in sperm modification by mouse svs, which contains several minor proteins and seven well-defined major proteins designated svs i-vii, named in decreasing order of molecular mass according to their mobility in sds-page [ ] . previously, we demonstrated that svs vii enhances sperm motility [ ] , and two of the minor proteins modulate sperm activity. one is a caltrin-like trypsin inhibitor/p , which suppresses ca ϩ uptake by sperm [ ] , and the other is a seminal vesicle autoantigen, which serves as a decapacitation factor [ , ] . here we report the purification and identification of an androgen-stimulated -kda glycoprotein, a minor protein component of mouse svs that is able to enhance sperm motility in vitro. we have demonstrated that its core protein is derived from the ceacam gene [ ] , which is a member of the cell adhesion molecule (cam) subgroup belonging to the carcinoembryonic antigen (cea) family. the following materials were obtained from commercial sources: deae-sephacel (amersham pharmacia biotech, uppsala, sweden); protein pak sp pw column (waters, milford, ma); vydac tp c column (separations group, hesperia, ca); aminolink coupling gel, bicinchoninic protein assay kit (pierce, rockford, il); testosterone propionate, nitroblue tetrazolium, -bromo- -chloro- -indolyl phosphate (bcip), pmsf, periodic acid schiff reagent, and silanated glass slides (sigma chemical co., st louis, mo); cdna integrity kit, alkaline phosphataseconjugated streptavidin, and biotin-conjugated goat anti-rabbit immunoglobulin g (igg; kirkegaard & perry laboratories, gaithersburg, md); rhodamine-conjugated goat anti-rabbit igg (zymed laboratories, san francisco, ca); nuclear fast red (vector laboratories, burlingame, ca); enhanced chemiluminescent substrate and [␣- a) fractionation of soluble mouse svs proteins by ion exchange chromatography on a deae-sephacel column. b) resolution of fraction iii sample from a by ion-exchange hplc on an sp column. c) demonstration of the glycoprotein nature. each of the a-to-d peaks from b were digested with n-glycosidase f. the parent proteins (lane , peak a; lane , peak b; lane , peak c; lane , peak d) and their deglycosylated forms (lanes , , , and ) were identified by sds-page on a % polyacrylamide gel slab. the proteins in the gel were stained with coomassie brilliant blue. pgem-t-easy vector (promega, madison, wi); ultraspec-ii rna isolation kit (biotecx laboratories, inc., houston, tx); thermoscript reverse transcriptase (invitrogen life technologies, carlsbad, ca); n-glycosidase f, taq dna polymerase (takara, shiga, japan); and mouse embryo-stage blots designed to observe gene expression during pregnancy (seegene, inc., korea). all other chemicals were reagent grade. outbred icr mice were purchased from charles river laboratories (wilmington, ma) and were maintained and bred in the animal center at the college of medicine, national taiwan university. animals were treated according to institutional guidelines for the care and use of experimental animals. they were housed under controlled lighting ( l: d) at - Њc and were provided with water and nih- laboratory mouse chow ad libitum. for investigation of androgenic effects, -wk-old adult male mice, which had been castrated wk earlier, received a daily s.c. injection of testosterone propionate in corn oil ( mg/kg body weight) for consecutive days. control animals received corn oil only. seminal vesicles were removed from the animals h after the last injection. normal adult mice ( to wk old) were killed by cervical dislocation. the seminal vesicles of mice were carefully dissected to free them from the adjacent coagulating glands, and the secretions were squeezed directly into ml of ice-cold mm tris-hcl in the presence of mm pmsf at ph . . after centrifugation at ϫ g for min, the supernatant was resolved by ion exchange chromatography on a deae-sephacel column ( ϫ . cm) that had been pre-equilibrated with mm tris-hcl at ph . . after the nonretarded fractions were washed out, the column was eluted with . m nacl in the same buffer at a flow rate of ml/h. fractions ( ml) were collected, and their absorbance at nm was recorded (fig. a) . fraction iii was further subjected to ionexchange high-performance liquid chromatography (hplc) on a sulfopropyl (sp) column ( . cm ϫ . mm). the column was sequentially eluted with three linear gradients, including %- %, %- %, and %- % of . m nacl in mm sodium acetate at ph . at a flow rate of . ml/min for min (see fig. b ). thirty micrograms of each peak (a-d) in l of mm tris-hcl ph . were digested with . g of trypsin-tpck at Њc for h. the reaction was stopped by adding l of . % trifluoroacetic acid and the reaction mixture was resolved by a reverse-phase c column ( . ϫ mm). the column was eluted with a linear gradient of %- % acetonitrile at a flow rate of . ml/min for min (see fig. a ). the n-glycoconjugate was removed from a glycoprotein by following the method described by tarentino and plummer [ ] . the protein was boiled in . % sds and incubated with n-glycosidase f ( u/mg of protein) in mm sodium phosphate at ph . in the presence of mm edta, . % nonidet p- , and mm sodium azide for h at Њc. the concentration of ceacam was determined using the bicinchoninic acid protein assay [ ] according to the manufacturer's instructions. the amino acid sequence was determined using automated edman degradation with a protein sequencer with an online c analyzer (applied biosystems, foster city, ca). the circular dichromoism (cd) spectra were measured with a jasco j- spectropolarimeter under constant flushing with n at room temperature. the mean residue elipticity () was estimated from the mean residue weight, which was calculated from the primary structure. antisera against ceacam were raised in new zealand white rabbits. the anti-ceacam antibody was purified from the antiserum on a column ( . ϫ . cm) of aminolink gel coupled with the purified antigen according to our previously described method [ ] . proteins were resolved using sds-page on a % gel slab ( . ϫ . ϫ . cm) according to the method described by laemmli [ ] . the proteins on the gel were stained with coomassie brilliant blue or transferred to a nitrocellulose membrane using an electroblotting method, which was conducted at v at Њc for h in a solution of mm tris-hcl, mm glycine, and . % methanol. membranes were blocked with % normal goat serum in pbs for h, and then incubated with anti-ceacam antibody ( . g/ml) in the blocking solution for h at room temperature. after gently agitating in four changes of pbs for min each, they were immunoreacted with horseradish peroxidase-conjugated goat anti-rabbit igg diluted to : in the blocking solution for h. immunoreactive bands were revealed using an enhanced chemiluminescent substrate according to the manufacturer's instructions. mouse seminal vesicles were fixed in bouin solution, embedded in paraffin, and -m serial cross-sections were mounted on silanated glass slides. deparaffinized sections were blocked in blocking solution for h at room temperature and then incubated with affinity-purified anti-cea-cam antibody at a concentration of g/ml in the blocking solution for h. the slides were gently agitated in three changes of washing solution for min each and then treated with biotin-conjugated goat antirabbit igg ( g/ml) in the blocking solution for h at room temperature. the slides were washed again as described above and then incubated with alkaline phosphatase-conjugated streptavidin ( g/ml) in blocking solution for h at room temperature. protein signals were observed after the slides were incubated for min with . % nitroblue tetrazolium and . % bcip in a solution of mm tris-hcl, mm nacl, and mm mgcl at ph . . the slides were washed in three changes of water for min each and then counterstained with nuclear fast red for min. fig. . identification of the -kda glycoprotein derived from ceacam and its circular dichroism. a) the trypsin-digested sample of peak c from figure b was resolved by reverse-phase hplc on a c column (see materials and methods). b) the protein sequence was deduced from the reading frame of ceacam cdna (genbank accession number nm ). the initial and stop codons are underlined. the potential n-linked glycosylation sites are denoted by open boxes. the deduced protein sequence and the amino acid sequences determined directly from protein analysis for peaks a to d in figure b and peaks and in figure a agree in all positions except that asn from the cdna-deduced protein was not identified in protein sequencing. the cleavage points for the generation of mature protein are indicated by an arrow. c) circular dichroism of ceacam in mm phosphate buffer at ph . at room temperature. finally, the slides were briefly washed with water and photographed using a brightfield microscope (ah -rfca; olympus, tokyo, japan). seminal vesicle tissues were oriented in a small aluminum foil cup and frozen immediately in tissue-tek oct medium. embedded samples were then stored at Ϫ Њc before further processing. serial -m cryostat sections were mounted on uncoated glass slides and stored at Ϫ Њc. before use, sections were fixed in % ethanol and stained with a histogene lcm frozen section-staining kit following the supplier's protocol. slides were dipped in xylene twice for min each time and then air-dried. mucosal folds or smooth muscle cells were harvested using a pixcell ii lcm system (arcturus engineering, mountain view, ca). captured tissues from three sections were collected on capsure hs lcm caps containing transfer film. tissue samples from three different animals were pooled for subsequent analysis. total rna was extracted from the captured cells by using the picopure rna isolation kit, followed by treatment with dnase i before cdna synthesis. the total rna was reverse-transcribed using thermoscript reverse transcriptase for first-strand cdna synthesis according to the manufacturer's instructions. a cdna integrity kit was employed to examine the quality of the cdna. the qualified cdna samples were used as templates for pcr. the primer pair for amplification of a -base pair (bp) ceacam gene fragment was a forward primer ( Ј-tatgctattt-caaaacttcgatcat- Ј), which corresponds to sequence - , and a reverse primer ( Ј-gttatgcggactttattg- Ј), which corresponds to sequence - (genbank accession number nm ). the primer pair for amplification of a -bp mouse glyceraldehyde- -phosphate dehydrogenase (gapd) dna fragment was a forward primer ( Ј-cggcaaattcaacggcacagt- Ј), which corresponds to sequence - , and a reverse primer ( Ј-tgggggtaggaacacggaagg- Ј), which corresponds to sequence - (genbank accession number xm ). pcr reaction mixtures consisted of l of template, . for sec at Њc, extension for sec at Њc, and a final extension for min at Њc. the polymerase chain reaction (pcr) products were analyzed by electrophoresis on a . % agarose gel. the identity of pcr products was confirmed by cloning and sequencing. dna sequencing was carried out with an abi prism - dna sequencer using the abi prism bigdye terminator cycle sequencing ready reaction kit (applied biosystems). total rna was extracted from tissue homogenates using an ultraspec-ii rna isolation kit. a pcr-amplified fragment of ceacam cdna ( bp), which was inserted in pgem-t-easy, and a cdna fragment of the mouse gapd gene ( bp), which was inserted in pgem vector, were used as a template to prepare a p-labeled cdna probe using a promega random-priming kit. rna samples ( g) were subjected to denaturing by . % agarose-formaldehyde gel electrophoresis and then blotted onto nylon membranes by capillary transfer as previously described [ ] . after incubation with the prehybridization buffer ( % deionized formamide, ϫ ssc, ϫ denhardt solution, . % sds, and g/ml of sheared salmon sperm dna) for h at Њc, the membranes were hybridized with one labeled probe overnight at Њc. following hybridization, the membranes were washed using standard procedures. rna messages on one filter membrane were observed after autoradiography and the probes were removed from the membranes as previously described [ ] . the same membrane was then hybridized with another labeled probe. thus, hybridization with ceacam or gapd cdna probe was performed on the same filter membrane. in accordance with a method previously used [ ] , a modified tyrode buffer, which consisted of . mm nacl, . mm kcl, . mm mgcl , . mm nah po , . mm glucose, . mm sodium pyruvate, mm nahco , mm hepes, iu/ml penicillin, and g/ml streptomycin was adjusted to ph . - . by aeration with humidified air/co ( : ) in an incubator for h at Њc before use. mouse epididymides were removed and immersed in the medium. after careful dissection from the connective tissue, spermatozoa were extruded from the distal portion of the tissues for min at Њc. the cells were gently filtered through two layers of nylon gauze, layered on top of a linear gradient of %- % percoll (v/v), and centrifuged at ϫ g for min at room temperature [ , ] . three distinct cell layers formed. the lowest layer, which contained cells with progressive motility, was washed with three volumes of the medium and collected using centrifugation at ϫ g for min at room temperature. the sperm were resuspended and centrifuged two more times in a similar manner. the cell pellets were resuspended, and cacl was added to the culture medium at a final concentration of . mm before the sperm were assayed. freshly prepared epididymal spermatozoa ( cells/ml) were blocked in pbs containing % normal goat serum for min at room temperature. the cells were further incubated with m ceacam for h. at the end of incubation the cells were centrifuged and the cell pellets were washed with pbs to remove the unbound ligands. the cells were air-dried on a glass slide and washed twice with pbs. the slides were incubated with the affinity-purified anti-ceacam antibody at a concentration of g/ml in blocking solution for h. the slides were washed three times with pbs to remove excess antibody before they were incubated with rhodamine-conjugated goat anti-rabbit igg diluted to : in blocking solution for min. all slides were then washed with pbs, covered with % (v/v) glycerol in pbs, and photographed with a fluorescence microscope (axioplan imaging; carl zeiss, oberkochen, germany). ejaculated sperm were collected from semen that existed in the uterine cavity of three vagina-plugged female mice. after extensive washing with pbs the ejaculated sperm without incubation with the exogenous cea-cam were smeared onto slides for immunolocalization of ceacam as mentioned above. sperm motility was determined using a computer-assisted sperm assay (casa) with a sperm motility analyzer (ivos version ; hamilton-thorne research, beverly, ma). a -l sample was placed in a -m-deep makler chamber at Њc. the analyzer was set as follows: negative phasecontrast optics and recording at frames/sec; minimum contrast, ; minimum cell size, pixels; low-size gate, . ; high-size gate, . ; lowintensity gate, . ; high-intensity gate, . ; nonmotile head size, ; nonmotile head intensity, ; medium average path velocity, m/sec; low path velocity, . m/sec; slow motile cells, yes; and threshold straightness, greater than %. ten fields were assessed for each sample. the fresh preparation of soluble svs was divided into fractions. i to iv by ion exchange chromatography on a deae-sephacel column (fig. a) . the fraction iii sample was further resolved into peaks a-e (fig. b) by ion exchange hplc on an sp column. peak e was a fad-dependent sulfhydryl oxidase (unpublished observation). on reducing sds-page gel, each of the a-to-d peak samples gave one rather broad -kda band that could be stained with either coomassie brilliant blue or periodic acid-schiff reagent, demonstrating their glycoprotein nature (fig. c, lanes , , , and ) . each protein sample could be deglycosylated either by trifluoromethane sulfonic acid or exhaustive digestion with n-glycosidase f to a core protein that was identified as a sharp band between and kda by sds-page (fig. c, lanes , , and ) , indicating a similar molecular mass of the protein cores. apparently, peaks a-d were glycoproteins with an n-linked carbohydrate moiety. they were purified to homogeneity. automated edman degradation for each a-to-d peak samples for cycles gave reliable data, which were assembled to the n-terminal sequences. avppxvtadnnvll was determined from the peak a sample. two amino acids were detected in each cycle during protein analysis for each peak (b to d). the actual yield of the two sequences in an individual cycle was such that the ratio of the major sequence to the minor one was estimated to be . - . : . assembly of the major and minor sequences gave a peptide sequence of aqvtveavppxvta and qvtveavppxvta, respectively. the three n-terminal peptides were completely confirmed in the ceacam -deduced protein consisting of amino acid residues in all positions except that x (asparagine), one potential site for an n-linked carbohydrate in ceacam , was not identified in the protein sequencing (fig. b) . the post-translational cleavage at the peptide bond between glu and ala, thr and ala, or ala and gln in the signal peptide of the putative ceacam sequence gives rise to a peak a protein or to peak b-to-d proteins. as a result, peaks a to d share a very similar protein core with a slight difference in their n-terminal sequences. thereafter, we combined them for further study. among the svs protein components on sds-page gel, antibody against cea-cam immunoreacted only to a -kda protein band corresponding to the antigen, showing high specificity of the antibody. taken together, these data indicate that peaks a to d are translational products of the ceacam gene. each peak (a to d) was digested with trypsin, and the digests were subjected to hplc on a c column. the chromatographic patterns of the four trypsin-digested samples were very similar. one representative chromatogram is shown in figure a . three amino acids were identified in each cycle of automated edman degradation of peak on figure a . these data could be assembled to three peptide sequences of vfywyk, etiysn, and aiywyr in cea-cam . the peptide sequence of ndegayaldmlfqnf in ceacam was completely confirmed by automated edman degradation of peak (see fig. b ). ceacam was stable in mm tris-hcl at ph . , but it was degraded to an -kda protein component in % acetic acid (not shown). the cd spectrum of this protein at ph . shows a negative band with a minimum mean residue ellipticity of deg cm dmol Ϫ at nm (fig. c ). in addition, a positive band appeared as the cd profile extending below nm. the spectral profile in the uv region shows some resemblance to that of the ␤ form of protein conformation [ ] [ ] [ ] [ ] , suggesting the presence of a considerable amount of ␤ form, a ␤ turn, or both in the protein molecule. we examined the distribution of ceacam and its rna message in the tissue homogenates of reproductive glands, including the seminal vesicle, epididymis, testis, coagulating gland, vas deferens, prostate, uterus, and ovary. the rna message was predominantly detected in the seminal vesicles (fig. a) . this was confirmed by the results of western blot analysis showing that ceacam was abundant in the seminal vesicle; a trace appeared in the epididymis and prostate only after over autoradiography (fig. b) . when equal amounts of total rna from the homogenate of a nonreproductive organ were compared with those of the seminal vesicle, very litter to no ceacam mrna was found in brain, heart, lung, liver, spleen, kidney, stomach, small intestine, muscle, skin, and thymus (not shown). ceacam was immunolocalized primarily to the luminal epithelium of the mucosal folds in the seminal vesicle slides of adult mice (fig. a) . the smooth muscle layer contained almost none. the strong immunochemical staining in the lumen supports the view that ceacam accumulates in the lumen as a result of its secretion from the luminal epithelium. further, we separated mucosal epithe-lial cells and smooth muscle cells from the tissue slices by lcm. ceacam transcripts were relatively abundant in the mucosal cells, but only trace amounts of the rna message appeared in the smooth muscle cells (fig. b) . the amounts of ceacam mrna in the seminal vesicles of mice at different ages were compared. the rna message first appeared at a considerable level in -wk-old mice. thereafter, the amount of transcript began increasing rapidly at wk and reached a maximum in -wk-old mice (fig. a) . we analyzed mouse embryos from . to . days postcoitus (d.p.c.). the . - . d.p.c. samples included early stage embryos, extraembryonic tissue, and maternal uterus; the . - . d.p.c. samples included embryos and extraembryonic tissues, and the . - . d.p.c. samples were solely embryos. the rna message in the embryo samples was present in trace amounts on . d.p.c, increased remarkably from . d.p.c. to a maximum on . d.p.c., and rapidly declined thereafter to an almost undetectable level until delivery (fig. b) . because seminal vesicle growth is known to be androgen-dependent, we examined how androgen influenced ceacam expression in the seminal vesicles of adult mice that had been castrated wk earlier (fig. ) . ceacam mrna was undetectable in the total rna prepared from the control castrates that had received a daily injection of corn oil only compared with that of normal adults. induction of ceacam mrna was observed in the castrates treated with testosterone ( mg/kg per day) for consecutive days. figure a shows micrographs of epididymal spermatozoa with indirect fluorescence staining. no fluorescence was observed on the cells after they were treated successively with the ceacam antibody and rhodamine-conjugated anti-rabbit igg, demonstrating a lack of cea-cam on the cell surface. when spermatozoa were preincubated with m ceacam in a blocking solution at room temperature for min, rhodamine fluorescence was prominent on the middle piece, relatively weak on the tail, and faint on the head (fig. a, d) . apparently, sperm have ceacam -binding sites that cover the entire cell surface. to access the binding of ceacam to epididymal sperm upon ejaculation, the ejaculated sperm were directly stained with the antibody. ceacam was immunodetected on the surface of the ejaculated sperm, in spite of high fluorescence background due to the free cea-cam that was difficult to removed completely during cell preparation (fig. b, d) . most spermatozoa freshly retrieved from the caudal epididymis of mice in modified tyrode buffer were mobile with visible tail beating. the result of casa for the cell incubation at specified conditions revealed that . m ceacam in cell culture greatly enhanced sperm motility relative to the motility of control cells at any incubation time (fig. c) . this work is the first to purify ceacam from mouse svs. we demonstrated it to be a -kda protein with an n-linked glycoconjugate. asn , asn , asn , and asn , each being part of consensus asn-xaa-(ser/thr) [ , ] in the protein molecule, are the potential acceptor sites for the attachment of the carbohydrate moieties. our results of edman degradation support asn as being one n-glycosylated site but rule out asn in that role. removal of the hydrophobic leader sequence from the putative form of this protein gives a protein core consisting , , or amino acid residues that sum to have a molecular mass of , or daltons, which is close to the molecular size of the deglycosylated proteins as determined by sds-page (fig. c) . the cea family consists of ceacam and pregnancyspecific glycoprotein subfamilies. these evolutionarily and structurally divergent glycoproteins of mammals share many common structural features [ ] . they are characterized by the assembly of immunoglobulin variable (igv)like domain and immunoglobulin constant (igc)-like domains in each member of the family. according to the molecular model established by watt et al. [ ] and tan et al. [ ] , there are nine ␤ strands in one immunoglobulin-like domain involved in the maintenance of the three-dimensional architecture. in the ceacam molecule, residues - form one igv-like domain and residues - form another [ ] . this may account for the characteristic cd shown in figure c . many members of the murine and human ceacam family contain either a transmembrane domain or a glycosyl ptdins moiety, but no such structural element is present in the ceacam -deduced protein sequence, suggesting that ceacam is not a membrane-bound protein [ ] . this is substantiated by our demonstration of its secretion from the luminal epithelium of seminal vesicle (fig. ) . in the sexual glands of adult mice, the ceacam gene is predominantly transcribed and translated in the seminal vesicle. on ejaculation, the ceacam -sperm binding may take place in the semen to enhance sperm motility. finkenzeller et al. [ ] demonstrated that ceacam Ϫ/Ϫ male and female mice developed indistinguishably from wild-type litter mates with respect to sex ratio, weight gain, and fertility, but a significant reduction by % of litter size was observed in ceacam Ϫ/Ϫ mating. although this may be partially attributed to the lack of ceacam in the semen of ceacam -inactivated males, it remains arguable whether in vitro ceacam -enhanced sperm motility may play a significant role after coitus under natural circumstances. the maternal decidua surrounding the implantation site was not removed from the mouse conceptus that was used to prepare the commercially available embryo-stage blot for the observation of gene expression during pregnancy. as a result, the appearance of ceacam mrna in embryo samples at the blastula or gastrula stage ( . - . d.p.c.) (fig. b ) may arise from the maternal decidua as suggested in the study by finkenzeller et al. [ ] . this finding, together with the lack of an rna message in embryo samples at . and . - . d.p.c. (fig. b) implies that ceacam might be weakly expressed if not silent during the entire course of embryonic development. in fact, we were unable to immunodetect the presence of ceacam in the em- (a and b) or the ceacam antibody (c and d) and followed by incubation with rhodamine-conjugated anti-rabbit igg. the slides were observed via light microscopy (a and c) or fluorescence microscopy (b and d). bar ϭ m. c) freshly prepared mouse spermatozoa in modified tyrode solution ( cells/ml) containing . mm cacl were incubated alone (⅙) or in the presence of m ceacam (•) at Њc for to min. cell motility determined at each specified incubation time was expressed as a percentage of control cell motility at time zero. points are mean Ϯ sd for three determinations. *p Ͻ . in a paired statistical comparison with the corresponding control. values were evaluated using one-way analysis of variance. bryo samples obtained by carefully microdissecting the extraembryonic tissues from the conceptus at . to . d.p.c. cytochemical observations shown in figure suggest the presence of ceacam -binding sites on the entire sperm surface. considering the presence of ceacam on human sperm cells [ ] , it raises a possibility that heterophilic adhesion exists between ceacam and other ceacam molecules on the mouse sperm surface. this is unlike the action of other sperm motility effectors in the mouse svs, such as svs vii, which binds neutral phospholipid to enhance sperm motility [ ] , and sva, which predominantly binds membrane phosphatidylcholine to suppress sperm motility [ ] . motility of spermatozoa in hydrosalpingeal and follicular fluid of pigs purification and characterization of a sperm motility-dynein atpase inhibitor from boar seminal plasma. mol reprod purification and characterization of reversible sperm motility inhibitors from porcine seminal plasma low molecular weight components in bovine semen diffusate and their effects on motility of bull sperm effects of seminal vesicle fluid components on sperm motility in the house mouse purification and characterization of the active precursor of a human sperm motility inhibitor secreted by the seminal vesicles: identity with semenogelin the role of the seminal vesicles, coagulating glands and prostate glands on the fertility and fecundity of mice effects of seminal vesicle removal on fertility and uterine sperm motility in the house mouse the androgendependent mouse seminal vesicle secretory protein iv: characterization and complementary deoxyribonucleic acid cloning a novel heat-labile phospholipid-binding protein, svs vii, in mouse seminal vesicle as a sperm motility enhancer developmental profile of a caltrin-like protease inhibitor, p , in mouse seminal vesicle and characterization of its binding sites on sperm surface seminal vesicle autoantigen, a novel phospholipid-binding protein secreted from luminal epithelium of mouse seminal vesicle, exhibits the ability to suppress mouse sperm motility a seminal vesicle autoantigen of mouse is able to suppress sperm capacitation-related events stimulated by serum albumin the cea gene encodes a secreted member of the murine carcinoembryonic antigen family and is expressed in the placenta, gastrointestinal tract and bone marrow enzymatic deglycosylation of asparagine-linked glycans: purification, properties, and specificity of oligosaccharide-cleaving enzymes from flavobacterium meningosepticum measurement of protein using bicinchoninic acid various forms of mouse lactoferrins: purification and characterization cleavage of structural proteins during the assembly of the head of bacteriophage t molecular cloning. cold spring harbor recovery, capacitation, acrosome reaction, and fractionation of sperm choosing among different technical variations of percoll centrifugation for sperm selection motility and fertilizing ability of rat epididymal spermatozoa washed by a continuous gradient of percoll determination of the secondary structures of proteins by circular dichroism and optical rotatory dispersion determination of the helix and beta form of proteins in aqueous solution by circular dichroism circular dichroic analysis of protein conformation: inclusion of the beta-turns analysis of the circular dichroism spectrum of proteins using the convex constraint algorithm: a practical guide sequence differences between glycosylated and non-glycosylated asn-x-thr/ser acceptor sites: implications for protein engineering the carcinoembryonic antigen (cea) family: structures, suggested functions and expression in normal and malignant tissues homophilic adhesion of human ceacam involves n-terminal domain interactions: structural analysis of the binding site crystal structure of murine sceacam a[ , ]: a coronavirus receptor in the cea family redefined nomenclature for members of the carcinoembryonic antigen family zimmermann w. carcinoembryonic antigen-related cell adhesion molecule expressed specifically early in pregnancy in the decidua is dispensable for normal murine development soluble isoforms of ceacam containing the a domain: increased serum levels in patients with obstructive jaundice and differences in -fucosyl-n-acetyl-lactosamine moiety key: cord- - imgztwe authors: frishman, d.; albrecht, m.; blankenburg, h.; bork, p.; harrington, e. d.; hermjakob, h.; juhl jensen, l.; juan, d. a.; lengauer, t.; pagel, p.; schachter, v.; valencia, a. title: protein-protein interactions: analysis and prediction date: - - journal: modern genome annotation doi: . / - - - - _ sha: doc_id: cord_uid: imgztwe proteins represent the tools and appliances of the cell — they assemble into larger structural elements, catalyze the biochemical reactions of metabolism, transmit signals, move cargo across membrane boundaries and carry out many other tasks. for most of these functions proteins cannot act in isolation but require close cooperation with other proteins to accomplish their task. often, this collaborative action implies physical interaction of the proteins involved. accordingly, experimental detection, in silico prediction and computational analysis of protein-protein interactions (ppi) have attracted great attention in the quest for discovering functional links among proteins and deciphering the complex networks of the cell. proteins represent the tools and appliances of the cellthey assemble into larger structural elements, catalyze the biochemical reactions of metabolism, transmit signals, move cargo across membrane boundaries and carry out many other tasks. for most of these functions proteins cannot act in isolation but require close cooperation with other proteins to accomplish their task. often, this collaborative action implies physical interaction of the proteins involved. accordingly, experimental detection, in silico prediction and computational analysis of protein-protein interactions (ppi) have attracted great attention in the quest for discovering functional links among proteins and deciphering the complex networks of the cell. proteins do not simply clump togetherbinding between proteins is a highly specific event involving well defined binding sites. several criteria can be used to further classify interactions (nooren and thornton ) . protein interactions are not mediated by covalent bonds and, from a chemical perspective, they are always reversible. nevertheless, some ppi are so persistent to be considered irreversible (obligatory) for all practical purposes. other interactions are subject to tight regulation and only occur under characteristic conditions. depending on their functional role, some protein interactions remain stable for a long time (e.g. between proteins of the cytoskeleton) while others last only fractions of a second (e.g. binding of kinases to their targets). protein complexes formed by physical binding are not restricted to so called binary interactions which involve exactly two proteins (dimer) but are often found to contain three (trimer), four (tetramer), or more peptide chains. another distinction can be made based on the number of distinct proteins in a complex: homo-oligomers contain multiple copies of the same protein while hetero-oligomers consist of different protein species. sophisticated "molecular machines" like the bacterial flagellum consist of a large number of different proteins linked by protein interactions. the focus of this chapter is on the computational methods for analyzing and predicting protein-protein interactions. nevertheless, some basic knowledge about experimental techniques for detecting these interactions is highly useful for interpreting results, estimating potential biases, and judging the quality of the data we use in our work. many different types of methods have been developed but the vast majority of interactions in the literature and public databases come from only two classes of approaches: co-purification and two-hybrid methods. co-purification methods (rigaut et al. ) are carried out in vitro and involve three basic steps. first, the protein of interest is "captured" from a cell lysatee.g. by attaching it to an immobile matrix. this may be done with specific antibodies, affinity tags, epitope tags along with a matching antibody, or by other means. second, all other proteins in the solution are removed in a washing step in order to purify the captured protein. under suitable conditions, protein-protein interactions are preserved. in the third step, any proteins still attached to the purified protein are detected by suitable methods (e.g. western-blot or mass spectrometry). hence, the interaction partners are co-purified, as the name of the method implies. the two-hybrid technique (fields and song ) uses a very different approachit exploits the fact that transcription factors such as gal consist of two distinct functional domains. the dna-binding domain (bd) recognizes the transcription factor (tf) binding site in the dna and attaches the protein to it while the activation domain (ad) triggers transcription of the gene under the control of the factor. when expressed as separate protein chains, both domains remain fully functional: the bd still binds the dna but lacks a way of triggering transcription. the ad could trigger transcription but has no means of binding to the dna. for a two-hybrid test, two proteins x and y are fused to these domains resulting in two hybrids: x-bd and y-ad. if x binds to y, the resulting protein complex turns out to be a fully functional transcription factor. accordingly, an interaction is revealed by detecting transcription of the reporter gene under the control of the tf. in contrast to co-purifications, the interaction is tested in vivo in the two-hybrid system (usually in yeast, but other systems exist). the above description refers to small-scale experiments testing one pair of proteins at a time, but both approaches have successfully been extended to large-scale experiments testing thousands of pairs in very short time. while such high-throughput data is very valuable, especially for computational biology which often requires comprehensive input data, a word of caution is necessary. even with the greatest care and a maximum of thoughtful controls, high-throughput data usually suffer from a certain degree of false-positive results as well as false-negatives compared to carefully performed and highly optimized individual experiments. the ultimate source of information about protein interactions is provided by high-resolution three-dimensional structures of interaction complexes, such as the one shown in fig. . spatial architectures obtained by x-ray crystallography or nmr spectroscopy provide atomic-level detail of interaction interfaces and allow for mechanistic understanding of interaction processes and their functional implications. additional kinetic, dynamic and structural aspects of protein interactions can be elucidated by electron and atomic force microscopy as well as by fluorescence resonance energy transfer. fig. structural complex between rhoa, a small gtp protein belonging to the ras superfamily, and the catalytic gtpase activating domain of rhogap (graham et al. ) protein interaction databases a huge number of protein-protein interactions has been experimentally determined and described in numerous scientific publications. public protein interaction databases that provide interaction data in form of structured, machine-readable datasets organized according to well documented standards have become invaluable resources for bioinformatics, systems biology and researchers in experimental laboratories. the data in these databases generally originate from two major sources: large-scale datasets and manually curated information extracted from the scientific literature. as pointed out above, the latter is considered substantially more reliable and large bodies of manually curated ppi data are often used as the gold standard against which predictions and large-scale experiments are benchmarked. of course, these reference data are far from complete and strongly biased. many factors, including experimental bias, preferences of the scientific community, and perceived biomedical relevance influence the chance of an interaction to be studied, discovered and published. in the manual annotation process it is not enough to simply record the interaction as such. additional information such as the type of experimental evidence, citations of the source, experimental conditions, and more need to be stored in order to convey a faithful picture of the data. annotation is a highly labor intensive task carried out by specially trained database curators. ppi databases can be roughly divided in two classes: specialized databases focusing on a single organism or a small set of species and general repositories which aim for a comprehensive representation of current knowledge. while the former are often well integrated with other information resources for the same organism, the latter strive for collecting all available interaction data including datasets from specialized resources. the size of these databases is growing constantly as more and more protein interactions are identified. as of writing (november ) , global repositories are approaching , pieces of evidence for protein interactions in various species. all of these databases offer convenient web interfaces that allow for interactively searching the database. in addition, the full datasets are usually provided for download in order to enable researchers to use the data in their own computational analyses. table gives an overview of some important ppi databases. until relatively recently, molecular interaction databases like the ones listed in table acted largely independently from each other. while they provided an extremely valuable service to the community in collecting and curating available molecular interaction data from the literature, they did so largely in an uncoordinated manner. each database had its own curation policy, feature set, and data formats. in , the proteomics standards initiative (psi), a work group of the human proteome organization (hupo), set out to improve this situation, with contributions from a broad range of academic and commercial organizations, among them bind, cellzome, dip, glaxosmithkline, hybrigenics sa, intact, mint, mips, serono, and the universities of bielefeld, bordeaux, and cambridge. in a first step, a community standard for the representation of protein-protein interactions was developed, the psi mi format . (hermjakob et al. ) . recently, version . of the psi mi format has been published , extending the scope of the format from protein-protein interactions to molecular interactions in general, allowing to model for example protein-rna complexes. the psi mi format is a flexible xml format representing the interaction data to a high level of detail. n-ary interactions (complexes) can be represented as well as experimental conditions and technologies, quantitative parameters and interacting domains. the xml format is accompanied by detailed controlled vocabularies in obo format (harris et al. ). these vocabularies are essential for standardizing not only the syntax, but also the semantics of the molecular interaction representation. as an example, the "yeast two-hybrid technology" described above is referred to in the literature using many different synonyms, for example y h, h, "yeast-two-hybrid", etc. while all of these terms refer to the same technology, filtering interaction data from multiple different databases based on this set of terms is not trivial. thus, the psi mi standard provides a set of now more than well-defined terms relevant to molecular interactions. figure shows the intact advanced search tool with a branch of the hierarchical psi mi controlled vocabulary. figure provides a partial graphical representation of the annotated xml schema, combined with an example dataset in psi mi xml format, reprinted from kerrien et al. ( b) . for user-friendly distribution of simplified psi data to end users, the psi mi . standard also defines a simple tabular representation (mitab), derived from the biogrid format (breitkreutz et al. ) . while this format necessarily excludes details of interaction data like interacting domains, it provides a means to efficiently access large numbers of basic binary interaction records. the psi mi format is now widely implemented, with data available from biogrid, dip, hprd, intact, mint, and mips, among others. visualization tools like cytoscape (shannon et al. ) can directly read and visualize psi mi formatted data. comparative and integrative analysis of interaction data from multiple sources has become easier, as has the development of analysis tools which do not need to provide a plethora of input parsers any more. the annotated psi mi xml schema, a list of tools and databases implementing it, as well as further information, are available from http:// www.psidev.info/. however, the development and implementation of a common data format is only one step towards the provision of consistent molecular interaction data to the scientific community. another key step is the coordination of the data curation process itself between different molecular interaction databases. without such synchronization, independent databases will often work on the same publications and insert the data into their systems, according to different curation rules, thus doing redundant work on some publications, while neglecting others. recognizing this issue, the dip, intact, and mint molecular interaction databases are currently synchronizing their curation efforts in the context of the imex consortium (http://imex.sf.net). these databases are now applying the same curation rules to provide a consistent high level of curation quality, and are synchronizing their fields of activity, each focusing on literature curation from a non-overlapping set of scientific journals. for these journals, the databases aim to insert all published interactions into the database shortly after publication. regular data exchange of all newly curated data between imes databases is currently in the implementation phase. to support the systematic representation and capture of relevant molecular interaction data supporting scientific publications, the hupo proteomics standards initiative has recently published "the minimum information required for reporting a molecular interaction experiment (mimix)" , detailing data items considered essential for the authors to provide, as well as a practical guide to efficient deposition of molecular interaction data in imex databases . the imex databases are also collaborating with scientific journals and funding agencies, to increasingly recommend data producers to deposit their data in an imex partner database prior to publication. database deposition prior to publication not only ensures public availability of the data at the time of publication, but also provides important quality control, as database curators often assess the data in much more detail than reviewers. the psi journal collaboration efforts are starting to show first results. nature biotechnology, nature genetics, and proteomics are now recommending that authors deposit molecular interaction data in a relevant public domain database prior to publication, a key step to a better capture of published molecular interaction data in public databases, and to overcome the current fragmentation of molecular interaction data. as an example of a molecular interaction database implementing the psi mi . standard, we will provide a more detailed description of the intact molecular interaction database ), accessible at http://www.ebi.ac.uk/intact. intact is a curated molecular interaction database active since . intact follows a full text curation policy, publications are read in full by the curation team, and all molecular interactions contained in the publication are inserted into the database, containing basic facts like the database accession numbers of the proteins participating in an interaction, but also details like experimental protein modifications, which can have an impact on assessments of confidence in the presence or absence of interactions. each database record is cross-checked by a senior curator for quality control. on release of the record, the corresponding author of the publication is automatically notified (where an email address is available), and requested to check the data provided. any corrections are usually inserted into the next weekly release. while such a detailed, high quality approach is slow and limits coverage, the provision of high quality reference datasets is an essential service both for biological analysis, and for the training and validation of automatic methods for computational prediction of molecular interactions. as it is impossible for any single database, or even the collaborating imex databases, to fully cover all published interactions, curation priorities have to be set. any direct data depositions supporting manuscripts approaching peer review have highest priority. next, for some journals (currently cell, cancer cell, and proteomics) intact curates all molecular interactions published in the journal. finally, several special curation topics are determined in collaboration with external communities or collaborators, where intact provides specialized literature curation and collaborates in the analysis of experimental datasets, for example around a specific protein of interest (camargo et al. ) . as of november , intact contains . binary interactions supported by ca. , publications. the intact interface implements a standard "simple search" box, ideal for search by uniprot protein accession numbers, gene names, species, or pubmed identifiers. the advanced search tool (fig. ) provides field-specific searches as well a specialized search taking into account the hierarchical structure of controlled vocabularies. a default search for the interaction detection method " hybrid" returns , interactions, while a search for " hybrid" with the tickbox "include children" activated returns more than twice that number, , interactions. the hierarchical search automatically includes similarly named methods like "two hybrid pooling approach", but also "gal vp complement". search results are initially shown in a tabular form based on the mitab format, which can also be directly downloaded. each pairwise interaction is only listed once, with all experimental evidence listed in the appropriate columns. the final column provides access to a detailed description of each interaction as well as a graphical representation of the interaction in is interaction neighborhood graph. for interactive, detailed analysis, interaction data can be loaded into tools like cytoscape (see below) via the psi . xml format. all intact data is freely available via the web interface, for download in psi mi tabular or xml format, and computationally accessible via web services. intact software is open source, implemented in java, with hibernate (www.hibernate.org/) for the object-relational mapping to oracle tm or postgres, and freely available under the apache license, version from http://www.ebi.ac.uk/intact. on a global scale, protein-protein interactions participate in the formation of complex biological networks which, to a large extent, represent the paths of communication and metabolism of an organism. these networks can be modeled as graphs making them amenable to a large number of well established techniques of graph theory and social network analysis. even though interaction networks do not directly encode cellular processes nor provide information on dynamics, they do represent a first step towards a description of cellular processes, which is ultimately dynamic in nature. for instance, protein-interaction networks may provide useful information on the dynamics of complex assembly or signaling. in general, investigating the topology of protein interaction, metabolic, signaling, and transcriptional networks allows researchers to reveal the fundamental principles of molecular organization of the cell and to interpret genome data in the context of large-scale experiments. such analyses have become an integral part of the genome annotation process: annotating genomes today increasingly means annotating networks. a protein-protein interaction network summarizes the existence of both stable and transient associations between proteins as an (undirected) graph: each protein is represented as a node (or vertex), an edge between two proteins denotes the existence of an interaction. interactions known to occur in the actual cell ( fig. a ) can thus be represented as an abstract graph of interaction capabilities (fig. b ). as such a graph is limited by definition to binary interactions, its construction from a database of molecular interactions may involve arbitrary choices. for instance, an n-ary interaction measured by co-purification can be represented using either the clique (all binary interactions between the n proteins are retained) or the spoke model (only edges connecting the "captured" protein to co-purified proteins are retained). once a network has been reconstructed from protein interaction data, a variety of statistics on network topology can be computed, such as the distribution of vertex degrees, the distribution of the clustering coefficient and other notions of density, the distribution of shortest path length between vertex pairs, or the distribution of network motifs occurrences (see for a review). these measures can be used to describe networks in a concise manner, to compare, group or contrast different networks, and to identify properties characteristic of a network or a class of network under study. some topological properties may be interpreted as traces of underlying biological mechanisms, shedding light on their dynamics, their evolution, or both and helping connect structure to function (see the "network modules" section below). for instance, most interaction networks seem to exhibit scale-free topology (jeong et al. ; yook et al. ) , i.e. their degree distribution (the probability that a node has exactly k links) approximates a power law p(k) $ k -g , meaning that most proteins have few interaction partners but some, the so-called "hubs", have many. as an example of derived evolutionary insight, it is easy to show that networks evolving by growth (addition of new nodes) and preferential attachment (new nodes are more likely to be connected to nodes with more connections) will exhibit scale-free topology (degree distribution approximates a power-law) and hubs (highly connected nodes). a simple model of interaction network evolution by gene duplication, where a duplicate initially keeps the same interaction partners as the original, generates preferential attachment, thus providing a candidate explanation for the scale-free nature and the existence of hubs in these networks . interacting proteins are denoted as p , p , etc. (b) a graph representation of the protein interactions shown in a. each node represents a protein, and each edge connects proteins that interact. (c) information on protein interactions obtained by different methods. (d) protein interaction network derived from experimental evidence shown in c. as in a, each node is a protein, and edges connect interactors. edges a colored according to the source of evidence: red - d, green -apms, brown -y h, magenta -prof, yellow -lit, blue -loc a corresponding functional interpretation of hubs and scale-free topology has been proposed in terms of robustness. scale-free networks are robust to component failure, as random failures are likely to affect low degree nodes and only failures affecting hub nodes will significantly change the number of connected components and the length of shortest paths between node pairs. deletion analyses have, perhaps unsurprisingly, confirmed that highly connected proteins are more likely to be essential (winzeler et al. ; giaever et al. ; gerdes et al. ) . most biological interpretations that have been proposed for purely topological properties of interaction networks have been the subject of heated controversies, some of which remain unsolved to this day (e.g. (he and zhang ; yu et al. ) on hubs). one often cited objection to any strong interpretation is the fact that networks reconstructed from high-throughput interaction data constitute very rough approximations of the "real" network of interactions taking place within the cell. as illustrated in fig. c , interaction data used in a reconstruction typically result from several experimental methods, often complemented with prediction schemes. each specific method can miss real interactions (false negatives) and incorrectly identify other interactions (false positives), resulting in biases that are clearly technology-dependent (gavin et al. ; legrain and selig ) . assessing false-negative and false-positive rates is difficult since there is no gold standard for positive interactions (protein pairs that are known to interact) or, more importantly, for negative interactions (protein pairs that are known not to interact). using less-than-ideal benchmark interaction sets, estimates of - % false positives and - % false negatives have been proposed for yeast-two-hybrid and copurification based techniques (aloy and russell ) . in particular, a comparison of several high-throughput interaction datasets on yeast, showing low overlap, has confirmed that each study covers only a small percentage of the underlying interaction network (von mering et al. ) (see also "estimates of the number of protein interactions" below). integration of interaction data from heterogeneous sources towards interaction network reconstruction can help compensate for these limitations. the basic principle is fairly simple and rests implicitly on a multigraph representation: several interaction networks to be integrated, each resulting from a specific experimental or predictive method, are defined over the same set of proteins. integration is achieved by merging them into a single network with several types of linksor edge colors-each drawn from one of the component networks. some edges in the multigraph may be incorrect, while some existing interactions may be missing from the multigraph, but interactions confirmed independently by several methods can be considered reliable. figure d shows the multigraph that corresponds to the evidence from fig. c and can be used to reconstruct the actual graph in fig. b . in practice, integration is not always straightforward: networks are usually defined over subsets of the entire gene or protein complement of a species, and meaningful integration requires that the overlap of these subsets be sufficiently large. in addition, if differences of reliability between network types are to be taken into account, an integrated reliability scoring scheme needs to be designed (jansen et al. ; von mering et al. ) with the corresponding pitfalls and level of arbitrariness involved in comparing apples and oranges. existing methods can significantly reduce false positive rates on a subset of the network, yielding a subnetwork of highreliability interactions. the tremendous amounts of available molecular interaction data raise the important issue of how to visualize them in a biologically meaningful way. a variety of tools have been developed to address this problem; two prominent examples are visant (hu et al. ) and cytoscape (shannon et al. ) . a recent review of further network visualization tools is provided by suderman and hallett ( ) . in this section, we focus on cytoscape (http://www.cytoscape.org) and demonstrate its use for the investigation of protein-protein interaction networks. for a more extensive protocol on the usage of cytoscape, see (cline et al. ) . cytoscape is a stand-alone java application that is available for all major computer platforms. this software provides functionalities for (i) generating biological networks, either manually or by importing interaction data from various sources, (ii) filtering interactions, (iii) displaying networks using graph layout algorithms, (iv) integrating and displaying additional information like gene expression data, and (v) performing analyses on networks, for instance, by calculating topological network properties or by identifying functional modules. one advantage of cytoscape over alternative visualization software applications is that cytoscape is released under the open-source lesser general public license (lgpl). this license basically permits all forms of software usage and thus helps to build a large user and developer community. third-party java developers can easily enhance the functionality of cytoscape by implementing own plug-ins, which are additional software modules that can be readily integrated into the cytoscape platform. currently, there are more than forty plug-ins publicly available, with functionalities ranging from interaction retrieval and integration across topological network analysis, detection of network motifs, protein complexes, and domain interactions, to visualization of subcellular protein localization and bipartite networks. a selection of popular cytoscape plug-ins is listed in table . in the following, we will describe the functionalities of cytoscape in greater detail. the initial step of generating a network can be accomplished in different ways. first, the user can import interaction data that are stored in various flat file or xml formats such as biopax, sbml, or psi-mi, as described above. second, the user can directly retrieve interactions from several public repositories from within cytoscape. a number table ). third, the user can utilize a text-mining plug-in that builds networks based on associations found in publication abstracts (agilent literature search; table ). while these associations are not as reliable as experimentally derived interactions, they can be helpful when the user is investigating species that are not well covered yet in the current data repositories. fourth, the user can directly create or manipulate a network by manually adding or removing nodes (genes, proteins, domains, etc.) and edges (interactions or relationships). in this way, expert knowledge that is not captured in the available data sets can be incorporated into the loaded network. generated networks can be further refined by applying selections and filters in cytoscape. the user can select nodes or edges by simply clicking on them or framing a selection area. in addition, starting with at least one selected node, the user can incrementally enlarge the selection to include all direct neighbor nodes. cytoscape also provides even sophisticated search and filter functionality for selecting particular nodes and edges in a network based on different properties; in particular, the enhanced search plug-in (table ) improves the built-in search functionality of cytoscape. filters select all network parts that match certain criteria, for instance, all human proteins or all interactions that have been detected using the yeast two-hybrid system. once a selection has been made, all selected parts can be removed from the network or added to another network. the main purpose of visualization tools like cytoscape is the presentation of biological networks in an appropriate manner. this can usually be accomplished by applying graph layout algorithms. sophisticated layouts can assist the user in revealing specific network characteristics such as hub proteins or functionally related protein clusters. cytoscape offers various layout algorithms, which can be categorized as circular, hierarchical, spring-embedded (or force-directed), and attribute-based layouts (fig. ). further layouts can be included using the cytoscape plug-in architecture, for example, to arrange protein nodes according to their subcellular localization or to their pathways assignments (bubblerouter, cerebral; table ). some layouts may be more effective than others for representing molecular networks of a certain type. the spring-embedded layout, for instance, has the effect of exposing the inherent network structure, thus identifying hub proteins and clusters of tightly connected nodes. it is noteworthy that current network visualization techniques have limitations, for example, when displaying extremely large or dense networks. in such cases, a simple graphical network representation with one node for each interaction partner, as it is initially created by cytoscape, can obfuscate the actual network organization due to the sheer number of nodes and edges. one potential solution to this problem is the introduction of meta-nodes (metanode plug-in; table ). a meta-node combines and replaces a group of other nodes. meta-nodes can be collapsed to increase clarity of the visualization and expanded to increase the level of detail (fig. ). an overview of established and novel visualization techniques for biological networks on different scales is presented in (hu et al. ). all layouts generated by cytoscape are zoomable, enabling the user to increase or decrease the magnification, and they can be further customized by aligning, scaling, or rotating selected network parts. additionally, the user can define the graphical network representation through visual styles. these styles define the colors, sizes, and shapes of all network parts. a powerful feature of cytoscape is its ability of visually mapping additional attribute values onto network representations. both nodes and edges can have arbitrary attributes, for example, protein function names, the number of interactions (node degree), expression values, the strength and type of an interaction, or confidence values for interaction reliability. these attributes can be used to adapt the network illustration by dynamically changing the visual styles of individual network parts (fig. ) . for example, this feature enables highlighting trustworthy interactions by assigning (table ). all protein nodes with subcellular localizations different from plasma membrane are combined into meta-nodes. these meta-nodes can be collapsed or expanded to increase clarity or detailedness, respectively different line styles or sizes to different experiment types (discrete mapping of an edge attribute), to spot network hubs by changing the size of a node according to its degree (discrete or continuous mapping of a node attribute), or to identify functional network patterns by coloring protein nodes with a color gradient according to their expression level (continuous mapping of a node attribute). hence, it is possible to simultaneously visualize different data types by overlaying them with a network model. in order to generate new biological hypotheses and to gain insights into molecular mechanisms, it is important to identify relevant network characteristics and patterns. for this purpose, the straightforward approach is the visual exploration of the network. table lists a selection of cytoscape plug-ins that assist the user in this analysis task, for instance, by identifying putative complexes (mcode), by grouping proteins that show a similar expression profile (jactivemodules), or by identifying overrepresented go terms (bingo, golorize). however, the inclusion of complex data such as time-series results or diverse gene ontology (go) terms into the network visualization might not be feasible without further software support. particularly in case of huge, highly connected, or dynamic networks, more advanced visualization techniques will be required in the future. fig. visual representation of a subset of the gal network in yeast. the protein nodes are colored with a red-to-green gradient according to their expression value; green represents the lowest, red the highest value, and blue a missing value. the node size indicates the number of interactions (node degree); the larger a node, the higher is its degree. the colors and styles of the edges represent different interaction types; solid black lines represent protein-protein, dashed red lines protein-dna interactions in addition to the visual presentation of interaction networks, cytoscape can also be used to perform statistical analyses. for instance, the networkanalyzer plug-in (assenov et al. ) computes a large variety of topology parameters for all types of networks. the computed simple and complex topology parameters are represented as single values and distributions, respectively. examples of simple parameters are the number of nodes and edges, the average number of neighbors, the network diameter and radius, the clustering coefficient, and the characteristic path length. complex parameters are distributions of node degrees, neighborhood connectivities, average clustering coefficients, and shortest path lengths. these computed statistical results can be exported in textual or graphical form and are additionally stored as node attributes. the user can then apply the calculated attributes to select certain network parts or to map them onto the visual representation of the analyzed network as described above (fig. ) . it is also possible to fit a power law to the node degree distribution, which can frequently indicate a so-called scale-free network with few highly connected nodes (hubs) and many other nodes with a small number of interactions. scale-free networks are especially robust against failures of randomly selected nodes, but quite vulnerable to defects of hubs (albert ) . how many ppis exist in a living cell? the yeast genome encodes approximately gene products which means that the maximal possible number of interacting protein pairs in this organism is close to million, but what part of these potential interactions are actually realized in nature? for a given experimental method, such as the two-hybrid essay, the estimate of the total number of interactions in the cell is given by where n measured is the number of interactions identified in the experiment, and r fp and r fn are false positive and false negative rates of the method. r fn can be roughly estimated based on the number of interactions known with confidence (e.g., those confirmed by three-dimensional structures) that are being recovered by the method. assessing r fp is much more difficult because no experimental information on proteins that do not interact is currently available. since it is known that proteins belonging to the same functional class often interact, one very indirect way of calculating r fn is as the fraction of functionally related proteins not found to be interacting. an even more monumental problem is the estimation of the total number of unique structurally equivalent interaction types existing in nature. an interaction type is defined as a particular mutual orientation of two specific interacting domains. in some cases homologous proteins interact in a significantly different fashion while in other cases proteins lacking sequence similarity engage in interactions of the same type. in general, however, interacting protein pairs sharing a high degree of sequence similarity ( - % or higher) between their respective components almost always form structurally similar complexes (aloy et al. ) . this observation allows utilization of available atomic resolution structures of complexes for building useful models of closely related binary complexes. the total number of interaction types can then be estimated as follows: where the interaction similarity multiplier c reflects the clustering of all interactions of the same type, and e all-species extrapolates from one biological species to all organisms. aloy and russel ( ) derived an estimate for c by grouping interactions between proteins that share high sequence similarity, as discussed above. c depends on the number of paralogous sequences encoded in a given genome. for small prokaryotic organisms it is close to while for larger and more redundant genomes it adopts smaller values, typically in the range of . - . . the multiplier for all species e allspecies can be derived by assessing what fraction of known protein families is encoded in a given genome. based on the currently available data this factor is close to for bacteria, which means that a medium size prokaryotic organism contains around one tenth of all protein families. for eukaryotic organisms e all-species lies between and . for the comprehensive two-hybrid screen of yeast by (uetz ) in which interactions between proteins were identified, aloy and russell ( ) estimated c, r fp, and r fn , and e all-species to be . , . , . , and . respectively, leading to an estimated different interaction types in yeast alone, and over all species. based on the two-hybrid interaction map of the fly (giot ) the number of all interaction types in nature is estimated to be . it is thus reasonable to expect the total number of interaction types to be around , , and only are currently known. beyond binary interactions, proteins often form large molecular complexes involving multiple subunits (fig. ) . these complexes are much more than a random snapshot of a group of interacting proteinsthey represent large functional entities which remain stable for long periods of time. many such protein complexes have been elucidated step by step over time and recent advances in high-throughput technology have led to largescale studies revealing numerous new protein complexes. the preferred technology for this kind of experiment is initial co-purification of the complexes followed by the identification of the member proteins by mass spectrometry. as the bakers yeast s. cerevisiae is one of the most versatile model organisms used in molecular biology, it is not surprising that the first large-scale complex datasets were obtained in this species (gavin et al. ; ho et al. ; gavin et al. ; krogan et al. ). the yeast protein interaction database mpact (guldener et al. ) provides access to protein complexes based on careful literature annotation composed of different proteins plus over complexes from large-scale experiments which contain more than distinct proteins. these numbers contain some redundancy with respect to complexes, due to slightly different complex composition found by different groups or experiments. nevertheless, the dataset covers about % of the s.cerevisiae proteome. while many complexes comprise only a small number of different proteins, the largest of them features an impressive different protein species. a novel manually annotated database, corum (ruepp et al. ) contains literature-derived information about mammalian multi-protein complexes. over % of all complexes contain between three and six subunits, while the largest molecular structure, the spliceosome, consists of components (fig. ). modularity has emerged as one of the major organizational principles of cellular processes. functional modules are defined as molecular ensembles with an autonomous function (hartwell et al. ) . proteins or genes can be partitioned into modules based on shared patterns of regulation or expression, involvement in a common metabolic or regulatory pathway, or membership in the same protein complex or subcellular structure. modular representation and analysis of cellular processes allows for inter- pretation of genome data beyond single gene behavior. in particular, analysis of modules provides a convenient framework for studying the evolution of living systems (snel and huynen ) . multiprotein complexes represent one particular type of functional modules in which individual components engage in physical interactions to execute a specific cellular function. algorithmically, modular architectures can be defined as densely interconnected groups of nodes on biological networks (for an excellent review of available methods see (sharan et al. ). statistically significant functional subnetworks are characterized by a high degree of local clustering. the density of a cluster can be represented as a function q(m,n) = m/(n(n À )), where m is the number of interactions between the n nodes of the cluster (spirin and mirny ) . q thus takes values between for a set of unconnected nodes and for a fully connected cluster (clique). the statistical significance of q strongly depends on the size of the graph. it is obvious that random clusters with q ¼ involving just three proteins are very likely while large clusters with q ¼ or even with values below . are extremely unlikely. in order to compute the statistical significance of a cluster with n nodes and m connections spirin and mirny calculate the expected number of such clusters in a comparable random graph and then estimate the likelihood of having m or more interactions within a given set of n proteins given the number of interactions that each of these proteins has. significant dense clusters identified by this procedure on a graph of protein interactions were found to correspond to functional modules most of which are involved in transcription regulation, cell-cycle/ cell-fate control, rna processing, and protein transport. however, not all of them constitute physical protein complexes and, in general, it is not possible to predict whether a given module corresponds to a multiprotein complex or just to a group of functionally coupled proteins involved in the same cellular process. the search for significant subgraphs can be further enhanced by considering evolutionary conservation of protein interactions. with this approach protein complexes are predicted from binary interaction data by network alignment which involves comparing interaction graphs between several species (sharan et al. ) . first, proteins are grouped by sequence similarity such that each group contains one protein from each species, and each protein is similar to at least one other protein in the group. then a composite interaction network is created by joining with edges those pairs of groups that are linked by at least one conserved interaction. again, dense clusters on such network alignment graph are often indicative of multiprotein complexes. an alternative computational method for deriving complexes from noisy large-scale interaction data relies on a "socio-affinity" index which essentially reflects the frequency with which proteins form partnerships detected by co-purification (gavin et al. ) . this index was shown to correlate well with available three-dimensional structure data, dissociation constants of protein-protein interactions, and binary interactions identified by the two-hybrid techniques. by applying a clustering procedure to a matrix containing the values of the socio-affinity index for all yeast protein pairs found to associate by affinity purification, complexes were predicted, with over a half of them being novel and previously unknown. however, dependent on the analysis parameters distinct complex variants (isoforms) are found that differ from in terms of their subunit composition. those proteins present in most of the isoforms of a given complex constitute its core while variable components present only in a small number of isoforms can be considered "attachments" (fig. ) . furthermore, some stable, typically smaller protein groups can be found in multiple attachments in which case they are fig. definitions of complex cores, attachments, and modules. redrawn and modified with permission from (gavin et al. ) called "modules". stable functional modules can thus be flexibly used in the cell in a variety of functional contexts. proteins frequently associated with each other in complex cores and modules are likely to be co-expressed and co-localized. in this section, we offer a computational perspective on utilizing protein network data for molecular medical research. the identification of novel therapeutic targets for diseases and the development of drugs has always been a difficult, time-consuming and expensive venture (ruffner et al. ) . recent work has charted the current pharmacological space using different networks of drugs and their protein targets (paolini et al. ; keiser et al. ; kuhn et al. ; yildirim et al. ) based on biochemical relationships like ligand binding energy and molecular similarity or on shared disease association. above all, since many diseases are due to the malfunctioning of proteins, the systematic determination and exploration of the human interactome and homologous protein networks of model organisms can provide considerable new insight into pathophysiological processes (giallourakis et al. ) . knowledge of protein interactions can frequently improve the understanding of relevant molecular pathways and the interplay of various proteins in complex diseases (fishman and porter ) . this approach may result in the discovery of a considerable number of novel drug targets for the biopharmaceutical industry, possibly affording the development of multi-target combination therapeutics. observed perturbations of protein networks may also offer a refined molecular description of the etiology and progression of disease in contrast to phenotypic categorization of patients (loscalzo et al. ). molecular network data may help to improve the ability of cataloging disease unequivocally and to further individualize diagnosis, prognosis, prevention, and therapy. this will require a network-based approach that does not only include protein interactions to differentiate pathophenotypes, but also other types of molecular interactions as found in signaling cascades and metabolic pathways. furthermore, environmental factors like pathogens interacting with the human host or the effects of nutrition need to be taken into account. after large-scale screens identified enormous amounts of protein interactions in organisms like yeast, fly, and worm (goll and uetz ) , which also serve as model systems for studying many human disease mechanisms (giallourakis et al. ) , experimental techniques and computational prediction methods have recently been applied to generate sizable networks of human proteins (cusick et al. ; stelzl and wanker ; assenov et al. ; ram ırez et al. ). in addition, comprehensive maps of protein interactions inside pathogens and between pathogens and the human host have been compiled for bacteria like e. coli, h. pylori, c. jejuni, and other species (noirot and noirot-gros ) , for many viruses such as herpes viruses, the epstein-chapter . : protein-protein interactions: analysis and prediction barr virus, the sars coronavirus, hiv- , the hepatitis c virus, and others (uetz et al. ) , and for the malaria parasite p. falciparum (table ) . those extensive network maps can now be explored to identify potential drug targets and to block or manipulate important protein-protein interactions. furthermore, different experimental methods are also used to expand the known interaction networks around pathway-centric proteins like epidermal growth factor receptors (egfrs) (tewari et al. ; oda et al. ; jones et al. ) , smad and transforming growth factor-b (tgfb) (colland and daviet ; tewari et al. ; barrios-rodiles et al. ) , and tumor necrosis factor-a (tnfa) and the transcription factor nf-kb (bouwmeester et al. ). all of these proteins are involved in sophisticated signal transduction cascades implicated in various important disease indications ranging from cancer to inflammation. the immune system and toll-like receptor (tlr) pathways were the subject of other detailed studies (oda and kitano ) . apart from that, protein networks for longevity were assembled to research ageing-related effects (xue et al. ). high-throughput screens are also conducted for specific disease proteins causative of closely related clinical and pathological phenotypes to unveil molecular interconnections between the diseases. for example, similar neurodegenerative disease phenotypes are caused by polyglutamine proteins like huntingtin and over twenty ataxins. although they that are not evolutionarily related and their expression is not restricted to the brain, they are responsible for inherited neurotoxicity and age-dependent dementia only in specific neuron populations (ralser et al. ) . yeast two-hybrid screens revealed an unexpectedly dense interaction network of those disease proteins forming interconnected subnetworks (fig. ) , which suggests common pathways affected in disease (goehler et al. ; lim et al. ) . some of the protein-protein interactions may be involved in mediating neurodegeneration and thus may be tractable for drug inhibition, and several interaction partners of ataxins could additionally be shown to be potential disease modifiers in a fly model (kaltenbach et al. ) . a number of methodological approaches concentrate on deriving correlations between common topological properties and biological function from subnetworks around proteins that are associated with a particular disease phenotype like cancer. recent studies report that human disease-associated proteins with similar clinical and pathological features tend to be more highly connected among each other than with other proteins and to have more similar transcription profiles xu and li ; goh et al. ). this observation points to the existence of disease-associated functional modules. interestingly, in contrast to disease genes, essential genes whose defect may be lethal early on in life are frequently found to be hubs central to the network. further work focused on specific disease-relevant networks. for instance, to analyze experimental asthma, differentially expressed genes were mapped onto a protein interaction network ). here, highly connected nodes tended to have smaller expression changes than peripheral nodes. this agrees with the general notion that disease-causing genes are typically not central in the network. similarly, a comprehensive protein network analysis of systemic inflammation in human subjects investigated blood leukocyte gene expression patterns when receiving an inflammatory stimulus, a bacterial endotoxin, to identify functional modules perturbed in response to this stimulus (calvano et al. ) . topological criteria and gene expression data were also used to search protein networks for functional modules that are relevant to type diabetes mellitus or to different types of cancer (jonsson and bates ; cui et al. ; lin et al. ; pujana et al. ). moreover, it was recently demonstrated that the integration of gene expression profiles with subnetworks of interacting proteins can lead to improved prognostic markers for breast cancer outcome that are more reproducible between patient cohorts than sets of individual genes selected without network information (chuang et al. ). in drug discovery, protein networks can help to design selective inhibitors of protein-protein interactions which target specific interactions of a protein, but do not affect others (wells and mcclendon ) . for example, a highly connected protein (hub) may be a suitable target for an antibiotic whereas a more peripheral protein with few interaction partners may be more appropriate for a highly specific drug that needs to avoid side effects. thus, topological network criteria are not only useful for characterizing disease proteins, but also for finding drug targets. the diversity of interactions of a targeted protein could also help in predicting potential side effects of a drug. apart from that, it is remarkable that some potential drugs have been found to be less effective than expected due to the intrinsic robustness of living systems against perturbations of molecular interactions (kitano ) . furthermore, mutations in proteins cause genetic diseases, but it is not always easy to distinguish protein interactions impaired by mutated binding sites from other disease causes like structural instability induced by amino acid mutations. nowadays many genome-wide association and linkage studies for human diseases suggest genomic loci and linkage intervals that contain candidate genes encoding snps and mutations of potential disease proteins (kann ) . since the resultant list of candidates frequently contain dozens or even hundreds of genes, computational approaches have been developed to prioritize them for further analyses and experiments. in the following, we will demonstrate the variety of available prioritization approaches by explicating three recent methods that utilize protein interaction data in addition to the inclusion of other sequence and function information. all methods capitalize on the above described observation that closely interacting gene products often underlie polygenic diseases and similar pathophenotypes (oti and brunner ) . using protein-protein interaction data annotated with reliability values, lage et al. ( ) first predict human protein complexes for each candidate protein. they then score the pairwise phenotypic similarity of the candidate disease with all proteins within each complex that are associated with any disease. the scoring function basically measures the overlap of the respective disease phenotypes as recorded in text entries of omim (online mendelian inheritance in man) (hamosh et al. ) based on the vocabulary of umls (unified medical language system) (bodenreider ) . lastly, all candidates are prioritized by the probability returned by a bayesian predictor trained on the interaction data and phenotypic similarity. therefore, this method depends on the premise that the phenotypic effects caused by any disease-affected member in a predicted protein complex are very similar to each other. another prioritization approach by franke et al. ( ) does not make use of overlapping disease phenotypes and primarily aims at connecting physically disjoint genomic loci associated with the same disease using molecular networks. at the beginning, their method prioritizer performs a bayesian integration of three different network types of gene/protein relationships. the latter are derived from functional similarity using gene ontology annotation, microarray coexpression, and proteinprotein interaction. this results in a probabilistic human network of general functional links between genes. prioritizer then assesses which candidate genes contained in different disease loci are closely connected in this gene-gene network. to this end, the score of each candidate is initially set to zero, but it is increased iteratively during network exploration by a scoring function that depends on the network distance of the respective candidate gene to candidates inside another genomic loci. this procedure finally yields separate prioritization lists of ranked candidate genes for each genomic loci. in contrast to the integrated gene-gene network used by prioritizer, the endeavour system (aerts et al. ) directly compares candidate genes with known disease genes and creates different ranking lists of all candidates using various sources of evidence for annotated relationships between genes or proteins. the evidence can be derived from literature mining, functional associations based on gene ontology annotations, co-occurrence of transcriptional motifs, correlation of expression data, sequence similarity, common protein domains, shared metabolic pathway membership, and protein-protein interactions. at the end, endeavour merges the resultant ranking lists using order statistics and computes an overall prioritization list of all candidate genes. finally, it is important to keep in mind that current datasets of human protein interactions may still contain a significant number of false interactions and thus biological and medical conclusions derived from them should always be taken with a note of caution, in particular, if no good confidence measures are available. a comprehensive atlas of protein interactions is fundamental for a better understanding of the overall dynamic functioning of the living organisms. these insights arise from the integration of functional information, dynamic data and protein interaction networks. in order to fulfill the goal of enlarging our view of the protein interaction network, several approaches must be combined and a crosstalk must be established among experimental and computational methods. this has become clear from comparative evaluations which show similar performances for both types of methodologies. in fact, over the recent years this field has grown into one of the most appealing fields in bioinformatics. evolutionary signals result from restrictions imposed by the need to optimize the features that affect a given interaction and the nature of these features can differ from interaction to interaction. consequently, a number of different methods have been developed based a range of different evolutionary signals. this section is devoted to a brief review of some of these methods. these techniques are based on the similarity of absence/presence profiles of interacting proteins. in its original formulation (gaasterland and ragan ; huynen and bork ; pellegrini et al. ; marcotte et al. a ) the phylogenetic profiles were codified as / vectors for each reference protein according to the absence/presence of proteins of the studied family in a set of fully sequenced organisms (see fig. a ). the vectors for different reference sequences are compared by using the hamming distance (pellegrini et al. ) between vectors. this measure counts the number of differences between two binary vectors. the rationale for this method is that both interacting proteins must be present in an organism and that reductive evolution will remove unpaired proteins in the rest of the organisms. proposed improvements include the inclusion of quantitative measures of sequence divergence (marcotte et al. b; date and marcotte ) and the ability to deal with biases in the taxonomic distribution of the organisms used (date and marcotte ; barker and pagel ) . these biases are due to the intuitive fact that evolutionarily similar organisms will share a higher number of protein and genomic features (in this case presence/absence of an orthologue). to reduce this problem, date et al. used mutual information from sequence divergent profiles for measuring the amount of information shared by both vectors. mutual information is calculated as: miðp ; p Þ ¼ hðp Þ þ hðp Þ À hðp ; p Þ; where hðp Þ ¼ p pðp Þ ln pðp Þ is the marginal entropy of the probability distribution of protein p sequence distances and hðp ; p Þ ¼ À p p pðp ; p Þ ln pðp ; p Þ is the joint entropy of the probability distributions of both protein p and p sequence distances. the corresponding probabilities are calculated from the whole distribution of orthologue distances for the organisms. in this way, the most likely evolutionary distances between orthologues from a pair of organisms will produce smaller entropies and consequently smaller values of mutual information. this formulation should implicitly reduce the effect of taxonomic biases. in an interesting work, published recently by barker et al. ( ) , the authors showed that detection of correlated gene-gain/gene-loss events improves the predictions by reducing the number of false positives due to taxonomic biases. the phylogenetic profiling approach has been shown to be quite powerful, because its simple formulation has allowed the exploration of a number of alternative interdependencies between proteins. this is the case for enzyme "displacement" in metabolic pathways detected as anti-correlated profiles (morett et al. ) , and for complex dependence relations among triplets of proteins (bowers et al. ) . phylogenetic profiles have also been correlated with bacterial traits to predict the genes related to particular phenotypes (korbel et al. ) . the main drawbacks of these methods are the difficulty of dealing with essential proteins (where there is no absence information) and the requirement for the genomes under study to be complete (to establish the absence of a family member). fig. prediction of protein interactions based on genomic and sequence features. information coming from the set of close homologs of the proteins p and p from the organism in other organisms can be used to predict an interaction between these proteins. (a) phylogenetic profiling. presence/absence of a homolog of both proteins in different organisms is coded as the corresponding two / profiles (most simple approach) and an interaction is predicted for very similar profiles. (b) similarity of phylogenetic trees. multiple sequence alignments are built for both sets of proteins and phylogenetic trees are derived from the proteins with a possible partner present in its organism. proteins with highly similar trees are predicted to interact. (c) gene neighbourhood conservation. genome closeness is checked for those genes coding for both sets of homologous proteins. interaction is predicted if gene pairs are recurrently close to each other in a number of organisms. (d) gene fusion. finding the proteins containing different sequence regions homologous to each of the two proteins is used to predict an interaction between them similarity in the topology of phylogenetic trees of interacting proteins has been qualitatively observed in a number of cases (fryxell ; pages et al. ; goh et al. ) . the extension of this observation to a quantitative method for the prediction of protein interactions requires measuring the correlation between the similarity matrices of the explored pairs of protein families (goh et al. ) . this formulation allows systematic evaluation of the validity of using the original observation as a signal of protein interaction (pazos and valencia ) . the general protocol for these methods is illustrated in fig. b . it includes the building of the multiple sequence alignment for the set of orthologues (one per organism) related to every query sequence, the calculation of all protein pair evolutionary distances (derived from the corresponding phylogenetic trees) and finally the comparison of evolutionary distance matrices of pairs of query proteins using pearsons correlation coefficient. protein pairs with highly correlated distance matrices are predicted to be more likely to interact. although this signal has been shown to be significant, the underlying process responsible for this similarity is still controversial (chen and dokholyan ) . there are two main hypotheses for explaining this phenomenon. the first hypothesis suggests that this evolutionary similarity comes from the mutual adaptation (co-evolution) of interacting proteins and the need to retain interaction features while sequences diverge. the second hypothesis implicates external factors. in this scenario, the restrictions imposed by evolution on the functional process implicating both proteins would be responsible for the parallelism of their phylogenetic trees. although the relative importance of both factors is still not clear, the predictive power of similarities in phylogenetic trees is not affected. indeed, a number of developments have improved the original formulation (pazos et al. ; sato et al. ). the first advance involved managing the intrinsic similarity of the trees because of the common underlying taxonomic distribution (due to the speciation processes). this effect is analogous to the taxonomic biases discussed above. in these cases, the approach followed was to correct both trees by removing this common trend. for example, pazos et al. subtracted the distances of the s rrna phylogenetic tree to the corresponding distances for each protein tree. the correlations for the resulting distance matrices were used to predict protein interactions. additionally some analyses have focused on the selection of the sequence regions used for the tree building (jothi et al. ; kann et al. ) . for example, it has been shown that interacting regions, both defined as interacting residues (using structural data) and as the sequence domain involved in the interaction, show more clear tree similarities than the whole proteins (mintseris and weng ; jothi et al. ) . other interesting work showed that prediction performance can be improved by removing poorly conserved sequence regions ). finally, in a very recent work (juan et al. ) the authors have suggested a new method for removing noise in the detection of tree similarity signals and detecting different levels of evolutionary parallelism specificity. this method introduces the new strategy of using the global network of protein evolutionary similarity for a better calibration of the evolutionary parallelism between two proteins. for this purpose, they define a protein co-evolutionary profile as the vector containing the evolutionary correlations between a given protein tree and all the rest of the protein trees derived from sequences in the same organism. this co-evolutionary profile is a more robust and comparable representation of the evolution of a given protein (it involves hundreds of distances) and can be used to deploy a new level of evolutionary comparison. the authors compare these co-evolutionary profiles by calculating pearsons correlation coefficient for each pair. in this way, the method detects pairs of proteins for which high evolutionary similarities are supported by their similarities with the rest of proteins of the organism. this approach significantly improves the predictive performance of the tree similaritybased methods so that different degrees of co-evolutionary specificity are obtained according to the number of proteins that might be influencing the co-evolution of the studied pair. this is done by extending the approach of sato et al. ( ) , that uses partial correlations and a reduced set of proteins for determining specific evolutionary similarities. juan et al. calculated the partial correlation for each significant evolutionary similarity with respect to the remaining proteins in the organism and defined levels of co-evolutionary specificity according to the number of proteins that are considered to be co-evolving with each studied protein pair. with this strategy, its possible to detect a range of evolutionary parallelisms from the protein pairs (for very specific similarities) up to subsets of proteins (for more relaxed specificities) that are highly evolution dependent. interestingly, if specificity requirements are relaxed, protein relationships among components of macro-molecular complexes and proteins involved in the same metabolic process can be recovered. this can be considered as a first step in the application of higher orders of evolutionary parallelisms to decode the evolutionary impositions over the protein interaction network. this method exploits the well-known tendency of bacterial organisms to organize proteins involved in the same biochemical process by clustering them in the genome. this observation is obviously related to the operon concept and the mechanisms for the coordination of transcription regulation of the genes present in these modules. these mechanisms are widespread among bacterial genomes. therefore the significance of a given gene proximity can be established by its conservation in evolutionary distant species (dandekar et al. ; overbeek et al. ) . the availability of fully sequenced organisms makes computing the intergenic distances between each pair of genes easy. genes with the same direction of transcrip-tion and closer than bases are typically considered to be in the same genomic context (see fig. c ). the conservation of this closeness must be found in more than two highly divergent organisms to be considered significant because of the taxonomic biases. while this signal is strong in bacterial genomes, its relevance is unclear in eukaryotic genomes. this is the main drawback of these methodologies. in fact, this signal only can be exploited for eukaryotic organisms by extrapolating genomic closeness of bacterial genes to their homologues in eukaryotes. obviously, this extrapolation leads to a considerable reduction in the confidence and number of obtained predictions for this evolutionary lineage. however, conserved gene pairs that are transcribed from a shared bidirectional promoter can be detected by similar methods and can found in eukaryotes as well as prokaryotes (korbel et al. ) a further use of evolutionary signals in protein function and physical interaction prediction has been the tendency of interacting proteins to be involved in gene fusion events. sequences that appear as independently expressed orfs in one organism become fused as part of the same polypeptide sequence in another organism. these fusions are strong indicators of functional and structural interaction that have been suggested to increase the effective concentration of interacting functional domains (enright et al. ; marcotte et al. b ). this hypothesis proposes that gene fusion could remove the effect of diffusion and relative correct orientation of the proteins forming the original complex. these fusion events are typically detected when sequence searches for two nonhomologous proteins obtain a significant hit in the same sequence. cases matching to the same region of the hit sequence are removed (these cases are schematically represented in fig. d ). in spite of the strength of this signal, gene fusion seems to not be a habitual event in bacterial organisms. the difficulty of distinguishing protein interactions belonging to large evolutionary families is the main drawback of the automatic application of these methodologies. integration of experimentally determined and predicted interactions as described above, there are many both experimental techniques and computational methods for determining and predicting interactions. to obtain the most comprehensive interaction networks possible, as many as possible of these sources of interactions should be integrated. the integration of these resources is complicated by the fact that the different sources are not all equally reliable, and it is thus important to quantify the accuracy of the different evidence supporting an interaction. in addition to the quality issues, comparison of different interaction sets is further complicated by the different nature of the datasets: yeast two-hybrid experiments are inherently binary, whereas pull-down experiments tend to report larger complexes. to allow for comparisons, complexes are typically represented by binary interaction networks; however, it is important to realize that there is not a single, clear definition of a "binary interaction". for complex pull-down experiments, two different representations have been proposed: the matrix representation, in which each complex is represented by the set of binary interactions corresponding to all pairs of proteins from the complex, and the spoke representation, in which only bait-prey interactions are included (von mering et al. ) . the binary interactions obtained using either of these representations are somewhat artificial as some interacting proteins might in reality never touch each other and others might have too low an affinity to interact except in the context of the entire complex bringing them together. even in the case of yeast two-hybrid assays, which inherently report binary interactions, not all interactions correspond to direct physical interactions. the database string ("search tool for the retrieval of interacting genes/ proteins") (von mering et al. ) represents an effort to provide many of the different types of evidence for functional interactions under one common framework with an integrated scoring scheme. such an integrated approach offers several unique advantages: ) various types of evidence are mapped onto a single, stable set of proteins, thereby facilitating comparative analysis; ) known and predicted interactions often partially complement each other, leading to increased coverage; and ) an integrated scoring scheme can provide higher confidence when independent evidence types agree. in addition to the many associations imported from the protein interaction databases mentioned above (bader et al. ; salwinski et al. ; guldener et al. ; mishra et al. ; stark et al. ; chatr-aryamontri et al. ), string also includes interactions from curated pathway databases (vastrik et al. ; kanehisa et al. ) and a large body of predicted associations that are produced de novo using many of the methods described in this chapter (dandekar et al. ; gaasterland and ragan ; pellegrini et al. ; marcotte et al. c) . these different types of evidence are obviously not directly comparable, and even for the individual types of evidence the reliability may vary. to address these two issues, string uses a two-stage approach. first, a separate scoring scheme is used for each evidence type to rank the interactions according to their reliability; these raw quality scores cannot be compared between different evidence types. second, the ranked interaction lists are benchmarked against a common reference to obtain probabilistic scores, which can subsequently be combined across evidence types. to exemplify how raw quality scores work, we will here explain the scoring scheme used for physical protein interactions from high-throughput screens. the two funda-mentally different types of experimental interaction data sets, complex pull-downs and binary interactions are evaluated using separate scoring schemes. for the binary interaction experiments, e.g. yeast two-hybrid, the reliability of an interaction correlates well with the number of non-shared interaction partners for each interactor. string summarizes this in the following raw quality score: logððn þ Þ Á ðn þ ÞÞ; where n and n are the numbers of non-shared interaction partners. this score is similar to the ig measure suggested by saito et al. ( ) . in the case of complex pulldown experiments, the reliability of the inferred binary interactions correlates better with the number of times the interactors were co-purified compared to what would be expected at random: where n is the number of purifications containing both proteins, n and n are the numbers of purifications containing either protein or , and n is the total number of purifications. for this purpose, the bait protein was counted twice to account for bait-prey interactions being more reliable than prey-prey interactions. these raw quality scores are calculated for each individual high-throughput screen. scores vary within one dataset, because they include additional, intrinsic information from the data itself, such as the frequency with which an interaction is detected. for medium sized data sets that are not large enough to apply the topology based scoring schemes, the same raw score is assigned to all interactions within a dataset. finally, very small data sets are pooled and considered jointly as a single interaction set. we similarly have different scoring schemes for predicted interactions based on coexpression in microarray expression studies, conserved gene neighborhood, gene fusion events and phylogenetic profiles. based on these raw quality scores, a confidence score is assigned to each predicted association by benchmarking the performance of the predictions against a common reference set of trusted, true associations. string uses as reference the functional grouping of proteins maintained at kegg (kyoto encyclopedia of genes and genomes (kanehisa et al. ) . any predicted association for which both proteins are assigned to the same "kegg pathway" is counted as a true positive. kegg pathways are particularly suitable as a reference because they are based on manual curation, are available for a number of organisms, and cover several functional areas. other benchmark sets could also be used, for example "biological process" terms from gene ontology (ashburner et al. ) or reactome pathways (vastrik et al. ). the benchmarked confidence scores in string generally correspond to the probability of finding the linked proteins within the same pathway or biological process. the assignment of probabilistic scores for all evidence types solves many of the issues of data integration. first, incomparable evidence types are made comparable by assigning a score that represents how well the evidence type can predict a certain type of interactions (the type being specified by the reference set used). second, the separate benchmarking of interactions from, for example, different high-throughput protein interaction screens accounts for any differences in reliability between different studies. third, use of raw quality scores allows us to separate more reliable interactions from less reliable interactions even within a single dataset. the probabilistic nature of the scores also makes it easy to calculate the combined reliability of an interaction given multiple lines of evidence. it is computed under the assumption of independence for the various sources, in a na€ ıve bayesian fashion. in addition to having a good scoring scheme, it is crucial to make the evidence for an interaction transparent to the end users. to achieve this, the string interaction network is made available via a user-friendly web interface (http://string.embl.de). when performing a query, the user will first be presented with a network view, which provides a first, simplified overview (fig. ) . from here the user has full control over parameters such as the number of proteins shown in the network (nodes) and the minimal reliability required for an interaction (edge) to be displayed. from the network, the user also has the ability to drill down on the evidence that underlies any given interaction using the dedicated viewer for each evidence type. for example, it is possible to inspect the publications that support a given interaction, the set of protein that were fig. protein interaction network of the core cell-cycle regulation in human. the network was constructed by querying the string database (von mering et al. ) for very high confidence interactions (conf. score > . ) between four cyclin-dependent kinases, their associated cyclins, the wee kinase and the cdc phosphatases. the network correctly recapitulates cdc interacts with cyclin-a/b, cdk with cyclin-a/e, and cdk / with cyclin-d. it also shows that the wee and cdc phosphatases regulate cdc and cdk but not cdk and cdk . moreover, the network suggests that cdc a phosphatase regulates cdc and cdk , whereas cdc b and cdc c specifically regulate cdc co-purified in a particular experiment and the phylogenetic profiles or genomic context based on which an interaction was predicted. protein binding is commonly characterized by specific interactions of evolutionarily conserved domains (pawson and nash ) . domains are fundamental units of protein structure and function , which are incorporated into different proteins by genetic duplications and rearrangements (vogel et al. ) . globular domains are defined as structural units of fifty and more amino acids that usually fold independently of the remaining polypeptide chain to form stable, compact structures (orengo and thornton ) . they often carry important functional sites and determine the specificity of protein interactions (fig. ) . essential information on fig. exemplary interaction between the two human proteins hhr b and ataxin- . each protein domain commonly adopts a particular d structure and may fulfill a specific molecular function. generally, the domains responsible for an observed protein-protein interaction need to be determined before further functional characterizations are possible. in the depicted protein-protein interaction, it is known from experiments that the ubiquitin-like domain ubl of hhr b (yellow) forms a complex with de-ubiquitinating josephin domain of ataxin- (blue) (nicastro et al. ) the cellular function of specific protein interactions and complexes can often be gained from the known functions of the interacting protein domains. domains may contain binding sites for proteins and ligands such as metabolites, dna/rna, and drug-like molecules (xia et al. ) . widely spread domains that mediate molecular interactions can be found alone or combined in conjunction with other domains and intrinsically disordered, mainly unstructured, protein regions connecting globular domains (dunker et al. ) . according to apic et al. ( ) multi-domain proteins constitute two thirds of unicellular and % of metazoan proteomes. one and the same domain can occur in different proteins, and many domains of different types are frequently found in the same amino acid chain. much effort is being invested in discovering, annotating, and classifying protein domains both from the functional (pfam (finn et al. ) , smart (letunic et al. ), cdd (marchler-bauer et al. , interpro (mulder et al. ) and structural (scop (andreeva et al. ) , cath (greene et al. )) perspective. notably, it may be confusing that the term domain is commonly used in two slightly different meanings. in the context of domain databases such as pfam and smart, a domain is basically defined by a set of homologous sequence regions, which constitute a domain family. in contrast, a specific protein may contain one or more domains, which are concrete sequence regions within its amino acid sequence corresponding to autonomously folding units. domain families are commonly represented by hidden markov models (hmms), and highly sensitive search tools like hmmer (eddy ) are used to identify domains in protein sequences. different sources of information about interacting domains with experimental evidence are available. experimentally determined interactions of single-domain proteins indicate domain-domain interactions. similarly, experiments using protein fragments help identifying interaction domains, but this knowledge is frequently hidden in the text of publications and not contained in any database. however, domain databases like pfam, smart, and interpro may contain some annotation obtained by manual literature curation. in the near future, high-throughput screening techniques will result in even larger amounts of protein fragment interaction data to delineate domain borders and interacting protein regions (colland and daviet ) . above all, three-dimensional structures of protein domain complexes are experimentally solved by x-ray crystallography or nmr and are deposited in the pdb database (berman et al. ) . structural contacts between two interacting proteins can be derived by mapping sequence positions of domains onto pdb structures. extensive investigations of domain combinations in proteins of known structures (apic et al. ) as well as of structurally resolved homo-or heterotypic domain interactions (park et al. ) revealed that the overlap between intra-and intermolecular domain interactions is rather limited. two databases, ipfam (finn et al. ) and did (stein et al. ) , provide pre-computed structural information about protein interactions at the level of pfam domains. analysis of structural complexes suggests that interactions between a given pair of proteins may be mediated by different domain pairs in different situations and in different organisms. nevertheless, many domain interactions, especially those involved in basic cellular processes such as dna metabolism and nucleotide binding, tend to be evolutionarily conserved within a wide range of species from prokaryotes to eukaryotes (itzhaki et al. ) . in yeast, pfam domain pairs are associated with over % of experimentally known protein interactions, but only . % of them are covered by ipfam (schuster-bockler and bateman ) . domain interactions can be inferred from experimental data on protein interactions by identifying those domain pairs that are significantly overrepresented in interacting proteins compared to random protein pairs (deng et al. ; ng et al. a; riley et al. ; sprinzak and margalit ) (fig. ) . however, the predictive power of such an approach is strongly dependent on the quality of the data used as the source of information for protein interactions, and the coverage of protein sequences in terms of domain assignments. basically, the likelihood of two domains, d i and d j , to interact can be estimated as the fraction of protein pairs known to interact among all proteins in the dataset containing this domain pair. this basic idea has been improved upon by using a maximum-likelihood (ml) approach based on the expectation-maximization (em) algorithm. this method finds the maximum likelihood estimator of the observed protein-protein interactions by an iterative cycle of computing the expected likelihood (e-step) and maximizing the unobserved parameters (domain interaction propensities) in the m-step. when the algorithm converges (i.e. the total likelihood cannot be further improved by the algorithm), the ml estimate for the likelihood of the unobserved domain interactions is found (deng et al. ; riley et al. ). riley and colleagues further improved this method by excluding each potentially interacting domain pair from the dataset and recomputing the ml-estimate to obtain an additional confidence value for the respective domain-domain interaction. this domain pair exclusion (dpea) method measures the contribution of each domain pair to the overall likelihood of the protein interaction network based on domain-domain interactions. in particular, this approach enables the prediction of specific domain-domain interactions between selected proteins which would have been missed by the basic ml method. another ml-based algorithm is insite which takes differences in the reliability of the protein-protein interaction data into account (wang et al. a) . it also integrates external evidence such as functional annotation or domain fusion events. an alternative method for deriving domain interactions is through co-evolutionary analysis that exploits the notion that mutations of residue pairs at the interaction interfaces are correlated to preserve favorable physico-chemical properties of the binding surface (jothi et al. ) . the pair of domains mediating interactions between two proteins p and p may therefore be expected to display a higher similarity of their phylogenetic trees than other, non-interacting domains (fig. ) . the degree of agreement between the evolutionary history of two domains, d i and d j , can be computed by the pearsons correlation coefficient r ij between the similarity matrices of the domain sequences in different organisms: where n is the number of species, m i pq and m j pq are the evolutionary distances between species, and m i and m j are the mean values of the matrices, respectively. in figure the evolutionary tree of the domain d is most similar to those of d and d , corroborating the actual binding region. a well-known limitation of the correlated mutation analysis is that it is very difficult to decide whether residue co-variation happens as a result of functional co-evolution directed at preserving interaction sites, or because of sequence divergence due to speciation. to address this problem, suggested to distinguish the relative contribution of conserved and more variable regions in aligned sequences to the co-evolution signal based on the hypothesis that functional co-evolution is more prominent in conserved regions. finally, interacting domains can be identified by phylogenetic profiling, as described above for full-chain proteins. as in the case of complete protein chains, the similarity of evolutionary patterns shared by two domains may indicate that they interact with each other directly or at least share a common functional role (pagel et al. ) . as illustrated in fig. , clustering protein domains with similar phylogenetic profiles allows researchers to build domain interaction networks which provide clues for describing molecular complexes. similarly, the domainteam method (pasek et al. ) considers chromosomal neighborhoods at the level of conserved domain groups. a number of resources provide and combine experimentally derived and predicted domain interaction data. interdom (http://interdom.i r.a-star.edu.sg/) integrates domain-interaction predictions based on known protein interactions and complexes with domain fusion events (ng et al. b) . dima (http://mips.gsf.de/genre/proj/dima ) is another database of domain interactions, which integrates experimentally demon- fig. co-evolutionary analysis of domain interactions. two orthologous proteins from different organisms known to interact with each other are shown. the first protein consists of two domains, d and d , while the second protein includes the domains d , d , d , and d . evolutionary trees for each domain are shown, their similarity serves as an indication of interaction likelihood that is encoded in the interaction matrix strated domain interactions from ipfam and did with predictions based on the dpea algorithm and phylogenetic domain profiling ). recently, two new comprehensive resources, domine (http://domine.utdallas.edu) (raghavachari et al. ) and dasmi (http://www.dasmi.de) (blankenburg et al. , submitted) , were introduced and are available online. these resources contain ipfam and did data and predicted domain interactions taken from several other publications. predictions are based on several methods for deriving domain interactions from protein interaction data, phylogenetic domain profiling data and domain coevolution. with the availability of an increasing number of predictions the task of method weighting and quality assessment becomes crucial. a thorough analysis of the quality of domain interaction data can be found in schlicker et al. ( ) . beyond domain-domain contacts, an alternative mechanism of mediating molecular recognition is through binding of protein domains to short sequence regions (santonico et al. ) , typically from three to eight residues in length (zarrinpar et al. ; neduva et al. ) . such linear recognition motifs can be discovered from protein interaction data by identifying amino acid sequence patterns overrepresented in proteins that do not possess significant sequence similarity, but share the same interacting partner (yaffe ) . web services like eml (http://elm.eu.org (puntervoll et al. ) ), support the identification of linear motifs in protein sequences. as described above, specific adapter domains can mediate protein-protein interactions. while some of these interaction domains recognize small target peptides, others are involved in domain-domain interactions. as short binding motifs have a rather high probability of being found by chance and the exact mechanisms of binding specificity for this mode of interaction are not understood completely, predictions of proteinprotein interactions based on binding domains is currently limited to domain-domain interactions for which reliable data is available. predicting ppis from domain interactions may simply be achieved by reversing the ideas discussed above, that is, by using the domain composition of proteins to evaluate the interaction likelihood of proteins (bock and gough ; sprinzak and margalit ; wojcik and schachter ) . in a naive approach, domain interactions are treated as independent, and all protein pairs with a matching pair of interacting domains are predicted to engage in an interaction. given that protein interactions may also be mediated by several domain interactions simultaneously, more advanced statistical methods take into account dependencies between domains and exploit domain combinations (han et al. ) and multiple interacting domain pairs (chen and liu ) . exercising and validating these prediction approaches revealed that the most influential factor for ppi prediction is the quality of the underlying data. this suggests that, as for most biological predictions in other fields, the future of prediction methods for protein and domain interactions may lie in the integration of different sources of evidence and weighting the individual contributions based on calibration to goldstandard data. further methodological improvements may include the explicit consideration of cooperative domains, that is, domain pairs that jointly interact with other domains (wang et al. b ). basic interactions between two or up to a few biomolecules are the basic elements of the complex molecular interaction networks that enable the processes of life and, when thrown out of their intended equilibrium, manifest the molecular basis of diseases. such interactions are at the basis of the formation of metabolic, regulatory or signal transduction pathways. furthermore the search for drugs boils down to analyzing the interactions between the drug molecule and the molecular target to which it binds, which is often a protein. for the analysis of a single molecular interaction, we do not need complex biological screening data. thus it is not surprising that the analysis of the interactions between two molecules, one of them being a protein, has the longest tradition in computational biology of all problems involving molecular interactions, dating back over three decades. the basis for such analysis is the knowledge of the three-dimensional structure of the involved molecules. to date, such knowledge is based almost exclusively on experimental measurements, such as x-ray diffraction data or nmr spectra. there are also a few reported cases in which the analysis of molecular interactions based on structural models of protein has led to successes. the analysis of the interaction of two molecules based on their three-dimensional structure is called molecular docking. the input is composed of the three-dimensional structures of the participating molecules. (if the involved molecule is very flexible one admissible structure is provided.) the output consists of the three-dimensional structure of the molecular complex formed by the two molecules binding to each other. furthermore, usually an estimate of the differential free energy of binding is given, that is, the energy difference dg between the bound and the unbound conformation. for the binding event to be favorable that difference has to be negative. this slight misnomer describes the binding between a protein molecule and a small molecule. the small molecule can be a natural substrate such as a metabolite or a molecule to be designed to bind tightly to the protein such as a drug molecule. proteinligand docking is the most relevant version of the docking problem because it is a useful help in searching for new drugs. also, the problem lends itself especially well to computational analysis, because in pharmaceutical applications one is looking for small molecules that are binding very tightly to the target protein, and that do so in a conformation that is also a low-energy conformation in the unbound state. thus, subtle energy differences between competing ligands or binding modes are not of prime interest. for these reasons there is a developed commercial market for protein-ligand docking software. usually the small molecule has a molecular weight of up to several hundred daltons and can be quite flexible. typically, the small molecule is given by its d structure formula, e.g., in the form of a smiles string (weininger ) . if a starting d conformation is needed there is special software for generating such a conformation (see, e.g. (pearlman ; sadowski et al. ) ). challenges of the protein ligand problem are (i) finding the correct conformation of the usually highly flexible ligand in the binding site of the protein, (ii) determining the subtle conformational changes in the binding site of the protein upon binding of the ligand, which are termed induced fit, (iii) producing an accurate estimate of the differential energy of binding or at least ranking different conformations of the same ligand and conformations of different ligands correctly by their differential energy of binding. methods tackling problem (ii) can also be used to rectify smaller errors in structural models of proteins whose structure has not been resolved experimentally. the solution of problem (iii) provides the essential selection criterion for preferred ligands and binding modes, namely those with lowest differential energy of binding. challenge (i) has basically been conquered in the last decade as a number of docking programs have been developed that can efficiently sample the conformational space of the ligand and produce correct binding modes of the ligand within the protein, assuming that the protein is given in the correct structure for binding the ligand. several methods are applied here. the most brute-force method is to just try different (rigid) conformations of the ligand one after the other. if the program is fast enough one can run through a sizeable number of conformations per ligand (mcgann et al. ) . a more algorithmic and quite successful method is to build up the ligand from its molecular fragments inside the binding pocket of the protein (rarey et al. ). yet another class of methods sample ligand conformations inside the protein binding pocket by methods such as local search heuristics, monte carlo sampling or genetic algorithms (abagyan et al. ; jones et al. ; morris et al. ). there are also programs exercising combinations of different methods (friesner et al. ). the reported methods usually can compute the binding mode of a ligand inside a protein within fractions of a minute to several minutes. the resulting programs can be applied to screening through large databases of ligands involving hundreds of thousands to millions of compounds and are routinely used in pharmaceutical industry in the early stages of drug design and selection. they are also repeatedly compared on benchmark datasets (kellenberger et al. ; englebienne et al. ). more complex methods from computational biophysics, such as molecular dynamics (md) simulations that compute a trajectory of the molecular movement based on the forces exerted on the molecules take hours on a single problem instance and can only be used for final refinement of the complex. challenges (ii) and (iii) have not been solved yet. concerning problem (ii), structural changes in the protein can involve redirections of side chains in or close to the binding pocket and more substantial changes involving backbone movement. while recently methods have been developed to optimize side-chain placement upon ligand binding (claußen et al. ; sherman et al. ) , the problem of finding the correct structural change upon binding involving backbone and side-chain movement is open (carlson ) . concerning problem (iii), there are no scoring functions to date that are able to sufficiently accurately estimate the differential energy of binding on a diverse set of protein-ligand complexes huang and zou ) . this is especially unfortunate as an inaccurate estimate of the binding energy causes the docking program to disregard correct complex structures even though they have been sampled by the docking program because they are labeled with incorrect energies. this is the major problem in docking which limits the accuracy of the predictions. recent reviews on protein-ligand docking have been published in sousa et al. ( ) and rarey et al. ( ) . one restriction with protein-ligand docking as it applies to drug design and selection is that the three-dimensional structure of the target protein needs to be known. many pharmaceutical targets are membrane-standing proteins for which we do not have the three-dimensional structure. for such proteins there is a version of drug screening that can be viewed as the negative imprint of docking: instead of docking the drug candidate into the binding site of the proteinwhich is not availablewe superpose the drug candidate (which is here called the test molecule) onto another small molecule which is known to bind to the binding site of the protein. such a molecule can be the natural substrate for the target protein or another drug targeting that protein. let us call this small molecule the reference molecule. the suitability of the new drug candidate is then assessed on the basis of its structural and chemical similarity with the reference molecule. one problem is that now both the test molecule and the reference molecule can be highly flexible. but in many cases largely rigid reference molecules can be found, and in other cases it suffices to superpose the test moelcule onto any low-energy conformation of the reference molecule. there are several classes of drug screening programs based on this molecular comparison, ranging from (i) programs that perform a detailed analysis of the three-dimensional structures of the molecules to be compared (e.g. (lemmen et al. ; kr€ amer et al. ) ) across (ii) programs that perform a topological analysis of the two molecules (rarey and dixon ; gillet et al. ) to (iii) programs that represent both molecules by binary or numerical property vectors which are compared with string methods (mcgregor and muskal ; xue et al. ) . the first class of programs require fractions of seconds to fractions of a minute for a single comparison, the second can perform hundreds comparisons per second, the third up to several ten thousand comparisons per second. reviews of methods for drug screening based on ligand comparison are given in (lengauer et al. ; k€ amper et al. ). here both binding partners are proteins. since drugs tend to be small molecules this version of the docking problem is not of prime interest in drug design. also, the energy balance of protein-protein binding is much more involved that for protein-ligand binding. optimal binding modes tend not to form troughs in the energy landscape that are as pronounced as for protein-ligand docking. the binding mode is determined by subtle side-chain rearrangements of both binding partners that implement the induced fit along typically quite large binding interfaces. the energy balance is dominated by difficult to analyze entropic terms involving the desolvation of water within the binding interface. for these reasons, the software landscape for protein-protein docking is not as well developed as for protein-ligand docking and there is no commercial market for protein-protein docking software. protein-protein docking approaches are based either on conformational sampling and mdwhich can naturally incorporated molecular flexibility but suffers from very high computing demandsor on combinatorial sampling with both proteins considered rigid in which case handling of protein flexibility has to be incorporated with methodical extensions. for space reasons we do not detail methods for protein-protein docking. a recent review on the subject can be found in hildebrandt et al. ( ) . a variant of protein-protein docking is protein-dna docking. this problem shares with protein-protein docking the character that both binding partners are macromolecules. however, entropic aspects of the energy balance are even more dominant in protein-dna docking than in protein-protein docking. furthermore dna can assume nonstandard shapes when binding to proteins which deviate much more from the known double helix than we are used to when considering induced fit phenomena. icm-a method for protein modeling and design: applications to docking and structure prediction from the distorted native conformation gene prioritization through genomic data fusion scale-free networks in cell biology the relationship between sequence and interaction divergence in proteins ten thousand interactions for the molecular biologist structural systems biology: modelling protein interactions scop database in : refinements integrate structure and sequence family data domain combinations in archaeal, eubacterial and eukaryotic proteomes gene ontology: tool for the unification of biology. the gene ontology consortium computing topological parameters of biological networks bind: the biomolecular interaction network database network biology: understanding the cells functional organization constrained models of evolution lead to improved prediction of functional linkage from correlated gain and loss of genes predicting functional gene links from phylogenetic-statistical analyses of whole genomes high-throughput mapping of a dynamic signaling network in mammalian cells the worldwide protein data bank (wwpdb): ensuring a single, uniform archive of pdb data predicting protein-protein interactions from primary structure the unified medical language system (umls): integrating biomedical terminology superti-furga g ( ) a physical and functional map of the human tnf-alpha/ nf-kappa b signal transduction pathway use of logic relationships to decipher protein network organization the grid: the general repository for interaction datasets interaction network containing conserved and essential protein complexes in escherichia coli epstein-barr virus and virus human protein interaction maps a network-based analysis of systemic inflammation in humans disrupted in schizophrenia interactome: evidence for the close connectivity of risk genes and a potential synaptic basis for schizophrenia protein flexibility is an important component of structure-based drug discovery mint: the molecular interaction database on evaluating molecular-docking methods for pose prediction and enrichment factors prediction of protein-protein interactions using random decision forest framework the coordinated evolution of yeast proteins is constrained by functional modularity network-based classification of breast cancer metastasis flexe: efficient molecular docking considering protein structure variations integration of biological networks and gene expression data using cytoscape integrating a functional proteomic approach into the target discovery process identification of the helicobacter pylori anti-sigma factor a map of human cancer signaling interactome: gateway into systems biology conservation of gene order: a fingerprint of proteins that physically interact discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages inferring domain-domain interactions from protein-protein interactions flexible nets. the roles of intrinsic disorder in protein interaction networks profile hidden markov models evaluation of docking programs for predicting binding of golgi alpha-mannosidase ii inhibitors: a comparison with crystallography protein interaction maps for complete genomes based on gene fusion events a novel genetic system to detect protein-protein interactions ipfam: visualization of protein-protein interactions in pdb at domain and amino acid resolutions pfam: clans, web tools and services pharmaceuticals: a new grammar for drug discovery a genomic approach of the hepatitis c virus generates a protein interaction map reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes glide: a new approach for rapid, accurate docking and scoring. . method and assessment of docking accuracy the coevolution of gene family trees microbial genescapes: phyletic and functional patterns of orf distribution among prokaryotes analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets proteome survey reveals modularity of the yeast cell machinery functional organization of the yeast proteome by systematic analysis of protein complexes experimental determination and system level analysis of essential genes in escherichia coli mg disease gene discovery through integrative genomics similarity searching using reduced graphs a protein interaction map of drosophila melanogaster a protein interaction network links git , an enhancer of huntingtin aggregation, to huntingtons disease co-evolution of proteins with their interaction partners analyzing protein interaction networks the human disease network mgf( )(-) as a transition state analog of phosphoryl transfer the cath domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution mpact: the mips protein interaction resource on yeast online mendelian inheritance in man (omim), a knowledgebase of human genes and genetic disorders prespi: a domain combination based prediction system for protein-protein interaction the gene ontology (go) database and informatics resource from molecular to modular cell biology why do hubs tend to be essential in protein networks? the hupo psis molecular interaction format -a community standard for the representation of protein interaction data modeling protein-protein and protein-dna docking systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry towards zoomable multidimensional maps of the cell visant: data-integrating visual framework for biological networks and modules an iterative knowledge-based scoring function to predict protein-ligand interactions: ii. validation of the scoring function measuring genome evolution evolutionary conservation of domain-domain interactions a bayesian networks approach for predicting protein-protein interactions from genomic data lethality and centrality in protein networks development and validation of a genetic algorithm for flexible docking a quantitative protein interaction network for the erbb receptors using protein microarrays global topological features of cancer proteins in the human interactome co-evolutionary analysis of domains in interacting proteins reveals insights into domain-domain interactions mediating protein-protein interactions high-confidence prediction of global interactomes based on genome-wide coevolutionary networks huntingtin interacting proteins are genetic modifiers of neurodegeneration lead identification by virtual screaning kegg for linking genomes to life and the environment protein interactions and disease: computational approaches to uncover the etiology of diseases predicting protein domain interactions from coevolution of conserved regions relating protein pharmacology by ligand chemistry comparative evaluation of eight docking tools for docking and virtual screening accuracy intact-open source resource for molecular interaction data broadening the horizon -level . of the hupo-psi format for molecular interactions a robustness-based approach to systems-oriented drug design systematic association of genes to phenotypes by genome and literature mining analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs fast d molecular superposition and similarity search in databases of flexible molecules stitch: interaction networks of chemicals and proteins a protein interaction network of the malaria parasite plasmodium falciparum a human phenome-interactome network of protein complexes implicated in genetic disorders genome-wide protein interaction maps using two-hybrid systems flexs: a method for fast flexible ligand superposition novel technologies for virtual screening smart : domains in the context of genomes and networks a protein-protein interaction network for human inherited ataxias and disorders of purkinje cell degeneration a multidimensional analysis of genes mutated in breast and colorectal cancers network-based analysis of affected biological processes in type diabetes models human disease classification in the postgenomic era: a complex systems approach to human pathobiology hubs in biological interaction networks exhibit low changes in expression in experimental asthma cdd: a conserved domain database for interactive domain family analysis detecting protein function and protein-protein interactions from genome sequences a combined algorithm for genome-wide prediction of protein function a combined algorithm for genome-wide prediction of protein function gaussian docking functions pharmacophore fingerprinting. . application to qsar and focused library design structure, function, and evolution of transient and obligate protein-protein interactions human protein reference database- update systematic discovery of analogous enzymes in thiamin biosynthesis automated docking using a lamarckian genetic algorithm and an empirical binding free energy function new developments in the interpro database systematic discovery of new recognition peptides mediating protein interaction networks integrative approach for computationally inferring protein domain interactions interdom: a database of putative interacting protein domains for validating predicted protein interactions and complexes the solution structure of the josephin domain of ataxin- : structural determinants for molecular recognition protein interaction networks in bacteria diversity of protein-protein interactions a comprehensive map of the toll-like receptor signaling network a comprehensive pathway map of epidermal growth factor receptor signaling submit your interaction data the imex way: a step by step guide to trouble-free deposition the minimum information required for reporting a molecular interaction experiment (mimix) protein families and their evolution-a structural perspective the modular nature of genetic diseases use of contiguity on the chromosome to predict functional coupling a database and tool, im browser, for exploring and integrating emerging gene and protein interaction data for drosophila the mips mammalian protein-protein interaction database dima . predicted and known domain interactions a domain interaction map based on phylogenetic profiling species-specificity of the cohesin-dockerin interaction between clostridium thermocellum and clostridium cellulolyticum: prediction of specificity determinants of the dockerin domain global mapping of pharmacological space mapping protein family interactions: intramolecular and intermolecular protein family interaction repertoires in the pdb and yeast a proteome-wide protein interaction map for campylobacter jejuni identification of genomic features using microsyntenies of domains: domain teams assembly of cell regulatory systems through protein interaction domains assessing protein co-evolution in the context of the tree of life assists in the prediction of the interactome similarity of phylogenetic trees as indicator of protein-protein interaction rapid generation of high quality approximate -dimension molecular structures assigning protein functions by comparative genome analysis: protein phylogenetic profiles network modeling links breast cancer susceptibility and centrosome dysfunction elm server: a new resource for investigating short functional sites in modular eukaryotic proteins domine: a database of protein domain interactions an integrative approach to gain insights into the cellular function of human ataxin- computational analysis of human protein interaction networks docking and scoring for structure-based drug design feature trees: a new molecular similarity measure based on tree matching a fast flexible docking method using an incremental construction algorithm a generic protein purification method for protein complex characterization and proteome exploration inferring protein domain interactions from databases of interacting proteins corum: the comprehensive resource of mammalian protein complexes human protein-protein interaction networks and the value for drug discovery comparison of automatic three-dimensional models builders using x-ray structures interaction generality, a measurement to assess the reliability of a protein-protein interaction the database of interacting proteins: update methods to reveal domain networks partial correlation coefficient between distance matrices as a new indicator of protein-protein interactions the inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships functional evaluation of domain-domain interactions and human protein interaction networks reuse of structural domain-domain interactions in protein networks cytoscape: a software environment for integrated models of biomolecular interaction networks conserved patterns of protein interaction in multiple species network-based prediction of protein function novel procedure for modeling ligand/ receptor induced fit effects quantifying modularity in the evolution of biomolecular systems protein-ligand docking: current status and future challenges protein complexes and functional modules in molecular networks correlated sequence-signatures as markers of protein-protein interaction biogrid: a general repository for interaction datasets did: interacting protein domains of known three-dimensional structure the value of high quality protein-protein interaction networks for systems biology protein-protein interactions: analysis and prediction tools for visually exploring biological networks systematic interactome mapping and genetic perturbation analysis of a c. elegans tgf-b signaling network a comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae herpesviral protein networks and their interaction with the human proteome from orfeomes to protein interaction maps in viruses reactome: a knowledge base of biologic pathways and processes structure, function and evolution of multidomain proteins analysis of intraviral protein-protein interactions of the sars coronavirus orfeome string -recent developments in the integration and prediction of protein interactions comparative assessment of large-scale data sets of protein-protein interactions insite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale comparative evaluation of scoring functions for molecular docking analysis on multi-domain cooperation for predicting protein-protein interactions smiles, a chemical language and information system. . introduction and encoding rules reaching for high-hanging fruit in drug discovery at protein-protein interfaces protein-protein interaction map inference using interacting domain profile pairs analyzing cellular biochemistry in terms of molecular networks discovering disease-genes by topological features in human protein-protein interaction network a modular network model of aging evaluation of descriptors and mini-fingerprints for the identification of molecules with similar activity bits" and pieces drug-target network functional and topological characterization of protein interaction networks the importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics the structure and function of proline recognition domains protein-protein interactions: analysis and prediction key: cord- -tmpxs og authors: dondapati, srujan kumar; stech, marlitt; zemella, anne; kubick, stefan title: cell-free protein synthesis: a promising option for future drug development date: - - journal: biodrugs doi: . /s - - -y sha: doc_id: cord_uid: tmpxs og proteins are the main source of drug targets and some of them possess therapeutic potential themselves. among them, membrane proteins constitute approximately % of the major drug targets. in the drug discovery pipeline, rapid methods for producing different classes of proteins in a simple manner with high quality are important for structural and functional analysis. cell-free systems are emerging as an attractive alternative for the production of proteins due to their flexible nature without any cell membrane constraints. in a bioproduction context, open systems based on cell lysates derived from different sources, and with batch-to-batch consistency, have acted as a catalyst for cell-free synthesis of target proteins. most importantly, proteins can be processed for downstream applications like purification and functional analysis without the necessity of transfection, selection, and expansion of clones. in the last years, there has been an increased availability of new cell-free lysates derived from multiple organisms, and their use for the synthesis of a diverse range of proteins. despite this progress, major challenges still exist in terms of scalability, cost effectiveness, protein folding, and functionality. in this review, we present an overview of different cell-free systems derived from diverse sources and their application in the production of a wide spectrum of proteins. further, this article discusses some recent progress in cell-free systems derived from chinese hamster ovary and sf lysates containing endogenous translocationally active microsomes for the synthesis of membrane proteins. we particularly highlight the usage of internal ribosomal entry site sequences for more efficient protein production, and also the significance of site-specific incorporation of non-canonical amino acids for labeling applications and creation of antibody drug conjugates using cell-free systems. we also discuss strategies to overcome the major challenges involved in commercializing cell-free platforms from a laboratory level for future drug development. proteins whose functionality is not well characterized form a large percentage of entries in many of the currently available biological databases, including the protein data bank (pdb), and there is a constantly growing demand for reliable and fast synthesis and characterization methods. when it comes to drug discovery, proteins are key components as they can have therapeutic potential themselves (e.g., antibodies, coagulation factors, hormones, growth factors, enzymes, and antimicrobial peptides), but also because they could serve as drug targets for diverse diseases (as ion channels, receptors, enzymes, and transporters, for example) [ ] [ ] [ ] [ ] [ ] [ ] [ ] . a large proportion of approved pharmaceutical drugs target human proteins. beyond that, protein-based therapeutics, such as antibody-drug conjugates, represent a significant percentage of total drug molecules currently approved. they are poised to grow further with increased gene expression technology, improved protein engineering, and refined bioinformatics tools. some proteins are very difficult to express in traditional cell-based systems and this can hamper our ability to define the mechanism of action and structure-function relationship of the individual protein, knowledge which aids the development of drugs targeting these proteins [ ] [ ] [ ] [ ] . generally, to exploit and fine-tune the structural and functional characteristics of a protein, it needs to be expressed and purified with high quality by using recombinant expression technology. traditionally, escherichia coli-based systems were widely used for the production of recombinant proteins due to simplicity in preparation and operation, and cost effectiveness. as a result, broad research and standardization from several years was performed using e. coli-based expression systems, resulting in their often-cited utilization as a state-of-the-art protein expression system [ ] . for complex therapeutic proteins, membrane proteins (mps) originating from humans, and virus-like proteins (vlps), mammalian expression systems fulfill all the requirements like post-translational modifications (ptms), cofactors, and chaperones for correct folding and efficient production. however, batch-tobatch variation in cell culture may be a source of process variation. additionally, overexpression of mps might be toxic for the cultivated cells, resulting in cell death or truncated and misfolded proteins [ ] . ideally, synthesized proteins are functionally folded and exhibit appropriate ptms. due to the lack of extensive research and low yields in recombinant protein expression, many mps are not yet crystallized, thus limiting the computer-aided drug discovery efforts. due to the growing demand for the production of protein biologics and drug discovery targeting proteins, alternative strategies for protein synthesis should be developed. new expression technologies where proteins can be expressed in a simple way and which allow high throughput screening of different reaction conditions, different genes, and different supplements in a cost-effective manner are extremely important for future drug development. in this review, we give an overview of recent advances in cell-free (cf) synthesis platforms and their diverse applications. additionally, we focus on human and therapeutic proteins produced by different types of cf systems and how these cf protein synthesis (cfps) methods can further play a prominent role in future drug development. cfps systems use crude cell extracts prepared from cells of choice by lysis followed by many steps of washing to remove the cell debris and genomic dna [ , ] . these cell extracts can be stored at − °c for years and can be used by thawing just before the reaction. such extracts contain all the principal components necessary for transcription and translation, such as aminoacyl-trna synthetase (aas), ribosomes, and factors necessary for elongation, initiation, and transcription. protein synthesis can be realized by combining cell extracts with necessary substrates like amino acids, energy substrates, dna, cofactors, salts, and nucleotides. depending on the biochemical properties of the protein and its end application, the appropriate cf system can be selected. cfps is a fast protein production system since it does not require transfection or cell culture and lacks cell viability constraints. due to its openness, cfps platforms offer additional advantages when compared with cell-based expression methods. a comparative analysis of cf and cellbased approaches is shown in table . for complex proteins, eukaryotic cf systems are ideal as they contain the endogenous microsomes derived from the endoplasmic reticulum (er), enabling co-translational translocation of proteins and er-based ptms [ , , , ] . there has been a constant improvement in the quality of lysate preparation, system optimization, linear templatebased protein synthesis, and reduction of process costs, which has led to the preparation of cost-effective systems suitable for commercial purposes. a general scheme of cf protein production is depicted in fig. . cfps platforms are based on either prokaryotic or eukaryotic origin. among the prokaryotic cf systems, extracts based on e. coli are regularly used and are available commercially for cfps of a diverse range of proteins. very recently, cf systems based on bacillus subtilis [ ] , pseudomonas putida [ ] , streptomyces [ ] , and vibrio [ ] have been optimized well at the laboratory level due to the ease of preparation of cf lysates. a wide range of detailed protocols is currently available for the preparation of e. coli-based lysates. among the eukaryotic cf systems, extracts based on rabbit reticulocyte lysate (rrl), wheat germ, insect spodoptera frugiperda (sf ), chinese hamster ovary (cho), and cultured human cells are regularly used. an increasing number of eukaryotic cf systems have so far reached technical maturity and become commercially available. requires dna template to be cloned in a plasmid [ ] toxic proteins ideal choice for the synthesis of most of the toxic proteins due to high toxic tolerance of cf systems toxic proteins may be difficult to synthesize [ ] membrane proteins suitable for a wide range of mps of different sizes mp overexpression can lead to cell toxicity and death [ ] [ ] [ ] membrane protein solubilization to solubilize mps, supplements can be added directly to the reaction mixture in the form of nanodiscs, detergents or liposomes (prokaryotic systems and wheat germ systems) or by using endogenous microsomes or proteoliposomes (eukaryotic systems) not possible to add supplements externally during translation mps have to be purified and reconstituted into liposomes or detergents for functional analysis [ , ] reliability most of the reports related to cfps are currently limited to the research and laboratory level, but progress towards the drug discovery pipeline has been made recently most reliable and state of the art for protein production and drug discovery purposes, and approved by drug authorities [ ] functional characterization compared with cell-based systems, standardized biophysical, biochemical assays are limited, but progress has been made recently a wide range of standardized biophysical and biochemical techniques are available for proteins synthesized by cell-based systems [ ] protein yields and downstream applications yields range from µg/ml (complex proteins) to several mg/ml (cytosolic proteins and few mps) with more complex proteins downstream applications are simple and protein can be purified and reconstituted immediately after synthesis yields can be very high, in the range of mg/ml downstream applications are possible but need to additionally lyse the cells for mps [ , , ] post-translational modifications (ptms) ptms possible (mostly in eukaryotic cf systems with translationally active microsomes) limited ptms in prokaryotic and eukaryotic lysates lacking endogenous microsomes. o-glycosylation not possible all ptms are possible including o-glycosylation [ ] incorporation of non-standard amino acids ideal choice for the incorporation of single and multiple noncanonical amino acids difficult to incorporate non-canonical amino acids due to cell membrane barrier and cytotoxic effects [ , ] scale of reaction volume ranging from few µl (chip-based and batch-based in an eppendorf tube) to l reaction (in a fermenter) typical reactions require a minimum of ml. there are exceptional cases where it is performed in µl spots [ ] [ ] [ ] [ ] flexibility completely open system and easy to manipulate the reaction conditions with lack of cell membrane constraints completely closed system and difficult to manipulate [ ] automation cf systems can be automated with high throughput screening of multiple templates, starting in an elisa plate format generally difficult to automate due to the requirement of larger volumes and aseptic techniques [ ] point of care production of biologics lyophilized cf lysates are suitable for the production of therapeutic proteins next to the emergency settings very difficult due to its time-consuming process and requirements of large infrastructure including manufacturing facilities, transport, and cold storage facilities [ , ] recently, several eukaryotic cf extracts based on tobacco [ ] , leishmania [ ] , neurospora [ ] , yeast cells [ ] , and human blood cells [ ] were characterized and optimized for a limited number of proteins at the laboratory level. there is a growing trend in the development of novel cf platforms for taking advantage of the genetic tools available in the literature and the abundant literature available on the in vivo expression of proteins. prokaryotic cf systems based on e. coli are most commonly used for protein production towards drug development due to their simplicity and a vast literature available on the utilization of these cells. protein synthesis starts with crude cell extracts prepared from e. coli cells that contain the translation machinery along with all the essential components required for translation. a modified and reconstituted cf synthesis system known as the pure system (protein synthesis using recombinant elements), where all the components of the translation machinery are purified and added individually along with the dna template to produce the protein, has been reported [ ] . this is a highly controlled system compared with crude extract methods. a major advantage of the pure system is that protein factors participating in the initiation, elongation, and termination of the protein synthesis process are identified and can be adapted individually to the cf system's requirements. although the naturally occurring ptm machinery is not available in the e. coli lysates, recently proteins with n-glycosylation were synthesized by using e. coli extracts enriched with glycosylation components, including oligosaccharyltransferases (osts) and lipid-linked oligosaccharides (llos) [ ] . using release factor (rf ) deficient e. coli lysates, proteins were phosphorylated by incorporation of non-canonical amino acids, which will be addressed in a later part of this review [ ] . due to a constantly growing demand for more complex proteins of pharmaceutical value, cf systems based on eukaryotic lysates have been developed to produce highquality proteins. cf systems based on wheat germ lysates (wgl) are among the most popular eukaryotic platforms due to their capacity to produce eukaryotic proteins with high yields [ ] . cfps based on wgl have been used frequently for the discovery of novel vaccine candidates as well as for producing several proteins of high quality for structural analysis. despite the high yields and quality of the lysate, this system does not offer all the ptms like glycosylation and does not support the solubilization of complex mps [ ] . in the case of wheat germ and rrl, there are no translationally active endogenous microsomes present in the system. in the case of rrl, exogenous microsomes are typically supplied from the canine pancreas for protein translation [ , ] . it is quite laborious and difficult to enrich rrls with heterologous microsomes. cf systems derived from cultured insect (sf ) cells represent the most popular eukaryotic-based approach for synthesizing a wide variety of proteins. sf lysates contain translationally active endogenous er membranes, thereby supporting the signal peptide-mediated translocation of proteins across the membrane, and further provides functions such as signal peptide cleavage, post-translational modifications like n-glycosylation, and lipid modification [ , , ] . cho cell-based expression is well established and is approved for the large-scale synthesis of several biologics by the fda because it undergoes human-compatible ptms. nearly % of the approved mammalian therapeutic proteins are currently expressed in cho cells. however, these cells have limitations when it comes to difficult-to-express proteins like overexpression of complex mps, toxic proteins, and multi-subunit proteins as discussed above. cf systems based on cho lysates are evolving as an alternative strategy for the expression of difficult-to-express proteins [ , [ ] [ ] [ ] ] . apart from many general advantages of cf systems, cho-based cf systems retain most of the features of cho cells while being more flexible due to the lack of cell membrane boundaries. cho-based lysates harbor endogenous microsomal vesicles enabling translocation of transmembrane proteins and secretory proteins. furthermore, ptms of de novo synthesized mps, such as glycosylation, are possible using cho lysate. thus, using cho cell lysate for cfps has a potential value and enables new opportunities, in particular, the high-yield production of pharmaceutically relevant mps [ , [ ] [ ] [ ] . there is a significant increase in the number of publications based on cho lysates for cfps. table compares different cf systems and their advantages and limitations with some selected examples. fig. general scheme depicting the overall process of cell-free protein production. aatrna aminoacyl-trna, aas aminoacyl-trna synthetase, atp adenosine triphosphate, ef elongation factor, gsh glutathione, gssg glutathione-disulfide, gtp guanosine- '-triphosphate, if initiation factor, ires internal ribosome entry site, mp membrane protein, ncaa non-canonical amino acid, pdi protein disulfide isomerase, peg polyethylene glycol, ptm post-translational modification, r ribosomes, t-rna transfer rna, tf transcription factor, utr untranslated region, vlp virus like particle chinese hamster ovary (cho) mimic the cho cell-based production ptms (n-glycosylation, disulfide bridging, and lipidation) suitable for a wide range of eukaryotic and complex proteins presence of translational active endogenous microsomes [ ] high yields in cecf mode endotoxin free lysates used for point-of-care testing [ ] low yields especially in the batch mode [ ] cost ineffective and difficult to establish unlike e. colibased system streptokinase (cecf): µg/ml [ ] human tlr receptor [ ] ( -h batch): µg/ml ( -h cecf): µg/ml hegfr [ ] (batch): µg/ml (cecf): µg/ml cfps can be performed in different formats. the batchbased format is the most commonly used method both in the prokaryotic and eukaryotic systems. this method is relatively fast and cheap, and synthesis can be performed within . - hours depending on the system. e. coli-based systems can provide protein yields ranging from µg/ ml to - mg/ml. although the yields from batch-based eukaryotic systems are comparatively low, mps are automatically incorporated into microsomal membranes and the functionality can be addressed immediately after the synthesis [ ] . for researchers who would like to further scale up the protein yields via batch-based eukaryotic systems, a repetitive batch-based synthesis format has been proposed where the microsomes incorporating the mp of interest generated in an initial synthesis reaction can be added to a fresh cf synthesis reaction that has been depleted of its microsomes [ , ] . another popular cf synthesis format that has been used for a rapid increase in the protein yields is the so-called continuous exchange cell-free synthesis platform (cecf). in this format, a semi-permeable dialysis membrane separates the reaction chamber and a feed chamber and thereby a feed chamber provides the fresh reaction components and enriches the reaction chamber. in exchange, the inhibitory components accumulated during the reaction are removed [ , , , ] . typically, the cecf format prolongs the reaction time and increases the protein yields. until now, the cecf format has been used to increase the protein yield by multiple fold, and is widely used as cf platforms (table ). this section highlights some of the key parameters that might influence the protein production using cf lysates. designing synthetic dna and sequence manipulation for cf synthesis by adding regulatory elements plays a significant role in high-yield protein production. in eukaryotic cf systems, initiation factors (ifs) in particular limit the initiation of protein synthesis, thereby leading to low protein yields. one alternative is to use internal ribosome entry site (ires) elements found in the ′-untranslated region ( ′utr) of the different viral genomes upstream of the start codon for cap-independent translation initiation [ , , ] . ires elements from three different viral sources were compared for their translational efficiency in sf , cho, and human leukemia k cf lysates. the ires from the cricket paralysis virus (crpv) typically increased protein yields by a factor of - [ ] . inserting the crpv-ires into the corresponding vector upstream of the epidermal growth factor receptor (egfr) gene, and using the cecf reaction format, egfr yields were significantly increased to more than -fold compared with batch reaction format without crpv-ires [ ] . additionally, replacement of the initiator codon (atg) to a gcu-codon in combination with the crpv-ires resulted in a further improvement of protein expression levels in cho and k cf systems [ ] . the vector backbone also plays an important role in cfps. a detailed study comparing commercially available vectors harboring the luciferase gene in combination with crpv-ires showed that there is a significant -fold increase in protein yield with a change in the vector backbone [ ] . species-independent translational sequences (sits) are another group of synthetic ′utrs capable of initiating cap-independent translation in multiple prokaryotic and eukaryotic cf systems [ ] . typically, polymerase chain reaction (pcr) products are generated with sits downstream of the t promoter and upstream of the start codon atg [ ] . the ′ hairpin region of the sits increases the residence time of the preinitiation complex in the vicinity of the start codon [ ] . using l. tarentolae cfps in the presence of genes encoding rab encoding variable fragments in combination with a universal sits, nearly a full complement of human rab gtpases were produced with a yield of around µg/ml [ ] . similarly, egfp with a yield of around µg/ml [ ] , and an active multisubunit enzyme heterodimeric farnesyl transferase (ftase) [ ] were synthesized using the l. tarentolae cfps [ ] . codon optimization is another important parameter that plays a crucial role in increasing the expression yields of proteins. codon optimization has been shown to influence the translation efficiency of several proteins [ ] . by taking advantage of the cf lysates derived from n. crassa and s. cerevisiae, transcription and translation reactions were uncoupled for ribosome profiling, which provided strong biochemical evidence that codon optimization enhances the rate of translational elongation, thereby affecting the ribosome traffic on the mrna [ ] . on the one hand, codon optimization usually improves protein yields, but on the other hand, it was shown that faster translation rates might negatively affect the protein folding and function of the individual protein [ , ] . this problem often cannot be solved even by altering the trna population in the case of cfps. the addition of anti-spliced leader oligonucleotide to l. tarentolae cell extracts suppressed the translation of endogenous l. tarentolae mrnas, thus increasing the translation efficiency of exogenously supplied mrna [ ] . using the er-specific signal sequence of honeybee melittin (melittin signal peptide) instead of the native signal peptide increased the translocation of synthesized proteins such as wnt proteins, single-chain antibody variable fragments, and the htlr -ectodomain into microsomes in the case of sf and cho-based cf systems [ , , , ] . iterative optimization processes are required to develop highyield cfps. factors that influence both protein quality and quantity include reaction temperature, reaction time, plasmid concentration, salt concentration, t polymerase, and other supplements. the influence of these factors on the synthesis rates is also protein specific. very recently, cfps of human toll-like receptor protein (htlr ) in cho-based lysates has been reported by using a cecf method with high yields of around . mg/ml. by increasing the temperature from to °c, the protein yields were increased by almost %. stable monitoring and maintenance of ph throughout the entire cf reaction along with sufficient adenosine triphosphate (atp) supply are essential for efficient and maximum yield protein production. by using amino acid decarboxylase, the ph is controlled throughout the cf reaction [ ] . supplementation of chaperones influences the functional folding of many proteins. supplementation of chaperones such as groes/el and dnak/dnaj/grpe in prokaryotic cf systems was used to increase the yield and solubility of colicin m from to %, resulting in enhanced cellkilling activity [ ] . li et al. demonstrated that by using cfps based on wheat germ extracts, expression of j-domain containing chaperone proteins (dnajb and dnajb ) along with potassium channels plays a critical role in the folding, stabilization, and tetramerization of k+ channels [ ] . ion concentrations (potassium and magnesium) in the cf reaction have a significant effect on protein production. in the case of cho-based cecf reactions, an increase in the magnesium ion (mg + ) concentration from . to . mm led to a . -fold increase in egfr yield [ ] . for efficient regeneration of atp, several methods have been developed in cf systems. in prokaryotic systems, compounds like phosphoenolpyruvate (pep), glucose + glutamate decarboxylase, glucose- -phosphate, fructose- , -biphosphate, acetyl phosphate, maltodextrin, and creatine phosphate are widely used as energy sources [ ] . in eukaryotic cf systems, a combination of creatine phosphate and creatine kinase is typically used for energy regeneration. apart from these, phosphoglycerate (b. subtilis), and polyphosphate are used in cf systems [ , ] . cf systems have evolved over the last decade from their use as a prototype method in research laboratories to commercial and large-scale applications. in this section, the utility of cf systems in mp synthesis, antibody production, vaccine development, protein labeling, and antimicrobial peptide synthesis are addressed. mps are structurally and functionally diverse, and constitute % of the proteins encoded in the human genome. drugs targeting mps such as ion channels, transporters, and g-protein coupled receptors (gpcrs) represent out of the top global revenues in the pharmaceutical industry [ ] . due to the presence of transmembrane domains, ranging from to , these proteins are highly hydrophobic and are very challenging to express by traditional cell-based systems. expression of human proteins in heterologous cellular hosts is very much limited due to the difference in their lipid composition, which can prevent the mps from attaining maximum functionality [ ] . synthesis of mps by cell-based methods often leads to cytotoxicity, aggregation, and improper folding [ ] . to analyze mp functionality, the protein needs to be folded properly and in the appropriate hydrophobic environment. cf systems derived from prokaryotic, as well as eukaryotic lysates lacking endogenous microsomes, require specific supplements for the solubilization of mps in the form of detergents, nanodiscs, or liposomes. non-ionic and zwitterionic detergents are commonly used as supplements in the majority of cfps reactions for the solubilization of mps during their production. detergent-solubilized mps can be either used directly for functional analysis or may be reconstituted into liposomes by mixing with artificial lipids followed by detergent removal [ , , ] . alternatively, nanodiscs (nds) and liposome-based reconstitution are detergent-free strategies where nds and liposomes, prepared and characterized externally, could be supplemented directly into the reaction mixture for the reconstitution [ , ] . a detailed review of the cf synthesis of mps and the usage of solubilization supplements for isolation and functional analysis is presented in the literature [ ] . some of the advantages of nds are easy purification and flexibility in using different lipids and membrane scaffold proteins for creating different sizes, and their availability as monodisperse and homogenous nds. nonetheless, nds have their limitations, particularly when working with a protein whose functionality depends on its orientation and also working with transporter proteins. liposome-based reconstitution covers the limitations of the nds, but the separation of liposomes after the cf reaction is quite challenging and often suffers from disruption due to osmotic instability. further, such passive reconstitution strategies do not offer the advantages of post-translational modifications within native membranes and are limited for mps whose function does not depend on active translocon-based translation. cf systems derived from eukaryotic lysates equipped with endogenous microsomes (e.g., sf , cho, cultured human cells, and tobacco-by ) satisfy all the necessary requirements for proper folding of mps. the microsomes offer a native environment and intact translocon machinery for a proper embedment and folding of mps [ , , , , , , ] . there are continuing efforts in analyzing the functionality of microsomal reconstituted mps, indicated by a good number of publications reporting on this reconstitution strategy, which should help the pharmaceutical industry to develop more dynamic drug screening assays involving mps [ , , ] . here we present recent works on ion channels, transporters, and gpcrs, which constitute more than % of the major drug targets in the pharma industry [ ] . ion channels constitute approximately % of all currently existing human drug targets and play a crucial role in diverse physiological processes involving cell excitability, neuronal transmission, metabolism, sensory transduction, cognition, and electrolyte homeostasis. transporters mediate the translocation of a variety of substrates across biological membranes [ ] . the solute carrier (slc) family is the largest class of transporters and is implicated in metabolic conditions and diseases, and in the transport of drugs. these proteins typically have - transmembrane domains and are difficult to express by traditional methods [ , , , ] . slc transporters are an emerging drug target class and the molecular target of several approved inhibitor drugs [ ] . despite this, these classes of proteins remain largely unexplored in recent years due to the high costs involved and lack of proper expression methods [ , ] . table highlights some of the selected publications using cf methods for synthesis, reconstitution, and functional analysis of ion channels and transporters. the most widely used method of reconstitution for functional analysis is detergent-based reconstitution into liposomes or passive integration of mps into liposomes and nds. the majority of the functional assays were performed with plbe in the case of ion channels and substrate uptake assays in the case of transporters. gpcrs transduce extracellular stimuli to the inside of cells, after activation by a variety of different molecules such as neurotransmitters, hormones, odorants, and peptides, thereby triggering several signal transduction cascades. the involvement of gpcrs in almost all processes in living cells has resulted in significant pharmaceutical interest in this protein class, and the development of robust and high-throughput-suitable assays for the discovery of novel ligands and drugs targeting these proteins. in principle, the screening of ligands can be performed in whole-cell assays by measuring a downstream signaling event, or in cf assays, which are decoupled from the living organism. usually, these decoupled methods are preferable for high-throughput screenings, as they are easy to handle and therefore amenable for automation and downsizing. these parameters can be well combined with cfps. an automated cf synthesis procedure for the production of different mps is already reported [ ] . this procedure might be further expanded for the parallel analysis and identification of molecules that target different gpcrs. to date, only a few studies have analyzed in detail the activity of receptors produced by cf systems. the main reason for this is that there are limited well established activity assays. this section addresses possible activity assays that might be transferred to cf systems in the future. radioligand binding assays, the gold standard for identifying binding molecules, are already adapted for gpcrs that have been synthesized in eukaryotic and prokaryotic cf systems, and demonstrate similar binding affinities in comparison with in vivo produced gpcrs [ , [ ] [ ] [ ] . alternatively, fluorescently labeled ligands can be analyzed by an optical read-out system using eukaryotic cf systems harboring endogenous membrane structures [ ] . nevertheless, for these systems, radiolabeled or fluorescently labeled ligands are required, thereby limiting the analysis mainly to gpcrs with already known ligands. in addition, simple ligand binding assays usually do not differentiate between an agonistic and an antagonistic effect of the bound substance. in this context, measuring downstream signaling to distinguish between an activating and inhibitory ligand is preferable. one possible method of choice is the receptor-mediated coupling of g proteins [ ] . this early event immediately follows after gpcr activation and is detected by the binding of [ s]gtpγs to gα subunits. this method is not yet established in cf systems but might be transferable assuming the presence of gα proteins in the eukaryotic lysate. alternatively, the gα proteins can be additionally co-synthesized to the target gpcr or directly applied to the reaction based on the open nature of cf systems. after gpcr activation, gtp binding and hydrolysis should be detectable. in addition to ligand binding and g protein coupling, intra-and intermolecular interactions can be visualized by förster and bioluminescence resonance energy transfer techniques. different sensor models are known in living cells [ ] . the monitoring of intermolecular interactions can be performed as well in cf systems using the already established in vivo models. one model includes the tagging of the c-terminus of a gpcr of interest to a fluorophore (gfp/yfp) and fusing a binding partner such as β-arrestin to luciferase or a second fluorophore [ ] . upon activation of the gpcr, β-arrestin binds to the receptor and both tags are in close proximity, resulting in a measurable energy transfer. this model requires active g protein-coupled receptor kinases for the phosphorylation of the c terminus of the synthesized gpcr to get recognized by β-arrestins. this requirement has to be analyzed in detail in the individual cf systems. the second known in vivo model visualized intramolecular changes after agonist and antagonist binding by introducing fluorophores into the third extracellular loop and the c terminus of different gpcrs. upon activation, the distance between both fluorophores changes and an alteration in the energy transfer can be measured [ , ] . initial experiments to transfer these energy transfer-based sensors were recently performed in cf systems [ ] . both models can be applied to high-throughput analyses. in summary, the successful cf synthesis of a variety of gpcrs has been demonstrated in recent years and a transfer of these gpcr production systems to a drug discovery format in a high-throughput manner has recently started. in the near future, we might see novel technologies for ligand screenings, thereby utilizing the advantage of the automatization and downsizing capacity of cf systems. the gold standard for synthesis, development, and production of antibody-based drugs (based on full-length antibodies) is mammalian cell culture-based expression systems. although cultivation of mammalian cells is well established and widely used, the development of monoclonal antibodies (mabs) and antibody-drug conjugates (adcs) remains time-consuming and challenging. thus, methods for highthroughput screenings, especially in the early-stage evaluation of antibody candidates, are valuable. in view of this, the use of the cf technology constitutes a promising strategy to shorten the time from antibody discovery to production. using cf technology, antibodies can be produced in a flexible scale within a couple of hours. besides the synthesis of individual antibodies, cf technology can be used to display libraries of antibodies. in contrast to phage and yeast display, in vitro cf systems such as ribosome and mrna display are open, and thus result in higher library sizes. in theory, the size of the library is only limited by the quantity of supplemented mrna/dna, the volume of the cf reaction, and the number of ribosomes within the system, resulting in library sizes of ~ − /ml cf reaction [ ] . in comparison, phage and yeast display exhibit library diversities of ~ - . selection technologies such as ribosome display [ ] , mrna display [ ] , and cis display [ ] have been developed based on reticulocyte lysate [ ] and e. coli cf systems [ ] . these systems focused on smaller antibody fragments because their functionality does not rely on the assembly of multiple polypeptide chains. nonetheless, recently two groups have succeeded in developing completely cf display technologies that allow the selection of fab fragments [ ] . the challenge to assemble the heavy and light polypeptide chain (hc/lc) of the fab fragment was approached in different ways. while sumida et al. succeeded by combining mrna display based on two mrna sub-libraries, one encoding hc, the other one encoding lc, with in vitro compartmentalization pcr to link and then amplify hc and lc gene pairs [ ] , stafford et al. developed a ribosome display method where they displayed only one of the two fab chains, while the other one was not presented in display format [ ] . successful synthesis of different antibody formats, including single-chain variable fragments (scfvs), fab fragments, as well as complete iggs, has already been shown in e. coli [ , ] , sf [ , ] , reticulocyte [ ] , wheat germ [ ] , and cho cf systems [ , , ] . furthermore, the upscaling of cf reactions to the liter-scale [ , ] as well as downscaling [ ] and high-throughput applications [ ] have been demonstrated. in addition, advances in bioorthogonal reaction chemistries have paved the way to expand the possibilities for adc development. the site-specific introduction of non-canonical amino acids into a genetically engineered sequence can be used to create site-specifically labeled adcs [ ] . currently, seven adcs are approved for therapy. to date, all of these adcs have been generated by coupling of mabs to the cytotoxic linker-payload via surface-exposed lysines, or partial disulfide reduction and conjugation to free cysteines, which typically results in a controlled but heterogeneous adc population with varying numbers and positions of drug molecules attached to the mab [ ] . homogeneous adc populations can be achieved by introducing the payload at one or more defined positions. by developing a bioorthogonal trna/synthetase pair, zimmerman et al. have shown that the optimized non-canonical amino acid para-azidomethyll-phenylalanine (pamf) can be site-specifically incorporated into the tumor-specific, her -binding igg trastuzumab [ ] . subsequently, the cytotoxic linker payload dbco-peg-monomethyl auristatin (dbco-peg-mmaf) was conjugated to pamf via strain-promoted azide-alkyne cycloaddition (spaac) copper-free click chemistry. in the context of dual-functioning molecules, bispecific antibodies have also emerged as promising anti-cancer agents. one of the advantages of these proteins is their capability to target two different epitopes simultaneously, thereby increasing target engagement, where mono-specific antibodies might fail [ ] . due to their open design, cfps reactions can easily be manipulated, for example by varying the template ratios and concentrations of hc and lc. for example, xu et al. showed the successful assembly of bispecific 'knobs-into-holes' antibodies in multiple scaffolds by using an e. coli-based cf expression platform [ ] . taken together, antibody evolution, selection, and engineering can dramatically benefit from the technological advances in the field of cfps. ( ) novel display technologies based on cf methods enable the in vitro evolution of multimeric proteins and allow for more sophisticated protein engineering. ( ) due to the very short time frame from synthesis to functional testing, cf systems can accelerate antibody construct evaluation by a repetitive (one after one) and/or parallel screening. ( ) the introduction of non-canonical amino acids expands the chemical repertoire and thus the possibilities to modify and improve antibody-based therapeutics. advanced labeling technologies allow for a very fast qualitative analysis of drug-to-antibody ratio (dar), linker, linker/position, drug, drug/position (research application), and allow full control of the adc design (commercial application). cf systems are becoming a potential option for synthesizing vaccine antigens. most of the vaccine antigens produced by cf systems to date have used e. coli and wgl. recent progress on eukaryotic cf systems may offer additional advantages. in this context, eukaryotic cf systems are endotoxin free and lack the complex plasma membrane that makes the protein purification simple. some of the antigens synthesized by using cf systems are highlighted in table . they are able to induce a strong immune response in experimental animals and could serve as a proof of concept for future vaccine development. using recent advances in cfps technology, a freeze-dried, cell-free (fd-cf) expression system was created based on e. coli cf lysates [ ] . using this fd-cf technique, diphtheria toxoid antigen variants (dt and dt ) were produced following rehydration with water and functional characterization of the synthesized proteins was verified following administration in mice and measuring the immune response [ ] . the fd-cf method could enable the production of on-demand, point-of-care biologics requiring just the simple addition of water for activation and synthesis. recently, cf-based expression has proven successful in producing difficult-to-express proteins like major outer membrane protein (mmomp) of chlamydia spp., a major vaccine antigen. using e. coli-based cfps, mmomp was synthesized in a native trimeric form in the presence of nanolipoproteins (nlps) with a yield of around . mg/ml. when injected into mice in the presence of an adjuvant, the protein elicited an enhanced humoral immune response [ ] . this method of synthesizing and simultaneous incorporation of antigens into nlps using a cf approach is a promising method for future vaccine development. conjugate vaccines are one of the safest and most effective biologics [ ] . bioconjugate vaccines are produced using protein glycan coupling technology (pgct). however, pgct has its own limitations such as time-consuming in vivo processes. additionally, fda-approved carrier proteins, such as toxins derived from clostridium tetani and corynebacterium diphtheria, have not yet been demonstrated to be compatible with an e. coli-based production process. relevant ptms are often difficult to synthesize in e. coli cf systems. meanwhile, there have been further advances in using bacterial glycoengineering combined with cfps for producing bioconjugate vaccines. this cf glycoprotein synthesis (cfgps) used glycooptimized e. coli extracts integrating both n-linked glycosylation and protein synthesis. using cfgps, two bioconjugate vaccines were synthesized against f. tularensis and e. coli o [ , ] . besides posttranslational modification, the assembly of macromolecular structures in cf systems is highly ambitious. virus-like particles (vlps), for example, are nanoscale structures that are formed from the self-assembly of viral proteins without the viral genome responsible for the infection. usually, vlps mimic the capsid structure of the real virus. vlp antigens are vaccine candidates for several diseases [ ] . one of the vaccine candidates, which is currently in clinical trials, contains vlp antigens addressing noroviruses responsible for gastroenteritis in humans [ ] . cf synthesized vlps were structurally confirmed by electron microscopy [ ] . using e. coli cf systems, antimicrobial colicins (colicin m, la, e , and e ) have been synthesized with high yields (around µg/ml) and solubility. the synthesized colicins are able to effectively kill the target cells without any purification [ ] . antimicrobial peptides (amps) are another class of defense molecules that have a wide spectrum of targets; for example, bacteria, viruses, fungi, parasites, and cancer. amps are evolving as alternatives to antibiotics [ ] . using lyophilized e. coli cf lysates, ten different amps have been synthesized successfully and the functionality of plasmodium falciparum wheat germ interaction of pfmsa with cd was confirmed by erythrocyte binding assay [ ] . antigen-specific igg responses to lsa -c were profiled by an alphascreen assay [ ] . western blotting and elisa confirmed the interaction of purified recombinant msp with human sera [ ] . immunization of purified pfron with freund's complete adjuvant into japanese white rabbits generated pfron antisera [ ] . rabbits immunized with expressed pfripr produced specific antibodies [ ] . anti-exp antibodies were generated by immunization of recombinant exp [ ] . wheat germ proteoliposomal engineered cldn- antigens induced anti-cldn - -ecr antibodies in mice [ ] αcd -id small diabody (db) molecule containing both a b-cell-targeting moiety (anti-cd ) and a lymphoma id, c -scfv idiotype-specific single-chain variable fragment of the immunoglobulin from the c mouse b-cell lymphoma, cldn- claudin- , when non-canonical amino acids (ncaas) are incorporated into proteins, novel functional, structural, and imaging properties can be generated. this synthetic biology application is fast emerging and has wide applications such as incorporating precise ptms and adding novel functions to proteins [ , , , ] . by taking advantage of the openness of the cfps, one can add the machinery responsible for the co-translational incorporation of ncaas directly to the standard reaction components. one possible method to incorporate ncaa is to use precharged trnas harboring the ncaa. one of the most commonly used methods is the amber suppression technology using an orthogonal pair of aminoacyl-trna synthetase/trna (o-trna/aars pairs from distinct organisms), which functions independent of endogenous aarss and trnas in the host and is used to direct the incorporation of ncaas to specific positions such as the amber stop codon (uag). after incorporation of an ncaa with a reactive group, bioorthogonal click reactions can be performed to conjugate a molecule of interest. the most general biorthogonal click reactions for conjugating molecular probes or polymers are the copper-catalyzed azide-alkyne cycloaddition (cuaac), staudinger-ligation, photo click cycloaddition, strain-promoted azide-alkyne cycloaddition (spaac) and inverse electron-demand diels-alder cycloadditions (iedda + spiedac). using e. colibased cf systems, cui et al. showed the incorporation of two fluorescent labels, bodipy fluorophore and tamra-dibo, by using a precharged trna + orthogonal system for fret measurements [ ] . using sf -based cf systems, quast et al. demonstrated the incorporation of p-azido-l-phenylalanine at defined amber positions in parallel in the two subunits of the human egfr protein dimer. later, the azido group of the incorporated azfs was coupled by photoaffinity cross-linking using a bis-combo linker to create a stable synthetic dimer of egfr [ ] . the dimerized protein shows autophosphorylation in the presence of tyrosine kinase. in general, release factor (rf ) competes with orthogonal ncaa-trna for the amber codon, which results in truncated products along with successfully suppressed products. so, cf lysates derived from genetically modified e. coli lacking release factor (rf ) can be used to enhance the incorporation efficiency of ncaas. using the orthogonal system and e. coli-based cfps, human mek kinase with ptms was synthesized up to milligram quantities by site-specific, co-translational incorporation of phosphoserine at specific positions [ ] . various polyethylene glycol (peg) moieties have been widely used to decorate therapeutic proteins. the peg moiety usually offers high stability and extends the half-life of proteins while in circulation inside the body. the food and drug administration (fda) has recognized peg moieties as safe due to their structural flexibility, hydrophilicity, and minimal toxicity, and several pegylated drugs have been approved by the fda. using sf -based cf systems, a site-specific pegylated human epo was produced and characterized by autoradiography [ ] . apart from the amber suppression strategy, there are other strategies like frameshift suppression, sense codon reassignment, and unnatural base pairing. a detailed review of prominent methods for the incorporation of ncaas into proteins using cfps has been recently published [ ] . a wide range of commercial cf systems is available in the market based on lysates derived from diverse sources. as well, a few companies provide services for cf synthesis of proteins. some of the products derived from the cf systems based on e. coli lysates are already in clinical trials, such as adcs targeting cd and folate receptor alpha highly expressed in myeloma and cancer cells (sutro biopharma, inc, usa). table lists commercial systems currently available for the cf production of proteins. evolving cf systems from a laboratory level to a robust production platform is necessary to fulfill their potential. prior to full realization of cf systems as emerging tools for drug discovery and evaluation, several factors need to be addressed, like synthesis of the high-quality functional protein with proper folding and ptm, cost of production, scalability, and safety issues. a more detailed understanding of the components in the cf lysates will substantially improve the quality and stability of the extract preparation. the quantity of the protein depends on the translation efficiency of the cf system. the most important factors that influence the protein yields are quality of the celllysate, reaction conditions, and template optimization as addressed in section . . to increase the translation efficiency, further efforts are required to increase the quality of lysate production. this can be achieved by using genetic engineering tools to remove the factors responsible for nucleic acid degradation, ribosome inactivation, and protein degradation. brodiazhenko et al. showed that genomic disruption of genes encoding ribosome-inactivating factors (hpf in b. subtilis and stm in s. cerevisiae) has improved the activities of bacterial and yeast translation systems [ ] . in this context, advanced engineering tools like crispr cas could help to improve the translation efficiency of the cf systems [ ] . activation and enrichment of translation-relevant factors could also increase translation efficiency [ ] . when it comes to eukaryotic cfps platforms, translocation through microsomes currently remains a black box. optimizing the efficiency of coupling translation and translocation needs to be addressed. the most important issue with cf systems, especially when working with cecf systems, is to maintain the balance between the amount of protein synthesized and the stability and quality of the protein. although cecf has been capable of producing . - mg protein per ml, especially with the mammalian expression systems, only a small fraction of the produced protein was subject to detailed functional analysis [ , ] . this is one of the reasons why the functional assays are limited to binding assays (gpcr, tlr, antibody), plbe (ion channels), and colorimetric assays (enzymes). by optimizing the redox conditions, the problem of ab translocation into the lumen of microsomes is addressed already [ , ] . however, when it comes to the synthesis of complex transmembrane proteins in mammalian systems, the insertion efficiency might be already saturated at the low synthesis rates due to restrictions on the level of the translocon's functionality. a more detailed analysis of lipid composition and proteins constituting the microsomes present in the insect, cho, and human-derived lysates will help to improve the quality of synthesized membrane proteins. one could use alternative supplements like nanodiscs or liposomes reconstituted from microsomal membranes to support mp integration [ ] . intense efforts on designing novel and improved mammalian cf systems should be maintained as the majority of the drug targets are related to complex eukaryotic proteins. optimizing cf reactions in order to decrease protein aggregation during the purification processes and increasing the quality of the protein purification, especially when using the cecf method, is strongly required. another point to address in the field of cfps is to decrease the costs of production, especially in the preparation of cf lysates and the individual reaction components. substantial costs arise from the usage of phosphorylated energy systems, cofactors, nucleotides, amino acids, and dna. alternative energy regeneration systems are available in the place of phosphorylated substrates (e.g., glucose, maltodextrin, etc.) for sustainable atp regeneration throughout the synthesis reaction [ ] [ ] [ ] . use of nucleoside monophosphates instead of nucleoside triphosphates as the nucleotide source in the cf systems could be another cost-effective parameter [ ] . avoiding the use of exogenous trnas and cyclic amps and reducing the concentration of amino acids and nucleotides are some of the cost-effective parameters one could optimize during protein synthesis. additionally, new high-cell-density cultivation strategies and improvement in the quality of cell lines by genetic engineering could help to produce cost-effective high-quality cf systems. costs can also be decreased by engineering and optimization of eukaryotic lysates to extend the lifetime of these systems, thereby increasing the yield of the produced protein. there has been considerable progress in the point-ofcare production devices for on-demand biologic synthesis of small quantities of therapeutic proteins using cho lysates and e. coli lysates through on-site good manufacturing practice (gmp) [ ] . this type of miniaturized device could be useful for quick testing of proteins and thus help in treating common and rare diseases, and cfps could help solve the challenges associated with in vivo expression. due to the open nature of the cf systems, proteins can be modified with chemically synthesized glycans by bioconjugate chemistries. this will help to increase the quality and therapeutic efficiency of the synthesized proteins. there is an exponential increase in the number of publications from the last years using cf lysates for producing a wide range of proteins [ ] . due to the increased awareness of the biosynthetic potential of the cf systems, protocols becoming simpler, improvement in the lysate quality, and its applicability in the preparation of a diverse range of proteins, there will be unexpected outcomes in the field of protein production towards future drug development. funding this research is supported by the european regional development fund (efre) and the german ministry of education and research (bmbf, no. b a). open access this article is licensed under a creative commons attribution-noncommercial . international license, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the creative commons licence, and indicate if changes were made. the images or other third party material in this article are included in the article's creative commons licence, unless indicated otherwise in a credit line to the material. if material is not included in the article's creative commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.to view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by-nc/ . /. ion channels in drug discovery and safety pharmacology slc transporters as therapeutic targets: emerging opportunities unexplored therapeutic opportunities in the human genome properties of protein drug target classes antibodies and venom peptides: new modalities for ion channels prospects for pharmacological targeting of pseudokinases advances in development of antimicrobial peptidomimetics as potential drugs. molecules new tools for recombinant protein production in escherichia coli: a -year update membrane protein synthesis in cell-free systems: from bio-mimetic systems to bio-membranes cell-free protein synthesis for producing difficult-to-express proteins a user's guide to cell-free protein synthesis establishing a high-yielding cell-free protein synthesis platform derived from vibrio natriegens cell-free protein synthesis: pros and cons of prokaryotic and eukaryotic systems high-yield cell-free synthesis of human egfr by iresmediated protein translation in a continuous exchange cellfree reaction format cellfree synthesis of functional and endotoxin-free antibody fab fragments by translocation into microsomes cell-free synthesis and characterization of a novel cytotoxic pierisin-like protein from the cabbage butterfly pieris rapae improving the recombinant human erythropoietin glycosylation using microsome supplementation in cho cell-free system cell-free synthesis of human toll-like receptor (tlr ): optimization of synthesis conditions and functional analysis cell-free systems based on cho cell lysates: optimization strategies, synthesis of "difficultto-express" proteins and future perspectives systematic optimization of cell-free synthesized human endothelin b receptor folding cell-free protein synthesis: applications come of age cell-free synthetic biology: engineering in an open world cotranslational incorporation of non-standard amino acids using cell-free protein synthesis cell-free protein synthesis from genomically recoded bacteria enables multisite incorporation of noncanonical amino acids microscale to manufacturing scale-up of cell-free cytokine production-a new approach for shortening protein production development timelines on-chip automation of cell-free protein synthesis: new opportunities due to a novel reaction mode highthroughput transfection of interfering rna into a d cell-culture chip microarray platform affords improved product analysis in mammalian cell growth studies automated production of functional membrane proteins using eukaryotic cell-free translation systems point-of-care production of therapeutic proteins of good-manufacturing-practice quality on-demand biomolecular manufacturing development of a bacillus subtilis cell-free transcription-translation system for prototyping regulatory elements development of a pseudomonas putida cell-free protein synthesis platform for rapid screening of gene regulatory elements establishing a high yielding streptomyces-based cell-free protein synthesis system tobacco by- cellfree lysate: an alternative and highly-productive plant-based in vitro translation system leishmania cell-free protein expression system the cell free protein synthesis system from the model filamentous fungus neurospora crassa a combined cell-free transcription-translation system from saccharomyces cerevisiae for rapid and robust protein synthe rapid recombinant protein expression in cell-free extracts from human blood the pure system for the cell-free synthesis of membrane proteins single-pot glycoprotein biosynthesis using a cell-free transcription-translation system enriched with glycosylation machinery the application of cell-free protein synthesis in genetic code expansion for post-translational modifications wheat germ systems for cell-free protein expression cell-free expression and functional reconstitution of homo-oligomeric alpha nicotinic acetylcholine receptors into planar lipid bilayers cell-free protein synthesis as a novel tool for directed glycoengineering of active erythropoietin high-yield production of "difficult-to-express" proteins in a continuous exchange cell-free system based on performance benchmarking of four cell-free protein expression systems production and stabilization of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines cell-free expression of a functional pore-only sodium channel high-level cell-free production of the malarial lactate transporter pffnt as a basis for crystallization trials and directional transport studies cell-free production of pore forming toxins: functional analysis of thermostable direct hemolysin from vibrio parahaemolyticus functional characterization of cell-free expressed kv . channel using a voltage-sensitive fluorescent dye crystallization of the membrane protein hvdac produced in cell-free system elimination of ribosome inactivating factors improves the efficiency of bacillus subtilis and saccharomyces cerevisiae cell-free translation systems cell-free protein synthesis from fast-growing vibrio natriegens membrane assembly of the functional kcsa potassium channel in a vesicle-based eukaryotic cellfree translation system cell-free synthesis of functional human epidermal growth factor receptor: investigation of ligand-independent dimerization in sf microsomal membranes using non-canonical amino acids accelerating the production of druggable targets: eukaryotic cell-free systems come into focus cell-free production of a therapeutic protein: expression, purification, and characterization of recombinant streptokinase using a cho lysate functional g-protein-coupled receptor (gpcr) synthesis: the pharmacological analysis of human histamine h receptor (hrh ) synthesized by a wheat germ cell-free protein synthesis system combined with asolectin glycerosomes accelerated pharmaceutical protein development with integrated cell free expression, purification, and bioconjugation ires-mediated translation of membrane proteins and glycoproteins in eukaryotic cell-free systems an efficient mammalian cell-free translation system supplemented with translation factors a technique to increase protein yield in a rabbit reticulocyte lysate translation system host-regulated hepatitis b virus capsid assembly in a mammalian cell-free system species-independent translational leaders facilitate cell-free expression cell-free expression, purification and immunoreactivity assessment of recombinant fasciola hepatica saposin-like protein- exploiting leishmania tarentolae cell-free extracts for the synthesis of human solute carriers an optimized yeast cell-free system: sufficient for translation of human papillomavirus l mrna and assembly of virus-like particles a cell-free expression and purification process for rapid production of protein biologics synthesis of membrane proteins in eukaryotic cell-free systems codon usage influences the local rate of translation elongation to regulate co-translational protein folding nonoptimal codon usage influences protein structure in intrinsically disordered regions cell-free eukaryotic systems for the production, engineering, and modification of scfv antibody fragments cell-free synthesis of functional antibodies using a coupled in vitro transcription-translation system based on method for cell-free protein synthesis involved with ph control with amino acid decarboxylase optimizing cell-free protein synthesis for increased yield and activity of colicins tetrameric assembly of k(+) channels requires er-located chaperone proteins energy systems for atp regeneration in cell-free protein synthesis reactions recent advances in development of cell-free protein synthesis systems for fast and efficient production of recombinant proteins effects of atp regeneration systems on the yields and solubilities of cell-free synthesized proteins combining in vitro folding with cell free protein synthesis for membrane protein expression membrane protein production in escherichia coli cell-free lysates reconstitution and functional characterization of ion channels from nanodiscs in lipid bilayers challenges in the development of functional assays of membrane proteins a comprehensive map of molecular drug targets a cell-free translocation system using extracts of cultured insect cells to yield functional membrane proteins in vitro synthesis of a major facilitator transporter for specific active transport across droplet interface bilayers functional reconstitution of cell-free synthesized purified kv channels the sensorless pore module of voltage-gated k + channel family embodies the target site for the anticonvulsant retigabine cell free expression and functional reconstitution of eukaryotic drug transporters coupled cell-free synthesis and lipid vesicle insertion of a functional oligomeric channel mscl mscl does not need the insertase yidc for insertion in vitro enhanced functional expression of aquaporin z via fusion of in situ cleavable leader peptides in escherichia coli cell-free system green fluorescent protein changes the conductance of connexin (cx ) hemichannels reconstituted in planar lipid bilayers a small viral potassium ion channel with an inherent inward rectification incorporation of adenine nucleotide transporter, ant p, into proteoliposomes facilitates atp translocation and activation of encapsulated luciferase synthesis and site-directed fluorescence labeling of azido proteins using eukaryotic cell-free orthogonal translation systems modulation of g-protein coupled receptor sample quality by modified cell-free expression protocols: a case study of the human endothelin a receptor production of g protein-coupled receptors in an insect-based cell-free system cellfree synthesis of a functional g protein-coupled receptor complexed with nanometer scale bilayer discs qualifying a eukaryotic cell-free system for fluorescence based gpcr analyses principles: extending the utility of [ s]gtp gamma s binding assays optical probes based on g proteincoupled receptors -added work or added value? the role of beta-arrestins in the termination and transduction of g-protein-coupled receptor signals a flash-based fret approach to determine g protein-coupled receptor activation in living cells intramolecular and intermolecular fret sensors for gpcrs-monitoring conformational changes and beyond a combined cell-free protein synthesis and fluorescence-based approach to investigate gpcr binding properties in vitro selection and evolution of functional proteins by using ribosome display antibody-ribosome-mrna (arm) complexes as efficient selection particles for in vitro display and evolution of antibody combining sites cis display: in vitro selection of peptides from libraries of proteinâ€"dna complexes antibodies to watch in in vitro selection of fab fragments by mrna display and gene-linking emulsion pcr in vitro fab display: a cell-free system for igg discovery engineering toward a bacterial "endoplasmic reticulum" for the rapid expression of immunoglobulin proteins aglycosylated antibodies and antibody fragments produced in a scalable in vitro transcription-translation system production of functional antibody fragments in a vesicle-based eukaryotic cell-free translation system efficient synthesis of a disulfide-containing protein through a batch cellfree system from wheat germ development of a cho-based cellfree platform for synthesis of active monoclonal antibodies sealable femtoliter chamber arrays for cell-free biology high-throughput screening of biomolecules using cell-free gene expression systems synthesis of site-specific antibody-drug conjugates using unnatural amino acids location matters: site of conjugation modulates stability and pharmacokinetics of antibody drug conjugates production of site-specific antibody-drug conjugates using optimized non-natural amino acids in a cell-free expression system dual targeting strategies with bispecific antibodies production of bispecific antibodies in "knobs-into-holes" using a cell-free expression system cell-free production of a functional oligomeric form of a chlamydia major outer-membrane protein (momp) for vaccine development protein carriers of conjugate vaccines: characteristics, development, and clinical trials efficacy of a potential trivalent vaccine based on hc fragments of botulinum toxins a, b, and e produced in a cellfree expression system hela based cell free expression systems for expression of plasmodium rhoptry proteins ralp is a rhoptry neck erythrocytebinding protein of plasmodium falciparum merozoites and a potential blood-stage vaccine candidate antigen a vaccine directed to b cells and produced by cell-free protein synthesis generates potent antilymphoma immunity cellfree production of scfv fusion proteins: an efficient approach for personalized lymphoma vaccines pfmsa is a novel plasmodium falciparum vaccine antigen that interacts with human erythrocyte integrin associated protein (cd ) pv , a novel plasmodium falciparum merozoite dense granule protein, interacts with exported protein in infected erythrocytes anti-msp igg inhibits plasmodium falciparum merozoite invasion into erythrocytes in vitro antibodies against a plasmodium falciparum ron inhibit merozoite invasion into erythrocytes identification of plasmodium falciparum reticulocyte binding protein homologue -interacting protein, pfripr, as a highly conserved blood-stage malaria vaccine candidate. vaccine plasmodium falciparum exported protein is localized to dense granules in merozoites wheat germ cell-free system-based production of malaria proteins for discovery of novel vaccine candidates immunological characterization of plasmodium vivax pv , a novel predicted gpi-anchored merozoite surface protein assessing sequence plasticity of a virus-like nanoparticle by evolution toward a versatile scaffold for vaccines and drug delivery malaria derived glycosylphosphatidylinositol anchor enhances anti-pfs functional antibodies that block malaria transmission immunogenicity of glycosylphosphatidylinositolanchored micronemal antigen in natural plasmodium vivax exposure wheat germ cell-free system-based production of hemagglutinin-neuraminidase glycoprotein of human parainfluenza virus type for generation and characterization of monoclonal antibody engineered membrane protein antigens successfully induce antibodies against extracellular regions of claudin- toy building set method for rapid in vitro synthesis of bioconjugate vaccines via recombinant production of n-glycosylated proteins in prokaryotic cell lysates immunodrugs: therapeutic vlpbased vaccines for chronic diseases immunogenicity of takedas bivalent viruslike particle (vlp) norovirus vaccine (nov) candidate in children from months up to years of age cell-free protein synthesis of norovirus virus-like particles rapid production and characterization of antimicrobial colicins using escherichia coli-based cell-free protein synthesis application of antimicrobial peptides of the innate immune system in combination with conventional antibiotics-a novel way to combat antibiotic resistance? front cell infect microbiol combining sense and nonsense codon reassignment for site-selective protein modification with unnatural amino acids robust production of recombinant phosphoproteins using cell-free protein synthesis biosynthesis of membrane dependent proteins in insect cell lysates: identification of limiting parameters for folding and processing cell-free synthesis of membrane proteins: tailored cell models out of microsomes methods for activating natural energy metabolism for improving yeast cellfree protein synthesis: / cell-free gene expression: an expanded repertoire of applications an economical method for cell-free protein synthesis using glucose and nucleoside monophosphates cell-free biosensors for biomedical applications key: cord- -e vvi qo authors: pandit, sb; balaji, s.; srinivasan, n. title: structural and functional characterization of gene products encoded in the human genome by homology detection date: - - journal: iubmb life doi: . / sha: doc_id: cord_uid: e vvi qo availability of the human genome data has enabled the exploration of a huge amount of biological information encoded in it. there are extensive ongoing experimental efforts to understand the biological functions of the gene products encoded in the human genome. however, computational analysis can aid immensely in the interpretation of biological function by associating known functional/structural domains to the human proteins. in this article we have discussed the implications of such associations. the association of structural domains to human proteins could help in prioritizing the targets for structure determination in the structural genomics initiatives. the protein kinase family is one of the most frequently occurring protein domain families in the human proteome while p‐loop hydrolase, which comprises many gtpases and atpases, is a highly represented superfamily. using the superfamily relationships between families of unknown and known structures we could increase structural information content of the human genome by about %. we could also make new associations of domain families to human proteins that are potentially linked to genetically inherited diseases. iubmb life, : ‐ , an important event in the history of humankind has been made at the turn of the st century in the form of the draft version of human genome data ( , ) . this monumental effort provides us with an opportunity to better understand the various biological processes and possible cognition. the functional characterization of the gene products encoded in the human genome could provide insights into such complex biological process. there are various experimental endeavors to understand the functions of gene products encoded in the human genome. in addition to these experimental efforts, computational analyses of human proteins could form an important step in the functional inference of the genome data. the computational approaches for the prediction of functional features of proteins encoded in genomes relies on establishing relationships to homologues that are experimentally studied. there have been several attempts, using various sophisticated homology search tools, to assign functions to gene products encoded in various proteomes (see for example references - ) . such functional predictions could be used as a guiding tool in order to direct the relatively time consuming, more difficult and expensive experimental methods for exploring protein functions. furthermore, functional inferences of human proteins that are implicated in diseases could provide valuable insights on the molecular basis of human diseases. such an understanding could aid identification of effective drug targets and rational design of lead compounds to combat the diseases. the most commonly used computational approach for genome-wide association of functions to proteins is by identification of well-characterized homologues using sequence-based search procedures such as blast ( ) and fasta ( ) . but, pairwise sequence alignment based search procedures are unlikely to be able to identify related proteins with low sequence similarity. however, these distantly related proteins could often be identified with the use of threedimensional ( -d) structural information ( ) as the structure is conserved better than sequence during evolution ( , ) thus, use of structural information could potentially enhance the functional assignments ( - ) . moreover, structure prediction with relevant biochemical motifs can provide more detailed functional insights than sequence comparisons alone ( - ) . the search methods for such an analysis could be improved by the use of multiple sequence alignment of the homologues in a family, which can indicate structurally/ functionally important positions. the information in these multiple sequence alignments can be converted into position specific scoring matrices (pssm) usually referred as profiles ( ) or into a probabilistic model called the hidden markov model (hmm) ( ) . the use of profile-based search methods is known to improve sensitivity of detection of remotely related homologues ( - ) . hence, use of structure and profile-based method would enable detection of remote homologues and thus enrich the functional assignments. some of the commonly used profile-based search methods include psi-blast ( ) , impala ( ) , rps-blast and hidden markov model (hmmer )-based ( ) procedures. these methods have been shown to detect remote and subtle similarities ( , ) between proteins that were previously possible only by structure comparison procedures, which obviously demand the knowledge of -d structures. although the profile-based annotation methods are among the widely used procedures to detect remote similarities, there are other procedures such as genthreader ( ) and environment-based profiles ( ) , which are fold recognition methods for assigning the structural domains to amino acid sequences. furthermore, the comparison of proteins across various genomes could also aid in enhancement of the assignment of functions to the proteins ( - ) . the comparative genomics methods are based on the functional characterization of the gene products by detecting the orthologous proteins in the closely related organisms where the experimental functions of the proteins have been proposed. however, the effectiveness of this approach is dependent on at least two factors: ( ) ability to identify homologues of a given protein in other organisms. ( ) the extent of divergence of amino acid sequences of the homologues across the organisms and its implication on the similarity of functions between the two proteins. in order to understand the biological function of the human proteins we have associated functional or structural domains using sensitive profile-matching procedures. association of functional domain would provide clues to biochemical role of the protein. the structural domain association could provide enhanced abilities to assign function and also provide the molecular basis of action of proteins. furthermore, we have enhanced the structural information content of human genome by relating families apparently with unknown structures to known structural families, as in the supfam database, which was developed by us earlier ( , ) . the functional domain information was considered from pfam database (http://www.sanger.ac.uk/software/pfam) ( ) and structural domain from pali (http://pauling.mbu.iisc. ernet.in/*pali) ( , ) databases. pali contains structurebased sequence alignment and phylogeny of proteins in the families derived from the scop database ( ) , which is a hierarchical structural classification database. the profiles for pfam and pali families have been generated as described by pandit et al ( ) . these profiles were searched using rps-blast. we have also used hmm-based search procedure against the hmm libraries of pfam to associate functional domain to human proteins. profiles of transmembrane sequences have been associated with human proteins in order to predict membrane localisation of the proteins. using sensitive profile-matching procedures, we could make a comprehensive compilation of functional/structural domains to gene protein encoded in human genome. similar attempts have been made in the past by muller et al. ( ) . they have used psi-blast for much of their analysis, apart from impala, to assign structural/functional domains. in their approach psi-blast has been scanned against the nonredundant database of protein sequences augmented with scop domain sequences. in the present review we will discuss the current status of large-scale function association, using various computational procedures, of various gene products encoded in the human genome. furthermore, human proteins potentially involved in diseases have been specifically analyzed by associating functional/structural domain to these protein sequences. the amino acid sequences of the open reading frames (orfs) that correspond to the gene products encoded in human genome have been obtained from the ensembl database ( ) (release . d. , http://www.ensembl.org). the total number of gene products predicted in this release is , . the omim database ( ) is a comprehensive collection of genes and genetic disorders in humans. in our analysis the protein sequences corresponding to the entries in the omim database have been derived from the swissprot database ( ) . it is also possible to obtain details of genes involved in genetic disorders through the genelink table provided at ensembl database ( ) . the genelink table indicates the association of human proteins to omim identifiers. we were able to associate unique protein sequences, which have one or more reference, to the omim database. the number of omim entries referenced in swissprot is , as in march release of the swissprot database. a possible reason for the difference in the numbers of genes could be that more than one genetic disorder entry in omim database is associated with a given protein. in the data set used by muller et al. ( ) there were proteins linked to omim database entries. we have used profile-matching method rps-blast that matches a sequence to sequence-profile obtained from structural (pali -release . ) and functional domain (pfam -version . ) families. we used stringent e-value cut off of in our search methods to ensure reliability of the domain association. this e-value cut-off has been extrapolated from the one reported by schaffer et al. ( ) ( ) as well as based on the benchmarking (n. s. mhatre, b. anand and n. srinivasan, unpublished results) using the database of structure-based sequence alignments of similarly folded proteins. we have used hmmer based procedure against pfam hmm profiles with an e-value cut off of to extract reliable domain association. subsequent to functional/ structural domain identification, all the sequences were subjected to tmhmm . ( ) in order to assign transmembrane helical regions. the functional and structural domain assignments for human proteome along with other organisms are made publicly available at: http://hodgkin.mbu.iisc.ernet.in/*human. the following sections discuss some of the interesting results derived by analysing this large dataset of human proteins. we could assign a total of , functional/structural domains to , ( %) human proteins out of , proteins encoded in the human genome. we also surveyed for transmembrane regions in human proteins, since this would suggest putative localization of these proteins to the membrane hence, could aid in function prediction. using tmhmm ( ) we could identify transmembrane regions in gene products. of these, are found to be present in combination with extracellular or intracellular functional/ structural domains. the total number of residues covered in structural/ functional domain and transmembrane region assignments are about % of the proteome. these functional/structural domain assignments would indicate probable biochemical functions for the assigned proteins, which could be useful for biological function prediction. a total of human proteins with no domain assignment, hence with no function or cellular localization information, could form an interesting set for experimental exploration for their properties and biological roles. the association of gene products with structure can give valuable insights, since structural information provides molecular details of the function of a protein. the structural domain assignment will also help in prioritizing the target for structural genomics consortium by indicating gene products with no structural predictions. with a view to enhance structural information present in human genome, we have used structural information as in pali profiles that is generated using structure-dependent sequence alignments of a large number of protein domain families, since the incorporation of -d structural information could aid in effective detection of remotely related proteins. using pali profiles alone, we could associate additional structural domains to human proteins that are remotely related. furthermore, we tried relating families with unknown structures to known structural families as in supfam database, which was developed by us earlier ( , ) in order to enhance information on the structural content of human proteome. the supfam database relates two or more homologous protein families, of either known or unknown structure, using profiles derived from structure-based sequence alignments. integrating the relationships derived in supfam we could provide structural information for an additional * % of domain families (fig. ). these family assignments would increase known structural content in the genome. a total of pfam families are assigned in human genome, of which pfam families, have structural information documented in pfam. out of pfam families, apparently with no structural information, families could be related to a family of known structure in supfam. there are now families (* %) with structural information known directly or indirectly through relationships present in sup-fam (fig. ). these unique families with structural information are present in , domains, hence would provide further insights into their functions. a total number of , functional domains could be assigned to the human proteins. these assigned functional domains belong to sequence/functional families of the pfam database ( ) . we have surveyed for the most commonly occurring pfam families in human genome. fig. shows the most frequently occurring functional domain families in the human genome. the most frequently occurring family is the zf-c h family, which is a classical zinc-finger domain with very short length (typically residues). identification of such a family with short motifs using bioinformatics tools could be unreliable. hence, we did not consider them in our analysis. the other most frequently occurring globular protein family is protein kinase. it was previously shown that protein kinases occur with typical and atypical combination of domain families in the gene products encoded in human genome. these kinase domain-containing proteins are involved in a wide variety of biological roles ( ) . among the other most frequently occurring pfam families, the majority are involved in or in part responsible for protein-protein interactions (immunoglobulin, ankyrin repeat, tpr domains), cell attachment adhesion function (fibronectin, collagen, cadherin domains), signalling function (ph, sh , c domains), nucleic acid binding function (zf-c h , homoeobox, rrm domains). a considerable number of human proteins are characterized by short lengths, although they match significantly with protein domain families which are typically much longer. there are functional families, associated in human proteins, apparently with no structural information but could be associated to distantly related families of known structures using relationship described in supfam. these proteins have number of domains. most of these families correspond to enzymes. from the structural genomics perspective, structure association for pfam families meant that clues about structure and function could be extended for proteins. some of the pfam protein families are known to be characteristic of prokaryotic organisms or viruses only. however members of some of these families from human genome could be identified from the current analysis and these families are referred to as atypical families. we could associate bacterial specific families and viral specific families to and human proteins respectively. the list of bacterial and viral specific families, identified in human, along with associated gene products in human genome are listed in table a and b respectively the complete list of proteins with the region of pfam domain assignment is made available at http:// hodgkin.mbu.iisc.ernet.in/*human. the assigned domain family includes for example cobalamin biosynthesis protein, minor capsid protein, bacteriophage lambda head decoration protein. such functions have not been shown before to be present in humans. there are two possible explanations that could be drawn in the context of occurrence of the bacterial and viral specific families in human genome. first explanation is that the superfamily relationship exists between the assigned bacterial or viral domain families and the corresponding eurkaryotic domain families as the sequence similarity of these domains with human proteins is low, while significant. these regions in the human proteins could have diverged significantly and sequence data corresponding to these families in other eukaryotes is currently lacking. an alternative possibility is horizontal gene transfer of these bacterial/viral specific families to humans. we surveyed the human genome for the occurrence of specific pfam families, which are known to be present only in eukaryotes. such eukaryote specific families are known to be involved in specific functions in eukaryotes. out of eukaryote specific pfam families, we could not associate eukaryotic specific families to the human proteome. further, we assessed reasons for the absence of these eukaryotic specific families in human genome. most of these families are organism or lineage specific. some of them have no known functions and other, as mentioned below, are involved in functions not required or present in humans, hence not identified in the human proteome. the probable reasons for absence of eukaryotic specific pfam families in human genome are: ( ) class of toxin families (which also includes snake and scorpion toxins) ( ) families that are unique to the plant kingdom, like seed storage class of proteins, potato inhibitor and plant disease resistance response protein. the zf-c h has been excluded from this histogram due to low reliability in assignments of short domains. protein srb, c. elegans sre g protein-coupled chemoreceptor and c. elegans srg family integral membrane protein. this analysis showed that human proteome has eukaryotic specific families ( ) which are involved in eukaryotes-like functions. however, absence of some of the eukaryotic specific families could be explained from the observations that such biochemical functions are undesirable for human or they are highly specific to lower eukaryotes. the pfam families, without known -d structure, could be clustered into sequence superfamilies and such superfamily relationships are documented in the supfam database. in the current release of supfam, of the pfam families, with no structural information, could be clustered into new potential superfamilies. it is expected that members of all the families in each new potential superfamily would share the same fold and might have gross similarity in their functional properties. these relationships could help in prioritizing the target for structural genomics, since the -d structural determination of one of the representative member in each superfamily would result in structures that can serve as framework models. using these sequence superfamilies information we could identify of the sequence superfamilies in the human genome. the list of these new potential superfamilies with their constituent families identified in human genome is listed in table . the sequence superfamilies identified in human genome consist of pfam families, with no known -d structure for any of their members. there are domains belonging to these new potential superfamilies that could be assigned to the unique gene products in human genome. hence, an experimental structure for domains or proteins one each from these superfamilies could provide templates for interpreting the functions of other members in the superfamily. this results in substantial reduction (from to ) in the number of d-structures to be determined experimentally in order to get clues about their functions experimentally. these superfamilies may be considered as priority targets for structural genomics initiatives in order to improve the coverage of structural information for the human proteins. the nature of the superfamily relationships for some of the new potential sequence superfamilies that are identified in human genome is discussed further. this superfamily consists of three families namely patched domain, acr_tran and secd_secf domain families. the acr_tran family is an integral membrane protein family whose members are known to be involved in drug resistance in bacteria ( ) . the other family in this superfamily, patched domain, is a receptor for the morphogene sonic hedgehog and transduces hedgehog signals ( ) . this secd and secf family consists of various prokaryotic secd and secf protein export membrane proteins ( ) . we could identify human proteins with patched family domain assigned. the functional and structural elucidation of other two families could be could be extended to patched domain because of superfamily relationships. this superfamily has four families cluster together viz. sugar_tr, oatp_c, duf and duf . the sugar_tr family is single-polypeptide capable only of transporting small solutes, such as sugar, in response to chemiosmotic ion gradients and lies in uniporter-symporter-antiporter family ( ) . oatp_c is eukaryotic organic-anion-transporting polypeptides that vary in tissue distribution and substrate specificity ( ) . the functions of dufs (domains of unknown function) are not known. we could associate sugar_tr and oatp_c domains to and gene products respectively. this superfamily constitutes methyltransf_ and trna_u -meth_tr pfam families. both families have methyltransferase activity, however, the trna_u -meth_tr family is involved in methylation of t-rnas ( ) . we could identify and human homologues of methyltransf_ and trna_u -meth_tr respectively. the gde_c and duf pfam families could be clustered in this superfamily. the gde_c family is glycogen branching enzyme and has glucosidase activity ( ) . we could identify gde_c and duf in and human proteins respectively. from, this relationship it could be suggested that duf might have glucosidase-like activity. the -d information provides precise molecular details about the function of the protein. the association of gene products encoded in human genome to -d structures would assist in providing further insights into their function. the databases of protein structures in which domains with similar -d architecture are grouped together could be used for such structural analysis. we have used pali database derived from scop for the present analysis. scop classifies protein domain having high sequence and structural similarity into families. the families are grouped in superfamilies when they share similar functional features and have an evolutionary common ancestor. superfamilies are grouped in fold when major secondary structures are topologically equivalent with similar topological connectivity. the assignment of structural domains to the proteins would aid in the investigation of the preponderance of superfamilies and fold in the human genome. using the various search procedures we could associate , structural domains to , human proteins either directly or by using the sequence superfamily relationships as described in supfam. further, we classified these domain assignments at the level of fold or superfamilies to understand the most commonly used function present in human genome. we analyzed the most commonly occurring superfamilies in human proteome. the figure shows the top few superfamilies along with their extent of representation in the human genome. this distribution of superfamilies is similar to the one obtained by muller et al. ( ) . the most commonly occurring superfamily is c h zinc finger, followed by immunoglobulin domain. because of the short length of c h zinc-finger domain and associated low complexity region, there is bias in identification of these domains. hence, all the gene products having this domain might not have zinc-finger like function and we excluded them from our present analysis. p-loop containing nucleotide triphosphate hydrolases domain is the next most represented superfamily and it is involved in many different critical biological functions such as cell growth, differentiation, repair and modification of dna, transcription, etc. this superfamily comprises various atpases and gtpases that are essential for cell survival. for example gtpases include elongation factors, ga subunit of the heterotrimeric g-proteins that are absolutely critical in major cellular processes. the other superfamilies among frequently occurring superfamily are involved in various functions in the cell as cellular signalling (protein kinase-like, ph domainlike), cell adhesion (cadherin, fibronectin type iii), nucleic acid binding function (rna-binding domain). interestingly, 'family a-g protein-coupled receptor-like' superfamily that consists of many receptors as other most populous superfamilies. the complete list of structural superfamilies that occur in human genome with their respective frequency of occurrence in human genome is provided at http://hodgkin.mbu.iisc.ernet.in/*human. figure shows population distribution of few most populated folds, which occur in the human proteome. figure shows the -d folding patterns in the most populated folds. the c h and c hc zinc finger is the most frequent occurring fold in human genome. for the reasons mentioned before, we have excluded c h and c hc zinc finger from this analysis. ferredoxin-like fold has the highest number of superfamilies in the human proteome as well as in scop. however, of the superfamilies occur in the human proteome out of the currently known superfamilies in the ferredoxin fold. ribonuclease h-like motif fold has six out of currently known seven superfamilies in the human proteome. except the superfamily of hypothetical protein mth from methanobacterium, all other superfamilies of ribonuclease h-like motif occur in the human proteome. these superfamilies are actin-like atpase domain, creatinase/ prolidase n-terminal domain, ribonuclease h-like, translational machinery components, dna repair protein muts domain ii and methylated dna-protein cysteine methyltransferase domain. this could be expected as the nucleic acid binding/related superfamilies are highly represented in the human proteome. the complete list of protein structure folds that occur in human genome with their respective frequency of occurrence in human genome is provided at http://hodgkin. mbu.iisc.ernet.in/*human. the sequence to profile matching procedure described in the methods section resulted in the association of at least one functional domain family in pfam database to the proteins of swissprot database linked to omim entries ( . % of the total of proteins in the omim database). the remaining disease-related proteins could not be associated to any functional or structural domain family. hence these proteins could be high priority targets in structural genomics to provide further insights into the molecular basis of the function of these proteins. these proteins contain functional and structural domains from pfam families. it is important to note that domains out of domains could be linked to pfam families with known structural information. in terms of the amino acids coverage in these domains about % of the amino acids in the proteins are in the functional or structural domain ( ) assigned regions in these proteins. figure shows the distribution of the domains in the top most populous families in the proteins, these families contain domains which is about % of all assigned domains. protein kinase is the most frequently occurring domain family in the human disease proteins. among the top most populous families have known structural information. the most populous structural superfamily that is assigned to the proteins is p-loop containing nucleotide triphosphate hydrolases and this has domains in it. the largest representations in the p-loop superfamily come from the domain families like ras, helicase_c, and dead. the list of highly populated superfamilies has much in common with the analogous list generated by muller et al. ( ) . much of these highly represented superfamilies are associated with regulatory roles in development, differentiation and proliferation. further analysis revealed that there are proteins that have been assigned additional functional domains apart from previously assigned functional domains. these proteins are listed in table . these newly assigned domains may play a significant role in furthering our understanding of overall functions of these proteins. using various methods of domain association we could associate at least one domain to about % of gene products in the human genome. interestingly, the assignments of remote homologues to the human proteins revealed the occurrence of some of the viral and bacterial specific proteins in the human genome. among most commonly occurring functional family, protein kinases is one of the most frequently occurring domains, and the p-loop containing nucleotide triphosphate hydrolases is the one of the most represented superfamily. the assignment of domains to families with apparently no structural information to structural families would aid in the prioritization of targets for structural genomics of human genome. the assignment of new domains in addition to previously identified domains to the proteins possibly linked to genetically inherited human diseases could form a basis for the experimental verification of the roles of these domains as well as the molecular basis of disease. initial sequencing and analysis of the human genome the sequence of the human genome predicting function: from genes to genomes and back structural assignments to the mycoplasma genitalium proteins show extesive gene duplication and domain rearrangements the cath extended protein-family database providing structural annotations for genome sequences trends in protein evolution inferred from sequence and structure analysis studying genomes through the aeons: protein families, pseudogenes and proteome evolution predicting protein function by genomic context: quantitative evaluation and qualitative inferences basic local alignment search tool improved tools for biological sequence comparison distant homology recognition using structural classification of proteins the relation between the divergence of sequence and structure in proteins protein evolution. how far can sequences diverge? how representative are the known structures of the proteins in a complete genome? a comprehensive structural census homology-based fold predictions for mycoplasma genitalium proteins the relationship between protein structure and function a comprehensive survey with application to the yeast genome enhanced genome annotation using structural profiles in the program d-pssm predicting structures for genome proteins from protein structure to function genomic-scale comparison of sequence-and structure-based methods of function prediction: does structure provide additional insight? profile analysis: detection of distantly related proteins hidden markov models in computational biology. applications to protein modeling fold and function predictions for mycoplasma genitalium proteins applying motif and profile searches gapped blast and psi-blast: a new generation of protein database search programs supfam-a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes supfam: a database of sequence superfamilies of protein domains enhanced functional and structural domain assignments using remote similarity detection procedures for proteins encoded in the genome of mycobacterium tuberculosis impala: matching a protein sequence against a collection of psi-blast-constructed position-specific score matrices profile hidden markov models benchmarking psi-blast in genome annotation identification of related proteins on family, superfamily and fold level genthreader: an efficient and reliable protein fold recognition method for genomic sequences a method to identify protein sequences that fold into a known three-dimensional structure comparative genome and proteome analysis of anopheles gambiae and drosophila melanogaster the identification of functional modules from the genomic association of genes genecensus: genome comparisons in terms of metabolic pathway activity and protein family sharing structural characterization of the human proteome protein fold recognition using sequence profiles and its application in structural genomics functional and structural genomics using pedant genome sequences and great expectations the pfam protein families database pali-a database of phylogeny and alignment of homologous protein structures integration of related sequences with protein three-dimensional structural families in an updated version of pali database scop: a structural classification of proteins database for the investigation of sequences and structures the ensembl genome database project omim passes the -disease-gene mark the swiss-prot protein sequence database and its supplement trembl in a hidden markov model for predicting transmembrane helices in protein sequences the repertoire of protein kinases encoded in the draft version of the human genome: atypical variations and uncommon domain combinations acrab efflux pump plays a major role in the antibiotic resistance phenotype of escherichia coli multiple-antibiotic-resistance (mar) mutants the drosophila patched gene encodes a putative membrane protein required for segmental patterning secd and secf are required for the proton electrochemical gradient stimulation of preprotein translocation major facilitator superfamily. microbiol molecular identification and characterization of novel members of the human organic anion transporter (oatp) family dual function of the trna (m( )u ) methyltransferase in trna maturation identification of the catalytic residues of bifunctional glycogen debranching enzyme the human serum paraoxonase/arylesterase gene (pon ) is one member of a multigene family setor: hardware lighted three-dimensional solid model representations of macromolecules this research is supported by the award of senior fellowship to n.s. by the wellcome trust, london as well as by the computational genomics initiative supported by the department of biotechnology, new delhi. s.b. and s.b.p. are supported by the wellcome trust, london and csir, new delhi respectively. key: cord- -lbjesv y authors: durmuş tekir, saliha d.; Ülgen, kutlu Ö. title: systems biology of pathogen‐host interaction: networks of protein‐protein interaction within pathogens and pathogen‐human interactions in the post‐genomic era date: - - journal: biotechnol j doi: . /biot. sha: doc_id: cord_uid: lbjesv y infectious diseases comprise some of the leading causes of death and disability worldwide. interactions between pathogen and host proteins underlie the process of infection. improved understanding of pathogen‐host molecular interactions will increase our knowledge of the mechanisms involved in infection, and allow novel therapeutic solutions to be devised. complete genome sequences for a number of pathogenic microorganisms, as well as the human host, has led to the revelation of their protein‐protein interaction (ppi) networks. in this post‐genomic era, pathogen‐host interactions (phis) operating during infection can also be mapped. detailed systematic analyses of ppi and phi data together are required for a complete understanding of pathogenesis of infections. here we review the striking results recently obtained during the construction and investigation of these networks. emphasis is placed on studies producing large‐scale interaction data by high‐throughput experimental techniques. despite immense technological advances in medicine, pathogenic organisms remain the source of much human morbidity and mortality. hiv/aids, acute lower respiratory tract infections, hemorrhagic fever, diarrheal diseases, tuberculosis and malaria are particularly notorious for high mortality rates [ ] [ ] [ ] . the continuous emergence of new diseases and drug-resistant pathogens has heightened the global burden of infectious diseases in the st century [ , ] . to tackle such biological threats, an improved understanding of pathogenic microorganisms and their interactions with host organisms is needed since pathogen-host molecular interactions have crucial roles in initiating, sustaining, or preventing infection. pathogenic microorganisms communicate with human cells through interactions with human proteins both on the surface of the cell and within the interior of the cell. these interactions allow the microorganisms to enter the host cell and manipulate cellular mechanisms in order to use the host cell's capabilities to their own advantage, resulting in infection in the host organism. detailed knowledge of pathogen-host protein interactions may enable us to comprehend the mechanisms of infection and to identify better strategies to prevent or cure infection [ , ] . however, the identification of new drug and vaccine targets for infectious diseases is only possible when the molecular machinery within individual pathogenic and host organisms is understood. for instance, anti-infection therapeutics should target essential genes in the pathogens which have no homology with human genes [ ] . the very first genome sequencing was published in with the dna sequence for the genome of a virus, bacteriophage phix [ ] . following the sequencing of the bacterial pathogen haemophilus influenzae in [ ] and the human genome in [ ] , sequence data for prokaryotic and eukaryotic genomes have appeared at an accelerated rate. today, genomic data for most of the pathogen and host organisms are available [ ] . these data are used to study individual genes and corresponding proteins as well as to identify intra-and interspecies connections between proteins. in the light of these advances, the initial steps towards complete understanding of infection mechanisms through protein interactions have been recently published. in this review, the efforts to systematic determination and analysis of protein interaction networks underlying infection pathogenesis are summarized (mainly in a chronological order) to present the current picture of the research on infectious diseases. from a classical perspective, a protein is a functional unit that specifies a small, but discrete, part of the cellular physiology of an organism. in the post-genomic era, a protein is seen to function as an element within network of its interaction, and its role should be evaluated within this network together with its interacting partners [ ] . advances in genomics and proteomics have been followed by the first large-scale efforts to identify functional networks of interacting proteins using the two-hybrid method [ ] [ ] [ ] , pull-down assays [ , ] , and protein chips [ ] . to increase our understanding of the mechanisms of infection, protein-protein interaction (ppi) networks of pathogenic organisms should be determined in order to capture their functional and structural organizations. pathogenic ppi maps reveal biological pathways and processes, allowing prediction of protein functions and discovery of new drug and vaccine targets. the first genome-wide protein interaction networks were determined for viruses [ ] [ ] [ ] . the first large-scale bacterial networks [ ] [ ] [ ] followed successes in eukaryote mapping [ , [ ] [ ] [ ] . today, the genome-wide ppi maps for a number of pathogens and hosts are available in public databases: bind [ ] , biogrid [ ] , dip [ ] , hprd [ ] , intact [ ] , mint [ ] , mips [ ] , reactome [ ] and string [ ] . primarily due to their small genome size, whole genome ppi maps were first constructed for viruses. the first interaction map of whole proteome was determined for escherichia coli bacteriophage t , mapping interactions among viral proteins [ ] . subsequently, genomewide analyses of important human pathogens, hepatitis c virus [ , ] , vaccinia virus [ ] , herpesviruses [ , ] , and sars coronavirus [ , ] were performed through intraviral ppi maps. hepatitis c virus (hcv), a flaviviridae family member causing severe liver disease, is a positive-sense singlestranded rna virus. it encodes only a single polyprotein which is co-or post-translationally processed into at least viral proteins [ ] . a controlled two-hybrid strategy based on a random genomic hcv library screen was used by flajolet et al. [ ] , resulting in the identification of known and novel ppis. interactions among structural and non-structural proteins were revealed in the study, leading to the conclusion that almost all of the viral proteins encoded by the genome function in the hcv life-cycle, as in the cases of other members of the flaviviridae [ ] . the roles of these functional interactions were discussed within the framework of the constructed genome-wide interaction map. interacting domains of the viral polyprotein were also identified to shed light on the development of anti-viral agents [ ] . another genome-wide ppi map of hcv was then generated for the viral non-structural proteins [ ] . vaccinia virus, well-known as a smallpox vaccine and also the source of potential recombinant vaccines against cancer and infectious diseases, is a member of poxviridae family. it is a large, double-stranded dna virus. poxviruses replicate themselves in the cytoplasm of the host cells without depending on the host's transcriptional machinery. the large genome of vaccinia virus can potentially express more than proteins [ , ] . mccraith et al. [ ] performed a comprehensive two-hybrid analysis of full-length vaccinia virus proteins and detected ppis (including novel interactions) among both characterized and uncharacterized proteins. many of the ppis mapped involved one partner which was known to function in a specific process, coupled with another of unknown function, allowing functions to be assigned to previously unannotated proteins within dna replication, transcription, virion structure, or host evasion. another double-stranded dna virus family is herpesviridae whose members encode - proteins. herpesviruses cause human diseases such as kaposi sarcoma, b-cell lymphomas, chickenpox, shingles, and nasopharyngeal carcinoma [ ] [ ] [ ] . the genome-wide intraviral protein interaction maps for three members of this family, kaposi sarcoma-associated herpesvirus (kshv), varicella-zoster virus (vzv), and epstein-barr virus (ebv) were generated by two-hybrid and analyzed comprehensively to reveal viral network properties [ , ] . in the work of uetz et al. [ ] , ppis for kshv and ppis for vzv were identified, the largest dataset published to date, allowing the construction of the first viral networks. topological network analyses of these interactome maps indicated that the viral networks appear as a single, highly coupled module ( fig. ) with relatively many hubs and few peripheral nodes [ ] in contrast to scale-free cellular networks with well-separated functional modules [ , ] . just after this study was published, calderwood et al. [ ] reported the detection of ppis among ebv proteins. the construction of a ppi map for ebv by merging these interactions with already published ones resulted in a network of proteins with interactions. this large-scale network allowed the prediction of functions of uncharacterized proteins, further defining viral mechanisms. in these consecutive studies [ , ] , core proteins common to all herpesviruses and noncore ones specific to each strain were investigated thoroughly. the severe acute respiratory syndrome coronavirus (sars-cov) is a positive-sense single-stranded rna virus belonging to the family of the largest rna viruses known, coronaviridae. its genome encodes open reading frames expressing up to structural and non-structural proteins that have roles in viral replication, assembly, and other functions for viral amplification in host cells [ , , ] . for a genome-wide analysis of ppis of sars-cov, interactions between all sars-cov proteins were determined [ , ] by two-hybrid producing and interactions, respectively. intraviral ppis were analyzed to elucidate the functions of the proteins as well as to identify the essential proteins in viral replication. von brunn et al. [ ] compared the intraviral network topology of sars-cov with a previously defined viral network [ ] and cellular networks [ ] [ ] [ ] , concluding sars-cov network contained similarities to the kshv network [ ] . insights gained into molecular mechanisms and topological network properties provided by the genome-wide analyses of intraviral ppi maps (table ) may be used as a basis for further characterization of the functions and mechanisms of viral proteins, especially for other members of the same virus families. having successfully built genome-wide ppi maps for viruses, similar two-hybrid methodology was applied to construct ppi networks for the larger, more complex genomes of pathogenic bacteria. the first prokaryotic ppi map was built for helicobacter pylori [ ] . other large-scale prokaryotic networks eventually emerged for campylobacter jejuni [ ] , treponema pallidum [ ] mycobacterium tuberculosis [ ] , and bacillus subtilis [ ] . genome-scale analysis of interacting proteins that assemble into protein complexes were performed for e. coli [ , ] and mycoplasma pneumoniae [ ] . the first large-scale intrabacterial ppi map was constructed for the human gastric pathogen, and gram-negative bacterium h. pylori, identifying interactions between . % of all bacterial proteins using the twohybrid method [ ] . the comparison of these h. pylori ppis with previously described interactions between orthologous e. coli proteins resulted in prediction of protein functions within biological pathways such as chemotaxis and urease activity, essential for h. pylori pathogenicity. in this study, the interacting domains of h. pylori proteins were also identified and used in protein function predictions. interacting domains may serve in mapping new functional domains, providing crucial information for antibacterial drug design studies. gram-negative bacterium e. coli, the main cause of urinary tract infections and a model bacterial system, is one of the best characterized and early studied organisms [ ] [ ] [ ] . however, any large-scale analysis of protein complexes in e. coli was not performed until the studies of butland et al. [ ] and arifuzzaman et al. [ ] . first, binary interactions involving essential and nonessential proteins, were identified by pull-down assay using tandem affinity purification-mass spectrometry, targeting orfs (about one-quarter of the e. coli genome) [ ] . a small number ( %) of these interactions were already available in dip, bind, and string. ten newly described e. coli ppis were found as orthologous to the interactions reported for h. pylori [ ] . the novel interactions were analyzed for functional annotations of uncharacterized proteins, allocating them within ribosome function, rna processing, rna binding, and so on. the graph theoretical analysis of the ppi map of e. coli revealed scale-free behavior and a high correlation between connectivity and the degree of conservation. the genome-wide ppi map of e. coli k- strain with interactions among proteins was then constructed by a similar method [ ] . the comprehensive analysis of this large-scale network also validated the scale-free nature and the connectivity-conservation correlation found previously [ ] . arifuzzaman et al. [ ] identified functional units which have roles in metabolic pathways, transcriptional and translational machinery, recombination and flagella assembly. analysis of ppis based on this functional unit categorization provided further functional annotations. the gram-negative, food-borne pathogen c. jejuni is the major cause of gastroenteritis. the proteome-level analysis revealed interactions involving % of c. jejuni proteins [ ] , the most comprehensive bacterial ppi map determined by two-hybrid. a scale-free network was obtained, removing low confidence-scored interactions. this ppi map of c. jejuni was used to identify evolutionarily conserved subnetworks through comparison with protein networks of h. pylori [ ] , e. coli [ ] and saccharomyces cerevisiae in dip. further analyses of the identified conserved sub-networks allowed the prediction of new c. jejuni interactions using orthologous interactions. this comparative analysis also enabled the identification of essential c jejuni genes based on their orthology to essential genes in other organisms. this comprehensive interactome data were next used to predict protein roles and to map functional pathways such as chemotaxis. the causative agent of syphilis, t. pallidum, has one of the smallest genomes known in extracellular bacteria, encoding proteins [ ] . the global ppi network of t. pallidum, involving interactions connecting bacterial proteins, was identified by two-hybrid [ ] . the high-confidence subset connects proteins by interactions. in that study, an integrated network of dna-metabolism related processes was constructed and proteins were functionally annotated within this network. additionally, various orthologous interactions were predicted for completely sequenced genomes, allowing the description of phylogenetically conserved interaction patterns. atypical pneumonia causing human pathogen, m. pneumoniae also has one of the smallest genomes in self-replicating organisms with protein-encoding genes, making it a good model organism to study proteome organization in prokaryotes [ ] . a proteome-wide analysis was performed by tandem affinity purificationmass spectrometry, identifying homo-multimeric and hetero-multimeric protein complexes [ ] . about a third of the found hetero-multimeric complexes were observed to interact with proteins forming larger, multiprotein complexes implying higher level of proteome organization and protein multifunctionality, allowing functional annotations of assemblies as well as prediction of biological roles of individual proteins within the complexes. m. tuberculosis causes millions of deaths each year with tuberculosis infection [ ] . after computational efforts to construct large-scale ppi maps of m. tuberculosis [ , ] , its genome-wide network was identified experimentally by two-hybrid [ ] . this global network is composed of interactions among proteins which represent . % of the whole proteome. the topological properties of the undirected network of these interactions were calculated and compared with those of the previously defined prokaryotic ppi networks [ - , , ] . similar scale-free behavior following a power-law distribution was observed. in fact, the networks obtained by pull-down assay [ , ] differ in values of clustering coefficient from the networks obtained by two-hybrid analysis [ , , ] . moreover, wang et al. [ ] performed a cross-species network comparison analysis of m. tuberculosis interactions with the available large-scale ppi data [ - , , ] and identified conserved sub-networks. additionally, the highly connected critical proteins and mechanisms of the protein secretion pathways which have roles in its pathogenesis were revealed. a large-scale ppi network was recently constructed for the gram-positive bacterium b. subtilis (which is rarely pathogenic) by two-hybrid [ ] . this network of interactions involves bacterial proteins. due to its role as a model organism, many studies were performed to characterize the biological functions of its ppis in cellular processes [ ] [ ] [ ] . however, many processes remained uncharacterized. hence marchadier et al. [ ] performed a comprehensive analysis with the integration of transcriptomic data focusing on cell division, cell responses to stresses, the bacterial cytoskeleton, dna replication and chromosome maintenance. these sequential efforts on construction of large-scale ppi networks for prokaryotes (table ) constitute the first comprehensive description of the intraspecies mechanisms of the bacterial pathogens. the protozoan pathogen plasmodium falciparum causes malaria which results in deaths of nearly a million of people each year [ ] . a comprehensive protein interaction map of this pathogen was generated by two-hybrid, identifying a highly interconnected, scale-free network of interactions within proteins (~ % of the predicted p. falciparum proteins) [ ] . in this network, % of the interactions are between two uncharacterized proteins whereas % of the interactions include one such protein. bioinformatic analysis of this network yielded functional annotations of the proteins within the processes; chromatin modification, transcription, messenger rna stability, ubiquitination, and invasion of host cells. more detailed studies of ppis within p. falciparum are required in order to unravel its pathogenesis mechanisms thoroughly. despite the increasing rate in the identification of genome-wide ppi networks, they remain unconstructed for most pathogens. in the light of accelerating advances in genomics, proteomics, and interactomics, large-scale maps for many more organisms are expected to be built in the near future. increasing numbers of ppi networks will allow the comparison of networks across diverse organisms, resulting in generalized conclusions about pathogenic molecular mechanisms. the first examples of such comparative studies have been highlighted in the sections . and . above. integration of several highthroughput interaction datasets to generate more detailed networks is also possible, as indicated by recent examples for the e. coli system [ , ] . the frequency of such integrated networks is expected to increase, owing to the large number of diverse data sets. these will be invaluable in defining whole proteomic maps of the pathogens. one of the most striking results of bioinformatic analyses on the constructed ppi maps is the identification of essential proteins functioning within pathogens. these proteins should be examined thoroughly to test their potential as novel therapeutic targets. the exploration of genome-wide ppi maps of the pathogens permits the assignment of unannotated proteins to biological pathways with function prediction. the proteins annotated to the host invasion processes may provide a launching point for pathogen-host interaction studies. biochemical interactions of pathogens with their hosts are necessary to invade the host organism. these connections between pathogens and hosts include interactions between proteins, nucleotide sequences, and small ligands [ , ] . however, the protein interactions of pathogen-host systems have been identified as the most important, and therefore the most studied, type of pathogen-host interactions (phis) [ , ] . since these interspecies crosstalks determine the pathogenesis, focusing on the whole phi system, instead of investigating a pathogen or host individually, may allow us to capture critical mechanisms (i.e. strategies used by pathogens and host immune responses) during infection that cannot be provided by traditional methods. due to a lack of sufficient experimental phi data until recent years, many computational phi prediction methods have been developed [ ] [ ] [ ] [ ] [ ] [ ] [ ] . these studies focused mainly on interactions of p. falciparum and human immunodeficiency virus (hiv), as these are some of the most threatening pathogens to humans. very recently experiments have been carried out to determine the first large-scale molecular interactions between human and viruses [ , , ] and bacteria [ , ] . as a result of an increase in data available for pathogen-host systems, phispecific databases have been introduced such as phibase [ ] , virusmint [ ] , virhostnet [ ] , patric [ ] , and phisto [ ] . although these advances in data archiving are promising, most data relevant to phi are still buried in the biomedical literature. some rare efforts have been performed to obtain hidden phis from the literature by text mining [ ] [ ] [ ] . as in the case of intraspecies pathogen ppis, large-scale phi data were generated for viral systems before bacterial systems ( table ). the first examples are for commonly observed human pathogens, ebv [ ] , hcv [ ] and influenza a virus (h n and h n ) [ ] and then recently for hiv [ ] . in calderwood et al. [ ] , protein interactions between herpesvirus, ebv and human were mapped by twohybrid in conjunction with ebv intraviral ppi mapping, providing phis between ebv proteins and human proteins. a systematic analysis of these interactome maps of ppis and phis enabled hypotheses of the roles of ebv proteins in pathogenesis to be generated. furthermore, intraspecies protein interaction data for human were integrated from databases (bind, dip, hprd, mips) and from the literature [ , ] to analyze the organization of the human proteins targeted by ebv within human molecular machinery. it was found that ebv proteins tend to target human proteins which are highly connected (hubs) and central to many paths (bottlenecks) in the human ppi network. on the other hand, the degree distribution of the ebv-human protein interaction network could not be fitted to any model because of its incompleteness (fig. ) . attempts to analyze incomplete maps of ppis and phis are still able to supply a partial understanding of mechanisms underlying infection. a similar thorough analysis was earlier performed with herpesviral protein networks of kshv and vzv and their interaction with the human proteome [ ] . in that study, protein interactions between herpesviruses and human were predicted using the interacting orthologs of both proteins in other organisms [ ] . combined virus-human networks were constructed by starting with the viral networks, adding their human protein targets, and then adding the cellular interactions among the targeted human proteins. the topological analyses of the combined herpesviruses-human networks revealed distinct properties from both viral and human interactomes providing insights into the impact of the two organisms on each other [ ] . a proteome-wide phi map for the flavivirus hcv was mapped by two-hybrid and then by literature mining of previously found interactions between hcv and human [ ] . a map of interactions between hcv proteins and human proteins was generated ( phis by twohybrid). % of this phi network included novel interactions. the integrated human network of ppis among proteins [ ] was used to evaluate the interplay between hcv and human. very similar behavior to ebv [ ] was observed for hcv in terms of attacking hub and bottleneck proteins in the human network. to assess the human pathways targeted by hcv, kegg functional annotation pathways [ ] were used. four pathways were detected to be enriched in hcv-targeted human proteins. three of them were associated already with hcv clinical www.biotechnology-journal.com syndromes as insulin, tgf-β and jak/stat pathways. the last enriched pathway, focal adhesion, is a novel observation as a human pathway affected during hcv infection [ ] . influenza a is a member of negative-sense singlestranded viruses of orthomyxoviridae family. it is the sources of all flu pandemics infecting multiple species. for h n a/pr/ / strain of influenza virus, intraviral ppis among viral proteins and phis between viral and human proteins, most of which are expressed in primary human bronchial cells, were detected by twohybrid [ ] . some of the phis constructed had been published previously [ ] . the topology of the constructed intraviral network revealed a highly interconnected nature, as observed previously for other viral networks [ , ] . in the case of the influenza a-human interaction network, important properties about connectivity of proteins were observed. first, viral proteins interact with significant number of human proteins, reflecting the multifunctionality of the small number proteins encoded in rna viruses. second, each of human proteins connects with two or more viral proteins forming virus-human multiprotein complexes. additionally, it was observed that viral proteins generally target human proteins which are highly connected within their own network, as it was the case in herpesviruses-human system [ ] . in shapira et al. [ ] another phi network was identified for strain of influenza virus, h n a/udorn/ by the same experimental approach. this phi network consists of interactions between viral and human proteins, reflecting a similar nature to the network for h n strain-human system. this confirms the conserved functions of influenza virus proteins through strains. besides direct physical interactions between viral and human proteins, host responses in bronchial cells to influenza infection was identified by expression profiling, generating a regulatory map of interactions between influenza proteins and their human targets. comprehensive analysis of the physical and regulatory maps of the phi system elucidated human mechanisms involved in infection. for example, nf-κb, mitogen-activated protein kinase, apoptosis, and wnt signaling pathways are regulated through transcriptional and/or physical interactions during influenza a infection. one of the most dangerous human pathogens, hiv, belongs to positive-sense single-stranded rna virus family retroviridae. acquired immunodeficiency syndromecausing hiv has been extensively studied since its first observation near the end of the th century [ ] [ ] [ ] [ ] . similar to other rna viruses, hiv has a small genome and depends largely on human cellular machinery to be replicated. identifying the physical contacts between hiv and human proteins during hiv replication is critically important for a full understanding of hiv infection. being one of the most studied pathogens, there are many phi data for hiv- in virusmint and phisto. the current phi data have been produced mainly by small-scale experiments [ ] [ ] [ ] . very recently, a global phi network was generated for hiv-human protein complexes by affinity tagging and purification mass spectrometry, producing phis between hiv- proteins and human proteins [ ] . it was observed that hiv-targeted human proteins are highly conserved across primates. the novel interactions identified in that study requires further work to detail their biological significance in terms of hiv infection. besides whole proteins, domains of the interacting proteins were investigated and the enriched domain types in targeted human proteins were indicated for facilitating future structural modeling studies regarding hivhuman system. the first large-scale interaction networks between viruses and humans [ , , , ] provide crucial clues about the viral infections, verifying the critical importance of phi analyses in infection researches. until very recent years, the phi data were scarce for bacterial systems because of lack of any large-scale experiments. the first extensive bacterial phi networks were identified for important human pathogens, bacillus anthracis, francisella tularensis, and yersinia pestis [ ] , then another high-throughput experimental study generating phi data of y. pestis was reported [ ] . gram-positive bacteria b. anthracis and y. pestis and gram-negative bacterium f. tularensis are respiratory pathogens causing anthrax, bubonic plague, and acute pneumonic disease, respectively. using a two-hybrid assay, large-scale interaction data were generated between these bacteria and human producing phis between b. anthracis proteins and human proteins, phis between y. pestis proteins and human proteins, and phis between f. tularensis proteins and human proteins [ ] . the first conclusion of computational analyses of these comprehensive bacteria-human networks, in combination with the integrated human ppi network from databases bind, dip, hprd, intact, mint, mips, and reactome, was that bacterial proteins tend to target hubs and bottlenecks in the human network. secondly, the roles of human proteins targeted by these bacteria were investigated using their gene ontology annotations [ ] . the tendency of all three pathogens to target human proteins involved in immune responses was observed as previously reported [ ] [ ] [ ] . besides being effectors of immune signaling, the bacteria-targeted human proteins also have crucial roles in apoptosis [ ] . thirdly, the conserved protein interaction modules of the three phi networks were computed [ , ] for a more systematic comparative analysis. conserved modules revealed common attacks by the bacterial pathogens to same human pathways. subsequently, another phi map was generated for plague causing y. pestis by a different two-hybrid strategy by choosing only potential virulence factors as bait pro-teins [ ] . phis were yielded between y. pestis proteins and human proteins and then previously reported phis were integrated to construct a comprehensive network between y. pestis and human. a graph theoretic analysis confirms that y. pestis preferentially targets hub and bottleneck proteins in the human intranetwork as concluded previously for viruses [ , ] and bacteria [ ] . signaling pathways, crucial for human immune system, were found to be enriched in human proteins targeted by y. pestis. these pathways include mitogen-activated protein kinase signaling and toll-like receptor signaling and also pathways functioning in focal adhesion, regulation of cytoskeleton, and leukocyte transendoepithelial migration. finally, y. pestis-targeted human proteins were compared with those targeted by viruses whose phi networks were identified previously. of y. pestis-targeted human proteins are included in phi networks of ebv [ ] and hcv [ ] indicating the common infection strategies of both viruses and bacteria. the recent detected first large-scale phi networks of bacteria-human systems [ , ] contribute largely to the understanding of bacterial infection mechanisms with immune evasion. as phi data available for various pathogens increase, a need to analyze comprehensive phi data for all pathogen types together arises in order to draw a generalized picture. although infection mechanisms of individual pathogens have been studied through intraspecies pathogenic ppi maps and interspecies phi maps, a general overview of infection mechanisms was missing until analyses of phi data from different infection agents were attempted [ , ] . in the absence of large-scale phi networks for bacterial, protozoan and fungal systems, dyer et al. [ ] performed the first global analysis of protein interactions between pathogen strains of viruses, bacteria, protozoa, and human through properties of targeted human proteins. diversity of the available phi data was not rich, . % of phis belonged to the virushuman systems with . % of the interaction data drawn from hiv -human interaction systems. the importance of the pathogen-targeted proteins was evaluated within the intraspecies human ppi network of interactions. these phi and ppi data were integrated from public databases; mint, intact, dip, hprd, reactome, bind and mips. firstly, targeting hub and bottleneck proteins was concluded to be global behavior for all pathogens, as reported for individual pathogen strains previously [ , [ ] [ ] [ ] . gene ontology [ ] functions enriched in the targeted human proteins by different pathogens revealed common infection mechanisms. attack of human transcription factors and key proteins that control the cell cycle and regulate apoptosis and transport of genetic material across the nuclear membrane were found to be among the common viral strategies. despite its scarcity ( interactions in the datset), bacterial phi data allowed identification of specific human proteins that function in the host immune response (via toll-like receptors and i-κb kinase/nf-κb signaling cascade) as a target of bacterial infection strategy [ ] . recently we performed another study with comprehensive phi data to explore common and special infection strategies for viruses and bacteria [ ] . a significant amount of bacterial phis, constituting . % of all data, was avaiable thanks to dyer et al. [ ] . we analyzed interactions between proteins of viral, bacterial, protozoan and fungal pathogens (totally strains) and proteins of human obtained from phisto (www.phisto.org). to generate the intra species human protein network, ppis were integrated from biogrid, dip, intact, mint and reactome. the significant amount of bacterial and viral phi data allowed us to focus on comparisons between their specific infection mechanisms. firstly, attacking hub and bottleneck proteins in the human ppi network was verified as a common infection strategy of both bacteria and viruses. furthermore, viruses were observed to target human proteins of much higher connectivity and centrality values in comparison to bacteria. secondly, gene ontology enrichment analysis of the targeted human proteins verified the special mechanisms of bacteria and viruses use to manipulate of human immune defense mechanisms and cellular processes, respectively (as reported in dyer et al. [ ] but relying on lower amounts of phi data). a first attempt at the investigation of the human proteins targeted by both bacteria and viruses revealed that attacking human metabolic processes is a common strategy used by both pathogens during infections [ ] . global analysis of phi data provides insights into the strategies adapted by bacteria and viruses to subvert human cellular processes and immune system for the infection. however, large-scale phi networks for pathogens other than bacteria and viruses are still undetermined, leaving their pathogenesis mechanisms to be relatively uncharacterized. research on infectious diseases through phis has accelerated within the post-genomic era (fig. ) . however, large-scale phi networks have been infrequently studied. efforts to identify and analyze large-scale phis for diverse pathogen types would be expected to parallel the acceleration of biotechnology and bioinformatics research. increasing amounts of data available will allow more complete data sets to be compiled, resulting in characterization of topological properties of phi networks. the first attempt to fit the degree distribution of ebv-human interaction network failed due to scarcity of data [ ] . on the other hand, bioinformatic analyses of the pathogen-tar-geted human proteins succeeded in unraveling some infection strategies such as targeting human hubs and bottlenecks, subverting cellular processes for the usage of pathogens' own advantages and evasion of immune defenses [ , , , , , ] . the huge amount of data expected to be generated for phi systems will enable us to capture all details of infection processes. potentially leading to the development of new and more efficient therapeutics. conventional treatments for infectious diseases often aim to kill pathogens by targeting their essential proteins. this approach unfortunately forces the pathogens to evolve for survival and consequently selects resistant strains (especially in the case of rna viruses with a high mutation rate). to fight drug-resistant patho gens, novel alternative therapeutics are emerging which target host proteins required by pathogens to replicate and persist within the host organism. if these host factors are indispensable for pathogens, but not essential for host cells, their silencing may inactivate pathogenic activity, allowing them to serve as therapeutic targets [ , ] . in the light of phi studies, some human factors required by viral and bacterial pathogens have been determined for hiv [ ] [ ] [ ] [ ] [ ] , hcv [ ] , west nile virus [ ] , influenza virus [ , ] , and m. tuberculosis [ ] in recent years. despite the efforts reviewed here, the use of systems biology approaches to investigate phi is still considered relatively undeveloped. the availability of new phi network data, together with further topological and functional analyses of pathogen-host systems, are expected to shed more light on infection mechanisms and novel therapeutic targets for infectious diseases in the near future. we particularly thank dr. tunahan Çakır for critical reading of the manuscript and for his contributions to figure . the financial support was provided by the research funds of bogaziçi university, through project d. the doctoral scholarship for saliha durmuş tekir is sponsored by tÜbitak, is gratefully acknowledged. the authors declare no conflict of interest. figure . the number of scientific publications including phi-related terms in pubmed in the post genomic era. the searched phi-related terms: "pathogen host interactions", "host pathogen interactions", "pathogen host interaction", "host pathogen interaction", "pathogen-host interactions", "pathogen-host interaction", "host-pathogen interactions", "host-pathogen interaction". infectious diseases: for considerations the st century host-pathogen systems biology filoviruses are ancient and integrated into mammalian genomes new strategies to fight infectious diseases -arms race on a microscale signaling during pathogen infection the landscape of human proteins interacting with viruses and other pathogens antibacterial drug discovery and structure-based design nucleotide sequence of bacteriophage ϕx dna whole-genome random sequencing and assembly of haemophilus influenzae rd the sequence of the human genome protein function in the post-genomic era a novel genetic system to detect protein-protein interactions interaction mating reveals binary and ternary connections between drosophila cell cycle regulators toward a functional analysis of the yeast genome through exhaustive two-hybrid screens functional organization of the yeast proteome by systematic analysis of protein complexes systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry global analysis of protein activities using proteome chips a protein linkage map of escherichia coli bacteriophage t a genomic approach of the hepatitis c virus generates a protein interaction map genome-wide analysis of vaccinia virus protein-protein interactions the protein-protein interaction map of helicobacter pylori interaction network containing conserved and essential protein complexes in escherichia coli largescale identification of protein-protein interaction of escherichia coli k- toward a protein-protein interaction map of the budding yeast: a comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins a comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae protein interaction mapping in c. elegans using proteins involved in vulval development the biomolecular interaction network database and related tools: update the biogrid interaction database: update the database of interacting proteins: update human protein reference database the intact molecular interaction database in mint, the molecular interaction database: update mpact: the mips protein interaction resource on yeast reactome: a database of reactions, pathways and biological processes the string database in : functional interaction networks of proteins, globally integrated and scored protein-protein interactions between hepatitis c virus nonstructural proteins herpesviral protein networks and their interaction with the human proteome epstein-barr virus and virus human protein interaction maps analysis of intraviral protein-protein interactions of the sars coronavirus orfeome genome-wide analysis of protein-protein interactions and involvement of viral proteins in sars-cov replication hepatitis c virus: structure, protein products and processing of the polyprotein precursor flaviviridiae: the viruses and their replication the complete dna sequence of vaccinia virus genetically engineered poxviruses for recombinant gene expression, vaccination, and safety the complete dna sequence of varicella-zoster virus identification of herpesvirus-like dna sequences in aids-associated kaposi's sarcoma the genome of epstein-barr virus type strain ag emergence of scaling in random networks specificity and stability in topology of protein networks unique and conserved features of genome and proteome of sars-coronavirus, an early split-off from the coronavirus group lineage towards a proteome -scale map of the human protein-protein interaction network a human proteinprotein interaction network: a resource for annotating the proteome a first-draft human protein-interaction map a proteome-wide protein interaction map for campylobacter jejuni the binary protein interactome of treponema pallidum -the syphilis spirochete a global protein-protein interaction network in the human pathogen mycobacterium tuberculosis h rv an expanded protein-protein interaction network in bacillus subtilis reveals a group of hubs: exploration by an integrative approach proteome organization in a genome-reduced bacterium primary structure of the succinyl-coa synthetase of escherichia coli one-step inactivation of chromosomal genes in escherichia coli k- using pcr products new partners of acyl carrier protein detected in escherichia coli by tandem affinity purification complete genome eequence of treponema pallidum, the syphilis spirochete complete sequence analysis of the genome of the bacterium mycoplasma pneumoniae epidemiology, strategy, financing mycobacterium tuberculosis interactome analysis unravels potential pathways to drug resistance uncovering new signaling proteins and potential drug targets through the interactome analysis of mycobacterium tuberculosis an expanded view of bacterial dna replication dna polymerase i acts in translesion synthesis mediated by the y-polymerases in bacillus subtilis cell-cycle-dependent spatial sequestration of the dnaa replication initiator protein in bacillus subtilis chemical genetics of plasmodium falciparum a protein interaction network of the malaria parasite plasmodium falciparum inferring genome-wide functional linkages in e. coli by combining improved genome context methods: comparison with high-throughput experimental data global functional atlas of escherichia coli encompassing previously uncharacterized proteins the battle of two genomes: genetics of bacterial host/pathogen interactions in mice structural microbiology at the pathogen-host interface mining host-pathogen interactions host pathogen protein interactions predicted by comparative modeling computational prediction of host-pathogen protein-protein interactions a data integration approach to predict host-pathogen protein-protein interactions: application to recognize protein interactions between human and a malarial parasite ortholog-based protein-protein interaction prediction and its application to interspecies interactions prediction of interactions between hiv- and human proteins by information integration prediction of hiv- virus-host protein interactions using virus and host sequence motifs structural similarity-based predictions of protein interactions between hiv- and homo sapiens hepatitis c virus infection protein network a physical and regulatory map of host-influenza interactions reveals pathways in h n infection the human-bacterial pathogen protein interaction networks of bacillus anthracis, francisella tularensis, and yersinia pestis insight into bacterial virulence mechanisms against host immune response via the yersinia pestis-human protein-protein interaction network phi-base update: additions to the pathogen host interaction database virus-mint: a viral protein interaction database virhost-net: a knowledge base for the management and the analysis of proteome-wide virus-host interaction networks patric: the comprehensive bacterial bioinformatics resource with a focus on human pathogenic species infection strategies of bacterial and viral pathogens through pathogen-human protein-protein interactions document classification for mining host pathogen protein-protein interactions text mining for discovery of host-pathogen-interactions literature mining of hostpathogen interactions: comparing feature-based supervised learning and language-based approaches global landscape of hiv-human protein complexes analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets gene annotation and pathway mapping in kegg the multifunctional ns protein of influenza a viruses connecting viral with cellular interactomes gay compromise syndrome how does hiv cause aids? science hiv- nef impairs mhc class ii antigen presentation and surface expression hiv- tar mir-na protects against apoptosis by altering cellular gene expression anchorage of hiv on permissive cells leads to coaggregation of viral particles with surface nucleolin at membrane raft microdomains human immunodeficiency virus type vpr interacts with antiapoptotic mitochondrial protein hax- hiv- envelope triggers polyclonal ig class switch recombination through a cd -independent mechanism involving baff and c-type lectin receptors gene ontology: tool for the unification of biology francisella tularensis induces cytopathogenicity and apoptosis in murine macrophages via a mechanism that requires intracellular bacterial multiplication macrophage apoptosis by anthrax lethal factor through p map kinase inhibition inhibition of mapk and nf-kb pathways is necessary for rapid apoptosis in macrophages infected with yersinia graemlin: general and robust alignment of multiple large interaction networks conserved patterns of protein interaction in multiple species network-based prediction and analysis of hiv dependency factors identification of host proteins required for hiv infection through a functional genomic screen global analysis of hostpathogen interactions that regulate early-stage hiv- replication genome-scale rnai screen for host factors required for hiv replication host cell factors in hiv replication: meta-analysis of genome-wide studies a genome-wide genetic screen for host factors required for hepatitis c virus propagation rna interference screen for human genes associated with west nile virus infection genome-wide rnai screen identifies human host factors crucial for influenza virus replication human host factors required for influenza virus replication genome-wide analysis of the host intracellular network that regulates survival of mycobacterium tuberculosis key: cord- -mrh pbi authors: dumitrascu, georgiana r.; bucur, octavian title: critical physiological and pathological functions of forkhead box o tumor suppressors date: - - journal: nan doi: . /d. . sha: doc_id: cord_uid: mrh pbi the forkhead box, subclass o (foxo) proteins are critical transcription factors, ubiquitously expressed in the human body. these proteins are characterized by a remarkable functional diversity, being involved in cell cycle arrest, apoptosis, oxidative detoxification, dna damage repair, stem cell maintenance, cell differentiation, cell metabolism, angiogenesis, cardiac development, aging and others. in addition, foxo have critical implications in both normal and cancer stem cell biology. new strategies to modulate foxo expression and activity may now be developed since the discovery of novel foxo regulators and non-coding rnas (such as micrornas) targeting foxo transcription factors. this review focuses on physiological and pathological functions of foxo proteins and on their action as fine regulators of cell fate and context-dependent cell decisions. a better understanding of the structure and critical functions of foxo transcription factors and tumor suppressors may contribute to the development of novel therapies for cancer and other diseases. the forkhead box (fox) proteins represent a wide family of transcription factors that display an extraordinary functional diversity, regulating a variety of critical biological processes , . fox proteins are well known to control the following physiological procceses: apoptosis, cell-cycle, cellular metabolism, immune response, differentiation, development (such as cardiac development) or aging. fox proteins are also involved in various pathologies, such as cancer, diabetes and neurodegenerative diseases , . the first forkhead transcription factor was first identified in , and its function was related to the development of the anterior and posterior gut of the drosophila embryo. the "forkhead" name was given because of the two spiked-head structures found in the embryos of the drosophila melanogaster forkhead mutant presenting modifications of the gut formation . one year later, the sequence comparison between forkhead and mammalian hepatocyte-enriched transcription factor hnf- a showed a conserved -aminoacid dna binding domain, bringing evidence that forkhead proteins are a new class of transcription factors . hence, forkhead transcription factors are defined by their winged-helix dna binding domain, a conserved structure called "forkhead box", the symbol fox being assigned to all vertebrate forkhead transcription factors, according to the revised nomenclature . since its discovery, numerous forkhead genes have been identified in a broad range of organisms, from yeast and worms to humans . interestingly, up to now, fox genes have not been identified in plants . however, an update published in reported that are fox genes identified and classified in the human genome and in the mouse genome . therefore, a standard nomenclature system was required for this extended number of discovered factors. in , daniel e. martinez and his team established fox nomenclature committee, the first step towards an unified nomenclature for the winged-helix / forkhead transcription factors. the committee has changed the initial terms for the forkhead proteins (e.g. freac -forkhead related activator; fkh -drosophila gene fork head) . currently, based on phylogenetic analysis, foxo genes are classified in subclasses that range from foxa to foxs to yield subclasses in total , , , . each subclass has several members noted with an arabic number. while for human forkhead proteins abbreviations have all letters in uppercase (e.g., foxd ), for mouse only the first letter is capitalized (e.g., foxd ) and for all other chordates the first and subclass letter are in uppercase (e.g., foxd ) . one of the largest and the most important subclass of fox family is represented by foxo (forkhead box, subclass o) transcription factors. foxo transcription factors (foxos) are characterized by the same conserved dna binding domain that define the family of forkhead proteins . however, foxos share an additional unique amino acid sequence insertion within the dna binding domain that is not present in other forkhead proteins . only one foxo transcription factor is known in invertebrates (named abnormal dauer formation protein /daf- in the nematode worm caenorhabditis elegans and dfoxo in the fruit fly drosophila melanogaster), while in mammals four foxo proteins encoded by four different genes were identified: foxo , foxo , foxo and foxo . initially, foxo transcription factor (previously known as fkhr-forkhead in rhabdomyosarcoma) was identified through its involvement in chromosomal translocations t( ; ) and t( ; ) in alveolar rhabdomyosarcoma tumors due to pax / -fkhr fusion transcript , . a few years later, foxo (previously known as fkhrl -forkhead in rhabdomyosarcoma like protein ) was characterized and named, based on similarities to fkhr . the gene for foxo was described as being fused to mll transcription factor as a result of the t(x; ) chromosomal translocation in acute lymphoblastic leukemia, therefore, the initial term for foxo was afx (the acute leukemia fusion gene located on chromosome x) . foxo proteins are considered unique cellular targets, regulating a wide variety of critical cellular processes, such as: apoptosis, oxidative stress, dna damage repair, cell cycle, stem cell proliferation and maintenance, metabolism, angiogenesis, vascular tone, cardiovascular development, fertility, immune response and neuronal survival , . the aim of this review is to discuss the most recent advances in elucidating the functions, structure and transcriptional regulation of forkhead box o transcription factors, both in physiological and pathological conditions. the advances in understating the mechanism of foxos regulation of stem cells, cancer stem cells and how non-coding rnas are regulated and regulate the function of foxo genes/protein are presented. in humans, the primary structure of the foxo proteins is characterized by a length of approximately - amino acid residues (aa) for foxo ( aa) and foxo ( aa), and a shorter sequence of approximately amino acids for foxo ( aa) and foxo ( aa) , , , . all members of the foxo family consist of four domains: a forkhead dna-binding domain (dbd), a nuclear localization signal (nls), a nuclear export sequence (nes) and a c-terminal transactivation domain (figure ) . the forkhead dna-binding domain (dbd), also called the forkhead box, is described as a "winged helix" due to the butterfly-like aspect on x-ray crystallography and nuclear magnetic resonance . analysis of the amino acid sequences alignment of foxo proteins revealed a highly conserved dna binding domain among all the members of the broader group of forkhead transcription factors, but also among species, in eukaryotic organisms, from yeast to humans , . moreover, several studies point out that forkhead dbd is not the only region in the foxo molecule that is highly conserved. similarities were also observed in the n-terminal region near the first akt/protein kinase b (pkb) phosphorylation site (thr for foxo and thr for foxo ), the region containing nls, and sequences from c-terminal transactivation domain . as mentioned above, the forkhead domains in foxo subclass of fox family are similar to the forkhead regions of other subclasses. for example, structural similarities have been proven in the core region of foxo compared to foxa , foxk and foxp . the forkhead/winged helix motif is a shared sequence between subclasses of foxo, of around amino acids that folds into three alphahelices (h , h and h ), three beta-strands (s , s and s ) and two wing-like loops (w and w ). the structure has a h -s -h -h -s -w -s -w topology and the strands s , s and s interact with each other forming a beta-sheet. while the n-terminal part of the domain is formed by a cluster of three alpha-helices, the c-terminal half consists of beta strands s and s and two large loops (wings w and w ) . the main dna recognition site is represented by the highly conserved helix h of forkhead dna-binding domain. the stability of forkhead dbd -dna complexes is increased by the more variable regions of the dbd, including h , the region between h and h , wing w and c-terminal segment . although the foxo forkhead box is clearly related to those found in other forkhead genes, all foxo members contain an aditionally segment of five amino acid ( -gdsns- ) between helices h and h , that creates a small extra loop, forming a coil structure in the foxo dbd, and a short helix in the foxo dbd , . this is important in sequence-specific interactions with dna-binding sites . foxo proteins mediate transcriptional activation through binding to the conserved consensus core motif ttgtttac in the dna . they bind dna through a foxo-recognized element with a t/c-g/a-a-a-a-c-a-a consensus sequence in the forkhead dbd (c-terminal) . there are fourteen protein-dna contacts described, which mediate the activation/inhibition of foxo's target genes, such as bim, trail, p , p and catalase. the main recognition site is the α-helix h . forkhead proteins function as transcription factors that bind to dna through their forkhead domain in order to upregulate or downregulate the expression of a tremendous number of target genes . hence, the foxo transcription factors control the expression of a wide spectrum of genes that regulate essential physiological cellular processes, such as cell death/cell survival, cell cycle, cell proliferation, cell differentiation/development, angiogenesis, cell metabolism, stress response and stem cell maintenance , , , , , (figure ) . moreover, foxos are also involved in a wide range of pathological cellular processes, such as cancer, neurodegenerative diseases (parkinson's disease, alzheimer), diabetus mellitus, cardiac failure, atherosclerosis, hypertension and anovulation , , , , , , , . differences between functions and regulatory mechanisms of foxo and foxo proteins are considered to be partially redundant, with several exceptions. for example, akt phosphorylates all foxo family members, inducing their translocation into the cytoplasm and inactivation, while pp a dephosphorylates part of the akt phosphorylation sites in foxo and foxo , reactivating them . several particular differences regarding the regulatory kinases, e ligases or other important enzymes and regulators exist. thus, their activity is in part controlled by different mechanisms . foxo transcription factors regulate the expression of multiple pro-apoptotic and anti-apoptotic proteins, and they have the ability to induce apoptosis by activating either intrinsic or extrinsic pathways of apoptosis , , , , . moreover, the consensus foxo recognition element (fre) -(g/c)(t/a)aa(c/t)aa -which differs from that of other forkhead proteins, seems to have a very important role in both apoptotic pathways, since matching functional fre sites have been identified in the promoters of foxo target genes encoding fas ligand (fasl), insulin like growth factorbinding protein (igfbp ), the apoptotic regulator bcl- interacting mediator of cell death (bim) and others . foxo triggers the mitochondria-dependent intrinsec apoptotic pathway through upregulation of multiple pro-apoptotic bcl- family members, such as puma, bim, noxa, bnip , and downregulation of the anti-apoptotic bcl- family member bcl-xl , , , , . the expression of the anti-apoptotic protein bcl-xl is suppressed by foxo proteins, such as foxo , after increasing the expression of the transcriptional repressor bcl- , in order to trigger apoptosis . both foxo-induced expression of the proapoptotic bcl- family members and foxodownmodulated anti-apoptotic bcl- family members lead to apoptosis due to mitochondrial outer membrane permeabilization (momp), as a response to intracellular stress, growth factor deprivation or other factors . these mitochondrial modifications result in release of cytochrome c, smac/diablo and omi/htra from mitochondria, subsequently triggerring downstream caspases activation , . apaf binds cytochrome c forming a complex named apoptosome, required for caspase activation (initiator caspase) . caspase activation further activates the effector caspases, such , or , triggering caspase cascade . foxo proteins are also able to induce apoptosis by activating the receptor-dependent extrinsic pathway of apoptosis. they induce the upregulation of the death receptor ligands fasl and trail, promoting an autocrine and/or paracrine action of these death ligands on the death receptors . fas signaling pathway is important in apoptosis induction through the extrinsic pathway, since jurkat cells lacking several critical components of fas pathway fail to induce foxodependent cell death . upregulation of tumor necrosis factor related apoptosis inducing ligand (trail) by increased levels of foxo or foxo in cancer cells, such as prostate carcinoma cells, leads to apoptosis . thus, binding of fas ligand (fasl) and trail to their receptors (fas/cd /apo- for fasl and dr , dr for trail) triggers a death-inducing signaling complex (disc) with a subsequent activation of caspases, mainly initiator caspase and effector caspase , leading to apoptosis . moreover, foxo transcription factors directly regulate the expression of tumor necrosis factor receptorassociated death domain (tradd). activation of foxo proteins by pi k-akt pathway inhibitors results in increased expression of tradd protein . tradd is an important adaptor protein interacting with tnfr and fas receptors, mediating apoptosis and nfkb pathway activation . under normal physiological conditions, cyclins and their associate cyclin dependent kinases (cdks) are very important for the progression of the cell cycle . in response to dna damage, foxo transcription factors increase the expression levels of the cdk inhibitors binding to cyclin/cdk complexes, such as p (also known as cdk inhibitor ) and p (kip ) , . cdk inhibitors together with foxo-induced inhibition of cyclins expression act by stopping the cell cycle at different checkpoints in order to repair the dna damage or to remove the damaged cell . for example, a study performed on d murine myeloid cells and on baf murine pre-b-cell lines brings evidence that endogenous foxo proteins are required to enforce cell cycle checkpoints after the cell cycle arrest at g /s transition is induced by foxo by upregulating the negative regulators of the g /s phase, such as the cdk inhibitors: p kip , p kip , p cip /waf (cip/kip family), p ink , p ink (ink family) and the retinoblastoma protein family member p .also, foxo decreases the positive regulators, such as cyclin d , or cyclin d , blocking the g /s transition . additionally, foxo increases the expression of negative regulators such as gadd alpha and cyclin g , resulting in cell cycle arrest at g /m . surprisingly, foxo proteins act as transcription factors for plk expression during cell cycle, plk being critical in promoting g -m phase transition, m phase progression and end of mitosis . a complete knock-down of foxo protein levels induces an arrest in cell cycle, suggesting that a basal, low levels of foxos activity is required for the cell cycle progression . surprisingly, in specific settings, foxo may actually serve as a promoter of proliferation. for example, neutrophils isolated from foxo deficient mice show high levels of fasl expression and apoptosis, revealing that foxo may represses fasl expression in neutrophils, leading to proliferation in this type of cells . foxo was also shown to positively influence proliferation induction in pancreatic β cells in vitro, under low nutritional circumstances. the mechanisms implicated in this process are not completely understood. however, the induction of ccnd gene transcription, which encodes cyclin d , may at least partially be involved. cyclin d represents one of the earliest cell cycle-related events, being critical for the g to s phase progression during the cell cycle . thus, foxo proteins coordinate the expression of multiple important cell cycle regulators, in order to block the g /g , g /s or the g /m transitions during cell cycle when needed, such as after dna damage. however, a basal level of foxo proteins is required for g -m cell cycle transition, when foxo-dependent plk expression seems to be necessary. these results suggest that foxo proteins have to be tightly regulated and are important and sensitive regulators of cell cycle, proliferation and other critical cellular processes. foxo proteins play an important role in regulating differentiation of a wide variety of tissues. it is well known that foxos can control the differentiation of precursor cells into muscle, adipose tissue or blood cells. interestingly, foxos effect on differentiation is context dependent and sometimes foxo isoform dependent. while foxo promotes differentiation of erythroid cells, foxo suppresses the differentiation of precursor cells in adipose and muscle tissues . foxos can also suppress bone formation by inhibiting wnt-β catenin-tcf signaling. wnt signaling is known to stimulate bone formation . moreover, foxo proteins are required for maintenance of somatic/adult stem cells, such as hematopoietic stem cells, and they also regulate cancer stem cells (see details in chapter ). metabolic signaling mediated by forkhead transcription factors is conserved among multiple species, including but no limited to mammals, drosophila melanogaster and caenorhabditis elegans . it was first noticed for the foxo homolog named dauer formation- (daf- ), in the caenorhabditis elegans worm . foxo proteins are involved in critical physiological processes that regulate cellular metabolic activity in many organs, such as liver, pancreas, adipose tissue and hypothalamus , , , . in the liver, foxo forms a complex with another transcription factor, the liver specific pgc alpha, in order to induce gluconeogenesis by upregulating g pase and pepck genes. this is important for maintaining glucose homeostasis . confirming these results, loss of hepatic foxo genes in mice induces a downmodulation of gluconeogenesis and an upregulation of glycolysis . in addition, ectopic expression of foxo in rat primary hepatocytes increases apociii, an inhibitor of lipoprotein lipase, suggesting that foxo is involved in regulating the lipid metabolism . recently, nicotinamide phosphoribosyltransferase (nampt) gene was described to be a new transcriptional target gene of foxos for regulating hepatic triglyceride levels . in pancreas, foxo plays an important role in the maintenance of beta cell function, but also in the development of the pancreas, through repression of the pancreatic transcription factor pdx . aditionally, foxo increases the food intake by upregulating agrp and npy, and by downregulating pomc in hypothalamus, antagonizing the anorexigenic hormone leptin , . recently, g-protein-coupled receptor gpr was described to be a foxo target that activates agrp, suggesting a new mechanism for foxo -agrp mediated food intake that might provide a new treatment for obesity . all these results establish foxo proteins as important regulators of cellular metabolism and potential target in metabolic related diseases. foxo transcription factors function as sensors for reactive oxygen species (ros) and play an important role in cellular resistance to stress, increasing cellular survival. foxo proteins are able to protect the cell from oxidative damage, decreasing the availability of ros. they stimulate the expression of certain genes responsible for the ros inactivation, such as the antioxidant enzymes manganese superoxide dismutase (mnsod), that catalyzes the conversion of superoxide to hydrogen peroxide, and catalase, that converts hydrogen peroxide into water and oxygen . studies on human cardiac fibroblasts revealed that foxo also modulate the antioxidant enzyme peroxiredoxin iii, fighting against cell damage induced by oxidative stress. concomitant peroxiredoxin iii knockdown and foxo knockdown resulted in higher levels of hydrogen peroxide in response to serum starvation, as compared to peroxiredoxin iii knockdown only . other foxo transcriptional targets (figure ), such as sterrole carrier proteins (scps), are now known to play an important role during the defence against oxidative stress. scps are implicated in protecting fatty acids from peroxidation . foxo proteins are also able to increase dna repair, inducing a cell cylce arrest at g /m checkpoint, in order to provide time for repairing the dna damage. one of the transcriptional target of foxo, implicated not only in cell cycle arrest but also in dna repair and cell survival in response to cellular stress, is gadd . similarly, ddb gene protein product is able to repair the dna damage through upregulation by foxo . foxos regulate the longevity of the cell mainly by increasing resistance to stress, modulating the dna repair process and maintaining the stem cells . the lifespan extension is an important function of foxo that is conserved among several species. a well studied example is the role of foxo homologous daf- (caenorhabditis elegans) in aging . loss of foxo in human skin fibroblasts results in aging specific morphological changes, suggesting that foxo is necessary for maintenance or promotion of cellular longevity . moreover, loss of foxo activity leads to decreased mnsod and enhanced cell injury in vascular smooth muscle explanted from aged mice, due to limied ros inactivation . thus, cellular lifespan maintenance requires a certain level of foxos activity and this is mainly achieved by inactivation of akt activity . foxo proteins do not only play an important protective role during senescence/aging, but also during exercise. it was previously suggested that foxos may at least partially be involved in the exerciseinduced beneficial cardiac effects. interestingly, exercise induces an upregulation of foxo and sirt in the heart, with subsequent increased mnsod, catalase and gad alpha, and decreased cyclin d , suggesting the protective role of foxo proteins . interestingly, other studies revealed that, in conditions of oxidative stress, foxo is also able to induce apoptosis by triggering a fas-mediated death pathway in cultured motoneurons, by activating trail, bh -only proteins noxa and bim, and by promoting pro-apoptotic activity of p . additional reports suggest that suppression of foxo proteins expression during oxidative stress can be protective to some extent for cells, since protein inhibition or gene knockdown of foxo and foxo decreases the ischemic infarct size in the brain, protects the metabotropic glutamate receptors during vascular injury, enhances pancreatic β-cell or neuronal survival through nad + precursors and provide trophic factor protection with erythropoietin (epo) and neurotrophins . this suggests that while a certain level of foxo activation is required in cellular stress resistance, sustained activation or activation over a threshold of foxo is detrimental and may induce apoptosis. thus, foxo plays an important role in cellular decision during stress, helping the cells to survive by multiple mechanisms (such as dna repair, ros inactivation etc) and inducing apoptosis when dna or cellular damage can't be repaired. foxo proteins play a central role in maintaining the immune response of the human body. cell type-specific deletions of foxo and/or foxo in mice revealed an important role of foxo proteins in regulating immunological homeostasis and tolerance, by controlling the function and development of b and t lymphocytes. thus, foxo and foxo are essential transcription factors involved in early b cell development and peripheral b cell function, since early deletion of foxo blocked b cell differentiation at the pro-b cell stage. this is due to a defect in interleukin receptor alpha (il- ralpha) expression. deletion of foxo in peripheral b cells was associated with defective expression of both cd l and aid, with subsequent failure in class-switch recombination and reduced igg production upon immunization. moreover, it is known that the pi k-akt-mediated inactivation of foxo is essential for the optimal proliferation of b cells, while ectopic expression of pi k-independent variants of foxo or foxo (active triple mutants at the akt phosphorylation sites) resulted in cell cycle arrest and increased cell death in b cells . another study on mice with t cell-specific deletion of foxo showed a decreased expression of il- receptor on mature t-cells, suggesting that il -r is a transcriptional target of foxo , which is mediating through binding with il- the survival and homeostatic proliferation of peripheral t cell . excesive inflammatory cells may become harmful for the human body through the generation of ros and through the production of cytokines. studies performed on mice deficient of foxo ilustrated lymphoproliferation, inflammation of the salivary glands, lung, and kidney, and increased activity of helper t cells. these observations demonstrate the beneficial role of foxo proteins in human body, by preventing t cell hyperactivity . also, it was demonstrated that mir- has a central role in the late phase of clonal expansion of the helper t lymphocytes, by inducing il- which is able to inactivate foxo . foxo was found to be a suppressor of proliferation in resting helper t lymphocytes and, in order to allow proliferation, t cell activation via tcr/cd and il- r signaling must inhibit foxo . other studies reported that t cells derived from bim and puma deficient mice were resistant to apoptosis after il- deprivation, demonstrating that foxo , through upregulation of bim, puma and p kip is important for the induction of cell cycle arrest and apoptosis of t cells, in the absence of cytokines . foxo proteins may be benefical for autoimmune disorders by inducing a fas mediated apoptosis that target activated t cells, followed by a decrease in cytokine stimulation in patients with autoimmune lymphoproliferative syndromes . foxo proteins play an important role not only during physiological cellular processes, but also in few pathologies, such as cancer. foxos are well known tumor suppressor proteins. although foxo proteins protect the human body by playing a central role in a wide range of mainly physiological functions (as described earlier), under some circumstances, foxo's roles can become harmful for the human cells. this is because foxo is also involved in a number of pathological functions, such as inflammation and muscle atrophy. for example, apoptosis places foxo proteins on the good side when it leads to tumor suppression. however, cellular apoptosis can become itself a significant component for pathology in diseases such as neurodegenerative disease, diabetes mellitus (dm), and cardiovascular injury , , , . cancer: foxo proteins are tumor suppressors foxo proteins are inactivated in a wide variety of malignancies, eiher posttranslational (mainly by pi k-akt mediated phosphorlation) or by fusion mutations (such as pax -foxo ; mll-foxo , mll-foxo ) , . inactivation of foxo by pi k-akt pathway activation is a common feature of many malignancies, such as prostate cancer, breast cancer, leukemia and glioblastoma . conditional deletion of foxo family members in mice leads to the lymphomas and hemangiomas, which suggests that loss of foxos maintains or promotes survival of tumor cells . foxos loss or inactivation is known to play an important role in cancer tumorigenesis or progression, in vivo . for example, a recent study described a significant correlation between low expression of foxo and a poor prognosis for gastric cancer patients, bringing evidence that foxo could be a valuable prognostic biomarker for patients with gastric cancer . in addition, foxo overexpression was shown to reduce motility, invasiveness, and aggressiveness in estrogen receptor α-positive (erα+) breast cancer cells . foxo proteins exert their tumor suppressor functions predominantly by promoting cell cycle arrest, apoptosis, ros inactivation and dna repair, through expression of their target genes , . for instance, foxo -induced expression of bim, a pro-apoptotic member of the bcl- family of proteins induced a caspase-dependent cell death in several types of cancers, such as in chronic leukemias, breast and gastric cancers . similarly, foxo-induced trail and noxa expression induces apoptosis in many malignancies, including leukemias . interestingly, foxo can also inhibit the protooncogene c-myc, indirectly controlling the transcription of a wide set of target genes implicated in cell survival, cell cycle, apoptosis and tumorigenesis. myc is a transcription factor and a well known promoter of survival, proliferation and tumorigenesis, being found upregulated in a large variety of malignancies. noteworthy, activation of foxo proteins not only induces cell cycle arrest or apoptosis, but also a differentiation program. in chronic myeloid leukemia (cml), foxo can induce cml leukemic cells differentiation, by inhibiting the expression of id (inhibitor of dna binding ). cml is characterized by the presence of the bcr-abl fusion protein, which is a constitutive active kinase, controlling many crucial downstream pathways implicated in cell cycle, proliferation, apoptosis and cell adhesion. bcr-abl activates pi k-akt pathway, inactivating foxo proteins and their pro-apopotic and cell cycle arrest signals . the use of the tyrosine kinase inhibitors (tkis) and of the proteasome inhibitor bortezomib results in inhibition of bcr-abl activity and its downstream pathways, with a subsequent activation of foxo proteins . tkis treatment induces a foxo-dependent suppression of id expression, leading to k bcr-abl positive cell line differentiation. thus, bcr-abl can maintain the leukemic state not only by promoting proliferation, cell cycle progression and inhibiting apoptosis, but by also inhibiting foxo-mediated differentiation . inflammation (rheumatoid arthritis, osteoarthritis, systemic lupus erythematosus) it was shown that loss of functional foxo proteins lead to inflammatory cell activation in several disorders, with the subsequent cellular damage, through oxidative stress and excess of cytokines. inactivation of foxo in t lymphocytes, as well as inactivation of foxo and foxo in synovial macrophages in patients with rheumatoid arthritis and osteoarthritis result in inflammatory cell activation. in addition, loss of foxo proteins may be a potential etiology for systemic lupus erythematosus (sle) and rheumatoid arthritis, since foxo gene transcript levels are downregulated in peripheral blood mononuclear cells of these patients . a link between inflammation, insulin resistance and foxo transcription factors was previously suggested when downregulated foxo decreased the levels of c/ebp beta transcription factor in adipocytes, with the subsequent reduction in expression of the pro-inflammatory cytokines ccl (chemokine ligand ) and il- . thus, foxo indirectly might induce an inflammatory status of the adipose tissue, responsible for the insulin resistance in type diabetes. yet, there are studies that describe the ability of foxo proteins to directly induce inflammation through upregulation of the inflammatory cytokine il- b . however, further experiments are required in order to clearly understand the context related mechanisms of foxo-dependent modulation of inflammation. muscle atrophy appears in a variety of diseases, including cancer, diabetes and sepsis, and is characterized by accelerated proteolysis that can be induced through two pathways: the ubiquitinproteasome pathway and through lysosomal pathway, as a consequence of autophagy . studies revealed that foxo proteins can induce muscle atrophy, characterized by decreased muscle function . thus, foxo proteins have been shown to increase the transcription of key regulators of both lysosomal and proteasomal proteolysis: the autophagy followed by the lysosomal proteolysis is stimulated by upregulation of bnip , lc , and gabarapl , while the proteasomal proteolysis is induced by increasing the ubiquitin ligase atrogin . it was also shown that foxo activity is both required and sufficient for induction of autophagy in muscle cells, since studies performed on adult muscle fibers from mice show that the ectopic expression of an active foxo mutant leads to lysosomal proteolysis after formation of autophagosomes, while the knockdown of foxo in these muscles fibers blocked autophagosome formation after starvation , . autophagy has a critical role in maintaining the cellular and metabolic homeostasis. it seems that the metabolic status of the cell strongly influences this process in both normal cells and cancer cells, despite the profound differences in their metabolism . in cancer cells, atp is predominantly produced through the constitutive activation of aerobic glycolysis, process that is modulated by the transcription factor hif α . since p α is required to maintain the levels of hif α target genes, researchers demonstrated that in colorectal cancer cells, the inhibition of p α causes a rapid drop in atp levels, with an acute energy need which activates foxo in an ampkdependent manner, in order to induce autophagy, cell cycle arrest and cell death in these cells , . moreover, the knockdown of foxo was sufficient to induce hypertrophy in cultured neonatal rat cardiomyocytes. in these cells, stimulation with insulin inhibits foxo function, with subsequent downregulation of the antioxidant enzyme catalase and increased levels of ros, that in low levels may act as second messengers for intracellular signaling, making possible the increase in cell size . atrogin , upregulated by foxo and foxo , plays important roles in cardiovascular system, since mice lacking atrogin- are susceptible to cardiac hypertrophy . foxo proteins are inducing atrophy of differentiated cardiac and skeletal muscle cells through protein synthesis inhibition, which leads to a decrease in cell size . in skeletal muscle, this mechanism involves myostatin, a foxo transcriptional target and a secreted molecule that can induce atrophy by protein synthesis inhibition . foxo might be implicated in the development or progression of type diabetes, since increased foxo expression in diabetic mouse liver is associated with increased expression of pepck and g pase, and inhibition of foxo activity downregulated both pepck and g pase expression and normalized the blood glucose levels. thus, foxo -mediated expression of g pase and pepck is critical for gluconeogenesis in the liver during fasting, but its deregulation may be involved in diabetes etiology . physiological functions of foxo take place under certain circumstances, since sometimes the ability to maintain the proper control is overwhelmed . experiments on insulin-producing mouse pancreatic beta cells (betatc- ) show that chronic exposure to high glucose activates foxo transcription factors and leads to upregulation of endogenous inflammatory cytokines interleukin- beta (il- beta) and suppressors of cytokine signalling (socs). these events trigger the activation of caspase- with subsequent apoptosis, suggesting a new mechanism that leads to the destruction of endocrine pancreas in type diabetes . also, exposure to high glucose of the cardiac microvascular endothelial cells (cmecs) isolated from hearts of adult rats show that foxo transcription factors leads to reactive oxygen species (ros) accumulation and apoptosis, suggesting that foxos might be involved in microvascular complications of diabetes . foxo is also involved in insulin resistance and metabolic syndrome, since the activation of foxo in cardiomyocytes leads to increased akt activity and attenuated cellular response to insulin, followed by decreased glucose uptake . interestingly, clinical studies regarding metabolic status profile on age-related diseases, fertility, fecundity and mortality revealed higher hba c levels and increased mortality risk associated with specific haplotypes of foxo . foxo proteins are also activated in an attempt to protect the human body against the oxidative stress resulted due to hyperglycemia that leads to increased production of ros in endothelial cells, liver cells, and pancreatic β-cells. this hyperglycemia-dependent ros increase leads to a subsequent development of insulin resistance and significant neurodegenerative and cardiovascular diseases in the patients with diabetus mellitus . foxos are necessary for endothelial cell development and angiogenesis, since mice that are deficient in foxo lack development of the vascular system and die by embryonic day eleven . in addition, foxo and foxo were shown to be the most abundant foxo isoforms in mature endothelial cells, having also an important role in the regulation of postnatal vessel formation, not only in the embryogenesis . unfortunately, angiogenesis is not only involved in critical physiological processes, such as embryogenesis and postnatal vessel formation, but it is also involved in pathological events, such as chronic inflammation and tumor growth . thus, it is fascinating how the angiogenesis mediated by foxo proteins may become a negative element for the organism, antagonizing the tumor suppresor's main function through new vessel formation, that can lead to tumor cell growth . foxo was associated with both cardiomyocyte survival after oxidative stress and heart muscle loss with subsequent ventricular dysfunction , . foxos are activated after akt inhibition by insulin or other factors, leading to atrogin- induction, which results in a suppression of heart muscle cell size . foxo proteins seem to inhibit the vascular smooth muscle cell proliferation and growth in a rat balloon carotid arterial injury model, suggesting a role of foxos in the regulation of vascular tone and systemic arterial blood pressure, preventing or at least lessening the effects of atherosclerosis and hypertension . also, decreased foxo expression due to high flow states in vessels leads to proliferation of vascular smooth muscle cells, vascular neointimal hyperplasia, and subsequent hypertension . moreover, experiments on lowdensity lipoprotein (ldl) receptor knockout mice resulted in the prevention of atherosclerosis when the triple ablation of foxo , foxo and foxo was induced in endothelial cells . interestingly, the same experiment on myeloid cells lead to more severe atherosclerosis compared to the controls, explained by authors through increased proliferation of granulocyte-monocyte progenitors and high levels of inducible nitric oxide synthase (inos) and oxidative stress, which predispose to atherosclerosis . noteworthy, analysis of mouse oocytes revealed overexpressed foxo transcription factors in primordial and early primary follicles, but downregulated foxo in primary and more developed follicles, suggesting that foxo proteins also have reproductive functions, modulating oocyte and follicular cell maturation. to confirm the hypothesis, constitutively active foxo was induced in transgenic mouse oocytes in primary and more developed follicles, which affected the oocyte growth and follicular development, leading to anovulation and luteinization of unruptured follicles, with subsequent infertility . in addition, foxo and foxo mutations were detected in a small percentage of womens with premature ovarian failure . involvement of foxo family of transcription factors in stem cells self-renewal, survival, proliferation and differentiation is currently under investigation. foxos have been shown to play critical functions in maintaining self-renewal potential and quiescence of hematopoietic stem cells, however, the mechanisms of these processes are not yet well understood , . recent reports show that foxo-mediated regulation of cell cycle, oxidative stress and apoptosis plays an important role in these processes . stem cells are characterized by the capacity of self-renewing and the ability to differentiate . they are necessary in maintenance and propagation of several adult tissues, including but not limited to blood, skin and gastro-intestinal epithelium. adult stem cells or cells with stem cell properties were also found in other critical organs/systems, such as the central nervous system and the lung . previous studies suggested that hematopoietic stem cells are sensitive to reactive oxygen species (ros) levels. foxo family members are known to play a central role in ros detection and in inducing an adaptative response after ros exposure, by inducing the expression of critical enzymes that neutralise ros, such as catalase and manganase superoxide dismutase (mnsod). interestingly, foxo , foxo , and foxo inactivation leads to an upregulation of ros levels in hematopoietic stem cells and their death . deletion of these three foxo family members in mice revealed their importance in controlling ros in stem cells, in vivo . while the number of hematopoietic stem cells in bone marrow of foxo-deficient mice is low, an increase in myeloid progenitor cells in blood is observed. moreover, the repopulation ability is decreased in the absence of foxos and the treatment of foxo-deficient mice with n-acetylcysteine (nac) at least partially rescued these effects . thus, persistent akt activation, which induces inactivation of foxo transcription factors by akt-mediated phosphorylation, results in ros-induced cell death. this is due to the fact that the cells can't synthesize the foxo-dependent ros neutralizing factors catalase and mnsod . between foxo family members, the most important regulator of hematopoietc stem cells survival and self-renewal is foxo , since foxo knockdown induces hematopoietic stem cells depletion . ros neutralizing agent n-acetylcysteine (nac) can rescue hematopoietic stem cells quiescence and at least partially resque their ros-induced loss . all these results suggest that foxo , foxo and foxo -induced resistance to ros is critical in maintaining homeostasis of hematopoietic stem cells in bone marrow . pten tumor suppressor was revealed as an important modulator of hematopoietic stem cells self-renewal and survival. pten is a major inhibitor of akt activation. depletion of pten, induces akt activation and subsequent foxo inactivation. it is likely that foxos mediate at least a part of the pten effects on hematopoietic stem cells . although foxo was established as the most important foxo family member regulator of hematopoietic stem cell's self-renewal, foxo is critical for the human embryonic stem cells (hesc) pluripotency maintenance. foxo induced sox and oct expression is one of the mechanisms responsible for this process. interestingly, in embryonic stem cells akt is not the major regulator of foxo . fascinating, a recent study brings evidence that foxo is a critical regulator of stem cell maintenance in hydra vulgaris, a member of the phylogenetically old animal phyla cnidaria, which has been suggested to be biologically immortal due to the unlimited self-renewal capacity of their stem cells , . foxo transcription factors are not only required for maintenance of somatic stem cells, but they also play an important role in cancer stem cells . recent reports provided evidence that in many types of malignancies, such as leukemia, colon, or gastro-intestinal malignanciens, a small population of cells similar to stem cells exists . these cells, called cancer stem cells, have the potential of forming new tumors. notably, most of the time, the cancer stem cells are resistant to current cancer treatments, and these therapies may result in cancer stem cells enrichment. these cells serve as a starting point in cancer recurrence . similarities between stem cells and cancer stem cells were best described in the hematopoietic system, where similar surface markes and signal transduction patterns were described between the hematopoietic stem cells and leukemia-initiating cells . presented results have not only implications in uncovering the stem cells/cancer stem cells regulation, self-renewal and differentiation, but also for the development of novel therapeutic strategies in cancer, degenerative diseases and many other pathologies . foxo family members are expressed in almost every tissue of the human body, including the nervous system, cardiovascular system, reproductive system of males and females, lung, liver, spleen, pancreas, thymus, and skeletal muscle. however, each member of the foxo family has its own expression pattern, since they are not equally expressed in all tissues . foxo is better represented in adipose tissue . foxo has the highest expression in liver, but it is also being predominantly expressed in heart, brain, kidneys, and ovaries, while foxo is found mainly in the muscle and heart , . the newest member of the transcription family, foxo , is present in the brain. the association of this member with other tissues is still a matter of study . cell lines (immortalized and/or cancer cells) are some of the most used and useful tools for studying the structure, function, regulation and expression of proteins in general, and foxo family members in particular. foxo members expression in different cell lines (including nci group of cell lines) is partially known , . foxo is found to be expressed at high levels in igrov (human ovarian carcinoma cells), astrocyte cells, rl (human follicular lymphoma cells), ht (human colon carcinoma cells) and hek (human embryonic kidney cells). knowing the qualitative and quantitative expression pattern of foxo family members is important in elucidating their functions and regulation. thus, databases summarizing these patterns in various tissues and cell lines are very helpful and necessary. for example, biogps presents experimental results showing the mrna expression levels of a wide number of genes in most of the human tissues and many (mostly human) cell lines , , . regulation of foxos expression is not yet well understood. p was shown to control foxo gene expression by binding to the proximal region of the foxo promoter (cre tandem sites) . non-coding rnas were shown to suppress translation of foxo family members in various tissues and contexts. in particular, the micrornas-dependent inhibition of foxo and foxo expression is better studied and is summariez below, in chapter . however, the main transcription factors implicated in foxos expression are not well understood. interestingly, methylation of fox promotors induces suppression of their expression. as an example, promoter methylation induced by braf results in inactivation of fox genes expression . the mirnas are - nucleotide-long noncoding rna molecules with an important role in post-transcriptional regulation of protein expression that regulate a variety of cellular processes, including cell differentiation, cell cycle progression and apoptosis. these mirnas can function as oncogenes or tumor suppressors, and oncogenic mirnas (oncomirs) are upregulated in cancer cells. in cancer, mirnas were found to be situated both upstream and downstream of the carcinogenesis process and modified expression of some mirnas is the outcome of carcinogenic transformation or progression, revealing that mirnas may be potential diagnostic or prognostic tools in cancer. for instance, micrornas gained a special attention in melanoma studies and the altered pattern of mirna in melanoma seems to be related to apoptosis (mir- b), cell cycle (mir- b) and invasion/metastasis (mir- ) . the microrna (mirna)-mediated regulation of foxo transcription factors was demonstrated by several groups within the last three years. mir- has been shown to specifically target foxo transcription factors irrespective of cell type, since mir- seems to target foxo in melanoma cells, foxo in breast cancer cells and foxo in activated helper t (th) lymphocytes . thus, in melanoma cells, mir- modulate the expression of both foxo and microphthalmia-associated transcription factor (mitf). the inhibition of mir- by anti-mirs (blocking antisense oligonucleotides) hindered melanoma cell migration and triggered their apoptosis . in breast cancer cells, foxo was coordinately targeted by mir- a, mir- , and mir- , while the inhibition of each mirna resulted in induced levels of foxo and reduced breast cancer cell survival . mir- also targets foxo in osteoblasts lineage cells in order to inhibit osteoblast proliferation and differentiation, repressing the osteogenesis . many other micrornas were shown to regulate foxos activity and functions, including apoptosis or cell cycle. mir- has been shown to regulate foxo -induced apoptosis in transitional cell carcinoma . also, increased expression of mir- in breast cancer cells have been shown to downregulate foxo transcription factor with consequent induction of cell proliferation . also, increased mir- , mir- and mir- downregulate the expression of foxo transcription factor in classical hodgkin lymphoma (chl) cell lines, suggesting that decreased foxo expression is involved in lymphomagenesis . moreover, a recent study performed on du and lncap human prostate cancer cells show that upregulated microrna- induces proliferation due to downregulation of the foxo transcription factor . in addition, mir- , which is highly induced in mature activated t and b cells and in treg cells, has recently been shown to target foxo in t cells, but it remains to be shown whether foxo via mir- contributes to the observed phenotypes in b and t lymphocytes . new strategies to modulate foxo expression may now be developed since the discovery of mirnas targeting foxo transcription factors . however, some difficulties may appear in the mirna-based therapeutic manipulation of foxo transcription factors. for example, a single mirna may target hundreds of genes among foxos, and the therapeutic manipulation of a specific mirna could have unanticipated adverse effects by influencing whole gene networks, while having only moderate effects on foxo genes . since delivery of mir- mimics could induce lymphoproliferative disease or other forms of cancer by systemic repression of foxo transcription factors, another challenge in the mirna-foxo therapeutic manipulation is the specific delivery of the mirna into the target cell, in order to avoid adverse effects in other cell types or tissues . several micrornas were identified as being targets of foxo transcription factors. an akt -foxo -mir- d signaling pathway was recently identified. after inhibition of akt, activated foxo leads to upregulation of mir- d. mir- d acts as a tumor suppressor in renal cell carcinoma, further inhibiting the oncoprotein metadherin (mtdh) . another recent study shows that foxo stimulates the expression of a microrna cluster located on a x chromosome, in a direct manner, dependent on rna polymerase ii, but not on the de novo protein synthesis. thus, foxo upregulates mir- , mir- , mir- c, mir- a- , mir- a- . also, the same study shows that inhibition of pi k-akt axis in lncap and mcf human carcinoma cell lines is followed by increased mir- . as suggested by authors, mirnas could be valuable biomarkers of foxo activity . a study performed on primary cultures of neural stem/progenitor cells (nspcs) from adult mice show that the expression levels of the mir- b~ cluster members (mir- b, mir- , and mir- ) is modulated by foxo transcription factors in a complex manner . the precursors of the mir- b~ cluster members are located on mcm gene. foxo directly binds to the first intron of this gene, modulating the expression of the micrornas. thus, foxo transcription factors could be an important tool in preventing the loss of neurogenesis during aging . further studies are needed to completely understand the regulation of micrornas expression by activated foxo proteins. foxo transcription factors and tumor suppressors are ubiquitously expressed in the human body, with some specific differences between its members. as resumed above, foxo proteins are characterized by a remarkable functional diversity, being implicated in regulation of many critical cellular functions, such as cell cycle arrest, apoptosis, oxidative detoxification, dna damage repair, stem cell maintenance, cell differentiation, cell metabolism, angiogenesis, cardiac development, aging and others. foxo proteins play an important role not only during physiological cellular processes, but also in few pathologies, such as cancer. they are well known tumor suppressors proteins. although foxo proteins protect the human body by playing a central role in a wide range of mainly physiological functions, under some circumstances, foxo's roles can become harmful for the human cells. this is because foxo is also involved in a number of pathological functions, such as inflammation, muscle atropy, and a number of physiological functions that become harmful for the organism. for example, apoptosis places foxo proteins on the good side when it leads to tumor suppression, but in some cases cellular apoptosis can become itself a significant component for pathology in diseases such as neurodegenerative disease, diabetes mellitus (dm), and cardiovascular injury. interestingly, while excessive foxo levels induce cell cycle arrest and cell death, complete knock-down of foxos leads to cell cycle progression impairment, suggesting that certain levels of foxo activation are required for cell cycle progression. moreover, foxo proteins play a variety of roles in the cells dependending on the context. in deciphering the role of forkhead transcription factors in cancer therapy foxo tumor suppressors and bcr-abl-induced leukemia: a matter of evasion of apoptosis snapshot: forkhead transcription factors i sly as a foxo": new paths with forkhead signaling in the brain the homeotic gene fork head encodes a nuclear protein and is expressed in the terminal regions of dynamic foxo transcription factors forkhead transcription factors contribute to execution of the mitotic programme in mammals inflammatory arthritis requires foxo a to prevent fas ligandinduced neutrophil apoptosis foxo transcription factors and stem cell homeostasis: insights from the hematopoietic system foxos attenuate bone formation by suppressing wnt signaling foxo is an essential regulator of pluripotency in human embryonic stem cells the fork head transcription factor daf- transduces insulin-like metabolic and longevity signals in c. elegans insulin regulated hepatic gluconeogenesis through foxo -pgc- alpha interaction the forkhead transcription factor foxo links insulin signaling to pdx regulation of pancreatic beta cell growth forkhead transcription factor foxo in adipose tissue tissue regulates energy storage and expenditure forkhead protein foxo mediates agrpdependent effects of leptin on food intake deletion of hepatic foxo / / genes in mice significantly impacts on glucose metabolism through downregulation of gluconeogenesis and upregulation of glycolysis foxo mediates insulin action on apoc-iii and triglyceride metabolism hepatic foxos regulate lipid metabolism via modulation of expression of the nicotinamide phosphoribosyltransferase gene minibrain/dyrk a regulates food intake through the sir -foxo-snpf/npy pathway in drosophila and mammals foxo regulates peroxiredoxin iii expression in human cardiac fibro blasts regulation of sterol carrier protein gene expression by the forkhead transcription factor foxo a dna repair pathway stimulated by the forkhead transcription factor foxo a through the gadd protein daf- /foxo directly regulates an atypical amp-activated protein kinase gamma isoform to mediate the effects of insulin/igf- signaling on aging in caenorhabditis elegans down-regulation of a forkhead transcription factor, foxo a, accelerate s cellular senescence in human dermal fibroblasts down-regulation of manganese-superoxide dismutase through phosphorylation of foxo a by akt in explanted vascular smooth muscle cells from old rats akt negatively regulates the in vitro lifespan of human endothelial cells via a p /p -dependent pathway exercise training promotes sirt activity in aged rats lymphocyte signaling: regulation of foxo transcription factors by micrornas an essential role of the forkhead-box transcription factor foxo in control of t cellhomeostasis and tol erance high glucose induces suppression of insulin signalling and apoptosis via upregulation of endogenous il- beta and suppressor of cytokine signalling- in mouse pancreatic beta cells high glucose induced oxidative stress and apoptosis in cardiac microvascular endothelial cells are regulated by foxo a apo ligand/tumor necrosis factor-related apoptosis-inducing ligand in prostate cancer therapy common mechanism for oncogenic activation of mll by forkhead family proteins applications of post-translational modifications of foxo family proteins in biological functions foxos are lineage-restricted redundant tumor suppressors and regulate endothelial cell homeostasis decreased expression of the foxo a gene is associated with poor prognosis in primary gastric adenocarcinoma patients the dna damage repair protein ku interacts with foxo to coordinate a conserved cellular stress response therapy-resistant acute lymphoblastic leukemia (all) cells inactivate foxo to escape apoptosis induction by trail and noxa induction of mxi -sr alpha by foxo a contributes to repression of myc-dependent gene expression combination of bortezomib and mitotic inhibitors down-modulate bcr-abl and efficiently eliminates tyrosine-kinase inhibitor sensitive and resistant bcr-abl-positive leukemic cells foxo increased pro-inflammatory gene expression by inducing c/ ebpbeta in tnf-alpha-treated adipocytes foxo links insulin resistance to proinflammatory cytokine il beta production in macrophages inhibition of foxo transcriptional activity prevents muscle fiber atrophy during cachexia and induces hypertrophy foxo controls autophagy in skeletal muscle in vivo foxo coordinately activates protein degradation by the autophagic/lysosomal and proteasomal pathways in atrophying muscle cells inhibition of p alpha unveils an ampk-foxo a axis linking autophagy to cancer-specific metabolism foxo a inhibits cardiomyocyte hypertrophy through transactivating catalase regulation of myostatin expression and myoblast differentiation by foxo and smadtranscription factors inhibition of foxo function is associated with improved fasting glycemia in diabet ic mice foxo transcription factors activate akt and attenuate insulin signaling in heart by inhibiting protein phosphatases haplotypes in the human foxo a and foxo a genes; impact on disease and mortality at old age abnormal angiogenesis in foxo (fkhr)-deficient mice multifaceted link between cancer and inflammation constitutive phosphorylation of the foxo transcription factor in gastric cancer cells correlates with microvessel area and the expressions of angiogenesis -related molecules foxo transcription factors promote cardiomyocyte survival upon induction of oxidative stress forkhead transcription factors inhibit vascular smooth muscle cell proliferation and neointimal hyperplasia expanded granulocyte/monocyte compartment in myeloid-specific triple foxo knockout increases oxidative stress and accelerates atherosclerosis in mice mutational screening of foxo a and foxo a in women with premature ovarian failure the pi- kinase pathway in hematopoietic stem cells and leukemia-initiating cells: a mechanistic difference between normal and cancer stem cells mortality pattern suggest lack of senescence in hydra foxos: signalling integrators for homeostasis maintenance modulators of sensitivity and resistance to inhibition of pi k identified in a pharmacogenomic screen of the nci- human tumor cell line collection global proteome analysis of the nci- cell line panel biogps: an extensible and customizable portal for querying and organizing gene annotation resources biogps and mygene.info: organizing online, gene-centric information a gene atlas of the mouse and human protein-encoding transcriptomes control of foxo gene expression by co-activator p braf mutation-specific promoter methylation of fox genes in colorectal cancer tissular and soluble mirnas for diagnostic and therapy improvement in digestive tract cancers immune-related biomarkers for diagnosis/prognosis and therapy monitoring of cutaneous melanoma mir- is a negative regulator of osteoblast proliferation, differentiation, and skeletogenesis through targeting foxo mir- regulates foxo -mediated cell apoptosis in bladder cancer unregulated mir- induces cell proliferation in human breast cancer by downregulating transcriptional factor foxo a foxo is a tumor suppressor in classical hodgkin lymphoma upregulation of mircorna- induces proliferation in human prostate cancer cells by downregulating the transcription factor foxo akt/foxo pathway in renal cell carcinoma foxo regulates expression of a microrna cluster on x chromosome the microrna cluster mir- b~ regulates adult neural stem/progenitor cell proliferation and neuronal differentiation key: cord- -hzqayt n authors: chen, jianlin; liu, xiaorong; chen, jianhan title: targeting intrinsically disordered proteins through dynamic interactions date: - - journal: biomolecules doi: . /biom sha: doc_id: cord_uid: hzqayt n intrinsically disordered proteins (idps) are over-represented in major disease pathways and have attracted significant interest in understanding if and how they may be targeted using small molecules for therapeutic purposes. while most existing studies have focused on extending the traditional structure-centric drug design strategies and emphasized exploring pre-existing structure features of idps for specific binding, several examples have also emerged to suggest that small molecules could achieve specificity in binding idps and affect their function through dynamic and transient interactions. these dynamic interactions can modulate the disordered conformational ensemble and often lead to modest compaction to shield functionally important interaction sites. much work remains to be done on further elucidation of the molecular basis of the dynamic small molecule–idp interaction and determining how it can be exploited for targeting idps in practice. these efforts will rely critically on an integrated experimental and computational framework for disordered protein ensemble characterization. in particular, exciting advances have been made in recent years in enhanced sampling techniques, graphic processing unit (gpu)-computing, and protein force field optimization, which have now allowed rigorous physics-based atomistic simulations to generate reliable structure ensembles for nontrivial idps of modest sizes. such de novo atomistic simulations will play crucial roles in exploring the exciting opportunity of targeting idps through dynamic interactions. proteins are central components of regulatory networks that dictate virtually all aspects of cellular decision-making [ ] . demand for more sophisticated signaling in complex multicellular organisms has been met with increasing utilization of proteins that are highly flexible [ ] [ ] [ ] . in particular, so-called intrinsically disordered proteins (idps) account for~ % of signaling-associated proteins in eukaryotes [ ] . these proteins have lower sequence complexity compared to folded proteins, lacking large hydrophobic residues and enriched with charged and polar ones [ ] . they do not have stable tertiary structures in the unbound state under physiological conditions, even though they frequently undergo folding transitions upon binding to specific targets [ ] . the inherent thermodynamic instability of the structural features of this class of proteins allows their conformational properties to respond sensitively to numerous stimuli, including the binding of various small and large molecules, changes in cellular environments (e.g., ph), and post-translational modifications [ ] [ ] [ ] [ ] [ ] [ ] . multiple signals could also be naturally integrated through cooperative responses of the dynamic structure ensemble biomolecules , , of (such as coupled binding and folding) [ ] . these properties make idps uniquely suitable for fulfilling the complex signaling need of higher organisms. at the same time, deregulation of idps has been associated with many human diseases, including cancers, neurodegenerative diseases, heart disease, and diabetes [ , [ ] [ ] [ ] [ ] [ ] [ ] . for example, over two-thirds of cancer-associated proteins have been predicted to contain extensive regions of intrinsic disorder [ ] , and predicted disordered regions have been estimated to house almost one quarter of disease-associated missense mutations [ ] . there is thus tremendous interest in determining if and how idps may be targeted for therapeutic purposes. the dynamic and heterogeneous nature of unbound idps presents substantial challenges for characterization and this has proven to be a major bottleneck for establishing a reliable sequence-structure-function-disease relationship of idps [ , [ ] [ ] [ ] [ ] [ ] . the lack of a clear understanding of the molecular basis of idp function and deregulation in diseases has created significant ambiguity on the druggability of most idps, including transcription factors [ ] . most existing case studies of targeting idps have focused on extending the traditional structure-based screening and drug design strategies and emphasize exploiting residual structures and pre-existing potential binding pockets of the unbound state [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . nonetheless, it is clear that the disordered nature of idps would require novel strategies for targeting as well as new conceptual frameworks for thinking about how small molecule binding could modulate idp structure and function. in particular, it has been recognized that it may be more useful to consider the problem of targeting idps in the context of structural ensemble modulation [ ] , even though it is generally believed that one still needs to achieve specific interactions, such as by exploiting pre-existing structural features [ ] . many outstanding reviews have already been dedicated towards existing examples along these lines and they also provide extensive discussion of the successes, opportunities, and challenges of targeting idps via specific interactions of small molecules in neurodegenerative diseases, cancers, and other diseases [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . in this review, we will first summarize important recent advances in physics-based de novo simulations of disordered protein ensembles, including graphic processing unit (gpu) computing, enhanced sampling, and re-balanced protein force fields, and then focus on emerging examples that suggest the exciting possibility of targeting idps by directly modulating the disordered ensembles through dynamic and transient interactions. we will discuss the promise of such a broader view of how idps may be targeted as well as key challenges and required methodological developments to support targeting idps via dynamic interactions. a principal challenge in understanding the druggability and best targeting strategy of idps resides in the difficulty of detailed characterization of disordered protein states [ , , , ] . these states need to be represented using heterogeneous structure ensembles and are not amenable to traditional high-resolution structure determination methods. here, we briefly discuss the current status and challenges of disordered protein ensemble determination, which has a direct impact on the ability to devise effective strategies of designing idp binders and optimizing leads identified from traditional screening efforts. experimentally, a wide range of biophysical methods can be applied to characterize disordered protein states, including nmr, circular dichroism (cd), small-angle x-ray scattering (saxs), förster resonance energy transfer (fret), hydrogen/deuterium (h/d) exchange, mass spectrometry, and others [ , ] . these methods can provide complementary information on the local, intermediate, and long-range structural organizations of idps. nmr in particular is arguably the most powerful technique for structural studies of idp. many nmr observables can be measured at residue and atomic levels to infer secondary and tertiary structural properties. saxs and fret are highly complementary to nmr and provide information on the long-range global organization of the disordered ensemble. yet, a fundamental limitation is that these experimental measurements generally reflects the average properties, which alone are not sufficient to uniquely define the underlying heterogeneous ensemble due to the severely underdetermined nature of the structural calculation problem [ , , , [ ] [ ] [ ] . at present, the most robust methods generally involve first generating a large number of candidate random structures and then using experimental structural restraints to select and construct optimal sub-ensembles according to various statistical criteria [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . nonetheless, these methods rely critically on the ability to generate initial candidate structures that are not only diverse enough to cover the range of accessible states of the protein but also specific enough to contain any nontrivial local (and long-range) structure features associated with a particular protein state. these two requirements are difficult to satisfy simultaneously for most proteins of moderate sizes (e.g., - residues or longer) and complexity. as a consequence, disordered ensembles constructed using these approaches depend critically on the underlying protein model and/or coil library. furthermore, these ensembles are generally not proper thermodynamic ensembles. they should be considered just as ensemble models and cannot be used to reliably quantify statistical properties and extract thermodynamic parameters. as such, these experimental restraint-based ensemble construction approaches are likely inadequate for capturing potentially subtle effects of ligand binding on the disordered ensemble. given the fundamental challenges of disordered ensemble modeling based on experimental restraints alone, physics-based atomistic simulations have a crucial role to play in helping elucidate the conformational properties of idps and establishing a reliable molecular basis of their function and regulation [ ] [ ] [ ] [ ] [ ] . a particularly attractive approach is to first generate the atomistic ensemble using a transferable physics-based force field in absence of any experimental restraints and then use the experimental data for independent evaluation of the quality of the simulated ensemble. such a de novo simulation approach can effectively overcome the under-determined nature of disordered ensemble calculation, by leveraging the laws of physics that govern the nature of conformational fluctuation of the protein. successful simulations of disordered protein ensembles have proven to be very challenging, requiring both accurate description of the conformational dependence of energy and sufficient sampling of relevant conformational space of the protein. early idp simulations suffered from both systematic biases in the general-purpose protein force fields and severe limitations in conformational sampling [ ] . nonetheless, these physics-based simulation approaches promise to provide rigorous thermodynamic ensembles required for reliable description of idp-ligand interactions. the accuracy and capability of de novo idp simulations can be expected to improve continuously over time, benefiting from robust advances in molecular simulation methodologies and high-performance computing hardware. indeed, important breakthroughs have been made in the last few years in force field accuracy and sampling capability, which arguably have now allowed reliable de novo simulations of at least moderate-sized idps in general. one of the most important recent advances in molecular dynamics (md) simulations is the widespread availability of efficient gpu-enabled algorithms in virtually all major molecular simulation packages [ ] [ ] [ ] [ ] [ ] [ ] . modern gpus can process thousands of threads in parallel to accelerate explicit solvent md simulations by up to × compared to traditional cpu computing, significantly boosting the sampling capability. for example, the most efficient gpu-enabled md codes can yield - ns per day for systems of~ , , atoms on a single nvidia rtx ti gpu card that costs onlỹ $ . the ability to efficiently sample the protein conformational space has further benefitted from the emergence of enhanced sampling techniques [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] , particularly various replica exchange (rex)-based methods. among the various rex methods, replica exchange with solute tempering methods (rest) [ , ] is particularly suitable for atomistic simulations of idps in explicit solvent. in rest, a selected region of the system (e.g., the protein solute or a flexible segment of the protein) is subjected to tempering (i.e., random walk in the temperature space) while the rest of the system is maintained at a constant temperature (e.g., room temperature). this is achieved by scaling the interactions within the selected region and between it and the rest of the system. because the number of required replicas of replica exchange simulations scales as the square root of the number of atoms, rest can significantly reduce the number of replicas required for covering the needed temperature space (by~ -fold), thus overcoming a critical drawback in traditional temperature rex simulation in explicit solvent. the role of enhanced sampling in the recent successes of simulating dynamic idp-ligand interactions will become evident in the later sections of this review. the importance of enhanced sampling for the generation of converged ensembles cannot be over emphasized. for example, figure a shows the evolution of the per-residue β-sheet structure during a -µs conventional md simulation of an intrinsically disordered aβ peptide in explicit solvent at k, performed with anton specialized hardware using the well optimized a sb-disp force field [ ] . note that this trajectory is one to two orders of magnitude longer than typical md simulations performed using general purpose cpu-or gpu-based high-performance computing platforms. yet, very few reversible transitions are observed. for example, a transient β-hairpin spanning residues to persists for about µs from~ to µs and never appears again for the rest of the simulation. as a result, the final average residue helix and β-sheet probability profiles calculated from the first and second halves of the trajectory differ greatly, reflecting a very limited level of convergence. there is thus danger in relying on standard md simulations in deriving quantitative characterizations of disordered protein ensembles. it should also be emphasized that achieving a sufficient level of convergence required for resolving potentially subtle changes in the disordered ensemble, such as upon ligand binding, can be extremely challenging even with enhanced sampling. it is critical to carefully analyze and establish the level of convergence for proper interpretation of simulated ensembles. ideally, one should perform two or more independent simulations using distinct initial conformations and compare the resulting ensembles. the simulated ensemble from a single continuous run may appear to stop changing with respect to simulation time due to trapping in numerous local energy minima, giving rise to a misleading impression of convergence. biomolecules , , x of temperature space (by ~ -fold), thus overcoming a critical drawback in traditional temperature rex simulation in explicit solvent. the role of enhanced sampling in the recent successes of simulating dynamic idp-ligand interactions will become evident in the later sections of this review. the importance of enhanced sampling for the generation of converged ensembles cannot be over emphasized. for example, figure a shows the evolution of the per-residue β-sheet structure during a -μs conventional md simulation of an intrinsically disordered aβ peptide in explicit solvent at k, performed with anton specialized hardware using the well optimized a sb-disp force field [ ] . note that this trajectory is one to two orders of magnitude longer than typical md simulations performed using general purpose cpu-or gpu-based high-performance computing platforms. yet, very few reversible transitions are observed. for example, a transient β-hairpin spanning residues to persists for about μs from ~ to μs and never appears again for the rest of the simulation. as a result, the final average residue helix and β-sheet probability profiles calculated from the first and second halves of the trajectory differ greatly, reflecting a very limited level of convergence. there is thus danger in relying on standard md simulations in deriving quantitative characterizations of disordered protein ensembles. it should also be emphasized that achieving a sufficient level of convergence required for resolving potentially subtle changes in the disordered ensemble, such as upon ligand binding, can be extremely challenging even with enhanced sampling. it is critical to carefully analyze and establish the level of convergence for proper interpretation of simulated ensembles. ideally, one should perform two or more independent simulations using distinct initial conformations and compare the resulting ensembles. the simulated ensemble from a single continuous run may appear to stop changing with respect to simulation time due to trapping in numerous local energy minima, giving rise to a misleading impression of convergence. the dramatically improved sampling capability has facilitated extensive efforts to reparametrize general-purpose protein force fields to achieve greater balance of describing protein conformational equilibria. studies of disordered protein states have been a key driver of these developments and several well-characterized idps have been widely used as training systems and/or benchmarks for force field optimization [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . many force field variants have been developed in recent years, including amber ff sb [ ] , ff sb*-ildn [ ] and variants [ , ] , ff ws [ ] , ff sb [ ] , ff sbnmr [ ] , charmm * [ ] , charmm m (and c mw) [ ] , a sb-disp [ ] , and others. a key focus of these optimization efforts has been to rebalance the protein-protein, protein-water, and water-water interactions. earlier versions of general-purpose protein force fields consistently over-stabilize nonspecific protein-protein interactions and lead to overly compact conformational ensembles for disordered protein states [ , ] . it was demonstrated that such bias could be effectively compensated by directly increasing the strengths of protein-water dispersion interactions [ , , ] , even though other components of the force field should also be reparametrized for self-consistency. the latest charmm m and a sb-disp force fields, in particular, have been systematically reparametrized based on extensive simulations of tens of globular and disordered proteins and achieve impressive levels of accuracy for describing both structured and unstructured proteins. in a recent benchmark study, six of the latest protein force fields were evaluated using the -residue n-terminal transactivation domain (tad) of tumor suppressor p , which is a very challenging system due to its size and complex conformational features [ ] . it has been extensively characterized by nmr, saxs, and single-molecule fret and shown to contain a range of nontrivial local and long-range residual structures [ ] . the disordered ensemble of p -tad was calculated using rest -enhanced sampling using gpu-accelerated gromacs . . [ , ] patched with plumed . . [ ] [ ] [ ] . each rest simulation lasted . µs per replica, representing one of the most extensive atomistic simulations of idps of similar sizes. the results show that the ensembles generated using the force field a sb-disp yield the best agreement with the experimental data at both secondary and tertiary structure levels. for example, the back-calculated nmr paramagnetic relaxation enhancement (pre) effects are highly consistent with the experimental results for all four available labelling sites (figure ). this suggests that the simulated ensembles not only have the proper overall chain dimension but also recapitulate much of the transient long-range ordering within the unbound state of p -tad. the latter is an extremely challenging task. the fact that this could be achieved by a sb-disp represents an exciting breakthrough, suggesting that de novo atomistic simulations are now ready to provide a reliable approach for detailed characterization of the disordered ensembles of at least moderately sized idps. the dramatically improved sampling capability has facilitated extensive efforts to reparametrize general-purpose protein force fields to achieve greater balance of describing protein conformational equilibria. studies of disordered protein states have been a key driver of these developments and several well-characterized idps have been widely used as training systems and/or benchmarks for force field optimization [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . many force field variants have been developed in recent years, including amber ff sb [ ] , ff sb*-ildn [ ] and variants [ , ] , ff ws [ ] , ff sb [ ] , ff sbnmr [ ] , charmm * [ ] , charmm m (and c mw) [ ] , a sb-disp [ ] , and others. a key focus of these optimization efforts has been to rebalance the protein-protein, protein-water, and water-water interactions. earlier versions of general-purpose protein force fields consistently over-stabilize nonspecific protein-protein interactions and lead to overly compact conformational ensembles for disordered protein states [ , ] . it was demonstrated that such bias could be effectively compensated by directly increasing the strengths of protein-water dispersion interactions [ , , ] , even though other components of the force field should also be reparametrized for selfconsistency. the latest charmm m and a sb-disp force fields, in particular, have been systematically reparametrized based on extensive simulations of tens of globular and disordered proteins and achieve impressive levels of accuracy for describing both structured and unstructured proteins. in a recent benchmark study, six of the latest protein force fields were evaluated using the residue n-terminal transactivation domain (tad) of tumor suppressor p , which is a very challenging system due to its size and complex conformational features [ ] . it has been extensively characterized by nmr, saxs, and single-molecule fret and shown to contain a range of nontrivial local and long-range residual structures [ ] . the disordered ensemble of p -tad was calculated using rest -enhanced sampling using gpu-accelerated gromacs . . [ , ] patched with plumed . . [ ] [ ] [ ] . each rest simulation lasted . μs per replica, representing one of the most extensive atomistic simulations of idps of similar sizes. the results show that the ensembles generated using the force field a sb-disp yield the best agreement with the experimental data at both secondary and tertiary structure levels. for example, the back-calculated nmr paramagnetic relaxation enhancement (pre) effects are highly consistent with the experimental results for all four available labelling sites (figure ). this suggests that the simulated ensembles not only have the proper overall chain dimension but also recapitulate much of the transient long-range ordering within the unbound state of p -tad. the latter is an extremely challenging task. the fact that this could be achieved by a sb-disp represents an exciting breakthrough, suggesting that de novo atomistic simulations are now ready to provide a reliable approach for detailed characterization of the disordered ensembles of at least moderately sized idps. . calculated (lines) and experimental (grey bars) nmr paramagnetic relaxation enhancement (pre) effects induced by paramagnetic spin labelling at residues d , e , a , and d of p -tad. red and green traces were calculated from an independent control and folding rest simulations of p -tad in a sb-disp, respectively, to evaluate the level of convergence. control and folding simulations were initiated from helical and fully unstructured structures, respectively, and the length of rest simulations were μs per replica. this figure was adapted from [ ] . see [ ] for details on the simulation and analysis. red and green traces were calculated from an independent control and folding rest simulations of p -tad in a sb-disp, respectively, to evaluate the level of convergence. control and folding simulations were initiated from helical and fully unstructured structures, respectively, and the length of rest simulations were µs per replica. this figure was adapted from [ ] . see [ ] for details on the simulation and analysis. we note that implicit solvent protein force fields have also been developed and deployed for atomistic simulations of idps with various levels of success [ , , , ] . implicit treatment of solvent reduces the simulation system size~ -fold by direct estimation of the solvation free energy. it could provide important advantages for satisfying the simultaneous requirements of adequate sampling and sufficient force field accuracy for simulating disordered protein states. the absinth model, in particular, has demonstrated significant successes in mapping the sequence-conformational space relationship of idps [ ] . an improved version named absinth-c was recently developed by including the backbone torsion cross-terms optimized based on experimentally derived statistics [ ] . independently, the generalized born with the molecular volume (gmmv ) model, which is considered one of the most accurate implicit solvent models, was recently implemented on gpu within the charmm/openmm interface [ ] . gpu-accelerated gbmv is about -fold faster and provides a competitive alternative to explicit solvent simulations for studying the idp structure and interaction. this model was previously optimized based on enhanced sampling of model peptides and shown to be capable of accurately describing the conformational properties of both folded and unfolded peptides [ ] . the development of the gpu-accelerated version thus removed a key bottleneck to broader application of gbmv for atomistic simulations of the idp structure and interactions. nonetheless, whether these implicit solvent models could provide a viable alternative to explicit solvent simulations in studies of idp-ligand interactions is yet to be demonstrated. to date, small molecular targeting of idps has mostly focused on proteins involved in neurodegenerative diseases, such as amyloid β (aβ) peptides, α-synuclein, and tau protein, and disordered regions of cancer-associated transcription factors, such as p , c-myc, ews-fli , klf , and others [ , ] . advances in experimental and computational methods for studying disordered protein ensembles have allowed examination in greater details the interactions between idps and small molecules. there has been an emergence of examples showing that small molecules could modulate the disordered ensemble itself through nonspecific and dynamic interactions and achieve specific functional effects, in complete contrast to the traditional paradigm of drug binding that emphasizes strong specific interactions. such a dynamic mode of idp-small molecule interactions is reminiscent of "fuzzy complexes" in protein-protein interactions involving idps [ ] [ ] [ ] . the observation that small molecules could induce substantial effects through dynamic interactions is fascinating and suggests a broader and more effective strategy for targeting idps in general. [ , ] . nmr, cd, and fluorescence studies initially suggested that these inhibitors bound specifically to multiple independent sites in the monomeric and disordered c-myc, which induced max-binding-incompatible conformations to disrupt the c-myc-max interaction [ ] . subsequent explicit solvent simulations by michel and cuchillo in [ ] revealed that the interaction between c-myc and one of the inhibitors, -f , was actually dynamic and involved many short-lived contacts. such a dynamic and nonspecific nature of the interaction could explain the observation that small modifications to the ligand had limited effects on the binding affinity [ ] . a similar mode of interaction was proposed by jin et al. in [ ] for another c-myc inhibitor, -a , where atomistic simulations in explicit and implicit solvent force fields revealed the ligand to form a "ligand cloud" and interact dynamically with the disordered "protein cloud". importantly, the dynamic nature of the interaction between -f and c-myc - can confer sequence specificity, even though the interaction involves many transient contacts instead of well-defined specific ones [ ] . it has been further suggested that such dynamic interactions may be driven by entropic expansion of the idp conformational space [ ] . indeed, thermodynamics analysis showed that binding of -f to c-myc - was dominated by the entropic contribution (− . ± − . kj/mol out of the total binding free energy of − . ± − . kj/mol) [ ] . the caveat, however, is that binding of hydrophobic ligands is generally associated with large entropic contributions due to the release of restricted water molecules near the hydrophobic surface. it is thus not clear if c-myc indeed undergoes entropic expansion upon -f binding. an in-depth nmr study of the interaction of a small molecule with intrinsically disordered p kip found little evidence of ligand-induced conformational space expansion [ ] . instead, ligand binding was shown to mainly shift the populations of pre-existing states. dynamic interactions were also found to underlie the mechanism of α-synuclein aggregation inhibition by analogs of cyclized nordihydroguaiaretic acid (cndga) [ ] . the structural basis of cndga inhibition was characterized using an array of biochemical and biophysical methods, including nmr and fluorescence correlation spectroscopy. the results revealed that cndga induced modest compaction of the conformational ensemble of monomeric α-synuclein, apparently mediated by dynamic and transient interactions with the protein and without hindering membrane association. cndga-treated α-synuclein is resistant to aggregation even when seeded with α-synuclein aggregates. importantly, cndga was further shown to be effective in reducing α-synuclein-driven neurodegeneration in c. elegans. the observation that dynamic interactions between a small molecule and idp could be functionally effective both in vitro and in vivo is very encouraging and supports the promise of targeting idps using dynamic interactions for therapeutics. induced compaction of the disordered conformational ensemble has also been predicted to underlie the inhibition mechanism of two drugs under clinical trials for treating alzheimer's diseases [ ] . the disordered ensembles of aβ with and without tramiprosate (homotaurine; ht) and scyllo-inositol (si) were calculated using rest simulations in the charmm m force field that lasted µs per replica, making them the most extensive atomistic simulations of aβ to date. the resulting ensembles are well converged and appear consistent with the nmr chemical shifts. comparing the ensembles with and without the ligand showed that both ht and si mainly reduced the β propensity in the c-terminal region with minimal secondary structure perturbation in the rest of the peptide. intriguingly, both ht and si were found to induce modest compaction of the conformational ensemble, particularly in the c-terminal segment that is known to be important for amyloid fibril formation (figure a-c) . detailed analysis further revealed that the effects of both ht and si binding were achieved via dynamic and nonspecific interactions with various backbone and sidechain moieties of the peptide. it is noteworthy that the conformational modulation effects of both ht and si can be very difficult to detect at the ensemble level using bulk measurements, highlighting a critical role for reliable atomistic simulations that leverage recent advances in both protein force field quality and sampling capability. nonetheless, additional validation is needed to support the predicted conformational shifts induced by drugs and to establish the roles of such conformational changes in the mechanisms of drug action. fda-approved drugs and identified compounds that could bind nupr , a multi-functional idp involved in pancreatic ductal adenocarcinoma [ , ] . nmr chemical shift analysis suggested that nupr remained disordered in complex with all compounds, which is consistent with an inhibition mechanism involving transient and dynamic idp-ligand interactions. importantly, these compounds showed efficacy in cell-based assays, the most effective of which was found to completely arrest tumor growth in a mouse model. . the conformational space of aβ is projected onto the number of backbone hydrogen bonds and end-to-end distance and that of p -tad is projected onto the first two principal components. the conformational ensembles were calculated using long timescale rest simulations in explicit solvent ( and μs per replica for aβ and p -tad, respectively). representative conformations are shown in backbone traces. this figure was adapted from [ , ] . see [ , ] for details on the simulation and analysis. idps have remained an extremely challenging class of proteins to target using small molecules. albeit limited, successful inhibitors have been discovered and designed for several idps involved in cancers and neurodegenerative diseases, suggesting that idps are not undruggable. nonetheless, the unstructured and dynamic nature of idps is distinct from typical protein targets with well-defined binding pockets. it requires new conceptional frameworks to guide the development of novel strategies for discovering and designing small molecules that can modulate idp functions. traditional structure-based screening and lead optimization strategies are clearly inadequate, even though some success has been demonstrated in deploying existing tools to identify possible binding pockets and target pre-existing structural elements. such structural elements are lightly populated and generally too small to harbor significant pockets for small molecular binding that relies on specific interactions to achieve high affinity. in fact, there is a great uncertainty on whether highaffinity binding to idps is feasible with small molecules (e.g., to meet the typical industrial standard of dissociation constants of nm or lower). it is encouraging that examples are emerging that small molecules could modulate the idp ensembles entirely through dynamic nonspecific interactions. importantly, there is evidence that high-affinity binding may not be necessary to induce functional responses in vitro and in vivo. this may reflect a fundamental nature of how idps mediate function in biology, in that the disordered ensemble of an idp is poised to respond sensitively to a wide array figure . conformational ensembles of aβ with and without the ligands (a-c) and p -tad with and without ligands (d,e). the conformational space of aβ is projected onto the number of backbone hydrogen bonds and end-to-end distance and that of p -tad is projected onto the first two principal components. the conformational ensembles were calculated using long timescale rest simulations in explicit solvent ( and µs per replica for aβ and p -tad, respectively). representative conformations are shown in backbone traces. this figure was adapted from [ , ] . see [ , ] for details on the simulation and analysis. de novo atomistic simulations have also been integrated with nmr and biophysical experiments to examine how an anticancer drug, epigallocatechin gallate (egcg), modulates the disordered unbound state of p -tad [ ] . egcg is a major active ingredient of green tea and has been reported to have anticancer effects in both animal studies and clinical trials [ ] [ ] [ ] . the results suggested that egcg also interacted dynamically with p -tad through numerous transient and nonspecific interactions, which appeared consistent with nmr chemical shift titration results. multiple hydrophobic and particularly aromatic sidechains contribute significantly to egcg binding. the dynamic interaction with egcg was predicted to induce significant conformational compaction of p -tad in the n-terminal region (figure d ,e), which appears consistent with the saxs measurements. the compaction could shield the p -tad site required for interacting with mdm and thus inhibit p degradation to promote its anticancer activities. it is noteworthy that egcg has also been shown to be active in inhibiting the aggregation of multiple proteins, including aβ peptides and α-synuclein [ , ] . the implication is that dynamic interactions of egcg could provide a molecular basis for promiscuous selectivity, which is a fascinating property to investigate further using a combination of experiments and simulation. recently, neira et al. screened a set of fda-approved drugs and identified compounds that could bind nupr , a multi-functional idp involved in pancreatic ductal adenocarcinoma [ , ] . nmr chemical shift analysis suggested that nupr remained disordered in complex with all compounds, which is consistent with an inhibition mechanism involving transient and dynamic idp-ligand interactions. importantly, these compounds showed efficacy in cell-based assays, the most effective of which was found to completely arrest tumor growth in a mouse model. idps have remained an extremely challenging class of proteins to target using small molecules. albeit limited, successful inhibitors have been discovered and designed for several idps involved in cancers and neurodegenerative diseases, suggesting that idps are not undruggable. nonetheless, the unstructured and dynamic nature of idps is distinct from typical protein targets with well-defined binding pockets. it requires new conceptional frameworks to guide the development of novel strategies for discovering and designing small molecules that can modulate idp functions. traditional structure-based screening and lead optimization strategies are clearly inadequate, even though some success has been demonstrated in deploying existing tools to identify possible binding pockets and target pre-existing structural elements. such structural elements are lightly populated and generally too small to harbor significant pockets for small molecular binding that relies on specific interactions to achieve high affinity. in fact, there is a great uncertainty on whether high-affinity binding to idps is feasible with small molecules (e.g., to meet the typical industrial standard of dissociation constants of nm or lower). it is encouraging that examples are emerging that small molecules could modulate the idp ensembles entirely through dynamic nonspecific interactions. importantly, there is evidence that high-affinity binding may not be necessary to induce functional responses in vitro and in vivo. this may reflect a fundamental nature of how idps mediate function in biology, in that the disordered ensemble of an idp is poised to respond sensitively to a wide array of cellular signals to support signal transduction and cellular regulation [ ] . therefore, there is a great potential and promise for targeting idps through dynamic interactions with small molecules. it is noteworthy that idps have been found to play central roles in mediating liquid-liquid phase separation (llps) that underlies a range of cellular processes [ ] [ ] [ ] . how small molecules may modulate the equilibrium and properties of these biological condensates is essentially unknown at this point. however, it is conceivable that llps may also be ideally targeted using dynamic interactions with small molecules that modulate the conformational flexibility and preference of disordered regions, which in turn modify the multivalent interaction profiles and entropic contribution that affect the condensation process as well as the properties of the condensate itself. elucidating the molecular details of dynamic interactions between idps and small molecules will require further development and integration of new experimental and computational methodologies. the capability for reliable disordered ensemble characterization with and without small molecules will almost certainly be required for any future design and optimization strategy to discover drugs that target idps through dynamic interactions. this unfortunately remains a formidable task. bulk experimental measurements on average properties alone are not sufficient to uniquely define the disordered ensemble. the dynamic and transient nature of molecular contacts can be very difficult to detect and resolve experimentally [ , ] . leveraging significant recent advances in the protein force field quality, sampling techniques, and gpu computing, de novo atomistic simulations are now poised to help meet these challenges and play a pivotal role in establishing the molecular basis of dynamic idp-small molecule interactions. author contributions: data analysis, x.l.; writing-draft and edit, j.c. (jianhan chen); writing-review and edit, j.c. (jianlin chen) and x.l.; all authors have read and agreed to the published version of the manuscript. funding: this work is partially supported by national institutes of health (gm to jhc). targeting transcription factors in cancer-from undruggable to reality intrinsically unstructured proteins: re-assessing the protein structure-function paradigm intrinsically unstructured proteins flexible nets-the roles of intrinsic disorder in protein interaction networks intrinsic disorder in cell-signaling and cancer-associated proteins biomolecules sequence complexity of disordered protein coupling of folding and binding for unstructured proteins showing your id: intrinsic disorder as an id for recognition, regulation and cell signaling intrinsically unstructured proteins and their functions sending signals dynamically intrinsic disorder as a mechanism to optimize allosteric coupling in proteins ensemble allosteric model: energetic frustration within the intrinsically disordered glucocorticoid receptor a comprehensive ensemble model for comparing the allosteric effect of ordered and disordered proteins towards the physical basis of how intrinsic disorder mediates protein function intrinsically disordered proteins in human diseases: introducing the d- concept targeting intrinsically disordered transcription factors: changing the paradigm intrinsically disordered side of the zika virus proteome untapped potential of disordered proteins in current druggable human proteome intrinsic disorder associated with - - proteins and their partners intrinsic disorder here, there, and everywhere, and nowhere to escape from it disease mutations in disordered regions-exception to the rule? atomic-level characterization of disordered protein ensembles constructing ensembles for intrinsically disordered proteins intrinsically disordered proteins in a physics-based world relating sequence encoded information to form and function of intrinsically disordered proteins molecular recognition features (morfs) in three domains of life multiple independent binding sites for small-molecule inhibitors on the oncoprotein c-myc discovery of novel myc-max heterodimer disruptors with a three-dimensional pharmacophore model targeting the disordered c terminus of ptp b with an allosteric inhibitor sci rep a small molecule causes a population shift in the conformational landscape of an intrinsically disordered protein binding cavities and druggability of intrinsically disordered proteins presmo target-binding signatures in intrinsically disordered proteins identification of small-molecule binding pockets in the soluble monomeric form of the a beta peptide identification of small molecule inhibitors of tau aggregation by targeting monomeric tau as a potential therapeutic approach for tauopathies a fragment-based method of creating small-molecule libraries to target the aggregation of intrinsically disordered proteins structure-based inhibitor design for the intrinsically disordered protein c-myc a moving target: structure and disorder in pursuit of myc inhibitors conservation of potentially druggable cavities in intrinsically disordered proteins epi- , a compound active against castration-resistant prostate cancer, targets transactivation unit of the androgen receptor targeting the intrinsically disordered structural ensemble of alpha-synuclein by small molecules as a potential therapeutic strategy for parkinson's disease intrinsic disorder within akap fine-tunes anchored phosphatase activity toward substrates and drug sensitivity proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins structural ensemble modulation upon small-molecule binding to disordered proteins targeting intrinsically disordered proteins at the edge of chaos intrinsically disordered proteins: from sequence and conformational properties toward drug discovery drugs for 'protein clouds': targeting intrinsically disordered transcription factors intrinsically disordered proteins are potential drug targets intrinsically disordered proteins and novel strategies for drug discovery how to design a drug for the disordered proteins targeting disordered proteins with small molecules using entropy biomolecules druggability of intrinsically disordered proteins targeting protein-protein interactions (ppis) of transcription factors: challenges of intrinsically disordered proteins (idps) and regions (idrs) targeting intrinsically disordered proteins in rational drug discovery recent insights into the development of therapeutics against coronavirus diseases by targeting n protein pe-db: a database of structural ensembles of intrinsically disordered and of unfolded proteins intrinsically disordered proteins in cellular signalling and regulation intrinsically disordered proteins studied by nmr spectroscopy structural interpretation of paramagnetic relaxation enhancement-derived distances for disordered protein states principles of protein structural ensemble determination a critical assessment of methods to recover information from averaged data improved structural characterizations of the drkn sh domain unfolded state suggest a compact ensemble with native-like and non-native structure nmr characterization of long-range order in intrinsically disordered proteins structure of tumor suppressor p and its intrinsically disordered n-terminal transactivation domain recovering a representative conformational ensemble from underdetermined macromolecular structural data atomistic ensemble modeling and small-angle neutron scattering of intrinsically disordered protein complexes: applied to minichromosome maintenance protein atomistic modelling of scattering data in the collaborative computational project for small angle scattering (ccp-sas) combined monte carlo/torsion-angle molecular dynamics for ensemble modeling of proteins, nucleic acids and carbohydrates conformational propensities of intrinsically disordered proteins from nmr chemical shifts modeling intrinsically disordered proteins with bayesian statistics conformational space of flexible biological macromolecules from average data biomolecules simulations of disordered proteins and systems with conformational heterogeneity a preformed binding interface in the unbound ensemble of an intrinsically disordered protein: evidence from molecular simulations net charge per residue modulates conformational ensembles of intrinsically disordered proteins atomistic details of the disordered states of kid and pkid. implications in coupled binding and folding residual structures, conformational fluctuations, and electrostatic interactions in the synergistic folding of two intrinsically disordered proteins the biomolecular simulation program openmm : a reusable, extensible, hardware independent library for high performance molecular simulation high performance molecular simulations through multi-level parallelism from laptops to supercomputers scalable molecular dynamics with namd routine microsecond molecular dynamics simulations with amber on gpus. . generalized born accelerate sampling in atomistic energy landscapes using topology-based coarse-grained models scalable free energy calculation of proteins via multiscale essential sampling replica-exchange molecular dynamics method for protein folding replica exchange with solute tempering: a method for sampling biological systems in explicit water hamiltonian switch metropolis monte carlo simulations for improved conformational sampling of intrinsically disordered regions tethered to ordered domains of proteins a hybrid md-kmc algorithm for folding proteins in explicit solvent enhanced sampling and applications in protein folding in explicit solvent practically efficient and robust free energy calculations: double-integration orthogonal space tempering replica exchange with solute scaling: a more efficient version of replica exchange with solute tempering (rest ) developing a molecular dynamics force field for both folded and disordered protein states charmm m: an improved force field for folded and intrinsically disordered proteins improved peptide and protein torsional energetics with the opls-aa force field water dispersion interactions strongly influence simulated structural properties of disordered protein states optimization of the gbmv implicit solvent force field for accurate simulation of protein conformational equilibria optimizing solute-water van der waals interactions to reproduce solvation free energies improvements to the absinth force field for proteins based on experimentally derived amino acid specific backbone conformational statistics residual structures and transient long-range interactions of p transactivation domain: assessment of explicit solvent protein force fields comparison of multiple amber force fields and development of improved protein backbone parameters improved side-chain torsion potentials for the amber ff sb protein force field optimizing protein-solvent force fields to reproduce intrinsic conformational preferences of model peptides balanced protein-water interactions improve properties of disordered proteins and non-specific protein association improving the accuracy of protein side chain and backbone parameters from ff sb balanced amino-acid-specific molecular dynamics force field for the realistic simulation of both folded and disordered proteins how robust are protein folding simulations with respect to force field parameterization? optimization of the additive charmm all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ and χ dihedral angles extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations computational and theoretical advances in studies of intrinsically disordered proteins modulation of the disordered conformational ensembles of the p transactivation domain by cancer-associated mutations tackling exascale software challenges in molecular dynamics simulations with gromacs plumed : new feathers for an old bird on easy implementation of a variant of the replica exchange with solute tempering in gromacs hamiltonian replica exchange in gromacs: a flexible implementation balancing solvation and intramolecular interactions: toward a consistent generalized born force field accelerating the generalized born with molecular volume and solvent accessible surface area implicit solvent model using graphics processing units targeting the intrinsically disordered proteome using small-molecule ligands fuzzy complexes: polymorphism and structural disorder in protein-protein interactions potential conformational heterogeneity of p bound to s b(betabeta) dynamic multivalent interactions of intrinsically disordered proteins the impact of small molecule binding on the energy landscape of the intrinsically disordered protein c-myc improved low molecular weight myc-max inhibitors ligand clouds around protein clouds: a scenario of ligand binding with intrinsically disordered proteins sequence specificity in the entropy-driven binding of a small molecule and a disordered peptide cyclized ndga modifies dynamic alpha-synuclein monomers preventing aggregation and toxicity modulation of amyloid-beta conformation by small molecules through nonspecific binding modulation of p transactivation domain conformations by ligand binding and cancer-associated mutations epigallocatechin gallate (egcg) is the most effective cancer chemopreventive polyphenol in green tea primary cancer prevention by green tea, and tertiary cancer prevention by the combination of green tea catechins and anticancer compounds green tea extracts for the prevention of metachronous colorectal polyps among patients who underwent endoscopic removal of colorectal adenomas: a randomized clinical trial egcg remodels mature alpha-synuclein and amyloid-beta fibrils and reduces cellular toxicity green tea epigallocatechin- -gallate (egcg) reduces beta-amyloid mediated cognitive impairment and modulates tau pathology in alzheimer transgenic mice designing and repurposing drugs to target intrinsically disordered proteins for cancer treatment: using nupr as a paradigm identification of a drug targeting an intrinsically disordered protein involved in pancreatic adenocarcinoma considerations and challenges in studying liquid-liquid phase separation and biomolecular condensates polymer physics of intracellular phase transitions methods of probing the interactions between small molecules and disordered proteins characterization of the binding of small molecules to intrinsically disordered proteins this article is an open access article distributed under the terms and conditions of the creative commons attribution (cc by) license we thank d. e. shaw research for giving us access to the md trajectories of aβ . the authors thank chungwen liang for helpful discussions. we would also link to thank anonymous reviewers for their careful reading and insightful suggestions that have greatly improved both the content and writing of this review. the authors declare no conflict of interest. key: cord- -xgikbdns authors: Ühlein, e. title: Übersicht Über neue ernährungswissenschaftliche publikationen date: - - journal: z ernahrungswiss doi: . /bf sha: doc_id: cord_uid: xgikbdns nan shoe~taxer, w. c., yanof, h. m., tui~k, l. n., u. wilson, t.h.: glucose and fructose absorption in the unanesthetized dog. gastroenterology [ ] nr. , s. ff . ( s.) . * sieaei~, p. s., u. correia, iv] . j. : speed of resumption of eating following distraction in relation to number of hours food-deprivation. physiol. ree. [ ] nr. i, s. ff. ( s.) . si~eth, r. n., u. christie, it. j. : studies on the absorption of insulin from the gastrointestinal tract of the rabbit. diabetes [ ] nr. , s. ff. ( s.) . *st~oh~.x~r, g.: ~iessung der ksrperradioaktivitiit zur bestimmung der eisenresorption beim mensehen. nuclear med., suppl. [ ] s. ] . tallis, g. ] ~., moore, ~. w., u. gream, b. d. : specific gravity of live sheep. nature [ ] nr. , s. . tanaka, s., nu-kasawa, a., yoshikawa, h., u. yoneyama, y. : protein nutrition and radiation damage in mice. nature [ ] nr. , s. / . tho~tson, a. m., u. billv, wlcz, w. z. : nutritional status, maternal physique, and repro- ductive efficiency. prec. i~utrition see. [ ] nr. , s. / . toothill, j. : the effect of certain dietary factors on the apparent absorption of magnesium by the rat. brit. j. nutrition [ ] nr. , s. . *trail, j. c. m. : upgrading the indigenous poultry of uganda. i.: the growth rates and feed conversion from hatching to maturity of indigenous poultry crossed with four imported breeds. j. agrie. sei. {} [ ] nr. , s. ft. ( s.) . van i~iekerk, b. d. h., i~eid, j. t., be~s~uoun, a., u. paladines, . l.: urinary ereatinine as an index of body composition. j. nutrition [ ] nr. , s. / . voigtl~ndeg, h. : der rhythmus des brotessens. lebensmittel u. ern~hrung [ ] nr. , s. ] . wadsworth, g. r. : nutrition surveys: clinical signs and biochemical measurements. prec. l~utrition see. [ ] nr. , s. ] . watson, w. c., gordon, r. s., kx~en, a., u. jove~, a. : the absorption and excretion of castor oil in man. ft. pharmacy pharmacol. [ ] nr. , s. / . wkeel~r, r. r. : evaluation of various indicator techniques in estimating forage intake and digestibility by range cattle. diss. abstr. [ ] nr. , . williams, w. p., dawxs, r. e., u. couch, j. r. : the utilization of earotenoids by the hen and chick. poultry sei. [ ] nr. , s. ft. ( s.) . wolford, j. i-i., ringer, ~. k., col~_ai% t. h., u. zrsdel, h. c. : individual feed consumption of turkey breeder hens and the correlation of feed intake, body weight, and egg production. poultry sci. [ ] nr. , s. ff. ( s.) . youl~o, g. m., tensuan, r. s., saimt, f., u. hol~es, f. : estimating body fat of normal young woman. visualizing fat pads by soft-tissue x-rays. j. amer. dietetic assoc. [ ] nr. , s. ff . ( s.) . zuck~r, t. f., u. ziicker, l. hi. : fat accretion and growth in the rat. j. nutrition [ ] nr. , s. / . intermediary metabolism c - kohlenhydrate -garbohydrates +n. n. : enzymatic defects in the hereditary fructosurias. nutrition i~ev. [ ] i~r. , s. / . *agosi~, hi., scar~elli, n., di~amarca, iv] . l., u. arav~i~a, l. : intermediary carbohydrate metabolism in triatoma infestans (insecta: hemiptera). ii. : the metabolism of c-glucose in triatoma infestans nymphs and the effect of ddt. comp. bioehem. physiol. [ ] nr. , s. ff . ( s.) . baens, g. s., lv-~deen, e., u. cornblxth, m. : studies of carbohydrate metabolism in the newborn infant. vi. : levels of glucose in blood in premature infants. pediatrics [ ] nr. , s. ff . ( s.) . * b~akeb, j., u. mapson, l. w. : studies in the respiratory and carbohydrate metabolism of plant tissues. xil : l~tlrther studies of the formation of co~ and the changes in lactate, alcohol, sucrose, pyruvate, and cr in potato tubers in nitrogen and in air following anaerobic conditions. prec. royal soc. b. [ ] nr. , s. ff. ( s.) . *bean, r. c., porter, g. g., u. bur, b. k.: carbohydrate metabolism of avocado. il: formation of sugars during short periods of photosynthesis. plant physiol. [ ] nr. , s. ff. ( s.) . b~gy~, e. n.: quantitative aspects of glucose metabolism in pregnant and nonpregnant sheep. amer. j. physiol. [ ] nr. i, s . / . b~-~t~au~, p. l.: effect of temperature on the fate of certain sugars in the gut of oxycarenus hyalinipennis costa (heteroptera: lygaeidae). naturwissenschaften [ ] nr. , s. . * broit~x~, s. a., u. zamchec~, n. : utilization and excretion of d-xylose in thyroxinetreated rats. j. l~bor. elin. med. [ ] nr. , s. ff. ( s.) . davidsoi% e. a., u. small, w. : metabolism in vivo of connective tissue mncopolysaccharide. i. : cimndroitin sulfate c and keratosulfate of nucleus pulposus. biochimica biophysiea aeta [ ] nr. , w. : metabolism in vivo of connective tissue mucopolysaccharide. il : chondroitin sulfate b and hyaluronic acid of skin. biochimica biophysica acta [ ] nr. , s. / . --u. s~a~l, w.: metabolism in rive of connective tissue mucopolysaccharide. iil: chondroitin sulfate and keratosulfate of cartilage. bioehimica biophysica aeta [ ] nr. , s. / . dea~, j. m.: a comparative study of carbohydrate metabolism in fish as affccted by temperature and exercise. diss. abstr. [ ] nr. , - ( s.) . de ] ode, t~,. c., steele, r., altszuler, n., du~, a., u. bishor, j. s. : effects of insulin on hepatic glucose metabolism and glucose utilization by tissues. diabetes [ ] l~t. , s. ft . ( s.) . *edel)ian, j., recaldi~, d. a. c. l., u. dic~erson, a. g. : the metabolism of fructose polymers in plants. . : the activity of f-fruetosylsucrose]sucrose transfruetosylase in living tissue of helianthus tuberosis l. bull. res. council israel a [ ] nr. , s. ff . ( s.) . * farooqi, m. i. h., u. karl, k. n.: potassium nitrate and carbohydrate contents of argemone mexieana l. at different stages of its growth. current sci. [ ] nr. , s. ft . (i s.) . * geissb#~ler, f., u. fav~oe~, p. : ] ~ffet des rayons x sur le m tabolisme du glucose et des acides gras chez la souris m~le. helvetica physiologica et pharmacologica acta [ ] nr. , s. c ff. ( s.) . gra~cr, b. r. : an investigation of the mechanism of absorption of sugars by plant cells. diss. abstr. [ ] nr. , - ( s.) . * gayeos~i, . d., t~ay~r, w. r., gayboskl w. a., gabaielson, i. w., u. sl'mo, h. m. : a defect in disaceharide metabolism after gastrojejunostomy. :new engl. j. med. [ ] nr. , s. s/s . * hxtch, m. d., u. glasziou, k. [ ] nr. , s. ff. ( s.) . heald, p. j. : the mctabolism of carbohydrate by liver of the domestic fowl. biochem. j. [ ] nr. , s. / . h~sch~l, m. j., i-lrr,tj, w. b., u. poster, j. w. g. : carbohydrate digestion in the small intestine of the young steer. prec. nutrition soc. [ ] nr. , s. v/vi. heb~h~d~z, a., u. sols, a. : transport and phosphorylation of sugars in adipose tissue. biochem. j. [ ] nr. , s. / . h, , .z, h., eric, , , c., u. glaubitt, d. : veriinderung yon zelldichte und polysaccharidstoffwechsel im alternden bindegewebe. klin. wschr. [ ] nr. , s. / . holt, p. r., h&essler, h. a., u. issv, lbacher, k [ ] nr. , s. ff. ( s.) . je ni~os, a. c., u. morton, r. k. : changes in carbohydrate, protein, ni~ogenous compounds of developing wheat grain. australian j. biol. sei. [ ] , s. ff. ( s.) . * kok, b., hoch, g., u. coorv, l~, b .: sensitization of chloroplast reactions. i. : sensitization of reduction and oxidation of cytochrome c by chloroplasts. plant physiol. [ ] nr. , s. ff. ( s.) . *levin, :b., berholzer, v. g., slqodorass, g. j. a. i., stimm~er, l., u. wilmers, m. j. : fruetosaemia. an inborn error of fructose metabolism. arch. disease childhood [ ] nr. , s. ff. ( s.) . lindquist, b., u. m_eeuwlsse, g. w. : intestinal transport of monosaccharides in generalized and selective malabsorption. acta paediatrica suppl. , s. ll ff. ( s.) . lui~dholm, l., u. i~ottme-lundholi~i, e r, j. a. : i. : glucose tolerance in swine as related to post-mortem muscle characteristics. ii. : the relationship of niacin and niacin coenzymes with various post-mortem properties of porcine muscle. diss. abstr. [ ] nr. , - ( s.) . *ivir/~z, m. : carbohydrate metabolism following traumatization in the noble-collip drum and shock due to burns in rats. physiologica bohemoslovenica [ ] nr. , s. ft. ( s.) . * murder, r., u. smith, f. h.: abnormal carbohydrate metabolism in pancreatic carcinoma. i~ied. clinics north americ~ [ ] nr. , s. ff. ( s.) . *randle, p. ft., garland, p. b., hales, c. n., u. newsholme, e. a.: the glucose fattyacid cycle. lancet [ ] nr. , s. ff. ( s.) . rossi, f., z~ri, m., u. g] av.enbau~, a. l.: evidence for the existence of ~he hexose monophosphate pathway for glucose metabolism in the normal and denervated skeletal muscle of rats. biochcm. j. [ ] nr. , ] . *ruiteb, j., wei-i~bei~o, f., u. morrisoi~, a. : the stability of glucose in serum. clinical chem. [ ] bit. , s. ff. ( s.) . $sac]ter, j. a., ]: l~tch, ~r d., u. glasziou, k. t [ ] nr. , s. ff. ( s.) . *scm~ass~k, h.: metabolite des kohlenhydratstoffwechsels der isoliert pcrfundierten rattcnleber. biochem. z. [ ] iqr. , s. ff. ( s.) . *sch~reiu, c. : relation between fructose content of semen and fertility in man~ j. reproduction fertility [ ] i~ir. , s. ff. ( s.) . * schlubac~, it. h., u. g~em~, m neptune, e. ~., u. sudduth, h. c. -" toxic effects of oxygen at hight pressure on the metabolism of d-glucose by dispersion of rat brain. biochem. j. [ ] nr. , s. / . *waaei~, h. g.: pathways of utilization of ( - c) glucose and ( -x c) glucose in slices of peas. j. exper. botany [ ] nr. , s. ff. ( s.) . weic-~r, h., schs~r~cl, h., u. re~schle~, h. e.: ~ber physiologische glucose-ansscheidung im urin yon stoffwechsel-gesunden. klin. wschr. [ ] nr. , s. . w~ovse, s., fr~ed~ia~r~, b., u. reicm~d, g. : effects of insulin on hepatic glucose production and utilization. diabetes [ ] nr. i, s. lff i~ou~, l~i., u. kum~e~ow, f. a. : fatty acid composition of lymph lipids from rats fed fresh and thermally oxidized fats. j. dairy sci. [ ] i~r. , s. ff. ( s.) . * blou~rr, a. w., u. co~, m.: tissue lipid pattern in a case of xanthoma disseminatum. arch. intern. ivied. lu [ ] nr. , s. ] . bortz, w., anr~m~, s., u. c~o~, i. l.: localization of the block in lipogenesis resulting from feeding fat. j. biol. chem. [ ] nr. , s. / . bronsert, u., hartmani~', f., u. mitzkat, ii.-j.: die wirkung yon milchs~ure im fettstoffwechsel der leber bei experimenteller fettlcber der ratte. naturwisscnschaften [ ] i~r. , s. . br w~, m. l. : effect of a low dietary level of three types of fat on reproductive performance and tissue lipid content of the vitamin b -deficicnt female rat. j. nutrition [ ] nr. , s. . *busfm~k, e. r., tho~rrson, r. h., u. weedo~, g. d. : metabolic response to cold air in men and women in relation to total body fat content. j. appl. physiol. [ ] nr. of mast cells to fat transport. ann. new york acad. sci. [ ] ar~. , s. ff. ( s.) . katorsx~, b. a.: the influence of growth hormone on fat and protein metabolism. diss. abstr. [ ] nr. , -- ( s. amer. j. clinical nutrition [ ] nr. , s. ff. ( s.) . mxmo~, j. e.: the influence of diet and age on lipid metabolism of chickens. diss. abstr. [ ] nr. , - ( s.) . *me, i, f. : a study on the lipid metabolism in the patients with atherosclerosis especially on the cholesterol in serum lipoprotein fl-fraction. japan. heart j. [ ] nr. brown, l. d., grev~es, %. m., u. duncan, c. w. : effect of protein level in milk replacers on growth and protein metabolism of dairy calves. j. dairy sci. [ ] nr. , s. ff. ( s.) . *ma~i)elst~, j. : protein turnover and its function in the economy of the cell. arm. new york acad. sci. [ ] nr. , s. ff. ( s.) . mast~as, c. j. : nucleic acids and protein stores in the merino sheep. australian j. biol. sci. [ ] nr. , s. ff. ( s.) . noack, r.: zur biochemischen funktion des proteins als grundlage der bedarfsaormen. ernehrungsforschung [ ] nr. , s. / . *par.r. anin, a. u : le m tabolisme pro~ique dans le systems nel-ceux. studii si cercet~ri de biochimie [ ] nr. , s. ft. ( s.) . roouski, j., ~i[akowska, k., i~sn~, j., stankowska, a., hryniewiecki, l., u [ ] nr . [ ] . suppl. to nr. , s. ff. ( s.) . co~solazio, c. f., ~ixtous~, l. ., nv. lson, r. a., ~[a~drng, r. s., u. can" a~, j. e.: excretion of sodium, potassium, magnesium, and iron in human sweat and the relation of each to balance and requirements. j. nutrition [ ] nr . goodwin, a. f., u sci. [ nr. , s. ff. ( s.) . *gerlinger, p., much, j.-v., u. clukvert, j.: note pr liminaire sur raction du eyclophosphamide (endoxan) sur le d veloppement de l'embryon. c. r. acad. sci. [ nr. , s. ff. ( s. [ nr. , - ( s.) . nelson, f. e., jensen, l. s., u *caruzzo, c., tartara, d., pagano, p. g., pellegrini, a., costanzo, f., u [ ] nr. , -- ( s.) . [ ] nr. , s. ll ff. ( s.) . gal/~bos, j. t., asada, m., u. s~ks, j. z. : the effect of intravenous ethanol on serum enzymes in patients with normal or diseased liver. gastroenterology [ ] nr. , s~ ff. ( s.) . gall, c., u. weissw~.~le~, p. : un~ersuchungen fiber die labf~higkeit der milch und ihre beziehung zur mineralsteff-fiitterung der kfihe. milchwissenschaft [ ] nr. , s. ff. (? s.) . zitat: dt. lebensmittel-rdsch. [ ] nr. , s. . gasic, g., u. morrison, a.b.: mucopolysaccharides of renal collecting tubule ceils in potassium deficient rats. prec. soc. exper. biol. med. [ ] l~r. , s. ff. ( s.) . gear~-, t. f. : oak wilt development and its reduction by growth regulators. i. production and activity of oak wilt fungus pectinase, cellulase, and auxin. ii. effect of halogenated benzoic acids on oak trees, the oak wilt diseases, and the oak wilt fungus. diss. abstr. [ ] nr. , -- ( s.) . *g~l~,~cs~,r, f., g~ti, t., gy~g~, k., u. s s, j.: effec~ of cardiopathogenic diet on the thiopental anaesthesia. acts physiologica aead. sci. hung. suppl. nr. , s. . gar~'ls, j., piazu~lo, e., u [ ] nr. , -- ( s.) . ikir~a, h., u. t~, k. v. : activity of gibberellin 'd' on the germination of photosensitive lettuce seeds. nature ] nr. , s. / die wirkung des glucagons auf den blutzuckerspiegel in abh~ngigkeit yore alter. z. aiternsforsch. [ ] nr. , s. / . *loosli, j. k. : primary signs of nutritional deficiencies of laboratory animals. j. amer. veter, reed. assoc. [ ] nr. , s. ft. ( s.) . lowrey, r. s., pond, w. g., lo sli, j. k. u. barnes, r. h.: effect of dietary protein and fat on growth, protein utilization, and carcass composition of pigs fed purified diets. j. animal sci. [ ] nr. , s. ft. ( s.) . lvnd, c. c., u. ) [ ] nr. , s. ff. ( .) . nr~cleod, l. b. : effect of liming and potassium fertilization on soil solution and on yield and composition of alfalfa and orchard grass mixtures. dlss. abs~r. [ ] nr. , -- ( s.) . ~ds~n, k. ., u. edmonds, e. j. : prolonged effect on caries of short-term feeding of rice hulls to cotton rats. j. dental res. [ ] nr. , s. ff. ( s.) . m~a~.~, a. c. : biological responses of young rats fed diets containing genistin and genistein. j. nutrition [ ] nr. , s, ] . ~_ajaj, a.s., dnc~g, j.s., azzam, s.a., u. da_~by, w. j.: vitamin e responsive megaloblastic anemia in infants with protein-calorie malnutrition. amcr. j. clinical nutrition [ ] nr. , s. ft. ( s.) . v~jcm~owicz, e., u. quxstel, j. h. : effects of aliphatic alcohols on the metabolism of glucose and fructose in rat liver slices. canad. j. bioehem. physiol. [ ] nr. , s. ff. ( s.) . m_al~otra, o. p., n~a-wl)ov, a.v., r~ber, e. f., u. norton, h . w., effects of rat strain, stilbestrol, and testosterone on the occurrence of hemorrhagic diathesis in rats fed a ration containing irradiated beef. j. nutrition [ ] nr. , s. ff. ( ,) . --, u. r~ber, e. f. : effect of methionine and age of rat on the occurrence of hemorrhagic diathcsis in rats fed a ration containing irradiated beef. j. nutrition [ ] nr. , *misga, u. k. : effect of corn oil feeding on the lipids of dog bile. indian j. exper. biol. [ ] nr. , s. / . * miyao, m., tsvn•isnz, m., nagano, k., u. ttosogi, t.: experimental studies on the digestibility and absorbability of milk proteins. . effects of carbohydrate addition on the digestibility and absorbability of cow's milk proteins. tokushima j. exper. med. [ ] nr , nr. , s. ff. ( s.) . nagra, c. l., brerrenbach, r. p., u. meyer, ~. k. "-influence of hormones on food intake and lipid deposition in castrated pheasants. poultry sci. [ ] nr. , s. ff. ( s.) . *+na~:ler, w. g.: the significance of calcium ions in cardiac excitation and concentration. amer. heart j. [ ] nr l~, b. l., u. re(~a~, w. s., u. dobozy, a.: effect of vitamin a on the nucleic acid metabolism of rats. acts biologics acad. sci. hungariae , suppl. , s. . ~itz, k., u. loeser, a. : uber den einflufl appetithemmender substanzen auf das fettgewebe. klin. wschr. [ ] nr. , s. . * rstadius, k., nordstrom, g., u. lannek, n. : combined therapy with vitamin e and selenite in experimental nutritional muscular dystrophy of pigs. cornell veterinarian [ ] nr. , s. ft. ( s.) . ern~hrung und therapie. med. u. ern~hrung [ ] nr. , s. / . *--pflanzliche polysaccharide zur steigerung der kiirpereigenen abwehr. ivied. welt , nr. , s. pxrkrnson, t. m., u. lso~r, j. a. : inhibitory effects of bile acids on the uptake, metabolism, and transpor~ of water-soluble substances in the small intestine of the rat. life sci. , nr. , s. ] . *pxl~o% j.-l., u. mord~l~t-dx~rin~, )i..' action du potassium et du calcium sur l'histaminopexie s~rique. recherches sur le cobaye, le rat et l'homme. j. physiol. [ ] nr. , s. ff. ( s.) . * patek, a. j., de fr tsc] t, n. m., kendall, f. e., n. hirsch, r. l.: corn and coconut effects in dietary cirrhosis of rats. arch. pathology [ ] nr. , s. ff. ( s.) . patil, s. s.: the relation of ehlorogenic acid and total free phenols in potato plants to resistance to infection by verticillum alboatrum. diss. abstr. [ ] nr. , -- ( s.) . petuet,y, f.: diskussionsbemcrkungen zu rcferaten fiber sauermilchprodukte auf einer vortragsveranstaltung der gesellsehaft fiir erniihrungsbiologie e.v., miinchen, . juni . milchwissensehaft [ nr. , s. ] . pfei~ter, ( . j., gass, g. h., u. schw~rz, ( . s.: reduction by chlorpromazine of ulcem due to acute starvation in mice. nature [ ] nr. , s. . prrrllros, a. w., newc m~, tl r., u. sha.wklrs, d. r.: long-term rat feeding studies on irradiated chicken stew and irradiated cabbage. toxicol. appl. pharmacol. [ ] nr. , s. ] . prrmt~s, p. h., sutti~, j. w., u. ze~rowski, e. j. : effects of dietary sodium fluoride on dairy cows. vii.: recovery from fluoride ingestion. j. dairy sei. [ ] l~r. , s. ff. ( s. browi~, t. h., u. levee, r. v.: the effect of milk intake on nematode infestation of the lamb. prec. nutrition oc. [ ] nr. , s. / . * srnwrvasan, m., n~o~bhusha_~a~, a., u. sm~rrvas~, k. s.: changes in serum inorganic phosphate following ingestion of protein. curr. ci. [ ] nr. , s. . stallwoth, h. : some effects of . -dichlorophen-oxyacetie acid on sweet corn (zea mays rugosa l.) with emphasis on yield, tillering, root development, and exudation of electrolytes from roots and stems. diss. abstr. [ ] nr. , --- ( s.) . starcher, b., u. i~.ratzer, f~ h.: effect of zinc on bone alkaline phosphatase in turkey poults. j. nutrition [ ] nr. , s. / . stei~-~off, d., u. ~rqu~lrdt, p. : kombination yon kaliumpyrosulfit und ~xthy]alkohol ira tr~nkungsversuch an ratten. arzneimittelforschung [ ] nr. , s. / . stevermer, e. j. : influences of level of nutrition of the boar and of ionic environment of the spermatozoa on the properties of boar semen. diss. abstr. [ ] nr. , -- ( s.) . stn~p~., f., u. c~warz, k.: incorporation of valine-l-x*c into serum and tissue proteins of rats fed torula yeast diets. j. nutrition [ ] nr. , s. / . ton~., d. b., u. co~or, w. e. : the prolonged effects of a low cholesterol, high carbohydrate diet upon the serum lipids in diabetic patients. diabetes [ ] nr. , . ft. ( s.) . *stormont, j. •., u. waterhouse, c. : effect of variations in previous diet on fasting plasma lipids. j. labor. clinical med. [ ] nr. , s. ff. ( s.) . * stuart, a. e., u vity of the heart extract of rats. medlcina exporimentalis [ ] nr. , s. ff. ( s.) . szepsenwol, j.: carcinogenic effect of egg white egg yolk and lipids in mice. prec. soc. exper. biol. med. [ ] nr. , s. ff. ( s.) . *t~cs, l, u. ny~i, i.: effect of saline infusion, acth infusion, and blood transfusion on the hormone excretion of patients with hypereme~is. act~ medics aead. sei. hun. garicae [ ] nr. , s. ff. ( s.) . tanner, j. w.: an external effect of inorganic nitrogen on nodulation. diss. abstr. [ ] nr. , -- ( s.) . --, u [ ] nr. , s. ff. ( s.) . *t~i~er, l., m~az, m., u. cs:menafiov , m. : the effect of glucose and glucose together with insulin on the resistance of fasted rats to trauma in the noble-collip drum. physiologia bohemoslovenica [ ] nr. , s. ff. ( .) . ude~iend, s. : factors in amino acid metabolism which can influence the central nervous systems. amer. j. clinical nutrition [ ] nr. , s. ff. ( . .) vahouny, g. v., moede, a., silver, b., n . treadw~ll, c. r. : nutrition studies in the cold. iv. effect of cold environment on experimental atherosclerosis in the rabbit. j. nutrition [ ] nr. , s. / . van p lsum, j. f., olsen, b., taylor, d., rozyc'ki, t., u voss, r. d.: yield and foliar composition of corn as affected by fertilizer rates and environmental factors. i)iss. abstr. [ ] nr. , -- ( s.) . *vyval,ro, i. g. : the effect of gibberellin on the transformation of substances in germinating corn seeds. i)oldady akad. nauk sssr [russ.] [ ] nr. , s. ff. ( s.) . wao~ei~, g. r., cl~mk, a. j., hays, v. w., u amer. j. clinical nutrition [ ] lgr. , s. ff. ( s.) . the assessment of marginal protein malnutrition. prec. nutrition soc. [ ] mr. , s. ] . --, u. stnp~, j. m. l.: the free ly~ine and amino nitrogen content of liver, muscle, and serum in normal and protein-depleted rats. prec. nutrition soc. [ ] mr. i, s . viii/ix. *watson, w. c.: the morphology and lipid composition of the erythrocytes in normal and essential-fatty-acid-deflcient rats. bri . j. haematology [ ] lgr. , s. ff. ( s.) . waite, r., u. : blackburn, p. s. : the relationship between milk yield, composition and tissue damage in a case of subclinical masticis. j. dairy res. [ ] , nr. , s. ff. ( s.) . *n. n.: paraty ~hoid fever from frozen chinese eggs. brit. reed. j. . ( s.). *n. n. : radioactivity and human diet. chem. ind. , nr. , . . ~q. ~.: eh~digungen dutch konservlerungsmittel bei zitrusfrfichten? dr. reed. wschr. [ ] nr. , s. . *n. n. : salt-poisoning in infancy. lancet [ ] l~r. , s. . n.n.: kouoquium des arbeitskrcises hamburg der gdch-fachgruppe lebensmittelchemic und gerichtliche chemic am . januar . lebensmittelchem. u. gerichtl. chem. [ nr. , s. . *n. n. : smoking and heath disease. new england j. med. [ ] nr. , s. . +hi. n. : toxic components of lathyrus peas. nutrition rev. [ ] nr. , s. / . +n. n. : cariogenic ability of different diets. nutrition roy. [ ] nr. , s. ] . +n. ~.: aminoaeiduria in lead intoxication. nutrition rev. [ ] nr. , s. / . +n. n. : pyridoxine and dental caries. human studies. nutrition rev. [ ] nr. , s. . +n. n.: pyridoxine and dental caries. animal studies. i~utrition rev. [ ] nr. , s. . +hi. n. : rcporb: the prophylactic requirement and the toxicity of vitamin d. pediatrics [ ] nr. , s. ff. (? s.). axrkroa, a.: caesium- from fall-out in human milk. nature [ ] nr. , s. ff. ( s.) . *allcroft, r., u. car~agha~, r. b. a. : groundnut toxicity: an examination for toxin in human food products from animals fed toxic groundnut meal. veteri. rec. [ ] nr. , s. ff. ( s.) . *--, l~wis, g., u. hill, k. ~.: groundnut toxicity in cattle: experimental poisoning of calves and a report on clinical effects in older ca~tle. ve~er. rcc, [ ] nr, , s. ff. ( s. nr. , , nr. , , , , nr. , nr. , influence of calcium and ouabai bain upon potassium influx in human erythrocytes the enzymatic assimilation of nitrate in the tomato plant translocation of '~p, i~n, and ~c in plants einige neue gesiehtspunktr zum caleiumstoffwechsel dis~ibution of water, sodium, and potassium in resting and stimulated mammalian muscle. canad the influence of vitamin bi, on calcium ('sca) metabolism of maxillodental tissues kidney, water, and electrolyte metabolism intermediiirer elektroly~toffwechsel und zellgrenzfl~chenphysiologie im theoretischen zusammenhang mit der krebsentstehtmg. tell i dcr einflul~ yon vollkornbrot auf den calcium-stoffwechsel bei schulkindern recherches sur le m tabolisme du soufre. x. : la non- quivalence de la eystine et de la eyst ine dans la couvcrture des besoins sufr~s du rat adulte potassium-magnesium antagonism in soils and crops low serum iron levels in obese adolescents metal content of human organs studies on the requirements and interaction of copper and iron in broad breasted bronze turkeys to weeks of age iron absorption and excretion in experimental iron deficiency the measurement of exchangeable magnesium in dogs the copper metabolism of warmblooded animals with special reference to the rabbit and the sheep comparative studies of the metabolism of strontium and barium in the rat the utilization of iron in erythropoiesis binding of strontium in blood ~iolybdenum, copper, and zinc contents of mouse liver and sarcoma treated with molybdenum compounds biochemical effects of zinc deficiency in tomato plants excretion of sodium, potassium, magnesium, and iron in human sweat and the relation of each to balance and requirements turnover rate of zinc in the body as determined by the study of szn in rats a study of the iron absorption in mice as modified by various agents funktionsteste des radiojodstoffwechselstudiums und ihre bedeutung in der diagnostik der sehilddrfisenerkrankungen. ~rztl. laboratorium the fate of radioiodine after parcnteral administration a possible humeral regulator of iron absorption beitrag zur kl~rung der ursachen der anreicherung yon caesinm- im rganismus blood-and serum-level of watersoluble vitamins in man and animals significados metabslicos do ~cido ~-lip ico. . o ~cido r e o metabolismo do ferro observations on a magnesium-fluoride interrelationsip in chicks prevention of ,meat anemia" in mice by copper and calcium iron metabolism in experimental pyridoxine deficiency aluminium in soils and plants on the coastlands of british guinea physiology of adolescence. ii. : nutrition -basal oxygen consumption -energy expenditure and balance -nitrogen metabolism -calcium metabolism -iron metabolism -red cell mass and hemoglobin dietary strontium and calcium, and deposition of sr and asca in the bones of rats -~iengenelementansatz wachsender sehweine bei unterschicdiichen cuso -zulagen differences in copper retention in two strains of chickens untersuchungen fiber anreicherung und verteilung yon rubidium in gerstenkcimpflanzen [ ] nr. , s. / . *+lv~ointyr~, i.: an outline of magnesium metabolism in health and disease. a review uptake by the root and subsequent distribution within the potato plant of strontium- leached from the foliage nor zinkstoffwechsel in der schwangerschaft foetal metabolism of caesium- in the rat magnesium metabolism of chickens zinc metabolism in patients with the syndrome of iron deficiency anemia hepatospenomegaly dwarfism, and hypogonadism. labor. clinical mcd c~isium- trod kalium in menschlichen organen und in der milch / l~ole of the genotype in controlling accumulation of strontium- by plants copper and zinc interrelationships in the pig effect of chromium, cadmium, and other trace metals on the growth and survival of mice studies in the metabolism of zinc. iv. some observations on the urinary zine-porphyrin relationship in non-porphyries and in a patient with aeutm intermittent porphyria aastrontium balances in man studies on zinc metabolism. ii.: effect of the diabetic state on zinc metabolism: an experimental aspect effect of diabetic state on zinc metabolism: a clini. cal aspect copper metabolism and the liver iron metabolism and the liver with particular reference to the pathogcnesis of haemachromatosis studies on iron metabolism uber den eisenstoffwechsel. (bemerkung zu t. su~di~, miinchener reed yifinchener reed. wschr. [ ] nr. , s. . c - wirkstoffe -biocatalysts +n. n. : vitamin a in human livers. nutrition biological half.llfe of vitamin b~ in plasma hypervitaminosis a and mast cells. a study of the interrelationship of mast cells and vitamin a in vivo and in vitro evidence concerning the human requirement for vitamin bi~. use of the whole body counter for determination of absorption of vitamin blz vitamin c in plasma and leucocy~s of smokers and non-smokers some factors affecting the absorption of vitamins the determination of vitamin a in animal tissues and its presence in the liver of the vitamin a deficient rat metabolic activities of vitamin a and related compounds in animals. i.: role of vitamin a in intestinal muscular contraction zum vitamin-b--haushalt dcr ratte bei sorbitfiitterung contribution a l'~tude do la relation entre la vitamine big et la glande thyrolde effects of deficiencies of certain b vitamins and ascorbic acid on absorption of vitamin blv amer ascorbic acid metabolism in plants. ii. : biosynthesis dietary and thyroid interrelationships affecting vitamin a status of feedlot beef cattle die wirkung gesteigerter kupferzufuhr auf den vitamin-c-haushalt vom meerschweinchen bei parental zugeffihrter ascorbins~ure kritische auswe~ung der naeh erschienenen arbeiten fiber gebundene ascorbins~ure im tierischen gewebe metabolism and biological activity of vitamin a acid in the chick biochemical studies of vitamin metabolism in poultry urine. ii. : on the excretion of thiamine in poultry urine after subcutaneous and oral administrations of some thiamine derivatives untersuchungen fiber die speiehcrung und fiber die ansscheidung yon vitamin a nach ungeniigender vitamin-a-versorgung bei legenden hfihnern studies on metabolism of vitamin a. .: the biological acticity of vitamin a acid in rats vitamin a and cholesterol absorption in the chicken studies on the interaction of vitamin bi~, intrinsic factor, and receptors. il the possible absorption of intrinsic factor human growth hormone in infant malnutrition macro-and micromethods for the determination of serum vitamin a using trifluoroacetic acid the activation of sulphate by extracts of cornea and colonic mucosa from normal and vitamin a-deficient animals the importance of blood as a pool of vitamin d studies on metabolism of vitamin a. .: enzymic synthesis ~nd hydrolysis of phenolic sulphates in vitamin-a-deficient rats human metabolism of l-ascorbie acid and erythorbic acid tissue distribution and storage forms of vitamin b~, injected and orally administered to the dog relation between vitamin a, tocopherol, and cholesterol serum levels in the elderly interrelationships of vitamin bi~, folio held, and ascorbie acid in the megaloblastie anemias zum wirkungsmechanismns des vitamins e. helvetica physiologica et pharmacologica effect of some physiologic factors on the absorption of vitamin bi~ in rats transport of dietary nitrogen mitochondrial fatty acids of fish and fish-eating birds the biosynthesis of fatty acids influence of age and dietary protein on cerbain free amino acids in chick blood plasma nitrogen metabolism in coldexposed rats ~ber des vermehrte auftreten yon fettsiiuren mit bis c-atomen in den depotfetten siiugender ratteu und den ubergang der linolsi~ure yon den mfittern auf die jungen die bildung yon antiksrpcrn gegen verschicdene kuhmilehproteine bei neugeborenen, kindern, erwachsenen und graviden the myocardial arterio-venous differences of free amino acids and of free fatty acids in healthy individuals, patients with diabetes and with essential hypercholesterolemia urinary amino acids on phenylalanine-tyrosine-supplemented diets difference in the metabolic fate of acetate and ethanol fed to higher plant tissues iiemodynamic relationships of anaerobic metabolism and plasma free fatty acids during prolonged, strenuous exercise in trained and untrained subjects effects of palmitate on the metabolism of leukooytes from-guinea pig exudate the dynamics of plasma free fatty acid metabolism during exercise ~ber die retention, den abbau und die ausseheidung yon -thion-tetrahydro-l. . -thiadiazinen transport systems for amino acids effects of protein intake and cold exposure on selected liver enzymes associated with amino acid metabolism quantitative studies on tryptophan metabolism in the pyridoxine-deficient rat effect of desoxypyridoxine-induced vitamin b~ deficiency on polyunsaturated fatty acid metabolism in human beings studies on the wheat plants using carbon- compounds. xix.: observations on the metabolism of lysine- c. canad genetic defects of amino acid metabolism. pediatric clinics north america metabolism of tryptophan in diabetes mellitus the two-carbon chain in metabolism der einfluss yon vitamin a auf den citronens~urestoffwechsel ~)ber phenolspeicherung und phenolabbau in wasserpflanzen. naturwissensehaften effect of physical exercise on nitrogen balance in obese subjects metabolism of nitrogen compounds in the rumen of ruminants. izvest. acad der intermedli~r-stoffwechsel s~ffweehsel der carotine im hiihnerembryo nucleic acid metabolism of germinating corn seedlings carbon metabolism of ~ c-labelled amino acids in wheat leaves. ii.: serine and its role in glycine metabolism bovine metabolism of insecticides. the metabolism of ~evin in dairy cows stoffweehsel der carotine. wiss. veroff. dr. oes. ern~hrnng metabolism of labelled linolcic-l-tac acid in the sheep rnmen nonessential nitrogen supplements and essential amino acid requirements. nutrition rev vitamin c requirements of man re-examined. new values based on previously unrecognized exhalatory excretory pathway of ascorbie acid studies on the requirements and interaction of copper and iron in broad breasted bronze turkeys to weeks of age water requirements of men as related to salt intake evidence for a high zinc requirement at the onset of egg production effect of lysine and glyeine upon arginine requirement of guinea-pigs the cobalt requirement of subterranean clover in the field fluid and electrolyte, requirements of newborn infants with intestinal obstruction the requirement and availability of dietary iron for young pigs studies on the protein and methionine requirements of young bobwhite quail and young ringnecked pheasants phenylalanine requirement of women consuming a minimal tyrosine diet and the sparing effect of tyrosine on the phenylalanine requirement effects of starvation on the cardiovascular system of the chicken calcium and phosphorus requiremeats of finishing broilers using phosphorus sources of low and high availability water intake of normal children sex differences in the ~ oeopherol requirement of rats as shown by the haemolysis test further studies on protein and energy requirements of chicks selected for high and iow body weight smoking and blood dotting dental caries and trace elements a statement approved by the board of directors of the canadian heart foundation the question of fats. il : fats and disease moldy peanuts and liver cancers vitamin c and healing of wounds berieht fiber die vortragstagung des fachverbandes lehensmittelchemie der chemisehen gesellschaft in der d])r veto . his diet and human depot fat ethanol and plasma free fatty acid in man dietary water and protein efficiency in rats. nutrition rev. [ ] nr. , s. / . +n. n. : nature of the coagulation defect in rats fed diets producing thrombosis or experimental atheroselerosis factors affecting growth depression by raw soybeans bones in undernourished animals. nutrition rev. [ ] nr. , s. . +n. n. : vitamin e and the etiology of muscular dystrophy in the rabbit riboflavin eoenzymes and congenital malformations the thyroid gland in infant malnutrition. nutrition rev. [ ] nr. , s. . +n. n. : a proposed mechanism for the effect of fats on serum cholesterol effect of varying levels of dietary protein on synthesis and excretion of urea dietary phosphates and dental caries folic acid restriction and cancer inhibition changes induced by lipoie acid in normal rat liver vitamin b deficiency and tryptophan metabolism effect of ubichromenol on development of encephmomalacia in vitamin e deficient chicks leucine-induced hypoglycemia nutritional muscular dystrophy in lambs. nutrition rev. [ ] nr. , s. / . +n. n.: amino acid imbalance in cold-exposed rats. nutrition rev. [ ] nr. , s. . +n. n.: milk and athletic performance effect of vitamin a deficiency on the ubiquinone content of rat liver idiopatjaie stea~orrhea gastrointestinal protein loss in iron-deficiency anemia nutritional cirrhosis of the liver. nutrition rev. [ ] nr. , s. / . +n. n. : exercise and heart disease calcium deficiency in the etiology of osteoporosis the relation of dietary fat to the fatty acids in the intestinal wall clot-strength and elot-lysis in rats fed hyperlipemic diets effect of potassium iodide and duodenal powder on the growth and organ weights of goitrogen-fed rats ver~nderungen im stoffwechsel und wachsturn junger tomatenpflanzen nach giberillins~,urebehandiung effect of restricted feeding during the growing period on reproductive performance of large type white turkeys keys, a. l glucose, sucrose, and lactose in the diet and blood lipids in man ambient temperature and survival on a protein-deficient diet some considerations of changes in total body composition in relation to nutritional status the effect of variations in the energy and protein levels of the ration upon performance in the pig studies in choline deficiency. fate of injected - ~c-pal. mitio acid and fatty acid spectra in fasting and refed rats bartou i~ vitamin a requirements of chicks at moderately elevated temperature influence of age and dietary protein on eer~ain free amino acids in chick blood plasma the effects of nicotine on weight increment, activity, food intake, and water intake in weanllng albino rats effect of pyridoxine deficiency upon delayed hypersensitivity in guinea-pigs vitamin a deficiency in chickens ccrebellar encephalomalacia produced by diets deficient in toeopherol vitamin b deficiency in indian infantm plasma liplds in scurvy: effect of ascorblc acid supplement and insulin treatment effect of gibberellin on the variations of the growth-point in winter wheat uptake of dinitrophenol and its effect on transpiration. calcium accumulation in barley seedlings l~[ethylmalonate excretion in vitamin b~ deficiency reduction of plama cholesterol levels in atherosclerosis by diet and drug treatment. australasian ann effects of reduced dietary intake on the activities of various enzymes in the livers and kidneys of growing male rats the effect of feeding d-methionine on the d-amino acid oxidase activity of chick tissues l~etabolic effects of dietary protein level in cold-exposed rats relative effects of rapeseed oil and corn oil on rats subjected to aclrenalectomy, cold, or pyridoxine deprivation metabolic effects of dietary protein level with caloric restriction in coldexposed rats. canad metabolic effects of three dietary protein levels fed isocalorically to coldexposed rats. caned die nolle versehiedener fette im eiweis des organismus. nahrung the bursa of fabricius and xanthine oxidase activity of liver and kidney following dietary supplementation of iodina~d casein to chickens effects of linolsate and dietary fat level on plasma and liver cholesterol and vascular lesions of the cholesterol-fed rat a comparative study of the effect of bile acids and cholesterol on cholesterol metabolism in the mouse, rat, hamster, and guineapig the effects of ruminal and plasma sodium concentrations on the sodium appetite of sheep effect of level and sequence of feeding and breed on ovulation rate, embryo survival, and fetal growth in the mature ewe inhibitory effects of carbohydrates on histamine release and mast cell disruption by dextran fett-s~urestoffwechsel bci the effect of calcium infusions, parathyroid hormone, and vitamin i) on renal clearance of calcium nutritional supplementation during pregnancy effect of level of dietary protein with and without added cholesterol on plasma cholesterol levels in man die schilddrfisenfunktion bei enteralem eiweiflverlust effects of pelleting and varying grain intakes on milk yield and composition fatty acid composition of lipids of serum and aorta in the chicken on different diets rat intestinal suerase. ii.: the effects of rat age and sex and of diet on suerase activity the effect of selenium administration on the growth and health of sheep on scottish farns die wirkung yon vitamin b bei leukopenien effect of energy source and level of alfalfa pellets on growth and tissue hpids of beef calves effect of magnesium deficiency on mast cells and urinary histamine in rats histamine-liberating effect of magnesium deficiency in the rat zur wirkung des wassers bei der seitenwurzelbildung an luftwurzeln influence of the aqueous potato extract and its fractions on growth and spore formation of the b. pumilus and the production of the antibiotic tetaine influence of mineral nutrition on the resistance of peach tree to fusicoceum amygdali de la croix the effect of dietary protein on the course of various infections in the chick bifidus intestinal flora in infants fed on mamysan b. acta paediatriea effect of food fats on concentration of ketone bodies and citric acid level in blood and tissues is there a hemostatie effect of peanuts in hemophilioid disorders? milk allergy in infancy dental effects of fluoridation of water with particular reference to a study in the united kingdom influence of previous feeding with a high-fat diet on liver steatosis produced by acute starvation of growth hormone in mice effect of amino acid imbalance on nitrogen retention. ii.-interrelationships between methionine, valine, isoleucine, and threonine as supplements to corn protein for dogs supplementation of cereal proteins with amino acids. v. : effect of supplementation lime-treated corn with diffe. rent levels of lysine, tryptophane, and isoleueine of the nitrogen retention of young children the interrelation of nutrient supply, leaf nutrient content, and vegetative growth of ilex crenata gastric content of fasted primates. a survey serum cholesterol in vitamin c deficiency in man the effect of carbohydrates on the production of staphylococcal pigment effect of a low dietary level of three types of fat on reproductive performance and tissue lipid content of the vitamin b -defieient female rat effect of magnesium deficiency on synthesis of hear~ and liver mit~chondria phospholipids ~ber den einflub l~nger dauernder ksrperlicher inaktivit~t auf die blutzuckerkurve nach oraler glucosebelasttmg. helvetica medica acts action of trace elements on the metabolism of fluoride zur frage des einflusses yon kondensmilch und einer protein-valerina-polyphosphat-komplexverbindung auf die kreislaufwirktmg des kaffees beim menschen modifications de la croissance de la plantule de lapin blanc (lupinns albns l.) provoqu es par une diminution exp rimentale des r serves influence of the dietary protein level on the magnesium requirement effects of high levels of copper and chlorotetracycline on performance of pigs effect of dietary calcium lactate and lactitic acid on faecal escherichia coli counts in pigs die entwickhing von calcium-mangelsymptomen. z. pflanzenern~hrung, diingung, bodenkde. [ ] nr. , s. / . --calcium-mangelsymptome an hsheren pfianzen copper deficiency in relation to swayback in sheep. i. : effect of molybdate and sulphate supplements during pregnancy long-term, low fat, low protein diets and their effect on normal trappist subjects further studies of the influence of diet on radiosensitivity of guinea-pigs, with special reference to broccoli and alfalfa effects of the infusions of ammonia, amides, and amino acids on excretion of ammonia answirlmngen langfristig fettreicher ern~ihrung auf das plasma-cholesterin the relationship of dietary energy level and density to the growth response of chicks to fats influences of dietary carbohydrate-fat combinations on various functions associated with glycosis and lipogenesis in rats. i. effects of substituting sucrose for rice starch with unsaturated and with saturated fat compensatory carcass growth in steers following protein and energy restriction fatty acid composition and glyceride structure in rats fed rapeseed oil or corn oil. canad influence of selective and nonselective hydrogenation of rapeseed oil on carcass fat of rats. canad studies in serum lipids. with special reference to spontaneous variations and the effect of short-term dietary changes experimentelle untersuchungen zur wirkung yon kaffeefett evaluation of the effect of breed on vitamin be requirements of chicks -iigh salt content of western infant's diet: possible relationship to hypertension in the adult the effect on the serum cholesterol levels of the consumption of a special dietary fat with a high content of unsaturated fatty acids in elderly people effect of dilution of the diet with an indigestible filler on feed intake in the mouse effect of tea and its tannins upon capillary resistance of guinea-pigs food input and energy extraction efficiency in carassins auratns effect of calcium and magnesium upon digestibility of a ration containing corn oil by lambs effects of calorie restriction during the growing period on the performance of egg-type replacement stock effects of insulin on hepatic glucose metabolism and glucose utilization by tissues cellulase and polygalacturonase in tomato fruits and the effect of calcium on fruit cracking effect of nutritional muscular dystrophy and of starvation on amino acid penetration in rabbit tissues plasma protein synthesis in nutritional muscular dystrophy inulin and sucrose distribution in tissues of vitamin e-deficient and control rabbits protein-bound dyes in the serum and liver of rats fed aminoazo dyes vanadium. excretion, toxicity, lipid effect in man the influence of vitamin a status on the proteoly~ic activity of lysosomes from the livers and kidneys of rats disauxie metainfettive e da malnutrizione induced drinking in dogs: comparative effects of hypertonic sodium chloride and sorbitol the influence of early nutrition on brain cholesterol accumulation during growth changes in composition of the saliva of cows on grazing heavily fertilized grass. res. veter changes in composition of the saliva of sheep on feeding heavily fertilized grass efect of varying alfalfa: barley ratios on energy intake and volatile fatty acid production by sheep the influence of calcium on the secretory response of the submaxillary gland to acetyi-choline or to noradrenaline clinical dentistry and fluoride food allergy as a cause of abdominal pain the effect of various dietary levels of ddt on liver function, cell morphology, and ddt storage in the rhesus monkey effect of natural and purified diets on survival of x-irradiated mice effect of autoclaving and ])'sine supplementation of skimmilk-powder diets on growth and caries in rats effects of alcohol intake on subjective and objective variables over a five-hour period nitrogen metabolism of african cattle fed diets with an adequate energy feeding value of fl-caroteno following treatment with n~o~ the relationship of the quantity and quality of dietary fats to serum cholesterol levels in men of different ages and weights effects on girls of greater intake of milk, fruits, and vegetables effect of antioxidant, protein, and energy on vitamin a and feed utilization in steers the growth-maintaining activity of ascorbic acid the effects of an induced pyridoxine and pantothenie acid deficiency on excretions of oxalic and xanthurenic acids in the urine influence of lactose and dried skim milk upon the magnesium deficiency syndrome in the dog. i.: growth and biochem chronic malnutrition in turkey. v. study on serum fatty acids in malnourished children the effect of nutrition conditions on the growth of and nitrogen accumalation by fodder beans when sown together with indian corn effects of a diet high in polyunsaturated fat on the plasmalipids of normal young females citrate and action of vitamin d on calcium and phosphorus metabolism beeinflussung der sportliehen lelstungsf/~higkeit durch eine geeignete er-nehrung effect of dietary erotic acid on liver proteins effect of barbiturie acid and ehlortetraeyeline upon growth, ammonia concentration, and urease activity in the gastrointestinal tract of chicks effects of feeding low levels of dimethoate on milk and whole blood eholinesterase activity of dairy cattle changes in serum lipoproteins after a large fat meal in normal individuals and in patients with isehemic hear~ disease the relationship of specific nutrient deficiencies ~) antibody response in swine. i. : vitamin a. ii. : pantothenie acid, pyridoxine or riboflavin. i)iss. abstr relationship of specific nutrient deficiencies to antibody production on swine. i. : vitamin a relationship of specific nutrient deficiencies to antibody production in swine. il : pantothenic acid, pyridoxine or riboflavin histechemistry of dietary cardiac lesions effect of various levels of fluorine, stilbestrol, and oxytetraeycline, in the fattening ration of lambs uptake of copper and its physiological effects on chlorella vulgarls the effects of a small dose of ethyl alcohol on certain basic components of human physical performance. i. the effect on cardiac rate during muscular work. arch. internat some effects of feeding stilbestrol, chlortetracycline and penicillin with alfalfa soilage on steer performance and carcass quality experimental induction of ciguatera toxicity in fish $hrough diet effects of potassium fertilizer, age of ewe, and small magnesium supplementation on blood magnesium and calcium levels of lactating ewes the influence of higher volatile fatty acids on the intake of urea-supplemented low quality cereal hay by sheep un~emuchungen fiber den umsatz wachsender schweine ab geburt. . mitt eczema and cow's milk. brit. med. j. , nr. , s. . isaacso~, a. : the effects of zinc on responses of frog skeletal muscle effects of zinc on responses of skeletal muscle effect of diet on work metabolism carbohydrate-phosphorus metabolism in the skeletal muscles of epinephrectomized animals durlng treatment with cortisone and vitamins c and p. ukrainskii biokhimichnii zhurnal composition of dietary fat and the accumulation of liver lipid in the choline-deficlent rat nutrition and palatability the incidence of protein-calorie malnutrition of early childhood theories on the mode of action of fluoride in reducing dental decay saccharase deficiency, familial entailing intolerance ~ cane sugar. acts paediatrica raw and heat-treated soybeans for growing-finishing swine and their effect on fat firmness effect of the administration of isoniazid and a diet low in vitamin be on urinary excretion of oxalic acid dietary and thyroid interrelationships affecting vitamin a status of feedlot beef cattle ration effects on rumen acids, ketogenesis, and milk composition. i.: unrestricted roughage feeding the effect of supplements of groundnut flour or groundnut protein isolate fortifed with calcium salts and vitamins or of sklmmilk powder on the digestibility coefficient, biological value and net utilization of the proteins of poor indian diets given to undernourished children a comparison of skin-testing with natural foods and commercial extracts die wirkung yon hungern auf den ammoniakgehalt und das ph der pansenfi(issigkeit sowic auf die harnstoff-, cholesterin-und zuckerkonzentration im blu~. acts veterinaria acad increase of plant virus infection by magnesium in the presence of phosphate effect of intramuscular injection of magnesium sulphate solution on the growth rate and serum composition in rats diskussionsbemerkungen zu referaten fiber sauermilchprodukte auf einer vortragsveranstaltung der gesellschaft fiir eru~ effect of massive sodium bicarbonate infusion on renal function excessive insulin response to glucose in obese subjects as measured by immunoehemical assay die wirkung gesteigerter kupferzufuhr auf den vitamin-c-hanshalt yon meerschweinehen bci paren~ral zugef'dhrter ascorbinsrure the influence of growth hormone on fat and protein metabolism. dies. abs~r. [ ] nr phenolearbons~iuren in mensehlichen nahrungsprodukten. zum vorkommen yon phenolcarbons~uren in mensehlichen nahrungsprodukten und ihr einflul~ auf den intermedit~ren stoffwechsel influence of pregnancy and an oxidized lipid diet on the fatty acid composition of blood and tissue an experimental approach to the mechanisms of weight loss. ii. a comparison of effects of thyroxine, fat-mobilizing substance (fms) and food deprivation in achieving weight loss in mice fat accumulation in the regenerating media of arteries in rats given an atherogenic diet the effect of nutrition on the growth of faseiola hepatica in its snail host the effects of specific viruses, virus complexes, and nitrogen nutrition on the growth, flowering, and mineral composition os strawberry plants body weight and food intake as initiating factors for puberty in the rat the role of catecholamines in the free fatty acid response to cigarette smoking die renale steroidausseheidung bei experimentellem vitamin-e-mangel peculiar features of respiration and redox processes in the rice plants grown under different nutritional conditions der bohnenkaffee und die migrrne repeatability of litter size and weight in the laboratory rat as affected by selection and plane of nutrition wirkungen yon muskelextrakt auf den stoffwechsel. arzneimittelforschung is [ ] nr. , s. / einflul] der silageftitterung auf die qualit~t der butter einflul] der silageftitterung auf die qualit~t yon milch und milchproduk. ten. . mitt.: einflul] von silagefiitterung auf die organoleptischen eigenschaften dcr milch effect of protein intake and cold exposure on selected liver enzymes associated with amino acid metabolism body iron levels and hematologic fin. dings during excess methionine feeding der einttul~ des kulturmediums auf die bildung von streptolysis s durch ruhcnde zellen influence of polyphosphates in chilling water on quality of poultry meat influence of breed-type, feed level, and sex on characteristics of the lamb carcass, and some relationships among live animal and carcass measurements the toxic action of phenothiazine and some disturbances of intermediary metabolism in undernourished sheep efficiency of feed use in beef cattle uber die wirkung der nalmmgsfette auf die blutlipoide, teil i. ernahrungs-umsehau [ ] nr. , s. / . --~ber die wirkung der nahrungsfette auf die blutlipide fette und blutgerinnung. bibliothcca hacmatologiea effect of heavy cigarette smoking on postprandial triglycerides, free fatty acids, and cholesterol role of calcium in fibrin formation glucose- -phosphate dehydrogcnase and aldolase in lenses of lactose-fed rats -effect of riboflavin, choline, pantothenic acid and vitamin a on the excretion of sodium in urine of rats the effect of waterwashed rice in the diet on the growth, excretion of sodium in urine and blood pressure of rats effect of aseorbic acid, vitamin b, and milk on the dark adaption effect of single deficiency of vitamin a, thiamine der einflub yon vitaminen auf die psyehomotorisehe leistungsi'~higkcit beim menschen. naunyn-schmiederberg's arch. exper feeding response of adult tribolium to carbohydrates in relation to their utilization nutritional secondary hyper hieronymi: influence of nutritional conditions on the cellular rna metabolism in rive and in vitro diffusible auxin increase in a rosette plant treated with gibberellin transamination in muscular dystrophy and the effect of exogenous glutamate: a study on vitamin e deficient rabbits, and mice with hereditary dystrophy. canad effect of auxin on the emergence of lateral roots in p. mungo seedlings the effect of nutritive status on oestrus, ovulation, and graafian follicles in merino ewes biologische wirkungcn autoxydierter, epoxydierter und bestrahlter fette s~ure-basen-gleichgewicht mad chronische acidogene und alkalogene eruehrung effect of protein level in milk replacers on growth and protein metabolism of dairy calves effect of sodium bicarbonate in the drinking water of ruminants on the digestibility of a pelleted complete ration sucrose diet and bfliary chelate excretion in rats: with note on procedure for chelate determination eirdlub sauerer milcherzeugnisse auf die darmflora untersuchungen fiber die speicherung und die ausscheidung yon vitamin a nach ungenfigender vitamin-a-versorgung bei legenden hiihnern the effects of low magnesium intake on lactating ewes effect of vitamin a on the content of pyridinnucleotides, pyrovic, and lactic acid and on anaerobic phosphorylation. ukrainskii biokkimichnii zhurnal the pyridine nucteotides. a study of a method of measurement. a study of the alterations in rat fiver under the conditions of diabetes and starvation. a preliminary study of various marine fish tissues with the emphasis on the islet of langerhans iron uptake-transport of soybeans as influenced by other cations hshe und zeitpm~kt der dfingung yon sommerweizen mit chlorcholinchlorid zur vcrkiirzung der halmlange nutritional significance of soluble nitrogen in dietary proteins for ruminants effects of long-term feeding of vegetable fats on atherosclerosis effects of feeding various mile, corn, and protein levels on laying performance of egg production stock some observations on the influence of a magnesiumdeficient diet of rats, with special reference to renal concentrating ability effect of gibberellic acid on flowering of apple trees the effects of dietary fat and energy levels on the performance of caged laying birds motivational producing properties of the feeding system of the rat hypothalamus the influence of diet and age on lipid metabolism of chickens effect of age on the response of chickens to dietary protein and fat absorption and excretion of biotin after feeding minced liver in achlorhydria and after partial gastrectomy variations de la calcsmie du chien normal apr~s ingestion de cholesterol eristallis~ dans l' thanol ou dans rhexane changes in activity of rat epididymal adipose tissue in vitro due to time elapsed since last feeding differences in rat strain response to three diets of different compositions l'alcoolisme ~ l'hspital psychiatrique de colsou (martinique). ann. m~dico-psychologiques nitrogen, lipid, glycogen, and deoxyribonu. eleie acid content of human liver. the effect of brief starvation and intravenous administration of glucose some metabolic and nutritional factors affecting the survival of erythrocytes erythrocyte survival of rats deficient in vitamin e or vitamin b . j. nutrition [ ] nr. , s. / nutrition and lactation the effect of administering sodium chloride, sodium bicarbonate, and potassium bicarbonate to newly born piglets strontinm- and calcium in milk of miniature swine further studies on cariostatic effect of organic and inorganic phosphates urinary excretion of magnesium in man following the ingestion of ethanol the effects of magnesium compounds and of fertilizers on the mineral composition of herbage and on the incidence of hypomagnesaemia in dairy cows behavioral, dietary, and autonomic effects of ehlordiazepoxide in the rat the effect of a high and low sodium diet in a patient with familial periodic paralysis the effect of quaternary ammonium anion exchange resin on plasma and egg yolk cholesterol in the laying hen vitamin e deficiency and ion transpor~ in rat liver slices effect of level of nitrogen on growth and reproductive physiology of young buus and rams influence of low protein rations on growth and semen characteristics of young beef bulls, if treatment of nutritional cirrhosis in rats with choline and methionine; with special reference to fibrogenesis and fibroelasia probleme der beurteilung yon sauermilcherzeugnissen. milchwissen. schaft [ ] nr. , s. / . --antwort auf die diskussionsbemerkungen auf einer vortragstagung der gesellschaft ffir ern~hrungsbiologie e. v., miinehen, am response of early-weaned pigs to variations in dietary calcium level with and without lactose effect of low calcium diet on bone crysta]linity and skeletal uptake of sca in rats response of rural guatemalan indian children with hypocholesterolemia to increased crystalline cholesterol intake source of plasma free fatty acids in dogs receiving fat emulsion and heparin alcoholic intoxication in the newborn infant. bril mcd dental caries of rats fed a rice diet and modifications a study of zinc deficiency in the dairy calf effects of different levels of zinc and phosphorus on the growth of subterranean clover (trifolium subterraneum l) lrber den einflu yon fluor auf den wassergehalt des knochens gegevens ovvr vitamine b,-deficientie, -behoefto en -voorziening the liquefying action of pancreatic, cereal, fungal, and bacterial co-amylases ern~ihrung und endokrines system . mitt.: der einfiul der erniihrung auf die schilddriise the effect of saline water on kidney tubular function and electrolyte excretion in sheep zinc and iron deficiencies in male subjects with dwarfism and hypogonadism but without ancylostomiasis, schistosomiasis or severe anemia die yextriiglichkeit yon xyli~ beim diabetiker einflni der laevulose auf die fu~ionsbrelte. sport~tl. praxis der einflul~ yon vitamin a auf den zitronensi~urestoffwechsel studies on the growth-promoting value and digestibility of passion fruit seed oil ration effects on drylot steer feeding patterns dextrothyroxine on metabolism of cholesterol some effects of feeding lactates to dairy cows copper deficiency problems in south-east scotland bone changes in iron deficiency anaemia a preliminary report on nucleic acid levels in mineral deficient plants metabolism of histidine in protein malnutrition ttypoglycemie effect of l-leucine during periods of endogenous hyperinsulinism nutritional studies with the guinea-pig. viii. : effect of different proteins, with and without amino acid supplements, on growth some effects of sulphur-containing amino acids on the growth and composition of wool effects of hunger and vi value on vi pacing potency of conditioned reinforcem based on food and on food and punishment sfibwaren und karies in theorie und praxis effect of magnesium on the changes of myocardial potassium confent untersuchungen fiber den einflub oral verabreiehten oxytetraeyclins auf leberlipide und serumeholesterin der weil~en ratte further studies on manganese nutrition of tobacco in relation to virus infection and synthesis aminobutyric acid (),-aba)-vitamin be relationships in the brain serum lipids and diet: a comparison between three population groups with low, medium, and high fat intake effects of light and gibberellin on elongation of intact wheat coleoptiles experimental magnesium deficiency in the cow thyroid function in chickens and rats: effect of iodine content of the diet and hypophysectomy on iodine metabolism in white leghorns cockerels and long-evans rats kuhmilehallergie beim sgugling und a rapid method for assessing drug inhibition of feeding behavior variation in, and the effects of vitamins on vertieillinm albo-atrnm influence of high level vitamin a supplement on semen characteristics and blood composition of breeding bulls influence of diet on viral hepatitis der einflul] der stiekstoffdfingung auf die zusammcnsetzung yon karf~ffeleiweib insulin response to fructose and galactose the effect of excess vitamin a on the oxygen consumption of young female rats effect of dietary amino acid pattern on plasma ~mino acid pattern and food intake gibbere/lln: effect on diffusible auxin in fruit development effect of intravenous versus oral fat administration in fat-deiicient dogs plasma vitamin bi~ levels in some nutritional deficiency states nutritional factors influencing the conversion of tryptophan to niacin pancreatic hypertrophy and chick growth inhibition by soybean fractions devoid of trypsin inhibitor production, interior egg quality, and some physiological effects of feeding raw soybean meal to laying hens effect of palm jaggeries on the growth and blood and liver composition of albino rats kept on rice and jowa effects of polyphosphates on water uptake, moisture retention, and cooking loss in broilers untersuchungen fiber den ern~hrungsphysiologischen wer~ yon kasein entgegnung zu diskussionsbemerkungen auf einer vortragsveranstaltung der gesellschaft for ern~ihrungsbiologie e. v., mfinchen, am . juni die erg~nzungswirkung yon dl-methionin allein oder in kombination mit l-lysin beim wachsenden schwein der einflul~ der nahnmg auf den kauapparat der einflu der nahrung auf den kauapparat. teil il changes in bone mass and density in living rats during the manipulation of calcium intake effect of chromium, cadmium, and other trace metals on the growth and survival of mice a study of fermentation in the production of mahewu, an indigenous sour maize beverage of southern africa the vitamin b~ deficiency syndrome in human infancy, biochemical and clinical observations lipids in chick urine: the influence of dietary rapeseed oil effects of dietary nitri~ on the chick" growth, liver vitamin a stores, and thyroid weight influence of radioactive sodium- on higher nervous activity of dogs, when chronically administered into the organism in comparatively small doses the change of erythroeyte blood composition in persons with prolonged complete alimentary starvation (without limiting the water intake) and subsequent feeding dental abnormalities in rats attributable to protein deficiency during reproduction the effect of environment, and nutrition of pathogen and host, in the damping off of seedlings by rhizoctonia solani effect of dietary protein on fructose, citric acid, and -nucleotldase activity in the semen of bulls the effect of fluorine on dairy cattle. v. : fluorine in the urine as an estimator of fluorine intake some effects of diet and therapy on the survival and metabolism of adrenalectomized rats effect of methonine and choline deficiency on liver choline oxidase activity in young rats untersuchungen fiber die wasserlssliehen hemmstoffe aus dem chnittholz der weinrebe (vi~is vinifera l.). i. zur wirkung der hemmstoffe auf die keimlmg mad entwicklung yon rebs~mlingen nutrition and palatability. lancek the effect of feeding fluoride on some enzymes of bovine tissues diet and histamine in the rmninant effect of food reflexes on cholinestera~e activity of cortical tissue. federation essential fatty acid deficiency and rat liver homogenate oxidations the effect of vitamin and antibiotic injections on early turkey poult growth and mortality alimentary production of gallstones in hamsters. . studies with rice starch diets with and without antibiotics nitrogen studies with apple and cranberry the influence of diet on the quality of faecal fat in patients with and without steatorrhoea uber die unterschiediiche beeinflussung des tryptophansteffwechsels dutch vitamin b -mangel in der ratte. hoppe-seyler's influence of vitamin b and its coenzyme upon incorporation in rive of amino acids into tissue proteins in rats relativer vitamin b -mangel hei erkrankungen der schilddriise strukturanomalien der z/ihne bei vitamin d-mangel-raehitis und der vitamin d-resistenten renalen rachitis the effect of fluoride on bone effects of insulin on hepatic glucose production and utilization prevention by hydrocortisone of changes in connective tissue induced by an excess of vitamin a acid in amphibia acute hypervitaminosis a in guinea-pigs. i. : effect on acid hydrolases der einfiu~ yon vollkornbrot auf den calciumstoffwechsel bei schulkindern effect of feeding milk replacers with varying amounts of f~ for hothonsc lamb production egg yolk and serum cholesterol levels: importance of dietary cholesterol intake effect of protein intake on glutamic dehydrogenase and amino acid desruination in rive observations in experimental magnesium depletion effect of gibbercluc acid on growth, gibberellin content, and chlorophyll content of leaves of potato ~ physiological factors influencing growth, reproduction, and production of well-fed dairy heifers. i. age at first breeding. ii. feeding of diethytstilbestrol tm~ influence of diet on the development of parotid salivation and the rumen of the lamb bericht fiber den wissenschaftlichen kongreb der deutschen geseuschaft fiir erniihrung influence of variations in dietary calcium: phosphorus ratio on performance and blood constituents of calves the lack of a consistent chick growth response to norwegian kelp meal some effects of kinctin on the growth and flowering of intact green plants incorporation of [, p] orthophosphate into phospholipids of frog tissues during feeding and stmrvation growth-modifying and antimetabolite effects of amino acids in chrysanthemum a study of techniques used by advertisers in dealing with weight control. a national health problem lipid excretion. .: examination of faecal lipids of rats injected intravenously with serum lipoprotcin containing ~ac-labelled cholesterol effect of diet and diabetes on plasma glucose, fatty acid, and insulin effect of cigaret smoking during pregnancy: study of cases. obstetrics gynecology respiration and phosphorylation in live homogenates from rats exposed to hypoxia and food restriction the influence of mi]l~ fat depressing rations on the yield and composition of bovine milk phosphatides and cholesterol in the rat bed: effects of growth, diet, and age the effect of plant nutrients and antagonistic microorganisms on the damp. ing-off of cotton seedlings caused by rhizoctonia solani kurus l~utrition of gram-negative anaerobic bacilli. nutrition rev. effects of glucose on the production by escherichia coli of hydrogen sulphide from cysteine. j. general iylierobiol. enumeration of psyehrophilie microorganisms vitamin requirements of three pathogenic fungi calorie requirements of rat intestinal microorganisms specificity of the salt requirement of halobacterium cutirubrum influence of the aqueous potato extract and its fractions on growth and spore formation of the b. pumilus and the production of the antibiotic tetaine the relationship between hormonal activity and sugar metabolism in protoparce scxta (joka~sen) and blabcrus craniifer bur~ieister apparent incorporation of ammonia and amino acid carbon during growth of selected species of ruminal bacteria l~ber die wirkung anorganischer st~ube auf das wachstum yon mikroorganismen effect of dietary calcium lactate and lactic acid of faecal eseherichia coli counts in pigs uber den einflul des n~ihrsubstrats auf die hemmung des bakterienwachstums durch cyanid autoradiographic studies of the differential incorporation of glycine, and purine and pyrimidine ribosides by paramecium aurelia correlation between the essential amino-acid requirements of staphylococcus aureus, their phage types, and antibiotic patterns nutrition and metabolism of marine bacteria. xii.: ion activation of adenosine triphosphat~se in membranes of marine bacterial cells carbon dioxide fixation in bacillus anthracis bacterial growth under conditions of limited nutrition interrelationship between temperature and sodium chloride on growth of lactic acid bacteria isolated from meat-curing brines morphological variation and nutrition of a new monoeentric marine fungns feeding stimulants required by a polyphagons insect, schlstocera gregaria vitamin requirements of root nodule bacteria phcnotypic, genotypic, and chemical changes in starving populations of aerobacter aerogenes studies on the d-amino acids. ii.: utilization of d-amino acids by lactic acid bacteria role and formation of the acid phosphatase in yeast der einflub yon co~-partia]druck und glucose-konzentration auf wachsturn und stiekstoffbindung yon azotobacter chroococcum bei~ inorganic polyphosphate metabolism in chlorobium thiosulfatophilum effects of molybdenum, vanadium, tungsten, and cobalt on growth of rhizobia and their hosts nutrition of leptospira pomona. ii.: fatty acid requirements sterilization by beta-propiolactone of solid nutrient media for eultivation of moulds the digestion of natural food protein by the elearnose skate raja eglanteria (bose.) utilization of amino acids during metabolism in escheriehia coil the effect of nutrition on the growth of fasciola hepatica in its snail host cobamide coenzyme contents of soybean nodules and nitrogen fixing bacteria in relation to physio]ogical conditions determination of carbohydrate metabolism of marine bacteria the amino acid requirements of various types of shigella mushroom culture. factors affecting the growth of morel mushroom myecelium in submerged culture lebensmittelzusatzstoffe und mutagene wirkung. vii. : priifung einiger xanthen-farbs~offe auf mutagene wirkung an escherichia coll the biological control of glycogen metabolism in agrobaeterium tumefaeiens the maintenance requirement of escheriehia coll methionine requirement for growth of a species of mieroeoecus ~iorphogenesis and nutrition in the memnionella-stachybotrys group of fungi viable organisms from feces and food-s~uffs from early antarctic expeditions the metabolism of yeas~ sporulation. v. : stimulation and inhibition of sporulation and growth by nitrogen compounds the effect of lipids on citric acid production by an aspergillns niger mutant relationship between deuterium inhibition of growth and glucose catabolism in saecharomyees cerevisiae function of trehalose in baker's yeast (saccharomyces cerevisiae). arch. biochem preparation and lyophilization of colicine suspensions. i. production of eolicines in liquid nutrient media lvber den einflub der kulturbedingungen auf die stramenempfmdliehkeit der glueoseoxydation in baeterium cadaveris nutritional requirements and metabolism of myeoplasma laid-lawil j. gen. microbiol. die wirkung subletaler konzentrationen yon sorbinsi~ure auf escherichia coli und aspergillus niger the genetic analysis of carbohydrate utilization in aspergillus nidulans the amino acid nutrition of respiration deficient and respiration competent saecharomyces. a. van leewenhoek nutritional studies of some membem of mucorales. iv.: . sugars, amino-, and organic acids of the myceaium selektivn~hrboden fiir staphylokokken effects of certain amino acids in anthranilate production in neurospora crassa studies on the polysaccharide-fermenting lactic acid bacteria. in. : nutritional requirements and the existence of fermentation promoting factors for sucrose and inulin the catabolism of proteins and nucleic acids in starved aerobacter aerogenes aerobic fermentation and the depletion of the amino acid pool in yeast cells influence of hydrogen ion concentrations on the utilization of sodium nitrite by fungi oxidative metabolism of glucose in leaf tissues infected with tobacco mosaic virus differential effects of amino acid deficiencies on bacterial cytochemistry nutritional requirements of an aspergiuus niger mutant for citric acid production culture de tissus d'insectes ~, l'aide d'extrait d'embryon de poulet en l'absenee d' h molymphe. c. r. acad utilization of some carbon and nitrogen sources by pseudomorms fluorescens on the selection of microorganisms for use in bacterial fertilizers in vitro and -rive uptake of carbon- labelled alanine and glucose by ascaridia galli, parasitic nematode of chickens growth of psychrophiles. i. : lipid changes in relation to growgh-temperature reductions vitamin requirements of listeria monoeytogenes parasitism and nutrition of gonatobo~rys simplex the effect of alkali metals on the growth of staphylococcus pyogenes the uptake of potassium and rubidium by staphylococcus pyogenes metabolism of nucleic acids and of nucleotides in the course of synchronous development of azotobacter vihelandii studies on the biotin-oleie acid requirements of a lactobaeillus plantarum variant isolated from chick feces unusual response to iron-dextran. brit. *ned. j. , nr. , s. . +in'. n.: tissue trypsin and trypsinogen levels in pancreatitis skeletal development of suckling kittens rate of liver regeneration atherogenesis in the monkey the significance of serum triglyeerides anaemia and parasitism in man physiological mechanisms in nutritionally-induced differences in ovarian activity of mature ewes bone development in suckled pigs production performance of artificiauy and non~r~ifically sired herd-mates in wisconsin search for an unidentified nutrient in mammalian liver. part i.: growth studies with various ox liver preparations proline control of the feeding reaction of cordylo-phora relationship between longevity and production in holstein-friesian cattle circadian adrenal cycle in c mice kept without food and water for a day and a half metabohc pmduc~s form labelled ethanol. iv. : disappearance of ethanol-carbon from morphological fractions and lipids of rat tissues acetate utilization by maize roots vajda, ] . : i~c~sll-rcs of body fat and hydration in adolescent boys some characteristics of a proteolytic enzyme system of pseudomonas fluorescens some metabolic relationships between host and parasite with particular reference to the eimcriae of domestic poultry nucleotide degradation in the muscle of iced haddock (gadns aeglefinns), lemon sole (pleuronectes microcephalus), and plaice (pleuronectes platessa) effect of starvation and -mcthylprednisolone (m_edrol) on the acid phosphatase of rat liver and muscle metabolic patterns in preadolescent children. vii. : intake of niacin and tryptophan and excretion of niacin or tryptophan metabolites biochemical changes in fish muscle during rigor morris studies on ornithine synthesis in relation to benzoic acid excretion in the domestic fowl effect of manual total collection of feces upon nutrient digestibilities polarographie studies on storage of meats. xxii. : influence of proteolytic enzymes on the polarographie wave of beef protein solutions post-mortem changes in chilled and frozen muscle genetic-nutritional interactiions as affecting the early growth rate of chickens effect of unequal milking intervals on lactation milk, milk fat, and total solids production of cows changes with age in glutamic oxalacetic transaminase activity of sonically oscillated tureen juice compared to total steam volatile fatty acids in calves fed different roughages catecholamine metabolism and some functions of the nervous system a study of some of the conditions affecting the rate of excretion and stability of creatinine in sheep urine changes in feeding behavior after intracerebral injections in the rat kanzcrogene substanzen in wasser und boden food additives and contaminants and cancer milk allergy in infancy food poisoning due to salmonella cnteritidis vat the mineral element content of spring pasture in relation to the occurrence of grass tetany and hypomagnesaemia in dairy cows insecticide residues in meat. residues in body tissues of livestock sprayed with sevin or given sevin in the diet over de giftigheid van solanum-alkaloidcn toxic products in groundnuts zur beziehung zwischen lipidcn hepatotoxicity of foods: a consideration of the hepatotoxicity of a few phanerogams and eryptogams. their possible influence in the pathogenesis of cirrhosis and hepatoma food-poisoning potential of the enterococei vanadium. excretion, toxicity, lipid effect in man an institutional food-poisoning outbreak examination of market milk of novokuznctsk for brucellosis an outbreak of food poisoning in a mental hospital food allergy as a cause of abdominal pain radioactivity in the diet the effect of microbial contamination on the requirement of chicks for certain nutrients the acute oral toxicity of cottonseed pigment glands and intraglandular pigments sur l'absorption du edsium radioactif par rorge. c. r. hebd. s ances aead die experimentelle alimentiire lebernekrose a]s empfindlicher indikator bei thermiseher belastung der milch. uber die magermilehtroekntmg the comparative toxicity of ethylene dibromide when fed as fumigated grain and when administered in single daily doses repository polyvalent insect antigen treatment for patients sensitive to hymenoptera. a clinical evaluation precursors of carcinogenic hydrocarbons in tobacco smoke toxin production in naturally separated liquid and solid components in preparations of heated surface-ripened cheese inoculated with clostridium botulinum allergieversuehe am tier zur ~tiologie der sogenannten margarinekrankheit. dr. reed. wschr. [ ] nr. , s. / . --, allergenwirkung oder immunologisohe adjuvanswirkung in der ~tiologie der sogenarmten margarinekrankheit radium- in human diet and bone miodine in the thyroids of north american deer experimental groundnut poisoning in pigs cholesterol as carcinogen safety factors in water fluoridation based on toxicology of fluorides entelo epidemiologische gegevens over her ,planta-exantheen" te rotterdam, verkregen door enquetc-onderzoek planta-~x~ntheem" epidemie te rotterdam in de m~nden ~ugnstu~ en september salmonella-verontreiniging van plantaardige grondstoffen veer voedingsmiddelen van mens en dier increase of strontium- and caesium- sodium fluoride intoxication salmonellosis in the netherlands zwei seltene salmonellenfunde untersuehungen fiber die chronisehe toxizit~t der ascorbins~ure bei der ratte captan in green vegetables rfickst~nde yon pflanzenschutzmitteln, insektiziden mid dergleichen in der nahrung und ihre bedeutung ffir die gesundheit der gehalt der milch an j, ,~co ' u ba _{_ la in der deutschen )iileh in der zeit yon langfristige nutritive anwendung yon antibiotika in der tierern~hrung im hinblick auf die menschliche gesundheit mi~ besonderer beriicksichtigung yon chlortetrazyklin a milk-borne outbreak of food poisoning due to salmonella heidelberg ergebnisse yon schwebversuehen an farbstoffen zur farbmattierung yon tabakwaren nutritional secondary hyperparathyroidism of the cat insecticide residues in fat. a screening method for chlorinated pesticide residues in fat without cleanup untersuehungen fiber die quantitative verteilung radioaktiver falloutprodukte in milch too many vitamins radios~ron$ium removal from milk. determination of apparent equilibrium constants of the exchange reactions of sodium, potassium, calcium, and magnesium with amberlite ir- ern~hrungsphysiologische eigenschaften der margarine. fette, seffen, anstrichmittel smoking and cancer: retrospective studies and epidemlologieal evalution beobachtungen fiber den verlauf der alkoholkrankheit am krankengut einer heilanstalt die verschmutzung yon trinkwasser dutch i)etergentien grain fumigant residues. occurrence of bromides in the milk of cows fed sodium bromide and grain fumigated with methylbromide insecticide persistence. the disappearance of endrin residues on cabbage lebensm ittelchem u. gerichtl reproductive performance of female miniature swine ingesting strontium- daily toxicity of nitrate nitrogen to cattle methods of residues of phosphated insecticides and miticides in foods on bacillary excretion in food toxinfections of salmonella etiology relationships between the deposition of strontium- and the contamination of milk in the united kingdom staphylococci in cottage cheese is~cs and potassium in people and diet. -a study of finnish lapps effect of treatment of seeds with -chloroethanol on the resistance to boron toxicity in wheat seedlings desferrioxamin, eine neue das eisen bindende und eliminierende substanz zur behandlung der rim~rcn und sckund~ren i-i~mochromatose akuter eisenvergiftungen toxic products in groundnuts smoking, arteriosclerosis, and age the incidence of milk sensitivity and the development of allergy in infants einflul] yon fluor und jod auf den stoffwechsel, insbesondere auf die schilddrtise quelques exemples illnstrant la valour et l'utllit des m~thodes de lysotypie clans certaines salmonelloses humaines d'origine alimentaire food poisoning caused by panthogenic halophilic bacteria (pseudomonas enteritis txkikawa): %ep rt of four autopsy cases procaine penicillin g in milk following intramuscular injections comparative subacute toxicity for rabbits of citric, fumaric, and tartaric acids distribution of pesticides in fermentation products obtained from artificially fortified grape musts mercurial fungicidal seed protectant toxic for sheep and chickens the problem of salmonella food poisoning dietary factors in the pathogenesis and treatment of cirrhosis of the liver. med. clinics of north america an outbreak of salmonella food poisoning in l~ehmadabad town, kaira i)istxiet, gnjaxat ~a~e la tossinfezionl alimentarl da salmonella nell' spedale maggiore di milano dal al water intoxication due to oxytocin: reporb of a case c~sium- und kalium in menschlichen organen und in der milch / caesium- in dried milk produets in relation to phytoellmatic zones smoking and oral cancer occurrence of hepatomas in rats fed diets containing peanut meal as a major source of protein nachweis yon mangan- und kobalt- in pflanzen als fo]ge russischer kernwaffenvcrsuche e-ruhr-epidemie durch speiseeis bcricht fiber eine arbeitstagung bei der internationalen atomenergic-behsrde in wien vom . bis . i)ezember . i)t. lebensmittel-rdsch the development of microbiological standards for foods. j. milk food technol a case of breslau salmoneliosis caused by eating chicken internationales rundgespr~ch fiber lebensmittelchemische probleme in wiesbaden und eltvllle a. rh. ( vortragsreferate) staphylococcal infection of raw milk as a cause of food poisoning. monthly bull. ministry health pub carcinogenic effect of egg white, egg yolk, and lipids in mice eczema and cow's milk exitus letalis nach lebensmittelvergiftung dutch bacillus cereus repression of staphylococcus aureus by food bacteria. ii. : causes of inhibition a further report on the radioactive contamination of milk and milk products in japan. determination of strontlum- and cesium- concentrations in milk powder in japan concerning sporadic salmonelloses insecticide residues. extraction and cleanup studies for parathion residues on leafy vegetables salmonellosis epidemiology zur frage der deponierung yon nutrltiven allergenen im organismus. allergic, _~sthma allergic children with various symptoms caused by cows' milk messungen der umweltsradioaktivit~t und der radioak-tivit~t yon lebensmitteln im jahre ein cxperimenteller bcitrag zur tabakrauehkanzerogenese. dr. reed. wschr. [ ] nr. . u n. : contamination of leaves by radio active fall-out insecticide residues in milk and meat. residues in butterfat and body fat of dairy cows fed at two levels of kelthane ( . and insecticide residues in milk. residues in milk from dairy cows fed low levels of toxaphene in their daily rations tier-und pfhnzenerniihrnug _anlrnal and plant nutrition summary of ,tropical crops soil, analysis, and its relation to plant composition and growth fertilisers and plant nutrients ulcers in swine tnfluence of chelating agents on the concentration of some nutrients for plants growing in soil under acid and under alkaline conditions nutritional evaluation of permanent pastures with dairy cattle in louisiana the herbage intake of eattle grazing lucerne and cocksfoot pastures terminology and methods for feeding and weighing animals the effect of feeding on evaporative heat loss and body temperature in zebu and jersey heifers studies on the requirements and interaction of copper and iron in broad breasted bronze turkeys to weeks of age some factors affecting iron uptake by strawberry plants feed consumption during lactation and involution in sprague-dawley-rolfsmeycr rats the effect of variations in the energy and protein levels of the ration upon performance in the pig use of barley in high-efficiency broiler rations. . poultry sci tierarzncimittcl und aufzuchtmittel in der landwirtschaftlichen praxis. gesund. heitliche erwggungen zum schutze des konsumenten bei der anwendung yon tierarzneimitteln und aufzuchtmitteln in der landwirtschaftlichcn praxis, tell ii mechanism for movement of plant nutrients from soil and fertilizer to plant root growth rate of the tea leaf as determined by shade and nutrients. empire j. exper note on induction of flowering in ~railing shoots of clones of saccharum spontaneum effect of level and sequence of feeding and breed on ovulation rate, embryo survival, and fetal growth in the mature ewe evidence for a high zinc requirement at the onset of egg production aufnahme und wirkung des mikronghrstoffs knpfer in ionogencr und ehelatisierter form bei gerste effects of pelleting and varying grain intakes on milk yield and composition the relationship of gibberellic acid to flower initiation in column stock, math~ela incana the effect of selenium administration on the growth and health of sheep on scottish farms the horsebean (vicia faba l.) as a vegetable protein concentrate in chick diets size and feeding of different types of fishes the influence of age of chicks on their sensitivity to raw soybean oil meal influence of the mineral nutrition on the resistance of peach trees to fusicoceum amygdali de la croix granular fertilizer. influence of associated salts on plant response to dicalcium phosphate a comparison of feeding growing pigs once or twice daily the interrelation of nutrient supply, leaf nutrient content, and vegetative growth of ilcx crenata 'green island' effect of rationing grass on the growth rate of dairy heifers and on output per acre, with a note on its significance in experimental design experiments on the nutrition of the dairy heifer. iv.: protein requirements of -year-old heifers grass silage vs. hay for lactating dairy cows hay vs. silage for two to six months old dairy calves weaned at or days effects of high levels of copper and ehlortetracycline on performance of pigs seedlings resistance of corn to leaf feeding of the european corn borer ostrina nubflalis ease of hydrolysis of the hemieeiluloses of forage plants in relation to digestibility bodenkde. [ ] mr. , s. / . --caleium-mangelsymptome an h heren pflanzen effects of frequency of feeding on urea utilization and growth charae%oristics in dairy heifers factors affecting the voluntary intake of foods by cows. . : a preliminary experiment with ground, pelleted hay the relationship of dietary energy level and density to the growth response of chicks to fats salt tolerances of pinus thunbergii compensatory carcass growth in steers following protein and energy restriction a guide to production, care, and use of laboratory animals. federation prec. estimation of essential fatty acid intake in swine automatic dispensing at frequent regular intervals of liquid diet for piglets chelation in nutrition. chelates and the trace element nutrition of corn der einflul yon humuss~ure auf wachstum und ver~inderungen des freien zuekergehaltes bci winterweizenpflanzen, die im dtmkeln kultiviert wurden a comparison of the growth of chicks in the gustafsson germ-free apparatus and in a conventional environment, with and without dietary supplements of penicillin an evaluation of weed competition and the effects of weed extracts and leachates on the development of field corn (zea mays l.) and oats (arena sativa l digestibility of rations containing different sources of supplementary protein by young pigs the effects of urea supplements on the utillzation of straw plus molasses diets by sheep production performance of artificially and nonartifieiallysired herd-mates inwiseonsin dietary phosphorus for laying hens tolerance to acid soil conditions in barley effect of calcium and magnesium upon digestibility of a ration containing corn oil by lambs effects of caloric restriction during the growing period on the performance of egg-type replacement stock untersuchungen fiber die verwertung yon calcium-und phosphorsalzen aus fisehgr~itenmehl einigcr frischfische und fischkonserven bei der verffitt~rung a nut~rient requirement for optimum water absorption by intact root systems preparation of feed for animal nutrition experiments responses of winter wheat to nitrogen and soil nitrogen status studies on calcium requirements of broilers znm problem dcr nahrungspflanzenwabl der aphiden some factors affecting leaf development and longevity and the subsequent yield of corn grain efficiency of energy utilization by young cattle ingesting diets of hay, silage, and hay supplemented with lactic acid the effects of a plant steroid on body weight and feed efficiency of broilers feeding troughs for guinea-pigs beitrag zur ]~ackfruchtmast mit schweinen unter besondercr beriicksichtigung des n~hrstoffgehaltes der beifuttermischungen und der the feeding of thyroprotein to lactating sows zur planung, durchfiihrnng und auswertung yon schweinemastvcrsuchen bei gruppenfiitterung the influence of barbituric acid derivatives on the development of plant roots and root hairs factors affecting the voluntary intake of food by cows. .: the relationship between the voluntary intake of food, the amount of digesta in the retieulo-rumen, and the rate of disappearance of digesta from the alimentary tract with diets of hay, dried grass or concentrates artificial food for oak-silkworm raising the comparative toxicity of ethylene dibromide when fed as fumigated grain and when administered in ingle daily do~ ~ gibberellin at the vineyard oak wilt development and its reduction by growth regulators. i. production and activity of oak wilt fungus pectinase, ecmulase, and auxin. ii. effect of halogenated benzoic acids on oak trees, the oak wilt diseases, and the oak wilt fungus regulating nutrient intake in laying hens with diets fed ad libitum some effects of different soils on composition and growth of sugar beet production of fodder yeast from barley. i. preliminary studies on the use of the waldhof fermenter development and nutrition of new species of thranstochytrium studies on the effect of frequency of feeding upon the biology of a rabbit-adapted strain of pediculns humanus the influence of previous vitamin k nutrition on thromboplastie activity of brain extract the effect of nutrition conditions on the growth of and nitrogen accumulation by fodder beans when sown together with indian corn. i)oklady akad. nauk sssr dutch phenylbors~ure induzierte fragen der resistenzsteigerung in der modernen gefliigelhaltung chelation in nutrition. metal chelates in plant nutrition beziehungen zwischen dcm kupfergehalt und dem zeitlichen auftreten yon kupfermangelsymptomen an hafer in wasserkultur mit kleincn bodenmengen increased tolerance of bean plants to soil drought by means of growth-retarding substances effect of monocaleium and diammonium phosphates on crop yield, and their influence on soil solution movement and characteristics when associated with different salts effect of barbituric acid and ehlortetraeyeline upon growth, ammonia concentration, and urease activity in the gastrointestinal tract of chicks effects of feeding low levels of dimethoate on milk and on whole blood cholinesterase activity of dairy cattle die ziichtung von fleischschweinen und die folgeerscheinungen, die sich besenders im hinblick auf die qualit~t yon fleiseh und fett ergeben the relationship of specific nutrient deficiencies to antibody response in swine. i.: vitamin a. ii.: pantothenie acid, pyridoxine or riboflavin further studies of diet composition on egg weight effect of various levels of fluorine, stilbestrol, and oxytetraeycline, in the fattening ration of lambs studies on the properties of l~ew zealand butterfat. viii. the fatty acid composition of the milk fat of cows grazing on ryegrass at two stages of maturity and the composition of the ryegrass lipids soil potassium and the growth of vegetable seedlings artificial feed for silkworm, bombyx mori some effects of feeding stilbestrol, ehlortetracyeline, and penicillin with alfalfa soflage on steer performance and carcass quality food agric. [ ] nr. , s. / . --, mineral supplements for sheep the influence of higher volatile fatty acids on the intake of urea-supplemented low quality cereal hay by sheep untersuehungen fiber den umsatz waehsender sehweine ab geburt. . mitt growth of edible emorophyllous plant tissues in vitro chelates in agriculture. metal chelation by glucose-ammonia derivatives economic analysis of high-level grain feeding for dairy cows evaluation of the dacron bag technique as a method for measuring cellulose digestibility and rate of forage digestion n~ihrlssungen fiir zuckerriiben in wasser-und sandkultur activity of gibbereuin:'d' on the germination of photosensitive lettuce seeds raw and heat-treated soybeans for growing-finishing swine, and their effect on fat firmness ration effects on tureen acids, ketogenesis, and milk composition. i.: unrestricted roughage feeding a new growth stimulant, ~ growth hormone the effects of specific viruses, virus complexes, and nitrogen nutrition on the growth, flowering, and mineral composition of strawberry plants peculiar feature of respiration and redox processes in the rice plants grown under different nutritional conditions einflul] der silagefiitterung auf die qualit~t yon milch und milchprodukten. . mitt. : einflul] der silagcfiitterung auf die organoleptischen eigenschaften dcr milch effect of grinding and pelleting on the utilization of coastel bermuda grass hay by dairy heifers langfristige nutritive anwendung yon antibiotika in der tierernlihrung im hinblick auf die menschliche gemmdheit mit besonderer beriicksichtigung yon chlor-te~azyklin mode of action of growth retarding chemicals yield of sugarcane in louis-ana as influenced by soil moisture status and climate diss effect of auxin on the emergence of lateral roots in p. mungo seedlings compound mouse diets a semipurified caries-test diet for rats present status of feeding antibiotics to htctating dairy cows effect of sodium bicarbonate in the drinking water of ruminants on the digestibility of a pelleted complete ration semi-purified diets for sheep effect of vacuum-drying, freeze-drying, and storage environment on the viability of pea pollen. ii. : effect of boron, sucrose, and agar on the germination of pea pollen hshe und zeitpunkt der diingung yon sommerweizen mit chlorcholinchlorid zur verkiirzung der halml~nge nutritional signifieance of soluble nitrogen in dietary proteins for ruminants primary signs of nutritional deficiencies of laboratory animals ]~ffe[~b of dlctary pro~in and fat on growth, protein utilization, and carcass composition of pigs fed purified diets trans-fetts~iuregehalt yon schweineschmalz nach fiitterung yon sehweinen mit rindertalghaltigem kraftfutter. (ein beitrag zur quantitativen infrarotspektroskopischen bestimmung yon trans-fetts~uren in fetten effect of liming and potassium fertilization on soil solution and on yield and composition of alfaffa and orchard grass mixtures effects of feeding various milo, corn, and protein levels on laying performance of egg production stock effect of gibberellie acid on flowering of apple trees the effects of dietary fat and energy ]evels on the performance of caged laying birds effect of age on the response of chickens to dietary protein and fat chemical control of flowering. concentration of a floral-inducing entity from plant extracts strontium- and calcium in milk of miniature swine studies on the properties of newzealand butterfat. vii. effect of the stage of maturity of ryegrass fed to cows on the characteristics of butterfat and its carotene and vitamin a contents new radioactive tests show how termites feed mechanisms regulating the feeding rate of daphni~ magna straus influence of low protein rations on growth and semen characteristics of young beef bulls a study of zinc deficiency in the dairy call effects of different levels of zinc and phosphorus on the growth of subterranean clover (trifolium subterraneum l.). australian j. agrie absorption, translocation, exudation, and metabolism of plant growth-regulating substances in relation to residues the effect of the performance of growing pigs of the level of meal fed in conjunction with an unrestricted supply of whey increase in yield of legumes by fer~iliser mixture with lime chemically defined medium for growth of animal cells in suspension dis sieherung der eiweiljwrsotgung in dor l~ndwirt influences of previous calcium and phosphorns intake and plant phosphorus on the requirement of developing turkeys for calcium and phosphorus relationship between isotopicauy exchangeable calcium and absorption by plants effect of adding buffers to all-concentrate rations on fcedlot performance of steers, ration digestibility, and intrarumen environment lysine supplementation of corn -and barley-base diets for growing-finishing swine the effect of gibbereliin on the germination of seeds of arboreal plants effect of physical state of coastal bermuda grass hay on passage through digestive tract of dairy heifers nitrate reduction and carotene stability. effect of nitrate and some of its reduction products on carotene stability, d. agric. food chem chemical preparations for plant protection untersuchungen fiber die ksrperzusammensetzung und den stoffansatz waehsender mastschweine und ihre beeinflussung dutch die erniihrung. . mitt the cobalt requirement of sub-~erranean clover in the field comparing mile and corn in broiler diets on an equivalent nutrient intake basis effect of mineral nutrition on the invasion and response of turnip tissue to plasmodiophora brassicae wor the relation of chlorogenic acid and total free phenols in potato plants to resistance to infection by verfieillium alboafxum nitrogen and potassium as variables influencing soluble nitrogen and organic acid accumulation in soybean (glyoine max). di~s. abstr. bett~r british beef and barley feed. veter effect of nitrogen fertilization upon yield and digestibility of aftermath timothy forages fed to dairy heifers ration effects on dltlot steer feeding patterns effects on zea mays seedlings of a strontium replacement for calcium in nutrient media evaluation of albumen quality in a poultry breeding program nutritional studies with the guinea-pig. viii. : effect of different proteins, with and without amino acid supplements, on growth some effects of heredity and environment on appetite in dairy animals further studics on manganese nutrition of tobacco in relation to virus infection and synthesis amillo acid supplementation of pig diets chelation as a basic biological mechanism der einflu~ der stiekstoffdiingung auf die znsammensetzung yon kartoffeleiweiil z studies on photosynthesis. i. : biosynthesis of sucrose from glycolate. par~ ii. : bicarbonate utilization by washed algae production, interior egg quality, and some physiological effects of feeding raw soybean meal to laying hens alteration of post-mortem changes in porcine muscle by preslaughter heat treatment and diet modification chelation in nutrition. soft microorganisms and soil chelation. the pedogenie action of lichens and lichen acids die ergiinzungswirkung yon dl-•ethionin allein oder in kombination mit l-lysin beim wachsenden schwein untersuchungen iibcr den einflub unterschiedlicher wasservemorgung auf ertr~ge, gehalte an ~therischem , trenspirationsquotienten, biattgrsl~en und relative ~)ldriisendichtsn bei einigen arten aus der familie der labiaten. . teil gehalte an iitherischem , transpirationsquotienten, blattgrsflen und relative -driisendichten bei einigen arten aus dcr familie der labiaten. iii.: blattgrsflen, relative driisendichten, anzahl am haupttricb inseriertcr blattpaare und internodien. liingen cadmium: uptake by vegetables from supcrphosphate in soil studies on the protein and methionine requirements of young bobwhite quail and young ringnecked pheasants chelation in nutrition. evidence for natural chelates which aid in the utilization of zinc by chicks selective fertilization of apple-trees some soils and fertilizer relationships of the cavendish banana (muss cavendlshl lambert) on three different soils in costa rice soil organic phosphorus and the phosphorus nutrition of plants the effect of heat treatment on the nutritive value of milk for the young calf. . : a comparison of spray-dried skim milks prepared with different preheating treatments and roller.dried skim milk, and the effect of chlortetracyclinc supplementation of the spray-dried skim milks the effect of heat, ~reatmen~ on the nutritive value of milk for the young calf. . :the effect of the addition of calcium a biological assay for metabolizable energy in poultry feed ingredients together with findings which demonstrate some of the problems associated with the evaluation of fats feed additives in livestock rations: part i. : urea in dairy rations. part.i/: use of thyroprotein in cattle nutrition diet and histamine in the ruminant synthetic ion-exchange resins as a medium for plant growth nutrition of vibrio fetus theoretical basis of unicellula algae cultivation amino acid supplementation of barley diets for growing swine some effects of . -dichiorophen-oxyacetic acid on swect corn (zea ]~ays rugosa l.) with emphasis on yield, tillering, root development, and exudation of electrolytes from roots and stems. i)iss. abstr feeding and management of broiler strain breeder hens relationships among seven elements in the nutrition of corn in sand culture an external effect of inorganic nitrogen on nodulation influence of enzyme supplements in lamb fattening rations gravenstein and jonathan apples produced with giberellle acid the role of carotene in the dairy cow. wiss. ver ff. dr. ges. ern~hrung vitamin a-wirksamkeit der carotine bei versehiedenen tierarten. wiss. ver- ff. dt. ges. eru~hrung upgrading the indigenous poultry of uganda. i. : the growth rates and feed conversion from hatching to maturity of indigenous poultry crossed with four imported breeds effect of different kinds of litter on growth and feed efficiency in chick rearing investigation of the mineral nutrition of datura innoxia the effect of flooding in the availability of phosphorus and on the growth of rice nutrition of the boll weevil larva ascorbic acid in the nutrition of plant-feeding insects effects on the s~maeh worm, i-iaemonehus contortus, of feeding lambs natural versus semipuriiicd diets yield and foliar composition of corn as affected by fertilizer rates and environmental factors. i)iss. abstr. ~ [ ] nr praktische erfahrm]gen in der carotinoidversorgung yon vsgeln effect of protein-energy relationship on the performance and carcass quality of growing swine the use of quarter samples in the assessment of the effects of feeding treatments on milk composition * calcium and phosphorus requirements of finishing broilers using phosphorus sources of low and high availability amino acid supplementation of peanut meal diets for broiler chicks the effects of feeding various levels of vitamin a on chicks with cecal coccidiosis chelation in nutrition. review of chelation in plant nutrition water use by irrigated arabia coffee in the failure of certain dietary ingredients to affect the incidence of blood spots in chicken eggs dcr einflul~ der anbauverh~ltnisse auf die eigensehaften der kartoffelknolle und der st~rke effect of feeding milk replacers with varying amounts of fat for hothouse lamb production l~hysiologieal factors influeneelng growth, reproduction, and production of wcll-fed dairy heifers. i. ago at first breeding. ii. feeding of diethylstflbestrol results of an experiment ot rothamsted testing farmyard manure and n, p, and k fertilizers on five arable crops. i. : yields results of an experiment at rothamsted testing farmyard manure and n, p, and k fertilizers on five arable crops. ii.: nutrients removed by crops the utilization of carotenoids by the hen and chick some effects of potassium and lime on the relation between phosphorus in soil and plant, with particular reference to glasshouse tomatoes, carnations, and winter lettuce the lack of a consistent chick growth response to norwegian kelp meal further studies on protein and energy requirements of chicks selected for high and low body weight some effects of kinetin on the growth and flowering of intact green plants individual feed consumption of turkey breeder hens and the correlation of feed intake, bocly weight, and egg production crop analysis technique for studying the food habits and preferences of chickens on range supplemental value of turkey protein for wheat herbicides and plant growth regulators preparation of purified ration for chick. parg iv. : preparation of crystalline amino acid diet evaluation of algae as a food for human diets the influence of milk fat depressing rations on the yield and composition of bovine milk the effect of plant nutrients and antagonistic microorganisms on the damping-off of cotton seedlings caused by rhizoetonia solani kukn ki;-;sehe ern~ihrung und di~tetlk clinical nutrition and dietetics a statement approved by the board of directors of the canadian heath foundation radioactivity and human diet probleme der ern~hrung durch gefrierkost. sympomum der d utschen gcgell~ehaft ffir ernghrung veto . bis . mgrz klinische ernghrungslehre" und wissenschaftlicher kongreb der deutschen gesellschaft flit ern~hrung an der johannes-gutenberg-universit~t mainz veto . bis wissenschaftlicher kongreb der deutsehen gesellschaft ftir ern~hrung an der johannes-gutenberg-universit~t mainz am . und ernghrung nnd digit" . deutseher kongreb ffir ~irztliehe fortbildung in berlin veto . bis arbeitstagung fiber klinisebe ernghrungslehre. ern~ foods of the future (forts.). problems in space foods and nutrition. foods for extended space travel and habitation the question of fats. ii.: fats and disease behandlung fettbedingter gerinnungsstgrungen mit lipostabil sugar and dental caries obesity and sugar addiction hunger and malnutrition lancet nutrition and general practice bericht fiber die vortragstagung des fachverbandes lebensmittelchemie der chemischen gesellschaft in der ddr vom arbeitstagung fiber kommission ffir volksern~hrung, lebensmittelgesetzgebung und -kontrolle (eek) zu yi~inden des eidg the national diet-heart study low fat diet in familial mediterranean fever the thyroid gland in infant malnutrition evaluation of fao amino acid reference pattern studies on the physiology of nutrition in surinam rickets in southern israel diet and heart disease maiskeims in ernt~hrung und ditltetik apha conference report safe and nutritious food supply malnutrition and disease expert committee on medical assessment of nutritional status protein malnutrition the fat tolerance curves of patients with hyperllpidcmia and athcrosclcrosis die lactose im rahmen der ernt~hrung effect of environment on nutritional status zur theorie und praxis der zuckerkrankheit. wiener z dietetically induced experimental flous of rats physikalisch-diiitetische therapie yon hautkrankheiten. arch. phys. therapie err~hrungsforschung [ / ] :nr. , s. / . +bo rtiw~l, p. w. : milk-borne disease consumers' reactions to instand foods de voeding van woonwagenbcwoners experimental investigations on nutrition and human behavior. a post-script. amer di~tetische therapie der chronischen herzinsuffizienz construction and validation of the food attitude scale why we have a safe and wholesome food supply use of food in a psychiatric setting stature and nutrition in cystinuria and hartnup disease ])as endokrinologisehe syndrom des proteinmangels urinary excretion of . -dlhydroxyphenylalanine (dopa) in two children of short stature with malnutrition current problems affecting consumption of milk and indnstry's response to them preparedness for emergency feeding fluoridation and public relations dieetprodukten in vlaanderen incorporation of labelled glycine into erythrocyto glutathione of rabbits; effect of nutritional muscular dystrophy hot wcreldvoedselvraagstuk sniker -glycogcen -tandbederf dietary in take in patients with arthritis and other chronic diseases a clinical trial of iron-fortified bread effects of freater intake of milk, fruits, and vegetables joint fao/w/to expert committee on nutrition" fiber eine sitzung in genf vom . his serum cholesterol in a military population. its relation to obesity and the military diet ). ~z~, a. c. some nutritional problems of older age groups neuere biochemisehe untersuchungen zur diagnostik und therapie yon b-vitamin-mangelzust~nden call-harvard nutrition project. iii.: the erythroid atrophy of severe protein deficiency in monkeys cali-harvard nutrition projeet. ii. : the erythroid a~rophy of kwashiorkor and marasmus zur hshe des erwiinsehten fottverbrauehs ern~hrung des sportlers voeding moderne ern~hrungsbedarfsnormen. i. mitt. z. ges. hyg. [ ] nr. , s. . *--~ioderne ern~hrungsbedarfsnormen. . mit~.: z. ges. hyg. a comprehensive home-care program for the chronically ill fat-modified foods for serum cholesterol reduction besonderheiten der ern~hrung alter menschen chronic malnutrition in turkey. v. studies on serum fatty acids in malnourished children prevention of ,meat anemia" in mice by copper and calcium beeinflussung der sportlichen leistungsfdhigkeit dutch eine geeignete er-n~ihrung physiology of adolescence. ii, l~u-trition -basal oxygen consumption -energy expenditure and balance -nitrogen metabolism -calcium metabolism -iron metabolism -red cell mass and hemoglobin ern~hrungsproblemc bei chirurgisehcn kranken. wiss. versff. dr. ges. ern~hrung moderne vitamin b~-therapie: oral, rektal oder parenteral ? a palatable diet for producing experimental folate deficiency in man smoking in hospital was ist hungern und was heibt the cultivation of tflapia. this prolific fish as a fine source of proteinrich food in underdeveloped areas pyridoxine supplementation during pregnancy. clinical and laboratory observations on japanese foods nutritional sequelae of ga~trio surgery koehsalzarme kost und nierenerkrankungen theoretische und praktische grundiagen der ernilhrung in der fett in der diabeteskost foods or supplements? g zur hygiene und ,di~tetik des rauehens zur dis behandlung der uremic anaphylactie shock of the lungs triggered by mieroaspiration of cows' milk: a form of sudden unexpected death in early infancy the food service industry and its relation to the control of foodborne illness die konservative therapie des peptischen gesehwiirs gezondheid op reis addictive aspeeta in heavy cigarette smoking die ern~hrung im rahmen de~ heilvers im kurort the diet in renal faiiure is the rationale for gaetrointestinal diet therapy sound? familie-beruf-ern~hrung die dfiitbehandiung der leberkrankheiten. i)t. reed vom hunger his zum ~beritul -weltweite ern~hrungsprobleme die bedeutung der vitamine in der t~tglichen erniihrung s~ure-und baseniibersehiissige naln'ung. therapiewoehe [ ] nr. , s. / . --ss und chronische acidogene und alkalogene erns z. em~hrungswiss industrial lunches and public health the assessment of nutritional status in man: chairman's opening remarks therapie der essentiellen hypertonie speeifieke voedings-en voorlichtingsproblemen in tropische landen wie kann man unsere kos~ und unsere kostgewohnheiten beeinflussen? neue konzeptionen in der wasser-und salzsubstitution some aspects of the relation of nutrition and pregnancy is coronary heart disease preventable? world hunger demineralization of whey. use of its protein in infant feeding elaidinized olive oil and cholesterol atherosclerosis soziatrischo aspekte dee genul)-und arz~aeimittelkonsums verwendung yon htilsenfriichten in der diabetiker-di~t sanitation and dishes sanitation and dishes. aspects old and new. part ii smoking, arteriosclerosis, and age neue weg~ zur erni~hrungsphysiologi~chen aufwertung yon getreide-erzeugnissen diphyliobothrium latum and human nutrition, with particular reference to vitamin bli deficiency milk and diverticulosis dietary factors in the pathogenesis and treatment of cirrhosis of the liver. ivied. clinies north america zur frage der quanti~tiven charakteristik der ern~hrung der berufst probleme der gemeinschaftsverpflegung aus der sieht des ern~hrungsphysio-logen gegevens over vitamine b,-deficientie, -behoefte en -voorziening use of government-donated foods in a rural community fluoride, teeth, and the analyst eiweil bedarfsnormen im rahmen unserer ern~hrungsrichts~tze physiologic discomforts in navy protective shelter tests di~tvorschl~ige : akute nierenentziindtmg siii]waren und karies in theorie und praxis ernehrtmgsberatung im krankenhaus use of a low-sodium formula as an improved karell diet, with emphasis upon the outpatient management of heart failure and lymphedema pflanzliches eiweil~ fiir die erniihrung des menschen influence of diet on viral hepatitis influence of siblings on student smoking patterns voeding yon leerlingen van een lagere technische school. ii. calorie~n-en nutri~ntenwaarde early use of circulating blood volume, weights, and normal diet in acute renal failure fette in tier nahrung. dr. recd. wschr. [ ] nr. , s. / . *scm~-idt-b~bach, a.: ~j~ber die di~tetik der sauermilch begriff und anfgabe di~tetischer lebensmittel zur ursache yon geschmackskalamit~ten in trinkwassertalsperren psychologische motive im wandel des brotverzehrs improving levels of nutrition through better food practices the vitamin b~ deficiency syndrome in hun~n infancy. biochemical and clinical observations treatmen~ of ,refractory obesry" with ,formula die~ nr. , s. ff. ( s.). elwzea, c. c.: morphologie constitution and smoking prediction the outcome for obese dieters symposium fiber probleme der ern~hrung durch gefrierkost in karlsruhe yore coordination of long-term care of :pku children die di~t bei diabetes meuitus a nutritional supplement (nutrament) for elderly patients dietary intake of five groups of subjects. -hr. recall diets vs. dietary patterns the prolonged effects of a low cholesterol, high carbohydrate diet upon the serum lipids in diabetic patients stand und perspektiven der eiwei~versorgung. zur zielsetzung des verhandiungsthemas de voeding van rijswerkers signification des standards calorieo-azotes utilists en france erg~nzungen veto standpunkt des lebensmittelehemikers zu (dem beitrag) official acceptance of homogenized milk in the united states advances in nutrition and dietetics nutrition of naval recruits during a shelter habitability study hot ombuigen van voedingsgewoonten de voeding van schippemkinderen san boord en in de internaten voeding [ ] iqr. , s. / . --le traitement de rinsuffisance rtnale her zoutarme-eiwitarme dicer ein beitrag zur allgemeinen el problematik, ausgehend yon einer anorexia nervosa rroblem~ in nutrltlon~l supplementation an ~mri~hnl~ut detection of nutritional imbalances theorie und praxis der schwangerenern~hrung die pharmakologisehe beeinflussung yon hunger mad s~ttigung problems in the evalutaion of nutritional status in chronic illness europ~ische di~ttagung in amstercl~m ( . bls entwieklung des brot-und gctreidevcrzehrs in der neuercn zeit nutrition and palatability coehac dis~tse. -biochemical and technological aspects die di~itetik der :fettsucht kinderemll}~mg nutrition of infants and children report on infant feeding childhood nutrition in lapland. :nutrition rev the thyroid gland in infant malnutrition. nutrition rev. [ ] nr. , s. . +n. n.: physical activity of obese girls appraisal of nutritional adequacy of infant formulas used as cow milk substitutes isomaltose intolerance causing decreased ability to utilize dietary starch passagere hypereh]or~mische azidose bei zwei ausgetragenen s~uglingen wghrend s~uremflch-em~hrung ober erfahrungen in der frfihgeborenonaufzucht mit einer neuen bedarfsangepa ten friilmahrung nutritional defects in adolescence g p intravenous glucose tolerance in the normal newborn infant: the effects of a double dose of glucose and insulin ! attitudes towards physical ac. tivity, food, and family in obese and nonobese adolescent girls urinary excretion of . -dihydroxyphenylalanine (dopa) in two children of short stature with malnutrition high salt content of western infant's diet: possible relationship ~o hypertension in the adult vitamin e to premature infants des enzym yon ~'leming (lysozym) und seine bedeutung flit die si~ugllngsern~ihrung. ann investigation on the relation of between-meal eating and dental caries of sixth-year molars in school children studies in infantile malnutrition. i.: nature of the problem in peru chronic malnutrition in turkey. v. studies on serum fatty acids in malnourished children emi~hrnng mad faekalc lysozymaktivitiit beim s~iugiing role of linoleio acid in infant nutrition factors related to the eating behavior and dietary adequacy of girls to years of age. dies. abstr. des vitamin c im jugendalter. ii. mitt.: uber die wirkung yon natiirlichem und synthctisehemvitamin c bei l~ngeren zugaben the incidence of protein-calorie malnutrition of early childhood ein besonderer znsammenhang zwischen dem bedarf an nahrungsfett und dem stoffwechsel in den ersten lebensjahren the effect of supplements of groundnut flour or groundnut prorein isolate fortified with calcium salts and vitamins or of skim-milk powder on the digestibility coefficient, biological value, and net utilization of the proteins of poor indian diets given to undernourished children lysine fortifications of wheat bread fed to haitian school children lrber den -stunden-rhythmus der kalorienerzeugung bei friihgeborenen beitriige zur frage der spezifisch-dyrmmi~ehen wlrkung auf grund yon glykokoll-belastungen bei friihgeborenen. acta paediatriea acad carotine in tier stiuglingserni~hrung. wiss. versff. d$. ges. ernkhrung liver and depot lipids in children on normal and high carbohydrate diets response of rural guatemalan indian children with hypocholesterolemia to increased crystalline cholesterol intake feeding value of soy milks for premature infants early feeding and birth difficulties in childhood schizophrenia. a brief study zusammenh~nge zwischen stoffwechsel und fl~issigkeitsbedarf beim s~ug-ling kuhmilchauergie beim s~ugling und ,cot death". die unspezifische kumulative sensibilisicrung malnutrition and the health of children practical aspects of infant feeding breast-feeding, weaning, ~nd acculturation appetithemmer in der ]~ehandlung der fettsucht bei kindern. miinchener med the effect of different amounts of vitamin d on growth and serum levels of calcium, inorganic phosphorus, and alkaline phosphatase in premature infants partition of urinary nitrogen in children with kwashiorkor treated with animal and vegetable proteins der einflud yon voukornbrot auf den caleiumstoffweehsel bei schulkindern ist eine rektale vitamin blz-behandlung vertretbar? dr ethyl alcohol in the pathogenesis of gout c|e~rance of infused fat emulsion in diabetic dogs praktische durehffihrung der parentcralon ern~hrung die parcntcrale ern~hrung chirurgischor paticntcn. wiss. ver ff. dr. ges. em~hrung verwertung intravenss verabfolgter aminos~,urengemische. wiss. versff. dr. ges die erkcnnung yon fe~-transportstsrungen und ihre bedeutung ftir die intra-ven se fettzufuhr intravenous glucose tolerance in the normal newborn infant: the effect of a double dose of glucose and insulin new intravenous fat emulsion indikationen und kontraindikationen der intraven sen fettzufuhr in der chirurgie anwendung intraven s gegebener aminos~urengemische in dcr p~diatrie ymtravensse fettinfusionen. wiss. versff. dt. ges. ern~rung ~qotwendigkcit und erfolge der parenteralen mad sonder-ern~hrung moderne vitamin blz-therapie: oral, rektal oder p~renteral? mcd anwendung intravenss gegebener aminos~urengemische in der gyn~kologie und geburt~hilfe aminos~ureninfusionen. schweiz. reed. wsehr. klinische anwendung mad erfahrungen bei der verabreiehung intravensser fettemulsionen an ehirurgischen patienten diskussionsbemerkung zum thema: die parenteralo ern~hrung ~tude exp~rimentale de la tolerance d'une solution de graisse vsgstale d'administration intraveineuse ern~hrungsphysiologische grundlagen der parenteralen ern~hrung erfahrungen mit der parenteralen ern~hrung mittels fettinfusionen. helvetica chirurgica aeta nitrogen, lipid, glycogen, and deoxyribonucleic acid content of human liver. the effec~ of brief starvation and intravenous administration of glucose techn~ und indikationen der parenteralen ern~hrung des neugeborenen die praktische organisation der klinisehen infusionstherapie mit zuckerund elektrolytlt)sungen. l ~ed untersuchungen und bcobachtungen fiber intravensse fettinfusionen in der inneren klinik. wiss. versff. d$. ges. ern~hrung l'alimentation parenttrale, mulsions lipidiques. (a suivre) ann intravcnsse ern~hrungstherapie mit fettemulsionen parenteral-und sondeneru~hrte patienten zur rekt~len kaliumsubstitufion parenteraie ern~hrtmg mit fettemulsionen konservierung mad zubereitung yon lebens-und futtermitteln nutritional hygiene, preservation and preparation of foodstuffs and feeds n. n.: vortragsveranstaltung ,fleischhygieno" der th seminar on the use of radioisotopes in nutrition science and of ionising radiation in food technology. strasbourg, st - th october progress of food irradia~on work and programmes in o.e.c.d. member countries ( berichte) safe heat processing of canned cured meats with regard to bacterial spores the role of food science i and technology on the freeze dehydration of foods public health aspects of handling animal products in the txopics fack)rs affecting bacfcrial spoilage of animal products at elevated temperatures. food technol sterilized concentrated millr. food teehnol. [ ] nr. , s. / , . n.n.: foods of the future. now opportunities for flavor modification unrestricted approval for irradiated bacon -a sanitary standards for multiple-use rubber and rubber-like materials used as product contact surfaces in dairy equipment -a sanitary standards for batch and continuous freezers for ice cream, ices, and similarly-frozen dairy foods bericht fiber die vortrag~tagung des fachverbandes lebensmittelchemio der chemischen gesellsch~ft in der ddr vom . bis lebensmittelchem. u. geriehtl. chem. fluoridation in great britain die enzymatische phycinspaltung in geschrotetem getreide in abh~ingigkeit yon der relativen lufffeuchtigkeit the effect of certain antioxidants during freezer storage of pork chops and sausage the mechanics of treating hatching eggs for disease prevention beitrag zur sfil~gerinnung yon kakaotrunk jodophore als desinfektionsmittel in der milchwirtschaft. milchwissenschaft [ ] nr. , s. ff. (? s.). zitat: dr. lebensmittel tierarzneimittel und anfzuchtmittcl in der landwirtschaftllehen praxis. gesundheitliche erwrgungen znm sehutze des konsumenten bei der anwendung yon tierarzneimitteln und aufzuchtmitteln in der landwirtschaftlichen praxis n~ihrwertminderung dureh zubereitung dcr nahrung suue modifieazioni della flora mierobiea dei mollusohi eduli par effetto di eonservazione impropria verlinderungen des inhaltes yon dosenkonserven w~hrend lgngerer lagerung digtbrote aus der sieht ihrer praktischen gestmtung. wiss. versff. dr. ges. ern~hrung l erniihrungshygienische untersuchungen in kindergarten an budapester kindern im alter yon bis jahren. z. ges. hyg. die eignung der bakteriologischen untersuehung yon kannenmilchproben als grundlage eines eutergesundheitsdienstes adequacy of cooking procedures for the destruction of salmonellae zur revitaminierung des mehles bzw. brotes. wiss. versff. dr. ges. er-n~hrung conventionele verwarningsmethoden beitrag zur kenntnis der wechselwirkungen zwischen proteinen und poly. phenolcn der kakaobohnen wtihrend der fermentation nihydrazone feed medication ag~ins~ ar~iiieiaily induced escheriehia eoli air-sac infection foam-mat dried orange juice. i. time-temperature drying studies lebensmittelhygicnische probleme bei der herstellung yon gemeinschaftsverflpegung. . mitt. : z. ges. hyg. lebensmittelhygienische probleme bci der herstellung yon gemeinschaftsverpflegung. . mitt.: z. ges. ttyg. untersuchungen fiber die temperaturvorg~nge im innern yon lebensmittein w~hrend ihrer thermischen zubereitung, erl~uter~ am kochen yon kartoffelklsl~en. z. ges. hyg. zur gewinnung yon niederverestertem pektin aus toehnisehen apfelpektinextrakten mit ammoniak: einflul] der entesterungsbedingungen auf das geliervermsgen hygienisohe beurteilung einer dutch clostridlum verursaehten massen. lebensmittelvergiftung [ungar neuartige teehnik der lebensmlttelverpaekung filr ge. schmaeks-aromastoffe und andere artikel bei ~berdruek studies with a natural source of xanthophylls for the pigmentation of egg yolks and skin of poultry die problematik der tuberkulosebeurteilung in der sehlachttier-und fleischuntersuchung freezing rate of beef as affected by moisture, fat, and wrapping materials ~ber mit komblnier~en konservierungsmitteln hergestellte konfitiiren und consumers' reactions to instant foods. food teclmol effect of supplementing lime-~reated corn with different levels of lysine, tryptophane, and isoleueine on the nitrogen retention of young children effect of freezing on autoxidation of oxymyoglobin solutions the control of gloecsporium album rot of stored apples by orchard sprays which reduce sporulation of wood infections bakteriologische befunde bei der spelseeisuntersuchung im sommer the microbiology of vacuum packed sliced bacon the stability of canned foods in long erm storage the effect of proofing and baling on concentrations of organic acids, carbonyl compounds, and alcohols in bread doughs prepared from pre-ferments nutrients in raw vs. cooked globe artichokes effect of gamma-radiation, chemical, and packaging treatments on refrigerated life of strawberries and sweet cherries. food teehnol beeinflussung der wirkung yon kaffeeinhaltsstoffen dutch be-s~immte behandiungsverfahren der l~hbohne. (eine tierexperimentell-toxikologische studie influence of surface pasteurization and ehlortetraeycline on bacterial incidence on fryers the hydrolysis of grass hemicelluloses during ensilage post-harvest storage studies with selected fruits the science of food technology in venezuela beitrag zur aufbewahrung von sti~rkesirup in verzinkten ge-f~ben inactivation-rate studies on a radiation.resistant spoilage microorganism. ii. : thermal inactivation rates in beef the oceurrence and growth of staphylococci on packed bacon, with special reference to staphylococcus aureus zur verhiitung yon lebensmittelinfektionen in grobklichenanlagen dutch desinfektionsmabnahmen. ~rztl the influence of selected bacteria upon the flavor of a precooked frozen poultry product flour maturing and bleaching with aeyclie acetone peroxides effect of processing conditions on dry-heat expansion of bulgar wheat zur vcrwendung von pentachlornitrobenzol bei der lagerung yon kohl. dr. lebensmittel-rdsch. [ ] nr. , s. los mati~res f cales des pores et les selles des ouvriers d'aba~toir constituent une source permanente de diss mination des salmonella studies on cooking fats and oils aspect sanitaire et l~gal aetuel des aliments conserv&s. rev. d'hyg over de betekenis van postduiven als besmettingsbron van levensmiddelen met salmonella-kiemen. ti]dschr. v. diergeneesk over bet voorkomen van salmonella-kiemen bij slagerijen. tijdschr. v. diergeneesk ursache und entstehung yon brotfehlern les salmonella des oeufs et ovoproduite frangais eg trangers the microstructure of baked products and doughs. food technol tomsto powder by foam-mat drying zitat: dt. lebensmittel-rdsch. [ ] nr. , s. . --die verwendung yon gefrosteter saline zur butterherstellung. teil iv. die ergebnisse der in d~nemark, frankreich und in den l~iederlanden durchgefiihrten praktischen versuche und ihre bedeutung ffir die in der deutschen molkereiwirtschaft erfolgendo verwendung yon gefrosteter sahne bei der butterherstellung tenderne~ of the turkey meat as influenced by pre-cooling before proce~ing and hand masv~ging vortragsmaterialien flit die ern~hrungsproplldeutik. (erlruterungen zu insgesamt groben sehautafeln.) . mitt. : behandlung der tafeln iv bis vi. ernahrungsforschung die unterschiedliehe problematik und ihre konsequcnzen bei der bekrmplung der rinder-und schwefnefinnen salmonellae from flies in a mexican slaughterhouse factors affecting quality of pies prepared from frozen bulkpack red sour pitted cherries zur bedeutung antimikrobiellcr stoffc in der nahrung modified equipment for pasteurizing and deodorizing market milk and for pasteurizing, deodorizing, and slightly concentrating cheese milk adhesion of coatings on frozen fried chicken oob~age eheeae problems in production and sanitation. publle health aspects ~ber den einflub yon licht, auerstoff und tempera~ur auf die hal~barkoib yon verpaektem emmentaler ks in scheiben aktuelle notwendigkeiten -gesetzliche m glichkeiten zur fischkfihlung in eis und seewasser techniques de recherche des salmonella dans les oeufs frais et de conserve food hygiene on board ship safety factors in water fluoridation based on toxicology of fluorides the effect of oiling before and after cleaning in maintaining the albumen condition of shell eggs lebensmi~t~l-aerosole. fette preservation of the natural color in processed sweetpotato products. i.: flakes. food technol temporary inhibition of fermentation in apple juice preservatives and artificial sweeteners the mechanism of the development of rancidity in frozen fresh pork sausage and practicable methods for its inhibition die entwicldung der trinkwasseriinoridierung in den usa microbiological principles in prcpaeking meats st~ndard-kapazit~tstest ffir die bestimmung der desinfektionswirkung yon desinfektionsmitteln in der milchwirtsehaft. internationaler standard fil/ii)f - die anwendung einiger arteu, bzw. st~mme, yon propions~urebakterien zur herstellung bestimmter k~scsorten mit hohem vitamin b -gehalt the extraction of pectins from apple marc preparations zur hygiene und ,di~tetik des rauchens studies on control of respiration of mcintosh apples by packaging methods. food teehnol effects of ingredients used in condermed and frozen dairy products on thermal resistance of potentially pathogenic staphylococci der frisehkllse und seine verpackung studies on the viscosity of mayonnaise. ii.: the influence of addition of vinegar on the vi~co~isy of mayonnaise e~'ec~ of chemical v~lditives on the spreading quality of butter. ii. laboratory and plant churnings studies on browning mechanisms of fruit juice products. i.: changes in chemical composition which accompany browning of commercial concentrated lemon juice during storage ergebnisse der dlg-qualit~tspriifung fiir speiseeis. dr. molkerei-ztg beeinttnssung versehiedenartig verpaekter lebensmittel dureh desinfektion mit formaldehyd grunds~itzliches zur stabilisierung und solubilisierung yon carotin und carotinoidpr~paraten riickst~nde yon pflanzensehutzmitteln, insektiziden und dergleichen in der nahrung und ihre bedeutung fiir die gesundheit absehliebende stellungnahme lebensmittel-rdsch langfristige nutritive anwendung yon antibiotilm in der tierern~hrung im hinbliek auf die menschliehe gesundheit mit besonderer beriieksiehtigung yon chiortetrazyklin nutritional studies on the utilization of distiller's stillage. part l: insolubles of me]lasses-butanol distiller's stillage auswertung der dlg-priifung fiir frischk~ise in verbraueherpaekungen zu den fermentativen eigensehaften der milchs~urebakterien (,laetobacillus meijerinek"), zugleieh ein beitrag zur vermeidung yon fehlfabrikaten bei roh-nnd briihwurst. arch. lebensmittel-hyg microbiological aspects of one-trip glass bottles as used by the carbonated beverage industry de bereiding van bouillon aktnelle milchhygienische aufgaben und zicle des organisation der ~berwachung der umweltradioaktivit~t unter besonderer beriieksichtigung der l~berwaehung des gehaltes yon lebensmitteln an radioaktiven stoffen. dr. lebensmittel bakteriologie der auermilcherzeugnisse l~ber die italtbarkeit yon lebensmittelkonserven preparation of aeid-modifid flour for tub sizing radiostrontium removal from milk. determination of apparent equilibrium constants of the exchange reactions of sodium, potassium, calcium, and magnesium wish amberlite ir- probleme der vitaminierung yon brot the effect of several operational variables on the rate of freeze-drying of beef studies on beef quality. x. effect of temperature, freezing, frozen s~orage, thawing, and p~ on the rate of hypoxanthine production. div. food preservation techn retardation of gelation in high temperature-short-time sterile milk concentrates with polyphosphates nonenzymatic bread browning and flavor. changes in amino acids and formation of earbonyl compounds during baking ttinweise fiir konservierende wirkungen synthetiseher senfslbildner naeh versuehen an fisehen neuo wege zur herstellung haltbarer fisch-pr~iserven behavior of ethylene dibromide, methyl bromide, and their mixtures. i. : in columns of grains and milled materials der einflui~ des wksserns auf die kartoffel irradiation of fruits and vegetables in india effect of storage in nitrogen on the soluble sugar and dry matter contents of ryegrass drying of seaweeds and other plants. v. throughcirculation drying of asophyllum nodosnm in a semi-continuous dryer niacin, thiamin, and riboflavin in fresh and cooked pale, soft, watery versus dark, firm, dry pork muscle nouvelles observations concernant la survie des salmonellae clans les fromages pyroearhonie acid diethyl ester as a potential food preservative the effect of phosphates on moisture absorption, retention, and cooking losses of broiler carcasses gur hygienisehen beur~ilung der trinkwasserverh~ltnisse des oberon vogtlandes -eine hydrobiologische s~udie. z. ges. hyg. studies on preserving quality in market eggs rapid detection of faecal coliform bacteria in the food processing plant. j. milk food technol the relationship between the loss of water and carbon dioxide from eggs and the effect upon albumen quality plastic pacckaging of eggs. study on improvement of digestibility of milk protein. i.: the effect of heating, adjustment of activity of calcium ion, addition of whey protein, homogenization, and elimination of coarse casein micelle from milk by ultracentrifuge on the digestibility of milk especially on the coagulability of it part iii.; the digebtibility of slightly hydrolized milk with proteinase and the preparation of rnill~ which has same eoagulability as human milk. g. agrie, chem. see study on improvement of digestibility of milk protein. part iv. : the nature of coagulation of casein of milk preparation which has same coagulability as human milk the influence of added microorganism on the quality of margarine. i. : the influence of mold inoculation die ilberung yon tafelw/~ssern. dr. lebermmittel-rdsch technological aspects of the radiation pasteurization of foods rapid hydration of dried fruits. food technol untersuchungen fiber polygalakturonase-enzyme aus sehimmelpilzen. . mitt.: eigenschaften der polygalakturonasen aus schimmclpilzen role of individual phospholipids as antioxidants association of veterinary food hygienists symposium on the marketing, transport, and slaughter of calves. iil: scientific aspects. ve~cr ricerche sulla resistenza della brucella abor~us helle salsicce. riv pr senee des salmonelles dans les viandes. donn es frangaises et tran-gbres biochemisehe vorg~nge inl fleisch bei der lagerung einflul des r stgrades yon kaffee auf die extinktion w~briger extrakte und die menge der trockensubstanz die herstellung yon quark und weillk~e unter ansnfitzung ss eiweibstoffe der milch vcrluste yon vitamin b~ und c beim kochen und turmkoehen yon gemfise effect of chilled storage on the frozen storage life of whiting salmonellenfunde in einer importsendung amerikanischer tiefgefrierhiihner. arch. lebensmittel-hyg iron sulfide blackening in canned protein foods: oxidation and reduction mechanisms in relation to sulfur and iron raft research on food preservation by irradiation in poland no~: gas chromatography of chicken and turkey volatiles: the effect of temperature, oxygen, and type of tissue on composition of the volatile fraction l'inaetivation dens l'eau de meret l'eau d'alimentation de eertains entdrovirus de voedingswaarde van aardappelen van versehiuende re, sen en de invloed daarop van bemesting en bewaring effecb of l-arab]nose and d-xylose on dough fermentation and crust browning gelation of egg yolk corn carotenoids: effects of temperature and moisture on losses during storage salmonellenfunde in einer importsendung amerikanischer tiefgefriorhiihner. arch. lebensmittel-ttyg bacteriological examination of unbottled soft drink ~berbliek fiber kunststoff-folien und -kombinatlonen ale verpackungsmaterial in der mflchwirtschaft pigmentierung des eidottem bei gettiigel. wiss. versff. dr. ges. eruiilu'ung s~ beltrag zur bedeutung wasserlsslieher hochmolekularer kohlenhydrate f'tir die verkleisterung der st~irke einfiul ehemischer verbindungen auf die antimikrobielle konservierungs-~toffwirkung. . mitt.: einflub verschiedener stoffgruppen auf die konservierungswirkung gegen aspergillus niger a quantitative s~udy of changes in dried skim-milk and lactose cnscin in the 'dry' state during storage the role of the major sugars of potatoes in ~he browning roa tion during chipping probleme der zuverl~sigkeit yon kunststoffen zur lebensmittelverpackung in europi~ischer sicht bakteriologiseh-hygienisehe beur~ilung yon speiseeis weizenkeime ale wertvoller rohstoff-einige ~ragestellungen und probleme probleme der frischhaltung und haltbarmachung yon brot end backwaren the effect of selected polymers upon the albumen quality of eggs after storage for short periods preparation and quality evaluation of processed fruits and fruit products with sucrose and synthetic sweeteners the microflora within the tissue of fruits and vegetables changes in carbohydrate and phosphorus content of potato tubers during storage in nitrogen preparation of "natural" cow-milk fat globules; preliminary investigation of materials adsorbed at their surfaces lethal doses of gamma radiation of some fruit spoilage microorganisms alteration of post-mortem changes in porcine muscle by preslaughter heat treatment and diet modification ober die msglichkeiten end grenzen eines effects of polyphosphates on water uptake, moisture retention, and cooking loss in broilers flavors imparted to dairy products by phenol deriva aromatisehe crackproduktc yon sterinen. (i). z. ern~hrungs-wiss dose requirements for the radiation sterilization of food berichb fiber eine arbeitstagung bei der internationalen atomenergie-beh rde in wien vom zur bok~mpfung d r rinderfinne zum einfiu]~ handelsfiblicher, in lebensmittelbetrieben gebr~uch-]icher desinfektionsmittel auf lactobakterien; zugleich ein beitrag zur desinfektion in der marinadenindustrie einflu~ chemischer umsetzungen bei trockenen lebensmittelgemischen in hinsich~ auf die lagerfestigkeit. vi. mitt. : lebensmittelgemische mit troekenmagermileh als hauptkomponente. z. lebensmittel-untersuchung u die technologie yon sauren milcherzeugnissen, insbesondere der sauermilcharten und sauerrahmarten effects of several edible coatings on poultry meat quality how to control insects in stored foods. part die antibiotika und die ans ihrer anwendung fiir die ~iilehwirtsehaft sich ergebenden probleme zur haltbarkeitsverli~ngerung empfindlicher l~ahrungs-und genuflmittel dureh abpaeken unter vakuum. fette, seifcn the diffusion of hydrogen through tinplate containers packed with grapefruit juice effect of ice cream stabilizem on the freezing characteristics of various aqueous systems ist die infektion mit trichinen aus amtlich untersuchtem schweinefieisch im liehte der mathematisehen analyse der bestimmungen der fleischbcschau yon schweinefleisch msglieh? hygiene in milk production, processing, and distribution effect of pre-eooling eggs and cartons upon quality after storage a biological after-effect in radiation-processed chicken muscle accounting for farm tank milk factors related to the flavor stability during storage of foam-dried whole milk. iii. effect of antioxidants untersuchungen zur hygienischen beurteihing yon ~ietallverunreinlgungen in lvben~mitteln the effect of bleed time prior to scald and refrigerated storage upon bacterial counts in the axillary diverticula of the interclavicular air sac of chickens techniques de recherche des salmonella darts les viandes the serotypes of salmonella isolated from foods carotinverluste beider zubereitung der nahrung. wiss. vcrsff. dr. gcs. er-n~hrung stability of ascorbie acid in a liquid multivitamin emulsion containing sodium fluoride the effect of storage time and holding temperature on egg interior quality in uganda uber den einflul versehiedener fangverfahren auf die qualit~t und lagerreserve der fische zerst~ubungstrocknung yon tomatenkonzentraten. dr. lebensmittel radiation pasteurization of fresh fruits and vegetables a bacteriological survey of certain processed meat~. part l population studies at packet and retail levels association of veterinary food hygienists symposium on the marketing, transport, and slaughter of calves. l : marketing and slaughter die kombinierte verarbeitung yon kartoffeln auf st~rke und alkohol (fortschrittsbericht) usaec program in radiation research preservation of certain fish and fruits biochemical and quality changes in chicken meat during storage at above-freezing temperatures inleidend onderzoek naar strnctuurveranderingen die ontstaan bi] verhitten van plantaardige produkten veranderingen van hot vetgehalte bij de bereiding van vlces a study on the relationship between the factors influencing the time of cheese salting maple sirup. xxi. : the effec~ of temperature and formaldehyde on the growth of pseudomonas geniculata in maple sap vacuum-tempering corn for dry. milling organoleptische eigenschappen, thiamine-en ascorbinezuurgehalte van enige week-en diepvriesgroenten growth of psyehrophiles. il : growth of poultry meat spoilage bacteria and some effects of chlortctracycline tests of corn stored four years in a commercial bin association of veterinary food hygienists symposium on the marketing, transport and slaughter of calves. ii. : the slaughter and inspection of calves. veter the effect of processing conditions upon the nutritional quality of vegetable oils berieht fiber den wisscnsehaftlichen kongrel der dcutschen gesellschaft ftir ern~hrung ( vortragsrcferate) an objective measurement of the freshness of ready-to-cook broilers the fieldman's responsibilities in milk quality and procurement studies on the bacteria found in the wine during its making. i.: multiplication of bacteria in wine and male-lactic fermentation hydrophilic colloids as additives in white layer cakes the acceptability of cooked poultry protected by an edible acetylated monoglyeeride coating during fresh and frozen storage istes zu vertreten, dal das fleiseh sehwachfinniger rinder auch in gebriitetem zust~nd eingcfrorcn wird? arch. lebensmittel-hyg =[efenlnfit.ierte kondensmilch als ursache yon fehlfabrikation bei schokolade. arch. lebensmittel.hyg. [ ] nr. , s. . m, ober die bedeutung aerober sporenbildner als bombageerreger yon wfimtehenkonscreen. arch. lebensmittel-hyg aluminiumfolio zur verpaekung tiefgekiihlter und gefriergetrockneter lebcnsmittel. fette fiber die milcldtuorierung. bull. schwciz. akad. reed. wiss vitamin stability in diets sterilized for germfree animal~ die trinkwasserversorgung ernliln-ungsstatistlk nutritional statistics african nutrition problems der vcrbrauch yon alkoholisehen getr~,nken in sterreich childhood nutrition in lapland overweight children in stockholm iron deficiency in the finnish population vitamin b deficiency in indian infants the indices of nutritional change in great britain dietary values from a h recall compared to a -day survey on elderly people her vaststellen van de voedingstoestand van sen bevolldng in de tropen en subtropen dental effects of fluoridation of water with particular reference to a study in the united kingdom de voedlng van woonwagenbewoncrs nutritional attitudes of some london housewives de vocdingsgewoonten van bcjaarden in amsterdam community studies of drinking behavior fats and carbohydrates as factors on atherosclerosis and diabetes in yemenite jews nutritional beliefs among a low-income urban population algemene gezondimidsaspccten van de vocding in de ontwikkelingsgcbieden a comparative study of the nutritional adequacy of the morning intake of women clerical workers and women factory workers serum cholesterol in a military population. its relation to obesity and the military diet nutrient intakes of healthy older women analysis of the structures of food consumption by groups in japan thiamin (vitamin b,)-untercru~hrung in deutschland? lvied vortragsmaterialien ftir die ern~hrungsprops (erli~uterungen zu insgesamt groben sehautafeln.) . mitt.: behandlung der tafein iv bis vi. erns [ ] nr. , s. zur ern~ihrungssituation in arbeitexfamilien aus verschiedcnen bezirken der ddr. . mitt. : ern~hrungssoziologischc answertung der lebensmittelverzehrungen in itaushaltungen mit erwachscnen und kindern~ stand studies in infantile malnutrition. i. : nature of the problem in peru zur ~ethodik yon ern~ihrungserhebungen bei der gemeinschaftsverpflegung weight changes in relation to birthweight of papuan, indonesian, and chinese children during the first two weeks of life numbers of tasters required to determine consumer preferences for fruit drinks onderzoek naar de menupatronen in de noordoostpolder smoking habits of medical and non-medical university staff i**cs and potassium in people and diet. -a study of finnish lapps. ann. acad. scientiarum fennicae a ~berbliek fiber die radioaktivit~t der in sterreich im jahre konsumierten lebensmittel diet and plasma cholesterol in bank men der mengenmi~bige getr~inkeverbrauch je einwohner im bundesgebiet predictors of human food consumption use of goverument.donated foods in a rural community bericht fiber die durum-und teigwarentagung der arbcitsgemeinschaft der oetreldeforschung e. v. vorn . bis . miirz voeding van leerlingen van con lagere teehni~eho school. ii. calorie~in-en nutriiintenwaarde nutritional deficiencies in developing countries dietary survey in surinam vitamine a-tekor~en op de aarde schwierigkeiten beim erreichen einer vollwertigcn ern~ihrung in ausgew~hl-ten vcrbrauchergruppen die entwicklung des brot-und getreideverzehrs in der neueren zeit. wiss. ver ff. dr. ges. ern~hrung n~rwert und zusammensetzung yon lebens. und futtermltteln nutritive values, composition of food~tuffe and f physical chemistry of ice cream vitamin e in human nutrition appraisal of nutritional adequacy of infant formulas used as cow milk substitutes enkele gegevens betreffende de calorisehe waarde van klsine hepjes, gerechtcn en maaltijden the public health aspects of the use of antibiotics in food and foodstuffs. report of an expert committee niihrwertminderung dutch zubereitung der nahrung metals and other elements in foods die farbe der nahrungsmittel in anthropologischer sicht the enzymatic destruction of carotene and carotenoids foodstuff flavors. some factors affecting the flavor of sodium caseinate chemical and radiochemical composition of the rongelapese diet the organic constituents of food. i. : lettuce the influence of dehydration of foods on the digestibility and the biological value of the protein a stfidy of two methods of assessing vitamin b nutriture mincraisalze mad spurenelemente in der nahrung distribution of the bound form of nicotinic ac|d in natural material~ the distribution of c~rotenoids in nature and their biological significance zur bedeutung antimikrobieller stoffe in der mahrung die konsistenz yon margarine mad fetten an improved nutrient solution for diploid chinese hamster and human cell lines extraneous materials in foods and drugs the composition of food flavors studies on pantothenic acid intake. i. pantothenic acid content in japanese foods haben wir mangel an essentiellen fettsiiurcn? phonolcarbons~uren in menschlichen nahrungsprodukten. zum vorkommen yon phenolcarbons~uren in menschlichen nahrungsprodukten und ihr einttud auf den intermedi~ren sboffwechscl drugs in feeds relationship between the sulphur/nitrogen ratio and the protein value of diets gums in foods zur definition der begriffe ,aroma" und begriff und aufgabe dii~tetischer lebensmittel valettr vitaminique des carot nes pour rhomme. wiss. versff. dr. ges. ern~hrung internationales rundgespriich fiber lebensmittelchemische probleme in wiesbaden und eltville a. rh. ( vortragsreferate) consumer awareness of texture and other food attributes organisehe und organisierte substanz in der lebensmittelchemie frost resistivity of fruit plants californ'ia association of chemistry teachers : inorganic nutrients in the sea fluoride in food antibiotics in feeds and other products planktorm as foods relation between color of cranberries and color and stability of sauce berieht fiber die vortragstagung des fachverbandes lebensmittelchemie der chemischen gesellsehaf$ in der ddr vom . bis bulletin on tobacco evaluation of algae as a food for human diet~ digestibility of high-amylose corn starch. nutrition red. [ ] nr. , s. / . ~. n. : comparative evaluation of corn mesa and steam-processed whole corn flours the major anthocyanin pigments of vitis vinifera varieties flame tokay, emperor, and red ~alaga peonidin- -monoglucoside in vinifera grapes formation and distribution of amylosc and amylopectin in the starch granule nutrient in seeds. amino composition of some seeds sugar levels in fruits of the lowbush blueberry estimated at four physiological ages the relation of pectic substances to firmness of processed sweet potatoes (ipomoea berates) processed vegetable produc~s the protein composition of different flours and its relationship to nitrogen content and baking performance relationship between 'antitryptic factors' of some plant protein feeds and products of proteolysis precipitable by trichloroacetic acid. g. sci. food agric a thermostable haemolytic factor in soybeans foam-mat dried orange juice. i. time-temperature drying studies bound" growth inhibitor in raw soybean meal leaf analysis as a guide to the nutrition of fruit crops. ii. : distribution of total n, p, k, ca and mg in the laminae and petioles of raspberry (rubns idaeus l.) as influenced by soil treatments nutritive value of pumpkin seed. essential amino acid content and protein value of pumpkin seed (cueur bi~a farinoaa) effect of cooking and of amino acid supplementation on the nutritive value of black beans (phaseolns vulgaris l.) supplementation of cereal proteins with amino acids. iv.: lysine supplementation of wheat flour fed to young children at different levels of protein intake in the presence and absence of other amino acids uber den chemischen total ascorbic acid in potatoes. raw, fresh, mashed, and reeonstituted flakes moisture contents of hard red winter wheat as determined by meters and by oven drying, and influence of small differences in moisture content upon subsequent deterioration of the grain in storage rheological studies with canned tomato juice the chemical composition of maple sugar sand soluble carbohydrate content of varieties of te~raploid ryegrass natiirlicher gehalt und stabilit~t yon carotine und carotinoiden in citrnss~ften ~ber wein und weinuntersuchungen fatty acids and other lipids in mayonnaise cyclic fatty acid yields from linseed oil factors ~ffecting enzymatic solubilization of beef proteins weibulls original ring, en ny medeltidig varvetesort. (a new medium early variety of spring wheat, weibull's ring.) agri hortique genetiea ober die askorbins~uresynthese in zerschnittene kartoffeln die bestimmung der amylaseaktivit~t und einige studicn fiber amylaseaktivit~t in gekeimtem roggen effect of processing conditions on dry-heat expansion of ~ulgar wheat oils, fats, and waxes ~.~ber das der johannisbro~.kerne. fetto, seifen fruit and fruit products uber die variabili~it einiger eigcnschaften der kartoffelstilrke in abhiingigkcit yon witterung a short.term effect of weather on malie acid in pineapple fruit the specific surface of flour and starch granules in a hard winter wheat flour and in its five subsieve-size fractions oranges and lemons factors affecting quality of pies prepared from frozen bulk-pack red sour pitted cherries studies on the consish~ncy of thiamin and protein contents of pure.bred strains of rice a comparison of the nutritional value of protein from several soybean frac ions zum vitamin-und aminos~uregehal~ yon maisquellwasser dark discoloration of canned all-green asparagus. i. chemistry and related factors enzymatic enhancement of flavor peetinestcrase in normal and abnormal tomato fruit storage effects on winter squashes. varietal differences and storage changes in the ascorbie acid content of six varieties of winter squashes grape pigments. concord grape pigments corn meal as a source of ribonuclease banana odor components. volatile components of bananas. part i. isolation of an odor concentrate. part il separation and identification lipids of algae. ill. : the components of unsaponifable matter of the algae chlore]la volatile esters of bartlett pear. ii studies on the nutritive value of raw and cooked soybeans for growing rats and swine and their effect on fat firmness characterization of fruit juices by acid profiles nitrate content of beets, collards, turnip greens lysine fortifications of wheat bread fed to haitian school children studies on the flavor of green tea. par~ iv. : dimethyl sulfide and its preeursor funktionelle eigenschaften yon lehens-mittclst~rken ~)ber das vorkommen yon xylit im speisepi]z champignon ein beitrag zur znsammensetzung yon apfeisinens~ften aus spanischcn und l~iarokko-apfelsinen an examination of the free amino acids of the common onion (allium cepa) a trypsin inhibitor in wheat flour the protein quality, digestibility, and composition of algae, chlorella chemical and color changes in canned apple sauce digestibility of the a-cellulose and pentosan components of the cellulosic mieelle of rescue and alfalfa safeness and serva. bility of meringued pie alcoholic beverages diurnal-nocturnal changes in the starch of tobacco leaves sugar and sugar products modification of flour proteins by dough mixing: effects of suffhydryl-blocking and oxidizing agents ~ber das haferprotein. z. lebensmittel-untersuchung u sodium and potassium in wines and distilled spirits non-volatile organic acids of the dwarf cavendish (chinese) variety of banana~ biological evaluation of soybean meal and cottonseed meal by amino acid digestibility and protein efficiency ratio studies. oiss. abstr. [ ] l~r oxidation-reduction potentials of sak~f and synthetic sak . xi.: on the relationship of the various fermenting processes of sak~ to oxidereduction potentials and indicator time test (i.t.t.) values of sak mash neue wege zur ern~hrungsphysiologischen aufwcrtung von getreideerzeugnissen cereal products ascorbie acid in dehydrated po~es die eiweibqualit~t yon ,getoastetem" (dampferhitztem) und ungetoastetem sojaextraktionssehrot color studies on processed dried fruits studies on the basic amino acid of the soy sauces and the seasoning liquids. ii.: the quantitative changes of l-arginine in the process of soy sauces brewing studies on the flavorons substances in soy sauce. xxii. : the differences between the soy sauce made from soy bean and wheat and that made from defatted soy bean and wheat feeding value of soy milks for premature infants flavors and non-alcoholic beverages fat content and fatty acids in some commercial mixes for baked products sensory examination of four organic acids added f~ wine ber die inhaltsstoffe der robktmtanie und versuehe zu ihrcr gehaltsbestimmung. diss. univ. hamburg, . malt beverages, sirups, extracts, and brewing materials vitamin b and niacin in potatoes. retention after storage and cooking chemical investigation of some wild indian legumes zusatzstoffe der margarine. forte componen~ glyeerides of an indian fresh-water fish fat einflub des rsstgrades yon kaffce auf die extinktion w~$riger extrakte und die v[enge der trockensubstanz gaschromatographische untersuehungen yon fusclslen aus vcrsehiedchen g~irprodukten. . mitt.: problemsteilung und literaturfibemieht de voedingswaarde van aardappelen van versehillende rassen en de invloed daarop van bcmesting en bewaring eleetrophoretic separation of beet pigments studies on the growth-promoting value and digestibility of passion fruit seed oil polycyclisehe und aliphatisehe kohlenwasserstoffe dc~ tabakrauehes carotenoid, oil, and tocopherol content of corn inbreds location and possible role of esterified phosphorus in starch fractions untersuchungen fiber die chemischen u beim a]tern yon r stkaffee. . mitt.: gaschromatographische analyse der leichtflfichtigen aromabestandteile nature, origin, and prevention of hydrogen sulphide aroma in wines the magnesium contents of soil and crops versuehe zur ehemischen differenzierung der eiweibst~ffe des weizens und roggens lemon juice composition. iii.: characterization of california-arizona lemon juice by use of a multiple regression analysis weizenkeime als wertvoller rohstoff -einige fragestellungen und probleme isolation of gram quantities of a rhamnoglucoside of apigenin from grapefruit determination of distribution of water in wheat grains by interference microscopy preparation and quality evaluation of processed fruits and fruit products with sucrose and synthetic sweeteners changes in carbohydrate and phosphorus content of potato tubers during storage in nitrogen chemical composition of some natural and processed orange juices effects of various factors in the candy test zum stand der kenntnlsse fiber die v{eclmelwir-kung zwischen nativer sts und wasser some volatile compounds from cooked potatoes firmness of canned apple slices as affected by maturity and steam-blanch temperature nutritive values of ten samples of western canadian grains proteins of wheat and flour. the separation and purification of the pyrophosphate-soluble proteins of wheat flour by chromatography on dear-cellulose changes in quality and composition produced in wine by s~ gamma irradiation citrus essential oils. iii. evaluation of silician natural lemon oils mineral analysis of plant tissues a. : nature of colloids in clarified cane juices studies on amino acid content of rice. i. : amino acid composition of polished rice glutelln estimated by beckman amino acid analyzer yliichtige carbonylverbindungen in honig neue aminos~iuren in h heren pflanzen studium der wirkung der gammastrahlung auf die l-ascorbins~iure flour liplds and oxidation of sulfhydryl groups in dough zur kenntnls eincr weiteren in der sti~rke vorkommenden kohlenhydrat-komponente. ern~ihrungsforschung lemon juice composition. ii. : characterization of california.arizona lemon juice by its polyphenolic content lemon juice composition. i.: characterization of california-arizona lemon juice by its total amino acid and -malic acid content safflower amino acids amino acid composition of safflower kernels, kernel protein, and hulls, and solubility of kernel nitrogen citrus fruit enzymes. i. : ascorbie acid oxidase in oranges untersuehungen fiber die verf~rbung gekochter kartoffeln an den sorten des kulturlrartoffelsortiments ides instituts ffir pflanzenziiehtung groi~-liisewitz nutritive value of red kidney beans (phaseolns vulgaris) for chicks instability in potable spirits. ii.: rum and brandy effect of size classification and maturity on the protein content of alaska and perfection peas ma~tr~, d. c.: ascorbie acid retention and color of strawberries as related to low-level irradiation and storage time historical aoac data on four fema samples of vanilla extract the acid-extracted pentosan content of wheat as a measure of milling quality of pacific northwest wheats petroleum ether extraetables in tobacco isolation, origin, and synthesis of a bread flavor constituent the protein composition of airclassifled flour fractions * studies on the flavor of green tea. v. : examination of the essential oil of the tea-leaves by gas lipid chromatography nutritive value of starches. iv.; comparison of digestibility of natural starches estimated by a new procedure o c lebensmittel tierischen ursprungs _foodstu~$ o/animal origin n.n.: gdch-faehgruppe ,lebensmib~el. und gerichtliche chemie bericht fiber die vortragstagung des fachverbandes lebensmittelchemie der chemischen gesellsehaft in der ddr yore . bis a taxometric study of the propionic acid bacteria of dairy origin comparisons of the caseins of buffalo's and cow's milk matiirlicher gehalt und stabilit~t der carotinoide in fetten und milchprodukten. wiss. versff. dr. ges. ernhhrung antibiotics in milk vitamin a and d enrichment of nonfat dry milk some characteristics of yolk solids affecting their performance in cake doughnuts. i. effects of yolk type, level, and contamination with white proteine des eidotters post-mortem changes in the muscles oflandrace pigs t~ber den fettgehalt yon flcischkonserven. dr. lebensmittel der einflul~ der verfahrenstechnik auf die ks jahreszeitliche einfliisse alff den gehalt der fischmuskulatur an freien ami-nos~iuren u. deren bedeutung fiir die qualitiit fangtechnik und fisch-qualit~t. fette spcei~c distribution of fatty acids in marine lipids a comparison of pigs slaughtered at three diffcrcn~ weights. i. : carcass quality and performance a comparison of pigs slaughtered at three different~ weights. ii. : association between dissection results, various measuremen~ and visual assessments a note on the effect of heat on the colour of goat's milk chemical and nutritional changes in stored herring meal. .: nutritional significance of oxidation of the oil relation of pork muscle quality factors to zinc con~ent and other properties the chemical nature of the characteristic flavor of cultured buttermilk occurence of vaniuin in hea~ed milks the electrophoretie properties of the proteins in cottage cheese curd the influence of post-mortem glycolysis on poultry tenderness seasonal variations in cod liver oil isolation and characterization of the flavor components of rancid pork some problems in the evaluation of egg albumen quality quality evaluation studies of fish and shellfish from certain northern european waters indices for lamb carcass composition subjective and objective evaluations of prefabricated cuts of beef ils, fats, and waxes die bildung yon eiskristallen in diinnen milchschichten. milchwissensehaft the incidence of bacteria in cheese milk and cheddar cheese and their association with flavour body composition of market weight pigs some factors affecting tenderness of turkey meat ovine bioenergetics and nutritional efficiency, with special reference go forage utilization die ziichtung yon fleisehsehweinen und die folgeerseheinungen, die sich insbesondere im hinbliek auf die qualit~t yon fleisch und fett ergeben. arch. lebensmitgel-ityg n the structure of highly unsaturated fatty acids of fish oils by high resolution nuclear magnetic resonance spectral analysis the fatty acid composition of the milk fat of cows grazing on ryegrass at two stages of maturity and the composition of the ryegrass hpids effect of intrauterine infusion of penicillin-streptomycin and furacin and vaginal deposition of furacin on chemical residues tevei~ in millr beziehungen zwisehen l-aseorbinsaure und milch comparison of chemical and organoleptic data obtained on thawed and unthawed frozen cod, haddock, and perch fillets die ~ m~uosfiurenzusammensetzung der ziegenmflch und des ziegenmileh-caseins food flavors and odors. meat flavor: lamb dairy products de ehemische samenstelling van visen visprodukten chemical studies on the herring (clupen harengus). vii.: collagen and cohesiveness in heat-processed herring and observations on a seasonal variation in collagen content growth and pro~eolycie activity of pseudomonas fluoresccns in eggs and egg products the fatty acid composition of some perirenal and subcutaneous beef depot fats inactivation of peroxidase in milk by homogenization a study of the "cured meat" color producing reaction and the effects of some curing adjuncts t~yoer den fettgchalt yon br'tihwiirsten stress effects, eareass composition, and carcass quality in lambs effect of chemical additives on the spreading quality of butter. ii. laboratory and plant ehurnings preliminary studies on protein and moisture relationship in fresh and proeessed hams die zusammensetzung des kuhmilchfettes in abh~ngig-keit yon der ffitterung. fette chemical characterization of off-flavors in concentrated and nonfat dry milk zur ausseheidung yon xanthindehydrase mid molybdiin in der kuhmilch copper distribution in milk during early lactation die physikaliseh-ehemischen ursaehen der hitzestabilitet yon mileheiweil]stoffen. milehwissenscha sobn~a: ~oer einige verenderungen in den caseinfraktionen normaler und anomaler milch (colostralmilch und milch an lviastitis erkrankter kiihe). milchwissensehaft some characteristics of yolk solids affecting their performance in cake doughnuts. ii. variability in commercial yolk solids zum problem des znsammenhanges zwisehen der konsistenz und der physikalischen struktur der butter. fette relationships between milk fat acidity, short-chain fatty acids, and rancid flavors in milk induced and natural inhibitory behavior of milk and significance to antibiotics disc assay testing. j. dairy sei. [ ] nr. , s. ff. ( s.). --some distribution patterns of cottage cheese particles and conditions contributing to curd shattering natural inhibitory eharacteristies of some irish manufacturing milks thermophylie aetinomycetes in milk and dairy products. mikrobiologiya the free fatty acids of purdue swiss-type eheese die zusammensetzung yon siilzen lind ihre beurteilung im regierungsbezirk diisseldorf not~ on tyrosine production in frozen stored liver studies on the muscles of meat animals. hi. : comparative composition of various muscles in pigs of the three weight groups studies on beef quality. x. effect of temperature, freezing, frozen storage, thawing, and pe on the rate of hypoxanthine production. die. food preservation t~chn increased iodine in milk as a countermeasure for ~liodine time-temperature studies of baked, loaves. meat, fish, and poultry vergleichende untersuehungon an butter und einem butter~hnli-chen umgeesterten fort. forte, seifen, anstrichmittel zur beziehung zwisehen fettgehalt und wassergehalt bei ungewa~ohener s/il~rahmbutter (fritzbutter) variation of ovine fat composition within the carcass studies on the properties of new zealand butterfat. vii. effect of the stage of maturity of ryegrass fed to cows on the characteristics of butterfat and its carotene and vitamin a contents studies on turkey body composition. . poultry sci thermal conductivity of beef studies on turkey body composition. free fatty acid, tyrosine, and ~ changes during ripening of blue cheese made from variously treated milks l : factors influencing the nutritional value of fish flour. il: availability of lysine and sulphur amino acids. canad characterization of flavor compounds isolated from evaporated milk effect of egg yolk size on yolk cholesterol concentration ~tudo immuno~lectrophor~tiquo du lair dans los divers types de mammites. i. milchwissenschaft rdsultats hemmstoffe in der anlieferungsmflch und methoden zu ihrem nachweia eiweibverenderungen gefriergetroekneter muskulatur meat and meat products effect of high doses of vitamin a palmitate on vitamin a aldehyde, esters, and alcohol and carotenoid contents of hen's eggs. brig. j. nutrition [ ] nr. , s. / . --the amounts of vitamin a aldehyde, esters, and alcohol and of earotenoids in hen's eggs and in day-old chicks nutritive value of marine oils i. : ]~lenhaden oil at varying oxidation levels, with and without antioxidants in rat diets a quantitative study of changes in dried skim-milk and lactose-casein in the 'dry' state during storage physico chemical characteristics of canadian milk fat. unsaturated fatty acids collagen content and its relation t~ tenderness of connective tissue in two beef muscles ~ber eine braune verf~rbung yon mariniertem hering. dr. lebensmitbel versuche fiber den sonnenlichtgesehmaek in mit ass markierter milch effect of unequal milking intervals on lactation milk, milk fat, and total solids production of cows wei~re untersuehungen fiber die nitratreduktase verschiedener an der reifung yon rohwurst beteiligter mlkroorganismen nutritive value of leg of lamb roasts. l~oisture, energy, protein, fat, and iodine values ~ber einige gul~sigkai~s~renzen bei konsistenzfehiern yon butter influence of linoleic acid content of milk lipids on oxidation of milk and milk fat fish hydrolysates-iii.: influence of degree of hydrolysis on nutritive value einige beobaehtungen fiber die bildung des durch lieht verursaehten oxydationsgesehmackes s.). zitat: dt. lebensmittel comminuted meat emulsions: factors affecting meat proteins as emulsion stabilizers factors related to the flavor stability during storage of foam-dried whole milk. iil effect of antioxidants upgrading the indigenous poultry of uganda. hi. : shell and egg interior quality relation between carcass composition and live weight of sheep diethylstilbestrol occurence in eggs of subcutaneously injected hens biochemical and quality changes in chicken meat during storage a~ above-freezing temperatures investigations on the allerged goitrogenie properties of milk fish and other marine products bioehemieal properties of pork muscle in relation to curing potassium content of dried milk vergleiehende histometrische untersuehungen fiber den kollagengehalt yon brfihwfirsten a comparison of the volatile compounds of fresh and decomposed cream by gas chromatography studies on the volatile carbonyl compounds in ladino clover and their influence on the flavor of milk aut~xidation of fish oils. ii.: changes in the carbonyl distribution of autoxidizing salmon oils the public health aspects of the use of antibiotics in food and feedstuffs. repor~ of an expert committee composition and digestibility of corn silage as affected by fertilizer rate and plant population factors influencing the nitrate content of forage the component sugars and rate of hydrolysis of forage hemicelhflose as related to digestibility digestibility trials on forages in trinidad and their use in the prediction of nutritive value acetyl-(para-nitrophenyl)-sulfanilamide in feeds distribution of major and trace elements in some common pasture species -dimcthyl -( , , -trichlorophcnyl)phosphorothioate) in feeds feeding value of low-moisture alfalfa silage from conventional silos ovine bioenergeties and nutritional efficiency, with special reference to forage utilization a system for naming and describing feeds, energy terminology, and the use of such information in calculating diets supplemental methionine in a sixteen percent protein diet for laying chickens l : organic arsenicals in feeds cellulose degradation by enzymes added to ensiled forages influence of corn distillers dried grains with sohbles on the feeding value of wheat silage zinc content of certain feeds, associated materials, and water der natiirliche gehalt und die stabilit~it yon carotin und carotinoiden in lieu und silage forage digestibility. benzene-ethanol extracts of forage and faeces as indicators of digestibility the prediction of the metabolizable energy content of poultry fcedingstuffs from a knowledge of their chemical composition factors affecting the metabolizable energy content of poultry feeds. facters affeeting the metsbohzablo energy content of poultry feeds unidentified chick growth factor in fish solubles diet and histamine in the ruminant. occurrence of histamine in silage ethopabate in feeds die isolierung yon apocarotinalen aus luzernemehl isoflavone contents of red and subterranean clovers metabolizable energy of some oil seed meals and some unusual feedstuffs ii methoden der untersuchung yon lebens-und futtermitteln techniques analysis of foodstuffs and feeds n.n.: gdch-faehgruppe analysis of foods by neutronaetivation techniques continuous measurement of dissolved solids in food processes by critical-angle refractometry berieht fiber die vortragstagung des fachverbandes lebensmittelchemie der chemischen gesellschaft in der ddr vom . bis a rapid test for anionic detergents in drinking water fortschritte in der lebensmittelehemio dureh moderne analysenmethoden direct potcntiometrio determination of chloride in cheese modification of the polarimetrie starch determination on hig-hamyloso corn ~iethoden der ]~berwachung des wassers auf radioaktiviuit. btmdes prediction of quality in protein concentrates by laboratory procedures involving determination of soluble nitrogen direct chromatographic analysis of milk die eignung der bakteriologischen untersuehung yon kannenmilchproben als grundlage eines eu~ergesundheitsdienstes. arch. lebensmittel-i~yg nachweis fremder carotinoide in rangensiiften mittels diinnschichtchromatographie. i)t. lebensmit~el-l~dseh determination of phosphate composition of stock food calcium phosphate. . assoe. off. agric. chemists molybdenum in plants and animals. determination of molybdenum in biological materims with dithiol control of copper interference the determination of dissolved oxygen in canned drinks using a vibrating mercury-plated platinum electrode determination of chromium and lead in periodic acid solution and dialdehyde starch the utilization of infrared and ultraviolet spectrometric procedures for assay of pesticide residues ultramicro determination of potassium and sodium in biologic fluids gum quautativen naehweis des pektins und der alginsiiure ein schliffapparat zur abtrennung yon flfichtigcn stoffen mittel~ wasscrdampfdcstillation ~tude par chromatographic on phase gazeuse des acidea gras du beurre fabriqu en italic et clans d'autres pays. application ~, la recherche des falsifications clans le bcurre commercim. ann. falsiticatiorm expertise chim note on the determination of caffeine in coffee a comparison of the press method with taste-panel and shear measurements of tenderness in beef and lamb muscles zur diehtebestimmung der milch mit der neuen milehspindel. lebensmittelchem, u. geriehtl. chem. [ ] nr. , s. / . --aufschlu -.i~thercxtrakt und titrationswert bei der teigwarenunbersuehung als beispiel einer dynamisehen lebensmittelanalyse quantitative measures of carcass composition and qualitative evaluations fruit preservatives analysis. determination of calcium in cherry brines by versenat~ titration: elimination of anthocyanin interference by means of carbonyl reagents, g. agric. food chem allgemeine prinzipien der analytik yon carotinen und carotinoiden zur bestimmung yon ethoxyquin kolorimctrische bestimmung yon dipterex-riickst~nden yon lebensmittein the direct determination of shear stress-shear rate behavior of foods on the presence of a yield stress methods for evaluating bhe feeding quality of meat-andbone meals a paper chromatographic method for determination of vanillin and ethyl vanillin in vanilla flavorings ~drber die anwendung der papierchromatographisehen analyse auf dem fettgebiet. . mitt.: uber ls~iurereiehe samenole. ermittlung der konsti~uierenden fettsiiuren der samensie yon margosa (azadiraehta indies), cashewkern (anacardicum oecidentale) und putranjiva roxburghli. nahrung die jodzahl des riickenspeeks im verhiiltnis zu der qualit~t des futterfettes, dem alter der schweine und dem fettungsgrad bei sehweinen der ditnisehen landrasse determination of parathion, methyl parathion, epn, and their oxons in some fruit and vegetable crops an improved chromatographic method for determining trace elements in foodstuffs determination of guthion residues on fruits techniques used in meat flavor research the analysis of edible oils contaminated with synthetic ester lubricants colorimetrische bestimmtmg yon nitrat und nitrit in biologisehem material confrontation de quelques proe~d s de dosage iodometrique de l'anhydride sulfureux dans les vins zur anwendung der massenspektroskopie zur strukturermittlung yon naturstoffen, mit besonderer berfieksiehtigung der lebensmittelanalytik z lebensmittel cellulose solubility as an estimate of cellulose digestibillty and nutritive value of grasses the -thiobarbiturie acid reagent for determination of oxidative rancidity in fish oils bet die eignung yon daphnia magna zur ermittlung yon riieksti~nden auf frisehem bst und gemfise bemcrkung zum diinnschiehtchromatographischen kakaoschalennaehweis nach the determination of vitamin a in animal tissues and its presence in the liver of the vitamin a-deficient rat elution column preparation of leaf sample for flame photometry. ii. : determination of calcium in tobacco enzymatic-ultraviolet method for determination of uric acid in flour die bestimmung der amyiaseaktivit~t und einige studien fiber amylaseaktivit~it in gekeimtem roggen identification and determination of ascorbic acid (vitamin c) with janus green and its localisation in mitochondria cho]estehnbestimmung im kleinen laboratorlum chick edema factor. iii.: application of mieroeoulometric gas chromatography to detection of chick edema factor in fats or fatty acids bioassay of chick edema factor. collaborative study determination of nih te and nitrate in meat products the measurement of the surface areas of milk powders by a permeability procedure sedimentbeurteilung und sehaim-~iastitis-test als sortierverfahren zur ermittlung yon sekretionsstsrungen bei der ~iassenuntersuchung yon milchproben aoac methods for nutritional adjuncts die ermittlung der ribonucleins~ure im pflanzenmaterial beitrag zur analytischen beurteilung des frischezustandes der pharmazeutisch verwendeten e und fe~te sehnellbestimmung yon kupfer in fe~n applications of oscillographic polarography to the determination of organophosphorus pesticides. ii. : a rapid screening procedure for the determination of parathion in some t~its and vegetables zur standardisierung der vitaminb~-bestimmung in getreide und getreideprodukten eine mikromethode zur bestimmung des fettgehaltes der milch kleiner laboratoriumstiere the determination of organophosphato pesticides and their residues by paper chromatography die l~fikrochemie beim studium yon nahrung und erni~hrung aearicide residues. an improved method for kelthane residue analysis with applications for determination of residues in milk collaborative study of the determination of ethoxyquin in feeds ~rber den naehweis yon quellstoffcn in fleischwaren und m gliche stsrungen durch andere polysaccharide ersatz manueller labormethoden der l sungsspektralanalyse durch den automaten determination of total acids in wines: american society of enologists determination of aldehydes in wines and spirit~ by the direct bisulilte method ~fber einen vereinfachten nachweis des vitamin b mit poteriochromonas slipitata extraction of nys~tin in animal feeds for microbiological analysis zur quantitativen bestimmung der sorbinsi~ure mit dem thiobarbitur-s~urereagenz cottage cheese problems in production and sanitation. quality control in cottage cheese sitzung des arbeitskreises berlin der gdch-fachgruppe lebensmittelchemie und gerichtliche chemie am . . in ]~erlin-dahlem ( vortragsreferate) ein neues elektronisches sctmeuverfahren zur ermittlung der ~risehe yon seefischen fatty acids of lard. a. identification by gas.liquid chromatography oxydative abbauprodukte der l-aseorbins~ure. . mitt. : papierchromatographischer l~achweis analysis of orange juice for total earotenoids, carotenes, and added betacarotene. food technol. [ ] nr. , s. / . u. r~ck~, a. a. : refractometrio measurement of soluble solids in orange juice analysis of iron chelates in plant extracts. il: ferric ethylenediamine'bis'(~ acid) determination of n-aeetylglucosamine- -phosphate and n-aeetylglucosamine in milk arsenic in foods: collaborative comparison of the aminemolybdenum blue and the silver diethyldithiocarbamate methods improved method for testing macaroni products neuere beitri~ge zur chemic der st~rkefraktionen. . mitt.: die mikro-ameisens~urebestimmung bei der perjodat- xydatonsbestimmungsmethode yon stiirke recherche des falsifications dans les extraits de vanille. ann. falsifications expertise chim bestimmung des gesamtstckstoffgehaltes yon milch nach der kjwld~t:l-methode. internationaler standard fil vorteile und grenzen des einsatzes yon markiertem phosphat bei untersuchungen am hiihnerei paper chromatography of carotene and carotenoids determinaton of sevin insecticide residues in fruits and vegetables insecticide residues in meat and eggs. determinaton of sevin insecticide and its metabolites in poultry tissues and eggs bcstimmung der jodzahl yon fetten und en mittels n-bromsuccinimid. . mitt.: l~ber eine !~i gliehkeit zur bestimmung der gesamtjodzahl yon el~iostearins~ure und holz . nahrung [ ] nr. , s. / . kxrr:~e~r, m. a. : ~ ber die anwendbarkeit des thiobarbi~urs~iuretestes boi der untersuchung yon milchprodukten application of gas chromatography to the measurement of gas permeability of packaging materials a sensitive method for quantitative microdetermination of lipids comparison of chemical and microbiological methods for the determination of procaine penicillin in prcmixes and mixed feeds electrophoretc analysis of flour proteins from various varieties of wheat katalytische methode zur bestmmung kleinster manganmengen in lebensmitteln am beispiel der milch eine methode zur papierchromatographischen qualits yon silagen comparison of methods of measuring potassium in pork and lamb and prediction of their composition from sodium and potassium electron capture gas chromatography for determination of ddt in butter and some vegetable oils eine methode zum schnelinachweis yon nitriten in yleiseh-und wurstwaren. arch. lebensmittel-hyg determination ofglyodin residues on pears and peaches eleetrophoretic analysis of flour proteins erstes internationales symposium fiber methoden zur analyse yon lebensmitteln in bordeaux-tatence (frankreich) veto . bis . oktober on the problem of luminescence technique of protein definition in milk beitr~ge zur aminos/iuren-bestimmung in biologischem material aktuelle fragen der lebensmitteluntersuehung, insbesondere der histologischen wurstanalyse untersuchungen yon importiertem tiefgefrorenem tiasenfleisch argentinischer herkunft fish hydrolysates. iv.: microbiological evaluation untersuehungon zum naehweis yon emulgatoren in lebensmit~eln. . mitteilung. forte use of a slice-tenderness evaluation device with pork studies on improvements in quantitative paper chromatography of amino acids in foods. i. : research on procedure of development studies on improvements in quantitative paper chromatography of amino acids in foods. ii.: selection of solvent systems determination of the equilibrium relative humidity of foods untersuchungen fiber die methodik der quantitativen bestimmung yon heizolen und flfissigen treibstoffen im wasser determination of ~-lactalbumin in complex systems hydrogen sulphide in cheddar cheese; its estimation and possible contribution to flavour ascorbic acid measurement. polarographic determination of total ascorbic acid in foods trans-fettsguregehalt yon sehweineschmalz nach fiitterung von schweinen mit rindertalghaltigem kraftfutter. (ein beitrag zur quantitativen infrarotspektroskopischen bestimmung yon trans-fetts~uren in fetten the importance of starch on the microscopic identification of cereal grains in feeds insecticide residues. chromatographic identification of some organophosphate insecticides in the presence of plant extracts chemical and biological estimation of the carotene content in fresh and processed italian apricots sialir acid as an index of the u-casein content of bovine skimmilk foodstuffs analysis. nonvolatile acids of blueberries le dosage de la matibre grasse dans les fromages. ]~tude critique de la m thode en usage au laboratoire municipal la d termination de residus d'insecticides et de fongieides par la m thode polarographique fusel oil determination by gas.liquid chromatography * die brabanter mastitis-reaktion, ein neues verfahren zur ermittlung yon sekretionsstsrungen des euters dutch die kannenmilchuntersuchung uber eine enzymatisehe _&pfels~urebestimmung in wein und traubensaft direct microscopic technique to detect viable yeast cells in pasteurized orange drink die enzymatisehe bestimmung der glucose und saecharose und ibre anwendung in der lebensmittelanalyse a modified zirconium-alizarin method for determining fluoride in natural waters paper chromatography of some cholesterol derivatives determination of moisture by nuclear magnetic resonance and oven methods in wheat, flour, doughs, and dried fruits dye binding by soybean and fish meal as an index of quality the absorptiometrie determination of silicon in water: part l: formation, stability, and reduction of cr and fl-molybdosilicie acids column chromatography of soybean whey proteins enzymatic determination of carbon dioxide in lightly carbonated wines. collaborative study nachweis yon biiffelmilch als verf/flschungsmittel in kuhmilch durch serologisehe methoden photometric determination of phosphate in wines pertinent references to analytical lipid methods published recently, ft spectrophotometric estimation of nucleic acid of plant leaves hemmstoffe in der aalieferungsmilch und methoden zu ihrem nachweis determination of potassium in tobacco determination of chlorides in tebacco phospholipase c determination by egg yolk turbidimetry oher die inhaltsstoffe der rol]kastanie und versuche zu ihrer gehaltsbestimmung mierobiologlcal method for assaying nystatin in animal feeds / ] nr. , s. / . --gaschromatographische untersuchungen yon fuselslen aus versehiedenen g~rproduk-ten. . mitt.: methodik der fusel bestimmung lactose activity measurements. evaluation of laetase preparations for use in breadmaking comparison of methods for determination of lysino in cereals direct determination of calcium in plants, soils, and milk by means of a flame photometer colorimetrie determination of urea in feeds kolorimetrische bestimmung des dihydrox-yacetons fluoride, teeth, and the analyst the quantitative micro-determination of biphenyl in citrus fruit kolorimetrisehes verfahren zur gleichzcitigen bestimmung der weinsiiure und milchsiiurc in wein und most estimation of extra-cellular starch of dehydrated potatoes versuehe zur chromatographischen trennung yon kohlenhydra~en und eiweil en aus dem w~ibrigen extrakt yon roggenvollkornmehl quantitative determination of the amino acid content of rumen fluid from twin steers fed soybean oil meal or urea vemnche zur chemischen differenzierung der eiwei•stoffe des weizens und roggens aromastoffe des brotes. versuch einer auswertung chemiseher gesehacksanalysen mit hilfe des schwellenwertes zur trennung yon sacchariden an kohle]celit~-s~ulen. ern~hrungs-fomchung untersuehungen zur bestimmung der lsslichkeit yon milchpulver * selection of a medium for the isolation and enumeration of enterocoeei in dairy products colorimetric determination of amino nitrogen in corn syrups stxogene und versuche zu ihrem nachweis in gefliigelflelsch. dt. lebensmittel ein beitr~g bert. die verwendung der anatysen-q~arzlampe zu friihzeitiger erkennung der l~anzigkeit colorimetrlsche methode zur bestimmung yon -monoglyceriden in eiskrem determination of fusel oil in distilled spirits zu den m glichen fehlerquellcn bei der histometrischen ermittlung des kollagen-, bzw. gelatine-gehaltes bei briihwiirsten studium der uv-spektren der auf hshere tempcraturen erhitzten e measuring of oil-binding characteristics of flour naehweis der konservierungsmittel mit hilfe cl~r papierehromatographie identification of ch]orogenic acid in castor bean and oranges. canad evaluation of forages in the laboratory. iii.: comparison of various methods for predicting silage digestibility feed microscopy essential oils. determination of botanical and geographical origin of spearmint oils by gas chromatographic and ultraviolet analysis fluorometric determination of chlortetracycline in premixes dfinnschicht-chromatographie yon carotin-und carotinoidgemischen measurement of the sub-sieve particle size distribution of flour chemisehe bestimmung der riickst~nde yon parathion, malathion und diazinon auf blumenkohl, kohlrabi, bohnen und gurken determination of cadmium anthranilate in feeds nachweis yon penicillin und anderen antibiotika in milch microbiological evaluation of protein quality with tetrahymenapyriformis w. .: a simplified assay procedure peanut lipoprotein. ii. : analysis in foods by gas chromatography beitrag zum papierchromatographisehen und spektrophotometrisehen i~achweis fettlsslieher synthetiseher farbstoffe in lebensmitteln und kosmetika elektroehemische sauerstoffbestimmung in olefinischen fetich bestimmung der gesamten sehwefligen s~ure in getr~nken the modified whiteside test. recommended procedures for bulk or blended milk deliveries die bestimmung yon mileheiweib in fleiseherzeugnisscn. dr. lebensmittel zur bestimmung des veresterungsgrades yon pektin a quantitative fluorometrie method for the determination of serpasil (reserpine) in feeds at the micro level fluorimetric mierodetermination of carbohydrates zur l~iethodik der klebrigkeitsbestimmung yon brot frage der papierchromatographischen untersuchung von amylopektin und amylose chemical determination of diethylstilbestrol residues in the tissues of treated chickens gas chromatographic identification of components in m~ple sirup fl~vor extract dark discoloration of canned all-green asparagus. ii. development of a new tin plate for its control thermal conductivity and density of chicken breast mnsele and skim beitrag zur bestimmung der fluoreszenz in spriten microbiological determination of alanine in proteins and foods collaborative study of the method for counting microorganisms in maple sirup glass fiber paper strip charring. a rapid and simple method for monitoring column chromatography of lipids verbesserte mcthode zum serienm~bigcn quantitativen nachweis yon insektizidrficksti~nden bei bst und gemfise use of the shear press in determining fibronsness of raw and canned green asparagus betrachtungen fiber verschiedene methodcn zur bestimmnng des bindegewebeanteiles in rohem fleisch und fleischwaren. arch. lebensmittel-hyg the determination of citric acid in milk and milk sera formaldehyde in maple sirup: an adaption of the nash method polarographische bestimmung dcr ascorbinsiiure und des gesamt-vitamin c untersuchungcn fiber die mbglichkeit zur objektiven beurteilung der organoleptischen eigensehaften yon kokosraspeln analysis of the flavor and aroma constituents of florida orange juices by gas chromatography assay for cyzine in finished feeds dfinnsehicht-ehromatographische trennnngen yon synthetischen lebensmitteffarbstoffen auf ceuulose-schichten evaluation of quantitative methods of determining peroxidase in vegetables. i.: the indophenol and the o-phenylenediamine methods ein beitrag zur untersuchung thermisch oxydierter fette. dt. lehensmittel a more accurate method for determination of caffeine in decaffeinated coffee bestimmung yon vitamin c in friichten, fruchtsiiften, gemiise und konserven naeh der methode naeh tillma~s unter ausschaltung reduzierender stoffe lebensmittelrecht und lebensmitteliiberwaehung nutritional laws and nutritional control briihwiirst~ einfaeher qualit~t, die zu einem kaliber unt~r mm in den verkehr gebracht werden, sind i.s. des w nr. lmg irreffihrend aufgemaeht new regulations under the food and drugs act lebcnsmittel-rdseh orb der kenntlichmaehung fremder stoffe probleme des geltenden lebensmittelrechts glasurmittel ftir r stkaffee beschr~nkt zugelassen food technol. [ ] nr. , s. . n. iv. : revised u. k. preservatives regulations. food teehnol ausschub fiir lebeusmittelrechtliche fragen der fachgruppe lebensmittelehemio und gerichtliche chemic in der geseuschaft dcutscher chemiker vi. t~tigkeitsberieht der eidg. kommission fiir volksern~hrung, lebensmittelgesetzgebung und -kontrolle (eek) zu h~nden des eidg. departementes des i_nnern, umfassend die jahre federal food, drug, and cosmetic act, as amended. selected u. s. government publ. nr. ; h. catalog i~o verkauf yon frikadellen mit brotzusatz in gastwirtsehaften tierarzneimittel und aufzuchtmittel in der landwirtschaftlichen praxis. gesundheitliche erw~gungen zum schutze des konsumenten bei der anwendung yon tierarzneimitteln und aufzuchtmit~ln in der landwirtschaftliehen praxis. teil hi der amtsarzt und das neue lebensmitt~lgesetz. off ist der beschlub des bundesgerichtshofes vom . . - stl~ -geeignet, den vertrieb yon hackfleiseh befriedigend zu regeln ? arch. lebensmittel-hyg zur beurteilung des saccharingehaltes yon meerrettiehzubereitungen. dt. lebensmittel zum begriff ,mai]gebend" in w a abs. des lebensmittelgesetzes aspect sanitaire et ldgal actuel des aliments conservds verbrauchererwaxtung und lebensmitte]kontrolle bei fleiseherzeugnissen bei ger~ueherten fleischwaren. sehwarz-w~lder speck, schwarzw~lder sehinkenspeck, sehwarzw~lder sehinken. arch. lebensmittel-ttyg kampf der wasserverseuchung. aktuelle notwendigkeiten -gesetzliehe ~/isgliehkeiten. ~iiinchener reed. wschr. lebensmittelreehtliehe stellung yon carotin und carotinoiden in der schweiz die lebensmittelgesetzgebung sterreiehs, der sehweiz und der bundesrepublik deutschland. eine vergleiehende untersuchung arzneimittel-lebensmittel dr. lebensmittel-rdseh erwiderung des autors (zur stellungnahme yon f. nitzsc~, d~. lebensmittel-rdseh. [ ] nr. , s. ). i)t. lebensmittel das lebensmittelgesetz und seine auswirkung auf die gummiindtlstrie codex alimentarius austriacus absehliel]ende stellungnahme der ort der kenntlichmachtmg fremder stoffe. dr. lebensmittel-rdsch _rreffihrende bezeiehnung und angaben bei limonade aus mineralamem tafelwasser. dr. lebensmittel zur herkunftsbezeichnung yon lebensmitteln. dt. lebensmlttel aktuelles zur lebensmitteliiberwachung einige beispiele fiir die auswirkung der neuen lebensmittelrechtliehen vorschriften auf erniihrungs-und landwirtschaft auf dem wege zum einheitliehen lebensmittel-rech~ zur ~r yon b. rsssl~: welehe anforderungen sind an alkoholhaltige siigwaren zu stellen? dr die silbertmg yon tafelwiissern mokka-kaffee" auch im handel mit kaffeebohnen keine tterkunftsbezeiehnung [ ] nr. , s. . --zuliissigkeit trod grenzen bildlicher darstellungen yon fleiseh und ylelscherzeugnissen auf paclmngen yon suppen in trockener form. db. lebensmit~el-l~lsch bedeutung und beur~eitung yon galtstreptokokken in vorzugsmileh. arch. lebensmittel-hyg auf clipverschliissen zul~ssig ist die infektion mit trichinen aus amtlich untersuehtem schwelnefleiseh im lichte der mathematischen analyse der bestimmungen der fleischbesehau yon schweinefleiseh msglich? arch. lebensmittel-hyg intema~ionales rundgespr~ch fiber lebensmitte]ehemische probleme in wiesbaden und eltville a. rh. z. lebensmittel-untersuch. u. -forschung. [ ] nr. , s. . --kommission zur priifung fremder stoffe bei lebensmitteln (fremdstoff-kommission) der deutsehen forsehungsgemeinsehaft entwicklung des lebensmittelrechts im nationalen und internationalen bereich entwicklung des lebensmittelrechts im nationalen und inter~ationalen bereich entwicklung des lebensmittelrechts im nationalen und internationalen bereich de warenwet en de hierop berustendc besluiten zum entwurf einer harmonisierung der lebensmittelrechtlichen bestimmungen fiber konservierungsstoffe im gebiet der europaischen wirtschaftsgemeinschaft. er lebensmittelrechtliche stellung yon carotinen und carotinoiden in der bundesrepublik deutschland die instrumente der lebensmittelfiberwachung in osterreich sterreichischer standpunkt zur l~rage der f~rbung yon lebensmitteln mit carotinen und carotinolden. wiss. ver ff. dr. ges. ernkhrung lst es zu vcrtreten, dab das fleisch schwacbfinniger rinder auch in gebr~tetem zustand eingefroren wird? z lrch. lebensmittel-iiyg richtlinien fiber die zulassung yon gegeusachverst~ndigen zur untersuchung yon lebensmittel-gegenproben die ]ebensmittelrechtliche bedeutung yon bildiichen darstellmlgen auf verpackungen diverse problems (education, documentation associations, terminology etc the nutritional education of the food technologist proposed formation of a food engineering panel within the food group studium der hauswirtschafts-und ern~hrungswissenschaften. ern~hrungswirt-sehaft l an information service for the american food industry st international congress of food science and technology. symposium on education and training technically trained people for developing countries lft paces the food frontier un committee on food additives international standardization of fruit and vegetables. food technol terminology and methods for feeding and weighing animals food additives and food standards beitriige zur durehffihrung der umweltsradloaktivitiits- berwaehung training dairy personnel in denmark training opportunities for the sanitarian-specialized in-service training r die benennungen honigkueheniihnlicher oebiieke als lobkuehen und lebzelten sowie als biber und bibenzelten. dt. lebensmittel nutrition and dietetics for the medical student problem of keeping dairy plants supplied with the foreman-type employee literaturdokumentation fiir hochschulassistenf~n a system for naming and describing feeds, energy terminology, and the use of such information in calculating diets a brief sketch of veterinary periodical literature in great britain before the foundation of the veterinary record a conference on nutrition teaching in medical schools the food service industry and its relation to the control of foodborne illness research and educational progress in nutrition informationsm gliehkeiten fiir tieriirzte auf dem measuring readability of health education literature the dai~y literature problem the central food technological research institute the education and function of the nutritionist training in food service for nursing homes. l: tools for evaluation training in food service for nursing homes. iii.: observations on management of twelve units pennsylvania takes a look at nutrition in the orthopedic program zur definition der begriffe ,aroma" und l~ber die notwendigkeit einer priifung for beamtete sehlaehthofleiter. arch. lebensmittel-hyg organisation der dokumentation in einem forschungsinstitut (bundes. forsehungsanstalt f'tir lebensmittelfrischhaltung in karlsruhe). dt. lebensmittel-rdsch on changing the name of our assoziation. for a name change pporbunities in nutrition education questionnaires to identify nursing homes most in need of dietary counsel. publ. health irep lebensmittelwissenschafttiche institute in bulgarien zur stellung des psychiaters in der alkoholfrage naas nutrition chemists. vcter on changing the name of our association. against a name changing begriffe) in sprachen: deutseh, russiseh, polnisch, tsehechisch, slowakisch, ungariseh, bulgariseh, serbokroatiseh, rum~nisch, englisch the role of nutrition in the teaching of medicine key: cord- - igesqw authors: patra, prasanta; mondal, niladri; patra, bidhan chandra; bhattacharya, manojit title: epitope-based vaccine designing of nocardia asteroides targeting the virulence factor mce-family protein by immunoinformatics approach date: - - journal: int j pept res ther doi: . /s - - - sha: doc_id: cord_uid: igesqw nocardia asteroides is the main causative agent responsible for nocardiosis disease in immunocompromised patient viz. acquired immunodeficiency syndrome (aids), malignancy, diabetic, organ recipient and genetic disorders. the virulence factor and outer membrane protein pertains immense contribution towards the designing of epitopic vaccine and limiting the robust outbreak of diseases. while epitopic based vaccine element carrying b and t cell epitope along with adjuvant is highly immunoprophylactic in nature. present research equips immunoinformatics to figure out the suitable epitopes for effective vaccine designing. the selected epitopes vlgssvqta, vnielkpef and vvpsnlfav amino acids sequence are identified by hla-drb alleles of both mhc class (mhc-i and ii) molecules. simultaneously, these also accessible to b-cell, confirmed through the abcpred server. antigenic property expression is validated by the vaxijen antigenic prediction web portal. molecular docking between the epitopes and t cell receptor delta chain authenticate the accurate interaction between epitope and receptor with significantly low binding energy. easy access of epitopes to immune system also be concluded as transmembrane nature of the protein verified by using of tmhmm server. appropriate structural identity of the virulence factor mce-family protein generated through phyre server and subsequently validated by prosa and procheck program suite. the structural configuration of theses epitopes also shaped using distill web server. both the structure of epitopes and protein will contribute a significant step in designing of epitopic vaccine against n. asteroides. therefore, such immunoinformatics based computational drive definitely provides a conspicuous impel towards the development of epitopic vaccine as a promising remedy of nocardiosis the gram positive bacillus bacteria, nocardia asteroides causing nocardiosis targeting to immunocompromised patients in direct state (wilson ) . a total of species have been estimated till now under the genera of nocardia (bennett et al. ) . it is an opportunistic pathogen responsible for several diseases in human as well as other vertebrate animal (moylett et al. ) . practically these bacteria serve as predominant causing factors for pulmonary infection. consequently it shows acute necrotizing pneumonia that also reflects in the inflammation of cutaneous and subcutaneous tissue. while the brain is recognized as most favored site for secondary infection in % patients amongst all victims (ellis and beaman ; quinones-hinojosa ) . subsequently n. asteroides is the prime cause of serious cerebral abscess as reported by researcher (fleetwood et al. ) . nocardiosis is distributed worldwide but it is most prevalence through the united states (patil et al. ; saubolle and sussland ) . infection of nocardiasis is influences by the sex and age of the patient. as noted the male female patient ratio : reflects that men are prone to nocardiasis rather than the female. recently, around . nocardia infection for every individuals have been reported throughout the world (palmieri et al. ) . exclusively - individuals are affected in united states every year in a random mode (sethy et al. ) . numerous endeavors are taken to overcome the fatal disease but till now the disease is prevalence as research data reported. the nocardiosis disease is predominant to the alcoholic, diabetic, cancer, aids and organ transplanted patient so, its need immediate control and monitoring supports. in present scenario records of these patients are quiet in frequent numbers. therefore, this disease already occupied great concern and requires immediate remedy by promising, cost effective ways. vaccination against the disease will be the finest and reliable option for controlling the disease outbreak (nichol et al. ) . a vaccine would have more effective if it can elicit both b and t-cell dependent immune responses. vaccine designing against any disease can be performed using two specific approaches, conventional approach and reverse vaccinological approach. within the present research plan the reverse vaccinological approach has been applied to design epitopic vaccine against n. asteroides as the conventional technique has some limitations. the conventional technique requires much time and also very expensive to formulate. furthermore, the technique applied to killed, attenuated or indolent pathogens for vaccine development which will prove deadly if the pathogen alters to its pathogenic appearance (khan et al. ) . besides from such conventional approach, the reverse vaccinology harnesses the immunoinformatics for developing vaccine against pathogenic diseases (vivona et al. ) . the advance approach involves several web portals and computational tools to predict the potential vaccine candidate. the fundamentality of reverse vaccinology showing it targets of protein derived epitopic sequence for novel vaccine generation. the reverse vaccinology provides easy and reliable techniques to find out the accurate b and t-cell epitopes (srivastava et al. ) . while the exomembrane and secreted protein of pathogens are the ideal for vaccine generation as these can easily interact with immune system (gourlay et al. ) . considering the bacterial proteins are foreign to host body, it will be further recognized as antigenic determinants. expression of both b and t-cell epitopes will enhance the immunogenicity driven by b and t-cell proliferation. the virulence factor of mce family protein found to be responsible for the virulency of n. asteroides and pathogenicity expression. the current research focuses on the computational analysis and identification of potential epitopes present within the bacterial mce family protein candidate. therefore, identified epitopes will be served as ideal component for future vaccine development against the infection of n. asteroides. in order to execute the subsequent research performance the supreme computational aided techniques has been employed. consequently the targeted protein sequence has been crucially analyzed through several authentic web prediction portals focus to extract the functional epitopes leading to narrative vaccine generation against nocardiosis disease. the amino acid sequences of a protein is the preliminary requirement for development of computer aided epitopic vaccine component. amino acid sequence of the virulence factor mce-family protein has been retrieve from ncbi protein database (coordinators ) . identified amino acids sequence was downloaded and carried out further computer based analysis particularly supported in fasta file format. b-cell epitope identification is one of the most crucial steps intended to development of epitopic vaccine (jamal et al. ) . potential b-cell epitope has been predicted employing the abcpred web server (saha and raghava ) . the server utilizes recurrent neural network technique to locate the specific b-cell epitopes present in targeted protein sequence. the server also uses five parameters to validate the predicted outputs. these factors are: q ppv (positive prediction value) = tp tp + fp tp, fn, tn and fp represents true positive, false negative, true negative and false positive output respectively (saha and raghava ) . the collective fasta sequence of targeted protein virulence factor mce-family protein has been submitted to the abcpred server. further, for the b-cell epitope identification, threshold score is set to . and window length is set to in order to clarify the result. moreover an overlapping filter has been used to eliminate the overlapped epitopic prediction. in order to generate the immense immune response, an antigenic epitope necessarily to be accessible for the both mhc class type molecules (naz et al. ) along with the b-cell (barh et al. ; bhattacharya et al. ) . in support to the prediction of potent t-cell epitopes, the propred (singh and raghava ) and propred-i (singh and raghava ) servers are appointed for mhc-ii and mhc-i molecules correspondingly. the propred and propred-i web servers predict epitopes that can be recognized by mhc-ii and mhc-i alleles (mustafa and shaban ) . both the server applied matrices prediction algorithm method (lafuente and reche ; lin et al. ) to find out the potent t-cell epitope. current research work predicted epitopes for b-cell, are submitted to the propred and propred-i server with default parameters to identify the common epitopes anticipate by both b and t-cell. predicted common epitopes for both the b and t-cell are further investigated for their antigenic property through the vaxijen (v. . ) server (doytchinova and flower ) . the server requires a query sequence of amino acids to evaluate the antigenic propensity with - % accuracy level (dimitrov et al. ) . the server also specifies the field of antigenic prediction in five subsequent target group like-bacteria, virus, tumor, parasite and fungal (zaharieva et al. ). the auto cross covariance (acc) method has been also incorporated to convert the submitted sequence into uniform length of amino acids chain (doytchinova and flower ) . the acc is estimated by the formulas ( ) where 'j' is the z-scale (j = , , ), 'n' is the amino acids present in sequence (i = , ,…n) and 'l' is the lag (l = , ,…l). the eq. ( ) is used when there are two different zscales. the query sequence has been submitted to the server by selecting bacteria as target organism. in order to obtain more precise result, a threshold value of . has been set instead of default threshold value ( . ). the common epitopes of b and t-cell having antigenic property are promising candidate for the epitopic vaccine designing. the epitopes predicted by abcpred, propred and propred-i server also having vaxijen predicted antigenic nature are selected for the vaccine development. three dimensional conformation of the epitope have its significant role leading to vaccine development. it is the most vibrant object for investigation of epitope-antibody docking system (alam et al. ). due to short amino acid length of the epitope conventional modeling servers are not in use so, distill . (baú et al. ) server have been introduced for the process. the server uses two sets of bidirectional recurrent neural networks technique (pollastri et al. ) . for the result output, filtering is applied in two successive stages. in first stage network input has been amplified through prophecies and averaged over several adjacent windows. if σj = (αj, βj, ϒj) signifies output of j, subsequent to helix, strand and coil prediction reflects the inputs. in second stage network, i j is the input instead of j. in which k f = j + f( ω + ), ω + represents window size (ω = ) and p + represents window numbers (p = ) taken into account (pollastri and mclysaght ) . pbd file of the experimental epitopic sequence is provided by the server as result element. the three dimensional ( d) architecture of a protein plays essential role in protein function and stability (roy et al. ; willard et al. ) . so, unrevealing the protein structural configuration is quiet necessary and purposeful for the study. the virulence factor mce-family protein structure of n. asteroides is unavailable in pdb database (berman et al. ) . because of the unavailability of pdb structure, the phyre (kelley et al. ) web server has been introduced to generate the protein structural data. the phyre server applies the hidden markov model alignment to detect structure of target protein through hhsearch, open-source software package (kumar and jena ; nema and pal ) . this server also uses poing folding simulation to figure out the non allied part of the protein sequence (nema and pal ) . amino acid sequence of virulence factor mce-family protein has been submitted to the server and a zip file containing predictions is comes out as systematic result component. justification of a predicted protein model is very much fundamental aspect for establishing the specified protein d configuration (laskowski et al. ; rodriguez et al. ) . to properly justify the model quality, two web portals are methodologically assigned. the procheck web server (laskowski et al. ) , to analyze stereochemical properties of the protein model emphasizing on the torsion angle of c α atoms of amino acids. a pdb file of the targeted protein model has to provide to the online web server. the server offered a ramachandran plot of all available amino acids, that also vital for the validation of targeted protein model (laskowski et al. ). the prosa-web is introduce to calculate the z score of the protein model and furnishes a plot of the protein model within all known protein structure (wiederstein and sippl ) . this server also furnishes energy plot for perfect assessment of the model quality. while the negative energy value of the amino acid residues indicates a good model quality of examined protein (belkina et al. ; wiederstein and sippl ) . transmembrane helix prediction of the protein model was performed through tmhmm (ver. . ) server, the standalone software package (sonnhammer et al. ). the server predicts transmembrane helices based on a hidden markov model method with - % accuracy of result (krogh et al. ) . fasta sequence of the protein has been submitted to the server and the server provides lists of transmembrane prediction along with an apparent graphical interpretation. molecular docking between epitope and antibody component is the critical for proper functioning of vaccine candidate. in order to carry out molecular docking the patchdock server have been appointed in present research (duhovny et al. ) . the geometric contour complementarity method has been applied for docking the peptide and protein sequences (schneidman- duhovny et al. ) . hence, the pdb file or pdb code of the protein and epitope has to be submitted to the server for molecular docking analysis. the accurate docked complexes ranked according to geometric shape complimentary score and presented along with ace value, interface area and pdb data (schneidman- duhovny et al. ) . in molecular docking the aggregated desolvation energy of atom pairs termed as ace. atom set inside s and s with threshold distance d, the ace is: here |s − t| signifies euclidean distance amid s and t, t [s,t] signifies the prearranged score of s and t atom pair. t [s,t] is estimated using the subsequent formula: here signifies the solvent. (n s,t ) is number of s,t connection and the number of s- connection (n t, ) are suitable connection numbers of recognized complexes. moreover, cs,t and cs, are signifies as the possible numbers of s,t connection and s- connection (guo et al. ) . the amino acid sequence of the protein virulence factor mce-family protein has been retrieved from the ncbi protein database. the virulence factor mce-family protein consisting of amino acids and the sequence was downloaded in fasta format (genbank: sfl . ) (benson et al. ) . afterwards, this sequence processed through several web servers to find out the potential epitopes for promising vaccine development. abcpred server predicts linear epitopes (table ) within the virulence factor mce-family protein of amino acids length (window length). the sensitivity, specificity and accuracy are . %, . % and . % correspondingly at window length of (saha and raghava ) . a score is also assigned against predicted epitopes and are ranked accordingly (han et al. ) . predicted sequence with the higher score secures better chance to be an potent epitope (jones and carter ) . the predicted epitopes are superior because of high threshold score of . . the identified b-cell epitope were further investigated for detection of prospective t-cell epitope through propred and propred-i to improve immunogenicity (oprea and antohe ). the b-cell epitopes those are recognized by both mhc molecules are taken under additional consideration. the epitopes common to both mhc classes and b-cell are listed in the table which may be recognized for ideal vaccine candidate. following mhc alleles are also enlisted in the supplementary table. the propred server predicts epitopes of only nine amino acids moreover, the common epitopes comprised of mer sequence. manifestation of antigenicity is the prime criterion of a novel epitope in immunobiological aspect (chen et al. ). the common b and t-cell epitopes are successively validated against antigenic propensity via vaxijen server; table presents the epitopes along with relevant antigenic score. the mer epitopes availing antigenic score beyond the threshold value . secures antigenic characteristics. in that consequence out of meric epitopes proved futile to express antigenicity. three mers vlgssvqta, vnielkpef and vvpsnlfav having antigenic score . , . and . respectively proved to secure antigenic propensity. the three mer b and t-cell epitopes vlgssvqta, vnielkpef and vvpsnlfav having antigenicity (table ) will be easily accessible to the immune system (comerford et al. ) . these epitopes can be utilized for the future prospective of epitopic vaccine development. the epitopes will elicit strong immune response when administrated in the body as potent vaccine element. the selected epitopes vlgssvqta, vnielkpef and vvpsnlfav are submitted to distill server ( . ). the distill server contributes five models of each of the epitope in pdb format. only the top ranked model of epitopes is taken under consideration. ucsf chimera (ver. . . ) program has been implemented for visualization of the selected pdb files of the targeted epitopes (pettersen et al. ). subsequently the generated images are retrieved and presented in the fig. a , b, c in order of vlgssvqta, vnielkpef and vvpsnlfav protein. the virulence factor mce-family protein lacking pdb entry, for that reason the structure of protein has been depicted by homology modeling. modeling of virulence factor mce-family protein has been performed via hidden markov model in phyre server. amongst amino acids of virulence factor mce-family protein %, %, and % amino acid construct α-helix, β-strand and transmembrane helix respectively. phyre predicted pdb data of the protein processed through ucsf chimera program in order to generate structural image. figure showing the three dimensional conformation of virulence factor mce-family protein in n. asteroids. the topology of virulence factor mce-family protein is justified using prosa and procheck web servers. according to the prediction of procheck generated ramachandran plot (fig. ) , . % of residues exist in most favoured region and only . % exists in disallowed region (table ). existence of only two non-glycine and non-proline residues in disallowed region established the model quality. prosa estimated z score (− . ) of the model resides within the range of experimentally proved protein structures (fig. ) (sharma and jaiswal ) . as presented in fig. , most of the residues of the model possess negative energy value only few have positive value then the model is justified (bodade et al. ). pdb file of vlgssvqta, vnielkpef and vvpsnl-fav epitopes and variable domain of t cell receptor delta chain (pdb id: tvd) are submitted to patchdock algorithm based server (li et al. ) . the server provides docking complex for each epitope ranking them against geometric shape complimentary score. only the top ranked complex of each epitopes is picked for computational analysis. epitope along with score, area of interface and ace value are listed in table . the significantly low ace value of docking complexes indicated elevated reactivity between epitope and (guo et al. ; lavi et al. ). the docking of selected epitopes and t cell receptor confirms that the epitopes will accessible to immune system and generate specific immunogenicity. rational identification, authentication and in sillico analysis of epitopic components facilitates the successful generation of novel vaccines. as noted the vaccine element performs a key role not only recovering from the infections but also controlling the future disease outbreak. hence, the nocardiosis has carried a dreadful influence over the immunocompromised patient as they manifest high level of disease susceptibility. particularly the organ transplant patients encounter the greater chances of nocardiosis. the casualty of organ recipient patients for nocardia infection often reflects in higher degree (husain et al. ) . consequently, the causing pathogen also highly resistance to several well known, market available antibiotics (husain et al. ) . greater survivability, infections and antibiotic resistance nature of the bacterial pathogen is one of the greatest apprehensions for medical biotechnologist. present advanced, computational analysis assisted research emphasizes much on the discovery of effective vaccine against to n. asteroides. the virulence factor mce family protein of n. asteroides is responsible for the pathogenesis of the bacteria to other organism. hence this protein served as the supreme component for designing epitopic vaccine against nocardiosis. the virulence factor mce family protein is processed through several bioinformatic tools to execute the suitable epitopes having antigenicity. the transmembrane localization of the protein permits it to intermingle with exact immune system. the transmembrane helix prediction server tmhmm server (ver. . ) attests the exomembrane localization of the protein component. both the b and t-cell epitopes are considerate for obtaining the maximum immune response through humoral and cell mediated immunity. the epitopes were identified through sequence based prediction method using the several web prediction servers. after retrieving the protein sequence of virulence factor mce family protein (genbank: sfl . ) from ncbi database, it is processed through the abcpred server for b-cell epitope motif. the numbers of b-cell epitopes having considering threshold value are again submitted to propred and propred-i servers simultaneously for mhc-ii and mhc-i binding allele (barh et al. ) . the common epitopes are shortened to only mers because of propred prediction module provides only meric epitopes. the epitopes vlgss-vqta, vnielkpef and vvpsnlfav were selected for vaccine designing after accurate validation against antigenic property through vaxijen server. these are highly antigenic (vlgssvqta = . , vnielkpef = . and vvpsnlfav = . ) being laid over the customized threshold antigenic score. this research design reveled that, three epitopes are the finest vaccine components as identified by b-cell and mhc molecules. the motif of the particular epitopes was mapped out through distill . for conformational uniqueness and molecular docking. intend to proper binding of epitopes with the t cell receptor delta chain reflects through the significantly lower ace values. the structural profile of virulence factor mce family protein also been mapped out via phyre server that will be fundamental for therapeutics to design the vaccine. the epitopes vlgssvqta, vnielkpef and vvpsnlfav will be much more effectual for perfect designing of epitopic vaccine against n. asteroides limiting nocardiosis and subsequent casualties linked with it. specialized immunoinformatic studies focuses the virulence factor mce-family protein of n. asteroides established its significances lead to expression of bacterial pathogenicity. existing research will be surely valuable in modern therapeutics purposes, to resist the nocardiosis outbreak. particular manifestation of antigenicity by the common b and t-cell epitopes (vlgssvqta, vnielk-pef and vvpsnlfav) substantiates the critical aptitude to generate humoral and cell mediated immunity. consequently, the targeted epitopes assist for easy interaction with the immune receptors favor to transmembrane localization of protein element. literal structural signature of considerable protein along with its epitopes served as decisive factor for novel vaccine development. however, the epitopes requires substantial in vivo and in vitro justification for accurate refinement to generate finest vaccine component restricting the nocardia infection. such computer aided research techniques are also highly influential and efficient for designing of desired epitopic vaccine against several associated diseases in light of immunoinformatics. from zikv genome to vaccine: in silico approach for the epitope-based peptide vaccine against zika virus envelope glycoprotein a novel strategy of epitope design in neisseria gonorrhoeae distill: a suite of web servers for the prediction of one-, two-and three-dimensional structural features of proteins modelling of three-dimensional structures of cytochromes p b and b mandell, douglas, and bennett's principles and practice of infectious diseases: -volume set the protein data bank computational characterization of epitopic region within the outer membrane protein candidate in flavobacterium columnare for vaccine development homology modeling and docking study of xanthine oxidase of arthrobacter sp. xl prediction of linear b-cell epitopes using amino acid pair antigenicity scale identification of t-and b-cell epitopes of the e protein of human papillomavirus type database resources of the national center for biotechnology information pymol: an open-source molecular graphics tool a cohesive and integrated platform for immunogenicity prediction. vaccine design vaxijen: a server for prediction of protective antigens, tumour antigens and subunit vaccines efficient unbound docking of rigid molecules murine polymorphonuclear neutrophils produce interferon-γ in response to pulmonary infection with nocardia asteroides nocardia asteroides cerebral abscess in immunocompetent hosts: report of three cases and review of surgical recommendations exploiting the burkholderia pseudomallei acute phase antigen bpsl for structure-based epitope discovery/design in structural vaccinology protein-protein binding site identification by enumerating the configurations identification of immunodominant b-cell epitope regions of reticulocyte binding proteins in plasmodium vivax by protein microarray based immunoscreening nocardia infection in lung transplant recipients identification of b-cell epitope of leishmania donovani and its application in diagnosis of visceral leishmaniasis prediction of b-cell epitopes in listeriolysin o, a cholesterol dependent cytolysin secreted by listeria monocytogenes the phyre web portal for protein modeling, prediction and analysis epitopebased peptide vaccine design and target site depiction against ebola viruses: an immunoinformatics study predicting transmembrane protein topology with a hidden markov model: application to complete genomes understanding rifampicin resistance in tuberculosis through a computational approach prediction of mhc-peptide binding: a systematic and comprehensive overview pro-check: a program to check the stereochemical quality of protein structures procheck: validation of protein-structure coordinates detection of peptide-binding sites on protein surfaces: the first step toward the modeling and targeting of peptidemediated interactions structure of the vδ domain of a human γδ t-cell antigen receptor evaluation of mhc class i peptide binding prediction servers: applications for vaccine research clinical experience with linezolid for the treatment of nocardia infection propred analysis and experimental evaluation of promiscuous t-cell epitopes of three major secreted antigens of mycobacterium tuberculosis identification of putative vaccine candidates against helicobacter pylori exploiting exoproteome and secretome: a reverse vaccinology based approach exploration of freely available web-interfaces for comparative homology modelling of microbial proteins influenza vaccination and reduction in hospitalizations for cardiac disease and stroke among the elderly reverse-vaccinology strategy for designing t-cell epitope candidates for staphylococcus aureus endocarditis vaccine soil-acquired cutaneous nocardiosis on the forearm of a healthy male contracted in a swamp in rural eastern virginia ucsf chimera-a visualization system for exploratory research and analysis porter: a new, accurate server for protein secondary structure prediction improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles schmidek and sweet: operative neurosurgical techniques -volume set: indications, methods and results in silico identification of catalytic residues in azobenzene reductase from bacillus subtilis and its docking studies with azo dyes homology modeling, model and software evaluation: three related resources cofactor: an accurate comparative algorithm for structure-based protein function annotation prediction of continuous b-cell epitopes in an antigen using recurrent neural network membrane association and epitope recognition by hiv- neutralizing anti-gp f and e antibodies nocardiosis: review of clinical and laboratory experience taking geometry to its edge: fast unbound rigid (and hinge-bent) docking patchdock and symmdock: servers for rigid and symmetric docking pleuroplumonary nocardiosis in an immunocompetent host egf domain ii of protein pb from plasmodium berghei interacts with monoclonal transmission blocking antibody . propred: prediction of hla-dr binding sites propred : prediction of promiscuous mhc class-i binding sites a hidden markov model for predicting transmembrane helices in protein sequences design of novel multi-epitope vaccines against severe acute respiratory syndrome validated through multistage molecular interaction and dynamics computer-aided biotechnology: from immuno-informatics to reverse vaccinology prosa-web: interactive web service for the recognition of errors in three-dimensional structures of proteins vadar: a web server for quantitative evaluation of protein structure quality nocardiosis: updates and clinical overview epitope mapping of two immunodominant domains of gp , the transmembrane protein of human immunodeficiency virus type , using ten human monoclonal antibodies immunogenicity prediction by vaxijen: a ten year overview key: cord- -i z tdrk authors: dangi, mehak; kumari, rinku; singh, bharat; chhillar, anil kumar title: advanced in silico tools for designing of antigenic epitope as potential vaccine candidates against coronavirus date: - - journal: bioinformatics: sequences, structures, phylogeny doi: . / - - - - _ sha: doc_id: cord_uid: i z tdrk vaccines are the most economical and potent substitute of available medicines to cure various bacterial and viral diseases. earlier, killed or attenuated pathogens were employed for vaccine development. but in present era, the peptide vaccines are in much trend and are favoured over whole vaccines because of their superiority over conventional vaccines. these vaccines are either based on single proteins or on synthetic peptides including several b-cell and t-cell epitopes. however, the overall mechanism of action remains the same and works by prompting the immune system to activate the specific b-cell- and t-cell-mediated responses against the pathogen. rino rappuoli and others have contributed in this field by plotting the design of the most potent and fully computational approach for discovery of potential vaccine candidates which is popular as reverse vaccinology. this is quite an unambiguous advance for vaccine evolution where one begins with the genome information of the pathogen and ends up with the list of certain epitopes after application of multiple bioinformatics tools. this book chapter is an effort to bring this approach of reverse vaccinology into notice of readers using example of coronavirus. it compelled us to apply the well-known reverse vaccinology (rv) approach on available proteome of coronavirus. rv approach has been successfully applied on many prokaryotes, but there are very few known applications on eukaryotes and viruses. so, it is worthwhile to explore the potential of this approach to identify potential vaccine candidates for coronavirus. rv basically does the in silico examination of the viral proteome to hunt antigenic and surface-exposed proteins. this approach was initially applied successfully to neisseria meningitidis serogroup b (kelly and rappuoli ) against which none of the prevailing techniques could develop a vaccine. the present book chapter is intended to explore the potential of rv approach to select the probable vaccine candidates against coronavirus and validate the results using docking studies. undoubtedly, the traditional approaches for vaccine development are fortunate enough to efficiently resist the alarming pathogenic diseases of its time. however, the traditional approach suffers from certain limitations like it is very timeconsuming, the pathogens which can't be cultivated in the lab conditions are out of reach, and certain non-abundant proteins are not accessible using this approach (rappuoli ) . consequently, a number of pathogenic diseases are left without any vaccine against them. all these limitations are conquered by reverse vaccinology approach utilizing genome sequence information which ultimately is translated into proteins. hence all the proteins expressed by the genome are accessible irrespective of their abundance, conditions in which they expressed. the credit of fame of reverse vaccinology should go to the advancements in the sequencing strategies worldwide. accordingly, improvement in the sequencing technologies has flooded the genome databases with huge amount of data which can be computationally undertaken to reveal the various crucial aspects of the virulence factors of the concerned pathogen. reverse vaccinology is based on same approach of computationally analysing the genome of pathogen and proceeds step by step to ultimately identify the highly antigenic, secreted proteins with high epitope densities. the best epitopes are selected as potential vaccine candidates (pizza et al. ) . this approach has brought the unapproachable pathogens of interest in spotlight and is evolving as the most reassuring tool for precise selection of vaccine candidates and brought the use of peptide vaccines in trend (sette and rappuoli ; kanampalliwar et al. ). bexsero is the first universal serogroup b meningococcal vaccine developed using rv, and it has currently earned positive judgement from the european medicines agency (gabutti ) . whether it is discovery of pili in gram-positive pathogens which were thought to not have any pili or the sighting of factor g-binding protein in meningococcus (alessandro and rino ), the reverse vaccinology steals all the credits from other conventional approaches. most of the applications of rv are against prokaryotes and very few against eukaryotes and viruses because of complexity of their genome. corynebacterium urealyticum (guimarães et al. ) , mycobacterium tuberculosis (monterrubio-lópez et al. ) , h. pylori (naz et al. ) , acinetobacter baumannii (chiang et al. ) , rickettsia prowazekii (caro-gomez et al. ) , neospora caninum (goodswen et al. ) and brucella melitensis (vishnu et al. ) are the examples of some pathogens that are recently approached using this in silico technique in order to spot some epitopes having potential of being a vaccine candidate. herpesviridae (bruno et al. ) and hepatitis c virus (hcv) (kolesanova et al. ) are the examples of the viruses that are addressed using this approach. (altschul et al. ; okonechnikov et al. ; golosova et al. ) . multiple sequence alignment (msa) was done via clustalw, and the phylogenetic tree was constructed using nj method from unipro ugene . . bioinformatics toolkit (okonechnikov et al. ). analysis of secondary structure of the proteins of seed genome was done by means of expasy portal. the aim is to forecast the solvent accessibility, instability index, theoretical pi, molecular weight, grand average of hydropathicity (gravy), aliphatic index, number of charged residues, extinction coefficient etc. (http://web. expasy.org/protparam/; gasteiger et al. ) . virus-mploc was used to identify the localization of proteins of virus in the infected cells of host (http://www.csbio.sjtu.edu.cn/bioinf/virus-multi/; hong-bin shen and kuo-chin chou ) . this information is important to understand the destructive role and mechanism of the viral proteins in causing the disease. in total six different subcellular locations, namely, host cytoplasm, viral capsid, host plasma membrane, host nucleus, host endoplasmic reticulum and secreted proteins, were covered. these predictions could help in formulation of better therapeutic options against the virus. as per the protocol of rv, secreted and membrane proteins are of special interest, therefore, filtered for further analysis. to predict the number of transmembrane helices tmhmm server v. . (http://www.cbs.dtu.dk/services/tmhmm/; krogh et al. ) was used. signal peptides are known to impact the immune responses and possess high epitope densities. moreover, most of the known vaccine candidates also possess signal peptides. hence, it is worthwhile to predict signal peptides in proteins prior to epitope predictions. signal-blast web server is used to predict the signal peptides without any false predictions (http://sigpep.services.came.sbg.ac.at/signalblast.html; frank and sippl ) . the prediction options include best sensitivity, balanced prediction, best specificity and detect cleavage site only. we choose to make the predictions using each option, and the proteins predicted as signal peptide by all the four options were preferred for further investigation. the most appropriate targets as vaccine candidates are those which possess the adhesion-like properties because they not only mediate the adhesion of pathogen's proteins with cells of host but also facilitate transmission of virus. adhesions are known to be crucial for virulence and are located on surface which makes them promptly approachable to antibodies. the stand-alone spaan with a sensitivity of % and specificity of % was used to carry out the adhesion probability predictions, and the proteins with having adhesion probabilities higher than or equal to . were selected (sachdeva et al. ). betawrap motifs are dominant in virulence factors of the pathogens. if the proteins are predicted to possess such motifs, then they are appropriate to be taken under reverse vaccinology studies. betawrap server is the only online web server to make such predictions. the proteins having p-value lower than . were anticipated to contain betawraps (http://groups.csail.mit.edu/cb/betawrap/betawrap.html; bradley et al. ). for added identification of the antigenic likely of the proteins, they were subjected to vaxijen server version . . it is basically an empirical method to hunt antigenic proteins. so, if the proteins are not found antigenic using other sequence-based methods, then they can be identified using this method. this step confirms the antigenicity of proteins selected using above-mentioned steps (http://www.ddgpharmfac.net/vaxijen/vaxijen/vaxijen.html; doytchinova and flower ). for being a probable vaccine candidate, the protein should not exhibit the characteristics of an allergen as they trigger the type- hypersensitivity reactions causing allergy. therefore, to escape out such possibilities, the proteins were also subjected to allergenicity predictions using allertop (http://www.pharmfac.net/ allertop; dimitrov et al. ) and algpred tools (http://www.imtech.res.in/ raghava/algpred/submission.html; saha and raghava a, b). to check whether the filtered proteins possess any similarity to host proteins or not, the standard blastp (http://blast.ncbi.nlm.nih.gov/blast) searches were performed. in case of sequence similarity, there is a feasibility of generation of immune responses against own cells. predicting the epitopes binding to mhc class i is the main decisive phase of the rv to carry out valid vaccine predictions. the predicted epitopes were docked with receptor that is hla-a* using cluspro (http://cluspro.bu.edu/login.php; kozakov et al. ) that is an automated protein-protein docking web server. the literature searches provided the information of conserved residues of the receptor site. the default parameters were used for docking (comeau et al. a, b; kozakov et al. ). a total of different sequenced strains of coronavirus are available at ncbi. among them strains are pathogenic to humans. various information regarding source, host and collection of these strains are presented in table . and . . this information can be obtained from ncbi's genome database, the virus pathogen database and analysis resource and genomes online database (liolios et al. ; pickett et al. ) . the mers strain is taken as seed genome as it is the most prevalent and disastrous strain among others. its proteome consists of total proteins as shown in table . . the results of sequence similarity to reveal orthologs using blastp are shown in table . . the sequences with greater than % identity score are considered as homologs. the phylogenetic tree is depicted in fig. . and the mers-cov, taken as seed genome, found clustered with different bat coronaviruses. the results of analysis of secondary structure of the proteome using expasy tools are shown in the table . . from the analysis of charge on the residues and ph values, it is concluded that six of the proteins are basic and positively charged unlike allergens which are acidic in nature. however, five proteins are acidic and show negative charge. the negative gravy score of five proteins justify them to be of hydrophilic nature with majority of the residues positioned towards the surface. for the rest of six proteins, the gravy score is positive; it means that these are the accession number and identity of orthologs obtained in different strains is shown in the table hydrophobic proteins. the proteins with less than value of instability index are quite stable than those with higher values. all the proteins are having the molecular weight less than kda except (yp_ . , yp_ . and yp_ . ). this exhibits the effectiveness of lightweight proteins as targets as they can be easily purified because of their low molecular weights. the protein yp_ . is reported as a spike glycoprotein. it is acidic with prominent negative charge, with negative gravy score which suggests its hydrophilicity and figure . depicts the subcellular localization of proteins of the seed genome, i.e. mers-cov. only one protein was predicted to be localized in host cytoplasm, four in host membrane, two in both host cell membrane and endoplasmic reticulum (er) while two in only er, and two are left unrecognized. the known spike protein is predicted to be localized in host er. from these results we decided to pick the proteins which are located in host membrane or were predicted to be localized in both host membrane and er. the two are known envelop protein and membrane protein from bibliographic studies, and along with that, the known spike protein was also included in the filtered results. out of the filtered proteins, only two (yp_ . and yp_ . ) contain more than two transmembrane helices, therefore filtered out. the results of transmembrane helices prediction are tabulated in table . . figure . depicts the subcellular localization of proteins of all the four selected genomes using virus-mploc prediction tool. the proteins that are predicted to possess the signal peptides by signal-blast web server are yp_ . and yp_ . . the results of signal-blast web server are tabulated in the table . . this step takes into account the concept of adhesion-based virulence. adhesions cause pathogen recognition and initiation of inflammatory responses by the host. spaan predicted (yp_ . and yp_ . ) out of proteins of mers strain as adhesive (table . ). only one protein (yp_ . ) was predicted to contain betawrap motifs within it (table . ). hence, it is considered virulent and might be responsible for initializing the infection in the host. a total of out of proteins of mers strain were predicted antigenic (prediction values greater than . ). the protein with accession number yp_ . and yp_ . were among the filtered proteins, however, not predicted antigenic, therefore filtered out. as a result, only four proteins (yp_ . , yp_ . , yp_ . and yp_ . ) were kept for further analyses. none of the proteins of mers-cov possessed any clue of allergenicity as per prediction results from algpred and allertop tools; it means that no vigorous immune responses will be mounted if the epitopes from these proteins will be adopted as vaccine candidates. none of the protein of mers strain shows similarity with the proteins of host that demonstrates that the epitopes from these proteins can safely elicit the required immune response without the hazard of autoimmunity. in total different -mer epitopes with potential to bind to receptors of both b-cell and t-cell were predicted. the list of the predicted epitopes can be found in the table . and are specific for mers-cov strain. all these epitopes displayed no conservancy with proteins of other human and non-human pathogenic strains. docking permits to reveal the binding energy or potency of connection among epitopes and the receptor in appropriate orientation. the cluspro docking server was used to dock the predicted epitopes against hla-a* . the structure of the receptor was available from pdb and was optimized before docking to free it from the complexed self-peptide ( u y, resolution . Å, bouvier et al. ). pepstr (peptide tertiary structure prediction server; kaur et al. ) was used to derive the tertiary structure of the predicted peptides. figure . depicts the quaternary structure of the receptor hla-a* with its conserved active site known to form complex with the peptides (bouvier et al. ). the binding energy results obtained after performing docking analysis are listed in table . . the -mer epitope vvcaitllv at site of protein yp_ . docked to the receptor with smallest amount of binding energy (À . ) and hydrogen bonds. the next epitope in the list was also from the same protein yp_ . at site , i.e. tllvcmafl. the predicted structure of the top potent epitopes on the basis of docking energy and the snapshots of docking results are displayed in figs. . , . , . , . and . . the most chief restriction for developing a safe and sound vaccine against any of the virus is to identify the protective antigens. the present study is an effort of application of reverse vaccinology approach to investigate a choice of coronavirus proteomes to identify possible vaccine targets. this technique has demonstrated to be a competent way to forecast different epitopes from the selected seed genome. these epitopes are from spike glycoprotein, ns protein, ns b protein and envelope protein. unfortunately none of the epitope is found conserved in other strains, and all are specific to mers-cov. the docking analysis studies revealed perfect binding between hla-a* receptor and epitopes. the conserved residues of the receptor site are also involved in h-bonding with epitope residues. further, the selected antigenic epitopes must be validated using in vitro and in vivo studies to confirm their potential as vaccine candidates. review: reverse vaccinology: developing vaccines in the era of genomics basic local alignment search tool the middle east respiratory syndrome coronavirus -a continuing risk to global health security crystal structures of hla-a* complexed with antigenic peptides with either the amino-or carboxyl-terminal group substituted by a methyl group betawrap: successful prediction of parallel beta -helices from primary sequence reveals an association with many microbial pathogens geminiviridae. in: virus taxonomy-ninth report of the international committee on taxonomy of viruses lessons from reverse vaccinology for viral vaccine design discovery of novel cross-protective rickettsia prowazekii t-cell antigens using a combined reverse vaccinology and in vivo screening approach identification of novel vaccine candidates against acinetobacter baumannii using reverse vaccinology cluspro: a fully automated algorithm for protein-protein docking cluspro: an automated docking and discrimination method for the prediction of protein complexes middle east respiratory syndrome coronavirus (mers-cov): announcement of the coronavirus study group allertop v. -a server for in silico prediction of allergens vaxijen: a server for prediction of protective antigens, tumour antigens and subunit vaccines high performance signal peptide prediction based on sequence alignment techniques meningococcus b: control of two outbreaks by vaccination protein identification and analysis tools on the expasy server unipro ugene ngs pipelines and components for variant calling, rna-seq and chip-seq data analyses discovering a vaccine against neosporosis using computers: is it feasible? genome informatics and vaccine targets in corynebacterium urealyticum using two whole genomes, comparative genomics, and reverse vaccinology reverse vaccinology: basics and applications pepstr: a de novo method for tertiary structure prediction of small bioactive peptides reverse vaccinology and vaccines for serogroup b neisseria meningitidis way to the peptide vaccine against hepatitis c piper: an fft-based protein docking program with pair wise potentials the cluspro web server for protein-protein docking predicting transmembrane protein topology with a hidden markov model: application to complete genomes an integrative approach to ctl epitope prediction, a combined algorithm integrating mhc-i binding, tap transport efficiency, and proteasomal cleavage prediction improved method for predicting linear b-cell epitopes the genomes on line database (gold) v. : a monitor of genome projects worldwide identification of novel potential vaccine candidates against tuberculosis based on reverse vaccinology identification of putative vaccine candidates against helicobacter pylori exploiting exoproteome and secretome: a reverse vaccinology based approach database resources of the national center for biotechnology information unipro ugene: a unified bioinformatics toolkit scheme for ranking potential hla-a binding peptides based on independent binding of individual peptide side-chains virus pathogen database and analysis resource (vipr): a comprehensive bioinformatics database and analysis resource for the coronavirus research community identification of vaccine candidates against serogroup b meningococcus by whole-genome sequencing reverse vaccinology spaan: a software program for prediction of adhesins and adhesin-like proteins using neural networks algpred: prediction of allergenic proteins and mapping of ige epitopes prediction of continuous b-cell epitopes in an antigen using recurrent neural network reverse vaccinology: developing vaccines in the era of genomics virus-mploc: a fusion classifier for viral protein subcellular location prediction by incorporating multiple sites propred : prediction of promiscuous mhc class-i binding sites identification of potential antigens from non-classically secreted proteins and designing novel multitope peptide vaccine candidate against brucella melitensis through reverse vaccinology and immunoinformatics approach key: cord- -v f dx e authors: gupta, varsha; sengupta, manjistha; prakash, jaya; tripathy, baishnab charan title: production of recombinant pharmaceutical proteins date: - - journal: basic and applied aspects of biotechnology doi: . / - - - - _ sha: doc_id: cord_uid: v f dx e the proteins produced in the body control and mediate the metabolic processes and help in its routine functioning. any kind of impairment in protein production, such as production of mutated protein, or misfolded protein, leads to disruption of the pathway controlled by that protein. this may manifest in the form of the disease. however, these diseases can be treated, by supplying the protein from outside or exogenously. the supply of active exogenous protein requires its production on large scale to fulfill the growing demand. the process is complex, requiring higher protein expression, purification, and processing. each product needs unique settings or standardizations for large-scale production and purification. as only large-scale production can fulfill the growing demand, thus it needs to be cost-effective. the tools of genetic engineering are utilized to produce the proteins of human origin in bacteria, fungi, insect, or mammalian host. usage of recombinant dna technology for large-scale production of proteins requires ample amount of time, labor, and resources, but it also offers many opportunities for economic growth. after reading this chapter, readers would be able to understand the basics about production of recombinant proteins in various hosts along with the advantages and limitations of each host system and properties and production of some of the important pharmaceutical compounds and growth factors. the proteins produced in the body control and mediate the metabolic processes and help in its routine functioning. any kind of impairment in protein production, such as production of mutated protein or misfolded protein, leads to disruption of the pathway controlled by that protein. this may manifest in the form of the disease. however, these diseases can be treated, by supplying the protein from outside or exogenously. the supply of active exogenous protein requires its production on large scale to fulfi ll the growing demand. the process is complex, requiring higher protein expression, purifi cation, and processing. each product needs unique settings or standardizations for large-scale production and purifi cation. as only large-scale production can fulfi ll the growing demand, thus it needs to be cost-effective. the tools of genetic engineering are utilized to produce the proteins of human origin in bacteria, fungi, insect, or mammalian host ( fig. . ) . usage of recombinant dna technology for large-scale production of proteins requires ample amount of time, labor, and resources, but it also offers many opportunities for economic growth. after reading this chapter, readers would be able to understand the basics about production of recombinant proteins in various hosts along with the advantages and limitations of each host system and properties and production of some of the important pharmaceutical compounds and growth factors. in all living cells, the expression of gene occurs where genetic information contained in dna is passed on to rna in the process of transcription and from rna to protein in the process of translation . thus, the body synthesizes rna and then proteins according to the instructions from the dna. dna-dependent rna polymerase or rna polymerase carries the transcription . eukaryotes have three rna polymerases: polymerase i transcribes ribosomal rna genes ( s, . s and srrna), polymerase ii transcribes all proteincoding genes (mrna and small rna), and polymerase iii transcribes the genes for srrna and trna. in prokaryotic cells ( e. coli ), rna polymerase consists of fi ve subunits-two identical α subunits and one subunit each of β, β′, and σ subunit. the σ subunit dissociates after polymerization ensues. thus, the term "holoenzyme" is used for complete enzyme and "core enzyme" is used without σ subunit. the process of transcription is initiated by binding of rna polymerase to dna molecule at a very specifi c site called " promoters ." the promoters are critical for the start of the transcription. the promoter sites are nearly bp long and are mostly located before the fi rst base, which is copied into rna ( fig. . a ) . this fi rst base is called as start point of transcription and is denoted by + . promoters for rnapol ii allow differential expression of genes and determine the rate at which the genes are transcribed. there are some for expression of the cloned gene, the gene is with all the essential regulatory elements required for transcription of the gene. the gene is attached to the selective gene, which helps in the selection of the clones with gene of interest. the cells are screened for the synthesis of desirable product and then processed for large-scale production in the bioreactor with optimum condition for high yields promoters that cause the inserted genes to be expressed all the time; in all parts of the system, they are known as "constitutive" promoters. others allow expression only at certain stages/ certain tissue/organ of individuals and at certain time points. gene expression is under temporal and spatial regulation. in prokaryotes the position before start site at − and − can interact with σ subunit of holoenzyme of rna polymerase. in eukaryotes the sequence (analogous to − -consensus sequence of prokaryotes), tataaaa, is present at − position in the promoter region. eukaryotic promoter is shown (fig. . a ) with tata box forming the core promoter at − position (from − to − ), upstream of transcription start site. caat box and gc box are at approximately − - and − , respectively. the location of promoter is always on the same dna molecule which they regulate. they are referred as cis-acting elements. the spacing of various elements is more important and much is dependent on locus-specifi c activators, either at core promoter or at distant sites. various other signals as enhancers are also involved which are far apart from the target gene. they exert stimulatory effects on promoter activity and can be upstream, downstream or in the middle of the gene. promoters of housekeeping genes or genes with complex patterns of expression have cpg islands rather than tata box. the gene to be transcribed has ʹ-untranslated region ( ʹ utr), exons (coding region) separated by introns (noncoding region). the end has ʹ-untranslated region ( ʹ utr). in the cloning for gene expression, usually intronless version of the gene is used. in eukaryotes, the transcription is further enhanced by enhancers, which may be , - , bases away from the promoter region but are able to affect the rate of transcription . the simplest host for the work of recombinant dna technology is prokaryotic bacterial system . in the early s, the fi rst recombinant fdaapproved pharmaceutical, the human insulin (humulin-us/humuline-eu), was obtained from genetically engineered escherichia coli ( e. coli ) for treatment of diabetes . due to increasing demand, many strains of microbial species are being designed with increased throughput and better recovery of the therapeutic protein from large-scale culture. the recombinant proteins approved by fda are obtained either from escherichia coli or other prokaryotes; from saccharomyces cerevisiae or other fungal species; from insect cells , mammalian cells , or human cells ; or from transgenic plants or animals. cloning and production of protein in a particular host system are dependent upon a property of host to clone and express the desirable size of protein-encoding genes; production of correctly modifi ed, folded, and functional protein; high yield of the protein; and low-cost requirements. the choice of host systems requires best system, which can fulfi ll the requirements [ ] . the advantages of producing proteins using recombinant dna technology are: • as human gene may be cloned and expressed, it minimizes the risk of immune reaction and the specifi c activity of the protein is high. • the therapeutic protein can be produced efficiently, maintaining its cost-effectiveness. • it minimizes the risk of transmission of unknown pathogens present in animal and human sources. • appropriate modifi cation with higher specifi city, increased half-life, and improved functionality. • allow to create critical changes for better specifi city and activity. expression and production of eukaryotic protein require cloning of its cdna in an expression vector and subsequent transfer of the vector in suitable host. dna is modifi ed, cloned, and expressed in other host for the production of the protein; thus, optimum production conditions are required in each host. however, there are yield variations in different expression systems but high-level expression of the protein may be achieved by considering the following points: • though various host systems are available for production of recombinant proteins, microbial hosts offer several advantages over other systems, as production is fast, cheap, and economic: . the molecular biology and physiology is well characterized and documented. . easy to maintain and manipulate. . utilizes inexpensive nutrition sources. . rapid growth and biomass accumulation to achieve high cell densities. . scale-up is easy and convenient. . their expression machinery can be with variety of strong inducible promoters . as with the cellular structure of bacteria, it can rapidly adapt to culturing conditions with very short replication time ( min). the media requirement of bacterial cell is simple and consists of simple carbon and nitrogen source. thus, the overall inputs in bacteria are % lower than for mammalian cells . some of the the inducible promoter may require an inducer, or the depletion or addition of a specifi c nutrient, or ph change or changes in physicochemical factors in order to initiate the process of gene expression. the inducible systems suffer from the disadvantage that chemical inducers may be expensive and toxic and would require elimination during downstream processing when the product is intended for human usage. thus, usage of thermoregulated systems has been used for production of recombinant pharmaceutical proteins as expression is dependent upon strong heatregulated promoter minimizing the risks of any addition of chemical agent. approved bacteria-derived (by either the european union or fda, usa) therapeutics include hormones (human insulin and insulin analogs, calcitonin, human growth hormone , glucagons, parathyroid hormone , somatropin, and insulin-like growth factor ), interferons (alfa- , alfa- a, alfa- b, and gamma- b), interleukins and , light and heavy chains raised against vascular endothelial growth factor-a, tumor necrosis factor alpha, cholera b subunit protein, granulocyte colony-stimulating factor, and plasminogen activator [ ] . the microbe of fi rst choice for production of recombinant protein is enterobacterium e. coli . the system offers quick and easy modifi cations, ease of growing in manageable environmental conditions and short life cycle. the bacterial cell can tolerate and adapt to changes in the environment rapidly, thus scale-up is easier. however, the system suffers from some of the disadvantages [ , ] . . human or mammalian genes cloned in bacteria cannot undergo splicing due to lack of splicing machinery, thus intron-less version of the gene is cloned for optimum results (fig. . b ). . the signals involved in transcription of genes may vary; thus, the gene of interest is usually fused with bacterial gene under the control of its promoter , and the protein is obtained as a fusion product, which can be later cleaved, purifi ed, and used. . lac promoter is one of the most popular bacterial promoters . however, for high-level expression, t promoter is also preferred (present in pet vectors . for addition of amino acids during the process of translation , as there are more than one codon for several amino acids, thus, codon biasing occurs which is the preference of a particular codon of amino acid in a particular species (fig. . c ). . complexity: eukaryotic cells have the advantage of producing fully functional and properly folded proteins. the antibodies with four subunits may be secreted by eukaryotic cells in fully functional form. on the other hand, it is very diffi cult to obtain multidomain protein from e. coli . even if protein is obtained, the renaturation and folding in laboratory condition may either be very expensive or protein may lack its activity. . lack of posttranslational modifi cations (ptms) is the problem, which cannot be solved, and is mandatory for activity of many therapeutic proteins. the glycosylation is the most common modifi cations, and others are phosphorylation and formation of disulfi de bond, which are essentially required for the full functional capability of many human proteins. ptms play an important role in proper protein folding, processing, stability, tissue targeting, activity, immune reactivity, and half-life of the protein. lack of these results in insoluble, unstable, or inactive product. however, the n-linked glycosylation system of campylobacter jejuni has been successfully transferred to e. coli , thus opening a possibility for the production of glycosylated protein in it. certain mutant e. coli are being developed to promote disulfi de bond formation (ad , origami, rosetta-gami) with reduced protease activity (blz ). . overproduction of recombinant protein in bacteria might result in the loss of solubility and deposition of many protein species as protein aggregates or inclusion bodies. alteration in growth conditions might render the product in insoluble form. many eukaryotic proteins are found trapped in inclusion bodies with resistance to further processing. success has been obtained in purifi cation of insulin and betaferon from inclusion bodies. the retrieval of proteins using denaturating condition with subsequent refolding and renaturation might not always be easy and prove to be extremely expensive. . with the e. coli host, it is very diffi cult to obtain protein larger than kda in soluble form. due to certain limitations for production of proteins in e. coli , other host systems are being discussed for production of proteins. and two n-acetylglucosamine residues. the core may attach to different oligosaccharides to form different glycoproteins. man glcnac glcnac --asn-pentasaccharide core glycosylation is one of the important posttranslational modifi cations, which occurs inside the lumen of the endoplasmic reticulum (er) and in golgi complex. the er and golgi complex are important in protein targeting and transport. n-linked glycosylation starts in the endoplasmic reticulum and continues in the golgi complex after the polypeptide is synthesized on ribosomes. however, o-linked glycosylation exclusively occurs in the golgi complex. in the process, oligosaccharide to be attached to the protein associates with a specialized lipid present in er, dolichol phosphate, which consists of about isoprene (c ) units. through phosphate of dolichol phosphate, oligosaccharide is transferred to specifi c asparagine residue of polypeptide chain on ribosomes. the enzyme responsible for glycosylating protein and activated oligosaccharide are located on the lumen side of er. then these are transported to the golgi complex, where the carbohydrate units are altered and fi nalized. golgi complex is responsible for o-linked sugar attachment and modifi cation of n-linked sugar. then the proteins are targeted and transported to their destination . covalent attachment of carbohydrate group to the protein to form glycoprotein is called glycosylation. in glycoproteins, proteins constitute a major fraction. these play important roles in various physiological processes and are components of cell membranes. the carbohydrates commonly attached to proteins may be fucose (fuc), galactose (gal), n-acetylgalactosamine (galnac), glucose (glc), n-acetylglucosamine (glcnac), mannose (man), and sialic acid (sia). these sugar moieties may associate through amide nitrogen atom of side chain of asparagine (asn) termed as n-linked glycosylation or to the oxygen atom in the side chain of serine (ser) or threonine (thr) termed as o-linked glycosylation. not all the asparagine (asn) present in polypeptide can accept the carbohydrate moiety. the residues with the sequence asn-x-ser or asn-x-thr, where x is any amino acid except proline, are targets for glycosylation. not only the residual sequence but other aspects of the structure of the protein and cell type determine the glycosylation site. all the n-linked sugar residues have a common core of pentasaccharides. these pentasaccharide consists of three mannose (continued) due to the problems encountered in e. coli for production of larger proteins or modifi ed proteins, the next cost-effective, fast, high-density, and easy to handle system is of fungi. saccharomyces cerevisiae ( yeast ) was the system of choice when it was diffi cult to obtain therapeutic protein in soluble form and with appropriate posttranslational modifications in bacterial host. in yeast , mutants are available which can give high yield. the approved products obtained from yeast are hormones , vaccines, recombinant granulocyte macrophage colony-stimulating factors (gm-csf), albumin, hirudin, and platelet-derived growth factor ( pdgf ). the advantages of s. cerevisiae are the following: ( ) it secretes recombinant protein in the culture, ( ) protein is properly folded, and ( ) it performs most posttranslational modifi cations. with the yeast system, high amounts of recombinant protein are obtained, and yeast is also capable of performing posttranslational modifi cations as o-linked glycosylation, phosphorylation, acetylation, and acylation but differs drastically in patterns of n-linked glycosylation ( fig. . ). chaperones are family of highly conserved different proteins. the important functions of chaperones are: • prevention of aggregation and misfolding of newly synthesized polypeptide chain. • they prevent irreversible aggregation of nonnative conformation and maintain the protein on the productive folding pathway. • they prevent nonproductive interactions with other components of the cell. • they help and guide the direct assembly of multisubunit protein complexes and larger proteins. • the chaperones involved in folding recognize nonnative substrate proteins mainly via their exposed hydrophobic residues. the major classes of molecular chaperones are: heat shock proteins are present in a variety of systems and prevent damage to the proteins under high heat. (continued) differences in n-glycosylation in yeast are with high or hypermannose which is highly immunogenic. unmodifi ed proteins are suitable for production in yeast . other members of fungi are pichia pastoris , pichia methanolica , candida boidinii , and pichia angusta , which are facultative methylotrophic yeast having great potential. pichia pastoris is favored as high cell densities can be obtained; protein is secreted in high concentration ( g/l), less hypermannosylation as compared to yeast and thus less immunogenic. however, the disadvantage is that it requires methanol to induce gene expression as transgene is under the control of the promoter of alcohol oxidase (aox ) gene. methanol may be fl ammable and is toxic to cells and humans if not thoroughly removed. because of hypermannosetype glycosylation , the fungi are also unsuitable for production of many recombinant proteins . insect cells : insect cell can be infected with baculoviruses which are double-stranded circular dna viruses with arthropods as host. baculovirus-mediated gene expression in insects is a method of choice and is cost-effective, giving the much higher yield of recombinant protein compared to other systems. it is possible to produce large protein resulting in production of correctly processed and biologically active protein. a baculovirus autographa californica nuclear polyhedrosis virus (acmnpv) is used as a cloning vector for insect cell lines. in this viral polyhedron protein is used, which is required in its normal habitat and exhibits high rate of transcrip- • it is a bacterial chaperone. • it binds with partially folded and misfolded proteins. • for its functionality groel requires its cofactor groes. • it is tetradecameric mitochondrial chaperonin. • it is implicated in protein import and macromolecular assembly. • required for folding of precursor polypeptides in atp-dependent manner. • prevents aggregation and mediates refolding of protein after heat shock. • they are central components of the cellular network of folding catalysts and molecular chaperones . • they assist in different types of processes of protein folding in the cell by transient association of their substrate binding domain with short hydrophobic peptide. • they bind and release their substrate by switching to low-affi nity atp-bound state and the high-affi nity adp-bound state. • they form complex network of folding machines [ ] . • it is highly abundant chaperone. • it plays an important role in many cellular processes, for example, cell cycle control, cell survival, and hormone and other signaling pathways. • it is a key player in maintaining cellular homeostasis during stress . • has atpase activity, whose binding and hydrolysis affects conformational dynamics of the protein. • it has become a major therapeutic target for cancer, and its role is being explored in neurodegenerative disorders and infectious diseases [ ] . • this is eukaryotic chaperonin tailless complex polypeptide (tcp ) ring complex (tric). • it facilitates the proper folding of many cellular proteins. (continued) tion, but is not needed in cell culture. thus, the coding sequence of the gene is replaced with foreign dna. the gene is transcribed under the control of powerful polyhedron promoter with high yields (~ % of total cell protein). the observed yield may be variable due to the course of virus infection and viral titer. the production of recombinant protein in insect cell is time consuming (as compared to bacterial system ) as cell growth is slow and the cost of medium is high. every time fresh cells are required, viral infection is lethal for cells. it also has limitations in performing posttranslational modifi cations as it performs non-syalated n-linked glycosylation . all the other optimizations need to be perfect as yield depends upon the virus titer and time taken from infection to expression. insect cells are preferred when active protein is diffi cult to obtain in e. coli system. genetic engineering has been used to select mimic™ (invitrogen) and sfswt- , which are transgenic cell lines expressing all necessary enzymes to obtain humanized, complex n-linked glycosylation pattern. the system has been extensively used for structural studies as correctly folded eukaryotic proteins may be obtained in secreted form simplifying purifi cation protocols. some of the approved biopharmaceuticals from infected insect cell line hi five are cervarix (recombinant papillomavirus c-terminal truncated major capsid protein l types and , used as cancer vaccine). glycosylation is a problem which is encountered when insect cells are used for production of recombinant human glycosylated proteins. lots of genetic engineering is required to produce humanlike glycosylation in insect cell . thus, the preferred system for therapeutic human protein production is mammalian system (chinese hamster ovary cell line ). due to time and diffi cultly in maintaining insect cells , the mammalian cells were explored for production of recombinant protein. mammalian cells , because of their properties of protein folding, assembly, and posttranslational modifi cations, have become the preferred system for protein production and are now accounting for major recombinant protein production . mammalian cells : production of complex proteins requires extensive processing and posttranslational modifi cations. mammalian cells have the advantage of performing ptms (fig. . ) correctly; they secrete recombinant protein into the medium in their natural form, thus skipping the critical steps of renaturation and refolding which sometimes leads to inactive proteins. therefore, major therapeutic proteins ( - %) are produced in mammalian cells primarily chinese hamster ovary (cho) cells and baby hamster kidney cells (bhk). cho cells are relatively easy to manipulate and their properties favor large-scale production in them [ ] . the proteins produced are safe to use in humans with no adverse reactions because of similar glycosylation pattern. chinese hamster ovary (cho) and baby hamster kidney (bhk) cells are the prominent producers of recombinant proteins (fig. . ) . the recombinant therapeutic product used for clinical applications was produced from mammalian cells. mammalian cells were maintained in serum-/blood-/plasma-based medium; therefore, the presence of any infectious agent in the product might be deleterious if not properly removed. infections may range from hiv , coronavirus (severe acute respiratory syndrome (sars), non-lipid-enveloped (nle) viruses as circoviruses (torque tenovirus (ttv) and torque tenominivirus (ttmv)), hbv, hcv, htlv (human t-cell lymphotropic virus), to west nile virus. prions that are self-replicating infectious proteins may also be present which may lead to variant creutzfeldt-jakob disease (vcjd). pathogen transmission was a major concern in the manufacture of blood-derived coagulation factor. in the early s, the factor replacement products derived from plasma, which were used to treat hemophilia, were found to be contaminated with hiv , hbv, and hcv viruses. in the year , up to % of us-based hemophilic patients were infected with hiv and - % were infected with hcv. parvoviruses, b (b v) and parv , were present as con-in mammalian cells, genes can be expressed either transiently or stably. obtaining stably transformed cell lines requires the usage of some selectable marker. another major advantage of these cells is they can be grown in suspension, in serum-free (sf), protein-free, and chemically defi ned media . the product is safe without the risk of prions of bovine spongiform encephalopathy (bse) from bovine serum albumin and infections of variant creutzfeldt-jakob disease (vcjd) from the human serum. presently due to virus and prions in the donor plasma or blood samples, the manufacturers are opting for plasma-/serum-free growth medium for culturing the cell lines . cells require some of these components (albumin, transferring) for their growth. nowadays recombinant human albumin is available like human insulin (produced in e. coli and yeast ) and yeast-derived animal-free recombinant human transferrin is available. these support plasma-free mammalian cell culture. else, cho cells may be engineered to produce its own transferring or insulin-like growth factor. the therapeutic protein obtained by mammalian cell is treated and tested for the presence of any pathogenic agents or viruses as contaminant. the therapeutic products obtained during processing are treated with various agents to remove inactive virus, but the presence of asymptomatic virus poses a serious risk. common virus inactivation technique is using solvent/ detergent, which is effective against lipidenveloped viruses such as human immunodeficiency virus (hiv) , hepatitis b and c virus (hbv and hcv), human tcell lymphotropic virus (htlv), and west nile virus (wnv). non-lipidenveloped viruses such as parvoviruses, enteroviruses, and circoviruses are resistant to inactivation via solvent detergent treatment. human herpesvirus , responsible for kaposi's sarcoma, is shown to be transmitted through blood and blood product. following outbreaks, strict requirements were imposed on manufacturers of biologics and medical devices. such concerns prompted manufacturers to switch to sugar-based fi nal formulations and develop recombinant plasma-free albumin produced in yeast for usage in biopharmaceutical manufacturing. plasma-free manufacturing involves the elimination of plasma derivatives in every step (like cell line development, upstream processing, downstream processing, and fi nal formulations) of the process with appropriate postproduction checks. the manufacturers shifted from the use of serum to serum-free cell culture media with animal product-free media and ultimately to protein-free, completely synthetic chemically defi ned media . the media consists of protein hydrolysates derived from yeast , soy, and wheat with amino acids, peptides, carbohydrates, vitamins, and essential elements, which are ultrafi ltered to remove any unwanted contaminants [ ] . (continued) ) , and replagal-alfa-galactosidase a (lysosomal hydrolase) have been approved by the european union (eu) or food and drug administration (fda, usa). as these products are fully glycosylated when expressed in human cell lines and used as therapeutics in human beings. transgenics for protein production the transgenic animals are successfully used for production of recombinant proteins (for details refer to chap. ). protein production poses great risk in terms of safety as transmission of infectious agents, allergic responses, immune reactivity, and autoimmune responses might occur. atryn was the only approved (approved in by european medical agency and in by fda) recombinant biopharmaceutical using transgenic animals. it contains human antithrombin ( aa) with % glycosylated moieties and is secreted into milk of transgenic goats. rhucin intended for acute attacks of angioedema in patients with congenital c inhibitor activity defi ciency, obtained from transgenic rabbit, was denied approval. transgenic plants are being explored as recombinant protein producers for research and diagnostic uses. obtaining therapeutic proteins from their natural source poses threat for spread of diseases. therefore, alternative systems for the production of therapeutic agents have their own benefi ts. molecular farming in plants has been widely explored for production of recombinant pharmaceutical proteins. their advantages are low cost, high mass production, scale-up, lack of human pathogens , and addition of eukaryotic ptms. the fi rst recombinant protein obtained in from tobacco plants was human growth hormone . however, sometimes plant-specifi c ptms might result in adverse immune reactivity. production of recombinant heterologous proteins in plants is simple and is used for production of non-naturally occurring proteins as single chain fv fragments (scfvs). high yield of recombinant protein is the main goal of production system in transgenic plants . therefore appropriate expression vectors and constructs are designed to achieve high yields of the engineered gene products [ , ] . taminant in plasma-derived factor viii. therefore, the production urgently required regulatory measures. then came fi rst recombinant factor viii , advate (baxter), in the usa in . advate was produced in cho cells grown in serum-free and protein-free medium with ultrafi ltered soybean peptides with subsequent purifi cation by immunoaffi nity chromatography. the usage of advate helped in eliminating the risk of transmitting emerging blood pathogens . prions pose a serious risk, as they are highly resistant to physical/chemical inactivation. early stage of prion infection is almost impossible to detect in plasma donor. iatrogenic transmission of prions has occurred in patients who received humanderived pituitary hormones as human growth hormone (hgh) and gonadotropins. cjd was transmitted to over recipients of cadaveric pituitary hgh before its withdrawal. cadaveric pituitaryderived gonadotropins for infertility were associated with iatrogenic transmission of cjd. later on cadaveric pituitary hgh and gonadotropins have been replaced with recombinant gh (produced in microbial system) and recombinant gonadotropins (produced in cho cell lines ). nowadays plant system is effi ciently engineered to produce human growth hormone , human serum albumin, erythropoietin, α-interferon, antibodies and scfvs, toxins, subunit vaccines , and insulin [ ] . with increasing demand from the consumer, the companies are trying to increase the productivity [ ] . few challenges with large-scale production of proteins are: the transgene in expression construct is chimeric structure as it is surrounded by various active regulatory elements. polyadenylation sites play an important role for the high level of expression of transgene. caulifl ower mosaic virus (camv) s promoter works well with dicots. it is a strong constitutive promoter that is made more active by duplicating the enhancer region. however, in monocots maize ubiquitin- promoter is the preferred promoter . the presence of an intron in the ′-untranslated region ( ′-utr) enhances transcription in monocots. for obtaining high yield of the protein, several factors may be appropriately considered: • the incorporation of polyadenylation sites may be from camv s transcripts or the agrobacterium tumefaciens nos gene or the pea ssu gene. • the yield can be controlled by placing the gene under the control of the promoter which is active in a particular tissue or developmental stage or particular environment (e.g., rice glutelin, pea legumin). • usage of inducible promoter (e.g., tomato hydroxyl- -methylglutaryl coa reductase- (hmgr )) which has mechanical gene activation system developed by cramer (crop tech corp., virginia, usa). transcription starts when harvested tobacco leaves are sheared during processing. • codon bias in the host plant may be overcome by engineering of transgene at positions, which might lead to truncation and/or misincorporation, or slowing the process. • subcellular targeting is also very important factor which affects the process of folding, assembly, and posttranslational modifi cation and can be effi ciently achieved by inclusion of an n-terminal signal peptide. • position of transgene integration. • structure of transgene locus. • gene copy number. • presence of truncated or rearranged transgene copies. • affi nity tags as his or the flag epitopes can be used to ease the process of purifi cation; however, these modifi cations not only affect the primary structure but also the properties of the protein. figure . shows the general vector pcambia (it is small in size ( - kb) maintained in high copy number with pvs replicon that imparts chloramphenicol or kanamycin resistance and high stability in agrobacterium ) that is used for transgene expression in the plants. the modifi ed vector has shown success in insulin production. (continued) . challenges of production of therapeutic proteins important that gene of interest should give adequate protein production. however expression may be lost due to many factors, like, if there are structural changes in the recombinant gene or inactivation or disappearance of the gene from host cell. other factors infl uencing yield may be increase in the copy number of insert, maintenance of optimum temperature, and toxicity of the expressed protein to the host. chromosomal integration of the foreign gene might overcome the problem of expression stability, but in plasmid -based system, high copy number leads to increased yield. . posttranslational processing: protein folding requires foldases (accelerates protein folding) and chaperones (prevents protein formation of nonnative insoluble folding intermediates). glycosylation is complex ptm requiring consecutive steps and enzymes. glycosylation is important as it determines protein stability, solubility, antigenicity , folding, localization, biological activity, and circulation half-life. getting correctly glycosylated and folded protein is required for therapeutic usage. in prokaryotes, with the discovery of n-glycosylation system in campylobacter jejuni , several other systems of o-glycosylation were unraveled in both pathogenic and symbiotic bacteria. the production of recombinant proteins is commonly done by using e. coli , yeast , or cell lines derived from insects (sf ), mice (sp / ), or cho, but obtaining fully human ptms is a challenging task. fig. . the fi gure shows the structure of pcambia vector (cambia.org) used for plant transformation. the vector has camv s promoter , multiple cloning site, and reporter gene (gus or gfp may be used). the vector can be modifi ed to express genes for insulin (tomato) or hep-b surface antigen (hbsag) for recombinant therapeutics providing a human glycosylation pattern have increased attention, and the efforts are being made for the development of novel glycoengineered cell lines for production of fully glycosylated protein therapeutic. . overexpression of therapeutic protein might result in the formation of inclusion bodies in prokaryotic system. rapid intracellular protein accumulation and expression of large proteins increases the probability of aggregation. aggregation protects proteins from proteolysis and can facilitate protein recovery. when the expressed protein is toxic to the host, the presence of protein in the inclusion body tends to protect it. precautions recombinant protein production requires some precaution resulting in a loss of yield and/or product: . contamination: at any stage, contamination might occur from any source. it poses big challenge and may be adverse when contaminated material is put for human usage. . immune response: for the therapeutic agent, the body can mount the immune reactions which lead to deposition of immune complexes in various tissues, and condition of anaphylactic shock might occur (e.g., when some essential agent is lost since birth like factor viii , then the patient might raise antibody response against the treatment). this also occurs when antibodies are used as therapeutic agents in the treatment of variety of cancers. . protein aggregation: any nonfunctional condition might result in aggregation or loss of activity of recombinant protein. after purifi cation, the proper modifi cations and refolding are required for therapeutics. . disulfi de bond formation: it stabilizes protein structure; thus, strategies for specifi c extracellular excretion pathway or overexpression of chaperones is required for optimum production. . degradation: sometimes proteins, which are active in host cells like proteases or proteinmodifying chemicals, might degrade the recombinant protein (e.g., peg interferon is modifi ed to have polyethylene glycol with prolonged presence and reduction in enzymatic degradation and renal clearance, thus extending its presence with lesser immunogenicity ) [ ] . ischemic stroke and myocardial infarction are one of the leading causes of cardiovascular morbidity and mortality in the world. important thrombolytic agents are urokinase (obtained from urine), tissue plasminogen activator (tpa), and streptokinase (obtained from bacteria). among these, t-pa is largely used commercially. plasminogen activators are serine proteases, which are responsible for conversion of inactive proenzyme plasminogen (plg-a single chain glycoprotein) to serine protease called plasmin (plm). the plasmin degrades the network of fi brin of the blood clots ( fig. . a ). there are two immunologically unrelated groups of plasminogen activators, the kda urokinase-type-pa (u-pa) and kda tissue-type pa (t-pa) (ec . . . ). the t-pa is the physiological vascular activator consisting of single polypeptide chain of kda consisting of amino acids. it shows strong activity in the presence of fi brin [ ] . the potential inhibitors of the thrombolytic cascade are type i plasminogen activator inhibitor (pai- or serpin e ) and pai- (secreted by placenta and present in signifi cant amount during pregnancy). they act by competing with t-pa for binding sites on fi brin thus preventing the fi brinolytic cascade. pai- complexes with t-pa for binding to fi brin. thus, truncated t-pa in which the residues responsible for interacting with pai- ( - ) are replaced with four alanine amino acids and three domains (fi nger, epidermal growth factor (egf) , and kringle domains)) are deleted, and chimeric tetrapeptide gly-his-arg-pro (ghrp) with high affi nity to fi brin was added. reteplase is the deletion mutant with a prolonged half-life, in which the fi nger, egf , and kringle domains of the full-length molecule are all deleted; thus, it is not inhibited. defi ciency of pai leads to over fi brinolysis and hemorrhagic diathesis (like defi ciency of clotting factors). tiplaxtinin is the inhibitor and is used for remodeling of blood vessels [ ] . plasminogen activator has great clinical relevance for the management of stroke and myocardial infarctions. for production of t-pa, e. coli and yeast system did not work properly due to lack of posttranslational modifi cation and over glycosylation , respectively. a novel truncated form of t-pa with an improved fi brin affi nity and an increased resis-tance to pai- was expressed in a cho dg expression system. therapeutic protein was produced in stably transfected cho dg cell lines . these cell lines were maintained in serum-free medium, with glutamine, hypoxanthine, and thymine in stirred tank bioreactor . the cells were grown at °c, rpm with % co and % humidity. the protein was then purifi ed, with higher yield (fig. . b ). factor viii is one of the important factors of all blood clotting factors. defi ciency of factor viii causes bleeding disorder called hemophilia a. the hemophilia may be mild to severe depend- ing upon factor viii concentration in the body. in moderate and severe factor viii defi ciency, there can be spontaneous bleeding episodes in the joints. hemophilia a affects in , - , males. replacement therapy is the treatment option for hemophiliacs either with human plasma-derived factor viii (pdfviii) or recombinant fviii (rfviii) [ ] . transfection of hek cell cultures in serum-free suspension is being tried for optimal yield. recombinant factor viii (rfviii) is produced by culturing mammalian cells as baby hamster kidney (bhk) or chinese hamster ovary cells (cho), using large-scale bioreactors . standardizations are done to maximize yields . insulin is a peptide hormone consisting of amino acids. it is secreted by β cells of islets of langerhans of pancreas. the hormone is responsible for maintaining normal blood glucose level in blood. insulin is stored in the form of proinsulin which contains two polypeptide chains, a and b, and is connected with a third peptide cchain), which before secretion is cleaved with production of insulin and c-chain. the cleavage results in the removal of c-chain, and the a ( amino acids) and b chain ( amino acids) are linked by disulfi de linkage to form mature insulin. in the beginning, efforts were made to isolate mrna for pre-and proinsulin from rat islets of langerhans of pancreas and to synthesize cdna. thereafter, it was inserted into a plasmid . the recombinant plasmids were transferred into the e. coli cells, which secreted proinsulin [ , , , ] . scientists have chemically synthesized dna sequences for two chains, a and b, of insulin and separately inserted into two pbr plasmids by the side of β-galactosidase gene. the recombinant plasmids were separately transferred into e. coli cells which secreted fused β-galactosidase-a chain and β-galactosidase-b chain separately. these chains were isolated in pure form by detaching from β-galactosidase with yields of about mg/ g of healthy and transformed cells. production of recombinant insulin is shown in (fig. . a, b ) . growth hormone is produced by the anterior lobe of the pituitary gland and is released in multiple pulses. the hgh is encoded by ghn gene cluster (an array of fi ve closely related genes), which is localized on chromosome . it belongs to diverse gene family that has evolved by gene duplication events and has lots of structural similarity and some common functions. of the various forms, the predominant form of hgh is kda protein of amino acids with two disulfi de bonds. gh does not control the functions directly, but acts on certain hormones or somatomedins for its activity, for example, insulin-like growth factor (igf- ). it has a wide spectrum of roles to play as promotion of long bone growth, promotion of normal sex organ development through puberty, regulation of metabolism, stimulation of tissue growth and repair, anabolic/anti-catabolic effect via improved nitrogen retention, modulation of bone mineral density and metabolism throughout life, proliferation of some cell types of the immune system, appetite stimulation, and breakdown of fat (lipolysis). in clinical conditions, the therapy of growth hormone is given in the treatment of dwarfi sm, bone fractures, skin burns, bleeding ulcers, and aids . recombinant human growth hormone (rhgh) is kda consisting of long chain of amino acids. it is used in defi ciency disorders of growth hormone . in children, it is used for growth abnormality as short stature and is used in chronic renal insuffi ciency. the therapy of growth hormone is also approved for adult growth hormone defi ciency. gh is one of the most widely used hormones with the estimated market of more than . billion usd. long experience in its administration has proven the therapy as safe and effective in various conditions of growth abnormality. earlier pituitary-derived hgh was used but later on it was prohibited when found associated with creutzfeldt-jakob disease. because of recombinant dna technology , safe and abundant recombinant hgh was produced in various heterologous systems. as the non-glycosylated human growth hormone was biologically active, thus, the preferred system for its production is e. coli , which allows its rapid and economical production in large amounts [ ] . recombinant hgh (rhgh) is now used to treat: • gh-defi cient (ghd) short-stature children. • acceleration of wound healing. • increase in insulin-like growth factor (igf)- levels. • rhgh increases igf- , osteocalcin, type i procollagen pro-peptide (picp), and bone density, when administered to children with ghd. for optimal productivity, strong inducible promoters are preferred as ipl, ipr, trc, and t in e. coli . they are advantageous as they drive overproduction of recombinant proteins. apart from e. coli , human somatotropin (hst) expression was tried in a biologically active, disulfi de-bonded form in tobacco chloroplasts. the hormone is used for the treatment of hypopituitary dwarfi sm in children; additional indications are in treatment of turner syndrome, chronic renal failure, hiv wasting syndrome, and possibly treatment of the elderly. growth hormone defi ciency in human occurs both in children and adults [ , ] . hematopoietic growth factors consist of cytokines and protein hormones produced by the body which govern the production and maturation of the various cells produced during the process of hematopoiesis in the bone marrow from hematopoietic stem cell. the precursor cells in the presence of a particular growth factor differentiate and become a specialized kind of cell as monocyte, macrophage, lymphocytes, or red blood cell. one of the important growth factor is erythropoietin which is a protein hormone produced by a specifi c type of cells in the kidney. in the presence of erythropoietin, progenitor cells are stimulated in the bone marrow to form mature erythrocytes (red blood cells). thus the patients with chronic kidney disease are unable to maintain adequate amount of erythropoietin for normal development of erythrocytes in blood, resulting in low numbers of red blood cells and subsequent anemia. these patients either require blood transfusion or erythropoietin from outside. as supply is limited from the natural source that is kidney cells, thus a recombinant human erythropoietin epogen® which is amgen's trade name for epoetin alfa is marketed for anemic condition involving erythropoietin. the human gene encoding erythropoietin was cloned into the chinese hamster ovary cell line for production of the human protein. this cell line continues to be used today for the production of epogen®. the half-life of erythropoietin can be increased by incorporating the glycosylation of the protein growth factor. thus, darbepoetin-α is an analog which is engineered for two extra amino acids which are substrates for glycosylation . thus, production is done in cho cell lines ; the product has fi ve n-linked sugar chains and has almost three times longer life than erythropoietin . pdgf regulates cell growth and division and plays a signifi cant role in blood vessel formation (angiogenesis), the growth of blood vessels from already existing blood vessel in tissue, and may act in autocrine and paracrine stimulation of cell growth in vivo. pdgf plays the role in development, cell proliferation, cell migration, and angiogenesis and has been linked to atherosclerosis, fi brosis, and malignant diseases. pdgf has fi ve different isoforms: pdgfa, pdgfb, pdgfc, pdgfd, and ab heterodimer. pdgf-a and pdgf-b have % similarity in amino acid sequence, but experiments suggest different biological functions for the two chains and different locations of these under different transcriptional controls. pgdf-aa is released in the medium, and pdgf-bb are insuffi ciently secreted and remain attached to the plasma membrane, and of these pgdf-bb and pdgf-ab are strong mitogens and are probably responsible for biological roles of pdgf. pdgf receptor (pdgfr) is receptor tyrosine kinase (rtk) (alpha and beta type). upon activation by pdgf, these receptors dimerize leading to autophosphorylation of several sites on their cytosolic domain. pdgf being a mitogen promotes the proliferation of fi broblasts and smooth muscle cells in vitro. pdgf shows considerable heterogeneity with sizes of - kda; however, purifi ed pdgf is cationic protein of kda. recombinant human platelet-derived growth factor (rh-pdgf) was the fi rst recombinant protein to be approved by the us food and drug administration for treatment of chronic foot ulcers in diabetic patients (regranex, ethicon inc. somerville, nj). it has the potential for use in bone regeneration and increasing bone density in long bones and spine. pdgf is commercially produced by using e. coli and mammalian cell s . human egf protein has aa and three intracellular disulfi de bonds and plays an important role in the regulation of cell growth and proliferation. it shows strong sequential and functional homology with human type-alpha transforming growth factor (htgf alpha), which is a competitor for egf receptor site. egf acts by binding with high affi nity to egfr on cell surface and stimulates the intrinsic protein tyrosine kinase activity. egf has many biological activities. initial observations were centered around their proliferative effects on fi broblasts, keratinocytes, and epithelial cells. egf modulates luteinizing hormone and thyroid hormone . egf is produced commercially by engineered e. coli . the other systems are also being explored for optimum egf production [ ] . fgfs are commonly mitogens with multifunctional proteins with a wide variety of regulatory, morphological, and endocrine effects. there are mammalian fgfs ( - and - ) which affect growth and functions of a wide variety of mesenchymal, endocrine, and nerve cells. the functions of fgfs in developmental processes include mesoderm induction, anteroposterior patterning, limb formation, neural tube induction, and brain development and in mature tissue/systems angiogenesis, keratinocyte organization, and wound healing processes. fgf is very important during normal development of both vertebrates and invertebrates, and any irregularities in their function lead to a range of developmental defects [ ] . fgf and fgf show strong angiogenic properties with the promotion of endothelial cell proliferation and the physical organization of endothelial cells into tubelike structures and the growth of new blood vessels from the preexisting vasculature. fgf and fgf (also known as keratinocyte growth factors (kgf) and kgf , respectively) stimulate the repair of injured skin and mucosal tissues by stimulating the proliferation, migration, and differentiation of epithelial cells, and they have direct chemotactic effects on tissue remodeling. most fgfs are secreted proteins that bind heparan sulfates and can therefore be caught up in the extracellular matrix of tissues that contain heparan sulfate proteoglycans. this allows them to act locally in a paracrine fashion. however, the fgf subfamily (including fgf , fgf , and fgf ) which binds less tightly to heparan sulfates can act in an endocrine fashion on far away tissues, such as the intestine, liver, kidney, adipose, and bone. for example, fgf is produced by intestinal cells but acts on fgfr -expressing liver cells to downregulate key genes in the bile acid synthase pathway; fgf is produced by the bone but acts on fgfr expressing kidney cells to regulate the synthesis of vitamin d and in turn affect calcium homeostasis. fgf may be synthesized using e. coli as host system . fgf is involved in stimulating collateral vascularization and recovery from ischemia as well as enhancing wound healing, nerve regeneration, and repair of cartilage and has been alternately referred to as "pluripotent" (capable of developing into more than one cell type or tissue) growth factors and as "promiscuous" (biochemistry and pharmacology concept of how a variety of molecules can bind to and elicit a response from single receptor) growth factors due to its multiple actions on multiple cell types. the fgfs and small-molecule fgf receptor kinase inhibitor are used in the treatment of cancer and cardiovascular disease and have potential in the treatment of metabolic syndrome and hypophosphatemic diseases: • receptor tyrosine kinase inhibitor (sunitinib) is approved for indications in renal cell carcinoma and gastrointestinal stromal tumors. • small fgfr inhibitors, su , pd , and nordihydroguaiaretic acid, are effective in multiple myeloma cell lines . • pd can induce cell cycle arrest in endometrial cancer cells with mutated fgfr . • antibody against fgfr has been shown to effectively cause apoptosis in mouse models of multiple myeloma and bladder cancer. thus fgfr inhibition can be very effective in the treatment of cancer. the nerve growth factor (ngf) is an important member of the family of neurotrophins. five protein nerve growth factors of the neurotrophin family are important. they regulate the development of the nervous system and play an important role in maintaining the structure, plasticity , and repair of the adult nervous system. all the neurotrophins are basic proteins of about aa, share % sequence homology, and are highly conserved in mammalian species. nerve growth factor (ngf) is a small secreted protein which induces the differentiation and survival of particular target neurons (nerve cells). little is known of the biological action of neurotrophin apart from ngf. the nucleotide sequence of cdna predicts that ngf is synthesized as pre-pro-ngf. upon removal of hydrophobic signal, either kda or kda pro-ngf is generated depending on the size of the transcript. however, processing of the precursors in different tissues is not well understood. they are essential for normal development, growth, and differentiation of the sympathetic and sensory neurons and are also essential to maintain the normal function of these cells in adults. thus it is important for maturation and survival of neurons and prevents degeneration of adult neurons. apart from its important role in the nervous system, it has been shown to possess protective action of human pressure ulcer, corneal ulcer, and glaucoma. reduced sensation may be observed in leprosy , wound healing, nerve injury, and diabetes . ngf may help to regulate the sensory fi ber sensitivity and function directly or indirectly by stimulating other effectors. administration of recombinant ngf may improve sensation and pain [ ] . cholinergic neurons of the basal forebrain show receptors for ngf; specifi c mrnas for various ngfs have been identifi ed in different areas of the brain. cholinergic neuron loss is a cardinal feature of alzheimer's disease. nerve growth factor (ngf) stimulates cholinergic function, improves memory, and prevents cholinergic degeneration in animal models of injury, amyloid overexpression, and aging. ngf acts on intracellular calcium through tyrosine kinase receptor mechanism. nerve growth factor enhances early regeneration of severed exons and is also important in maintaining the biochemical and morphological phenotype of mature basal forebrain cholinergic neurons (bfcnls) after lesions or injury of the central nervous system (cns). thus, ngf may provide therapeutic option for preventing death of cholinergic neurons and other clinical conditions and is produced using e. coli . tgf-α exerts its function in an autocrine and endocrine fashion for various cell types of ectodermal origin, including most epithelial cells. it is normally a transmembrane protein and functions in cell communication through its ability to activate a receptor tyrosine kinase. ectodomain of tgf-α is cleaved in a highly regulated manner, releasing soluble tgf-α which activates paracrine signaling. the receptor is egf/tgf alpha receptor; therefore, the focus is on understanding the important roles of tgf-α and egf receptor signaling in carcinoma development. tgf-β is a large group of related proteins. some of its family members include bone morphogenetic protein (bmps), growth, and differentiation factors (gdp). it affects tissue remodeling, wound repair, hematopoiesis, morphogenesis, embryonic development, adult stem cell differentiation, immune regulation (is switch factor for iga), and infl ammation. it exists as multiple forms as tgf-β , tgf-β , and tgf-β . it acts through transmembrane serine/ threonine receptor kinase leading to the activation of smads. it acts as tumor suppressor during cancer initiation but promoter during tumor progression. it has a role in the control of embryonic development, cellular differentiation, hormone secretion, and immune function. its role as mesenchymal differentiation factor, with focus on the muscle, fat, and bone cell, might provide insights into its deregulation in skeletal and developmental diseases and is the area of active research. it is produced using cho cell lines . there is huge potential for future therapies using proteins as therapeutic agents. the recombinant proteins are not only benefi cial, but the researchers can further engineer them to improve their activity and prolonged stay in the body, for example, engineering of monoclonal antibodies to have toxin or radioisotope or generation of bispecifi c antibody . still the technology is struggling hard to make the diseases completely a text of books and having a society free of diseases and the pathogens . • the production of proteins for therapeutic purpose is very important not only because of their specifi c action with minimum side effects but also due to their unique form and functions. • nowadays therapeutics (pancreatic enzymes from hog and pig pancreas or a- -proteinase inhibitor from pooled human plasma) are not extracted from their native sources (from humans, animals) due to risk of transmission of pathogens . other problems faced for extraction from native sources were the scarce availability of animal tissue for production, high cost of purifi cation with less yield, and immunological reactions in the recipients. • majority of the biological therapeutic agents are produced by using various advance tools and technologies as cloning, selection , purification, and stability monitoring. it is also very important to monitor risks and side effects of therapeutics (safety analysis ) obtained from a wide range of cells. • the various biological systems are available for production of recombinant protein as bacteria, fungal system ( yeast ), insect cell , mammalian cell , human cell, and transgenic plants and animals . • the system of choice is dependent upon the total cost of production and obtaining fully functional and appropriately modifi ed (ptms) (glycosylation, phosphorylation, or properly folded) protein. all these are essential for the biological activity of the protein. • the bacterial system is the simplest with short cell division times, easy maintenance , easy to modify, and cost-effective production. the bacteria have limitation in production of (c) growth factors are required for progression of tumors (d) all of the above . darbepoetin is used for the treatment of: the fgf family: biology, pathophysiology and therapy nature reviews combined tge-sge expression of novel pai- -resistant t-pa in cho dg cells using orbitally shaking disposable bioreactors microbial factories for recombinant pharmaceuticals the preparation and characterization of human insulin of recombinant dna origin chemistry and clinical use of insulin biosynthetic human proinsulin: review of chemistry, in vitro and in vivo receptor binding, animal and human pharmacology studies, and clinical trial experience biopharmaceuticals derived from genetically modifi ed plants emerging trends in plasma-free manufacturing of recombinant protein therapeutics expressed in mammalian cells clinical applications of epidermal growth factor hsp : structure and function engineering of therapeutic proteins production in escherichia coli the production of recombinant pharmaceutical proteins in plants nerve growth factor: basic studies and possible therapeutic applications genetic engineering and biotechnology of growth hormones, genetic engineering -basics, new applications and responsibilities hsp chaperones: cellular functions and molecular mechanism microbial factories for recombinant pharmaceuticals production of recombinant proteins optimization of production of recombinant human growth hormone in escherichia coli bacterial production of human insulin cloning, transformation and expression of proinsulin gene in tomato (lycopersicum esculentum mill) transient transfection of serum-free suspension hek cell culture for effi cient production of human rfviii heat-induced production of human growth hormone by high cell density cultivation of recombinant escherichia coli the structure of the human tissue-type plasminogen activator gene: correlation of intron and exon structures to functional and structural domains production of recombinant protein therapeutics in cultivated mammalian cells large-sized protein and are unable to perform posttranslational modifi cation as glycosylation. the glycosylation is very important as it affects the activity, half-life, and immunogenicity of the recombinant protein.• due to limitations, the other host systems were used for production which could give better product with minimal immune reactions and are cost-effective. fungal and insect systems are utilized for protein production but sometimes result in inappropriate glycosylation. improper glycosylation may result in nonfunctional and highly immunogenic protein.mammalian cells are the most preferred system for production of these proteins. cho system is the system of choice for therapeutic protein production. it accounts for nearly % production of all products available in the market. key: cord- -jop rx authors: vignais, pierre v.; vignais, paulette m. title: challenges for experimentation on living beings at the dawn of the (st) century date: - - journal: discovering life, manufacturing life doi: . / - - - - _ sha: doc_id: cord_uid: jop rx “we can talk endlessly about moral progress, about social progress, about poetic progress, about progress made in happiness; nevertheless, there is a type of progress that defies any discussion, and that is scientific progress, as soon as we judge it within the hierarchy of knowledge, from a specifically intellectual point of view.” introduction of new exploratory methods such as biocomputing or bioinformatics and high-throughput screening, which involves the simultaneous processing of hundreds or even thousands of samples. this approach is in contrast with traditional biology, in which the research strategy is based upon the observation of effects obtained as a function of experimental parameters that are modified one by one. another aspect of modern times is that, with the irresistible trend in genetic manipulation towards a focus on human beings, certain areas of fundamental research are finding themselves locked into philosophical dilemmas that are matter for ethical and sociocultural consideration, and the subjects of fierce debate. instead of setting out to discover unknown mechanisms by analyzing effects that are dependent on specific causes, with some uncertainty as to the possible success of the enterprise being undertaken, which is the foundation stone of the bernardian paradigm of the experimental method, many current research projects give themselves achievable and programmable objectives that depend upon the means available to them: sequencing of genomes with a view to comparing them, recognition of sequence similarities in proteins coded for by genes belonging to different species, with the aim of putting together phylogenetic trees, synthesis of interesting proteins in transgenic animals and plants, analysis of the three-dimensional structure of proteins, in order to find sites that are likely to fix medicinal substances, and synthesis of molecular species able to recognize pathogenic targets. the facilities that are called into play include instruments that are often sophisticated, the performance of which, in terms of miniaturization, computerization and robotization, is far beyond that of apparatus that was in use a few decades ago. these facilities, applied to research into living beings, have entered the framework of a methodology that has been given the label biotechnology. proceeding handin-hand with applications that have become more and more meaningful in the domains of medicine, pharmacology, agronomy and animal husbandry, the biotechnological process has come to the fore as a new paradigm for the experimental method as applied to living beings. in addition to new discoveries, the driving forces behind biotechnologies are related to economic imperatives as well as the interest and support they receive from the political powers-that-be. the academic spirit that presides over fundamental science gives way to the entrepreneurial spirit that implements a rational programming of facilities and an efficient organization of scientific collaborations. as an example, the sequencing of the human genome, which includes three billion nucleotide base pairs, required the coordination of several dozen scientific teams around the world and the matching of several tens of thousands of results. research on dna provides a typical illustration of the way in which research has become divided, over the last few decades, between an approach and an interest that had previously been purely academic, and the increasing role of technology, which can be justified by the results that arise in the life of society at large, but which, because of these results, also gives rise to questions concerning how wellfounded some of these results are, particularly in the health domain. the experimental method, which had been confined to the laboratory, is now a matter for public debate. before it won acclaim, dna, which was isolated under the name of nuclein by johann friedrich miescher ( - ), at the end of the th century, had to undergo a series of structural evaluation tests that were spread out over the first five decades of the th century. an overall conclusion then came to the fore. dna is a polydeoxyribonucleotide that carries four cyclic bases, adenine, thymine, cytosine and guanine. each base is involved in the structure of a mononucleotide where it is itself associated with a sugar, deoxyribose, which is associated with a phosphate residue. dna was compared to a ladder, the rungs of which (mononucleotides) were linked by ester bonds between an acid group of a phosphate residue of a nucleotide and the free hydroxyl group of the deoxyribose of the following nucleotide. research committed them to this path. it was the curiosity of each, a new way of considering old problems, that led a few men and women to solve the problems of heredity." the middle of the th century saw an accumulation of experimental evidence showing that dna carries genetic information, and because of this, that it controls the transmission of hereditary characteristics: the proof provided in by oswald avery ( - ) , colin macleod ( - and maclyn mccarthy ( - of the transforming power of dna in pneumococcus, the highlighting by alfred hershey ( - ) and martha chase ( - , in , of the role played by bacteriophage dna as an infectious agent for bacteria, the revelation by erwin chargaff at the beginning of the s of the equivalence of molar concentrations of adenine (a) and thymine (t), on the one hand, and of cytosine (c) and guanine (g), on the other hand, in dnas arising from a multitude of sources, animal, plant and microbial, thus suggesting a complementary pairing of adenine and thymine, and cytosine and guanine. based on the pairing of a/t and c/g bases, the model of the double helix structure of dna, formulated in by james watson and francis crick, made it possible to understand the identical synthesis of double strands of dna by replication during cell division (figure iv. ) and, as a consequence, the conservation of hereditary characteristics in descendants. afterwards, it was found that the information contained in the dna base sequence determines the amino acid sequence in proteins. then the roles played by messenger rna and transfer rnas were elucidated, the former acting as a carrier of information between dna and the proteins being synthesized and the latter acting as double-headed adaptors, able to recognize nucleotide triplets (codons) in messenger rna and to specifically fix amino acids in order to position them on the ribosomes, the final result being the synthesis of a protein chain. in , the genetic code was deciphered. the veil of mystery that had covered the mechanism of the synthesis of proteins was lifted, and the decisive role played by nucleic acids in this synthesis was shown. later on, there were a few adjustments. although, in bacteria, proteins are coded for by a continuous sequence of nucleotide triplets in dna, in the s the surprising discovery was made that in eukaryotic organisms, genes are discontinuous and made up of coding dna sequences (exons) interrupted by non-coding sequences (introns). from the end of the s, françois jacob and jacques monod had postulated the existence of a dual determinism for protein synthesis and shown that, next to structural genes expressed as proteins, there are regulatory genes able to control the expression of the structural genes. the importance of the differential regulation of gene expression in cell differentiation in higher organisms was quickly recognized. from this point on it was possible to explain why a particular species of protein is more specifically expressed in a given tissue and another species of protein is more particularly expressed in another tissue, each type of tissue finding its specificity in its molecular components. this fantastic framework of knowledge, which was built up over a couple of decades, has been used as the foundation stone for the so-called central dogma of molecular biology, which explains the transcription of dna sequences into messenger rnas and the translation of messenger rnas into proteins and which, with only a few variants, is the same throughout the living world (figure iv. ) . the fascinating history of molecular biology was well described by james d. watson in molecular biology of the gene ( rd edition ). there are now many works concerning this subject and how it has progressed, including two well-documented books in french: the secrets of the gene by françois gros ( ) and histoire de la biologie moléculaire by michel morange ( ) . new strand a -double helix structure of dna and simplified representation of its self-replication. each strand of the parent molecule of dna acts as a matrix for the synthesis of a daughter molecule of complementary dna, in conformity with the rules of pairing: adenine (a) with thymine (t) and guanine (g) with cytosine (c). the double strands that appear are identical to each other and identical to the parent dna molecule. b -transcription of dna into messenger rna and its translation into an amino acid chain. diagram of gene expression in a eukaryotic cell. one of the strands of dna (the coding strand) has coding sequences (exons) and non-coding sequences (introns). it is said that the gene is split. the transcription of the exons, accompanied by their splicing leads to the formation of a messenger rna, the codons (nucleotide triplets) of which are translated into amino acids that are linked to each other by covalent bonds in order to form a protein chain. in prokaryotic cells (bacteria), the genes do not contain introns and are not split. a: adenine; c: cytosine; g: guanine; t: thymine; u: uracil. met: methionine; his: histidine; tyr: tyrosine; gly: glycine; phe: phenylalanine. interlinked with the epic rise of molecular biology, there was a succession of technical innovations that led to the synthesis of dna by chemical or enzymatic means, and to its being cleaved at specific locations, with the pieces that were obtained being joined together again . in , in geneva, werner arber (b. ) and daisy dussoix highlighted the restriction phenomenon, which involves the degradation of bacteriophage dna by a recipient bacterium. they discovered that an extract of e. coli has a restriction activity, and that this activity is of an enzymatic nature, caused by a nuclease that breaks the phosphodiester bonds in dna. in , the americans hamilton smith (b. ) and kent wilcox purified the first restriction enzyme from a strain of haemophilus influenzae. in , daniel nathans ( - and kathleen danna (b. ) at johns hopkins university in baltimore (usa) drew up the first restriction map based on the circular dna of the monkey sv virus, using a restriction enzyme that was named hindiii and a follow-up of the sequential appearance of shorter and shorter fragments resulting from the partial digestion of dna. in the following years, dozens of restriction enzymes were isolated, all of them endowed with a surprising specificity with respect to specific base sequences in dna ( figure iv . ). these enzymes were to be indispensable tools in genetic recombination experiments. the transformation of rna back into dna was observed by howard temin ( - ) and s. mizutani , in experiments on the rous sarcoma virus, a virus with rna that, when it proliferates in host cells, is able to synthesize a dna that is complementary to its rna. the enzyme responsible, reverse transcriptase, was purified by both h. temin and david baltimore (b. ) . starting with a determined messenger rna, it then became possible to work back to the dna, i.e., to the gene, by a simple enzymatic reverse transcription operation. dna that has been synthesized in this way is called complementary dna (cdna). in eukaryotic organisms, reverse transcription has proved to be all the more useful as a technique in that all cdna is coding, unlike the situation in vivo in which the genes are divided up into portions that are coding (exons) and portions that are non-coding (introns). the ability to cleave dna and to join together the fragments obtained in a deliberately chosen order, or, in other words, to manufacture previously unseen dna sequences by making new combinations, led to the dawning of recombinant dna technology and caused scientists to come to the sudden realization that the pandora's box that contains the secrets of life had been opened, that uncontrollable catastrophes might arise from this, and that there was a potential danger of causing tumorigenic viruses to reproduce in commensal bacteria such as the enterobacterium escherichia coli. in , around a hundred molecular biologists gathered together at the asilomar conference center near monterey in california, in order to discuss the dangers of the new dna technology. they proposed strict regulation to govern genetic manipulation. time and experience have shown that the risks being run were very low. in , dna sequencing methods were published. one of them made use of chemical techniques , the other made use of enzymatic techniques . applications were not slow in appearing. from , the team led by frederick sanger (b. ) in cambridge (uk) determined the first sequence of a genome, that of bacteriophage phix , which is nucleotides long. this was the beginning of an audacious adventure, the apparently senseless challenge being met with unbelievable rapidity thanks to the innovative methods of bioengineering, resulting, during the first years of the st century, in the sequencing of the human genome. analysis of the human dna sequence involved the participation of two rival groups, one of them being academic, coordinated by francis collins (b. ) , and bringing together dozens of laboratories around the world, and the other being a private californian company directed by craig venter (b. ) . at the beginning of the s, when everyone was persuaded that the rnas could be placed into three well defined categories, messenger rnas, transfer rnas and ribosomal rnas, it was with great surprise that it was learned that there were rnas that have catalytic properties (see thomas cech [b. ] ). these rnas, called ribozymes, have, in keeping with enzymatic proteins, structured catalytic sites that are able to catalyze rna or dna cleavage or ligation reactions. recently, engineering techniques have been used to obtain artificial ribozymes that have been found to be able to catalyze reactions as varied as oxidations or the synthesis of peptides and nucleotides, thus opening up wide-ranging possibilities of applications in molecular therapeutics, and, in addition, reinforcing the famous theory of the "world of rna" at the beginning of the appearance of life on earth . another discovery of the s was the role of methylation of dna bases, cytosine and adenine, and its deregulation in a certain number of pathologies: fragile x syndrome, scapulohumeral dystrophy, certain forms of cancer … in the past decade or so, basic proteins known as histones that are associated with the nuclear dna of eukaryotes in the form of a complex called chromatin and which had previously been assigned a structural role, have now acquired the status of functional partners. thanks to specific modifications of certain amino acids (acetylation, methylation, phosphorylation), histones control the state of condensation of the chromatin and the efficacy of transcription of dna contained in the chromatin, to such an extent that we now speak of the "histone code". the development of our understanding of histones is a good illustration of the complexification of a concept, the dna code, into an entity that comes closer to living reality, the dna code in partnership with the histone code. there has also been the discovery of interfering micrornas, small polymers made up of around twenty nucleotide units, the role of which is to control protein synthesis (chapter iv- . . ). methylation of dna, structural modifications of the chromatin histones, and blocking of transcriptional activity by interfering micrornas are a few of the major areas of research in a scientific domain that is in full expansion, epigenetics, which could be said to have "pipped the science of genetics at the post," and which explains the plasticity of the functions of living beings. with the arrival of restriction enzymes and reverse transcription, the foundation stones of genetic engineering have now been laid, and are ready to be used, this being all the easier in that synthetic chemistry is now able to manufacture dna chains that are several hundreds of nucleotides long, and progress in robotics and computing techniques made it possible for chemists to avoid carrying out tedious routine tasks by using completely-programmable machines. the hope that it would be possible to experiment on living beings by means of the manipulation of dna became a reality when the american researchers paul in the first work carried out on the expression of foreign genes, the use of plasmids as vectors was preferred, particularly that of plasmid pbr , because of its considerable replication capacity. in , a first success was obtained by herbert boyer and his co-workers, with the expression of the gene for somatostatin, a peptide hormone comprising twelve amino acids that negatively regulates the secretion of growth hormone, in the bacterium e. coli. because of its small size, the somatostatin gene was synthesized by chemical means. the expression of somatostatin in e. coli was verified using immunological and physiological criteria, thus demonstrating the validity of the procedure that was used. the following year, human insulin was produced in e. coli. fairly soon, yeast was substituted for this bacterium because, as a eukaryotic organism, it has enzyme systems that are able to carry out chemical finishing operations on neosynthesized proteins that bacteria are unable to do, for example the formation of disulfide bridges in insulin. genetic recombination is used in order to cause bacteria to manufacture a foreign protein of animal or plant origin. this involves the insertion of the fragment of animal or plant dna that codes for this foreign protein into a plasmid. the plasmid, a small ring of bacterial dna, acts as a vector for the foreign dna. in order to carry out insertion, plasmid dna is cleaved by an appropriate restriction enzyme. the foreign dna is obtained by reverse transcription from a useful messenger rna. its duplication is catalyzed by a dna polymerase. the s nuclease makes it possible to break the covalent bond between two strands of dna. in the following step, a terminal transferase is used to add four nucleotides for which the base is a cytosine (c) to each of the two dna strands. the same lengthening operation is carried out on the bacterial plasmid, but, in this case, the addition involves a sequence of four nucleotides for which the base is a guanine (g) (complementary to the cytosine c). the bacterial plasmid is hybridized in vitro with foreign animal or plant dna and then introduced into the bacterium which, using its own machinery, perfects the junction between the integrated dna and the plasmid dna. a very small volume of dna ( . - ml) is injected under the microscope into eukaryotic cells (hela cells in inset) using a micropipette with a very fine end that pierces the cell membrane. the swelling of the cells at the moment of injection can be seen (inset). supported by these successes, genetic engineering started to come to the fore as an application-oriented discipline. levels of performance that would never have been imagined half a century before were achieved, such as the production of growth hormone, interferons, blood coagulation factors and vaccines. in the final decades of the th century, phenotype transformations using genetic modifications that had previously been carried out in bacteria and yeasts were successfully attempted in animals and plants. it was observed that a mutated dna integrated into a plasmid and introduced into a fertilized mouse egg (by micromanipulation) modifies the mouse's genetic inheritance, which affects first the embryo and then the adult mouse with phenotype modifications. such mice, which are said to be transgenic because of the stable integration of a foreign dna into their genome, are now widely used as animal models in studies that aim to understand the mechanisms involved in high-incidence human pathologies such as cancer, diabetes, and rheumatoid conditions. in , two american researchers , ralph brinster (b. ) and richard palmiter (b. ) carried out a spectacular transgenesis experiment in mice. using microinjection, they introduced the growth hormone gene (obtained from the rat) into oocytes of mice from a "little" germ line. once the transgeneic mice had reached adulthood, they were giants. at present, the transgenesis technique is being applied both to the animal kingdom and to the plant kingdom. in the s, kary mullis (b. ) perfected an ingenious technique, the polymerase chain reaction (pcr), which makes it possible to produce several tens of thousands of copies of a fragment of dna. using this technique, it is possible to detect traces of a fragment of dna of a given sequence down to an attomolar concentration, i.e., one billion billion ( - ) times smaller than molar concentration. by the end of the th century, genetic engineering had become well-established and wide-spread, thanks to a mastery of techniques involving the manipulation of dna such as the accurate cleavage of a gene into fragments using commercially available restriction enzymes, the covalent assembly of two fragments of dna by ligases, the automated chemical synthesis of fragments of dna of more than one hundred nucleotides and the possibility of manufacturing a complementary dna (cdna) from a messenger rna by using a reverse transcriptase and automated dna sequencing. given this particularly well-equipped toolbox, the molecular biologist is now able to manipulate dna, that is to say, the chemical material that contains the information that is central to the functioning of living structures (microorganisms, plant and animal organisms), and thus to modify, at will, the genotype of these structures that the selective pressure of evolution has previously favored. genomics has produced enormous quantities of data that are stored in databanks. automated procedures have been invented to make the information contained in these data intelligible, these procedures forming the basis for a new discipline, biocomputing, or bioinformatics, which develops programs, or algorithm-based strategies, that are able to solve specific problems, of which the annotation of genomes, i.e., the identification of coding and non-coding sequences. while the annotation of the prokaryotic genomes is relatively easy, because of the absence of introns, that of the eukaryotic genomes is considerably more difficult because of the alternating exons and introns and the small proportion of coding exons (fewer than % in the case of the human genome). this explains why, at the time of writing, several hundred genomes of prokaryotes (around at the beginning of ) have been sequenced, as opposed to only a few dozen genomes of eukaryotes. annotation was carried out manually at first, but it has become automated and it is now possible to analyze thousands of items of genomic data. the comparison of nucleotide base sequences in dnas and of amino acids in proteins of different origins involves biocomputing. the identification of similar or identical regions that provide information about functional similarities and phylogenetic proximity involves the use of alignment methods. one of these methods, which is in current use, is called blast (base local alignment search tool). comparison of protein sequences has been particularly instructive in the science of evolution. it has highlighted evolutive processes in the phylogenesis of proteins and linked these processes to precise functions. it has been possible to deduce that, over time, different families of proteins with similar functions appeared independently and evolved along different routes. this is the case for membrane proteins whose polypeptide chain crosses the thickness of the membrane six or twelve times; thus, the mitochondrial membrane proteins that transport metabolites are formed by triplication of an element with two transmembrane segments, while proteins located in other membranes of the cell are derived from duplication of an element with three transmembrane segments. from this academic context arose the study of paleogenetics, a new discipline that compares dna sequences extracted from fossils and amplified by pcr with dna sequences of current species. in addition to being of immense interest to fundamental biology, genetic bioengineering has led to innumerable industrial applications making use of genetically modified microorganisms that are able to synthesize molecules with a high added value that can also be used in xenobiotic depollution operations. so much data has already been deposited, equivalent to the sequencing of more than one hundred billion nucleotides, that it is inevitable that there have been errors, some of which might prove prejudicial for future use (comparison of sequences, screening of drugs…). nevertheless, the ever-increasing numbers of genome sequencing projects for animal, plant and microbe species show the interest that is shown in understanding the genetic information present in different types of cells, in order to be able to exploit their potential. dna chips appeared in the last decade of the th century, and came to the fore as part of a new technical revolution, the "high throughput" revolution ( figure iv. ). an article written by the group headed by ronald davis and patrick brown (b. ) at the university of stanford gives a precise description of the hybridization technique used in dna chips. thus, around a hundred short dna strands, corresponding to portions of genes of the plant arabidopsis thaliana, commonly known as mouse-ear cress, a small plant in the brassicaceae (formerly cruciferae) family, are synthesized. a robot is used to deposit microquantities of these dnas in solution in a dot pattern on a small glass slide coated with poly-l-lysine, thus comprising a "chip" on which the covalently fixed dnas act as probes for specific molecules. a later step involves both the use of reverse transcription to produce complementary dnas (cdnas) from messenger rnas arising from the expression of genes in the same plant and the labeling of these cdnas with fluorescent ligands for use in screening. once these fluorescent cdnas have been denatured, i.e., after separation into single strands, they are brought into contact with the dna chip. the unhybridized molecules are removed by washing. a -the term dna chip corresponds to a small, chemically-treated glass (or sometimes silicon) plate on which a robot has deposited dna strands of known sequence in a pre-determined order. b -the dna chip may be used for different types of diagnostic procedures. in the differential diagnosis experiment represented here, messenger rnas are prepared from two cell samples that have been treated in parallel, the control sample (normal cell) and the experimental sample (pathological cell). these messenger rnas are reverse transcribed into complementary dnas (cdnas) by means of a reverse transcriptase. each of these two types of cdna, corresponding to the two types of messenger rna, is labeled by a chemical reaction with a specific fluorescent ligand (cy , which emits at nm and cy , which emits at nm). they are then hybridized with chip dna strands. after hybridization and then washing, the fluorescence emitted at nm and nm under laser irradiation are analyzed using an appropriate detection system. the differential expression of the genes in the control cell sample and the experimental sample (shown by a color difference) can thus be analyzed. hybridization between the fluorescent dnas called targets and the complementary nucleotide probes fixed to the dna chip is detected by means of an automated fluorescence detection system. at the beginning of the s, stephen fodor and his group, who were working at the affymax research institute (palo alto, california), developed an ingenious procedure for microphotolithography that led to the synthesis of a network of a thousand peptides on chemically pre-treated glass microscope slides. the resolution of the network was shown by epifluorescence microscopy after fixation of specific antibodies labeled with fluorescent probes. soon after this, microphotolithography was used for the manufacture of dna molecule networks on solid supports. from then on, two competing techniques for the preparation of dna chips became well-established, either the depositing on a solid support of cdna obtained by gene amplification (technique used by davis and brown), or the synthesis in situ of oligonucleotides carried out directly on a solid support (technique used by the american affymetrix company, arising from affymax). one considerable advantage of dna chip technology is that it provides information on the level of transcription of thousands of genes into messenger rnas (mrnas), in a simultaneous manner and in a relatively short lapse of time. experiments that previously required weeks, months or even years to be completed can now be carried out in a matter of hours. we therefore have a sort of instantaneous, precise, freeze-frame picture of the state of a cell at a given moment, with a great number of parameters explored in a semi-quantitative manner. dna chips are a typical example of the application of high throughput technology to the study of living beings. the panoply of mrnas produced by the transcription of dna is called the transcriptome. the method by which transcriptomes are obtained is called transcriptomics. it should be added here that there is not necessarily a correlation between the abundance of a mrna, evaluated on a dna chip, and the functionality of the corresponding protein, which depends on multiple factors that particularly involve post-translational modifications: phosphorylation, glycosylation, hydroxylation, and so on. at the end of the s, dna chips were being used extensively in the research programs of many biology laboratories. they are used in a variety of domains: human pathology, to differentiate between forms of cancer linked to multiform mutations; microbiology, to identify pathogenic germs; comparative genomics, to look at model eukaryotic organisms that have in common a certain number of genes; or even in populations genetics, to detect polymorphisms linked to a change in a single base in a dna sequence. as a complement to the dna chip technique, the fish (fluorescent in situ hybridization) method holds a key position in the study of cytogenetics. this method is based on hybridization between fluorescent nuclear probes of known nuclear sequence and of complementary motifs located in the dna of the chromosomes. it allows the detection of chromosomal modifications with a gain or loss of genetic material, such as those that are found in certain tumors. it is used in prenatal diagnostics to diagnose such modifications. protein chips, which are used to characterize the reactivity of proteins with respect to specific ligands, are another example of the application of high throughput technology. dozens of proteins of different types (antibodies, for example), as well as derivatives of nucleic acids or even molecules capable of being ligands of proteins that might arise from combinatorial chemistry (chapter iv- . ), are arranged in a network on small glass plates that are chemically treated to act as hooks to entrap specific proteins present in a tissue extract or serum ( figure iv . a). this procedure, which is essentially analytical in nature, is complemented by a functional study in which proteins that have been isolated in their native form, i.e., those that are capable of expressing the same functions that they posses in vivo, are deposited on a glass microplate. this type of biochip makes it possible to analyze the reactivity of the proteins that are fixed to it with respect to a multitude of targets (proteins, nucleic acids or pharmaceutical substances) (figure iv. b). antigens peptides a -biochip for the identification of proteins. different types of ligands, antibodies, antigens, dna or rna, small molecules with a high affinity and specificity, are deposited on a reactive surface. these biochips can be used to determine the level of expression of proteins and the type of proteins expressed in cell extracts. they can be used for clinical diagnostics. b -biochip for the functional study of proteins. native proteins or peptides are arranged in micronetworks on a reactive medium. biochips produced in this way are used to analyze the activities of proteins and their affinities as a function of posttranslational modifications. they are useful for identifying drug targets. analytical proteomics, which was still in its infancy in the last decades of the th century (chapter iii- . . ) has become a vigorous discipline. the association of liquid nanochromatography and of mass spectrometry allows the identification of peptides obtained by the trypsin hydrolysis of samples of proteins of around one picomole in size. applied to fundamental and pathological cell biology, the aim of analytical proteomics is not only to decipher the list of proteins on the scale of the cell, but also to highlight variations in the abundance of synthesized proteins as a function of environmental conditions. it also aims to determine the posttranslational modifications that are undergone by the proteins inside the cells, which, for a large part, control the specificity of their operation. in parallel with the study of proteomics, the study of peptidomics, or the study of all of the peptides (peptidome) present in animal and plant cells and in the fluids that bathe these cells, has developed. for example, several hundred different peptides have been found in the cerebrospinal fluid. structural proteomics, which deals with the three-dimensional structure of proteins, has also become a domain in which the activity has been increasingly dominated by high-throughput techniques. this is partly due to the fact that the pharmaceutical industry has given considerable, sustained attention to the understanding of the structure of proteins that could play the role of therapeutic targets. this is the case for protein kinases that catalyze, via atp, the phosphorylation of endocellular proteins, of proteases involved in hydrolysis reactions, or even of cell surface receptors that are able to combine with ligands such as hormones. the use of automated crystallization systems has become common in structural biology. from the classical technique of the hanging drop, in plates comprising wells, we have moved on to plates with , and, recently, wells. obviously, this increase in dimensions requires the use of an automated system that includes a robot in charge of transferring microaliquots of the protein solution into the wells and adding media that differ according to their ph, ionic force and molecular composition to these wells. the crystallization process is followed by an automated microscopic examination coupled with video photography. making use of the recent development of genomics and of proteomics, and a detailed inventory of the structures and functions of the different protein species of living beings, contemporary biology is now able to sketch out a scheme of molecular systematics, including a classification into phylla, families, and classes that echoes those of the zoological and botanical systematics of the th and th centuries. however, modern systematics does not tell us how protein macromolecules interact within dynamic networks. there is still an enormous amount of work to be done in order to achieve an understanding of the meaning of the dialogue between macromolecules in a normal or pathological cell context. this work will require a detailed analysis of metabolic pathways and of how they are controlled, and their evaluation in kinetic and thermodynamic terms. it will be accompanied by modeling (chapter iv- ). there is no doubt that it will be successful. making use of subtle differences in the qualitative and quantitative expression of genes, it will become possible to understand the molecular principles that modulate differences in morphology and in function between neighboring animal, plant or microbial species. the science of evolution should benefit from this. in medicine, the forecasting of predispositions for certain diseases should be made easier (chapter iv- . ), opening up the perspective of prevention strategies. using recombinant dna technology, metabolic engineering applied to microorganisms and plants should make it possible to improve the production of molecules that are of economic interest or can be used for drugs. the diversity of bacteria is amazing, much greater than might be supposed by looking at the number of bacterial species identified by culturing on appropriate media. in fact, the number of bacterial species that can be cultivated only represents % of the total number of existing bacterial species on the surface of the earth. there are two major reasons for this: we do not know the appropriate conditions for culturing these bacteria; a certain number of environmental bacteria live in symbiosis, acting as commensal organisms that benefit from the products secreted by other organisms. nevertheless, the study of the bacterial genome, without any clonal culture, has been carried out, and comprises a branch of genomics known as metagenomics. instead of looking at an isolated, well-identified bacterial species, in order to analyze the sequence of its dna, as has been done traditionally, researchers look at a heterogeneous bacterial sample from which the dna is extracted, amplified, and then sequenced by high throughput methods. computer processing of the data provides information about individual germs. craig venter, who had already gained notoriety with the sequencing of the human genome, recently applied "metagenomic" procedures to the study of the sequence of the "metagenome" of the bacterial species of the sargasso sea . he came up with nucleotide sequences corresponding to approximately million kilobases of non-redundant nucleotides, attributable to more than two thousand different genomes. the challenge to be met by metagenomics is to connect a function to its phylogenetic source and to extend this information to specific species within a bacterial community. group . in human biology, a metagenomics approach has been applied to the study of the population of bacteriophages present in the intestinal flora. approximately genotypes have been identified, a number that greatly exceeds the bacterial species of this flora . this result leads us to think that the luxuriant community of bacteriophages which cohabits with that of the intestinal bacteria may influence the diversity of the latter by selective bacterial lysis and also by promoting the exchange of genes between bacteria. a rapid overview of the history of the exploration of genomic dna over the last fifty years shows the rapidity with which a traditional experimental paradigm can move thanks to modern computing and robotics procedures. in less than twenty years, we have moved from the manual sequencing of dna that was developed at the end of the s to automated high throughput sequencing. at the turn of the st century, the sequencing of communities of genomes (metagenomics) has been substituted for the sequencing of individual genomes. dna and protein chips have become objects of everyday use in fundamental and applied biology. transgenesis is widely practiced. dna, a molecule that remained mysterious for a long time after it was discovered, delivered some of its secrets during the second half of the th century. the purpose of the first experiments on dna was to understand how dna, detector of the genetic code, transmitted its message. after having questioned dna, researchers moved on to manipulating it. the current aim is to use oligonucleotides to build nanoscale constructions with original and, if possible, useful, properties. in addition, the possibility that has recently become available of being able to interfere with the expression of the genome in living cells, with the intervention of small rna molecules, allows the programmed manipulation of the genome. another challenge, the extending of the coding power of the genetic code, now appears to be achievable. from the fact that each of the strands of the double helix overhangs in one direction, and in the other the strand with which it is paired, thus leaving a few bases free ( figure iv. ) . if two strands of dna with sticky ends are brought into contact, when the bases of these ends are complementary, a branched structure will appear spontaneously. using this principle as a basis, cube-shaped nanometric constructions that make it possible to encage molecules of interest have been built. the opening of the cage by appropriate devices liberates the encaged molecules, which can act as substrates in specific reactions. the cutting of a double strand of dna using restriction enzymes able to create fragments with cohesive ends (a) has been used to "build" an artificial construction (b), which, in this case, is a cube (c), but which could be an object of a different geometrical type. a dna nanomachine that is capable of movement is becoming a reality. one dna nanomachine, which is admittedly still rudimentary, has been put together based on the structural difference that exists between b-dna, the classical double helix that twists to the right, and z-dna, a double helix that twists to the left. a propensity to adopt the z-form is triggered when there is an alternating sequence of cytosine (c) and guanine (g) (cg sequence) in the dna. the experiment illustrated in figure iv. makes use of a duplex formed of b-double helices. the dna nanomachine constructed by n.c. seeman comprised a duplex of double strands of dna. one of the double strands, of the classical b-form of dna (right-hand twist), has been cleaved in such a way as to fix fluorescent molecular probes onto the cleavage zones. facing this cleavage zone, a short nucleotide sequence, in which the cytosine (c) guanine (g) motif is repeated, can be found in the other dna double strand, which is also of b-form. the addition of cobaltihexammine induces the transition of the cg segment from a right-hand twist (b-dna) to a left-hand twist (z-dna), which leads to a rotation of this segment and to a rotation of the assembly, which can be detected using fret (fluorescence resonance energy transfer) spectroscopy. one of the double helices has a short cg segment. facing the cg segment, the other double helix is interrupted, and its ends, where the interruption is, carry fluorescent molecular probes. the simple fact of adding a cationic substance such as cobaltihexammine, which neutralizes the negative charges of the phosphate groups, triggers a conformational transition, with the cg segment taking the z-form, causing a rotational movement of the assembly that is detected by the movement of the probes. there is no doubt that the use of dna strands in order to build nanomolecular constructions that are capable of programmed movement marks the beginning of an adventure that we may imagine will be rich in outlets for domains such as computer technology, nanomechanics and even the life sciences. in addition, the discovery that dna conducts electrical current gives rise to dreams of a revolutionary technology in which dna may be used in the design of electrical circuits, in competition with classical electronics . interfering rnas are non-coding rnas of around twenty nucleotides that control gene expression at post-transcriptional level. as with many discoveries, that of rna interference was the result of serendipity. it began during the s with observations made by two american research groups, that of victor ambros now at darmouth medical school, hanover, and that of gary ruvkun (b. ) at boston's massachusetts general hospital, that a gene named lin- , which is involved in the post-embryonic development of the nematode c. elegans, did not code for a protein, but for a small size rna that played an antisense role. this odd discovery was supported and made more explicit a few years later by the research groups of andrew fire (b. ) at baltimore's carnegie institute and craig c. mello (b. ) at the university of massachusetts in worcester . in order to block the production of certain proteins in the nematode c. elegans, the researchers used synthetic antisense rnas. the control involved the use of sense rnas according to a classical protocol. unexpectedly, protein synthesis was blocked in both cases, suggesting that a contaminant was present in the sense and antisense rna preparations. this contaminant was identified as a double strand rna (dsrna -double strand) that is, an rna that is folded back on itself in a "hairpin" loop because of the pairing of complementary bases (adenine vs uracil and guanine vs cytosine). in order to verify the mechanism by which the translation of messenger rnas into proteins is silenced, the nematode c. elegans was injected with a synthetic dsrna, part of the sequence of which was complementary to that of the gene unc- , known to code for a protein involved in muscular contraction. within a few hours, the worm was making disordered movements, suggesting that the dsrna interferes with the production of proteins in the process of muscular contraction. the mechanism of action of dsrna was quickly unraveled: dsrna gives rise to two single strand rnas after cleavage by a specific enzymatic mechanism. one of the single strand rnas (sirna -small interfering rna) is paired thanks to a complementarity of bases with a short sequence of messenger rna transcribed from the gene unc- . the result is a blockage of the translation of messenger rna into a protein, followed by the destruction of messenger rna. this phenomenon was named rna interference ( figure iv the dicer cleavage enzyme, which has a ribonuclease activity, cuts the double strand rna into two strands. in the presence of the risc (rna-induced silencing complex) protein complex, one of the rna strands finds a complementary nucleotide sequence in a messenger rna (mrna) and associates itself with this rna, making it unable to be translated into protein. we now know that eukaryotic cells from animals and plants produce and host interfering rnas that are said to be "natural" . natural interfering rnas, of around twenty nucleotides, are called micrornas (mirnas). although a few details differ between the modes of formation and action of natural interfering rnas and those of synthetic interfering rnas, in particular the fact that messenger rnas are not destroyed by mirnas but blocked in their translation, the effect of negative regulation on the production of specific proteins comes to the same thing ( figure iv. ). there is a far from negligible number of genes that code for mirnas. already, several hundred mirnas have been identified in the genomes of plants and animals. the amount of interest that they arouse, and the feverishness of the research being carried out on them, are in keeping with the major mechanisms that they control: embryogenesis, hematopoiesis, neuronal differentiation, etc. given an understanding of the genome sequence in man, the rat and the mouse, trials have begun that aim to achieve an understanding of how the expression of mammal genes of known sequence might be manipulated by the interplay of interfering rnas (chapter iv- . ). the treatment of viral infections such as aids or hepatitis b, which are worrying public health problems, could benefit from this new technology. it appears that interfering rnas have much more to give to us in the near future than they have taught us up until now. the deciphering of the genetic code in the middle of the s was the end of a first step in elucidating the mechanism by which a sequence of nucleotides in dna is translated into a sequence of amino acids in a protein (chapter iv- . ). during the years that followed, the subtleties of the transcription of dna into messenger rna and of the translation of messenger rna into protein via transfer rnas were explored in hundreds of laboratories around the world. particular attention was paid to the understanding of how a given amino acid is activated and bound to a transfer rna (trna) after being picked up by an aminoacyl-trna synthetase. nevertheless, the idea remained of a code in which triplets of purine and pyrimidine bases of messenger rnas are translated into natural amino acids. recently, methods have been developed that give more flexibility to the action of the aminoacyl-trna synthetases, or, in other terms, relax their specificity . synthetases that have been manipulated in this way are able to recognize non-natural amino acids and to incorporate them into proteins by working together with the ribosomal machinery. it is in this way that, at the time of writing, around thirty non-natural amino acids, obtained by insertion of different types of chemical residue (photoactivable, fluorescent or radioactive residues capable of acting as probes for structural and functional analyses) (figure iv. ) have been incorporated into protein structures. with such an innovation, an unexpected field of exploration has opened up to research in domains as far apart as pharmacology and the science of evolution, giving rise to burning questions: could such non-natural proteins have therapeutic properties? could they give a selective advantage to the organisms that host them? with the addition of non-natural amino acids to the genetic code and the demonstration that proteins containing such amino acids can function in living cells, in sum, with the transgression of the potentialities of the natural genetic code, the experimental method appears to challenge the order of living beings. the triumph of genetic engineering via the study of dna is not unique to biology. many other sectors are undergoing changes in their type of experimental approach, dictated by the technosciences and making use of computer sciences, robotics and high-throughput screening. however, given the many questions that its operation continues to raise, and its central position at the heart of scientific ethics, the study of dna remains a typical example of the way in which the experimental life sciences and the techniques that underlie them are evolving nowadays. "i perfectly agree that when physiology is sufficiently advanced, the physiologist will be able to make new animals or plants, as the chemists produces substances that have potential, but do not exist in the natural order of things." more than a century after claude bernard predicted a genetically manipulated world, it has come to pass. the molecular biologist, having original, highperformance methods for "tinkering" with dna, has moved on to the application and use of his technical expertise for utilitarian ends. during the s, with transgenesis, research on bacteria (chapter iv- . . ) opened up a new biological domain, that of genetically modified organisms (gmos). results led to predictions that it would be possible to transfer a fragment of dna corresponding to a gene of a certain species into the genome of another species and have this foreign gene express itself as a protein in the host cell. in , the successful trans-genesis of a gene for resistance to an antibiotic, kanamycin, in tobacco plants, signaled the beginning of the technology of the first plant-type genetically modified organisms, still called gmps (genetically modified plants). in , the birth of dolly the ewe unveiled the era of reproductive cloning in mammals, i.e., the identical reproduction of an already-existing organism. an additional step was taken with the first tests on the differentiation of embryonic stem cells towards different types of lines that are characteristic of well-defined tissues, such as nerve tissue, cardiac tissue or the hepatic parenchyma, thus opening up promising perspectives in regenerative medicine. the frontiers of the experimental method continue to be pushed back to the limit of what is feasible and sometimes into the realm of fiction, as in immunology, for example, with the idea of xenotransplantation, using "humanized" animal organs. given the universality of the genetic code, any gene that is introduced into the genome of a plant, whether that gene is of animal, plant or microbial origin, is able to replicate itself and be expressed as a specific protein. thus plant gmos or genetically modified plants (gmps) are able to express specific foreign proteins from another plant, a bacterial microorganism or an animal organism. in the s, a short time after fundamental research had revealed the feasibility of plant transgenesis, the first transgenic plant, the flavr savr tomato, was marketed in the usa. since this time, numerous other plant gmos have been cultivated on a large scale and become available on the world market, including corn, soya, rice, cotton and the poplar. one of the desired aims is to produce modified plants that are able to resist destruction by the herbicides that are commonly used to eliminate weeds, while another is to prevent predation by harmful insects. in the first case, transgenesis involves the insertion of a herbicide-resistant gene, and in the second case, the inserted gene codes for an insecticidal toxin. recently, plant gmos that produce proteins with a therapeutic effect have appeared, ranging from antibiotic peptides to antibodies or proteins as unexpected as human hemoglobin. current projects aim to create plants that are resistant to adverse conditions such as the dryness of arid climate zones. the preferred procedure for producing a plant gmo is to use a bacterium, agrobacterium tumefaciens, a microorganism that is able to insert fragments of its own dna into plant cells ( figure iv . ). the useful gene that we wish to transfer into the plant may be a gene for resistance to a pesticide such as glyphosate, marketed under the name of roundup, phosphinothricin (basta) or glufosinate (liberty). the plasmid is reintegrated into a. tumefaciens. during infection of a plant cell by a. tumefaciens the t-dna carrying the gene of interest is inserted into one of the chromosomes of this cell. the ti plasmid is isolated and cut using a restriction enzyme. a foreign gene, known as a gene of interest, is inserted into the t-dna of the plasmid. a plant is generated from a modified clone. all of its cells carry the foreign gene. transgene of interest in the case of the fight against insect predators, the useful gene is carried by a fragment of dna contained in the genome of the bacterium bacillus thuringiensis. this gene, called bt, expresses a toxin responsible for the insecticidal capability of b. thuringiensis. a current application involves the protection of bt corn with respect to the corn-borer, a devastating insect whose caterpillars are particularly destructive. another, more direct, gene transfer method, known as biolistics, involves bombarding plant cells with tungsten microbeads covered with modified dna. with the implementation of large-surface-area experimental fields and the first marketing of gm soya, in , the question of whether or not the advantages achieved with respect to crop yields are counter-balanced by risks for the environment and for consumers came to the fore. food risks could arise from the toxicity or allergenic power of artificially synthesized proteins. at the time of writing, this question remains unanswered, due to the lack of epidemiological studies carried out rationally over several years. when the first creations of gmos took place, the transfer of the gene of interest was carried out by means of the co-transfer of an antibiotic resistance gene. the transformed cells were selected according to the criterion of their resistance to this antibiotic, which involved a risk of dissemination of the resistance gene. this selection technique has been abandoned. in practice, it is difficult to evaluate the theoretical ecological risk of wild plants being invaded by genes that have been inserted artificially into gmos. as a precaution, zones used for experimentation of plant gmos are now surrounded by refuge zones, i.e., fields in which the same species of plants, in non-gmo form, are cultivated. there has been a much fiercer and completely legitimate debate concerning the presence of the terminator gene in seed from the first gmos marketed by the monsanto company in the usa. the terminator gene blocked germination of the seed from the cultivated plant, so it was necessary for the farmer to buy more seed from the company each season, thus creating a state of dependency. this technique is no longer in use, but the fact remains that most transgenic seed is patented, and therefore farmers who use such seed are dependent on the companies that posses this genetic know-how. the culture of plant gmos has spread around the world, covering more than a billion hectares of our planet, more than half of which are in the united states of america. this type of culture is used on a large scale for soya and in a less extensive way for corn, rape seed and cotton, but there are many other applications of plant transgenesis. among the countries that are actively involved we may mention argentina, brazil, canada and china, and more recently india, paraguay and south africa. while the policies of these countries are based on the fact that gmo products do not differ fundamentally from non-gmo products with respect to checks carried out a posteriori, and that there is thus no reason to prohibit them, european policy has taken refuge behind a principle of precaution, and it remains basically restrictive. although the moratorium on the culture and marketing of plant gmos that was put in place in was lifted in , mandatory labeling for any consumable product containing more than . % gmo remains dissuasive. the united states of america has refused to use such labeling. the worries that are aroused by plant transgenesis, which are often exacerbated by the diktats of ecology groups, must be analyzed in a reasoned manner. common sense and lucid thought dictate that the debate should be situated within a scientific perspective in which the main role is played by the experimental method in long-term applications. simple reflection leads us to think that with time parasites and self-propagating plants will develop a resistance to the most drastic treatments, as was the case for bacteria confronted with antibiotics. the perspective of an acquisition of uncontrolled resistances, which gives rise to so much passionate debate, is, in fact, only the first stage of a technology with promising applications. the mastery of plant transgenesis that was acquired through the first experimentations should, in fact, allow the emergence of plant gmos that are assigned to the production of molecules with therapeutic effects (drugs, vaccines, human proteins, vitamins…). in this domain, there have already been creations that include golden rice, which carries β-carotene, the precursor of vitamin a, banana plants that express a vaccine against hepatitis b and tobacco that produces human lactotransferrin and hemoglobin. if we just look at the production of golden rice as a palliative for vitamin a deficiency, it should be remembered that, in certain countries of our planet, this deficiency affects people's sight and is a frequent cause of blindness, that it generates problems with development and the immune response to infections, that it affects more than a hundred million children around the world and that it is responsible for the death of three million of them each year. if these plants are considered to be a material of choice for the production of proteins with a therapeutic effect, this is partly due to the yield of such crops over large surface areas, and also partly due to the low risk of transmission of viral pathogens to man, because of the species barrier, a risk that is less negligible when animal productions are involved. genetically modified plants are also potential factories for the manufacture of chemical products with an industrial impact, for example lubricants, perfumes and aromas. given the unpredictable outlets that plant gmos may have in human medicine and the different domains of the economy, plant gmo technology should be considered in a manner that is free of any pressure or passion, and, as far as the political authorities are concerned, it should be subject to appropriate measures to surround and protect certain strategic experiments. when looking at the worries being expressed by the european society, it should be remembered that the genetic inheritance of plants has never ceased changing, not only in the most of natural of manners, over millions of years, particularly with the mobility of transposable elements located in the genome, but also artificially, at the hands of farmers from ancient times onwards, with their methods of hybridization and selection. the nervousness of european authorities, showing an ignorance of basic scientific ideas, with the pretext of a principle of precaution, and sometimes political compromises that are exemplified by fluctuating and contradictory positions, runs the risk, in the short term, of causing their countries to lag disadvantageously behind the united states of america, which holds the majority of plant biotechnology patents. the principle of gene therapy is simple: the introduction of an appropriate gene into the cells of a patient who carries a mutation can correct the phenotypical consequences of this mutation, or, in other terms, cure the disease affecting the patient, or at least slow down its evolution. the technical difficulty involved in gene therapy is that of finding an appropriate vehicle or vector for the transfer of the gene and addressing it to an appropriate location in the genome of the host cell. the most commonly used vectors in human gene therapy are viral. a certain number of criteria are necessary for a transfer to be efficacious, including a high concentration of viral particles carrying the gene to be transferred (more than a billion viral particles per milliliter) and a good capability on the part of the foreign gene to be integrated into the host's genome. the patient's immune response remains a major worry in the use of viral vectors: at cell level it often leads to a proliferation of cytotoxic lymphocytes and, especially at humoral level, to the synthesis of antibodies directed against the viral proteins. in order to minimize its immune response, the genetic material of the viral vectors is modified. for ethical reasons, gene therapy is currently only applied to somatic cells, germinal gene therapy being rejected. somatic gene therapy has been experimented in the treatment of hereditary illnesses linked to hematopoiesis. one of the technical reasons for this choice is easy access to the progenitor cells of the bone marrow, with the aim of transfection. it was with this in mind that mouse gene therapy models were developed a few years ago. the sickle cell mouse is one of these models. human drepanocytosis (sickle cell anemia) is a serious disease that is caused by a mutation in the β protein chain of normal human hemoglobin a. the molecules of sickle cell hemoglobin s tend to aggregate and form fibers that obstruct the blood capillaries of the microcirculation. somatic gene therapy has been applied to these sickle cell mice. this involves an autograft of bone marrow hematopoietic cells transfected with a retrovirus hosting the gene coding for the β subunit of normal hemoglobin. encouraging results have shown the validity of this approach. in , a gene therapy protocol that had been applied with success to man was described by the group of alain fischer (b. ) and marina cavazzana-calvo at the necker hospital in paris (science, vol. , pp. - ). the purpose of this therapy was to bring about a long term remission in the case of an immune disease known as scid-x (severe combined immunodeficiency linked to a mutation on the x chromosome). because of their susceptibility to microbial and viral infections, babies who are affected can only survive in sterile rooms. they are known as bubble babies. in this illness, the hematopoietic progenitor cells of the bone marrow are unable to differentiate into t and nk (natural killer) lymphocytes because of a mutation that affects a cytokine receptor. previous experiments carried out on model mice show that scid can be corrected by in vivo transfer of the cytokine receptor gene into hematopoietic progenitors. the transfer of the gene of interest paired with a retroviral vector was carried out first in march , in two babies, one of them eleven months and the other two months old. progenitor cells from their own bone marrow, cultured and modified genetically, were injected into them. these were therefore autografts, without any risk of immune rejection. a remission of symptoms over a period of nearly a year, shown by the almost normal behavior of the babies' immune cells, encouraged the application of the same therapy to other babies. in total, ten babies were given this therapy. the enthusiasm that greeted the successes that were recorded was nevertheless tempered by fact that in the spring of , and again in the following year, a child who had undergone the gene therapy developed a leukemia characterized by an anarchical proliferation of lymphocytes, necessitating chemotherapy. these two occurrences were explained by the random character of the insertion of the gene of interest into the patients' genomes: insertion into a site close to a proto-oncogene had led to activation of this proto-oncogene and the proliferation of the lymphocytes. while the trial carried out at the necker hospital gave rise to great hopes, it nevertheless showed that there is still a long way to go before we achieve a targeted transfection of genes so that no undesirable consequences follow. here we have a typical example of the limits of an experimental method that is based on an in-depth technological know-how, but also on a still imperfect understanding of the complex arcana of the mechanisms that regulate the positioning and interaction of genes in the chromosomes of eukaryotic cells. this example highlights a harrowing ethical dilemma: should we not treat a patient whose illness is likely to be fatal, or attempt a therapy that may save the patient, without having any formal assurance of its success? an experimental medicine that has the power to modify the human organism via its genetic material is now able to take over from the experimental method that up until now operated on animals and plants. we can easily understand, given the progress that has already been accomplished and that which is to come in the domain of gene therapy, that the temptation will be great, in the future, to consider manipulations of the human germ cell genome as being licit, insofar as such manipulations make it possible to eradicate a handicapping defect in our descendents. at present, the idea of any attack on the germinal genetic inheritance has been rejected unconditionally on the basis of ethical considerations. nevertheless, the history of science shows that prohibitions that were once considered to be untouchable finish by being contravened. this was the case for abortion. in a text entitled why genetic engineering should continue its battle , james watson writes of his confusion when faced with a choice that is likely to become more and more insistent over the years: "dare we be entrusted with improving on the results of several million years of darwinian natural selection? or do the human germ cells represent on the contrary rubicons that geneticists will never dare to cross?" a mastery of the differentiation of stem cells and of cloning are two essential weapons in the biotechnological arsenal, the use of which for utilitarian ends, particularly in human medicine, gives rise to hope and disquiet, agreement and disapproval. at the beginning of the s, experiments carried out by the canadian biologists ernest mcculloch (b. ) and james till (b. ) attracted attention to the particular properties of cells in the bone marrow, the stem cells, which would subsequently be found in other tissues . the experimental protocol is simple. bone marrow cells from a mouse are injected into another mouse that has previously been irradiated in order to destroy its stem cells. the injected cells go to the spleen where they divide and form colonies that take the form of nodules of different sizes. the researchers realized that the cells of these nodules present differences in their potential for renewal, which is more or less rapid. they reinjected the nodule cells into mice from a second batch. the reinjected cells showed themselves capable of multiplying and generating several types of blood line. these observations suggest the presence in the nodules of progenitor cells that have a strong potential for self-renewal and self-differentiation. in the following years, these observations were confirmed and explained by two characteristic criteria of stem cells; self-renewal and differentiation into multiples cell lines with specific characteristics. from this point on, it was possible to understand the enigma of the amputated hydra in the experiments carried out by trembley, two centuries beforehand (chapter ii- . . ). we now understand why, like the hydra, organisms like the flatworm, the salamander, the starfish and the zebrafish are able to recreate an amputated or damaged part of their bodies. the hydra mobilizes stem cells that it has preserved since its birth. in the case of the salamander, regeneration involves the reprogramming of cells that have already been differentiated. like all stem cells, embryonic stem cells (or es cells) are able to self-renew and differentiate into the different types of known adult cell line, giving rise to different types of cell such as neurons, cardiac cells that are able to contract, or hepatocytes ( figure iv . ). this potential has led to the hope that es cells could be used in regenerative medicine. in a fertilized egg that has developed to the blastocyst stage, it is possible to distinguish a cell mass (inner cell mass, icm) which protrudes inside the blastocyst. the icm cells are removed and placed on a mat of irradiated (and thus unable to divide) fibroblasts that provide them with a support and nutrients (steps and ) so that they can proliferate. the stem cells arising from the icm cells, placed in a medium that has been specifically conditioned to provide cytokines and other biomolecules, are able to differentiate into various cell types (step ). at what stage of embryo development is it possible to remove es cells for experimental purposes? after fertilization by a sperm cell, the ovum undergoes a series of divisions that give rise to a microstructure, the blastocyst, the cells of which are called blastomeres. each isolated blastomere remains capable of producing an entire organism of fetus and placenta, by division and differentiation. at this stage, blastomeres are totipotent. five days after fertilization, the embryo has the form of a hollow sphere. an external layer of cells, the trophoectoderm, surrounds a cavity, the blastocele, inside which a small mass of cells, the inner cell mass, protrudes. from the beginning of the implantation of the blastocyt in the uterus, the trophoectoderm evolves to form the placenta. the cells of the inner cell mass take part in the process of differentiation that generates all of the tissues of the future adult organism. these are called embryonic stem cells (es cells). es cells are said to be pluripotent. isolated, they have lost their ability to give rise to a complete individual, but they have maintained the possibility of differentiating, according to their environment, into any of the two hundred cell types that make up animal tissues. during their division, es cells evolve from a stage of being pluripotent to a stage of being unipotent, passing through a stage of multipotency beyond a hundred cells. a state of multipotency characterizes cells that give rise to a restricted number of cell lines in the tissues in which they nest. this is the case for of the hematopoietic stem cells of the bone marrow that form the red blood cells and the white blood cells. the term unipotent refers to the progenitors, which give rise to a single type of cell, for example the hepatocyte of the liver or the cardiomyocyte of the heart. when es cells are cultivated for to days in a conventional nutritive medium, they multiply and aggregate. if the culture medium is supplemented with certain biomolecules such as insulin, retinoic acid, transferrin or fibronectin, the differentiation of the es cells is oriented towards cells of different types, such as neuron cells, glial cells or muscle cells. there are many publications about experiments concerning the grafting of differentiated stem cells in the mouse or the rat. for example, neuron precursors derived from the spinal cord or the brain are grafted into rats whose spinal chords have been injured. five weeks after the grafts are carried out, the transplanted cells have filled the area of the injury and differentiated into oligodendrocytes, astrocytes and neurons. what is more, after about twelve weeks, locomotive function has been partially restoredirradiated . other experiments involving the grafting of differentiated stem cells have been carried out on rats in which the dopaminergic neurons of the "substantia nigra" of the brain that secrete the neurotransmitter dopamine have been selectively destroyed by injection of -hydroxydopamine. the problems found in the rat as a result of this neuronal degenerescence mimic those found in man in patients suffering from parkinson's disease. dopaminergic neurons obtained by the differentiation of mouse es cells are grafted into the striatum of each of these rats, a region of the brain whose neurons communicate with those of the substantia nigra and play a fundamental role in the control of movement. this results in a significant improvement in the motor deficit, coupled with the establishment of functional synapses between the injected neurons and those of the host , . a recent publication bringing together the results of two french research teams, that of michel pucÉat (b. ) in montpellier and that of philippe menaschÉ (b. ) in paris, provides interesting information about how mouse embryonic stem cells, grafted into sheep cardiac tissue where an infarctus has been artificially induced, are able to colonize the infarct zone and regenerate cardiac contraction in a functional manner. moving from the mouse to the sheep constitutes a considerable species leap, and the absence of any immune rejection leads us to say that embryonic stem cells have an "immune privilege" . the use of es cells in regenerative medicine necessarily requires that their differentiation be regulated in an exhaustive manner into well-defined pathways, in order to produce homogeneous cell lines with a view to implanting them in damaged tissues. in fact, contamination with non-differentiated es cells is likely to cause tumors (teratomas) over the long term. the mastering of the use of es cell culture and differentiation, as well as of cloning, in such a way as to overcome problems of histocompatibility, is still in its infancy. for a long time, the mouse was the preferred animal model for experimental studies on the differentiation of es cells. in , the first es cells from mouse blastocysts were isolated and successfully cultured by two groups of researchers in great britain and the usa. it was only in that human embryonic stem cells (hes) were isolated for the first time and held in culture, on a nutritive layer of fibroblasts from irradiated mice. this delay with respect to the ability to culture animal es cells can be explained by the fact that the molecular machinery that activates replication and cell differentiation programs is not completely identical in man and the mouse . for example, a cytokine called lif (leukemia inhibitory factor), which is indispensable for the renewal of es cells in an undifferentiated state in the mouse, has no effect on human es cells. there are several other differences concerning the control of proliferation and differentiation in human and murine es cells by growth factors. briefly, the conclusions obtained from experiments carried while there is a highly promising future for the use of es cells, this future is littered with obstacles, and rigorous checks and balances need to be put in place. nevertheless, research on such cells is mandatory if we wish to move on to a regenerative medicine that aims to be a new frontier in the art of healing. after specific differentiation, hes cells could provide unlimited quantities of the tissues needed to replace damaged tissues responsible for handicapping illnesses (dopaminergic neurons in parkinson's disease, cardiomyocytes in myocardial infarction, pancreatic islets of langerhans cells in diabetes, fibroblasts in skin grafts, chondrocytes in rhumatoid arthritis). in addition, metabolic analysis of hes cells carrying defective genes whose phenotypical expression is known in human pathology should improve our understanding of the perturbed mechanisms, and could lead to pharmacological advances. as well as the technical difficulties involved, which have not yet been adequately overcome, the handling of hes cells is subject to much ethical debate in many countries, with those who object to it holding to their prejudices, which are linked to religious or cultural traditions. this is the case in france, where, nevertheless, a few timid dispensations had begun to appear at the time of writing. in contrast, in great britain, the law authorizes the isolation of hes cells for therapeutic purposes, using embryos of less than one hundred cells, produced by in vitro fertilization, and surplus to requirements. the british response to the burning question of whether an isolated hes cell may be considered as a potential human embryo is clearly "no", for, in order to be able to develop in utero, such hes cells would need to have the placental progenitor cells. an alternative to the use of es cells is to make use of adult stem cells. however, the proliferation capacity of adult stem cells is considerably lower than that of their embryonic homologues. the hematopoietic stem cell is the paradigm of the adult stem cell. it can differentiate into all known types of cells. in the last decade of the th century, several publications concerning the plasticity of the adult stem cell awakened a hope that these cells could transform the treatment of degenerative illnesses. certain of these publications stated that adult bone marrow stem cells, implanted into different types of tissues, differentiate into hepatocytes, cardiomyocytes or neurons, depending on the specific environment. careful re-examination of the techniques used revealed that, in certain cases, interpretation of the results as showing cell transdifferentiation was an erroneous one, and that the fusion of the bone marrow stem cells with cells from other tissues was a more plausible explanation. in any case, while not ignoring the use of adult stem cells, experimentation on hes cells remains a judicious choice, given our current state of understanding. in france, the law application decree that was issued on the th of february , revising the restrictive bioethical standards of , opens up the possibility of using human embryonic stem cells for scientific purposes, with certain ethical reserves being maintained. one of the obstacles to the stabilization over time of a stem cell graft in a receiver involves the phenomenon of rejection for reasons of histocompatibility. considered to be foreign by the receiver (host), grafted stem cells coming from a donor are rejected. this obstacle could be overcome by using the technique of cloning. based on experiments on several animal species, it is now accepted that the transfer of the nucleus of an adult somatic cell from a host into an enucleated oocyte makes it possible to obtain from this oocyte, which is once again nucleated, and which is the equivalent of a zygote and able to divide, es cells whose genome is identical to that of the host. because of this, the es cells are immunologically compatible with the tissues of the host. in man, such cells could be directed by differentiation towards stable cell lines creating well-defined tissues and organs (liver, muscle…) that could be used in regenerative medicine. this is the principle of therapeutic cloning. in march , korean veterinary researcher woo suk hwang (b. ) and his co-workers, who were recognized experts in animal cloning, announced in the american review science that they had succeeded for the first time in obtaining around thirty human blastocysts by cloning, i.e., by the transfer of nuclei of somatic cells into enucleated ova. this first experiment involved autologous cloning (ovum nuclei and enucleated somatic cells taken from the same woman). hwang and his team used ova, and the yield from the experiment was close to that obtained at that time for the cloning of mammals. using the inner cell mass of one of the blastocysts, they isolated a line of embryonic stem cells able to maintain a normal karyotype after several dozen divisions. this publication, which appeared in a highly prestigious scientific review, triggered an enthusiasm in the media that was in keeping with the spectacular nature of the team's exploit, tempered here and there by a few comments that were mainly linked to questions of medical ethics. in , there were numerous other articles by the same team on the same subject, reinforcing the first results with a heterologous cloning technique (ovum nuclei and enucleated somatic cells taken from different people), thus giving rise to great hopes that the era of regenerative medicine was near. at the beginning of , professor hwang's retractation of all his work, and a public confession of a spectacular fraud, were even more dramatic, offering certain media an occasion for a disproportionate level of fury against therapeutic cloning. however, despite such rear-guard combats, it is obvious that one day these technical difficulties will be overcome. human cloning, in order to obtain stem cells for therapeutic purposes, cannot escape the future. once this aim has been achieved, it will be spoken of as the outcome of a long story. the adventure of animal reproductive cloning began in . in developmental biology , two american researchers, robert briggs ( - ) and thomas king ( - described experiments involving the transfer of cell nuclei of embryos from a frog (rana pipiens), at the blastula and gastrula stages, into enucleated eggs of the same species. a high percentage of the clones obtained in this way were able to reach the tadpole stage when the transferred nuclei came from the early blastula stage, but only mediocre success was achieved when the nuclei came from the later gastrula stage. these experiments emphasized both the totipotency of the embryo somatic cells and the equivalency of the somatic cell nucleus and the nucleus of the fertilized egg in cell division and differentiation. briggs and king's publication did not arouse any particular interest. it is true that the s were dominated by the saga of molecular biology, which would reach its culmination in the deciphering of the genetic code. from the s onward, the first attempts to clone mammals (rat, mouse, pig) began. moving from the amphibian egg, which was a millimeter wide, to a mammal egg that was one hundred times smaller, presented a technical difficulty that would be overcome by a technique of cell-to-cell electrofusion. cloned embryos were thus obtained by nuclear transfer and then implanted into the uterus of a surrogate female. however, in all cases, the nucleus came from embryo cells. in february , the announcement made by ian wilmut (b. ), keith h. campbell (b. ) and their collaborators at edinburgh's roslin institute of the birth of the cloned lamb dolly had an immediate effect in the media. in fact, this was not only the cloning of a higher mammal, but, above all, the cloning by insertion of an adult somatic cell, in this case a mammary tissue cell, into an enucleated oocyte. this went far further than the experiment carried out by briggs and king, which essentially involved the transfer of embryo cell nuclei into enucleated frog eggs. the trick that gave wilmut and campbell their success was to bring the cells providing the nuclei to a quiescent state corresponding to the interphase stage of the cell cycle, by impoverishing their culture medium, before electrofusion with enucleated oocytes. although we should be aware that attempts were made before a positive result was achieved, this does not make it any less astonishing that the nucleus of a cell in its adult state, i.e., completely differentiated, was able to behave as if it were totipotent. despite being committed to a program of differentiation that is considered to be more-or-less irreversible, and which will give it a specific identity, the nucleus of an adult cell can be reprogrammed and become totipotent. since dolly, many other mammals have been cloned from nuclei of adult cells; mice, cows, goats, pigs, rabbits, cats, dogs, rats and horses. as far as ethical discussion about cloning is concerned (chapter iv. ), it is essential to note that the demarcation line between reproductive cloning and therapeutic cloning is situated where decisions are made concerning the destiny of the cloned blastocyst ( figure iv . ). the transfer of the nucleus of a somatic cell (liver, epidermis, muscle) containing n chromosomes into an enucleated oocyte gives rise to an egg ( n chromosomes) that is able to divide and to produce a blastocyst. the cells of the blastocyst inner cell mass (icm) can be used as stem cells that can differentiate into different types of cell line (therapeutic cloning). on the other hand, if the whole blastocyst is implanted into a uterus, it will produce an embryo which, after birth, will grow into an adult animal (reproductive cloning). reproductive cloning and therapeutic cloning therefore differ because of the fact that in reproductive cloning, the whole blastocyst is used, while in therapeutic cloning, only certain cells, corresponding to the inner cell mass (icm) of the blastocyst, are used. the structural and functional identity of the cells of a given tissue in an adult organism involves a basic mechanism: while each cell has the same set of genes, only some of the genes are expressed as proteins and the genes that are expressed differ according to the tissue involved. the key to the mechanism responsible is in the epigenetic type chemical modifications of cell dna, for example methylations, which repress the expression of certain genes without altering the expression of others. these modifications of the dna, which control cellular specificity (muscle, liver, brain…) are not very reversible, but, in certain circumstances, they can become so. this is what happens from time to time when the nucleus of an adult cell is inserted into an enucleated oocyte. we are thus able to assume that in the molecular arsenal of the oocyte cytoplasm there are substances that can cancel the epigenetic modifications of the dna present in the nucleus of an adult somatic cell and recreate a state of pluripotency in this nucleus, or, in other words, provoke the reprogramming of the somatic cell nucleus. in the long term, it is to be hoped that biochemical technology will be able to find and purify the molecules responsible for the nuclear reprogramming of somatic cells. the use of human oocytes for the purpose of therapeutic cloning is still subject to severe criticism. certain groups wish it to be prohibited, because of a fear of a drift towards reproductive cloning. to obviate this risk, the idea has been to make use not of human oocytes but of those of animals, transferring the nuclei of human somatic cells into them. even supposing that the technical difficulties involved could be overcome, the cells that would result, a sort of man-animal chimera, would also be the subject of an ethical debate, even if the purpose of this type of cloning were to be solely therapeutic. some japanese researchers have succeeded in creating mice according to a parthenogenetic process that involves adding the nucleus of an oocyte that is haploid ( n chromosomes) to another haploid oocyte, the result being the equivalent of a fertilized egg ( n chromosomes). this exploit is achieved by the invalidation of one of the genes (h ) involved in the control of the parental imprint. it is known that sexual reproduction in mammals involves a phenomenon called the parental imprint, which, by means of the methylation of dna and perhaps also of histones, allows the expressing or silencing of certain genes in male and female gametes. a single copy of a given gene, originating either from the oocyte or from the sperm cell, is therefore expressed, while the other is inactive. in the japanese experiment, if the mouse h gene had not been invalidated, the result would have been an anarchical development of the responsible genes involved in the parental imprint with overexpression in the case of some and an absence of expression in the case of others. these disturbances would have been incompatible with the viability of the embryo. however limited its application might be, the manipulation of the germinal genome poses the problem of the mechanism by which the parental imprint intervenes in the viability of the egg, a parameter that at the time of writing is still not completely understood, but is being actively explored. in boston, massachusetts, in , a kidney was transplanted from a healthy boy into his twin brother, who was suffering from a fatal renal anomaly. the success of this graft ushered in the era of transplantations of such organs as the heart, liver and kidney in man. in order to try to prevent the rejection of grafts, caused by an immune incompatibility between the receiver and the graft from the donor, different immunosuppressing treatments were tried, one by one, involving corticosteroids or cytostatic agents such as -mercaptopurine. in the s, a decisive step forward was made with the fortuitous discovery of the powerful immunosuppressive effect of the cyclosporin a, a cyclic polypeptide isolated from the mold tolypocladium inflatum. each year, human organ transplants into patients make it possible to save many lives. however, for some time now, organ transplantation has been suffering from penury of donors. one alternative to the homograft is the grafting of animal organs, or the xenograft, and, at the dawn of the st century, this type of graft has entered an active, promising phase, with the creation of pigs that have been partially "humanized" and are thus, as a consequence, immunocompatible. for reasons of genetic and physiological similarity, the first choice for such grafts was to use apes or monkeys. however, this idea was quickly abandoned, for several reasons; a non-negligible risk of viral infection due to the phylogenetic kinship of human and simian species; a slow growth rate; a low reproduction rate and, finally, laws that protect primates. these disadvantages are not found, or are at least minimized, in the pig: the risk of a viral infection passing from the pig to man should be low because of the species barrier (but nevertheless, it ought to be evaluated), the pig growth rate is relatively rapid, pig litters are large and pig organs are of a size close to those of man. hyperacute rejection of grafts is the critical obstacle that must be overcome before it is possible even to envisage the feasibility of xenotransplantation. hyperacute rejection is caused by the presence in man of natural antibodies (xenoantibodies) that accumulate throughout a lifetime and are directed against antigenic motifs carried by the products of the digestion of food or dust that is breathed in. xenoantibodies are mobilized when a xenograft occurs, and when they combine with xenoantigens brought by the graft this activates immune proteins such as the complement proteins. the catastrophic effect of this xenoantibody/xenoantigen combination is a vascular thrombosis followed by necrosis and rejection of the graft. the pig xenoantigen that is considered to be the one mainly responsible for the phenomenon of rejection in man is a sugar molecule, galactose α- , -galactose, located on the plasma membrane of endothelial cells. synthesis of this molecule requires the enzyme α- , -galactosyltransferase, which is present in most mammals, but absent in man and the primates. this enzyme disappeared in man around twenty million years ago, following a double mutation of the gene. in , cloning by nuclear transfer, associated with the invalidation of the gene coding for galactosyltransferase, made it possible to create pigs without galactose α- , -galactose . this performance shows that the xenotransplantation objective, although it can only be envisaged over the long term, is not based on false hopes. plant gmos, gene therapy, embryonic stem cells, therapeutic cloning, and xenotransplantation are a few of the many examples that show how far experimentation on living beings has progressed in just a few decades, from inquiries into the operating mechanism of an organ or a cell, in the interests of pure understanding, to a programmed process, planned with an objective in mind, the chances of success of this objective being analyzed and counted in terms of impact and cost-effectiveness. during the renaissance, ecclesiastical authorities, worried by the libertarian forces that were assailing them, applied the brakes to audacious questioning of dogma such as the circumterrestrial revolution of the sun that had, since ancient times, placed man at the center of the universe. nowadays, civil authorities, conscious of the potential but also of the possible misuses of genetic manipulation, insist on having the right to oversee such procedures. in truth, since the th century, governments have been interesting themselves in research on living beings and encouraging it, as long as its applications have allowed improvements in human health. this has been the case for vaccinations against infectious diseases or for prevention of microbial infections by means of aseptic or antiseptic methods. with the breakthroughs made in genetic manipulation at the end of the th century, it was more than just the results of experiments on living beings that attracted the attention of the political authorities, it was, above all, the manner in which the experimental method, with all its hazards, made use of living material, sometimes of human origin, in order to unlock mysteries. conscious of the social impact of emerging discoveries that are subject to considerable media coverage and are sensationalized in both the written and audiovisual media, the state, with the help of researchers and philosophers, has laid down a code of bioethics, applied through strict or even restrictive legislation. it remains to be seen whether the rules of this code will continue to be an inviolable absolute or will be modified according to the evolution of the moral codes and the cultures of nations. university teaching and the education of a society must now take into account not only the content of successive discoveries, but also the fallout of these discoveries, insofar as they concern man, and even the ethical justification of the methods that have allowed these discoveries. in "remaking" living beings according to imposed norms, and in scheduling, in a certain fashion, the manufacture of life according to new codes, certain questions move from the "how" to the "why", i.e., from the scientific domain that is accessible to human thought to the metaphysical sphere, with its problems of the limit of what is surmountable and tolerable in terms of ethics. in his birth of predictive medicine, jacques ruffiÉ ( - ) reminds us that medicine has evolved through three stages over the course of time: curative medicine, which has been practiced since ancient times and is still being practiced; preventive medicine, which is more recent, and is designed to prevent people from falling ill, either by vaccinating them, in the case of infectious diseases, or by recommending an appropriate diet and medication in the case of metabolic disorders such as diabetes or arterial hypertension that have been detected by means of systematic examination; and, finally, predictive medicine, a branch of medicine that is still in its early phases, and which is based on modern technology and is able to predict situations of risk because of anomalies detected in the genetic inheritance or because of exposure to environments that are reputed to be dangerous (for example, carcinogenic smoke, asbestos). about one and a half centuries ago, the publication of the introduction to the study of experimental medicine ( ) provided proof, based on scientific arguments, that the time had come to transfer the experience that had been acquired through the experimental method practiced on animal models to the ill person. after claude bernard, attentive to the progress made by ideas and techniques in the physical and chemical sciences, and making use of its own advances in the understanding of the living cell, both normal and pathological, experimental medicine was to live through a development that was without precedent in the history of humanity. to understand the causes of epidemics, nutritional deficiencies, metabolic deviations of hereditary origin and degenerative illnesses, and then to translate these causes in cellular and molecular terms, this was the process undertaken by medicine once it began to use the experimental method. in fact, for several decades, from the beginning of the th century, medicine had already undergone some major revisions of outdated practices and had inaugurated a new era in diagnosis. for example, the differential diagnosis of pulmonary ailments became possible because of the invention of the stethoscope by rené laennec ( - ) and the practice of auscultation and percussion by joseph skoda ( - ), the uncontested master of the vienna school. in france, pierre louis ( - ) used statistical methods to evaluate the efficacy of different treatments. armand trousseau ( - ), a pupil of pierre bretonneau ( - ) wrote a famous treatise on the hôtel-dieu medical clinic in paris. in great britain, chronic nephritis, with its identifying symptoms, was described by richard bright ( - ), paralysis agitans by james parkinson ( - ) and addison's disease, which affects the adrenal glands, by thomas addison ( - ). during the th century, many other famous names signaled the arrival of a medicine that was resolutely anatomoclinical in nature, in line with bernardian doctrine. "experimental medicine is thus a medicine that claims to understand the laws of the organism in sickness and in health, in such a way that it not only predicts phenomena, but also in such a way that it can regulate and modify them, within certain limits." in the introduction to the study of experimental medicine, claude bernard stigmatizes the relics of an empirical medicine that was still being practiced in his day and was forgetful of rationalism. the terms that he uses are without leniency: "i have often heard doctors who, when asked the reason for a diagnosis, reply that they don't know how they recognize such a case, but it is obvious, or who, when asked why they administer certain remedies, reply that they don't really know how to put it exactly, and that anyway they are not required to give a reason, because it is their medical tact and their intuition that guides them. it is easy to understand that doctors who reason in this way are denying science. what is more, it is impossible to be too forceful in rising up against such ideas, which are bad not only because they stifle any scientific seed in the young, but also, above all, because they favor laziness, ignorance and charlatanism." in order to evaluate the meaning of these words, it should be remembered that in claude bernard's time, the medical profession was far from considering the microscope as a useful instrument for the study of cell structures and that the cause and effect relationship between bacterial germs and infections was still to be shown. with the development of increasingly effective instruments for exploration, and of methods for microanalyses concerning a wide range of blood and humoral constants, throughout the th century, medicine, which was once empirical, has now become scientific. claude bernard's dream, experimental medicine, is now operative. this medicine is no longer content simply to determine the cause of an illness and to locate the affected organ, which was the major objective of clinical medicine, but it seeks to detect the mechanisms of pathological processes by means of histological and physicochemical explorations. this medicine is no longer willing to passively monitor the evolution of an infectious disease. after having identified the responsible germ, it tries to target this germ with the chemical weapon that is able to selectively destroy it. this medicine is no longer content simply to find remedies, it aims to understand the mode of action. it sets itself the goal of meeting challenges such as finding the genetic cause of degenerative illnesses or of cancers and developing appropriate therapies. it is supported by statistical data. when a new drug is implemented, the results are now evaluated by the double blind method: neither any of the patients (treated and non-treated) nor any of the investigators are aware of who has been administered with the drug and who has been administered with a placebo. in the surgical domain, audacious techniques have also led to considerable progress, particularly in neurosurgery and in cardiovascular surgery. thanks to robotics and to computer technology, remote surgery or telesurgery has become practicable, although up until not that long ago, it was only to be found in fiction. faced with emerging problems in public health, the task undertaken by experimental medicine is immense. in the middle of the th century, the spectacular recovery from high-incidence infectious diseases such as pneumococcal pneumonia, meningococcal meningitis or acute forms of tuberculosis, which that was brought about by antibiotics, gave rise to the idea that medicine had won a battle against the microbial world and that, from then on, it would be able to control the evolution of infectious diseases and to offer rational treatments. the gradual appearance of a microbial resistance to antibiotics has brought an end to this euphoric era. penicillin, for example, which was put on the market at the end of the s, was active on practically all strains of staphylococcus aureus. sixty years later, more than % of the strains of this same microbe are resistant to penicillin. the incidence of nosocomial infections, which are contracted in health care facilities, never ceases to rise. at present, around % of the hospitalizations that take place are complicated by the patient developing a nosocomial infection. equally worrying are the re-emergence of diseases that were once considered to be under control, such as tuberculosis or poliomyelitis in africa, and the emergence of new diseases such as aids, whose hiv virus (human immunodeficiency virus), which was identified at the beginning of the s, has generated a pandemic that has spread throughout the planet. infectious diseases are currently responsible for more than a quarter of human deaths. the koch bacillus responsible for tuberculosis and the pneumococcus kill three to four million people a year, around the world. in , hiv killed more than three million people, and more than forty million people are infected. one person is infected every seconds. in viral diseases, the role of vectors (insects, various animals) as well as the notions of contagiousness and aggressivity have been emphasized. we have only to remember the dreadful contagiousness and aggressivity of the spanish flu virus (influenzavirus ah n ) which, in - , killed more human beings around the world than the first world war that preceded it. in contrast, the sars (severe acute respiratory syndrome) epidemic of , the vector of which was doubtless the civet, a small carnivore raised in china and desired for its meat, was rapidly contained because of its low contagiousness and also because of the isolation measures that were taken. human behavior is not without its effect on the emergence of viral diseases. the growth in intercontinental travel and human migration, as well as intensive deforestation in africa and south america, which bring virus vectors into contact with man, are factors concerned in the emergence of viral diseases that risk being explosive and devastating. in this context, the history of the ebola virus and of the marburg virus, which cause violent hemorrhaging, is edifying. in , in the german village of marburg, an epidemic of unknown origin broke out, the illness manifesting itself with brutal suddenness by vomiting, diarrhea, a high fever and an increased tendency to bleed. this pathology, which was contained rapidly by means of drastic isolation measures, was found to be of viral origin. the pathogen concerned was a filovirus (filiform virus). a brief enquiry showed that the origin of the epidemic was contact between technicians of a pharmaceutical company and monkeys that had been imported from uganda and that were carrying the virus. in , two other epidemics, characterized by severe and often fatal hemorrhagic fevers, were reported in the sudan and the republic of the congo. here again, the illness was caused by a filovirus, the ebola virus. at the time of writing, only public health organizations, including the nih (national institutes of health) in the usa, have attempted to set up vaccination and therapeutic strategies. research on these dangerous viruses requires high security installations that are particularly costly, so that private companies are reticent about investing in work that is only targeted on poor regions and which concerns epidemics that have so far been contained successfully, although one day the ebola and the marburg virus could quite well escape their african niches. experimental medicine must also understand the colossal challenge of the five thousand hereditary diseases that are currently listed, the most handicapping of which are myopathies and neuropathies. given the means that are available to the contemporary clinician in order to assign each of these diseases to a genetic defect, one can only be amazed by the mass of information about them that has accumulated over a century, since the first diagnosis of a hereditary disease, alcaptonuria, which was made in by archibald garrod ( - ), a doctor at london's st bartholomew's hospital. alcaptonuria is a non-serious genetic flaw that can be detected easily by a blackening of the urine. it is the result of a blockage caused by the mutation of an enzyme involved in the catabolism of an amino acid, tyrosine, this blockage leading to the accumulation of homogentisic acid, the polymerization of which gives rise to a brownish color. the patient examined by garrod was a young boy. investigation of the family history revealed that transmission of the flaw was correlated to cross-cousin marriages and followed mendel's laws for recessive traits. garrod demonstrated other hereditary-type anomalies, cystinuria, porphyria and pentosuria. in , these observations were published in a work that became a classic: inborn errors of metabolism. in , the specific molecular defect of a metabolic anomaly linked to a mutation was identified for the first time by the german-born british biochemist vernon ingram. this was the hemoglobin defect responsible for drepanocytosis or sickle cell anemia: a glutamic acid in the β chain is replaced by a valine. the consequence of this simple change is a modification of the structure of the hemoglobin, leading to a sickle-shaped deformation of the red blood cells, the increased fragility of these cells and also a tendency towards cell lysis. this discovery made use of the electrophoresis and chromatography techniques that had just been introduced in biochemistry (chapter iii- . . ): such a discovery would not have been possible without these techniques. because of the progress made in molecular biology, the nosological framework of hereditary diseases has been greatly enriched over the last twenty years. for example, at present, more than one hundred hereditary-type myopathies have been identified by accurately locating molecular lesions in the genomic dna and characterizing the structural and functional modifications of the mutated proteins. certain health problems present real challenges for experimental research. this is the case for the spongiform encephalopathy caused by a prion (proteinaceous infectious particle) , which has all the more impact on the imagination because its etiology remains a mystery. it is also the case for degenerescence of the central nervous system correlated with aging, alzheimer's disease being a striking example, although, as far as familial forms of this illness are concerned, i.e., those of the hereditary type, it has been possible to link the invasion of the brain by a so-called amyloid peptide, which accumulates in plaques, on the one hand, and, on the other hand, the absence, due to a mutation, of an enzyme, a peptidase, which normally degrades the amyloid peptide. contemporary scientific medicine sometimes acquires a revolutionary aspect. here again, as with other disciplines involved in the study of living beings, it arises from discoveries resulting from the principle of serendipity (chapter iii- . . ). this was the case when, in january , a team in grenoble, france , led by the neurosurgeon alim-louis benabid (b. ) and the neurologist pierre pollack (b. ) discovered by accident that in patients affected by parkinson's disease a beneficial effect was achieved by deep, high-frequency electrical stimulation of the brain. the three major symptoms of parkinson's disease are muscular rigidity, a tremor when at rest and a slowing down of the execution of movements. in the s, the swedish team of arvid carlsson (b. ) , who won the nobel prize for physiology and medicine, demonstrated a relationship between the parkinson syndrome and a deficit in the secretion of a neurotransmitter, dopamine. a group of neurons that is limited to half a million (of the billion contained in the brain) produces this neurotransmitter in a small structure located in the midbrain, called the substantia nigra. the neurons of the substantia nigra have elongations that interact with different nerve formations (called nuclei) including the subthalamic nucleus. in , bergman et al. published an article that describes a curious relationship between a provoked lesion of the subthalamic nucleus and the disappearance of the signs of parkinson's disease in a monkey which had been made parkinsonian by chemical treatment. this publication led the team in grenoble to target their electrical stimulation on the subthalamic nucleus. this was completely successful. this electrical stimulation procedure, which is now well-codified, involves using stereotactic neurosurgical techniques, controlled by magnetic resonance imaging, to implant an electrode into the subthalamic nucleus. the electrode is connected to a generator that is implanted under the patient's clavicle. the generator sends brief electrical impulses of frequencies from to hz. under the effect of this stimulation, the characteristic symptoms of the illness, particularly the static tremor and bradykinesis, regress in a spectacular manner. the mechanism by which this stimulation acts is not yet understood. no doubt this has to do with complex phenomena involving the inhibition of certain neuron relays near to the substantia nigra, which remain to be deciphered. here we have a typical case of a progression from an experimental fact, discovered by accident, towards the analysis of its cause. from the point of view of the experimental method, it is interesting to make parallels between this discovery by serendipitous means of the beneficial role of electrical stimulation of the midbrain in parkinson's disease and cartesian style programmed research that aims to graft into the brain of parkinson's disease sufferers embryonic stem cells differentiated into dopaminergic neurons . civilian society and its armed force, the political authorities, have understood that experimental science has the tools, the method and the thought processes necessary to develop strategies for prevention and healing. immune cells of this person and induces the synthesis of immune proteins. finally, it seems relatively certain that hopes concerning gene therapy for hereditary diseases will be fulfilled within before too long (chapter iv- . . ). one of the traits that is characteristic of the period we live in, and which arises partly from the economic stakes involved, is the shortening of the time that elapses between a discovery being made and the application of that discovery. for example, interfering rnas, which were discovered in the s (chapter iv- . . ) are already the subject of therapeutic investigation. more than a hundred biopharmaceutical companies around the world are using them with a view to producing drugs from them . in mice, a certain number of synthetic interfering rnas have proved their efficacy in silencing genes which, following mutation, have acquired carcinogenic potential. however, the use of interfering rnas as therapeutic agents requires them to be stabilized, because they are fragile molecules. the group headed by achim aigner (b. ), at the school of medicine in marburg, germany, managed to stabilize a synthetic interfering rna by complexing it with polyethyleneimine, and this interfering rna is able to block the expression of a receptor involved in cancerization (c-erbb /neu(her- ) receptor). used in mice, such a drug appears promising. despite the undeniable progress that has been made, experimental medicine is still some way from finding solutions to some of the enigmas that it meets along the way, and which underline the complexity of living beings. some time ago, it was thought that after having invalidated a gene coding for a protein that is indispensable to a function, we would discover the secret of a cause-and-effect relationship. experimental practice has shown that, generally, this is far from being the case. another example of the complex relationships that exist in living beings is the interference of the mental and the organic. one experiment that suggests this interference was carried out on mice who had acquired a form of pathology similar to huntington's chorea, by transgenesis. mice from the same line were separated into two batches, one acting as a control, and the other being subjected to daily mental stimulation, including memorization tests. unexpectedly, the appearance of symptoms was noticeably slowed down in mice who had been subjected to mental gymnastics , as if the brain, by intentionally mobilizing its neuron activity, was able to secrete substances able to alleviate its own defects. in short, by means of possible retroactive mechanisms that are called upon by the mind, the brain appears to act as actor and spectator. at the turn of the st century, experimental medicine was being nourished by techniques inherited from experimental physics, chemistry, and even mathematics and computer technology, in the same way as the other sciences of living beings. the progress made in medical imaging techniques has been particularly impressive since the time, at the end of the th century, when the x-rays discovered by wilhelm rÖntgen made it possible to view the structure of the human skeleton. the saga of x-radiation continued through the th century (chapter iii- . . ). for the last few decades, new imaging techniques have come to the fore. they have spread rapidly, and been refined. ultrasound imaging, which is based on the principle of the reflection of ultrasound waves off of different kinds of surfaces, has become an everyday technique for viewing blood flow in blood vessels and the heart. however, it is mainly in the study of the brain that medical imaging has benefited from technical advances in the domains of physics and computer technology, and it has been innovative in assigning cognitive activities to well-identified anatomical structures. this functional neuroanatomy makes it possible, in a non-trauma-inducing manner, to monitor and locate the operation of neuron networks with great temporal and spatial precision, during various cognitive tasks such as reading and the written or oral expression of thought. the middle of the th century saw the gradual development of two methods for exploring zones of cerebral activity, electroencephalography and magnetoencephalography. at present, these techniques are being taken over by mri (magnetic resonance imaging) ( figure iv. ) . the principle of mri is based on the detection of hydrogen nuclei and their differentiation according to their environment. functional mri leads to the location of the areas of the brain that are active during calculation exercises, the perception of sounds, language and objects, and memorization, with a resolution of just a few millimeters. its power of exploration is such that it has been possible to analyze the brain response, in sleeping or awake babies who are only three months old, to auditory stimuli from language that either makes sense or does not make sense . the response, located in the left hemisphere and the prefrontal cortex, leads to the conclusion that, from the first months of life, there are zones of the brain that are potentially active before the first attempts at language appear. both in france (cea-saclay and the frederic joliot hospital at orsay) and abroad, recent mri performance has encouraged projects concerning the manufacture of instruments able to produce magnetic fields of around ten teslas, which allows an unequaled definition in the identification of areas of the brain assigned to specific cognitive functions and in the highly accurate determination of the location of pathological lesions. a technique that is complementary to mri is positron emission tomography (pet). this generally uses water labeled with oxygen ( o), a radioactive isotope of natural oxygen that has a very short lifetime ( s), produced extemporaneously in a cyclotron by bombardment of an n target with protons. the radiolabeled water is injected into the blood flow of the patient. it is found in greater concentration in the zones that are the most irrigated by blood capillaries. the positrons that it emits collide with the surrounding electrons and give rise to photons that can be detected by the appropriate apparatus. affected by a stimulus (whether this stimulus results from talking, writing or listening), the blood irrigation of the zones of the brain that have been specifically excited increases noticeably. the location of the positron emission provides information about the location of these zones. within a few dozen minutes, it is possible to locate a highly vascularized cerebral tumor. pet can use molecules other than water, such as organic molecules labeled with positron-emitting atoms, ( f) fluorine (half life min) and ( c) carbon (half life min). around twenty years ago, in canada, an analogue of l-dopa, the precursor of dopamine in the brain, f- -l-fluorodopa, was synthesized, and was found to be an excellent probe for determining the capture capability of the endings of the dopaminergic neurons in the striatum. in patients suffering from parkinson's disease, this capture capability is noticeably reduced. at present, pet involving f- -l-fluorodopa is being used to evaluate the survival of dopaminergic cells grafted into the striata of parkinson's disease sufferers , . nowadays, brain imaging techniques can be used to explore the electromagnetic anomalies of neurological or neuropsychiatric illnesses such as huntington's chorea, the different forms of alzheimer's disease or even autism, the genetic origin of which is in the process of being deciphered. a bridge has now been built between the molecular defects identified by genetics and the electromagnetic anomalies that result, analyzed by functional cerebral imaging. it was not so long ago that descartes considered that human thought was unconnected to a material support (chapter ii- . . ). we are not far from the era when broca located the language area in a specific zone of the brain after the autopsy of an aphasic patient (chapter iii- . ), thus opening the door to another scientific domain, neuropsychology, which had previously only been the subject of speculation. the consequences, from the societal point of view, were far from being insignificant. thus autism, which was once suspected of being caused by errors in the mother's behavior with respect to her child, has been shown to be a disturbance in the development of the fetal nervous system, in the temporooccipital region. while the neurosciences occupy a preponderant position in the medicine of the beginning of the st century, because of the development of techniques that aim to analyze even the functions of thought, emerging methodologies of another order, such as gene therapy (chapter iv- . ), are in the process of completely modifying our ways of treating and curing a range of previously incurable human diseases, from incapacitating immune disorders to cardiovascular diseases and cancer. "it is in the domain of thought about the future that man is singled out. we are beings who have an imagination. not content to live in the present, to profit from past experience, we remain haunted by a future that we are conscious of constantly entering. this obsession with the future has been a powerful driving force in cultural evolution. we seek to predict in order to avoid the worst and to better prepare for our tomorrows." by predicting potential dangers in subjects who are in good health, predictive medicine aims to provide the means of avoiding these dangers. these dangers can be intrinsic in nature, being written, for example, into a certain genome dna sequence, or they can be extrinsic in nature, linked to an unsuspectedly deleterious environment. in each generation, mutations occur, certain of which can lead to so-called genetic diseases; between and % of newborns are affected. besides these spontaneous mutations, there are also mutations arising from the genetic inheritance of the parents. the purpose of genetic counseling is to warn parents when the existence of a potentially serious genetic flaw is suspected. the highlighting of genes that give a predisposition for cancer (proto-oncogenes) is a convincing illustration of the power of predictive medicine. this involves genes that control the synthesis of growth factors, the activity of which is essential to embryogenesis and to the repair of damaged tissue. while they are normally subject to strict control by anti-oncogenes, proto-oncogenes are able to become active in an anarchical manner, under different influences, and to transform themselves into cancer-generating oncogenes. recently, mutations have been found in two genes, brca and brca , these mutations giving a predisposition for cancers of the breast and of the ovary. thanks to genetic exploration, it will soon be possible to predict whether a cancer of the breast will have a rapid progression leading to uncontrollable metastases or a slow progression. depending on the case patients will be subject to heavy chemotherapy or to a less aggressive treatment. in this context, targeted therapy with monoclonal antibodies is a source of great hope. while genetic inheritance has a role in cancer, the environment plays a notinsignificant role as well. this is the case, for example, in lung cancer sufferers who smoke tobacco, cancer of oesophagus in those who drink alcohol and job-related cancers in those working in factories producing colorants or materials derived from asbestos or tars. cardiovascular diseases are the primary cause of death in the more developed countries, involving either an infarctus, or a stroke. many risk factors for these diseases are known, i.e., metabolic deviations affecting cholesterol or the blood serum proteins involved in the transport of lipids. these metabolic anomalies result in a syndrome known as atherosclerosis, which is characterized anatomically by the deposit of fats in the form plaques in the arteries. while genetic factors are at the origin of these metabolic problems, the latter are clearly amplified by an inappropriate diet. the role of predictive medicine is to recognize the genes that are responsible, warn individuals of the risks they are running and to advise them about the types of lifestyle and diet that do not increase these risks. being able to predict, predictive medicine should be able to prevent by means of targeted drugs. within this context, it gives rise to reflection upon polymorphism linked to variation in a single nucleotide in the dna of the genome of an individual. known as snp (single nucleotide polymorphism), this polymorphism has proved to be a very useful auxiliary in molecular medicine. hundreds of thousands of snps are present in the human genome and several tens of thousands in genes coding for the proteins. where they are located differs according to ethnic backgrounds. among these snps, some appear to be linked to certain pathologies, such as certain forms of cancer or degenerative illnesses such as alzheimer's disease. in addition, in a small number of patients, the location of certain snps has been connected with previously-inexplicable drug incompatibilities. in line with these observations, pharmacogenomics, a branch of pharmacology that deals directly with genome sequence data, is trying to evaluate the impact of "snp variants" on the efficacy and toxicity of drugs and to understand the genetic bases that explain the differences that are observed in the responses of different individuals to the same medication . rather than using a standard drug that is not very efficacious or causes adverse side effects, it might be possible, depending on the genetic profile of the patient, to use a drug that is more appropriate to his or her genetic map. it is doubtless not just a fantasy to imagine that, in or years' time, a patient visiting the doctor will be offered a genetic map thanks to cells taken from the buccal mucosa. finding snp variants that are known to be responsible for drug incompatibilities in such a map will make a targeted prescription possible. it will allow the detection of genes for susceptibility to an illness, at the same time uncovering targets for new drugs. pharmacogenomics, which is still called new pharmacogenetics, contrasts with old pharmacogenetics in which, having found an adverse clinical response to a certain therapy, an attempt was made to identify the protein target of the incriminated drug, and then to go back to the gene coding for this protein, and to look for the mutation responsible for the aberrant response to the drug. the existence of customized predictive medicine, which would read the destinies of individuals in their genes, would not be without its consequences in the life of a citizen. by registering each citizen with a genetic map, matched with a named identity card, predictive medicine might begin to take on the aspect of a janus, with his beneficent face warning subjects of potential risks of metabolic problems, and guiding them towards the actions to be taken to lower the risks, but also with his evil face delivering each individual's intimate details to the indiscrete inquisitiveness of investigators who are operating towards their own ends (insurance companies, employers…). no less worrying would be the sly but predictable transformation of the individuality of the repaired or even doped human being within a system of imposed, docilely-accepted assistance. in the th and th centuries, the methodology for biological experimentation underwent a revolution caused by the progress made in the domain of chemistry, both analytical chemistry, with the deciphering of increasingly complex molecular structures, and also in synthetic chemistry, with the large-scale production of tens of thousands of new molecules. the effects of these molecules, which might eventually be used as drugs, were tested directly on animals. it was thus that in the german chemist paul ehrlich discovered salvarsan, a derivative of arsenic, which was active against a type of treponeme, the agent of syphilis. this was the result of a systematic analysis of the effect of synthetic products, aromatic derivatives of arsenic acid, on syphilis in rabbits. salvarsan was the th derivative that was tested, and this is why it was called for a long time before it was given the name salvarsan. sometimes, lucky chance shows surprising and unexpected properties in synthetic molecules. this was the case for chlorpromazine, which was initially used as an antihistamine. it was luck that led to its antipsychotic activity being discovered in . a new era opened up in psychiatry with the arrival of synthetic narcoleptics like chlorpromazine. a new chemical science known as combinatory chemistry, which dates from the s, has aroused an increasing amount of interest in pharmacology. this involves making two or more species of organic molecules that carry reactive functional residues react in solution or in the solid phase in such a way as to synthesize, by means of all possible combinations, a number of final and intermediary products that is situated in the hundreds or even the thousands, and which makes up chemical library or drug library. we can directly test all of the products formed on a sample of eukaryotic cells, in order to verify their effects (for example the inhibition of an anarchical proliferation of cancerous cells), or on microorganisms in order to evaluate an antibiotic capability. we can also proceed straight away with the fractioning of the reaction products and the testing of each of the fractions. if the response is positive, fractioning is continued until the molecular species responsible for the desired effect is obtained. other evaluation parameters for this molecule, such as its absorption, its toxicity and its metabolic future (distribution in the organs, chemical modifications and excretion) are then explored, first in cells, and then in animals (rats, mice), thus comprising pre-clinical tests. these screening operations, which are said to be high-throughput, require automation and robotization aided by powerful computer technology. each year, pharmaceutical companies screen tens of thousands of different molecules on hundreds of targets. complementary to combinatory chemistry, in silico chemistry works by molecular modeling and uses computer programs for the rational design of new drugs that are able to fix onto specific protein targets. the purpose of this is to provide a virtual follow up to modifications in the reactivity of a given drug molecule as a function of the modifications imposed on its structure, for example, the addition of residues that differ according to their electrophilic or hydrophilic properties, or according to the length of their side-chain. provided there is a chemical library and we know the three-dimensional structure of a macromolecule, for example an enzyme, as well as the nature of the residues that define its active site, we can hope to select and chemically modify a substance that is able recognize the active site of this enzyme and to make an almost perfect ligand out of it which is able to efficiently block the operation of the target enzyme. this method, which is based on computer-aided chemistry, is called "structure-based drug design", and has had some notable successes. it has made it possible to develop an inhibitor capable of blocking a protease involved in the replication of the aids virus. however, both in combinatory chemistry and in molecular modeling, the many successes that have been achieved remain modest in number compared to the means that have been deployed to achieve them. in terms of statistics, out of ten thousand molecules that are recognized as being efficacious for a given target in vitro, around one hundred are chosen for preclinical trials on animals, around ten are chosen for preclinical trials in man and only one will come out as a drug. the financial and economic effect is far from being negligible. it has even become a preoccupation in a system where merciless competition is the rule. in addition to synthetic chemistry, preparative chemistry, which is based on the isolation of natural molecules, is now the subject of renewed interest, due to the introduction of high-throughput techniques. high-throughput screening, which is an essential tool in combinatory chemistry, is also carried out to ensure the systematic detection and isolation of natural substances having interesting pharmacological activities such as antibiotic activities or anti-cancer activities, based on marine animals, microscopic fungi, prokaryotic organisms and various plants. for example, among the substances that have been isolated recently are cibrostatin, a specific cytostatic of melanoma cells, from a marine sponge, mannopeptimycin, a bacterial antibiotic from an actinobacterium streptomyces hydroscopicus and a whole set of alkaloids with a cytostatic activity with respect to human tumor cells from an exotic plant of the genus daphniphyllum. the molecular diversity of the living world is such that the reserves of natural products having pharmacological activities are far from being exhausted. so far, only a small percentage of the microbial species populating the earth have been listed. the depths of the oceans harbor many unknown species. thousands of insect species remain to be discovered in the canopies of tropical rainforests. exploration of the plant kingdom is far from being complete. the listing of natural molecules having a therapeutic activity has only just begun. the hunt promises to be a fruitful one, all the more so because the highthroughput screening methods that can now be used greatly increase the efficiency of the search. high-throughput screening, applied to natural molecules, has overturned the methodological procedures that were in use until recently, which progress through logical steps, using relatively simple artisanal analytical methods, from observation, often resulting from serendipity, to the isolation of the active substance. thus, in the th century, using inherited traditional knowledge that a decoction of cinchona officinalis bark calms malaria crises, pierre joseph pelletier ( - ) and joseph bienaimé caventou ( - ) decided to isolate the active substance of this bark. from the raw extract, they purified an alkaloid, quinine, which proved to be the anti-malarial substance they were looking for. more recently, the starting point of florey and chain's isolation of penicillin from the microscopic fungus penicillium notatum was the fortuitous observation made by fleming that this penicillium secretes an antibiotic factor (chapter iii- . . ). there are many examples in which serendipity has been the principle factor involved in the discovery of a drug, and this will no doubt continue to be the case. the appearance of a lucky chance, after all, is not incompatible with highthroughput practices. also, it is not impossible that in the future there will be a conjugation of the discovery of new natural substances and the use of combinatory chemistry, with the aim of manufacturing derivatives having a much greater power of action and quality of specificity from these substances . to sum up, the experimental method has caused contemporary medicine to take a giant leap forward, with the discovery of increasingly high-performance functional exploration techniques, the development of therapies using molecules that are already present in nature or are manufactured by synthesis and the more and more advanced understanding of molecular mechanisms that takes into account the basic idea of the pioneers of molecular biology was that the function of a macromolecule depended on its structure. thus, perutz's elucidation of the tetrameric three-dimensional structure of hemoglobin, and of its modifications depending on the degree of oxygenation, shed a considerable amount of light on the cooperative mechanism of the transition from the hemoglobin state to the oxyhemoglobin state (chapter iii- . . ). in the same way, an understanding of the structure of many enzymes, receptors and transporters of metabolites has shed light on their mechanisms. in a parallel manner to the exploration of the structures and functions of proteins, that of genomes has made remarkable progress. the subtle entanglements of genomics and proteomics that have become accessible to the experimental method are the order of the day. one major challenge for post-genomics is to understand how proteins, expressed by genes, interact with one another to generate functions that characterize cellular specificity. even more ambitious are attempts to understand the operation of organs or even of living organisms, based on mechanisms that are implemented at molecular level. these attempts lead straight to an integrated biology, that is, a biology that aims to understand the overall functioning of living beings. taking as its purpose the access to emerging functions, resulting from interactions between macromolecules, integrated biology first tries to invent methods that make it possible to detect these interactions. strengthened by the information obtained, it tries to integrate this information with a mathematized language into modules that attempt to simulate living beings. from the simplistic procedures of the middle of the th century, which were justifiable within the reductionist context of this period, and which involved considering each species of proteins as an autonomous functional entity, we have moved on to the idea that the different species of protein that inhabit a cell have a dialogue with one other, and that they may move from one endocellular organelle to another, depending on post-translational modifications (for example, phosphorylations) that change their conformation and, at the same time, their reactivity and their behavior. thus, an enzyme protein is not only defined by its catalytic performance with respect to a given substrate, but also by its place in a metabolic network where it interacts in a dynamic and transitory manner with a multitude of other protein species (figure iv. a) . the concept of cell signaling has also evolved. instead of considering that a cell membrane receptor, activated by fixation of an extracellular ligand (a hormone, for example), addresses the information received to an endocellular effector protein via a linear cascade of individual proteins, it has come to be postulated that communication between an activated receptor and its effector is mediated by proteins organized into interactive networks ( figure iv. b) . this machinery provides more flexibility in the addressing of messages to effectors. the diagram on the right shows that besides its catalytic function, protein a interacts with other proteins in the cell. b -case of the transduction of a signal that is external to the cell (a hormone, for example). the diagram on the left refers to the classical idea of signaling from a receptor r according to a linear cascade of protein-protein interactions inside the cell, leading to an effector z. the diagram on the right shows that the signal is spread through proteins organized into interactive networks. another subject to be considered is the density of macromolecules of all types, such as proteins, nucleic acids, lipids and polysaccharides, contained by a microorganism or a eukaryotic cell, which reaches values of to g/liter, denoting a semi-solid state or a considerable degree of compacting. however, for technical reasons, kinetic studies carried out in vitro on isolated enzymes have been carried out with solutions that are or times more dilute. conscious of this difference in scale between information obtained from in vitro studies and the in vivo reality, the biology of today is trying to re-evaluate molecular dynamics within the context of a cell. this is why we are seeing the birth of an integrated (or integrative) biology of functions, which, using modeling procedures, aims to achieve an understanding of the temperospatial dynamics of the interactive components inside cells. this holistic conception of biological systems ("systems biology ") has been made possible by progress in technological expertise in domains as varied as biochemistry, molecular biology, physical optics, electronics, nanomechanics, physical and mathematical modeling and computer technology. it is a necessary complement to the classical experimental method based on bernardian determinism which, in order to connect an effect with a cause, explores living beings in a manner that is often monoparametric and is inevitably reductionist. this signals a change in paradigm in the experimental approach to living beings. a particularly effective investigative method used to explore the dialogue between proteins is the double-hybrid by genetic construction, two proteins, p and p , whose interaction is to be tested, are expressed in the form of fusion proteins in yeast. protein p is fused with the binding domain (gal -bd) to the dna of gal , a protein that regulates the transcription of the β-galactosidase gene. protein p is fused with the activation domain of gal (gal -ad). insofar as p interacts with p (b), the gal transcription regulation activity is re-established, which is verified by the transcription of the reporter gene. if the opposite occurs, i.e., in the absence of any interaction between the two domains of gal (a), the reporter gene is not transcribed. the principle of this method is based on the modular nature of numerous transcription factors in eukaryotes. these factors contain both a dna-binding domain that includes a specific dna-binding site and a transcription activation domain that starts up the machinery for transcribing dna into messenger rna. these two domains can be dissociated and then re-associated in a functional manner, by forming hybrids with interacting proteins. a first protein, p , is fused with the dna-binding domain of a transcription factor by genetic manipulation, and a second protein, p , is fused with the activation domain of the same transcription factor. if p is able to interact with p , the transcription factor is reconstituted and the reporter gene upon which it depends can be expressed. the trapping technique, which is complementary to the double-hybrid system, was developed to make it possible to identify a set of interactive proteins within a cell. a protein that is included in this set (protein of interest) is fused by genetic engineering techniques to a short polyhistidine chain (called a tag). using this tag, the protein of interest is fixed to a solid medium containing nickel ions, a material that is reactive with respect to the polyhistidine chain. in the presence of a soluble cell extract, the protein of interest binds the cognate proteins of this extract, making it possible to retrieve a complex whose components, corresponding to interactive proteins, can be resolved after denaturing gel electrophoresis and then characterized ( figure iv . ). the techniques described above are backed up by cell imaging techniques that make use of confocal microscopy, which is more directly in keeping with living reality. the optical performance level of confocal microscopes has improved lately, with the arrival of biphotonic and multiphotonic lasers that illuminate precise points of the cell. as we have seen previously (chapter iii- . . ), it is possible to create a protein chimera made up of a protein of interest fused with a fluorescent protein, in this case gfp (green fluorescent protein), inside a cell. there are currently several variants of gfp that are able to emit fluorescent light at different wavelengths. this has allowed the development of a technique known as fret (fluorescence resonance energy transfer ) which explores the interaction between two fluorescent proteins. in practice, two gfp variants that have neighboring emission spectra are fused, by genetic engineering inside the cell, to two proteins of interest, p and p , that are suspected of being interactive. if this is the case, the fluorochromes that they carry are sufficiently close that the result is a modification in the intensity of the emission fluorescence of the donor fluorochrome (decrease) and of the receiver fluorochrome (increase), which is readily detectable. all of these studies, taken together, have given rise to the idea that endocellular proteins are organized into networks, that these networks are interactive and that their location in defined compartments of the cell is dependent on epigenetic events such as phosphorylations. two attributes can be found in integrated systems: firstly, the presence of modules, i.e., interactive motifs, which, like the pieces of a jigsaw puzzle, fit together to produce a complex, coherent structure, and, secondly, the emergence of functional properties due to the newly created interactions. the protein of interest p is expressed in the form of a protein fused to a protein "tag" t that is able to bind to a solid support with a specific affinity. the assembly is brought into contact with a cell extract. certain proteins of this extract, a, b and c, which are able to interact with the protein p, become fixed to the latter. in a second step, the tag t is freed from its attachment to protein p by means of a specific cleavage enzyme. the pabc complex that is recovered from the solid medium in soluble form is subjected to polyacrylamide gel electrophoresis, in order to separate and identify its components. given an analytical description of the basic building blocks that are used to construct living systems, and an understanding of their modes of association in defined circumstances, it is normal to try to reconstruct, in their entirety, mechanisms that show the functioning of these systems. this idea was first applied to the yeast saccharomyces cerevisiae for different reasons, such as cell homogeneity (in principle, and, in any case, statistically speaking, all yeast cells have the same genome and the same proteome), an in-depth understanding of the genome and the proteome and the presence of a vast directory of well-characterized mutants. the use of techniques for the detection of interactions between proteins has revealed the existence of a potential dialogue of unexpected richness between a multitude of proteins ( figure iv. ) , in the yeast cell. it is necessary to reflect upon this evidence, which leads to the postulate that, for a given protein, there are mechanisms that restrict and select the many partners able to react with it at a precise moment.faced with a situation in which chance has the upper hand, leading to uncontrollable anarchy, it is necessary to have regulation, which is underpinned by darwinian logic. example of an interaction network involving the yeast sup prion protein. the line of dashes refers to experimental data concerning the interaction of sup with another protein, sup . the lines in bold refer to interactions taken from experimental data; while the fine lines refer to predictions, particularly phylogenetic ones. this logic arises from a choice of the most efficient reaction path, which is first of all dictated by the speed constants involved in the association and dissociation of molecular partners, without, however, neglecting any stochastic events that may arise. chemical modifications of proteins participate in this regulation, such as phosphorylation, glycosylation and acylation. the result at cell level is a coherent channeling of the information that is carried from a molecular signal. thus, fixation of a hormone onto a receptor induces a series of modifications to the intracellular proteins that channel information towards an effector terminal, for example an enzyme responsible for the production of a metabolite with a strategic function. how can the sum of the scattered experimental data that we have concerning the catalytic capabilities of a multitude of enzymes of cellular origin be integrated into the operation of a cell? how can we envisage the gene-enzyme relationship according to current evidence concerning the complexity of the genetic message? biocomputing, or bioinformatics, a science that emerged towards the end of the th century, proposes to try to answer these questions. at the turn of the th century, with the development of increasingly powerful computer microprocessors that are able to carry out complex operations with amazing rapidity, the hope arose that it would be possible to simulate processes as varied as the regulation of the cell cycle, molecular flow in metabolic pathways and the reception of molecular signals, for example from hormones by living cells, as well as the transmission of the messages that result. the dream of an in silico virtual biology has become achievable. the first mathematical theory of simple enzyme reaction kinetics was put forward approximately a century ago, by victor henri ( - ). born in marseilles to russian parents, victor henri studied in saint petersburg and then spent time at the universities of göttingen and leipzig before becoming established in paris. having an eclectic mind, studying both psychology and physicochemistry, he had the wonderful intuition that enzyme catalysis arises from a specific mechanism, different from that implemented in a chemical reaction. the study carried out by henri concerned the cleavage of sucrose (table sugar) into fructose and glucose by the action of an enzyme called invertase (sucrase). the term invertase was used because during reaction there was a change in the rotatory power of the sucrose solution, shown by a polarimeter. analysis of the reaction suggested that an enzyme-substrate complex is formed, which breaks down to regenerate the enzyme and liberate the product of the reaction. this analysis gave rise to an equation for the speed of the enzyme reaction as a function of substrate concentration. henri published the results of his experiments both in his thesis, which he presented to the paris faculty of sciences in , and in two articles that appeared in the reports of the academy of sciences , . in , in biochemische zeitschrift (vol. , pp. - ), leonor michaËlis and maud menten ( - ) confirmed the results of victor henri and formulated an equation that became a classic, describing the speed of formation of a product from a substrate in enzyme catalysis. since the period of these first studies, the concepts involved in enzyme kinetics have evolved considerably. the first metabolic pathway be deciphered was that of the degradation of glucose (glycolysis) , either into ethanol in yeast, or into lactic acid in muscle tissue. after this, researchers became aware that the activity of enzymes could be modulated as a function of covalent modifications of amino acid residues of their protein chain (phosphorylation, dephosphorylation, acylation…). metabolic flow analysis led to the idea of the limiting reaction. in the s, the idea that there is a single limiting reaction in a chain or a cycle of reactions gave way to the idea that metabolic control is distributed over several reactions, and that each reaction has its own, more-or-less intense control force. another complexity factor came to light with the discovery of allostery . allosteric enzymes have the particularity that they can fix reversibly onto a site that is different from the active site (allosteric site) molecules that are often the terminal products of a chain of reactions: the consequence of this is a conformational modification of the structure of the enzyme that has repercussions on the geometry of the active site and modifies its reactivity with respect to the substrate. in the s, faced with the complexity of the tangle of listed metabolic and signaling networks, attempts were made to use mathematical modeling to show the progress of the traffic of molecules inside a cell in relatively simple metabolic pathways such as glycolysis. in the modeling procedure, the concentrations of the different molecular species are considered to be variables whose variations over time depend on their speed of production and their speed of disappearance, which leads to a set of paired differential equations. with this procedure, we entered the domain of virtual biology. thanks to the creation of increasingly powerful software, the aim of virtual biology is to simulate signaling and metabolic pathways. in the longer term, the aim is to understand the molecular and cellular processes that direct embryo development, or to test the effects of drugs of known target on the metabolic behavior of the cell. metabolic engineering (which is already well developed) comprises two types of models, stoichiometric models and kinetic models. stoichiometric models describe metabolic networks in the stationary state, based on analytical data. kinetic models combine stoichiometric information and that concerning the catalytic capabilities of the enzymes in a metabolic network. in canada, the cyber-cell project plans to model the overall functioning of the machinery which, in the bacterium, includes its metabolism and its proliferation. the aim of the afcs (alliance for cellular signaling), which was launched in the usa, is to understand how signaling occurs in cells such as the b lymphocyte, the macrophage or the cardiac cell in response to different types of stress. the techniques that are used range from identification of all signaling network proteins to the evaluation of the flow of circulating information and to the integration of the data acquired into theoretical models. the european nerve synapse project makes use of similar procedures, with its long-term hope of linking the functioning of nerve cells with the cognitive and behavioral functions of living beings. this is a sizable challenge. in fact, there is far from being a real consensus concerning the principle of a demarcation between, on the one hand, cognitive functions such as language or memory, which are located in precise zones of the brain, and which could be reduced to physicochemical processes, and, on the other hand, forms reflective thought that are expressed through the creative imagination or by judgements concerning ethics or esthetics, the notion of personal responsibility, or even pictorial, architectural or musical beauty. should we see the human soul as the programmer of a superb computer that never ceases to develop from the embryonic state onwards, like john eccles ( - ) and others, or should we admit, like jean-pierre changeux (b. ), stanislas dehaene (b. ), daniel dennett (b. ) and others, that thought is not transcendent, and that it is intrinsically dependent on the brain, which is considered to be a neurochemical system, and thus look for the secret of the individuation of the human being in brain information storage mechanisms with retrocontrol loops associated with subtle neuron architectures, or, in short, refer to a sort of turing machine? whatever the case, in this domain, as in others, simple animal models are used in order to identify elementary processes that are able to explain easily-tested functions such as the memory in anatomical and physiological terms. this is the case for the sea slug or sea hare (chapter iii- . ) which, despite its rudimentary cognitive capabilities, provides information that can be used to reconstruct higher cognitive functions, present in the brains of mammals. it is clear that the cognitive sciences have reached a stage in which they are emerging from their infancy (chapter iv- . ). now, ingenious computing methods and a basis for reflection that has spread beyond the confines of psychology and philosophy, are available to them. they have set themselves the goal of producing an artificial intelligence, using ultrarapid computers as well as software that is able to model the operation of neural networks and to come close to the performance of human intelligence in terms of the power of their reactivity and their memorization. at present, many other biological systems are being subjected to multiparametric exploration, with the aim of producing models. this is the case, for example, with the program of the differentiation of certain white blood cells, the neutrophils (chapter iii- . . ), from precursors located in the bone marrow, a differentiation that leads to the emergence of functions such as phagocytosis that are implemented in the fight against microbial infections . in a domain that is closer to mechanical science, hydrodynamics, the digital simulation of the cardiovascular system has already made it possible to represent the physical phenomena associated with the propagation of a wave in deformable arteries during a cardiac contraction, in the form of equations, with a good approximation . in short, from a monoparametric approach that often began as being essentially and necessarily reductionist, the experimental method, applied to living beings, has become a "globalized", or synthetic, multiparametric approach, the aim of which is to understand the dynamics of molecular interactions in defined biological systems. making use of data obtained, the hope is to use mathematical processing to simulate the overall functioning of a cell, organ or organism. this new paradigm of the experimental method ("systems biology" ) is not limited to a simple accumulation of observations concerning a given biological system and their abstraction in mathematical form. the originality of this approach is that it formulates predictions of changes in the behavior of a system as a function of the manipulation of parameters such as substrate concentration, the presence of inhibitors, and so on. the mathematical processing of experimental data, with a view to learning about the functioning of complex systems by modeling, is supported by the technosciences, particularly biocomputing. it is linked not only to the enormous sum of accumulated knowledge concerning the structures and functions of living beings in the post-genomic era, and to the notion that the life of a cell depends on multiple networks of molecular interactions and thousands of enzyme reactions located in its different organelles, but also to the information that comes to it from its environment. a first type of modeling is based on observations made or experiments carried out on an easy-to-study model system. laws are drawn up from this. this so-called "bottom-up" (or synthetic) procedure, which proceeds from the simple to the complex, makes it necessary to have a set of very precise biochemical data. this great precision is all the more imperative in that any deviation, even a minimal one, in the integration of an experimental result can generate a mathematical model that is apparently plausible but which is unconnected to the living reality. the reverse, "top-down" (analytic) procedure proceeds from the overall operation of an organ and its theoretical analysis towards the specific mechanisms of its components. it takes into account the functioning of complex integrated systems such as the nerve and endocrine systems, immune and reproduction systems and the system controlling homeostasis, descending in stages towards the cellular, molecular and genetic levels. in the end, an understanding of living beings involves the management of an amazing capital of experimental data. this makes it necessary to consider all of the genes (genome), all of the transcripts coding for the proteins (transcriptome), all non-coding rnas (rnaome), all proteins expressed in a particular cell type (proteome) and all metabolites (metabolome) as a function of the enzyme catalyzers expressed and the energetics that underlie the catalyzed reactions, and, finally, to connect upstream events (mutations of genes and interference of messenger rnas, chemical modifications to amino acid residues in the proteins) to phenotypical modifications on the scale of the whole organism (phenome) (figure iv. ) . the goal of integrated or integrative biology ("systems biology") is therefore to put living beings into equations, that is, to represent them in virtual systems for which the behavior, accessible by means of calculation, can be predicted as a function of modifying parameters. in addition to the possibilities that are opened up in terms of a deeper understanding of physiological mechanisms, such virtual systems could be used for the design of new drugs or for the manufacture of economically valuable biomolecules. the diagram illustrates the different levels of complexity in the pathway that goes from all the genes together (genome) to all of the expressed characteristics (phenome) in the living being, passing via coding rnas (transcriptome) and non-coding rnas (non-coding rnaome), all the proteins (proteome), all addressing systems in the cell compartments (localisome) and all of the metabolic pathways (metabolome). at a scientific meeting held in sheffield, england, in january of , with the theme systems biology: will it work?, an argumentative discussion of the advantages as well as the disadvantages of an integrated, mathematized biology was useful in that it included a reminder that most of the parameters used in "systems biology" come from studies that are carried out in vitro on purified enzymes, and that it is not sufficient to know the value of the michaelian parameters (v max and k m ) in order to reach biological reality. in fact, in vivo, many enzymes record variations in activity that are hard to control due to allosteric type regulation or interenzyme contact; several enzymes of a metabolic pathway being able to interact to form a metabolon. however, by compacting several enzymes that catalyze contiguous reactions in a metabolic pathway, a metabolon considerably increases the catalytic efficiency of this pathway. another element of uncertainty arises from the protein density of the cell medium, and also from the fact that covalent modifications of enzymes can introduce a change in endocellular location (nucleus, organelles of the cytoplasm…). nevertheless, an approximative approach could limit itself to dealing with biological systems in modular terms, i.e., to considering them as being made up of a number of black boxes, each black box containing a series of reactions being processed mathematically together, with an input and an output. there is still a long way to go if we place ourselves on the cellular scale, but the end of the pathway seems even further away if we envisage the organism as a whole, taking into account the remote interactions between organs involving the interplay of chemical mediators. the brain plays a critical role in the dialogue between different organs, and in the regulation of the energy equilibrium in higher animals. this equilibrium can be disturbed by fasting or intense, prolonged muscular activity, or by an overabundant diet. the corrective response comes from a deep region of the brain, the hypothalamus, via the secretion of different types of peptides, some of which stimulate the appetite and others of which suppress it . while taking into account the multitude of parameters that affect the complexity of living beings on an individual level, the theoretical approach to the study of cell function by modeling has the advantage that it produces predictions and provides information about the validity of conclusions and of theories based on experiments that are old and accepted in the absence of contradictory elements. this was the case for the theory that stated that the state of activation of a gene is determined only by the presence in its environment of transcription factors. recent studies concerning the level of gene transcription in isolated cells have shown that there are probabilistic-type factors which mean that a given gene in a given cell can be activated at any moment. a review which came out in sums up this subject. in this review, the authors use modeling to analyze the behavior of cells in the process of differentiation during embryogenesis. their darwinian model, which associates contingency and selectivity, competes advantageously with the determinist (or instructive) model, based on an all-or-nothing logic, that has been implicitly accepted up until now. the darwinian model takes into account the occurrence of stochastic events at gene expression level, events that are partially linked to the structural modifications to the chromatin that depend on covalent modifications of an epigenetic nature (phosphorylation, methylation…). by basing itself on the existence of mutational fluctuations that arise by chance, associated with a selfregulation of gene expression, the model that is obtained shows that during embryogenesis a cell has a choice either to differentiate into another cell type or to remain in its initial state. differentiated cells stabilize their own phenotype and, in their surroundings, stimulate the proliferation of foreign cell phenotypes. a harmonious equilibrium between these two processes is the necessary condition for the setting up of the steps that lead to the arrangement of different cell types during organogenesis, which take place in an apparently inescapable order, in the absence of disturbances. a break in this equilibrium leads to an anarchical cell proliferation. generally speaking, from the point of view of experimental science, the lesson that can be drawn from current modeling experiments is that the bernardian determinism that has prevailed as the essential foundation stone of the methodology applied to the study of living beings may find itself being requalified by the taking into account of stochastic phenomena. this is the case when the number of reacting molecules is low and the probability of stochastic events is non negligible. the modeling of such systems necessitates having recourse to a complex mathematical formalism. it remains true that determinist models for simulation of the dynamics of living beings, represented by classical differential equations, are more-or-less valid when the number of reacting molecules involved is high and the reactions supposedly take place in a homogeneous medium. should "systems biology" be regarded as a resurgence of a physiology that has been somewhat neglected over the last few decades, but has been reinvigorated by a salutary hybridization of biologists and model-makers? in any case, this is the intention of the "physiome" project which has recently been launched on an international scale. it is also doubtless due to this state of mind that a trend which had gone out of fashion, involving the simulation of the performance of living beings by very elaborate concrete models, robots, is being reborn. an immense distance has been covered in just over two centuries, since the time when vaucanson presented automata in the forms of human figures, moved by ingenious springs and cogs, and giving the illusion that their movements were controlled by an intelligence, to a marveling public (chapter ii- . ). in the last decades of the th century, considerable progress was made in the understanding of the operation of the nervous system and in the development of technologies in which miniaturized electronics have come to the aid of already high-performance micromechanics. the brain being considered as an information processing machine, the aim is to understand the logic of this information machine by means of simulations on computers and, based on the results obtained, to construct robots whose electrical circuits take their inspiration from the operation of animal neurons. these robots are called biorobots or animats. insects have been chosen as a reference for the construction of such creatures because of the relative simplicity of their nervous systems: several hundred thousand neurons, in comparison with the billions of neurons present in mammals ( billion in man). the fly's system of vision has been favored as a subject of study because of the possibility it offers of being able to record the electrical response of neurons that can be identified one by one. in the middle of the s, in france, this inspired the pioneering work in biorobotics carried out by nicolas franceschini (b. ) and his team , (figure iv. ) . their objective was to study how an animal can avoid obstacles by means of its ocular perception and its movement-detecting neurons, the operation of which the team just analyzed using microelectrodes and a microscope-telescope specially built for the purpose. the fly's composite eye has elementary units or ommatidia, each carrying eight light receptor neurons. the electrical signals emitted by these neurons in response to captured light (at most a few dozen millivolts) are sent to subjacent neurons that are organized into three levels that correspond to the optical ganglions called the "lamina", "medulla" and "lobula". the lobula is a strategic decoding center which, because of the small number of neurons contained in it (sixty), has been the subject of in-depth electrophysiological investigation. each of the sixty neurons of the lobula operates as a signal integrator. the neurons of the lobula send their messages to motor neurons involved in the contraction of small muscles that control the guidance and stabilization of the fly's flight. based on an exhaustive study of the neuron wiring of the fly's eye, franceschini and his colleagues were able to reconstruct a facetted artificial eye that can retranscribe the light signals received optoelectronically. this artificial eye, the electronic components of which correspond to around one hundred movement detectors in the fly, was incorporated into the head of a robot. the recorded light signals were transmitted to the moving components of the robot. a -head of the blowfly, calliphora, seen from the front, showing the two compound eyes with their multifacetted array. each eye hides , photoreceptors that drive various image processors based on a few hundred thousand neurons. b -"elementary motion detector" (emd) neuron and its evolution over fifteen years: on the left, first generation ( ), using surface mounted device (smd) technology, compared to a one franc coin from that period; on the right, the version of the highly-miniaturized hybrid (analog + digital) emd circuit (mass . grams), compared with a one euro coin. c -autonomous vehicle ( kg) able to move around in a field of obstacles that it does not know about in advance. its vision is based on a genuine compound eye, whose circuits are inspired by those of the fly. it includes a network of "motion detecting neurons", transcribed electronically according to the principle analyzed in the fly's eye by means of microelectrodes and a specially-constructed microscope-telescope. this network is arranged around a ring that is about thirty centimeters in diameter. the recently-constructed roboflies, oscar and octave, only weigh around one hundred grams. d -routing of the electronic components (resistances, condensers, diodes and amplifiers that operate in their thousands) soldered onto the six-layer printed circuit-board that provides the connection between the sensors and the steering motor on board the autonomous mobile robot shown in (c). figure iv . illustrates the neuromimetic biorobofly constructed according to this principle. completely autonomous because of its on-board power supply, this robot was able to move around at high speed ( cm/s) in a cluttered area, avoiding the obstacles. this first "terrestrial" robofly, which was completed in , was followed by several much lighter brothers and sisters: fania, oscar and octave are aerial roboflys . constructed in , oscar is a captive robot that weighs around one hundred grams. it is equipped with an eye that reproduces the retinal microscanning of the fly's eye discovered by franceschini, oscar is able to rotate around a vertical axis because of its two diametrically opposed helices and can thus orient its view towards an object. if this object moves, oscar follows it with its eye, up to an angular speed comparable to the tracking speed of the human eye. produced in , octave is another aerial robofly that is able not only to take off and distinguish a relief, but also to land automatically and to react sensibly to a contrary wind in a turbulent atmosphere. on board, it has an electronic visuomotive self-regulation system, the operation of which is based on the signal processing operations that, in the insect, carry out the automatic pilot functions . the age of biorobotics, in which robots take their inspiration from animals, has only just begun . if specimens are still so rare, this is because behaviors for which we have a good understanding of the underlying neuronal bases are also rare. at the time of writing, a robot rat named psikharpax, with artificial muscles and a vision system that enables it to perceive objects in three-dimensional space, is being developed at the university of paris vi. almost in the realm of science fiction, we find hybrid robots obtained by hybridization of the living and the non-living. this is the case for the hybrid robot produced by japanese researchers, based on the silkworm moth. control of the nervous system of this insect is spread throughout its body. if its head is cut off, it continues to fly, which gave rise to the idea of replacing the head with an electronic transistor system . using a remote measurement device, it was possible to explore certain behavioral aspects of the insect. although the construction of hybrid robots may raise ethical objections, such technology is capable of giving rise to spectacular applications in the domain of prostheses. the neurological prostheses of the future will nevertheless require that a contact be made between living neurons and the electronic chips that are able to improve the inadequate processing of the physio- logical signal. such a contact was produced recently in a german laboratory directed by peter fromherz (b. ) . a small network of snail neurons, chosen because of their large size, was cultured on the surface of a silicon chip. a signal emitted at one location of the chip was able to be transmitted to another location via the synapse connection between two neurons ( figure iv. ) . on the molecular scale, mitochondrial atpase or atp synthase, with a size of around ten nanometers (chapter iii- . . ) was used recently for the manufacture of a biorobot that made its mark in the media as the smallest known rotating molecular motor. the membrane-type enzyme catalyzes the reversible reaction atp + h o adp + pi. this enzyme therefore has a double function; hydrolysis and synthesis. for this reason it is called atpase or atp synthase depending on the physiological context in which it is involved. in the mitochondria that oxidize metabolites, the enzyme operates like atp synthase. it catalyzes the synthesis of atp coupled to oxidation reactions. in the absence of respiration or of oxidizable substrates, the enzyme operates like atpase; it catalyzes the hydrolysis of atp. for ease of language, the enzyme will be designated here by the term atpase. it should be remembered that mitochondrial atpase includes two sectors, a hydrophobic sector, fo, characterized as a proton channel located inside the mitochondrial membrane, and a hydrophilic sector, f , carrying catalytic subunits that are arranged as if they were on a turret (see figure iii . c). fo contains two master parts of the atpase motor, i.e., a rotor comprising an assembly of around ten so-called "c" subunits and a stator that corresponds to the "a" subunit. the "c" subunit assembly is attached to the "γ" subunit of the catalytic sector f , which thus functions as a rotor. in , the british biochemist peter mitchell ( - ) showed that phosphorylative oxidation in the mitochondria is associated with a transmembrane transfer of protons. the mechanism involved is said to be chemiosmotic. the most important experiment involved an almost serendipitous observation, carried out with a simple ph meter. when a current of oxygen was passed through a suspension of mitochondria in an unbuffered saline medium, in the absence of adp and phosphate, an instantaneous acidification of the extramitochondrial medium occurred, shown by means of the ph meter electrode immersed in this medium. it was concluded that the sudden switch from anaerobiosis to aerobiosis, i.e., the start up of respiration, is correlated with an ejection of protons from the mitochondrial matrix to the extramitochondrial medium. afterwards, this fact was linked with several others, the whole leading to the formulation of the chemiosmotic theory. briefly, mitochondrial respiration generates a vectorial movement of protons from the interior to the exterior of the mitochondrion. because of this, a proton concentration difference is established on either side of the mitochondrial membrane. the electrical potential that is created in this way is used by the mitochondrial atpase in order to synthesize atp from adp and mineral phosphate. this process involves two correlated events: return movement of protons towards the inside of the mitochondrion across the fo sector of the atpase; rotation of the assembly of c subunits and the γ subunit that is interdependent with it. we have therefore moved from electrical to mechanical energy. during its rotational movement, the γ subunit establishes contacts with the three catalytic subunits of the f sector, in succession. one after the other, each of the three catalytic subunits in contact with the γ subunit undergoes a change in the conformation of its active site, which is at the origin of the synthesis of atp. in the absence of mitochondrial respiration, the reverse process occurs. the atp is hydrolyzed into adp and mineral phosphate, and the energy released at each of the three catalytic subunits is used to rotate the γ subunit in the reverse direction to that which accompanies the synthesis of atp. the existence of a rotational movement of mitochondrial atpase, which had been suggested on the basis of biochemical arguments and of structural data was authenticated by masasuke yoshida and his co-workers in japan in , thanks to an imaging technique . in a first step, the molecular system was simplified by being limited to the catalytic f sector of the enzyme. a methodological trick was employed: genetic engineering was used to modify the α and β subunits of this sector by fixing polyhistidine chains to them. because of the strong affinity between polyhistidine and nickel ions, the f sector α and β subunits were immobilized on a medium covered with nickel ions (carried by an organic molecule). an actin filament labeled with a fluorescent ligand was attached to the end of the f sector γ subunit. this assembly made it possible, under a fluorescence microscope, to display a rotational movement of the actin arm carried by the g subunit affected by the addition of atp and by its hydrolysis into adp and mineral phosphate. a similar rotational movement of the γ subunit carrying a metal microbar was observed by an american research group . remarkably, in , after having fixed a metal microbead onto the γ subunit, the japanese researchers demonstrated synthesis of atp from adp and mineral phosphate by rotating the γ subunit by means of rotation of the magnetic bead, induced by magnets. thus, the experimental coupling of a mechanical force and a chemical synthesis was demonstrated. in , the japanese research team succeeded in photographing the rotational movement of the enzyme powered by atp under the microscope, this time looking at the entire atpase complex, f fo. after having attached a gold microbead onto the "c" subunits of the fo sector, to act as a probe, the researchers were able to confirm that the rotational movement of these subunits depended on the hydrolysis of atp into adp and phosphate ( figure iv. ) . the whole of the mitochondrial atpase (atp synthase) does, in fact, function as a molecular rotational motor powered by a proton flow, rather like an industrial rotational motor powered by a fossil fuel or electricity. the analogy is a striking one; the γ subunit of the enzyme corresponds to the motor driveshaft and the "c" subunits correspond to the motor itself. because of its association with non-living structures, for example metal bars or gold beads, which are carried along in the rotational movement of the enzyme, it is possible to speak of molecular biorobots. this domain, in which nanomachines use macromolecules from the living world, has only just opened up, but its future is full of promise. the story of scientific progress made with respect to the mechanisms of phosphorylative oxidation via the functioning of mitochondrial atpase, from the time of mitchell's experiment with the ph meter until the time of the manufacture of yoshida's biorobots, is an exemplary one. it is typical of the way in which a mode of thought evolves over time, from a primary discovery resulting from serendipity or an experiment "to see what happens", leading to the proposal of the existence of a mechanism, to a carefully programmed project which, because of its inventive technicity, shows the validity of the proposed mechanism, and, in addition, demonstrates its future utilitarian value. nowadays, certain biotechnologists dream of being able to "synthesize life" in terms of cells that are able to imitate the performance of living cells. the concept of the "lab-in-a-cell " is coming to the fore , . nevertheless, it would be necessary to design an artificial cell that is an authentic replica of a living cell, and which benefits from all the attributes of a living cell. this is not achievable at the moment. thus, the current aim of nanobiotechnology is limited to scheduling the construction of artificial cells that are relatively simple both in composition and in function, for example, a microvesicle edged with a lipid membrane, containing a system of protein synthesis expressed from a short sequence of dna, as well as a system of atp synthesis able to supply the energy necessary for this protein synthesis. demonstration of a rotational movement of f fo mitochondrial atpase (atp synthase) , induced by atp. atpase or atp synthase (reversible catalysis enzyme that hydrolyzes or synthesizes atp) has two sectors (see figure iii . ). the membrane sector, fo, comprises an assembly of a dozen so-called c (rotor) subunits and an a (stator) subunit. the other, extra-membrane sector, f , is catalytic. it comprises three β catalytic subunits and three α non-catalytic subunits arranged in a ring, in alternating order. at the center of the ring is the γ subunit which is attached to the c subunits of the fo sector. subunits δ, ε and b stabilize the whole of the molecular complex. in the experiment illustrated in this figure, subunits α and β of the f sector of the enzyme have been genetically modified to include polyhistidine chains (his-tag, artificial ligand). due to the interaction of these chains with nickel ions (linked to an organic molecule) covering a solid support medium, the α and β subunits are immobilized. in addition, a gold microbead is fixed onto the ring of the fo sector c subunits by means of a chemical device (streptavidin molecule, artificial ligand). following the addition of atp, rotation of the microbead attached to the ring of fo sector c subunits is observed by microscopy on a black background. this rotation is dependent on (and at the same time an indicator of) the rotation of the c subunits, itself led by the rotation of the γ subunit in contact with the catalytic β subunits. note the ejection of protons. when the enzyme functions as atp synthase, the proton movement takes place in the opposite direction. "progress in biology is possibly mainly tributary to the drawing up of concepts or principles […] . in the process of elaborating concepts, which marks scientific progress in biology, there is sometimes a crucial step, when we realise that a more-or-less technical term that we had previously considered to cover a given concept, in fact covers a mixture of two (or more) concepts." ernst mayr translated from a french translation entitled history of biology. diversity, evolution and heredity - on the margins of the modeling and the difference in mathematized systems that comprise theoretical biology, particularly in silico biology, concepts are mental representations, often image-filled and idealized ones, of fundamental mechanisms that are deduced on the basis of experimental results. from the imaginary domain of the probable, they extrapolate constructions of the mind that are in phase with the facts and experimental data, within a reflective projection that gives them their meaning and makes it possible to make certain predictions. there are premonitory concepts. this was the case for the concept of the reflex arc that associates movement with sensation. this concept was already present in the ideas of descartes (chapter ii- . ), but it took more than a century before the theory of the existence of a reflex arc was supported by bell and magendie's demonstration of the existence of relay centers for sensory and motor nerves in the spinal chord (chapter iii- ). there have been premonitory concepts that, while they were demolished at the time they were first proposed, were shown to be completely accurate a few decades later. in the middle of the th century, the german pathologist jacob henle ( - ) needed a healthy dose of imagination and audacity in order to oppose the theory of the "miasma", a theory that was taught as a dogma, with a new theory that not only explained the spread of contagious diseases by microscopic beings, but also formulated the criteria for validating this theory, i.e., isolation of the pathogenic agent and its development in culture away from the diseased organism, then reproduction of the original pathology after injection of the pathogenic agent, which has been isolated, characterized and multiplied in a culture, into a model animal. thirty years would go by before the formulation of koch's postulates, based on experimental evidence (chapter iii- ). we may ask ourselves whether or not history is currently repeating itself in the case of spongiform encephalopathies that affect humans and animals, for which, according to the thesis of stanley prusiner (b. ) , the prion, as an infectious protein, is responsible. other evocative concepts hold the keys that open doors to domains that are unknown, but are potentially rich in information. it is thus that the double helix dna structure proposed by crick and watson, based on the complementarity of adenine-thymine and cytosine-guanine bases (chapter iv- . . ), led to the concept of dna replication with reconstruction of a double strand that is identical to the original double strand. the concept of dna replication spurred on matthew meselson (b. ) and franklin stahl (b. ) to develop an experimental protocol based on the labeling of the dna nucleotide bases of the enterobacterium e. coli with a heavy isotope of nitrogen, n, and on the differentiation of monocatenary dna strands in the process of synthesis by measurement of their density, as analyzed by centrifugation in cesium chloride gradients. in the same vein, jacob and monod's discovery of regulatory genes (chapter iv- . . ) gave rise to the concept of the operon which, in the bacterium, defines a genetic unit comprising structural genes and regulatory genes. the concept of the regulation of gene expression, extended to higher eukaryotes, makes it possible to explain the phenomenon of differentiation in cells with specific activities (muscle cells, nerve cells, epithelial cells…) by the silencing of certain genes and the activation of others. within the framework of bioenergetics, the chemiosmotic theory put forward by mitchell, in order to explain the coupling of mitochondrial respiration with atp synthesis (chapter iv- . ), gave rise to consideration of the concepts of transmembrane transport of metabolites and of vectorial metabolism. some generalizing concepts that carry a unifying virtue within them are known. one such is the concept of compartmentation. the cell is no longer considered to be a bag of enzymes, as used to be the case. it is now considered to be a compartmented structure in which each type of compartment corresponds to a type of organelle delimited by a membrane and characterized by specific functions. thus, because of the genetic material that is present in it, the nucleus of the cell holds the information necessary for the manufacture of proteins. the mitochondria, which are called cell power plants, are in charge of oxidizing the products of cell catabolism and using the resulting energy for the synthesis of atp from adp and mineral phosphate. the lysosomes are the garbage collectors of the cell. among the functions carried out by peroxisomes is the partial breakdown of very long chain fatty acids. the endoplasmic reticulum and the golgi apparatus are involved in the maturation and the secretion of proteins. the ribosomes represent the machinery upon which messenger rnas are displayed in order to be decoded into proteins. a sign of the extreme sophistication of this setup is that the membranes of the endocellular compartments are not sealed common walls. they contain proteins that act as selective transporters of metabolites or highly specific ion channels, allowing the exchange of messages throughout the cell. thus, each organelle, informed of the condition of the others, is able to adjust its own activity to ensure the greatest harmony of the whole. this conditioned compartmentation at cell level may be compared to the socialization of human communities. while endocellular organelles are compartments delimited by membranes, there are non-membrane-bound compartments in the cell, such as protein complexes in which two, three or even more proteins are closely linked. often, these are enzymes that catalyze reactions that are contiguous in a metabolic pathway. being compacted into a complex known as a metabolon results in an increased efficiency of the flow of metabolites by facilitating the channeling of this flow. concepts evolve, often adjusting their representations according to accumulated knowledge. a good example of this is the evolution of the concept of the gene since its formulation at the beginning of the th century. the term "genetics" was created in by the english naturalist william bateson ( - ) . the term "gene" was introduced three years later by the dane wilhelm johannsen. this term designated a principle which, in the chromosomes of fertilized egg, and in an intentionally vague manner, was supposed to have an influence on the phenotype of the progeniture. during the same period, the term "locus" appeared out of the experiments carried out by the american thomas hunt morgan on the drosophila, a locus being defined as a region of a chromosome which, when altered by a mutation, leads to a modification of the phenotype of the living organism. based on cross-breeding experiments carried out on hundreds of drosophila mutants, morgan and his co-workers drew up the first genetic maps. by chance, the salivary glands of the drosophila have a particular characteristic; the nuclei of their cells contain giant chromosomes called polytenes, which result from the association of a hundred replicate copies of chromosomes that, after staining, are visible under the optical microscope. on these chromosomes, it is possible to distinguish colored bands separated by clear bands. it was observed that specific mutations had specific effects on the arrangement and number of these bands. the material contained in the bands was therefore the site of mutations. in the middle of the s, the listing of more than bands made it possible to construct a cytological map that was already highly detailed. the concept of the gene, the material basis of inheritance, took root. the sporadic mutagenic effect of x-radiation in the drosophila, which was shown by the geneticist and biophysicist hermann mÜller ( - ), led the austrian physicist erwin schrÖdinger to question the sporadic event which, at the level of a target of a few dozen atoms, determines a mutation. he postulated that the target is located in the chromatin of the chromosomes, organized as an aperiodic crystal. the chemical nature of this target was identified with dna, following bacterial transformation experiments (avery, macleod and mccarthy, ) and experiments concerning bacterial infection by the bacteriophage (hershey and chase, ) (chapter iv- . . ). this is how the idea that gene = dna was born. the simple and reassuring idea that one gene → one enzyme, which was deduced from mutation experiments carried out by beadle and tatum on the mold neurospora crassa (chapter iii- . ), had only a limited lifetime. a first stumbling block appeared when it was shown that the activity of a gene, and in consequence its contribution to the phenotype, depends on nucleic elements outside the gene. the definition of the term "gene" was then extended to include promoting and regulatory sequences. in the case of the lac operon of escherichia coli, these sequences are located just upstream of the site where transcription begins. however, in eukaryotes, a regulatory sequence may be distant from the gene that must be transcribed and sometimes it may be involved in the regulation of several genes (chapter iv- . . ). in the s, the idea of the existence of the mosaic gene in eukaryotes appeared. a gene was now thought of as an assembly of several exons that originally in the chromosome are separated by introns. the alternative splicing of these pieces of genes gives rise to numerous possibilities for reconstitution, i.e., many messages coding for many different proteins. thus, however useful the concept of the gene has been with respect to its ability to generate discussion and to provoke experimentation concerning the molecular machinery responsible for the transmission of the hereditary characteristics, we can see that the term itself has not ceased to be the subject of readjustments, since the time it was first formulated. certain concepts are matched with metaphors. while some metaphorical concepts, particularly those that make use of images designed to grab the imagination, and to be easy to understand, tend to take liberties with the realities of living beings, they can also shed light on unsuspected mechanisms in sectors that have been neglected. metaphorical concepts are not a current fashion. it should be remembered that in his passions of the soul ( ), descartes, when asked "how limbs can be moved by objects of the senses and by the mind without the help of the soul," responds that this takes place "in the same way as the movement of a watch is produced only by the force of its spring and the arrangement of its cogs." later on, with lavoisier's clear vision of the vital role of oxygen, and his comparison of respiration with combustion, the concept of the chemistry of life, combined with that of bioenergetics, came to the fore, and was at the heart of studies on the metabolism. chemical reactions that liberate and absorb heat were substituted for the cogs of cartesian mechanics. the second half of the th century saw the birth and development of the concept of the program, a concept with computer technology connotations, which was destined to explain the phenomena of inheritance. this concept began to fill out from the moment when it became certain that, in its nucleotide sequence, dna contains the necessary information for the construction of the protein material of cells. for a certain period of time, the passion for molecular genetics eclipsed the interest that had previously been given to metabolic chemistry. the powerfulness of the metaphorical concept may be measured according to the effect it has in pushing scientific research in particular directions, with the results this has on society. thus, during the th and th centuries, the study of human pathology was impregnated with a strong iatromechanical current. physiological chemistry and its corollary, pathological chemistry, which emerged as disciplines in their own right in the th century and achieved full expansion in the th century, are our inheritance from lavoisier and the concept of discussion about concepts necessarily leads to a brief discussion of scientific semantics, as shown by the few examples given in the previous pages. as we have just seen, the word gene that was put forward by johannsen around one century ago did not have the same meaning at that time as it has now, a meaning that still remains fluid. the gmo, an acronym meaning genetically modified organism, which has been the subject of vehement diatribes over the last few years, becomes much less of an object of passion if it is considered within the context of evolution. after all, for the last two to three billion years, living organisms have been genetically modified constantly by spontaneous mutations, which is why the human beings that we are today are able to discuss them! the term cloning is another example of a semantic misunderstanding that leads to inaccurate interpretation and arouses the passions. the primary meaning of the term cloning is the multiplication and the identical reproduction of a living cell. the simplest and most unambiguous example is that of bacterial cloning, a bacterial cell producing millions of cells that are identical to the original cell by its multiplication in a nutritive medium. the term animal reproductive cloning does not carry exactly the same semantic weight. it should be remembered that, in eukaryotes, the preliminary act of the cloning procedure involves the injection of the nucleus (with n chromosomes) from a somatic cell into an enucleated oocyte (chapter iv- . . ; see also chapter iv- . ). the somatic cell nucleus, by providing its genetic equipment, gives the being that will develop in the uterus a phenotype that is practically identical to that of the somatic cell donor, but nevertheless not completely identical, as the cytoplasm of the enucleated ovum, with its mitochondria, provides a small but non-negligible fraction of genes, the mitochondrial genes. as for therapeutic cloning (in the absence of uterine implantation), this is used for the manufacture of differentiated cells that may be grafted into the individual who has donated the original somatic cell, with no immune-related rejection occurring. this is non-reproductive cloning. the passionate argument that has arisen because the term cloning is bandied about in an ill-considered fashion illustrates the confusion that can result from a lack of precision in the use of certain terms with a high level of media impact. "all the major problems of the relations between society and science lie in the same area. when the scientist is told that he must be more responsible for his effects on society, it is the applications of science that are referred to […] . no government has the right to decide on the truth of scientific principles, nor to prescribe in any way the character of the questions investigated." the progress of science is linked to that of civilization. it is in keeping with the state of mind, the beliefs, the lifestyle and the thought patterns of societies. in ancient greece, where manual work was considered to be servile, science remained essentially theoretical, confined to logic and dialectics, and strongly attached to questions of philosophy. the birth of experimental science in the th and th centuries went hand-in-hand with the rehabilitation of manual work. the technical side dominates in modern biology, which seeks to solve problems concerning the "how", rather than to address philosophical problems concerning the "why". as ian hacking (b. ) says in representing and intervening ( ), nowadays engineering, and not theorizing, is the greatest proof of scientific realism, which leads to the minimization of philosophical thought. in a skeptical biochemist ( ) , the polish-born american biochemist joseph fruton ( fruton ( - emphasizes the contrast between the th century and the first half of the th century, when eminent scientists were still interested in the ideas of the professional philosophers of the history of the sciences concerning the progress of experimental research and, in contrast, the end of the th century, when philosophy and the experimental sciences pretended to ignore one another. this is doubtless partly because the history of biology has become the history of biotechnologies to such an extent that, according to some, the objects being explored are so familiar that they are now part of the life of society. in pandora's hope ( ), bruno latour (b. ) considers that the current confrontation between subject and object, in which the researcher-subject explores the structure and function of the object, is being transformed into a human-nonhuman dialogue, in which the nonhuman-object becomes "socialized". taking yeast as an example, latour writes that it has been "working for millenia in the brewing industry, but now it works in a network of thirty laboratories where its genome is mapped, humanized and socialized like a code, a book, or a program of action that is compatible with our ways of coding, counting and reading […] . non-humans have become automatons, admittedly without rights, but much more complex than material entities." latour visualizes the human-nonhuman associations in the form of collectives that are organized into strata that implement the technical, the political, the social, the ethical, the ecological… the technosciences correspond to one of these strata, the sociotechnical stratum that is directly linked to the stratum of political ecology. in the same spirit, the belgian philosopher gilbert hottois (b. ), in his philosophies of the sciences, philosophies of techniques ( ) remarks that "laboratories produce things that go off to live their lives in society and in nature." thus, bacteria, yeasts or genetically modified plants are able to produce drugs such as insulin, growth hormone and vaccines for human medicine. these drugs become part of and indispensable to life in society. they are evaluated according to their market value by the companies that patent, manufacture and sell them, and according to the comfort they bring to the patients to whom they are administered. the financial management that results from their consumption becomes a worry for those responsible for public health, while their manufacture by specialized companies generates industrial activity and economic growth which may be measured according to how fashionable they are and how they sell. for a long time, society, while benefiting from scientific progress, remained indifferent to the experimental method, that is to say, the way in which knowledge progresses. in the last decades of the th century, society became aware, via information concerning the occasionally demonized exploits of genetic engineering, that science can "take liberties" with the human being. populations were well informed about the effects that genetic engineering could have on the mortality rates of pathologies such as cancer and diabetes or on degenerative illnesses of the nervous system, and about the closeness of possible solutions. however, they were also warned about the risks to which science was exposing humankind. remembering certain tragic episodes concerning hiv-contaminated blood transfusions, growth hormone and mad cow disease, and certain cassandra-like predictions, such as a catastrophic epidemic of spongiform encephalopathy that has happily yet to appear, society shows reservations when the media inform its members of new feats of modern technology. political authorities, for their part, afraid of potential problems, tend to follow the principle of precaution, which in fact hides a fear of risk. however, evaluating risk involves not being afraid of it but understanding it in a lucid and courageous fashion. informed by the media, which often use sensationalism, the citizen is increasingly calling into question whether certain practices involving the biosciences, such as cloning, or certain mercantile transactions such as the taking out of patents concerning gene sequences, or even experimentation on live animals, are well-founded. "the problem of experimentation on man is no longer a simple problem of technique. it is a problem of value. from the moment that biology concerns man no longer simply as a problem, but as instrumental to the search for solutions concerning him, the question arises of deciding whether the price of knowledge is such that the subject of the knowledge is able to consent to become the object of his or her own knowledge. we have no difficulty here in recognizing the still open debate concerning man as a means or an end; an object or a person. this is to say that human biology does not contain in and of itself the answer to questions concerning its nature and its significance." knowledge of life - written at a time when people were far from imagining how molecular biology was going to expand, the prophetic words of georges canguilhem ( - ) have maintained their philosophical validity. manipulation of the human embryo, whether this involves its creation by cloning or the modification of its genetic inheritance, obviously leads to the need to consider the societal, religious and political points that arise from the domain of bioethics and are a reflection of the period in which we are living. until recently, advances made in biology left moralists indifferent. this ceased to be the case when scientific experimentation began to look at the human embryo with a view to utilitarian ends in the health domain. the specter of cloning was brandished without any clear distinction being made between reproductive cloning and therapeutic cloning. biology became demonized. however, as biologist pierre chambon (b. ) said in an interview in the french journal biofutur: "in absolute terms, biology is unable to tell us whether the cloning of a human being is moral or immoral, it simply tells us whether it is biologically possible." the birth dolly the sheep in (chapter iv- . . ) triggered a virulent debate because now that the cloning of an animal had become possible, that of a human being became envisageable. the media sensationalized this debate all the more in that it was exacerbated by debate concerning gmos (chapter iv- . ). the dolly affair became a problem of society. up until then, the biosciences had been happy just to try and understand the mechanisms that explained the functions of living beings, but now, with the advent of gmos and cloning, it became obvious that a forbidden barrier had been crossed and that man had the power not only to transform but also invent himself. faced with this desacralisation of nature, the need arose for some philosophical reflection. this was given the name of bioethics, which is the title of the book, bioethics, a bridge to the future, which was written by the american biologist van rensselaer potter ( - ) in . the term bioethics covers philosophical considerations that range from the biosphere to the human person. bioethics tries to give a wider meaning to the moral codes which, in human societies, depend on ancestral traditions. it aims to prescribe that which is desirable according to the kantian maxim of the categorical imperative. in his what is bioethics? ( ), the belgian historian gilbert hottois reminds us that bioethics are above traditional morals, the latter being a set of norms that are most often spontaneously respected as good habits, without any critical reflection being involved, while bioethics, on the other hand, arises out of critical thought, analysis, discussion and the evaluation of established mores. over the last few years, the problems that are targeted by bioethics have moved towards today's burning issues. human cloning is an example. while allegations of the transcendence of man in nature may lead to human reproductive cloning being considered as a crime, strictly scientific considerations lead to an emphasis on the lack of responsibility shown by a few zealots, given the hazards involved in cloning in animals, such as the need to use a large number of oocytes in order to achieve success in cloning, the very low viability of the cloned embryos and the development of serious functional anomalies in the clones that survive. even supposing that scientific progress will one day overcome these difficulties, human reproductive cloning will come up against an insurmountable obstacle, the cloned subject's fear of finding that he or she is identical to the relative from whom his or her genetic inheritance comes. after all, the notion of manipulation of the human ovule with the aim of serial reproduction has often haunted science fiction stories. in brave new world ( ), aldous huxley ( - ) gives an apocalyptic vision of the budding of human eggs that produce hundreds of identical twins which are conditioned into classes and subclasses while being raised in jars, depending on the quality of the nutritive substances they are given. in the artificial uterus ( ), henri atlan (b. ) predicts that the raising of human fetuses in jars could well become an alternative to uterine gestation in a distant future. let it be understood that human reproductive cloning, which is no longer part of the domain of science fiction, as it has become feasible, must be considered as being reprehensible because it goes beyond the limits of reason, and is a denial of human transcendence. man as subject cannot be considered as an object. the problem of therapeutic cloning is quite different, although it leads to reticence and prohibition because the demarcation between therapeutic and reproductive cloning depends mainly on whether a cloned embryo is implanted in a uterus. while, at the time of writing, therapeutic cloning has been prohibited in france, germany and other countries, it is tolerated in great britain. in the usa, the prohibition only applies to publicly-financed researched, while each state has its own legislation, which is relatively flexible. the objective of therapeutic cloning is to provide patients with tissues that arise from their own selves, and are therefore immunocompatible and able to be grafted without there being any risk of rejection (chapter iv- . . ). it is based on the removal of somatic cells from the subject to receive the graft and the transfer of the nuclei of these cells into enucleated oocytes. the stem cells that are obtained after the first divisions are stimulated using appropriate growth factors. depending on the factor used, the stem cells differentiate to form a type of tissue (hepatic, muscular, nerve…) that can be used as a graft. such a procedure may be envisaged for patients who have suffered a serious, invalidating trauma, for example section of the spinal chord. a graft of immunocompatible nerve cells might make it possible to re-establish nerve continuity. a similar type of therapy has been considered for parkinson's disease, the cause of which is a degenerescence of certain cells of the encephalon (chapter iv- . . ). given the hopes that are raised by the possibility of such therapies, and the fact that, after all, such therapeutic cloning is the equivalent to an autograft, even if the ways in which the graft is obtained are slightly tortuous, the demonization and rejection of such practices should be reconsidered, calmly and coolly. another option for therapeutic cloning is the correction of mutations identified in the mitochondrial genome of a woman wishing to have children. it is, in fact, the mother's ovum that provides the fertilized egg with its complement of mitochondria that are indispensable for its viability. the manipulation involves inserting the nucleus of a fertilized ovum from the mother, obtained by artificial insemination, into an enucleated oocyte taken from a woman who is not suffering from the mitochondrial defect. the cytoplasm of the enucleated oocyte provides the stock of functional mitochondria that are indispensable to normal cell function in the future embryo. in a domain of the bioethics, in which rational objectivity comes up against deliberately technophobic religious and cultural considerations, it is useful to remember certain legal and legislative paradoxes. thus, in france, after having been considered to be a criminal act that was subjected to severe repression by the law up until , the right to have an abortion before the end of the third month of pregnancy became not only authorized but also protected by law. it is interesting to note that in the th century, thomas aquinas, the father of the church, had acknowledged that a fetus only becomes "animated" by the implantation of the soul by holy will in the third month after fertilization. another subject to be considered is pre-implantation genetic diagnosis (pgd), in which human embryos that have been fertilized in vitro are sorted in order to find those that are without defects, a practice which is on the verge of being a deviation in the direction of eugenics. nevertheless, pgd is the basis of a practice that is either already legalized or is in the process of being so in several european countries, the creation of so-called designer babies. a typical example is that of a designer baby arising from an embryo whose immune profile to that of an older sibling who is suffering from leukemia. in this case, there is good reason to hope that a graft of immunocompatible blood cells from the designer baby into the sibling who is suffering from leukemia will save the latter from death. out of the disharmony of opinions that arising from cultural tradition, religious conviction or simply scientific pragmatism, the american biologist and philosopher h. tristram engelhardt (b. ) , in the foundations of bioethics ( ) proposes a lay bioethics that is based upon the principle of permission. lay bioethics advocates tolerance while admitting that this tolerance in no way prevents anyone from taking up a personal position; it means that each human being has a moral sensitivity as well as the ability to reason and to choose within a defined limit of non-harmfulness and of justice. the individual is free to modify his or her destiny, or to manipulate his or her nature by genetic interventions because, adds engelhardt, "there is no lay moral foundation to prohibit such an intervention." when a researcher or the research organization that the researcher belongs to files for a patent for an invention with a patent office, it is necessary to demonstrate the novel and utilitarian nature of this invention. if a patent is accepted, this gives the person or body that filed it the exclusive right to make use of the invention over a pre-determined period of time, generally years, which is a means of protection, or, if desired, to allow others to make use of the invention by issuing a license to do so. in the domain of living beings, there has sometimes been confusion between invention and discovery. in , craig venter, known for his participation in the sequencing of the human genome, filed a demand for a patent covering the sequences of fragments of recombinant dna (cdna) called est (expressed sequence tags) that are obtained by reverse transcription from human brain messenger rnas, in the name of the nih (national institutes of health) at bethesda (usa). the patent specified that ests could be used as probes to characterize genes that are potentially involved in neurological ailments. the resulting outcry led the nih to withdraw its patent demand. in fact, the patenting of living beings has a long history that goes back to the patent that was filed in in france by louis pasteur, and then in in the usa, for the use, in brewing, of a yeast culture that was free from pathogenic bacteria. from this historical perspective, the case of ananda chakrabarty (b. ) set a legal precedent. in , chakrabarty filed a demand with the us patent office for a patent relating to a pseudomonas type bacterium which, by genetic modification, had acquired the ability to digest crude oil. his demand was refused. after an appeal and many legal battles, the united states supreme court overturned the patent demand refusal, the basis of the judgement being that any modified microorganism is a product of human ingenuity and has a specific name, characteristics and use. thus, from onwards, the arrival of an era of patents derived from genetic engineering was indicative of how this discipline was growing. in december of that year, stanley cohen and herbert boyer, acting on behalf of the university of stanford, patented a nucleic chimera comprising a recombinant dna carried by a vector. in , a patent concerning the growth hormone gene was awarded to the university of san francisco. in , the university of california at berkeley obtained a patent for the human insulin gene. in , the american company pioneer hi-bred succeeded in patenting a variety of corn in which genetic modification has led to an increased synthesis of tryptophan, an amino acid that is indispensable for animal feed. in , the genentech company acquired a patent for the gene coding for human gamma interferon. this was followed in japan by a patent for the gene coding for beta interferon. in the same year harvard university patented the oncomouse, a transgenic mouse whose susceptibility to cancer is greatly increased. after this, several species of transgenic animals were patented for utilitarian purposes, such as the production of human alpha- -antitrypsin taken from the milk of transgenic goats and used for the treatment of cystic fibrosis. the frenetic patenting of living beings has reached the domain of natural products arising from the plant world in tropical regions, the immensely varied essences arising from these plants being full of pharmacological potential. the potential for producing drugs of a considerable commercial value from such plants is very high. here we return to the problem of the patenting of genetically modified, cultivatable plants (gmps) (chapter iv- . ). thus, the experimental method, the principle of which is to acquire pure knowledge, finds itself led astray in its applications. whatever the motives that are given, particularly for manipulations that give rise to the manufacture of marketable products, the patenting of genomes for mercantile ends shows the regrettable, but unfortunately inevitable, direction in which the very spirit of a science, molecular biology, which half a century ago wished to be at the heart of an understanding of living beings, has drifted. the suffering of animals that are being experimented upon gives rise to a moral problem. the end of the th century saw large-scale demonstrations against vivisection and repeated demands for it to be abolished. today, there is renewed vigor in the call for the abolition of vivisection, without any real coherent basis. this desire to stop experimentation on animals ignores the imperatives of contemporary medicine, which must meet the challenge of pathologies whose increasing incidence is worrying, such as cardiovascular diseases, diabetes, cancer, and the degenerative illnesses that are linked with aging or are of genetic origin. it is true that animal experimentation inevitably leads to questions. are the stakes involved in a particular experiment, in terms of the acquisition of new knowledge, worth the suffering of an animal used in that experiment? is it not necessary to ensure that the experimental protocol is well-documented, that it is not redundant, or even that it has been the subject of previous studies carried out on cells in culture? it is easy to see the size of the methodological chasm that separates contemporary physiology from that of the time of claude bernard, when cell culture techniques were not yet being used, when the main instrument used was the scalpel and the researcher, using his or her imagination and creativity, had to develop specific protocols that were able to validate or refute a working hypothesis. each period in history operates in its own way according to its moral laws and its technical capabilities. the bloody operations carried out by magendie and by claude bernard in the th century, which were tolerated at this time despite criticisms from antivivisectionists, would not be permitted today. nevertheless, it is true that the physiologists of the th century, by means of the results of their experiments, wove a tapestry of new knowledge on which contemporary biologists were going to work and without which the level of understanding the modern science would be much lower than it is. animal experimentation remains indispensable in many areas of physiological investigation, in genomics, in toxicology and in pharmacology. it is a precondition for clinical trials of any new drug, being used to test for the drug's efficacy, its metabolism and any toxicity. however, not all data arising from animal experi-mentation can be extrapolated to man. the margin of uncertainty can be reduced by means of comparative trials on several animal species. because of their phylogenetic proximity to man, primates may seem to be the solution for experimentation prior to the application of a drug in man. this was the case for the development of a vaccine against hepatitis b. it has been proposed the grafting of stem cells in man should be preceded by experimentation in apes, in order to ensure the absence of tumorization over the long term. however, the researcher is confronted with a dilemma: should he or she ensure the safety of man with respect to possible deleterious effects or respond to ethical demands that recognize the very great genomic similarities between man and the chimpanzee. a consideration of cloning, patenting and animal experimentation practices illustrates the excesses of the experimental method in domains where political authorities consider themselves able to legislate. administrative decisions, often made in the absence of any dialogue with scientific authorities, can have serious consequences. thus, given the pretext of strict obedience to the principles of bioethics, which are a matter of tradition, and while certainly respectable, are nevertheless arguable, and also given the pretext of a sickly and unconsidered fear of the risk involved in certain experimental practices, and the absence of an intelligent evaluation of this risk, research, which until recently took place in a motivating atmosphere of liberty, may, over the long term, be weighed down with a highly prejudicial handicap and a limitless sense of discouragement. in the th and th centuries, experimental research, which was still in an emergent phase, was mainly artisanal, and in the hands of rare scholars. it took form during the th century in the west, particularly actively in germany, and became operational in the th century, under the aegis of governmental authorities, with the creation of institutes, the programmed recruitment of researchers and the allocation of renewable budgets. modern science, based on the principles of the experimental method, came to the fore much later in the east than in the west. the globalization of knowledge has meant that at present experimental science, in all domains, including that of the life sciences, has spread throughout the world, with even those countries that had become relatively backward in these domains because of their isolation catching up rapidly. nevertheless, it is true that the progress of the experimental sciences in the usa and in the united kingdom has been distinguished by pragmatic management of these countries' science policies, based on the excellence and the high degree of autonomy of their universities and research institutes with respect to recruitment and choice of subjects of study. the efficacy of this policy in the life sciences may be judged by the number of researchers who have won nobel prizes since the second world war (at the time of writing, more than in the usa and twenty or so in great britain as opposed to only in france). in france, research on living beings is carried out in the laboratories of universities, in institutes connected with higher education and in laboratories that are run by large organizations such as the national scientific research center (cnrs), the national institute of health and medical research (inserm), the national institute of agronomic research (inra), the atomic energy commission (cea), the national institute of research in computer processing and automation (inria), the national center for space studies (cnes) and the french institute of research on the seas and oceans (ifremer). equivalent bodies exist in countries other than france, some of them being institutes that are dedicated solely to research, and some being university laboratories that associate research and teaching. at the beginning of the th century, the function of researcher was most often associated with that of a professor occupying a chair at a university, surrounded by a few assistants, the professor directing the research work in his area of specialization. now, within a period of a few decades, the status of researcher has been modified greatly. today we talk of research careers classified according to level of expertise and technicality. management, or the supervision of career paths and the control of financing, is carried out by an administration that is itself highly hierarchical. the scientific process has undergone a metamorphosis, shown by changes in the behavior of researchers not only within the institutions in which they work but also in their relationships with the media, the political sphere and society. the teaching of the life sciences needs to take this into account. "long ago, there was a time when scientists recounted the exact circumstances of their discoveries, without shame, even when their recital showed up the fragility of their forecasts or an indecent collaboration on the part of every bit of luck. such times are past, and the researchers of today often like to make us believe that they only find what they are looking for. the thousands of pages pasteur's lab books provide an opportune reminder to us (and to program-makers or impatient users) that it is just as difficult to ask a question as to answer it, that a scientific discovery often occurs after a long, winding path, that rather than following the fashion, it is preferable to follow one's ideas, particularly if they are good ones, and are in advance of the fashion." jean jacques molecular dissymmetry, in "pasteur, workbooks of a scholar" - current technological progress, the accumulation of the scientific knowledge, the institutionalization of the public research and many other factors are disrupting a ritual of the experimental process that had survived until the middle of the th century, and even beyond. the experimental life sciences of the st century will necessarily see themselves remodeled with respect to their objectives and procedures. faced as it is by an increasingly tough international competition, the scientific community is also subject to restrictions in terms of operation and prospectives. an organization into small teams of a few researchers gathered around a boss, working in friendly interaction, is increasingly giving way to large groupings that sometimes seem like consortiums. focused on research subjects that are deemed to be "cost-effective", these superstructures are encouraged, or even imposed, in the sadly illusive hope that the will lead to greater efficacy. the person in charge of such large groups is taken up with everyday management tasks and by maintaining good relations with the administrative bodies on which his or her organization's survival depends. he or she may become distanced from the experimentation and forget the intellectual motivations that in the past caused his or her competence to be recognized. it should be emphasized that the secret of future successes lies in situations where young researchers are in direct contact with their bosses, and where friendly interaction with a known master teaches the apprentice researcher how to learn, how to think and how to experiment in a critical fashion. preoccupied by the rapid expansion of the scientific population, accompanied by the creation of laboratories whose operation necessarily requires financing, often on a large scale, political authorities, giving way to the requirements of media-fed public opinion, are interfering more and more, via administrative relays, in the control of the objectives of experimental research. short-term objectives, considered to be "visible", are favored. a priori, the viability of a project is judged according to the scientific context of the period and its impact on society, insofar as the project looks at health problems with a high degree of media coverage (cancer, degenerative illnesses, viral infections…) and often in agreement with a consensus that avoids going against the orthodoxy of the moment. this leads to a rigid management of projects that are financed and controlled according to objectives that have been fixed in advance, and that are all the more easily accepted by state authorities when they are somewhat fantastic in character. however, fundamental research proceeds from a playful activity, and for this reason, its efficacy is dependent on the passion of the researcher for the problem that he or she is studying. in contrast to what is believed by the narrow-minded, the effectiveness of a researcher in terms of discoveries depends upon the liberty that is given to this researcher, assuming, of course, that this liberty is underpinned by criteria of confidence such as the researcher's scientific past, his or her motivation, and judgements made concerning the researcher by impartial peers. it should not be forgotten that the determination of the three-dimensional structure of hemoglobin by max perutz (chapter iii- . . ) took around twenty years of solitary, uninterrupted and untiring labor. the theoretical and technical tricks that led to this success helped to open up the domain of the structures of giant macromolecules, several dozen kilodaltons in size, which no-one had dared study before. anyone who uses the experimental method realizes that while fundamental research must be organized, it cannot be scheduled. such a person knows that the pathways to discovery are convoluted, and that an inexplicable observation that appears unexpectedly during an experiment can sometimes, if the researcher is sufficiently perspicacious, be the beginning of an adventure that leads to a discovery. it was to just such a convoluted path that the belgian biologist christian de duve (b. ) alluded in his speech when he received the nobel prize for medicine and physiology in . after working at the university of saint louis in the usa, de duve, who had taken up a post at the university of louvain, belgium, decided to look at a research theme that had received a great deal of media coverage, diabetes and insulin. it was while operating on one of the subcellular fractions obtained from ground rat's liver, and analyzing certain of the enzyme activities of these fractions, that he was surprised to find, in one of them, enriched with mitochondria, a phosphatase activity that, paradoxically, increased with time, while the enzyme activities specific to the mitochondria declined. this was an activity belonging to organelles that were contaminating the mitochondria. dropping all research on diabetes, de duve set out to identify and characterize these unknown organelles. he discovered that they were involved in the breakdown (lysis) of molecules that are undesired by the cell and, for this reason, he called them lysosomes. the discovery of lysosomes helped to open a new chapter in cell biology and to attribute a molecular cause to diseases with serious prognoses whose etiology had remained a mystery up until then. these diseases were given the label lysosomal diseases. these diseases result from the absence of a lysosomal enzyme that is responsible for the breakdown of a given metabolite. the accumulation of this non-broken-down metabolite in the lysosomes leads to cell malfunction, which causes the lysosomal disease. as de duve said jokingly, if he had carefully followed the experimental process laid down in his diabetes research project, and if he had not given way to the temptation of "playing hooky" or "playing truant" he would never have mounted the podium in stockholm. in the same way, henri-gèry hers ( - ), a cell pathologist at the internationally renowned louvain school, remarked in an article published in the review médecine/sciences: "i believe we would obtain maximum value for the money devoted to research if we were willing to distribute it to those who have been shown to be productive, according to their needs, and without asking them for a program." hers concluded, in a tone that was deliberately playful, but thought-provoking, "such a simple system would lead to unemployment for a large number of administrators, which is why i suspect that it will never be adopted." research has its own set of ethics, driven by anticonformity and the creative imagination, capable of shaking up firmly-anchored ways of thinking and established hierarchies, and of leaving the researcher the freedom to express him or herself and to experiment off the beaten paths. as eccles says in evolution of the brain and creation of the conscience, it is important to distinguish between intelligence and imagination. intelligence is measured according to the rapidity and depth of understanding and clearness of expression. it may be measured and even given a numerical value. the same is not true for the imagination, a more subtle, unmeasurable phenomenon that cannot be learned. the imagination is one of the levers that is able to lift the boulder that hides scientific truth. the imagination is the ultimate weapon of research, which shakes up the knowledge acquired by the intelligence. nevertheless, the imagination must be tempered by a good critical sense that is able to perceive potential sources of artifacts, both in sophisticated instruments that act as so many black boxes from which already manufactured information emerges and in genetic or chemical cell exploration methods whose specificity must be carefully checked. the benefits that can sometimes be gained from prospective research that is far from dogma that is rooted in sterilizing tradition, the way in which knowledge progresses, most often by moving away from any orthodoxy, the way discoveries appear unexpectedly on the fringes of carefully put together projects, all of these points are matters for reflection for those in power in the worlds of politics, economics and industry. publication is an essential tool for communicating scientific knowledge, and is the judgement criterion for committees in charge of evaluating the creativity of a researcher. in order to have meaning, a publication must provide information that is sufficiently innovative with respect to parallel work carried out in other laboratories. here again, media coverage has quietly infiltrated the scene. its role is all the more perverse in that the rating of a publication is estimated according to its impact index, or, roughly speaking, the renown of the scientific journal in which it is published. curiously, it has happened that articles that would later be considered to be of primary importance have been rejected by highly prestigious journals, simply because the facts mentioned in the article and the conclusions made have not coincided with the orthodox opinions of the period and the traditionalist spirit of the journal's editorial committee. this was the case for an article which the biochemist hans krebs ( krebs ( - submitted to the british journal nature in . in this article krebs described a series of experiments showing that an endocellular metabolite, pyruvate, product of glycolysis, is completely degraded during a cycle of enzyme reactions. this degradation cycle would later be recognized as the central pivot of the intermediate metabolism. called upon to judge revolutionary scientific considerations, and unable to perceive their importance, nature's editorial committee rejected the article. krebs then sent his article to a journal with a relatively restricted audience, enzymologia. it was accepted and published in the two months that followed. the importance of the concept that was put forward in the article ensured that its author gained international recognition, leading to his winning the nobel prize for physiology and medicine in . for the researcher, publication is a way of making his or her work known. it is also the way in which the researcher learns about the work of others. while the rhythm at which publications in the life sciences appeared increased slightly in the first half of the th century, the second half of that century saw a great acceleration in this rhythm, leading to a difficult-to-manage proliferation of reviews and books. it has been estimated that in the last thirty years the volume of publications in the biological domain has increased five-fold; in the preceding twenty years it had already doubled. this accumulation of publications makes it harder for the researcher to judge the quality of the huge mass of published articles, even in the highly targeted domains that are within his or her area of expertise. the researcher, therefore, will deliberately choose a particular article according to the prestige of the journal in which it is published, which is not an inviolable criterion of quality. in addition, any judgement concerning the pertinence of a scientific article necessitates a dissection of the subtleties of the methodology, the well-groundedness of the experimental protocol and the validity of the results, by means of a careful examination of tables of results and graphs, and, finally, the logic of the discussion. this restrictive yet absolutely necessary requirement limits the number of articles that are likely to be screened. however, this is not the worse fault of publication today. there is another problem that is much more worrying. many documentation centers have reacted to this inflation in the scientific press by equipping themselves with computing facilities that are able to find, in data banks, articles that have been selected on the basis of a key word index, and to display them on screens. while acknowledging that this constitutes an inescapable change in the transmission of scientific know-how, it should be recognized that in browsing through the pages of a highquality scientific review, it is possible to come across an article containing an innovative idea or a useful technique, an advantage that is less available when using the on-line system of scientific publication that is most prevalent nowadays. mention should also be made of the requirement to publish frequently and within short time frames, for reasons of competitivity, when aspiring to obtain jobs or promotions, or even just to obtain recognition, this requirement being another factor that is prejudicial to fundamental research. it is the cause of worrying excesses, such as experiments that are hastily published and non-reproducible, or even the falsification of experimental results, occasionally within a context of considerable media coverage. although such practices, which are the exception rather than the rule, are rapidly detected and condemned in a scientific culture where information circulates freely, the publicity that they incite, which reaches society at large via the media, leads to an overall discrediting of experimental research. at present, one of the most noticeable trends in scientific publication is that of collectivism. while, in the th century, scientific articles were usually published in the name of a single author, occasionally two authors, and very rarely more than two, nowadays publications are often co-authored by several people, and when the work involves the analysis of structures, or the sequencing of genomes, several dozen researchers may be co-authors. from being the work of individuals, research has become collective. in domains whose complexity requires a wide selection of techniques that may range from physics to genetics, the hybridization of specific areas of expertise is certainly indispensable, and this requires the collaboration on a particular project of researchers who are sometimes physically remote from one another. the downside for the researcher, particularly one who is young, is that this requires him or her to abandon individuality and creativity. both collectivism and inflation in scientific publication are facts that are an integral part of contemporary science, facts which reflect an irreversible trend that it would be difficult to obviate. over the last few years, scientific publication has been subject to a type of restraint, in that certain "sensitive" data in the domain of molecular biology might be used for the manufacture of biological weapons in a form of terrorism known as bioterrorism. thus, the means of synthesizing de novo viruses (influenza virus, poliomyelitis) and the possibility of modifying their tropism by "directed molecular evolution" (change from a sexual tropism to a respiratory tropism for the aids virus) have been the subject of publications in prestigious journals. given sufficient means, terrorist pharmacists could well make use of such data in order to carry out malicious actions with catastrophic consequences . in order to please a public that is eager for progress and the sensational, politicians favor, by means of targeted financing, the types of organization that appeal to their sensibilities, such as the technological platforms. while recognizing that such platforms are now an integral part of the landscape of research on living beings, and that they must therefore be taken into account, and while acknowledging that projects which implement the latest technologies in different domains need to be federated, it is nonetheless vital not to underestimate the potential creativity of small groups of researchers, a point that was expressed by one of the greatest of contemporary biologists, arthur kornberg ( kornberg ( - , winner of the nobel prize for physiology and of medicine, in a speech given in : "as i view the steady growth of collective science and big science, the greatest danger i see is a dampen-ing of individual creativity and reversion to the old politics -the inevitable local politics that infects every group and institution." however, conscious of the metamorphosis that is occurring in the experimental method, and faced with a particularly inventive and all-conquering technology, fundamental research in the life sciences must come to terms. a century ago, fundamental research and technological research interacted all the more directly because they were both in their infancy. this is no longer the case. management of the ever-increasing amount of knowledge in the life sciences, and the degree of sophistication achieved by bioengineering techniques and instruments, is widening a gap that makes dialogue increasingly laborious. however, dialogue appears to be a guarantee of future progress. the solution can only come from an increase in cross-disciplinarity, which should begin with university teaching and the establishment of a recruitment policy that advocates the cohabitation of talents from different educational backgrounds in the same laboratory. fortified by such hybrid expertise, while maintaining its share of originality and liberty in the choice of problems to be studied, fundamental research on living beings can only be enriched by a marriage of reason with biotechnology. convinced of the necessity for such a marriage, stanley fields, the inventor of the double hybrid method (chapter iv- . ), in an article entitled "the interplay of biology and technology" (proceedings of the national academy of sciences, usa, , vol. , pp. - ), concludes,: "it is at the interfaces of biology and other sciences that many of the future discoveries will be made, at the interfaces of biology and engineering that these discoveries will come to be exploited, and at the interfaces of biology and ethics and law that their consequences for society will be decided." the desired dialogue between biology and technology also implies the breaking down of barriers that too often isolate fundamental research and so-called applied research, and the facilitating of consistent interaction between the discoveries made in the academic institutions and their application for utilitarian ends in private companies. this is where the twin demons of money and power raise their heads. already, at the turn of the s, a. bartlett giamatti ( - ) , who was then president of yale university in the usa, commenting on american university policies, spoke of a "ballet of antagonisms" between, on the one hand, commercial companies that are interested in the rapid cost-effectiveness of any new therapeutic advance and, on the other hand, non-profit university laboratories. recently, james j. duderstadt (b. ), emeritus president of the university of the michigan, argued that the university is a "counter-hierarchical" organism. in fact, its members are free to carry out the research that pleases them and to think in the ways that they wish to think, in any case within an academic norm that considers itself as being free from the constraints dictated by private interest groups. until recently, such behavior was considered as a sort of ethic which arose out of the university conscience and dignity. the crumbling away of this ethic in the final decades of the th century coincided with the rise of biotechnologies and the large-scale filing of patents relating to molecular genetics techniques that could be applied to the manipulation of living beings, by researchers in the public sector. the intrusion of the american private sector into public research laboratories, in the form of collaborations with transfer of "sensitive" information from the public to the private, has become such a worrying problem that drastic control measures have had to be taken. within this context, the american federal government, in february , issued a certain number of prohibitions targeting the national institutes of health (nih) of bethesda, particularly with respect to the retribution of researchers for services rendered to industry . these stands call for thought concerning the place that is currently held in universities with respect to fundamental research. without arguing against the efficacy of major research institutes, it is nevertheless necessary to remember the part played by the university in this domain. the university is not only the place where knowledge, both as it is now, in its current state of advancement, and as it has been, it is also the place where knowledge must be created by fundamental research. for the last few decades, under pressure from state policies, and also as a function of an improvement in social status, the world of the university has opened up to a wider public, leading to an influx of students that is sometimes so enormous that the task of teaching them has become overwhelming. because of this, the share of their time that university researchers can, in practice, devote to their research tasks has shrunk. this situation is highly prejudicial to the mission to innovate, which should be a priority. it is, in fact, during their university studies that the thought patterns of young students are forged by contact with teachers who not only instruct them, but also educate them by inspiring in them a motivation and an enthusiasm that gives rise to hope. how could this be true if the teaching faculty did not itself participate in scientific creation? "what can teaching, ex cathedra, do to guide the researcher? nothing, obviously. the researcher is trained in the laboratory. and the first stroke of genius on the part of a future researcher is to find a good boss. such a find will open up the royal road to success. the road will be opened -but the researcher must travel along it. a researcher may be taught many things. he or she can become familiar with techniques and with equipment. she or he can be assigned a problem to resolve. however, what is essential for the researcher is to know how to understand relationships between phenomena that seem unrelated, and to be able to progress from the particular to the general. a boss may develop such qualities in a gifted young researcher, but intuition is a gift; it cannot be taught." while the bernardian style experimental method, based on a working hypothesis aroused by an observation, followed by implementation of an experimental protocol, is still extant in the life sciences, and while "serendipity" is still the origin of great discoveries, "big science" , underpinned by sophisticated biocomputing or bioinformatics procedures, is intruding more and more, while genomics and proteomics are not far behind. the methods and instruments developed by the biotechnosciences have led to profound modifications in the ways that the structures and functions of living beings are investigated. for example, by varying multiple parameters in dna chips or protein chips, at the same time, the experimenter is able to ask questions that lead to grouped all-or-nothing answers (chapter iv- . . ). in combinatory chemistry, screening makes it possible to detect a molecule that is active for a given pathology from among a multitude of molecules (chapter iv- . ). the mathematical simulation of metabolic networks or of signaling chains is already well under way (chapter iv- . ). given this new technological outlook and the hope that it can provide rapid solutions to health problems subject to considerable media coverage, the teaching of biology in universities must not be limited to a description of current advances, no matter how brilliant and promising they may be. this teaching should return to its origins, be a reminder of history, and should not hesitate to use examples to illustrate how a major discovery can arise from a long period of wandering in the wilderness. in practical terms, while being conscious of the extraordinary complexity of living nature, and carefully avoiding the dangers of simplification, it is important to remember that the reductionist method was a necessary path to an understanding of the integrated, modelized biology that is emerging nowadays. at present, certain people call reductionism naive, but this is only the case insofar as we have faith in recent advances in integrated biology . with this in mind, it should be noted that the deciphering of the protein synthesis mechanism in prokaryotic microorganisms (chapter iv- . . ) was, along with the discovery of the genetic code, a jumping-off point for an inventory of similar, but noticeably more sophisticated, mechanisms in eukaryotic organisms. the reductionist "one gene, one enzyme" dogma, formulated on the basis of beadle and tatum's experiments on the mold neurospora crassa (chapter iii- . ) was a necessary prerequisite to a considerably more elaborate understanding of the relationship between the genotype and the phenotype. the way in which the nucleic acid and protein units in the tobacco mosaic virus spontaneously organize themselves (chapter iii- . ) acted as a basis for thought concerning the self-organization of macromolecular complexes in the cell. these few examples underline the fact that it is difficult to comprehend the scientific research process if we only refer to experiments carried out in the present, and if we do not have a clear idea not only of the way in which hypotheses, even false ones, were once formulated, but also of the way in which experimental work, which may have led to failures, was once carried out, or, in brief, if we do not look back at the past. let us add that it is occasionally good for us to show some humility when we take the trouble to examine the past. thus, the processes involved in the phagocytosis of bacteria by innate immune cells (neutrophils, macrophages), which are today studied in the greatest detail with particularly refined technical facilities, had already been perceived more than a century ago by metchnikoff, and even analyzed, admittedly with the clumsy means at his disposal, but with such accuracy that none of the conclusions formulated at that time have yet been disproved (chapters iii- . . and iii- . . ). the experimental method applied to the life sciences, the history of its birth and of its development, the way in which it is regarded by political and societal authorities, and, finally, the dependencies that are developing at present between the technosciences, human medicine and the different branches of the economic sector, all of these aspects should be covered by university teaching that includes not only the pure sciences, but also the human, political and economic sciences, as well as philosophy. the student should not be saturated with book-learning, but he or she should be taught to reason, to imagine and to criticize, not to accumulate knowledge in an indigestible catalogue, but to ask questions about the way in which certain, carefully chosen, items of knowledge have been acquired, and not to deliberately accept science in its current state without knowing what it was like in the past. he or she should understand what pathways of thought led to dogmas that were established and taught as truths being refuted, and favor experimentation, with its risks and questions, rather than well-smoothed, abstract theoretical presentations without rough edges. these should be the principles of teaching that is designed to open up young minds to creativity. in anglo-saxon countries, the worlds of industry and research that welcome the graduate manage to communicate with one another, but these worlds ignore one another in france, or at least remain reserved, a situation which is prejudicial from the economic point of view. if we look at the pharmaceutical industry in particular, we see that only half a century ago the pharmacopeia was limited to plant extracts or active agents isolated from these plants, with antibiotics quietly beginning to make their appearance. in the last decades of the th century, a great technological leap forward was made, with completely new methods in bioengineering, combinatory chemistry, and the finding of therapeutic targets in macromolecules, and this created a hiatus that severely handicapped countries that were unprepared for it. france, with its biological fundamental research training that is out of phase with that of the anglo-saxon countries, fell behind, and continues to be behind, a situation that is prejudicial for its economy. the remedy for this does not lie in incantatory speeches. it requires a volontarist policy for the management of experimental research. generally speaking, the fact that the major engineering schools in france, which recruit the scientific intellectual elite, students being chosen by competitive exams that select for intelligence rather than imagination, are unable to impose upon their students an end-of-course thesis that would authenticate their engineering degree, should not be tolerated. in contrast to other countries, in france only a small percentage of engineers have received doctoral training or had to present a thesis before entering their careers. the french dual system of major engineering schools and universities, which, a century ago, made sense for the economy of that period, has become completely obsolete, and deserves a courageous revision. "there is a question, much older than modern science, which has never ceased haunting certain men of science: that of the conclusions that the existence of science and the contents of scientific theories can lead to concerning the relationships that humankind has with the natural world. such conclusions cannot be imposed by science as is, but they are an integral part of the metamorphosis of this science." the new alliance. metamorphose of science - ( nd edition) in the s - s, the hybridization of the techniques of genetics, biochemistry and biophysics gave birth to molecular biology. with the resolution of the double helix structure of dna, the demonstration of its replication, the elucidation of the mode of expression of its nucleotide sequence as a sequence of amino acids in proteins and finally the deciphering of the genetic code, biology underwent a revolution of an amplitude similar to that which, at the end of the th century, saw a blossoming of the seeds of cell biology. the last decades of the th century represented the utilitarian era of molecular biology. the introduction of genetic engineering into biological experimentation dates to the beginning of the s. it was at this time that techniques were developed that made it possible to transfer a fragment of genomic dna from one species into the genome of another species. genetic engineering now fills a predominant position in the life sciences, supported by increasingly effective biocomputing or bioinformatics techniques. it is easy to understand that expertise and a high degree of knowledge about fundamental research is necessary in order to be able to master or even invent the genetic engineering techniques that are indispensable if we are going to produce biomolecules with a therapeutic impact, such as those that are currently being used in the pharmaceutical domain: insulin, growth hormone, blood coagulation factors, vaccines, etc. the engineering sciences that make up the greater part of contemporary biotechnology have now come to the fore in many domains of the life sciences. it is thus that a modernistic and original way of investigating nature has come into being. a multiparametric model, in which biocomputing or bioinformatics and high-throughput screening reign, is added to, or even substituted for, the bernardian model for the experimental method, based on observation, an a priori hypothesis, and experimentation to verify this hypothesis by varying a single parameter at a time. the aim of this globalized approach is to integrate the multiple reactions that take place almost simultaneously in different locations of a cell into a coherent whole, to rationalize the interpretation of the dialogue that operates between the different endocellular organelles, and finally to discover how the exchanges of information between cells in an organ and between organs in multicellular organisms are set up. we are therefore witness to the emergence of an integrated biology that has been labeled "systems biology". its long-term objective is to model the functioning of living beings and to theorize them. its development is encouraged by the perspective of consequences that could revolutionize certain sectors of the human economy and of public health. today, concrete, mechanical models, in the form of biorobots and hybrid robots, and, very recently, molecular motors are added to abstract models that are based on the logic of mathematics and algorithms, ushering in the era of nanobiomachines. becoming more utilitarian, the life sciences are imperceptibly detaching themselves from traditional philosophical concepts that try to explain the modes of reasoning of the researcher, or even to impose a framework for thought that is likely to orient his or her way of doing research. looking at genetic inheritance, contemporary experimentation has shown that at all levels of the tree of nature, including man, this inheritance can be modified. aware of his or her ability to influence the functioning and the destiny of living beings, the researcher is confronted with the dilemma of a desire for knowledge versus a questioning of the use to which discoveries may be put. there has never been such a real divorce between the world of phenomena that are understood by the experimenter and the world of noumena whose intelligibility is foreign to our senses. there has never been such a wide gap between the biotechnosciences, whose possibilities are coming to be seen as limitless, and a reflective analysis of thought, which wanders between freedom of action and prohibition. as society becomes aware of the potential applications of discoveries made concerning living beings, problems of bioethics, particularly those involving reproduction, have become problems of public interest. cloning and the production of stem cells are subjects that give rise to diatribes and passions. in the near future, genotyping, which is the result of progress in pharmacogenetics, could usher in a new form of customized medicine. elsewhere, the cognitive sciences that are bringing together philosophy and psychology in the domains of computer technology and artificial intelligence, and which are tackling the processes of thought, the creative imagination and memory, will no doubt be the subject of the considerable questioning concerning research on living beings with which the experimental method will be confronted in the st century. when faced with the way in which biotechnologies have erupted into the life of society, the mind travels back to the allegorical illustration that embellishes francis bacon's novum organum (see figure ii. ) , showing vessels returning from unknown lands, loaded with precious cargoes and returning to port having sailed past the pillars of hercules. at present, the challenge has been partially met, but a great deal remains to be done. innumerable cargoes have already reached port, but what will be the destiny of this precious merchandise? after all, the seeds of the idea of technoscience were already in place in the th century, in the philosophy of francis bacon and robert boyle (chapter ii- ). bacon recommended that the governments of the time promote experimental science by the creation of laboratories equipped with high-performance instruments and libraries, by the organization of researchers into teams and by appropriate financing. the utilitarian ends of scientific research were underlined. boyle imagined a situation in which laboratories were open to society and researchers were able to accept criticism. given innovations that upset tradition, protestations arose. the pneumatic machine or vacuum pump was the subject of the fameuse diatribe between boyle and the philosopher hobbes (chapter ii- . ). hobbes criticized the validity of boyle's conclusions, drawn from experiments that he qualified as doubtful. following his words, he came to see in the discoveries of experimental science a possible threat to the power of governments and the hierarchical layout of society. such overcautious opposition to the pursuit of knowledge is in no way anecdotal, it is still a reality, with the uprooting of genetically modified plants and the veto that has been placed in certain areas on stem cell research. this type of opposition is also shown when pressures or even vetoes are in operation that take into account more the opportunism of the moment than an in-depth understanding of science and of its history and that forget that freedom of the mind is a guarantee of its creativity, because, just as in the world of arts and letters, the world of scientific research is situated outside those norms that can be modulated by state decrees. the creativity of the researcher cannot be manufactured on demand. where it exists, it still needs to be detected and encouraged. the atp-synthase -a splendid molecular machine structure at . Å of f -atpase from beef heart mitochondria direct observation of the rotation of f -atpase powering an inorganic nanodevice with a biomolecular motor mechanically driven atp synthesis by f -atpase atp-driven stepwise rotation of fo-f atp synthase key: cord- - vd mdu authors: nan title: abstracts from the th european society for animal cell technology meeting: cell technologies for innovative therapies: lausanne, switzerland. - may date: - - journal: bmc proc doi: . /s - - -x sha: doc_id: cord_uid: vd mdu nan . a schematic representation of crispr based synthetic transcription factor technology. b mrna expression levels of protein transport related genes (napg, rab a and arpc b). background an increasing number of biologics are entering the development pipelines of pharmaceutical companies [ ] . today, the preferred production host for therapeutic proteins is the cho cell line. however one of the major hurdles, especially for the production of non-antibody glycoproteins, is host cell-related proteolytic degradation which can drastically impact developability and timelines of pipeline projects. material and methods spike-in: cho cells were cultivated in a chemically defined culture medium at . °c/ % co in shake-flasks. when the cells reached their maximum viable density, they were removed by centrifugation and the conditioned medium was collected. a model mab was spiked into the conditioned medium and incubated at °c ± protease inhibitors. the amount of proteolytic degradation was analysed by western blot and lc-ms. transcriptomics: total rna was extracted after days of cell cultivation. rna sequencing libraries were constructed and processed on the hiseq platform from illumina. generation of matriptase knockout: cho-k cells were transfected with mrna encoding "transcription activator-like effector nucleases" or "zinc finger nucleases" targeting matriptase exon . the transfected cells were subsequently sorted into single cells and analysed for frameshift mutations in both alleles via sanger sequencing. cell cultivation: fed batch cultivation was performed in -ml miniaturized bioreactors (ambr ). approximately proteases are known in rodents. to reduce the number of candidate proteases we showed first that a model mab (prone to proteolytic degradation) incubated in conditioned medium of cho-k cells resulted in clipping of the mab, demonstrating the involvement of secreted/shedded proteases (fig. a) . broad spectrum inhibitors of the different protease classes revealed that only serine protease inhibitors prevented clipping. serine protease inhibitors of higher specificity highlighted the group of "s a trypsin-like proteases" (fig. a) . comparison of the proteolytic degradation profile of several therapeutic proteins between cho-k with another cho cell line (cho-a) revealed less degradation in cho-a. therefore expression of the involved protease(s) is likely lower in cho-a. gene expression profile analysis of both cell lines showed five secreted/shedded "s a trypsin-like serine proteases" more than . fold lower expressed in cho-cho-a versus cho-k (fig. b) . surprisingly, sirna knockdown experiments of these five candidates identified "matriptase" as the major protease involved in degradation of recombinant proteins expressed in cho-k cells ( fig. c upper panel) . next, we generated a cho-k matriptase knockout (ko) cell line. no proteolytic degradation product was detected when the model mab was spiked into conditioned medium of the ko cell line (fig. c lower panel) . also, stable expression of the model mab in the ko cell line resulted in no/significantly less clipping (fig. e) . the protein titer and the cell growth behaviour of the matriptase ko cells were similar to the corresponding wildtype (wt) cells (fig. d) as shown by comparative cultivation in ambr system. conclusions one major challenge for the production of recombinant proteins is cho host cell mediated proteolytic degradation which can negatively impact or even result in termination of projects [ ; ] . using a variety of techniques such as applying protease inhibitors, transcriptomics and sirna mediated knock-down we were able to identify "matriptase" as the major protease involved in degradation of recombinant proteins expressed in cho-k cells. subsequently we generated a matriptase deficient cho cell line. protein candidates of diverse formats, severely degraded in wt cho-k cell line, were not or significantly less cleaved in the matriptase ko cell line. furthermore cell growth, viability and productivity levels were comparable between the wt and the matriptase ko cell line. in summary, we have generated a superior platform-compatible cho production host cell line with the same favourable productivity properties as the parental host cell line [ ; ] , allowing expression of complex glycoproteins prone to clipping. background cho cell lines are common hosts for the production of biopharmaceutical proteins. so far, considerable progress has been made increasing productivity of cell culture to meet the rapidly growing demand for antibody biopharmaceuticals through increased cell densities and longer culture times. the downside is the increase of the process related impurities, bringing new challenges for process and harvest development. among the process related impurities such as host cell proteins (hcps) or dna the potential impact of lipids production and release during cell culture is still poorly understood due to the complex nature and diversity of this class of molecules. thanks to recent advances in analytical tools especially mass spectrometry, the advent of lipidomics offers now the feasibility to study several thousands of lipid species thus unraveling the possibility to understand and potentially control the interactions between high performance bioreactor processes, harvest conditions and purification. in order to analyze and quantify lipids, we developed a three steps method. in a first step, lipids were extracted with methyl tert-butyl ether (mtbe) according to matyash method [ ] . lipids were then separated by liquid chromatography using either hilic of reverse phase column prior to detection and quantification by mass spectrometry. all lipid classes were detected by esi-ms/ms excepted cholesterol (apci-ms/ms). finally we applied this method to analyze the lipid content of different cell lines each expressing a different recombinant protein, during a days fed batch process. lipid from cho cells were successfully extracted with a yield between % and % depending on the different lipid classes. stable isotope labeled lipids were used as internal standard in order to have comparable results between batches. the obtained results (fig. ) show that for a given cell line, lipid distribution is changing over the process. moreover, this distribution may vary significantly depending on the cell line: cl- and in a lower extend, cl- , show an accumulation of triglycerides from day to the end of the process, while cl- doesn't seems to follow this trend. conclusion interestingly, in some cell lines/experimental conditions, we highlighted an overproduction of triglycerides and cholesterol leading to the accumulation of lipid droplets known as energy storage sink. at the metabolic level, these findings suggest a relative overflow of the carbon metabolism. from a process development perspective these findings can be considered on the one hand as a resource waste since the stored energy is not used for protein/biomass biosynthesis and on the second hand as the root cause of additional process challenges especially during the harvest and the first capture steps given the hydrophobic nature of these molecules. implementation of lipidomics analysis enables us to highlight a new type of process variability and to anticipate potential problems for the downstream steps. the application of this methodology on our platform has helped us to design tailor made solutions (pretreatment selection, filter selection,…) at the clarification step which are now implemented in our harvest development platform approach. . matriptase knock-out in cho cells prevents clipping of recombinant proteins. a serine protease inhibitors protect model mab from proteolytic degradation in cho-k cell derived conditioned medium. the model mab was incubated in conditioned medium for h or h at °c, subsequently samples were analyzed by western blot. broad spectrum serine protease inhibitors (aprotinin, leupetin) were added during incubation. aprotinin and leupetin are inhibiting proteolytic degradation. the intact mab (upper band) and the clipped mab (lower band) are indicated by arrows. b gene expression profiling of cho-k versus cho-a by ngs. shown is the gene expression profile of "secreted/shedded members of the s a trypsin-like serine protease family" for cho-k and cho-a cell lines using next generation sequencing. the gene expression analysis highlights that five proteases were more than . fold higher expressed in cho-k cells (labelled with a red asterix). the y-axis shows the transcript abundance as rpkm (reads per kilobase of exon model per million mapped reads). c sirna knock-down identifies matriptase as major clipping protease and cho matriptase ko clone shows no detectable clipping activity. upper figure: sirnas directed against the five protease genes and scrambled (scr.) sirna were transfected and conditioned medium was collected three days after transfection. the model mab was incubated in fresh medium as control (first lane) and conditioned medium from the sirna transfected cells. samples were analyzed by western blot. only sirna targeting matriptase (st ) showed reduced proteolytic degradation. the intact mab (upper band) and the clipped mab (lower band) are indicated by arrows. lower figure: the model mab was incubated for h in conditioned medium collected from wt cho-k as well as the matriptase knockout clone. samples were analyzed by western blot. the intact mab (upper band) and the clipped mab (lower band) are indicated by arrows. no proteolytic degradation could be detected in the samples originating from the matriptase ko clone. d cell growth, viability and productivity in ambr (fed batch with temperature shift). cell growth, viability and volumetric productivity profiles of wt cho-k (red circles, n= ) and matriptase ko clone (blue squares, n= ) cultivated in -ml ambr. no significant differences were seen between wt and matriptase ko clone regarding cell growth and viability. comparable or slightly higher productivity was detected for the matriptase ko clone compared to the wt. e significant reduced proteolytic clipping applying matriptase ko clone. the model mab was stable expressed in cho-k (wt) as well as the cho-k matriptase ko clone. samples were analyzed by western blot. the intact mab (upper band) and the clipped mab (lower band) are indicated by arrows. significant reduced proteolytic degradation could be detected in the samples originating from the matriptase ko clone ( samples each is shown for wt and ko cells) the glycosylation of therapeutic proteins is a critical quality attribute (cqa) and needs to be analyzed during cell line and bioprocess development. the current methods for analyzing glycosylation are mainly based on the enzymatic release of glycans. they are tedious and offer only limited throughput, which makes them unsuitable for cell line development work. in this study we evaluated a novel paia assay for measuring intact glycoproteins with capture beads and fluorescence labeled plant lectins to analyze glycans in a high throughput -well plate format. material and methods analytes: erbitux © , mabthera © , arzerra © and avastin © . two glycoengineered variants of one igg were kindly provided by merck (vevey, switzerland). all analytes were spiked into cho-k cell culture supernatant or buffer, diluted : with a denaturation solution and incubated at or °c for minutes to expose the fc glycans. erbitux samples were analyzed under denaturing conditions to detect fab-and fc-glycosylation and in native conditions for fab glycosylation only. μl of pretreated sample was added to each well of the special -well paiaplate, containing labeled lectin and capture beads. the microplate was incubated for minutes at rpm on an orbital shaker at room temperature and spun down at xg. the read-out was done on a fluorescence microscope (synentec, elmshorn, germany) in less than five minutes. results figure a : lectin binding profiles of different iggs. the analysis of different igg results in lectin binding profiles which show the different degrees in glycosylation. high abundance of sugars leads to high binding rates of the lectin for the respective sugar. avastin has a very low degree of galactosylation and high mannose species compared to mabthera and arzerra (fig. a) . only arzerra is carrying glycans with - linked sialic acids. these findings are in line with results from literature [ ] . figure b : distinction between fc and fab glycosylation in erbitux. without denaturation only the fab glycans are detectable in erbitux. denaturation leads to additional exposure of the fc glycans and thus higher lectin binding rates compared to native erbitux. gna and npl only bind to denatured erbitux indicating that the high mannose glycans are only present on the fc part. the equal sna binding rates for both conditions confirm that the - linked sialic acids are almost exclusively found on the fab part. this is in agreement with published data [ ] . figure c : lectin binding rates correlate with the levels of galactosylation and fucosylation. increasing degrees of glycosylation in the mixtures of the glycan variants from merck lead to higher lectin binding rates for all galactose and fucose markers proving that quantitative analysis can be performed with these assays. the cona lectin which binds to the common core mannose glycan motive remains at the same level, suggesting that the fc glycans were similarly exposed in all samples. the results demonstrate that paia assays are capable of quickly detecting differences in glycan patterns of different antibodies. in addition it was shown that glycan variants of the same igg can be analyzed quantitatively. and finally we could confirm the differences in fab and fc glycosylation in erbitux. we believe that bead-based assays with lectins have a great potential for monitoring product quality early in the development process. background gene-and cell therapy-based medicines are experiencing resurgence due to the introduction of "next generation" transfer viral vectors, which have demonstrated improved safety and efficacy. adeno associated viruses (aav) and lentiviruses are very commonly used in therapeutics and often produced by transient gene expression, using pei-mediated transient transfection in hek- or hek- t cells [ ] . the critical raw materials needed for cgmp vector production must be sourced from approved suppliers and should have gone through a rigorous testing program to reduce the risk of introducing adventitious agents into the production process. correspondingly, the pei transfection reagent must also be sourced from a qualified supplier, and have gone through rigorous testing to ensure reliable transfection efficiencies, and hence reproducible virus production yields. here, we present peipro® and peipro®-hq, the unique pei-based transfection reagents suitable for use in process development and in cgmp biomanufacturing, respectively. unlike commercially available peis, peipro® benefits from extensive research and development in polymer chemistry and formulation for mammalian cell transfection. we further demonstrate that peipro® and peipro®-hq are the reagents of choice for virus production runs in most cell culture systems, hence facilitating the transition from initial optimization during process development up to large-scale therapeutic viral vector production in adherent or suspension cells. manufacturing process of peipro® and peipro®-hq reagents. peipro® and peipro®-hq are fully synthetic reagents, free of any animal-origin components. in comparison to peipro®, a more extensive number of quality controls are performed on peipro®-hq to enable its use as a qualified raw material in gmp processes for the manufacturing of clinical batches of therapeutic products. lentivirus and aav production. irrespective of the cell culture vessel type, transfection using peipro® was performed following our recommandations. as an example, hek- t (lentivirus) and hek- (aav) cells were thawed directly into each medium and passaged every to days before going into a liter benchtop bioreactor. cells were resuspended and cultured for days before transfection with peipro®. hek- t cells were transfected with a third-generation system (four plasmids) for lentivirus production. hek- cells were co-transfected with three plasmids for aav production. lentiviral and aav titers were measured and hours post-transfection (data kindly provided by généthon). peipro® is the reagent of choice for virus production runs in most adherent and suspension cell culture systems from process development up to large scale clinical-grade virus production. irrespective of the cell culture-based system and production scale, peipro® and peipro®-hq have led to efficient viral vector yields in standard laboratory cell systems, such as in flasks, cell factories, and roller bottles, as well as in multilayers flasks or fixed-bed culture systems that take into account time and space concerns for the scaling-up process (table ) . for example, high viral vector yields superior to ig/ml and - vg/ml were obtained respectively for lentiviruses and aavs in suspension hek- t and hek- cells cultured in one of the commercially available synthetic cell culture medium balancd® hek (irvine scientific®). conclusion peipro® and its higher quality grade peipro®-hq are the unique pei suitable for efficient and reproducible production of therapeutic viral vectors. efficient viral vector production yields can be achieved in most cell culture systems, irrespective of the production scale. with appropriate and advanced quality controls, the highest quality grade peipro®-hq is commercially available to accompany academics and biopharmaceutical companies in terms of qualified raw material for their gmp-grade viral vector production needs. b distinction between fc and fab glycosylation in erbitux. erbitux was diluted to a concentration of μg/ml in tris buffer and measured in native and denatured conditions to distinguish fab glycosylation (native erbitux) from fab and fc glycosylation (denatured erbitux). it could be confirmed that sialic acids are almost exclusively present on the fab part of erbitux and that the high mannose glycans are only found in the fc part. c lectin binding rates correlate with the levels of galactosylation and fucosylation. two glycan variants samples of the same igg from merck were mixed in different ratios to yield glycosylation rates of to % in terminal β-galactose and to % in core-fucose, based on data from -ab uplc analysis. the mixtures all contained . μg igg per well. the measured lectin binding rates for all galactose and fucose markers correlate very well with the respective degree of glycosylation in the mixtures. all measurements were performed in triplicates (irvine scientific®) . hek- t (lentivirus) and hek- (aav) cells were thawed directly into each medium and passaged every to days before going into a liter benchtop bioreactor. cells were seeded and cultured for days before being transfected by peipro® (polyplus). for transfection, four plasmids were used for lentivirus and three plasmids were used for aav. lentiviral and aav titer were measured and hours post transfection (data kindly provided by généthon) table (abstract p- ). peipro®, the reagent of choice for virus production runs in most cell culture systems in both adherent and suspension cells. irrespective of the cell culture-based system and production scale, peipro® and peipro®-hq have led to efficient viral vector yields superior to ig/ml and vg/ml, respectively for lentiviruses and aavs background continuous perfusion process is making a comeback as a competing upstream manufacturing technology for the production of biopharmaceuticals compared to the standard fed batch processes. this is primarily because of cost advantages such as reduced capital cost and increased product yield. the change in status of perfusion process from older perfusion to the new-age perfusion is due to availability of better cell retention devices leading to more efficient processes, improved cell lines, cell culture medium capable of supporting high cell densities and better bioreactor control strategies. in this work, we present the advantages, limitations and challenges of fed batch and perfusion type of processes through case studies. table results the perfusion run yielded -fold higher titer compared to fed batch run (fig. a) . considering the number of runs that could be executed in a manufacturing facility within the same calendar days, about fold increase in product output can be achieved with the perfusion process (fig. a) . this difference is attributed to higher ivcc, higher pcd and longevity of cells because of decreased level of toxic metabolite concentrations such as lactate and ammonia. case : understanding product retention in perfusion process the new-age perfusion processes utilize hollow fiber filters. this has been observed to cause retention of product within the bioreactor especially towards the end of the production run. two types of experiments were conducted to study the factors contributing to product retention: -spiking studies: -role of product titer: product was spiked into chemically defined media -role of different cell viability: different broths with varying viability spiked with same product titer -evaluation of different hollow fiber membrane (m , m and m ) on product retention. from spiking studies, it was evident that cell debris and poor quality cell broth (lower viability) were the major factors contributing to product retention (fig. c) . from the different membranes experiments, it was identified that at pilot scale, m showed much higher retention from the first perfusion cycle itself and it increased to more than % towards the end of the batch. however, with m membrane, product retention started only late (after % of batch duration) and it remained low (~ - %). on the contrary, this difference was not observed at l scale due to the usage of membranes with larger filter area ( - folds higher compared to pilot scale). when the filter area per unit volume of perfusate was decreased by half (m _batch ) for the pilot scale, even m showed retention profile similar to m (fig. d ). we presented data to show that perfusion process has -fold increase in product yield on a per-batch basis and a -fold increase when facility throughput is considered. product retention is a technical challenge that requires optimization (perfusion rates and filter membrane types). we believe it is imperative that labs that develop processes for biologics can now consider both perfusion and fedbatch based processes as both these technologies can now closely compete with each other. the choice of the process format going forward should now solely be dependent on the requirement for the biologic rather than the earlier perception that fed-batch is the preferred choice because of its simplicity. background chinese hamster ovary (cho) cell culture has been widely used for production of monoclonal antibodies in the pharmaceutical industry. previous studies have shown that the cell specific productivity in cho cells can be increased by glucose limitation [ ] . introducing a productivity enhancing effect it is possible that this also affects the quality of the product such as glycosylation or other posttranslational modifications. in this work, we are focusing on the impact of glucose limitation and increased productivity on the product quality of a monoclonal antibody produced in a fed-batch cultivation of cho cells. materials and methods cho cells were cultivated both under limiting and non-limiting nutrient conditions in fed-batch. for fed-batch cultivation the reduced range for glucose concentration was chosen between . and . g/ l. reference cultivation was performed between . and . g/l. both cultures were fed with similar volumes of a complex nutrient supplement. all cultivations were performed in chemically-defined, animalcomponent free cho growth media (xell ag). viable cell density and viability were determined using the automated cell counting system cedex (roche diagnostics), glucose and lactate concentrations were detected via ysi (ysi life sciences). amino acid were quantified using hplc-fld, vitamins were quantified using reversed phase chromatography coupled to a triple quadrupol mass spectrometer (varian , selected reaction monitoring). amounts of igg were quantified via protein a hplc, mab purified from another cho cell clone was used as a standard. the analysis of product quality was performed by intact mass analysis using reversed phase chromatography coupled to a microotof-q ii mass spectrometer (bruker daltonik). the cho cell culture cultivated under low nutrient conditions reached a % higher viable cell density than the reference culture (fig. a) . the product titer was even increased by % (fig. b) . the spent media analysis shows that some amino acids and vitamins were present at presumably limiting concentrations after day / , mostly in the low nutrient level culture (down to to μm for tyr, gln, arg, and asn, below μm for pyridoxine, data not shown). the product quality showed significant changes for the changed feeding strategy ( fig. c and d) . as expected, the glycation level decreased from % to % compared to the reference culture. the truncation level of c-terminal lysine at the heavy chain of the mab increased from % to %. the glycosylation was also significantly influenced by the low nutrient level (fig. e) : the nonfucosylated variants increased from % to % (fig. f) , the degree of galactosylation increased from % to % (fig. g) . cultivation under low nutrient level led to % higher viable cell density and a product titer increased by % when compared to reference culture grown under non-limiting nutrient conditions. the analysis of product quality reveals % less glycation of light chain for cho cells grown under low nutrient conditions ( . % vs . % in reference culture). the truncation of c-terminal lysine decreased by % (from % to %), the degree of galactosylation increased by % (from % to %, also observed by takuma et al. [ ] ) and non-fucosylated glycans increased by % (from . % to . %) under low nutrient conditions. the product quality analysis by intact mass proved to be highly robust (average cv for four replicates = %). in summary, cultivation with alternative feed led to higher igg product titer and better product quality (glycation unwanted, higher amount of non-fucosylated glycans leads to higher antibody-dependent cellmediated cytotoxicity (adcc), higher amount of galactosylation to higher complement-dependent cytotoxicity (cdc) and adcc [ , ] ). background we developed an automated, multiwell plate (mwp) based screening system for suspension cell cultures (fig. a) which is now routinely used in cell culture process development. it is characterized by a fully automated workflow with integrated analytical instrumentation. it uses shaken - well plates as bioreactors which can be run in batch and fed-batch mode with a capacity of up to reactors in parallel [ ] [ ] [ ] . a wide ranging analytical portfolio to monitor cell culture processes and also a cooperation with internal high throughput (ht) analytic groups to characterize product quality are available. in addition the use and the benefits of spectroscopic methods for cell culture automation were shown in the past [ , ] . automated cell culture systems enable broader screening within a shorter time frame for many applications in upstream process development. the higher degree of parallelization and automation helps to screen for most promising parameters in a shorter time. the use of broad doe screening design allows in addition the identification of parameters that support high titers while keeping high product quality (multiple factors at the same time). the illustration (fig. b) shows an example how this combination can speed up process development steps. main applications of the cell culture automation are for example the identification of product quality levers and media or feed optimization. the application of the cell culture automation is shown for two examples. the goal in the first application was to identify levers to reduce trisulfides. by a screening of conditions in parallel (in -fold replication, wells in sum) the reduction of trisulfides by . % (normalized to start level) was possible. in addition the levers for trisulfide reduction were identified. the best and start conditions were verified in bioreactor scale (fig. c) . the goal in the second application was to increase product concentration without an impact on product quality. by a screening of conditions in parallel (in -fold replication, wells in sum) the increase of titer from . g/l to . g/l (> factor ) was possible by media platform change and media optimization. an impact on product quality could not been shown. the best conditions were also verified in bioreactor scale (fig. d) . the benefits of using cell culture automation in late stage process development were shown based on two examples of current applications. for this purpose the experimental results of the development work of two late state projects using the in-house developed automated cell culture system were shown. the first example shows the capability of the automated cell culture system by reducing trisulfides significantly in just one experiment. for the other project the final product concentration could be increased by factor . by a media screening and changing to the in-house media platform. these two examples show the potential of cell culture automation as a routine tool in process development. the cell line development process has become faster and is simultaneously generating more clone-and product-related analytical data. in order to select the best producer cell line, extremely heterogeneous data types need to be systematically compared. the timely availability of all data needed to decide which cell line to pursue has become a bottleneck in the cell line development workflow. to ensure sound decision making, new integrated workflow support and data analysis methods are needed. we have developed a new end-to-end platform for bioprocess development, which includes a cell line development workflow system supporting seeding, selection, passaging, analyzing, cryo-conservation, and processing in (micro-) bioreactors. this platform, genedata bioprocess™, enables partially or fully automated cell line selection and assessment processes, and it increases process efficiency and quality. the system tracks the full history of all clones -from initial transfection all the way to their evaluation in bioreactor runs -and combines this information with analytics data on molecules, clones, and product quality. it can directly integrate with all instruments, such as pipetting robots, bioreactors, and bioanalyzers. the system is designed for a wide range of . a schematic illustration of the automated cell culture system. only the core system is shown with a robotic plate handler as key device connecting cultivation, processing and analytical parts. b illustration of an example how cell culture automation can speed up process development steps. c application in the identification of product quality levers. d application in titer optimization biologic molecules, including antibodies (iggs, novel formats) and other therapeutic proteins (e.g., fusion proteins). highlighted use cases describe the identification of top producer cell lines, decision making support, bioreactor data management, and full clone history report documentation (fig. ). genedata bioprocess, which was developed in collaboration with top pharmaceutical companies, can flexibly support various (non-linear) workflows and structure the collected information in a way that fosters collaboration across an organization. while increasing throughput is crucial to ensure the timely availability of optimal producer cell lines, high-throughput is only possible when automated processes in the laboratory and the resulting data collection and aggregation can be streamlined. genedata bioprocess helps to establish more productive processes by offering support and integration for automation stations and measurement devices. thanks to the comprehensive workflow support and the possibility to integrate results from cell line stability experiments, product quality assessment, and bioreactor suitability tests, genedata bioprocess provides a unique way to evaluate cell lines. comprehensive analysis of all data collected in the process helps to ensure the highest possible quality and minimize the time and resources needed for data analysis and management. integration of bioreactor data analysis and visualization with other parameters measured in cell line development, streamlines clone evaluation in micro-bioreactors and supports highthroughput operations. genedata bioprocess comprehensively tracks the full clone history from the origin of the host cell line to the generation of the validated monoclonal producer cell line. for promising clones, the clone history report can be generated with one click. besides supporting cell line development, genedata bioprocess is a comprehensive platform capable of tracking the complete bioprocess development process. in , . million people suffered from cancer [ ] making it to a major concern of our society. since common cancer treatment is limited and not effective for late stage carcinoma, alternative methods are needed to reduce the high mortality rate of cancer patients. one alternative approach is the application of the oncolytic measles virus (omv), because omv has a natural affinity against cancer cells. the major drawbacks of omv is to produce the extremely high amount of at least tcid ( % tissue culture infective dose) per dose [ ] which is needed. to solve this problem, a high titer process must be established including an efficient downstream processing (dsp). we developed an appropriate upstream processing and are able to produce - tcid ml - in a bioreactor with . l working volume [ ] . now, we focus on the dsp part. the following study tested the application of charged depth filters for the omv clarification. in contrast to common dsp schemes, a depletion of virus particles or a loss of infectivity is not desired. the aim is a reduction of protein content and dna with minimal loss of infective omv. further, we investigated the influence of the cell culture medium on the depth filtration process. to explore the influence of the surrounding cell culture medium on the depth filtration performance, omv was either produced in serum-free medium (vp-sfm) or serum-containing medium (dmem + % fcs). the production was done in a str with a working volume of . l as described in [ ] . cells and carriers were separated with an opticap xl -module (polygard-cr; μm; merck). for the depth filtration millistak+ ce filters (merck) were used. the filter material was autoclaved and rinsed with ml of mm tris-hcl (ph= . ). the virus suspension was filtered with a load of l m - using a peristaltic pump (ism c; ismatec) applying a flux of l m - h - (fig. ). samples were collected at the beginning and end of a filtration run. the omv titer (tcid ml - ) of the samples was determined according to kärber and reed [ , ] . protein content was measured with the pierce bca protein assay kit (thermofisher scientific) according to the manufacturer's instructions. dna was measured by a microtiter assay using quant-it picogreen dsdna reagent (thermofisher scientific) according to the manufacturer's instructions. we found that positively charged depth filters were suitable to clarify omv suspensions. the cell culture medium, in which the omv was produced, influenced the outcome of the depth filtration. a log reduction value (lrv) of . was determined for omv present in serum-containing medium (scm), whereas the titer of omv in serumfree medium (sfm) was reduced . log levels. this indicates that without serum in the surrounding liquid, omv will adsorb to the filter material. however, we must evaluate if the missing serum or other components present in sfm are responsible for this effect. total protein was not relevantly reduced by the clarification using charged depth filters. for omv present in scm, the residual protein content was slightly less compared to omv present in sfm (table ). in contrast, host cell dna (hcdna) was bound to the filter material. we achieved a % reduction of hcdna for an omv suspension in sfm. after clarifying an omv suspension in sfm, the remaining hcdna content was even lower being only %. conclusions charged depth filters are suitable for the first clarification step of omv downstream processing. residual protein could pass the depth scheme of the complete cell line development workflow support in genedata bioprocess. showcasing integration of data from diverse measurement instruments, data visualization for decision making support as well as, tracking of full clone history filter almost unhindered, whereas the hcdna content was already reduced to % at maximum. however, the omv titer was also reduced by the depth filtration. this undesired effect was stronger for the omv present in sfm. because the agencies require avoiding serum in clinical-grade production processes, this is disadvantageous. nonetheless, because sfm will be soon standard for omv production, further experiments have to be done preventing the omv reduction during clarification. one option can be to reduce the adsorption strength of the virus to the filter material by the addition of salt. moreover, it is important to establish a standardized protocol for the upstream processing. we determined batch-to-batch variations within the clarification indicating a strong impact of upstream processing (usp) on the outcome of the dsp. therefore, further studies must investigate the influence of usp parameter e.g. time of harvest and ph of the harvest solution on the omv. fed-batch culture is commonly employed to maximize cell and product concentrations in upstream mammalian cell culture processes. typical standard platform processes rely on fixed-volume bolus feeding of concentrated feed supplements at regular intervals. however, such static approaches might result in over-or underfeeding. to mimic more closely the dynamics of a fed-batch culture, we developed a dynamic feeding strategy responsive to the actual nutrient needs of a mab-producing recombinant cho cell line. results and discussion improvements made at different steps during fed-batch development are shown in fig. . at step , all eight cell boost supplements were added to cdm ns according to a doe approach, and batch cultures were performed. this evaluation allowed us to select only those cell boost supplements that were beneficial to the overall culture performance. non-performing cell boost supplements were removed and not considered further. at step , the selected cell boost supplements were added daily to the cultures at different ratios according to a doe approach, and fedbatch cultures were conducted. as expected, daily feed additions to replenish consumed nutrients substantially improved mab and peak cell concentrations as well as viable cumulative cell days (vccd) compared to batch cultivation. further, the results enabled us to fine-tune the feed ratio of selected cell boost supplements. at step , we further optimized the best performing feed ratio by investigation of static and dynamic feed protocols. most fed-batch protocols rely on constant feed additions on distinct days. however, these approaches often lead to substantial over-or underfeeding during bioprocessing. to improve such "static" protocols, we investigated three different "dynamic" approaches as shown in table by applying the selected cell boost supplements with the optimized feed ratio. this investigation allowed us to further improve the bioprocess performance. the best performing approaches, constant and retrospective feed, were further investigated in fully automated bioreactors under controlled conditions. in general, constant cultivation parameters in the bioreactor slightly enhanced mab titers compared to shake flask cultivation. the retrospective feed strategy yielded % higher titers than the constant strategy. overall, the established methodology for fed-batch development allowed us to obtain . × higher mab titers (batch mean: . g/l vs. fed-batch . g/l) in a short time and three simple steps. in addition, the product quality was investigated. compared to the legacy fedbatch process, fed-batches that were conducted with the newly selected basal and feed media altered the distribution of charge and glycan variants. the amount of aggregated product was not altered. the established methodology for fed-batch development is a rapid protocol to select well-performing feed supplements and optimize their ratio to the culture requirements. in three steps, mab titers were boosted . x from . g/l to . g/l. product glycosylation and charge variants could be influenced by the newly selected basal and feed media compared to a legacy fed-batch process. the amount of aggregated product was not altered. the present study investigates the beneficial effect of spiking hyclone™ actipro™ basal medium with hyclone cell boost™ a and cell boost b feed supplements on growth and productivity of a recombinant cho cell line. to evaluate the impact of feed-spiking compared with cultivation in basal medium only, the cell line was grown in bioreactors under controlled conditions to determine cellspecific metabolic rates, nutrient consumption, and byproduct accumulation over the process time. transcriptome analysis of the cultivated cells, using microarrays on four consecutive days to investigate differential gene expression, revealed the beneficial effect of feed-spiking compared with cells grown in basal medium. model cell line was a mab-expressing cho dg (licensed from cellca gmbh) cultivated either in actipro basal medium only (ge healthcare) or in actipro basal medium feed-spiked with additional supplementation with % cell boost a and . % cell boost b (ge healthcare). both cultures were grown in batch mode using dasgip™ cellferm-pro™ stirred-tank bioreactors (eppendorf). the beneficial effect of feed-spiking was analysed by transcriptome analysis using microarray technology ( × k design, agilent). both basal and feed-spiked processes lasted for seven days with viabilities above % until day . on day seven, a sharp decline in viability indicated the end of the batch process (fig. a) . in feed-spiked medium, cells initially grew slower but reached almost twice as high peak cell concentrations ( . × c/ml) than in basal medium only ( . × c/ml). remarkably, the integral of the viable cell concentration over the total process time (viable cumulative cell days [vccd] ) was similar between both process strategies (fig. c) . while mab production plateaued after day in basal medium only (final titer . g/l), a continuous increase to three-fold higher final titers ( . g/l) was observed in feed-spiked medium (fig. b) . the higher titers could be attributed to generally higher cell-specific productivities (qp), which remained rather constant (~ pg/cell/day) in feedspiked cultures. in basal medium, the qp continuously dropped by % (day to ), % (day ), and > % (day to ) from to pg/cell/day in basal medium cultures. in average, the qp was % higher in feed-spiked cultures (fig. d ). transcriptome analysis of differentially expressed genes between cells grown in basal medium or feed-spiked medium were used to identify relevant go terms that indicated a more active proliferative state for feed-spiked cultures (data not shown). the top go terms significantly related to cell cycle and primary metabolism, cellular division, as well as nucleobase formation or regulation. furthermore, gsea revealed several significantly enriched set of genes related to gene transcription, dna replication and repair, cell growth and proliferation, as well as inhibition of apoptosis in feed-spiked cultures. thus, feed-spiking increased the proliferative activity of cultivated cells. several of the identified genes appear as promising targets for cell line engineering, but have not yet been described in relation to high-producing recombinant cell lines and will need to be evaluated in future studies. feed-spiking of basal medium is a convenient and easy way to considerably increase product concentrations in a simple batch culture. differential gene expression revealed genes that appear important for high cell-specific production rates, and this knowledge can be leveraged into cell line engineering approaches or the design of high producing cho cell media. in the latter case, a maximized supply of high biosimilarity must be demonstrated by physicochemical and functional characterization for approval requirements of phase i and phase iii studies in terms of efficacy, safety and immunogenicity. in this study, rounds of upstream and downstream processes were run to reach the cqa limits of the originator molecule. after conducting many different development strategies, the mirror plot images of the intact deconvoluted mass were found to be identical corresponding to similar levels of glycoforms. the uv chromatogram of reversed phase ultraperformance liquid chromatography (rp-uplc) of tryptic peptide mapping demonstrated that the primary structure of tur is identical to the originator as shown in fig. a . post-translational modifications (ptms) such as oxidation, deamidation, n-terminal pyroglutamic acid, c-terminal lysine truncation levels were also comparable for two products. the glycosylation site (hc-asn ) was confirmed by peptide mapping analysis and % glycan site occupancy was proven for tur and originator. the glycosylation pattern for two products were highly similar in terms of major glycans (g f, g f, g f-gn and etc.). man level was lower in tur compared to the originator product which may not have any clinical effect on the molecule. the secondary structure was determined by atr-ftir spectroscopy. absorption bands (amide i and amide ii) were overlapped completely and amounts of α-helix and β-sheet structures were comparable. furthermore; size-exclusion chromatography (sec) analysis revealed that both products have the same level of purity (> %) and aggregate (< %) levels. the level of impurities were determined as below % by ce-sds. the capillary isoelectric focusing (cief) experiments showed that the charge variant profiles of two products are indistinguishable and the isoelectric point of main peak is observed at . for both products. the association/dissociation rate constants and binding affinity for both tur and originator were highly similar and similarity score was calculated greater than %, as shown in fig. b . in this study, state-of-art analytical techniques were used to assess the biosimilarity of tur to the originator adalimumab. head-tohead comparison data clearly demonstrated that tur is highly similar to the originator adalimumab in terms of physicochemical and functional characteristics. based on the analytical similarities, we . process performance of basal medium (black) and feed-spiked (red) bioreactor batch cultures: a cell concentrations and viability, b viable cumulative cell days and specific growth rate, and c antibody concentrations and cell-specific productivity. error bars indicate standard deviation from three independent experiments. the black arrows on day indicate the beginning of decreasing cell-specific productivities and lower cell-specific growth rates in basal medium cultures believe that tur will have comparable pk/pd, potency, and efficacy results to the originator adalimumab. the expanded interest in intensified continuous bioprocessing has highlighted the need to develop a small scale model for perfusion cell culture. the direction in the industry has been to increase target cell densities to ≥ x vc/ml and decrease perfusion rates to ≤ vvd. in order to increase the throughput of our perfusion media development capabilities we sought to develop a small scale model of perfusion using the ambr® instrument (sartorius, germany). we used a modified cell settling model from the previously published by kreye et. al. to achieve the cell retention necessary to reach perfusion relevant viable cell concentrations [ ] . in this work, we will show the application of this small scale model for: ( ) identification of specific productivity performance over a steady-state for tested media, ( ) identification of cspr min for a specific cell line and medium combination, and ( ) confirmation of consistent product quality profiles between the small scale model and benchtop perfusion (data not shown). a chozn® cell line producing an igg was evaluated in several proprietary chemically defined media prototypes generated during the development of the catalog excell® advanced hd perfusion medium: "fed batch medium", "early prototype", "mid prototype", "intermediate prototype" and "late prototype" [ ] . small scale simulation of perfusion experiments were run in ambr® . media exchange was performed times per day in equal amounts. agitation, gassing, and liquid handling were stopped for an optimized period of time to allow cells to settle to the bottom. spent media was removed in an amount proportional to / rd daily exchange volume. agitation, gassing, and liquid handling were resumed and fresh media was added back to the vessels. for benchtop perfusion, cells were inoculated in l applikon bioreactors (applikon, netherlands). at a concentration of~ . x vc/ml, perfusion was initiated using the atf (repligen, massachusetts). perfusion rate was limited at . vvd during steady-state. using the cell settling method described above we have been able to achieve ≥ % cell retention efficiency. all media tested in this work were able to reach and maintain the x vc/ml target cell density at vvd (fig. ) . performance of each media formulation was ranked based on specific productivity (table ) . using "intermediate prototype", minimum steady-state cspr was determined to be . pl/c/d for this cell line. n-glycan analysis of ambr® and bioreactor samples via intact mass spectrometry displayed only slight differences in product quality profile (data not shown). our work has shown a clear distinction between various prototype perfusion media and demonstrated a % increase in specific productivity over "fed batch medium" used in perfusion. additionally, we have shown the application to further characterize the process using this model to determine cspr min for a given medium and cell line. the transient process has been successfully operated at l in a sartorius biostat single use bioreactor (sub), yielding . kg of crude product from a two-week expression culture (table ) . successful scale up of the process to l creates the potential to supply transiently expressed products to support toxicology studies or even early gmp clinical supply, enabling accelerated biopharmaceutical development project timelines. the scale up from rocking bioreactors (rbr) to sub scale identified some scalability issues. lower specific productivity due to increased cell growth and decreased titres were observed in the sub ( fig. iii & iv). to improve the predictability of scale up, a new process was developed and evaluated in the sub vessels utilising a modified transfection method, which resulted in comparable expression levels and specific productivity between rbr and sub scales. two sets of expression vectors comprising heavy chain and light chain plasmids expressing a human igg kappa mab, as previously described [ , ] were used in the process optimisation study. the cell line used for transient expression and the pei mediated transfection method has been described previously [ ] . transfected cultures were run under fed batch conditions for days in l ge healthcare wave bioreactors (rbr), hyclone sub using l and l hyclone bioreactor bags (thermo scientific). the transfection process was modified to address the reduced titres and higher viable cell density (vcd) seen in the sub cultures. shake flask cultures were used to assess the standard (a) and modified transfection processes (b and c) (fig. , i & ii). process c was identified as the process to be studied at sub scale, offering the potential to mitigate the high viable cell densities (vcd) observed. scaling up process c to l and l sub resulted in cultures producing titres exceeding g/l with desired cell growth profiles. scale up of process a into sub vessels resulted in decreased productivity compared to the rbr scale. after optimisation, the sub process c yielded increased specific productivities and expression titres comparable to those seen at rbr scale (table ) . medimmune has successfully completed the first known successful cho transient culture at l scale producing > mg/l of mab at harvest. process optimisation has subsequently demonstrated reproducible titres at l to l scale exceeding g/l with comparable glycosylation profiles between sub and rbr cultures across scales. comprehensive analysis of the impact of trace elements in media on clone dependent process performance and product quality background state-of-the-art biopharmaceutical processes are accounting concomitantly for process performance and product quality. even though high yielding, robust processes are the cornerstones of any process development, product quality parameters such as structural integrity, charge variances and post-translational modifications are progressively becoming the focus of the developmental work. in conjunction with host cell line selection and process performance parameters, media components are crucial for the continued progress in rational modulation of product quality attributes affecting biological activity, immunogenicity, half-life or stability. among media components, trace elements (te) are of particular interest as they play a pivotal role in various cell metabolism pathways. based on a comprehensive doe approach, extensive process performance-and product quality evaluation combined with metabolic flux analysis, the impact of several trace elements on the biopharmaceutical process is assessed. in a comprehensive i-optimal doe approach ( fig. ) , the effect of six te in various concentration levels and combinations in serum-free media was studied for four different cho-k cells lines in an ambr® setup. a scrutiny of the process performance parameters such as cellular growth, productivity, amino acids and vitamins consumptions rates for each of the conditions was performed. the process performance evaluation was accompanied by extensive product quality analysis including size and charge variants, glycosylation patterns, oxidation and methylation. furthermore, a metabolic flux analysis was performed based on the nitrogen balance. based on extensive analytical data, the obtained response surface model provides a clear insight into the impact of particular te and their combinations on process performance and product quality. the high model quality enables discriminations between clone dependent and clone independent effects. with an elevation in titer up to % in the best condition of the cell lines clearly show, that even state-of-the-art media can be outperformed by trace element rebalancing. analyzing specific rates in combination with metabolic flux analysis improves the understanding of metabolic restructuring of the cell lines under distinct te levels and combinations. modulation of trace elements levels had a tremendous effect on the charge heterogeneity and glycosylation structure of the different proteins. this provides a toolbox for the fine tuning and control of product quality parameters. taken together, the data further paves the way to the rational fine tuning of process performance and product quality attributes. background due to regulatory concerns and economic impact, ensuring product quality and consistency is now one of the main challenge faced by the biopharmaceutical industry. for monoclonal antibodies (mab), glycosylation is one of the most important quality attributes as it impacts on mab structure integrity, and ultimately on both clinical efficacy and safety. many factors affect mab glycosylation and its inherent heterogeneity, including the host cell, the culture medium, the mode of operation and the operating conditions. in this context, the capacity to monitor and control on-line the antibody glycosylation, from early-to late-stage process development, would be of salient interest to reduce the time and cost to market. in order to address this unmet need, we have designed an improved spr biosensor assay to measure the kinetics of interaction between a mab and the extracellular domain of the fcγriiia receptor bound at the biosensor surface [ , ] . of salient interest, we also demonstrated that various binding kinetic signatures, especially different dissociation kinetics could be correlated with distinct mab glycosylation patterns and with therapeutic efficacies, as deduced from mass spectrometry and a surrogate adcc assay, respectively. in parallel, we have also harnessed a spr biosensor directly to a bioreactor, which permitted the at-line determination of the concentration of antibodies by hybridoma cells during a bioreactor culture. we now plan on combining both approaches to determine on-line the glycosylation profile of the produced mabs. our ultimate goal is to design a unique and highly innovative bioprocess control tool that can be readily applied in an industrial bio-manufacturing setting. reducing timelines and costs are key factors for bio-pharmaceutical industries to accelerate process development and drug delivery to patients. enhancing throughput of bioprocess development has become increasingly important for the screening and optimization of cell culture processes. this challenge requires high throughput tools. in a previous study [ ] , we showed that ambr® , a robotically driven mini-bioreactor system developed by tap-sartorius, could be advantageous to accelerate process development. the use of ambr® system allows us to test a large number of experimental conditions in a single experiment. therefore, the large amount of production samples to be characterized for product quality attributes (pqa) increases as well: the bottleneck has moved from the generation of samples at the production bioreactor step to in-process analysis. for product quality attribute analysis at lab scale, protein purification is generally carried out on > ml columns which is incompatible with the size of ambr® bioreactors. moreover, the applied methods are relatively low throughput. the development of a high binding capacity resin (up to mg/ml) [ ] , combined with high performing new cell lines which are able to produce up to g/l of recombinant monoclonal antibodies, allow require the development of an efficient and high throughput (hts) purification method robot. the use of robotic equipment for small scale purification purposes is a great opportunity for us to tackle this bottleneck, by enabling highthroughput sample purification at smaller scale ( μl). recombinant monoclonal antibodies were produced by a genetically engineered dihydrofolate reductase (dhfr)-/-dg chinese hamster ovary (cho) cell line. clarified cell culture fluid (cccf) was obtained from and k liter bioreactors after three filtration steps. minipurifications were performed on tecan freedom evo® robot with predictor robocol-umns® containing μl mabselect sure® resin. larger scale purification were executed using an aktaxpress using hitrap column prota. to assess monoclonal antibody purification at small scale, we first tested the repeatability of the minipurification, purifying the samples times on the same columns and using different columns, focusing on the yield of the purification and the impact on product quality attributes, especially the hmws. then, we compared those results to those obtained with the aktaxpress at larger scale purification, comparing the yield of the purification and the pqa of the protein-a eluates obtained with both purification systems. finally, we assessed the capability of the robot to perform hts of buffer and purification conditions, evaluating three different buffers at different concentrations and ph values, and also testing different loading column capacities. in this study we established that the tecan can be used as a robust platform for purification at small scale. we observed similar purification yields, intra and inter run. the analysis of the pqa a level showed there is also very high reproducibility. and the ph of the eluate showed as well strong comparability as well. table shows the coefficient of variation (cv) of the yield, hmws and eluate ph are low, demonstrating the good reproducibility of the purification. the strong reproducibility obtained between the different purifications showed that the tecan and the aktaxpress are similar in terms of purification performance and pqa (fig. a, b) . the tecan is a versatile automated liquid handler allowing the screening of huge purification conditions (fig. c) , the possibility to purify large quantities of samples, while the samples amount is limited. the tecan has the potential to purify more than samples/day, reducing timelines and allowing us to deliver faster to the patients. viable cell density monitoring in bioreactor with lensless imaging geoffrey monitoring cell density and viability of mammalian cell culture bioreactors is a necessary task that presents today a number of remaining challenges. the traditional measurement for bioreactor cell count and viability rely on using the trypan blue exclusion method once a day. while automatic cell counters have reduced the statistical manual error, sampling the bioreactor remains a contamination risk and is prohibiting process control as the sampled volume becomes siginficant. lensless imaging technology is a new method for accurately determining cell concentration and viability without staining. this technique directly acquires the light diffraction properties of each individual cells through their holograms images without any objective, lens or focus settings. living and dead cells have significant holographic patterns that can be distinguished and precisely counted. lensless imaging technique directly acquires the light diffraction properties of each individual cells through their holograms images without any objective, lens or focus settings. living and dead cells have significant holographic patterns that can be distinguished and precisely counted. we compare cell counts and viability between the reference method and our lensless imaging device, the cytonote counter. measures are performed once a day on samples from bioreactors, from the inoculation to the end of the culture. we also assessed the repeatability of our method. another lensless imaging prototype is setup as a measurement chamber directly connected to a perfusion bioreactor, for continuously receiving the bioreactor broth, and therefore reproducing an in situ measure. with a concentration range up to x cells/ml ( fig. ) and viability range at - %, we obtained a correlation factor of . between the two compared methods. the large field of view allows the analyze of several thousand cells within a single image, keeping the statistical variability of the measure as low as %. our measurement chamber prototype has demonstarted its capability for continuous viable cell density and viability monitoring. we are now working at designing a steam strerilizable probe, and we envision lensless imaging to become the future method of choice for on-line monitoring of suspension cells cultures. lensless imaging technology is capable of accurately and precisely monitoring viable cell density and viability with a combination of significant advantages starting from low sample volume use, label free detection, quick measure, simple device, to high number of cell analyzed which let us think that it is a good candidate for very small- comparison between both purification systems and the ability of the system to be used as a high throughput tools for buffers screening. a purification yield (%). b pqa .a (normalized). c impact of the ph and buffer concentration on the pqa .a scale bioreactor and high-throughput measures. its high repeatability is also a key parameters in the effort to narrow batch to batch deviations. in addition we demonstrate that this technique is potentially powerful for in-line and continuous monitoring of a lab bioreactor. we envision lensless imaging to become the future method of choice for on-line and in-situ monitoring of suspension cells and a perfect tool for process control in fed-batch or perfusion mode in single-use bioreactors or traditional steam sterilized vessels. it can certainly become the first vcd measurement technique to work from cell line engineering, to process development, pilot scale, and up to manufacturing scale. time-dependent product heterogeneity in mammalian cell fermentation processes background a consistent product quality is a major goal in the production of biotherapeutics, especially recombinant glycoproteins. whereas it is unlikely that the polypeptide chain changes during a production process, posttranslational modifications and protein folding are sensitive to fluctuations in parameters and conditions. here we focus on protein glycosylation as one important indication for product quality [ ] . during a batch process conditions change continuously. at the beginning, the supply situation for the cell is excellent, but the secreted material stays a long time in the culture fluid. later during cultivation substrate provision decreases, whereas the exposition time of the protein to the culture fluid is much shorter. altogether this leads to product heterogeneity of the secreted protein during a batch culture. four different cell lines, two of human origin and two cho clones, producing four different recombinant glycoproteins were investigated in this study. together with their respective parental cell line the clones were cultured in three replicates in shakers. supernatant from the cultures were harvested at four time points. the removed culture volume was replaced by culture supernatant of the identically cultured corresponding parental cell line. the product was isolated from the supernatant and the glycans were released. one part of the released glycans was labeled with -ab and separated by hilic-fld. the other part of the glycans was permethylated and analyzed by maldi-tof mass spectrometry (fig. a) . the investigated proteins were antibody, antithrombin iii from cho clones and α -antitrypsin, c -inhibitor from human clones. the antennarity of the glycans is quite stable in all production phases. the degree of core fucosylation is very high in all products. a low fucosylation degree of antibodies may be favorable for a higher adcc performance [ ] . some of the products showed an antennary fucosylation, which seemed to change not very much in different cultivation phases. nevertheless, this might be an issue due to an antigenic impact in the patient. the antennary galactosylation changes noticeable for the antibody and α -antitrypsin. in both cases the degree is highest in the first phase. an incomplete galactosylation leads to truncated glycans. this leads inevitable to undersialylated antennas to be seen for α -antitrypsin. the sialylation is the highest in the early phases and decreases during cultivation time. sialylation of a therapeutical protein is important for the half-life in the patient. therefore highly sialylated products are desired [ ] . in further studies the consistency of the galactosylation and the sialylation will investigated for fed batch and long term continuous cultures in comparison to batch cultures. due to the feed solution or the fresh media being present during such processes, the supply situation should be excellent for the whole cultivation time. the differences between the maldi-tof and hilic-fld data originate from complex and unresolved chromatograms (fig. b , chromatograms not shown). for that reason coupling of hilic-fld and ms is very much recommended. background novel biologics are often selected from a large library of lead candidates in the initial stage of preclinical and clinical developments. for this selection, there is a demand for high-throughput production of recombinant proteins of high quality and in sufficient quantity. transient gene expression offers a rapid approach to the production of numerous recombinant proteins for the initial-stage developments of biologics. mammalian cells are major host cells for transient gene expression, but they have the disadvantages of complicated operations and high cost of culture. insect cells are easy to handle and can be grown to a high cell density in suspension with a serum-free medium. insect cells can also produce large amount of recombinant proteins through post-translational processing and modifications of higher eukaryotes. hence, insect cells have been recognized as an excellent platform for the production of functional recombinant proteins [ , ] . in the present study, the production of an antibody fab fragment through transient gene expression in lepidopteran insect cells was examined. the dna fragments encoding the heavy chain (hc) and light chain (lc) genes of an fab fragment of mouse anti-bovine rnasea [ ] were respectively cloned into the plasmid vector pihaneo, which contained the bombyx mori actin promoter downstream of the b. mori nucleopolyhedrovirus (bmnpv) ie- transactivator and the bmnpv hr enhancer for high-level expression [ ] . trichoplusia ni bti-tn- b - (high five) cells were co-transfected with the resultant plasmid vectors using linear polyethyleneimine (pei; mw , ). before transfection, the plasmids and pei were prepared in mm nacl, ph . and incubated at room temperature for min. when the transfection efficiency was checked, a plasmid vector encoding the enhanced green fluorescent protein (egfp) gene was also co-transfected. transfected cells were incubated with a serum-free medium in a static or shake-flask culture. culture supernatants were analysed by western blotting and enzyme-linked immunosorbent assay (elisa). the numbers of green fluorescent cells and total cells in culture broth was determined using a flow cytometer. western blot analysis and elisa of culture supernatants showed that transfected high five cells secreted the fab fragment with antigenbinding activity. in static cultures, transfection and culture conditions, such as hc:lc gene ratio, a serum-free medium, dna:pei ratio, and dna amount per cell, were successfully optimized by flow cytometry of egfp expression in transfected cells and the yield of the secreted fab fragment measured by elisa. the effects of culture temperature and initial cell density were also examined by comparing the cell growth and the production of fab fragments in shake-flask cultures. under optimal conditions (medium, psfm-j (wako pure chemical industries, japan); hc:lc gene ratio, : ; dna, μg/( cells); pei, μg/( cells); initial cell density, x cells/cm ; temperature, °c), the yield of more than mg/l of fab fragment was achieved in days in a shake-flask culture ( fig. ) . transfection did not significantly affect the growth of high five cells as compared with untransfected cells. transient gene expression using insect cells may offer a promising approach to high-throughput production of candidate proteins for the development of biologics. the increasing demand for biopharmaceuticals produced in mammalian cells has led industries to increase volumetric productivity of bioprocesses through different strategies [ , , ] . in this context, fedbatch and perfusion cultures have attracted more interest than conventional batch processes. the efficient application of such alternative processes requires the availability of reliable on-line measuring tools for cell density and cell metabolic activity estimation [ ] . the comparison of different culture strategies for hek cell line producing ifn-γ are presented below: batch, fortified batch and fed-batch. in this context, a new robust feeding strategy based on the monitoring of alkali buffer addition was applied for the estimation of nutrient requirements. this method allows to increase cell density and product titer compared with the other strategies assessed. three different culture strategies were carried out in -litre biostat b-dcu ii bioreactor. first, a reference batch and a batch using fortified medium (nutrient enriched medium) were run and assessed in terms of viable cell density (vcd) and product titer, and set as initial references. then, a fed-batch was performed applying a feeding strategy based on the nutrient requirements estimation by monitoring the alkali buffer addition used for the control of ph. results vcd and product titer achieved for the different culture strategies assessed (batch, fortified-batch and fed-batch) are presented in table . in fortified batch an increase in vcd of % and also % in product titer were obtained compared with batch. in the fed-batch culture carried out (fig. ) , we observed that alkali buffer addition profile matched the vcd evolution trend. thus, the monitoring of alkali buffer addition was used for estimating the nutrients requirements (i.e. the volume of feeding medium) at any time during the fed batch phase. the feeding strategy based on alkali buffer addition enabled to maintain glucose concentration set point therein a narrow range during fed-batch phase (around mm). as a result, higher vcd ( . · cells/ml) was obtained when compared with both batch references: vcd was enhanced to % and % and an increase up to % and % in product titer in respect to batch and fortified batch respectively. the results prove that fed-batch strategy based on the alkali buffer addition is a robust on-line monitoring method that enables to optimize the feeding strategy in a fed-batch cultures. three different culture strategies have been tested in bioreactor with a hek cell line producing ifn-γ. results show as the higher vcd is reached, the higher product concentration is achieved. therefore, from bioprocess development point of view, it is very interesting to implement strategies with higher vcd outcome, such as fed-batch operation mode. in this context, a new robust method for vcd estimation in fed-batch was applied. the alkali buffer addition necessary for maintaining the ph set-point is an on-line reliable and easy measuring variable that provides information about by-products formation (mainly lactic acid). the monitoring of this variable can provide information about the cell concentration, activity and metabolism, to detect changes in culture. besides that, a relationship between alkali buffer addition and vcd can be established since the first is strongly correlated with cell growth and metabolites consumption/formation. the application of alkali buffer addition measure to implement an optimal feeding strategy in fed-batch permits to enhance vcd and product titer when comparing with batch strategies. a novel approach to high throughput screening for perfusion background perfusion systems for suspended mammalian cells raise growing interest in the biomanufacturing industry. continuous manufacturing is growing in the field and is encouraged by health authorities [ , , ] . this work addresses scale down limitations inherent to continuous media exchange and cell retention by using a semi-continuous system. data was generated with a set of different clones that were previously studied in fed-batch mode [ ] . materials and methods cho-k cell lines expressing the same monoclonal antibody (mab) and issued from the same transfection were used as models. . l bioreactors (sartorius) were used for fed-batch and perfusion production runs. the perfusion bioreactors were run using an alternating tangential flow filtration device (repligen, xcell™ atf system). the cell biomass was controlled by removing cells through a bleed line and was controlled using a biocapacitance probe (hamilton, incyte). the perfusion rate (d) was fixed to one vessel volume a day (vvd - ). the semi-continuous runs were made in ml shake tube (tpp, tubespin® bioreactor ). once a day, the tubes were centrifuged ( min, g), the supernatant removed (to mimic a perfusion rate of vvd - ), replaced with fresh media and cells were re-suspended. the clone's growth potential were preserved across the systems (fig. ). clone # always reached the highest viable cell density (vcd), followed by clone # . clone # and # showed similar growth characteristics. it is interesting to note that in the perfusion bioreactor different patterns in terms of vcd were observed although the cell biomass signals were similar for all runs. this reflects the fact that the capacitance measures the biomass and not the absolute cell count [ ] . to estimate the minimum cell specific perfusion rate (cspr min ) in the semi-continuous experiment, the perfusion rate was divided by the maximum viable cell density (vcd max ). this value was compared to the cspr obtained during the th set-point (sp ) of the perfusion runs. as expected, the bleed fraction decreased when the capacitance set-point was increased and went down to % or less of the total perfusion rate (data not shown). since the bleed removes the excess biomass, it is an indication of how close to a limitation the system is. therefore, the cspr calculated at sp was considered as the minimum cspr. the cspr min obtained in both systems were very close ( table ). the semi-continuous system can therefore be used to identify the cspr min before running a continuous bioreactor, it therefore facilitates the decision making early in the development (to define the target cell density for a defined perfusion rate). the specific productivity (q p ) of the clones was quantified at the maximum vcd (semi-continuous) or at sp (perfusion). absolute values are not representative since the cell environment is so different in both systems. nevertheless, a relative ranking proved to be indicative of the respective performances ( table ). the maximum cell growth in fed-batch, semi-continuous and perfusion were also compared, their ranking was always preserved. both indications can be used to assess a performance ranking for different clones. the performance of clones was studied in different cultivation systems: fed-batch/perfusion bioreactors and semi-continuous shake tube. the semi-continuous system was able to precisely predict the cspr min , an important process parameter for perfusion. specific productivity and maximum cell density ranking was preserved across the systems, therefore the scale down experiment can be used to assess a performance ranking for perfusion clone screening. modulating antibody galactosylation through cell culture medium for improved function and product quality jenny y. bang, james-kevin y. tan the production of therapeutic antibodies (abs) requires high titers and excellent product quality to ensure efficient manufacturing and potent drug efficacy. glycosylation, or the attachment of sugars to organic molecules, is a critical quality aspect that can significantly alter ab binding, function, and therapeutic effect [ ] . galactose is a key sugar of interest due to its significant impact on ab function and the ability to control galactosylation through cell culture medium. herein, irvine scientific assessed the ability of media components to modulate galactose levels on a model therapeutic ab. various media compositions were able to modulate galactosylation levels without compromising cell growth and ab titers. in addition, an in vitro assay was utilized to evaluate the functional ability of abs to bind and activate complement-dependent cytotoxicity (cdc). differences in galactosylation significantly altered the abs' ability to induce cell cytotoxicity. furthermore, design of experiment analysis determined the optimal ratio of supplements to maximize galactosylation. this "optimized supplement" was verified and evaluated against other suppliers' galactosylation supplements in terms of growth, titer, glycan analysis, and ab function. the optimized supplement outperformed all other suppliers' supplements and resulted in the best overall cell growth, titer, galactosylation, and ab function. ab against cd were grown in balancd® cho growth a and were fed with balancd® cho feed on days - of the cultures. viable cell density and cell viability were assessed by a beckman coulter vi-cell xr, ab titer was assessed by a pall fortébio qk e , and glycan analysis was assessed by a perkinelmer labchip gxii. for the functional cdc assay, abs were incubated with daudi b lymphoblast cells and normal human complement serum. cell cytotoxicity was assessed with a promega cytotox-glo kit. various supplements were evaluated in fed-batch cultures and resulted in - % ab galactosylation without compromising cell growth and ab titers. design of experiment analysis determined an optimal composition, deemed "optimized supplement," which was evaluated against a panel of galactose-modulating supplements from other suppliers. the optimized supplement resulted in a similar viable cell density (vcd) and cell viability compared to the fed-batch culture control which had no supplements (fig. a) . supplements from supplier resulted in similar to half the vcd of the control while supplements from supplier resulted in very low vcd and percent viability. all of the supplements except those from supplier resulted in ab titers similar to the control (fig. b) . due to the poor growth and subsequently low titer from supplier 's supplement, supplier was not further evaluated. the glycan profiles were analyzed and are presented in (fig. c ). all the evaluated supplements were able to raise galactosylation; however, only the optimized supplement and the x supplier supplement resulted in over % galactosylation. the function of the abs was further evaluated in a cdc assay (fig. d) . abs from the optimized supplement were more effective than the control abs and had a significantly lower half-maximal effective concentration (ec , . μg/ml) than the control ( . μg/ml). abs from the x supplier supplement had a similar ec to the control which may be due to the higher man % of the abs. an optimized supplement was produced through fed-batch evaluation and design of experiment analysis. the optimized supplement outperformed all other supplenments from other suppliers and resulted in the best overall cell growth, glycan profile, and functional ab activity (table ) . industry practice for mammalian cell culture media and feed development typically employs a high-throughput screening (hts) platform along with large sets of experiments [ ] . modern hts systems often include robotic liquid handlers to replace labor intensive steps. to align with advancements in the field, a semi-automated hts platform was developed to facilitate in-house media and feed development for early stage biologics projects. selecting appropriate instruments and integrating them into a seamless system are the keys to a hts platform. the developed hts platform uses deep well plates (dwps) for culture vessels, the liquid handler of the advance microscale bioreactor (ambr ) for media/ feed formulation preparation in an aseptic environment, nyone cell imager for viability and cell growth analysis, tecan freedom evo's liquid handler for activity assay sample preparation, and cedex bioht for high-throughput metabolite analysis. dwps offer comparable cell growth to shake flasks and compatible layout to ambr , which makes the dwp an ideal candidate for the platform. in addition, the user friendly design of experiments (does) interface and liquid handler function of the ambr expediates the formulation preparation of varying doe conditions [ ] . a macro program was written and developed in excel to enable the easy import of does design from major statistics software packages, such as jmp and simca, into ambr . performance qualification of each component were performed prior to implementing the hts platform. comparable cell growth profile and productivity were achived between shake flasks and dwps (fig. a ), indicating compatable cell culture environment for the cells. cell counts using nyone gave identical cell growth ranking as the traditional count from vi-cell xr (fig. b) . freedom evo's liquid handler was optimized to produce comparable activity results to manual operation while expediting the sample preparation with improved consistency (fig. c) . finally, implementing the liquid handler function of ambr to support media and feed formulation significantly reduced the labor for each experiment. summary of the capability comparison between the hts platform and the traditional method are listed in table . a case study of a complex feed screening with definitive screening design was completed using the semi-automatic hts platform. this experiment, containing more than feed formulations in duplicates, was handled by one operator and delivered a % improvement in productivity within a week period (data not shown). in addition, implementing the hts platform for this study also resulted into~ % reduction in labor while improving the traceability of formulation preparation. a semi-automated hts platform was developed to support media and feeds screening and development for early stage biologics projects. the platform utilizes dwps, nyone cell imager, ambr , freedom evo liquid handler system, and bioht metabolic analyzer to accelerate the screening process. this screening platform not only improves process throughput, operational precision, and traceability of formulation preparation, but also reduces the labor for the media and feed formulation preparation. background a perfusion medium requires high concentrations of specific nutrients while balancing other components to support intensified perfusion processes. using a combination of design of experiment (doe), multivariate analysis (mva), and spent media analysis, we developed a catalog "de novo" perfusion medium by working with multiple cho cell lines and proteins. the optimization of the medium in bioreactors using alternating tangential flow (atf) cell retention devices reduced the minimum cspr from over pl/cell/d to under pl/cell/d for most cell lines while increasing specific productivity during day steady states with stable growth rates, viability, volumetric productivity and product quality. high throughput screening (hts) was performed with seven cell lines, while four were used in bioreactors: cho-s, dg , and two chozn® gs lines, each producing different monoclonal antibodies and include a fusion protein. for hts experiments, cells were inoculated at . x vc/ml with a ml working volume in ml tpp® tubes and cultured for days in a multitron shaken at rpm, °c, % rh, and % co . for benchtop perfusion, cells were inoculated at . - . x vc/ml in l applikon bioreactors (applikon, netherlands) with a l working volume. bioreactors were operated at rpm, °c, % do, and a ph of . or . ± . depending on the cell line. oxygen was supplied through an l-sparger or microsparger as needed, and excell® antifoam (milliporesigma, germany) was added at a maximum rate of . % v/v to control foam. at a cell concentration of~ . x vc/ml, perfusion was initiated using the atf (repligen, massachusetts), with a bleed set to maintain cell concentrations at or * vc/ml. two "de novo" prototype media were developed using doe and mva in hts with tpps and an ambr® [ ] and one was chosen for further development after comparing to a basal medium enriched with feed in bioreactors. eleven components were identified as significant effectors of critical parameters for perfusion processes across evaluated cell lines. doe central composite experiments were run and component concentrations were optimized in the selected prototype. in parallel, amino acid specific consumption rates were calculated from bioreactor spent media samples and used to adjust the concentration of amino acids to target a reduced cspr. increasing specific amino acids concentrations resulted in a significant reduction of the minimum cspr across all tested cell lines -for example the cspr of cho-s was reduced from to pl/cell/day (table ). however, even at the lower cspr, spent media analysis revealed excess concentration of some amino acids, so specific accumulating amino acids were reduced and components were streamlined for the final medium: excell® advanced hd perfusion medium. using this medium, a cho-s and a chozn® gs cell line producing a fusion protein were cultured at a cspr of less than pl/cell/day with a vcd of * vc/ml. metabolic profile, productivity, and product quality were constant over the day steady state. the chozn® gs cell line was also tested at * vc/ml with a cspr of pl/cell/day (fig. ). we have developed a catalog perfusion medium from first principles, ensuring broadness of application by using seven cell lines in scaleddown systems and four in perfusion bioreactors. the final catalog medium showed significant improvements in productivity across all cell lines, with reduced csprs when compared to enriched fed-batch medium or initial prototypes (table ) . there is a rising demand for accelerated process development, increased efficiency and economics for biopharmaceutical production processes. furthermore, increased process understanding have evolved from the process analytical tool initiative (pat) and the quality by design (qbd) methodology. in contrast to one-factor-at-atime methods, statistical design of experiment (doe) methods are widely used to develop biopharmaceutical processes. even if highthroughput systems can handle these numbers of experiments in parallel, the heuristic restriction of boundaries and the high number of factors results in stepwise iterations with multiple runs. therefore, the combination of model-based simulations with doe methods (mdoe) for the development of sophisticated cell culture processes is a novel tool for process development [ ] . it is used to reduce the number of experiments during doe and the time needed for the development of more knowledge-based cell culture processes. this concept was applied to the optimization of the initial glutamine and glucose concentrations of a cho batch process. a mechanistic model was adapted and modified from [ ] and used to describe the dynamics of cell metabolism and antibody production of an il- antibody producing cho cell line (see abbreviation of fig. for cultivation details). experiments were simulated and compared to a fully experimental doe. as can be seen from table , user defined constraints were chosen to get a stable and reproducible process with the aim of maximizing the cell density but decreased lactate and ammonia production. at first, the experimental space was estimated by simulating the responses for broad concentration ranges and calculating the multiple response desirability function (fig. a) . this results in a small area (turquoise) suggested as experimental space. experiments were planned within these boundaries and responses were either simulated ( fig. b, cultivations for fitting the model) or compared with the purely experimental responses (fig. c, cultivations). optimal concentrations for glutamine and glucose with respect to the constraints are in the lower right corner and similar for both methods (red frame, fig. ). compared with the fully experimental design, mdoe results in a reduction of % in the number of experiments ( experiments for modelling vs. experiments in experimental doe). the method is intended to optimize cultivation strategies for mammalian cell lines and evaluated these before experiments have to be performed in laboratory scale. this results in a significant time and cost reduction during process development and process establishment. the strategy is especially intended for the use in multi-single-use-devices to speed up process development. . at a target cell density of * vc/ml, volumetric productivity was stable for a day steady state with excell® advanced hd perfusion medium. shorter steady states were tested at * vc/ml background for the large-scale production of therapeutic glycoproteins, fedbatch culture has been widely used for its operational simplicity and high titer. however, repeated feeding of medium concentrates and/ or addition of a base to maintain optimal ph during fed-batch culture lead to increase in osmolality. the hyperosmolality affects glycosylation in a protein-specific manner. however, the mechanism behind such osmolality-dependent variations in glycosylation in recombinant chinese hamster ovary (rcho) cells remains unclear. in this study, to better understand the effect of hyperosmolality on the glycosylation of a protein produced from rcho cells, we investigated n-glycosylation-related gene expression and n-linked glycan structure in fc-fusion protein-producing rcho cells exposed to hyperosmotic conditions. furthermore, to validate the effect of hyperosmolality on protein glycosylation, we performed hyperosmotic culture supplemented with betaine, an osmoprotectant, and then analyzed the n-linked glycan structure and mrna levels of n-glycan branching/antennary genes. after three days of hyperosmotic culture, nine genes (ugp, slc a , slc d , gcs , manea, mgat , mgat b, b galt , and b galt ) were differentially expressed over . -fold of the control, and all these genes were down-regulated. n-linked glycan analysis by anion exchange and hydrophilic interaction hplc showed that the proportion of highly sialylated (di-, tri-, tetra-) and tetra-antennary n-linked glycans was significantly decreased upon hyperosmotic culture. addition of betaine, an osmoprotectant, to the hyperosmotic culture significantly increased the proportion of highly sialylated and tetra-antennary n-linked glycans (p ≤ . ), while it increased the expression of the n-glycan branching/antennary genes (mgat and mgat b). thus, decreased expression of the genes with roles in the n-glycan biosynthesis pathway correlated with reduced sialic acid content of fc-fusion protein caused by hyperosmolar conditions. conclusions taken together, the results obtained in this study provide a better understanding of the detrimental effects of hyperosmolality on n-glycosylation, especially sialylation, in rcho cells. the identified genes, particularly mgat and mgat b, are potential targets for engineering in cho cells to overcome the impact of hyperosmolality on glycoprotein sialylation. disruptive cost-effective antibody manufacturing platform based on cutting-edge purification process v. medvedev, m. duyck, t. albano, j. castillo univercells sa, gosselies, belgium correspondence: v. medvedev (v.medvedev@univercells.com) bmc proceedings , (suppl ):p- background demand for high-quality monoclonal antibodies is growing exponentially, calling for new production capacities. overcoming current limitations of conventional manufacturing strategies, namely the high capital investment and production cost, can only be achieved through innovative process designs based on the latest technologies. this study presents a process design combining batch-fed technology with continuous multi-column capture. an advanced cell culture clarification method was introduced to simplify downstream operations and increase overall cost-effectiveness of the process, for an optimized production of recombinant proteins. this study was performed with cho cells expressing a monoclonal antibody targeted against the coronoavirus responsible for the middle east respiratory syndrome (mers), developed by organic vaccines tm and the nih, kindly provided to univercells. upstream process: -fed-batch, days culture at l scale with cd-cho chemically defined media and feeds. harvest treatment: -precipitation of impurities in the production bioreactor using organic compounds (< % v/v) and flocculation by electropositive organics (< . % w/v). -acidic ph and physiological conductivity. upstream processing and harvest treatment: culture reached . g/ l ( x cells/ml; % viability), harvest treatment was found to be very effective in terms of impurities clearance. capture: capture strategies were evaluated from the point of view of simplification of downstream operations, with hcp impurities content monitored as a key performance indicator. -protein a affinity chromatography: advanced harvest clarification enabled major improvements in affinity capture, in terms of eluate purity and reduction of host cell impurities (< ppm in all conditions tested). (fig. ). -cation exchange chromatography: cex allows higher capacities (> g/l) than protein a, whilst being more affordable (from -to -fold cheaper). low residual hcp (< ppm) was observed with all cex resins tested. without harvest treatment and clarification preceding the capture studies (either affinity or cex), results showed a lower binding capacity of the resin, a higher content of hcp in the eluate (up to ppm), a higher content of hmw species in the elution fraction (up to -fold higher) and a significant turbidity of the neutralized eluate. -continuous multicolumn chromatography: further options to increase cost efficiency include using a continuous multicolumn setup (table ) . two models were assessed based on two different static binding capacities (sbc), demonstrating that to columns of ml were able to process a l production in less than h. this method provides a great opportunity for designing simplified and low footprint mabs dsp processes, while maintaining similar or achieving superior quality profile compared to standard approaches: -harvests treatments followed by depth filtration proved to be a cost-efficient way to obtain pretreated feed and minimize the burden on downstream operations. -protein a resins exhibited advantages of extracting key contaminants during harvest treatment, while caex confirmed to be a competitive capture strategy. -switching from batch to continuous multicolumn mode allowed to process a complete batch in less than hours, requiring lower media and resins volumes. followed by a single polishing step, such process set-up strongly supports the reduction of operations required to deliver a high-quality product. analyses of product quality of complex polymeric igm produced by cho cells background immunoglobulin m (igm) antibodies are secreted by b cells as the first defense against invading pathogens during primary immune response. some igm antibodies already gained the orphan drug status, which shows their unique capability in therapy of rare diseases. potential fields for applications are discovered with increasing knowledge about these molecules. it seems that the most active forms are pentameric and hexameric igms. unfortunately, recombinant production of igms is rather difficult as secretion and correct polymer formation results in low expression yields and mixtures of polymers. we established stable producing chinese hamster ovary (cho) dg cell lines to analyze cellular and extracellular factors that influence quantity and quality of the produced recombinant polymeric igm in future studies [ ] . one quality parameter is polymer distribution, which can be measured directly in cell culture supernatant using densitometric analyses [ ] . additionally, we developed a very efficient single-step-affinity purification strategy using the poros captureselect igm affinity matrix to analyze pure igms. for more precise measurements of the igm isoform distribution we separated the purified polymers by high performance liquid size exclusion chromatography (sec hplc). our cho dg cell lines grow to peak cell concentration of . x cells/ml in erlenmeyer flasks and . x cells/ml in bioreactors. similar productivity of approximately mg/l was observed for cells cultivated in both cultivation vessels in a nonoptimized batch culture using chemically defined media. analysing how cultivation conditions affect the fraction of polymers may offer clues about the assembly of polymers and the challenges of igm production. we quantified polymeric distribution of igm directly in the supernatant using a densitometric method [ ] . cultivated under standard conditions ( °c, ph ) igm is produced as % pentamers, whereas igm _gl only consists of approximately % pentamers. the purified igm _gl was analysed with sec-hplc and contained % pentamer and % dimer, which is comparable to the results achieved with densitometry. the purification of the igm antibodies was quite challenging as the manufacturer recommend acidic elution, which led to aggregation and inefficient elution of our model igms. therefore, we screened for different elution buffers that prevent denaturation and aggregation. by combining high salt concentrations with moderate ph reduction we optimized elution conditions to - % igm recovery, which corresponded to a five to six fold improvement compared to the manufacturers' conditions. sds-page analysis and sec-hplc showed that our elution strategy resulted in a very pure product after a single chromatographic step. the purification strategy was verified with the igm , igm and igm . our model igms were produced in a ratio of approximately : pentameric to dimeric igm, measured concordantly with both analytical methods. process development on igm purification using the poros capture select human igm affinity matrix enabled the recovery of highly pure fractions. through optimization, by combining mild ph and high salt concentrations, the relatively low elution yields were increased by a factor of - . applying densitometry and sec-hplc we will investigate how culture conditions influence polymer formation in future. currently, no small scale (< . l) cell culture system is commercially available for high cell density perfusion cultivations to use in high throughput screening studies. to increase throughput for process characterization activities at janssen vaccines and prevention, a shaker flask-based scale down model was developed. though, the control possibilities of shaker flask cultures are technically very limited and different compared to a bioreactor controlled process. in addition, the sensitivity of the shaker flask model should allow the detection of the effects of process parameters on critical quality attributes (cqas) of the vaccine produced at large scale. iterative experiments were performed in shake flasks to evaluate the influence of cultivation parameters such as shaking speed, working volume, co % in the incubator and daily base additions on cultivation parameters (as cell growth, ph and do). in addition, a medium exchange was tested to mimic the perfusion mode used in the bioreactor process. the presens shake flask reader was implemented to allow for ph and do monitoring. the conditions for which the performance as reflected in specific virus titer showed the best fit were selected. at these conditions, a series of parallel shaker flask infections were conducted to demonstrate statistical equivalence of performance parameter and cqas (as cell specific iu titer and vp/iu ratio) between the production scale and reduced scale processes and thus to qualify the shake flask as a scale-down model. a daily medium exchange by centrifugation was implemented and cultivation parameters for shake flasks were identified. based on performance parameter (cell specific vp titer) and the cqas of the vaccine (cell specific iu titer and vp/iu ratio), equivalence between the production-scale and scale down systems was confirmed. the scale down model data fall into the % prediction intervals calculated on manufacturing data whereas scale down model data from batch mode experiments (using non optimized cultivation conditions) do not. the shaker flask as a scale down model for the l bioreactor perfusion process was qualified. this model is a tool to screen a subset of process parameters at a higher throughput, thereby reducing process characterization timelines. background until today, the market for therapeutic proteins, especially monoclonal antibodies, is gaining more and more importance in the pharmaceutical field. to meet the increasing demand for these products, the industry made tremendous efforts to generate highly efficient production systems. one of the pharmaceutical industry's research focuses is the improvement of the secretion process in eukaryotic cells. in mammalian cells, the efficiency of protein transportation strongly depends on the translocation of a nascent protein into the er, which is mostly conducted by the signal peptide (sp) coupled to the nterminus. through the interchangeability of signal peptides between products and even species, a large variety can be used to enhance protein expression in already existing production systems materials and methods at first the influence of four different natural sps (sp ( ), ( ), ( ) and ( )) was compared on the secreted amount of an igg model antibody (product a) in fed batches using a cho dg host cell line. in the second part, one promising sp-candidate showing improved secretion (sp ( )) was identified and the influence of this sp on four additional antibody products, which varied in their expressability from good to mid/bad, was investigated. in both approaches, the standard sp was implemented for comparative reasons. in the first approach, four signal peptides sp ( ), sp( ), sp( ) and sp( ) were screened for their potential to improve the product secretion of cho dg cells expressing a model antibody (product a). the results revealed a . -fold increase in average final fed-batch antibody titer of sp( ) when compared to the standard sp approach (standard sp = . g/l; sp( ) = . g/l). in the second approach, the enhancing capacity of sp( ) on secretion of four other igg products (named product b to e, table ) was further evaluated. an improved performance was observed for all products when comparing sp( ) and the standard sp in a fed batch process (fig. ) . with an increase in average final fed-batch titers ranging from to % and up to % in cell-specific productivities. taken together, with a positive influence on the final concentrations of all tested products, the results obtained with sp( ) contribute to -signal peptide sp( ) was identified as a promising candidate with an average . -fold titer increase during screening of four signal peptides. -sp( ) was able to improve production titers up to % compared to standard sp. -sp( ) was able to improve cell-specific productivities up to % compared to standard sp. -future usage of sp ( ) contributes to the further optimization of sartorius stedim cellca's standard cell line development process. new platform for the integrated analysis of bioreactor online and offline data lukasz gricman , milan ganguly , amanda fitzgerald , hans peter more and more experiments are used to assess bioreactor suitability and stability of clones, to evaluate media composition and other process parameters, and to start upscaling campaigns. this has resulted in a major bottleneck due to the increase in data capturing, processing, aggregation, visualization, and statistical analysis. in addition, the association of the data with the experimental context (e.g., fermentation protocols, media recipes, bioreactor control parameters) is not easily accomplished in high throughput. the data generated in the process must not only be analyzed, but also managed and stored to enable easy tracking and relating to historical records. furthermore, the processes are often developed by global teams interacting in complex enterprise it ecosystems. therefore, new and high performing systems for data capture, processing, and analysis need to be integrated in order to enable storage and correlation of experimental context information and various types of time course analytics data. we have developed genedata bioprocess™, a new enterprise platform for bioprocess development. the platform enables automatic capture and visualization of all online and offline data (e.g., ph, o , metabolic data), auto-calculations and aggregations (e.g., ivcd, qp, consumption rates) and multi-parametric assessment of any type of time-series bioreactor data in the context of experimental protocol data (e.g., process parameters, feeds). genedata bioprocess comes with dedicated interfaces for integrating with relevant laboratory instruments, control systems, statistical analysis software packages and custom enterprise solutions. it enables the modeling and tracking of complex nonlinear workflows and supports decision making in bioprocess development. the data can be analyzed in the context of upstream process development, and also be correlated to other unit operations. automation support assists the ever increasing throughput of bioprocess development operations, and the analysis of experimental data and process parameters across unit operations or even different projects. this overall integration enhances process development workflows. highlighted use cases describe the selection of the best producer clones (fig. a) , the identification of optimized media feeding strategies (fig. b) , and the comparison of clone performance across different fermentation scales (fig. c) . a special focus is on the analysis of data from micro-and bench-top bioreactors (such as the ambr ™ and dasgip™ systems) operated in parallel. these bioreactors allow for increased throughput of clone selection and process optimization studies, which in turn leads to an increase in data generation. genedata bioprocess supports integration with such systems and enables a comparison of data regardless of the instrument provider or scale. automated bioreactor data analysis allows development groups to take advantage of even richer datasets and, as data management is built-in to the system, the data can be easily tracked and associated to historical records. another focus is on cross-reactor scale comparisons. data coming from different bioreactor scales can be easily imported into the platform and analyzed to establish the best conditions for upscaling. genedata bioprocess enables the correlation of process parameters (e.g., fermentation protocols, media recipes, bioreactor control parameters), with key performance indicators of the processes (e.g., titer, qp) and the product quality attributes (e.g., aggregation, glycosylation profiles). finally, bioreactor time course data can be tracked together with clone analytics and product quality parameters, which makes the platform uniquely able to support end-to-end biopharma development. upstream bioprocesses are at particular risk of contamination from adventitious agents. the typical . μm filters used at this step protect bioreactors from bacteria and mycoplasma but offer no protection from viral contaminations. a new polyethersulfone (pes) upstream virus filter, viresolve® barrier, has demonstrated high levels of microorganism retention -full retention for bacteria and mycoplasma (> . lrv -log reduction value) and~ lrv for small viruses, such as parvoviruses. it also has improved flow and capacity as compared to virus removal filters designed for monoclonal antibody purification. given the small pore size of virus retentive filters, implementing a virus filter upstream of the bioreactor raises the question of whether critical cell culture media components are removed. therefore, it is important to evaluate the cell culture performance and protein quality attributes using virus-filtered media to ensure that filtration does not negatively impact the process. materials and methods ex-cell® cho media and corresponding feeds were processed through either viresolve® barrier filters or . μm filters (control). media composition post-filtration was evaluated by high performance liquid chromatography (hplc), inductively coupled plasma/ optical emission spectrometry (icp-oes), and nuclear magnetic resonance (nmr). recombinant cho cells were cultured in fed batch culture. cell density and viability were measured by vi-cell tm cell viability analyzer while metabolites were analyzed by bioprofile® flex analyzer. shake flasks and bioreactors were utilized to verify that surfactants, such as poloxamer, (which are essential for shear protection in stirred tank bioreactors and can be difficult to filter) have not been removed during filtration. monoclonal antibody titer was quantitated by protein a hplc. characterization of the antibody product quality was assessed via weak cation-exchange chromatography (charge heterogeneity), size exclusion chromatography (aggregate profile), and -ab fluorescent labeling with np-uplc (glycan species). media and feed compositions were unaffected by filtration through the virus barrier filter. no significant differences in concentrations were observed with icp-oes (trace metals) or hplc (amino acids and water soluble vitamins). nmr showed no change in the organic composition of the media including poloxamer. the aromatic region with vitamin and amino acid signals is shown (fig. a) . cell cultures showed no differences in cell growth or titer, in either shake flasks or bioreactors (fig. b) . cell viability was unaffected, metabolite levels were within limits, and titer was consistent. the protein quality of the secreted antibodies showed no differences in the glycosylation pattern (fig. c) , amount of aggregates or charge variants. the risk of virus contamination in the bioreactor remains a concern for biotherapeutic manufacturers as there is no universal technology that provides a reliable, cost effective solution for virus removal that can be applied to all components of cell culture media. this study evaluated the viresolve® barrier filter that provides an efficient and easy way to protect bioprocesses from adventitious virus contamination. study results demonstrated that media and feed compositions, cell culture performance, and product quality were unaffected by filtration through the viresolve® barrier filter. implementation of vire-solve® barrier filters provides efficient filtration performance, high virus retention, and minimal cell culture impact and offers a viable option to improve the overall virus risk mitigation strategy for the manufacture of biotherapeutics. b tracking of process conditions together with online and offline performance analytics. the system allows to flexibly define tracked parameters and select optimal process conditions. c comparison of process performance across different reactor scales. the open architecture makes genedata bioprocess a provider agnostic system which allows to aggregate and compare data regardless of provider background bi-and multi-specific antibodies, antibody-cytokine fusion proteins, nonimmunoglobulin scaffolds, chimeric antigen receptors (cars), engineered t-cell receptors (tcrs) and tcr-based bispecific constructs can provide significant advantages for use in cancer immunotherapy. however, as highly engineered molecules they pose new challenges in design, engineering, cloning, expression, purification, and analytics. we have thus implemented an infrastructure that addresses these challenges and enables the industrialization of these various novel therapeutic platforms. in close collaboration with leading biopharmaceutical companies, we implemented a workflow, data management and analysis support system, genedata biologics™, enabling the automated design, screening, and expression of large panels of therapeutic candidates using these novel technologies. we have also built tools for developability and manufacturability assessments of these complex molecules. we have ensured that there is a seamless integration of all data generated and that functionalities such as bulk protein and vector generation using our in silico cloning engine, configurable library of template vectors and cloning strategies, fully annotated in silico protein molecules and dna constructs, and dna synthesis verification support, can be used for the newest protein formats and molecule topologies. we implemented data structures and data handling systems, which mirror how these complex next-generation biologics molecules and cell lines are being designed, screened, and analyzed. the result successfully addresses workflows for tcr optimization and engineering. we exemplified this with the generation and evaluation of a panel of engineered tcrs with an alpha chain cdr randomization and successfully supported the analysis and selection of beneficial mutations. the system also successfully supported workflows for the design and generation of a panel of tcr-based bispecifics (tcr coupled with anti-cd ) using automated molecule registration and in silico cloning tools and subsequent capture of expression, purification, and functional and analytical characterization data. on the car-t cell front, the system is able to provide traceability of the work from antibody generation, optimization, car engineering (e.g., attachment to the scfv with cd -zeta and co-stimulatory domains to mimic the natural tcr complex) to the engineering of the t-cell. the genedata biologics platform successfully enabled automation, increased data integrity and traceability during research and development work, and will contribute towards the industrialization of these very exciting novel approaches for cancer immunotherapy. optimal selection of therapeutic antibodies and production cell lines by assessment of critical quality attributes and developability background the increasing cost of bringing a new drug to the market has put significant pressure on biopharma organizations. to increase efficiency in r&d processes and reduce costs, organizations need to evaluate potential drug candidates earlier in the r&d process, eliminate those with undesirable characteristics, and focus on the most promising candidates. after designing and thorough testing of successful candidates, efficient production of new biological entities in mammalian cell lines is necessary. the main goal here is to find a suitable cell line and optimal upstream and downstream processing conditions that not only lead to a satisfactory product yield, but also to a product with the desired biochemical properties. the evaluation of production cell lines, processes, and product quality attributes is performed earlier and in higher throughput for an increasing number of drug candidates. in addition, new methods in molecular and cell biology (e.g., novel genome engineering approaches such as crispr/cas ), in analytics [e.g., process analytical techniques (pats)], in process miniaturization, and in automation promise to make process development more efficient. however, the management and analysis of the increasing amount of experimental data during candidate selection and cell line and process development has become a bottleneck. in addition, quality-compromising steps in biopharma organizations can negatively impact the cost of goods and substantially prolong the drug candidate's time to market. therefore, systems for integrated management and analysis of wellstructured and curated data that comprehensively integrate molecule and sample information, manufacturing process parameters, and process and product quality attributes are needed. critical quality attribute (cqa) assessment should be enabled along the whole bioprocess development workflow, including cell line development, upstream and downstream process development, as well as analytical and formulation development. we have developed a comprehensive platform, genedata bioprocess™, which supports drug candidate developability and manufacturability assessment and bioprocess development. the platform captures and structures the cell line and process parameters together with analytical data for cell lines, processes, and protein products. the protein analytical data being tracked include biological data (such as bioactivity, immunogenicity), and physicochemical properties. these properties include glycosylation, chemical liabilities (such as deamidation and oxidation), aggregation, stability under different conditions (low ph, low and high temperature), solubility, and impurities. genedata bioprocess™ simplifies and streamlines laborious, manual process and supports tools for molecule, clone and process selection. furthermore, the platform allows for seamless integration with laboratory instruments, statistical software packages, and custom solutions. here, we present use cases showing how to identify and annotate liability sites prone to chemical modifications (fig. a) and how to monitor cqas of molecules allowing to assess developability more efficiently. we show how the analytical data generated in the course of a developability assessment are compiled to select the best drug candidate (fig. b) . implemented traffic-light systems indicate where molecules harbor issues such as in case of the antibody tpp- , which is compromised by low temperature and repeated freeze-thaw operations. the same assessment views can also be applied on batches and cell lines. the underlying data can be visualized graphically. as an example, we show glycan types of products obtained from different cell line clones generated in a cell line development campaign for the molecule tpp- (fig. c) . even though the selected clone cli- meets the glycosylation criteria (e. g., < % afucosylation, < % galactosylation, < % sialylation), the produced next-generation biologics molecules are composed from a number of specific subdomains. each type of molecule is composed of a specific set of domains, which must be mirrored in the registration and further research and development workflow. molecule registration and hit-selection using data from a number of assays is shown here using the example of car-t cells. the image is a screenshot from the genedata biologics™ software molecule harbors some stability issues as mentioned above. therefore, more attempts would be needed either in formulation or in reengineering of the complimentarity determining regions (cdrs) in order to provide a developable ttp- -like drug candidate. background environmental process variables are often used as tools to optimize the performance of mammalian cell cultures to achieve higher cell densities and high productivities of r-proteins (q p ). the manipulation of culture temperature in the range of mild hypothermia (mh) ( - °c) [ , ] , as well as different glucose availability scenarios [ , ] , has been shown to improve productivity in different cell lines. however, the manipulation of these variables individually or together has a concomitant effect on the rate at which cells grow, masking the net response exhibited by the cells. in order to identify the effects of these variables, we have taken advantage of the use of the chemostat culture. chemostat cultures were performed at two dilution rate (d)( . or . (h- )), two temperatures ( or °c) and three feed glucose concentrations ( , or mm). the response was analysed considering r-protein production, cell growth and key metabolites. r-tpa protein concentration was determined by immunoassay (trinilize tpa kit); cells were counted using a hemocytometer and cell viability was determined by the method of exclusion using trypan blue (t , sigma, usa); glucose, lactate and glutamate were determined by enzymatic assay using a biochemical analyser ysi (yellow spring instruments). statistical analysis of the results was performed by anova (design-expert for windows). a decrease in cell density was observed in response to an increase of glucose feeding concentration, regardless of temperature or specific growth rate (in this case μ=d) evaluated. the maximum cell densities were reached at mm, achieving . and . x cells/ml at / °c and . (h - ); and . and . x cells/ml at / °c and . (h - ) respectively (fig. a) . the increase in glucose concentration from to mm resulted in an q p increase of and . fold at °c/ . (h - ) and °c/ . (h - ) respectively. a lower increase of . and . fold was reached at °c/ . (h - ) and °c/ . (h - ) respectively (fig. b) . the highest q p s were reached at °c and . (h - ). however, a positive effect of mh was not observed, in contrast to that observed in batch culture [ , , ] . this behaviour suggests that low μ is a main factor on increased r-protein production in batch cultures exposed at mh condition. the specific consumption rate of glucose was significantly increased by the glucose increase from to mm and reduced by mh (fig. c) . at . (h - ) the specific production rate of lactate (q lac ) was increased by glucose increase, independent of the culture temperature used. while at °c/ . (h - ) the q lac decreased with increasing glucose concentration and at °c/ . (h - ) a maximum consumption was observed at mm glucose (fig. d) . the lactate-glucose yield ( fig. e ) not showed relevant changes at . (h - ), while at . (h - ) this yield showed a more efficient utilization of glucose, as glucose concentration was increased. however, this last behaviour was not reflected in an increase of r-protein production. the concentration of glucose has the greatest impact on the behaviour of the culture, and its increase affects positively the protein productivity. the mh did not improve proteins productivity of cho cells producing tpa under the different conditions evaluated; low dilution rate and at high glucose concentration impact positively the protein productivity and the metabolism exhibited by the cells. background mammalian cell cultures are the most commonly used bioprocess for the production of therapeutic recombinant proteins such as monoclonal antibodies (mabs). facing to the increasing demand of these biopharmaceuticals, the fda has initiated the process analytical technology (pat) framework in order to encourage pharmaceutical industries to use innovative technologies to monitor in real time the critical process parameters (cpps), and to ensure the final product quality [ ] . one of the most important cpps for cell culture bioprocesses is the specific growth rate (μ), which is a direct indicator of cellular physiological state. indeed, μ is sensible to culture conditions and its value decreases when cells are in the unfavourable environment for growth [ ] , which may greatly influence mab production and quality. however, until this day, the online monitoring of μ remains a great challenge for mammalian cell culture bioprocesses. igg-producing cho cells were cultured in l stirred bioreactors equipped with an in situ dielectric spectroscopy (hamilton). operating conditions were fixed at rpm, % of air saturation, ph . and °c. permittivity of cell culture was measured every min, which allowed to calculate in real time the vcd by using a previously established linear correlation. then, a model of online estimation of μ was developed based on vcd prediction and cell mass balance equations. several signal noise filters and various calculation methods were evaluated to reach better model stability. cell cultures were performed in both batch and feed-harvest modes. feed-harvest cultures consisted of sequential renewals of / volume of the culture medium by following different strategies. this study proposed an innovative methodology based on dielectric spectroscopy to monitor in real time the cellular physiological state, by online estimating the specific growth rate (μ) of cells. model of online estimation of μ was developed from cultures in batch mode, and was validated by comparing online estimated μ with the experimental ones calculated at the end of the culture. with this model, the moment when μ started to decrease significantly, which indicated that cells were no longer in the exponential growth phase, was identified as the critical moment. to demonstrate the interest of online estimation of μ, the developed model was applied to a feedharvest culture, where the medium renewals were performed at the critical moments indicated by the model. this culture was then compared with the traditional feed-harvest culture where medium renewals were performed by following offline measurements of glucose and glutamine. we found that the online strategy allowed to maintain the value of μ by renewing the medium at the right time, while the values of μ varied a lot when using offline strategy. moreover, by using the online estimation of μ, the glycosylation of igg was kept at a high level (about %) throughout the whole culture. however, for the culture using offline strategy, the glycosylation level decreased progressively and was only about % at the end of the culture (fig. ) . model of online estimation of μ was developed by using dielectric spectroscopy, which allowed to monitor the physiological state of cells in cell culture bioprocesses. implementation of this model in feed-harvest cell culture led to better mab glycosylation, which demonstrates clearly the potential of this methodology in mab production bioprocesses. background monoclonal antibodies are normally synthesised from transfected mammalian cells as heterogeneous mixtures of glycoforms [ ] . however, clinical efficacy may depend upon single glycoforms which have been difficult to isolate [ ] . we have now developed an efficient method for generating single glycoforms by solid phase re-modelling which is superior to previous methods because it allows a sequential series of enzymatic changes without the need for intermediate purification of the antibody. solidphase binding exposes the antibody glycans to enable easier access of the transforming glycosylation enzyme. the antibodies subjected to modification were a chimeric human/ camelid monoclonal antibody (eg ), a humanized monoclonal antibody (il ), a full size chimeric antibody (cetuximab) and polyclonal antibodies obtained from pooled human serum. the antibodies were bound to a protein a column using conditions typical of mab purification (fig. ). after washing out non-bound impurities by a neutral ph buffer, each antibody was subjected to enzymatic modification directed to a targeted glycan profile ( table ). the antibodies were then eluted with a low ph buffer and neutralized. the glycan profiles were analysed following glycan removal with pngase f, labelling with -aminobenzamide and separation on a hilic-hplc column [ ] . prior to enzymatic modification glycan analysis of all antibodies showed variable galactosylation and sialylation typical of human abs. this included a distribution of fg , fg , fg , fs and fs with galactosylation indices ranging from . for il to . for eg . there was minimal sialylation in il but up to % in eg . glycan modifications were made as each antibody was held on a protein a column in accordance with procedures shown in table . agalactosylated glycans were enriched by treatment with the single addition of galactosidase and neuraminidase. this resulted in - % of agalactosylated structures in the mabs and % in the polyclonal antibody. galactosylated antibodies (> % yield) were produced by a single stage reaction involving sialidase and by galactosyltransferase with udp-gal. breakdown of the glycans to a trimannosyl core was accomplished by treatment of the agalactosylated structures with hexosaminidase. this produced a yield of - % of the fm structure with a small remainder of fa . sialylated antibodies (> %) were produced by a stage reaction involving sialidase, galactosyltransferase and finally treatment with , sialyltransferase in the presence of cmp-nana. the latter reaction produced equimolar quantities of monosialylated and disialylated cetuximab and polyclonal antibodies. the results suggest that for human antibodies ( kda) there may be a limitation for sialylation given the steric constraints between the two ch domains of the dimeric structure. the ability to sialylate the smaller camelid antibody ( kda) was greater resulting in a high (> %) level of disialylated glycans. this suggests that the steric constraints for glycosylation may be lower. these sialylated antibodies have significant potential clinical importance for their ant-inflammatory activities. we have modified the glycans of antibodies following immobilization on an affinity ligand column. this allows enzymatic transformation in a solid state that has a distinct advantage over the equivalent transformation in solution because the enzymes and buffers can be washed out on completion of the modification leaving the antibody still attached to the affinity ligand. this enables repeated rounds of an enzymatic reaction or sequential reaction steps without the need for intermediate antibody purification. the antibody can be removed eventually from the column by application of an elution buffer once all desired glycan modification have been made. since affinity ligand purification of antibodies is performed routinely as an initial step of purification after cell culture, the glycan modification can easily be incorporated into this process. the enrichment of the resulting antibody for a targeted glycoform can enhance the potential therapeutic efficacy as it is known that specific glycoforms are required for certain biological effects. [ , ] . this is mainly because microvesicles can be enriched/deprived for specific proteins, based on their functional purpose and their cellular origin. recently, microvesicles purified from the supernatant of t bladder cancer cells were reported to be enriched for bcl- and cyclin d (anti-apoptotic proteins), but deprived for bax and caspase- proteins (pro-apoptotic proteins) contributing towards immunity against programmed cell-death [ ] . however, impact of microvesicles on cho-based bioprocess has not been evaluated yet. therefore, in this investigation, we aimed to evaluate their impact on cell growth and recombinant protein production from cho cells. materials and methods cho-k cells were grown in chemically-defined protein-free culture medium (life technologies- ) in shake flask (gx- p). the different fractions of spent-media (microvesicles and microvesicle-free spent media) were collected using ultracentrifugation method [ , ] . quality of different fractions was ensured using western blotting for exosomal marker, cd (sc- ) and coomassie stained gel for loading control (fig. a) . to evaluate impact on cell growth, cells were seeded with microvesicles and microvesicle-free fraction collected from log-phase of culture and cell counts were performed by vicell using trypan-blue dye exclusion method. for impact on productivity, cell-free supernatant, collected from microvesicle-treated human igg secreting cho culture from stationary-phase of culture with respective control, was evaluated using elisa (ab ). microvesicles collected from % of media (by volume) from routine maintenance cultures compared to working volume for microvesiclesupplementation were used in each experiment. the growth of microvesicle-supplemented cultures had shorter lag-phase and achieved . fold higher maximum cell density ( . x viable cells/ml) compared to untreated standard culture ( . x viable cells/ml) and maintained higher for the remaining period of batch culture (fig. b) . however, microvesicle-free fraction did not had significant impact on growth. the viability of microvesicle-supplemented cultures, similar to microvesicle-free media supplemented, was also higher compared to standard culture suggesting potential use of microvesicles for regulating cho growth in production cultures. this could be possibily because microvesicles have already been reported to be enriched with cell growth/death-regulating proteins and hence facilitating cell growth [ , ] . we have also observed abundance of cell cycle regulators including cyclin d in microvesicle-fraction compared to microvesicle-free spent-media in our laboratory (data not shown); however, further investigation are required to prove the hypothesis. the overall productivity of human igg secreting cho cells was also observed to increase bỹ fold following supplementation of microvesicles to the culture without significantly affecting per-cell productivity. since microvesicle-supplementation facilitates cell growth, increased number of viable producer cells in the culture could be expected to be the basis of observed increase in the overall productivity of the culture [ , ] . the further work is ongoing to in-depth explore the potential of microvesicles for improving recombinant protein production from cho cells. the data indicate that microvesicles secreted from cho cells can improve cell growth and hence recombinat protein production in culture. therefore, strategies need to be developed for sterile isolation of cho microvesicles from routine maintenance cultures and their supplementation into the production culture for improving the performance of cho-based production process. the glycosylation profile of a recombinant protein is one of the most important attributes when defining product quality. producing a protein with desired characteristics requires the ability to modify and target specific glycosylation profiles. traditionally the approach to modify the glycosylation profile of a protein involves supplementing a culture with components that can improve galactosylation. experimentation using this supplemental approach resulted in a dramatic increase in terminal galactosylation, but lacked the ability to easily and repeatedly target specific glycosylation profiles. using novel and proprietary technology, we have developed a feed (glycantune™) and a unique feeding process that will maximize growth and titer while being able to modulate glycan profiles. this new feed can be added as a standalone process that can result in a significant shift from g f to g f and g f (maximum galactosylation). using a unique fed-batch process, glycantune can also be used with a standard feed to dial in targeted glycosylation profiles. through process development, we created a method where a transition point is used to switch from a standard feed to a glycan modulating feed. the timing of the transition point will determine the specificity of the glycan profile. ) . n-linked glycans were digested with pngase f and quantified using pmole maltohexose/maltopentose internal standards labeled with -aminopyrene- , , -trisulfonic acid (atps) as described by laroy et al [ ] or the user guide for the glycan labeling and analysis kit (glycanassure™ user guide, thermo fisher scientific). all ce separations were performed using the applied biosystems™ xl. the timing of transition from efc+ to gtc+ made it possible to target specific glycosylation profiles. modulating g f from % down to %, while increasing g f and increasing g f (fig. ) . transitioning to gtc+ early in culture resulted in a greater shift from g f to g f and g f. transitioning midway or late in culture resulted in a greater proportion of g f compared to g f and g f. supplementation based approaches using glycosylation modulating media components to modify and target specific glycosylation profiles proved to be difficult. these approaches were able to increase terminal galactosylation (g f and g f), but lacked the ability to fine tune glycan profiles. this could result in numerous rounds of titration experiments to target specific glycan profiles that would likely remain inconsistent between cell lines, culture media and feeds, and process scale. the development of a unique process made it possible to predictably target specific glycosylation profiles. transition from standard feeding to glycantune allowed for precise targeting of glycan profiles. transition to glycantune early in culture resulted in an increased shift from g f to g f and g f. a transition late in culture resulted in increased g f and decreased g f and g f. growth performance during precultures and batch curves in plain shaking flasks did not show any differences among tested surfactants or lots thereof, and cell densities reached - · cells/ ml ( fig. a and b) . experiments with hek -f cells at elevated power input in baffled shaking flasks revealed distinct differences between pluronic® f- , f- and kolliphor® p , with f- showing the best performance. peak viable cell densities reached with lots a and b of pluronic® f- and f- were comparable to those in plain shaking flasks, while those for kolliphor® p and lots c and d of pluronic® f- were significantly lower. peak viale cell densities were of - · cells/ml (fig. c) . similar transient transfection efficiency and mean fluorescence of transfected cells independent of applied surfactant and lot thereof indicated no major impact of respective poloxamer (fig. d) . interestingly, experiments using fluorescein-labelled pluronic® showed a time-dependent uptake into hek cells. visual tracking revealed an endocytic uptake of poloxamers by the cells (> fold increase in signal after h) and its co-localisation with cell membrane and lysosomes. sec (fig. e) analyses showed differences between the tested poloxamers. especially tested lots of pluronic® f- revealed notable deviations in the low molecular weight fraction (peak , fig. e ), compared to the other poloxamers. cultures subjected to varying levels of shear stress showed distinct growth differences depending on used poloxamer. while experiments in plain shake flasks did not show any differences in growth, cultivations under elevated shear stress in baffled shake flasks resulted in lower peak viable cell densities with kolliphor® p and some pluronic® f- lots. it remaines unclear whether this can be explained by different membrane protective activities alone, or if other mechanisms, occuring during and after cellular uptake, contribute to this effect. especially for the tested lots of pluronic® f- , sec of surfactants showed differences in the low molecular weight fraction. this fraction mainly represents polyethylen oxide (peo) (revealed by nmr), which is likely to be a remnant from synthesis. these observations indicate that the use of different poloxamers and lots thereof should be carefully evaluated, especially under elevated shear stress. further experiments will focus on investigating distinct sec fractions of poloxamers. overcoming (fig. ) . aurintricarboxylic acid (endonuclease inhibitor; enhancer used in e.g. salivary gland transfection) and polyvinylpyrrolidone (polymer; beneficial in electroporation) were both found to negatively impact peimediated transfection of cho cells, while another tested polymer enhanced growth as well as transfection efficiency. the use of a strong chelator led to a high transfection efficiency, but impaired cell growth. based on the results of the independent substance testings, the medium formulation was modified by the addition of a weak chelator and further components including vitamins. different osmolalities between mosmol/kg and mosmol/kg were tested for the final formulation, but no major impact was seen neither on transfection efficiency nor on viability days post-transfection. the final cho tf medium formulation supported high cell growth of finally tested cho cell lines and with peak viable cell densities above ⋅ cells/ml in batch cultivations with an overall cultivation time of - days (fig. ) . further improvements of the process might be achieved by adapting the protocol, as the results shown are based on a simple precomplexing of dna-pei. moreover, product yields could potentially be increased by using feeds, temperature shifts or commonly used enhancers (e.g. valproic acids). scaling of a cell culture process is an essential part in its development. in a typical approach scaling [ ] is performed by keeping a (critical) process parameter constant throughout the complete bioreactor range. this can lead to non-beneficial results either on the high or the low end of the range. for instance, the specific power input [p/v] of w/m might result in a good agitation in production scale whereas it leads to a nonturbulent mixing behavior in process development scale. to overcome this issue a new approach for an easy scaling procedure was developed. this "utility function" approach for agitation scaling is based on individual functions with a value-based mapping independent of bioreactor scale. process insight information (established either from doe process investigation or existing experience with a process platform) is directly formalized into a set of mappings which transform bioprocess values into perceived benefits ( to ). at each bioreactor scale, parameters (e.g. stirring and gassing) are then chosen to maximize the product of resultant utility functions. the model cho fed-batch process in this trial comprised a cho dg cell line that was transfected to produce a humanized antibody igg . a chemically defined media system was used. the process, including cell line, medium and feeding strategy was designed and developed by sartorius stedim cellca. the aim of the gassing scale-up was to achieve similar cell densities when the addition of pure oxygen starts. for all flexsafe str® bags oxygen was sparged via the micro sparger part of the combi sparger. all other systems used a ring sparger with holes face up. the initial air flow rate was set to an oxygen transfer rate (k l a) of /h at the corresponding agitation rate and volume. all process engineering characterization parameters were determined according to dechema guidelines [ ] . with the use of the utility functions the discrete agitation rate was determined (table ) . the utility functions led to discrete agitation rates where not only homogeneous mixing but also a turbulent flow pattern and a suitable specific power input was guaranteed. the initial gassing rate of air supplied enough oxygen for x cells/ml in all bioreactors. due to the used scaling methods the growth patterns in all bioreactor scales were comparable. peak viable cell densities (vcd) of - x cells/ml were achieved and viability at the point of harvest was above % in all scales. the final product concentration was in an acceptable range of . - . g/l. product quality attributes show comparability over the complete bioreactor range (fig. ) . the harvest criteria of days gave a combination of viability and product concentration that made it easy to process the cell broth during cell removal and other downstream steps. the process implementation of the cho production system -expressing mab was successfully performed with the use of utility functions. cell growth, productivity and product quality is comparable over the complete bioreactor range. background endoplasmic reticulum (er), the central part of the secretory pathways in eukaryotic cells, is responsible for controlling the quality of secreted and resident proteins through the regulation of protein translocation, protein folding, and early post-translational modifications [ ] . a number of physiological conditions such as oxidative stress, hypoglycemia, acidosis, and thermal instability can disturb the er functions, which triggers er stress [ ] . prolonged er stress induces apoptotic cell death [ ] . oxidative stress that naturally accumulates in the er as a result of mitochondrial energy metabolism and protein synthesis can disturb the er function [ ] . because er has a responsibility on the protein synthesis and quality control of the secreted proteins, er homeostasis has to be well maintained. when h o , an oxidative stress inducer, was added to recombinant chinese hamster ovary (rcho) cell cultures, it reduced cell growth, monoclonal antibody (mab) production, and galactosylated form of mab in a dose-dependent manner. antioxidants can reduce the oxidative stress level and suppress the apoptotic cell death by scavenging oxygen free radicals, inhibiting chain reaction of oxidation, and detoxifying peroxide [ ] . however, despite the importance of mass production of mabs, studies on the effect of antioxidants on the production and quality of mabs in rcho cell cultures have not been fully substantiated. to find a more effective antioxidant in rcho cell cultures, six different antioxidants including baicalein, which have used widely in mammalian cell cultures, were evaluated as chemical supplements with two different rcho cell lines producing the same mab in -well plates. then, batch and fed-batch cultures were performed in shake flasks with the supplementation of baicalein, which showed the best effect on culture performance among the antioxidants. the reactive oxygen species (ros) and er stress levels were measured to study the effect of baicalein on mab production and quality. among these antioxidants, baicalein showed the best mab production performance. addition of baicalein significantly reduced the expression level of bip and chop along with reduced ros level, suggesting oxidative stress accumulated in the cells can be relieved using baicalein. as a result, addition of baicalein in batch cultures resulted in . - . -fold increase in the maximum mab concentration (mmc), while maintaining the galactosylation of mab ( fig. and table ). likewise, addition of baicalein in fed-batch culture resulted in . -fold increase in the mmc while maintaining the galactosylation of mab. oxidative stress negatively affected the production and galactosylation of mab in rcho cell cultures. among the various antioxidants tested in this study, baicalein showed the best mab production performance in both batch and fed-batch cultures of rcho cells. baicalein addition significantly enhanced mab production while maintaining galactosylated forms of mab. thus, baicalein is an effective antioxidant for use in rcho cell cultures for improved mab production. background the production of many biopharmaceuticals (e.g. antibodies & proteins for diagnostic and therapeutic purposes) requires the cultivation of mammalian cell lines, which is demanding with respect to various aspects such as complex cell metabolism, variabilities in cell behavior, scale dependencies, influences of changes in cultivation conditions, medium composition etc. although an increasing number of measurement parameters is available, only a part of them is routinely utilized in industrial cell culture processes and their corresponding seed trains. nevertheless, the data base grows, statistical investigation of data gains importance and process data are more easily accessible in the context of industry . . cell cultivation has to consider these complex requirements, e.g. for fed-batch control and seed train design. furthermore, cultivation strategies have to be adapted to new products, cell lines and clones as well as to different production plants when transferring processes. one approach to encounter the variabilities and to include actual information from the process and from data analysis is adaptive model-assisted control [ ] . two software tools enabling adaptive model-assisted control applying unstructured, unsegregated models have been developed and implemented using matlab © , winers and fortran, one tool for fedbatch control and another one for seed train simulation and optimization. one key element of adaptive model-assisted control is the underlying process model. in order to provide an adaptive character, model parameters should be easily identifiable from routine cultivation data, which is available during seed train and fed-batch without additional sophisticated measurements. therefore, the usage of unstructured, unsegregated models is recommended. a) example of an unstructured, unsegregated cell culture model (for adaptive model-assisted control) one example, describing cell growth, cell death, uptake of substrates and production of metabolites via a first order system of ordinary differential equations and monod-type kinetics, is shown in table . this mathematical model includes cell specific model parameters [ ] . ii) b) open-loop control sequence for seed train simulation and optimization [ ] : using model, a priori identified model parameters and starting concentration values, the temporal concentration courses can be predicted for the first scale. subsequently, points in time for passaging and starting values for the next scale can be computed by adding a passaging strategy, seed train conditions and medium concentrations. prediction for the following scales can be obtained iteratively. integrating feedback from the process in terms of cultivation data enables increasing prediction accuracy and responding to possible changes in cell behaviour. process design and optimization, e.g. regarding seed train and fed-batch, is realized by adaptive model-assisted software tools using unstructured, unsegregated models. they enable feedback from the process via routine cultivation data and allow adaptation to diverse circumstances such as different cell lines, products, cultivation conditions, plant configurations etc. ) in polyelectrolyte capsules. significant advantages, such as great mechanical stability, good biocompatibility and good mass transfer properties characterized these capsules based on sodium cellulose sulfate/poly(diallyldimethyl) ammonium chloride (scs/pdadmac) [ , ] . here, we present the possibility to cultivate human t cells, freshly isolated from blood, to high densities in similar semipermeable polyelectrolyte microcapsules within less than days. cells were encapsulated in semipermeable scs/pdadmac polyelectrolyte microcapsules or confined in . % alginate/poly-l-lysine (pll) beads, a standard approach for cell immobilization. the permeability of the microcapsules was estimated using dextran-based molecular weight standards ( and kda) and vitamin b ( . kda). gentle digestion with endocellulase allows an easy release of the cells out of the capsules. cell growth, cytokines production and phenotype were measured in non-encapsulated and encapsulated cells grown under standard culture conditions. moreover, we analyzed the interplay between the secreted cytokines and the scs within the capsules and its putative influence on cell growth. cells mixed in the cellulose sulfate solution under physiological conditions can be safely trapped within a liquid core during capsule formation. encapsulated cells can reached cell densities ≤ x cells ml capsule - , whereas cells confined in alginate/pll beads and non-encapsulated ones reached . x cells ml bead - and . x cells ml, respectively. one major advantage of these polyelectrolyte microcapsules (< mm) is the low mwco (< kda) (fig. a-b) . this restricted permeability allows for a conditioning of the capsule core by autocrine factors, which in turn permits the use of basal cell culture medium instead of expensive t cell specialized media, hence does not necessitate high amounts of rhil- and reduces the cultivation costs. moreover, co-encapsulation of rhil- had a beneficial effect on the growth kinetics in most cases (fig. c) . some evidence is presented that the scs used to form the polyelectrolyte microcapsules, specifically adsorbs il- (table ) -a cytokine which provides an essential signal for t-cell proliferation and differentiation [ ] . therefore, we postulate that the scs used for encapsulation has biomimetic properties, creating an artificial extracellular matrix mimicking heparin sulfate which in turn positively affect t cell proliferation via trans-presentation of il- (fig. d) [ ] . primary t lymphocytes can be expanded under appropriate conditions outside the body. in the latter, t cells grow/expand in specific environments where the cells are tightly packed, leading to multiple cell-cell contacts and manifold interactions with the extracellular matrix. ex vivo suspension cultures of diluted cells cannot provide such a microenvironment. in the microcapsulesbased cultivation system presented, the cells are suspended in a viscous scs-solution. the low molecular weight cut off of the surrounding polyelectrolyte membrane assures that typical signaling molecules produced by the cells are retained thus facilitates the "conditioning" of the cellular microenvironment, while nutrients and metabolites can pass. expensive additives, such as interleukin- (il- ), can be co-encapsulated. expansion then no longer requires specialized t-cell media. moreover, the scs seems to have biomimetic properties, representing an artificial extracellular matrix mimicking heparin sulfate. we consider that the described method may be an appropriate alternative to expand t cells while creating a local microenvironment mimicking in vivo conditions. - ) . equations of balances and kinetics of an employed process model including x v viable cell density, x t total cell density, μ cell-specific growth rate, μ d cell-specific death rate, t time, k s and k monod kinetic constant and monod constant for uptake, k lys cell lysis constant, q cell-specific uptake rate or production rate, respectively, y kinetic production constant, c concentration, glc glucose, gln glutamine, lac lactate, amm ammonia, f feed rate, v volume balances with fed-batch terms kinetics ;uptake if c glc ≥ . mm : q lac,uptake = if c glc < . mm : q lac,uptake = q lac,uptake,max q amm = y amm/gln • q gln background digital manufacturing (dm) is heightening the productivity and robustness of existing processes and facilities. it also enables the efficient development of previously unmanageable products or processes and provided the basis for a wave of innovations. dm is a resident and on-line source of continuous optimization of process performance. it relies upon the comprehensive, real-time interfacing of both human and machine sourced information through one centralized system. more than legacy distributed control system (dcs) and supervisory control and data acquisition (scada), it is an integral interconnection of real-time access to divergent sources of information. as such, it can promise deep analysis and predictions leading to shortened product cycle and advanced process control. this comprehensive analysis is extending beyond operations performance data from the production floor to data driving such activities as raw materials security of supply (sos) and business continuity management systems (bcms). digital biomanufacturing (db) can be viewed as yet another, larger, embodiment of digital biotechnology. db is similar to digital manufacturing in that it promotes innovations in the manufacturing of biologicals by using such things as computer aided design, manufacture, verification and deep process analysis using software sensors (fig. ) . however, the fact that there are living components (cells) involved in the processes puts a distinctly different flavor to the systems employed. it is desirable to use a distinct term here to distinguish it because, as in the terms bioproduction and biopharmacology, db addresses many unique aspects of biologically-based activities. the reasons why the biotech and biopharma industry lags behind other sectors such as the automotive regarding the transformation to digital manufacturing are (i) the complexity and dissipative nature of biological systems, (ii) distributed heterogeneous data and (iii) limited at-line or on-line data sources. however, the costs of genomic sequencing, omics data generation, and computing resources are decreasing rapidly, and at the same time process analytical technologies, computational power and predictive modeling as well as data management infrastructures are greatly improving (table ) . by removing roadblocks that used to limit approaches, these changes have paved the way to transforming the bioeconomy into an industry that is based on digital knowledge. such new and optimized manufacturing technologies as continuous biomanufacturing and d bioprinting can actually demand the interfacing of many sources of information, deep data prior to elisa, the various proteins were incubated at °c in scs prepared as for encapsulation. as control, the scs was replaced by pbs. shown are mean values ± sd, n = analysis including software sensors for metabolic fluxes, and model-based predictions of digital biomanufacturing. the application of predictive models for bioprocess optimization greatly improves established platforms and finally leads to a massively increased mechanistic process understanding. four essential benefits result from the increased bioprocess understanding, development, and control of db. first, personnel are relieved of many manual and repetitive tasks. second, strategic planning and operational efficiency are improved. third, we see real-time optimization of end-to-end manufacturing based on such high-value criteria as projected product quality and profitability. fourth, it enables previously unmanageable operations and creates innovative solutions. monitoring between-batch behavior of real-time adjusted cellculture parameters xavier lories, jean-françois michiels arlenda, mont-saint-guibert, , belgium correspondence: xavier lories (xavier.lories@arlenda.com) bmc proceedings , (suppl ):p- background cell-culture parameters (ccp), such as ph, may be continuously measured online and subject to real-time automated adjustment (e.g. automated addition of a base to prevent the ph to drop too low). this is an efficient method to maintain the parameter within specified limits. this type of control constraint the variability within the predefined limits and does not provide any information on the between-batch variability of the process. online measurements of ccp provide time-dependent curves presenting one or more transitions. different types of transition can be observed: -the process can shift from a state in which adjustment is needed to keep the ccp in range to a state in which it is not. typically, the ccp drifts away from a limit. -the process shifts from a state in which adjustment is not needed to one in which it is. for instance, a drifting ccp reaches the lower or upper limit of the accepted range. the timepoints at which those transitions take place are here called changepoints. those are aspects of the process and, as such, should be controlled. in the multiple changepoints cases, the approach allows the early termination of runs showing very early or very late first changepoint. the identification of the changepoints position is based on simple rules rather than complex statistical modeling to keep the identification methodology simple. once the changepoint are identified, a multivariate bayesian model is adjusted on the appropriately transformed data. prediction regions are obtained and used as control limits [ ] . results obtained for a -changepoint case are shown on fig. . points on the right-hand graph represent new batches. the red triangle represents a failed batch. it appears that the control strategy fails to identify the failed batch. two reasons can be considered: -the limits of the prediction region have been established based on points, such a small sample size is likely to be insufficient for the definition of such a control chart. -the tested batches were produced out of set point. a control chart should be used on a stable process, ran in the same conditions, in order to be really relevant. this work was based on available historical data, which is never an ideal situation. the suggested strategy offers a simple approach to the monitoring of between-batch behavior for cell-culture. once the limits have been defined, the approach is quite straightforward and usable by nonstatistician. however, such strategy, as any other of this type, must be based on a sufficient number of batches for the definition of the control limits in order to have a good estimation of the batch-to batch variability. fig. (abstract p- ) . intelligent software applications support digital biomanufacturing process development and control. • databases using data collected online, at-line, and offline from bioprocesses operating worldwide. • process data are used to generate metabolic network models that represent a specific host cell line in a bioprocess. • modelbased computational simulations improve process understanding and reduce experimental efforts for media design, clone selection, and metabolic engineering. • automated data import and processing allow for a streamlined and standardized metabolic process analysis. • identification of critical metabolic parameters is used for proactive steering and control of production processes background rabies is a zoonotic viral disease with a mortality close to % [ ] . as there is not an efficacious treatment available, post-exposure vaccination is recommended for individuals in contact with the virus. on the other hand, the most common source of virus transmission is saliva of infected animals, mostly dogs, whereby mass vaccination of pets is the most cost-effective way to reduce human infections. in this context, availability of both human and veterinary vaccines is critical [ , ] . our group had previously developed an effective vlp-based rabies vaccine candidate produced in high density hek cell cultures with serum free medium (sfm) [ , ] . one of the aims in vaccine production process is the achievement of a good productivity with a low cost per dose, mainly in the case of vaccines for animal use in which case the sfm is one of the principal expenses. in this work, we show the adaptation of the producer clone to a non-expensive in-house developed culture medium, in order to reduce the global cost of the process and therefore the price per dose. experimental approach first, we compared a direct and a sequential adaptation protocol of our hek rv-vlps producer clone, from % of the commercial sfm (ex-cell , safc) to a new formulation with only % of the sfm and a minimum essential medium (p g), developed in our laboratory specifically for rv-vlps production. this new formulation was called rvpm (rabies vaccine production medium). the specific productivity of rv-vlps in culture supernatants was measured by sandwich elisa, using the th international standard for rabies vaccine that quantify the glycoprotein content (nibsc, expressed in elisa units per ml). further, we evaluated both media for the production of the rabies vaccine, using stirred tank bioreactors operated in continuous mode (biostat qplus, sartorius). the production of the rv-vlps was daily evaluated by elisa and the obtained harvests analysed by the nih potency test for rabies vaccine. after the adaptation process, suspension cultures without aggregates or clumps were obtained, with the same specific growth rate. a lower maximum cell density with the rvpm was reached, achieving x cells.ml - , compared with the sfm that reach cell densities between and x cells.ml - in batch mode. the specific rv-vlps productivity per cell was maintained, obtaining values of . and . eu. cells - .day - for the clone being cultured in sfm and rvpm, respectively. taking into account that this producer clone can be changed directly from one medium to the other without lag phase or cell damage, and that in rvpm the maximum cell density reached was lower, this medium was proposed to be analysed in high cell density in perfusion mode for a continuous culture in bioreactor. therefore, we performed two cultures in parallel to compare the efficacy of each media formulation in perfusion. as shown in fig. , we obtained very similar culture performances in both bioreactors; . eu.ml - and . eu.ml - of rv-vlps for the commercial sfm and rvpm, respectively. after that, the harvests were evaluated by the nih potency test obtaining a rabies vaccine potency of . iu.ml - for both cultures (being iu.ml - the minimum potency required for animal vaccine). thus, the results obtained represent an interesting advance in the optimization of this vaccine production process since the use of this new medium formulation represents a reduction of % of the total cost which will be reflected in a considerable reduction of the price of the vaccine dose. background vaccines are one of the most powerful and effective health inventions ever developedproviding tremendous economic and societal value; yet several factors hinder comprehensive immunization coverage. traditional methods of biologics production, based on stainless steel bioreactors, allow pharmaceutical companies to achieve economy of scale, but are limited by high capital expenditures. such approaches stifle manufacturing innovation and lack long-term cost-effectiveness and sustainability. current innovations can cut biologics' production costs to revolutionize the mainstream use of biologic treatments, focusing on developing fast, potent and cost-effective vaccine production. univercells' mission to make biologics affordable to all initiated a paradigm shift, targeting an innovative single-use manufacturing platform incorporating bioprocess into continuous operations. univercells employs process intensification, using high volumetric productivity bioreactors; and unit steps integration, coupling usp and dsp into continuous operations. the objective is a down-scaled high-productivity process for a cost-effective manufacturing solution. the resulting micro-facilities are easily-deployable in developing countries, breaking entry barriers to biomanufacturing (fig. ) . manufacturing and distribution advancements, from centralized to distributed, foresee affordable treatments' obtainability via supplying local populations with local production units. -bench-scale fixed-bed bioreactor; -carriers made of % pure non-woven hydrophilized pet fibers; -vero cells grown in serum-free and serum containing media; -attenuated polio strains; -cell nuclei on carriers counted by the crystal violet method; -polio virus production estimated by elisa assay (d-antigen content). cultivation of vero cells in medium with serum and in serum-free medium, was carried out in bench-scale compact fixed-bed bioreactors, to determine which culture conditions result in the highest growth rate, the highest cell biomass by carriers and virus production. cells were inoculated at . x cells/cm and infected during the mid-exponential phase, following a complete media exchange. viral infection took place in serum-free media. in-line clarification and purification is targeted to be performed in only a few steps (maximum one of two) without intermediary diafiltration. in such configuration, we measured that vero cells can reach a cell density of - x cells/cm with pdl/day of . - . in serum-containing media. this new facility is expected to manufacture any type of viral vaccine at a very low cost and could be deployed at the site of the manufacturer in emerging countries, killing the two birds of cost of manufacturing and distribution with one stone. the presentation will feature the description of the engineering development, but also the preliminary results of cell growth, infections, and product quality, as well as a description of the cogs calculation. univercells developed a disruptive polio vaccine manufacturing technology exceeding expectations when compared to traditional methodsachieving a superior result via its all-in-one solution of a simple, scalable, and fully-disposable vaccine production platform resulting in long-term cost-effectiveness, flexibility and sustainability: -all upstream, downstream and inactivation steps take place within a closed system with all the equipment contained in a low footprint isolatorcreating a confined area for polio virus handling that facilitates the deployment of micro-facilities. -this leads to a dramatic reduction in capital investment, time required for development and increases production capacity. -in conclusion, this is a simple and elegant solution for the industrial production of human vaccines at a low cost in micro-facilities, making polio vaccines available to all. comparison of media formulations for the vaccine production in l stirred tank bioreactors operated in continuous mode. both cultures were performed in parallel using the corresponding medium for the perfusion. a feeding was performed with the commercial sfm. b the first two days of perfusion feeding was performed with sfm until the cell density reach cells.ml - and, after that, the bioreactor was fed using the rvpm formulation. (↓) on day number , % of the reactor volume was punctually bled maintaining the working volume background vectored vaccines based on modified vaccinia virus ankara (mva) are reported to stably maintain large transgenes, and to be safe, immunogenic and tolerant to pre-existing immunity. mva is usually produced on primary chicken embryo fibroblasts but continuous cell lines are being investigated as more versatile substrates. we have previously reported development of a continuous suspension cell line (cr.pix) derived from the muscovy duck and efficient production process for mva in chemically defined media [ , ] . this process allowed isolation of an hitherto undescribed genotype (mva-cr ) that induced fewer syncytia in adherent cultures and replicated to higher infectious titers in the extracellular volume of suspension cultures [ ] . replication of mva-cr remained restricted predominantely to avian cells, an important property of mva vectors. homologous recombination in cr.pix cells was used to generate viruses with various expression cassettes in deletion site iii [ ] and combinations of the differentiating point mutations of mva-cr in a backbone of wildtype virus. all recombinant viruses were plaquepurified. successful introduction of the mutations was confirmed by sequencing and specifically designed restriction fragment length polymorphisms (rflps). viruses were analyzed by serial passaging, diagnostic pcrs accross deletion sites [ ] , replication kinetics, plaque phenotype and electron microscopy. the genome was further investigated by anchored pcr and long pcr. efficiency of spread of recombinant viruses (fig. a) could be mapped to a point mutation in one of the genes, a r. however, although mva-cr carries mutations in three structural proteins we detected no obvious differences to wildtype by electron microscopy (fig. b) . the replacement of the left viral telomere by the right counterpart was the most surprising result of our new study (fig. c) . this extensive rearrangement affects % of the viral genome and has also increased the area of complementarity between the two telomeres. the recombination site was precisely located and shown via analysis of earlier and subsequent passages to be a stable property of mva-cr . various viruses, including those with larger dual (dsred and gfp) expression cassettes, were serially passaged at least -fold. although the genotype of mva-cr is advantageous for replication, all genomic and genetic markers of wildtype and mva-cr were stably maintained in all passages of the recombinant viruses, independent of wildtype or mva-cr backbone. we confirmed our previous results that suggested that mva-cr replicates efficiently in single-cell suspensions and were able to connect this property with the d y mutation in a , a structural protein on the surface of the virions. mva-cr was also found to differ from wildtype mva by a recombination between left and right viral telomere. due to this event, several genes encoded at the left terminus have been deleted whereas the gene dosis of those originally encoded only at the right terminus may have increased. we do not currently know how much the various point mutations and changes in genomic structure combine to explain the improved replication of mva-cr . as several of the affected genes have been reported in the literature to impact interaction of mva with the host we would expect that in vivo studies may reveal additional novel properties of mva-cr . an extremely important distinction between our earlier study [ ] and this one concerns the source of the viruses. here, we investigated plaque-purified viruses and confirm the high genetic and genomic stability of mva. different expression cassettes inserted into deletion site iii, all diagnostic rflps and pcrs over various sites of the genome and within the viral telomeres remained unchanged throughout at least serial passages -independent of whether recombinant viruses with wildtype or cr -derived backbones were characterized. fig. (abstract p- ) . a one hallmark of mva-cr is a significantly reduced tendency to induce syncytia and an increased dispersion of plaques in cr.pix cell monolayers. this property appears to be supported by the mutation in a r. b electron microscopy reveals no obvious differences between novel genotype and wildtype. background transient gene expression systems using polyethylenimine (pei) are considered to be fast, flexible and cost-efficient for recombinant protein production [ ] . transfection efficiency depends on different factors; one of them is the type of media. production media support cell growth and protein production but not high transfection efficiency (te) mediated by pei [ ] . therefore, media were selected for transfection followed by feeding of production media [ ] to improve te and protein production. two different transfection strategies are compared: conventional transfection by preparing polyplex of a plasmid (pdna) and pei interaction before transfection and insitu transfection by direct addition both of them to the cell suspension and the polyplex formed spontaneously [ ] . cells were seeded hr in chomacs cd media before transfection. at transfection time point an equal amount of cells were resuspended in each media type. transfection was applied either insitu or conventional (polyplex prepared in μl of mm nacl and incubated for min.), media addition was performed hours post-transfection (hpt). media type and transfection condition were illustrated in table . media screen result exhibits the highest transfection efficiency of around % transfected cells by opti-mem medium coming along with low cell growth and viability. to improve the transfection efficiency, basic parameters including cell density, pdna, and pei concentrations were varied and higher transfection efficiency was reached by reducing media or accordingly increasing cell density, pei and pdna concentration for transfection. further optimization results show that the transfection of cho-k cells in opti-mem (transfection medium) for hours followed by addition of cho-macs cd (production medium) for further enhancing the transfection, cell count, and cell viability. the transfection efficiency (te) increased up to ± . % coincide with increases in viable cell concentration (vcc) in comparsion to transfection and cultivation in opti-mem media alone fig. a . both conventional and insitu methods are successfully transfected cho-k to the same similar high te as shown in fluorescence microscope images of fig. b . insitu transfection shows super-priority for suspension cell transfection concerning the reduction of handling steps (one step) compared to the conventional way (two steps). the insitu transfection avoiding the optimization step required for the incubation period to prepare transfection polyplex but require a higher amount of pdna and pei than conventional way as shown in table . in order to deal with the growing demand of large quantities of therapeutic proteins in a timely fashion, expression systems are being optimized to reduce the time of generation of stable clones as well as to increase the levels of protein secretion. this can be achieved by a combination of expression cassette optimization, cell engineering and selection process. we have previously developed the cumate gene-switch, which is a very efficient expression system for protein production [ ] . we have shown that the cumate-inducible promoter (cr ) was the strongest promoter we had tested so far in chinese hamster ovary (cho) cells. with this promoter, we were able to generate stable cho pools capable of producing high levels of a fc fusion protein ( mg/l), outperforming by to fold those generated with cmv and hybrid ef α-htlv constitutive promoters. besides the strength of the cr promoter, we demonstrated that the ability to control both the time and the level of expression during pool generation and maintenance gave a real advantage to the inducible expression system. indeed, we observed that keeping the expression off during selection enabled the generation of pools with superior productivity compared with the pools whose expression was maintained on. moreover, preliminary results suggest that keeping recombinant protein expression down increases the frequency of high producer clones [ ] . knowing that one of the main bottlenecks of the successful bioprocessing of recombinant proteins using cho cells is the rapid isolation of a high producer, our data suggest that the cumate gene-switch system could be a valuable platform for the generation of stable clones. the main regulatory authorities and organizations demand proof of monoclonality for biotechnological producer cells. with increasing pressure to shorten timelines and to improve drug safety, technologically advanced methods have to be established to ensure that production cell lines are derived from a single progenitor cell. sartorius stedim cellca's single cell cloning approach is based on one round of fluorescence-activated cell sorting (facs) using becton dickinson (bd) facsariatm fusion cell sorter combined with photodocumentation by synentec cellavista microscopic imaging system. for the approach, critical process parameters such as different cell lines, viability and cell aggregation levels were investigated separately to assess their contribution to the probability of monoclonality. immediately after single cell cloning into -well plates ( cell/ well) the plates were centrifuged followed by imaging using the cellavista (day ). further cellavista images are taken on day , day and on one day between day and . outgrowth was defined at day . cell lines expressing different recombinant products were investigated to calculate probability of having ≥ cells/well after facs sorting p(d), the apparent probability p(i) of having ghostcells (cells that are out-of-focus and, thus, are not visible during initial microscopic imaging), and the apparent probability p(k) of having ghostcells that outgrow the -well stage (fig. ). using these results, the probability of obtaining a monoclonal cell by using sartorius stedim cellca's single cell cloning approach was determined (table ) by conservative examination: p(monoclonal, conservative) = -(p(d) x p(i)) realistic examination: p(monoclonal, realistic) = -(p(d) x p(k)) cell pools with low viability can theoretically impact the probability of monoclonality by e.g. diminishing microscopic imaging quality (cell debris). therefore, pool cell line with very low viability (≥ %) was used to demonstrate, that the probability of monoclonality is still . % in case of low viability on day of sorting: p(monoclonal, conservative) = p(d) x p(i) = . % p(monoclonal, realistic) = p(d) x p(k) = . % furthermore, cell pools with high aggregation levels can theoretically impact the probability of monoclonality by sticking together during facs sorting and therefore increase the probability p(d) of having ≥ cells/droplet. therefore, pool cell line with high aggregation levels (≥ . %) was used to demonstrate, that the probability of monoclonality is still ≥ . % in case of highly aggregated cell pools on day of sorting: p(monoclonal, conservative) = p(d) x p(i) = > . % p(monoclonal, realistic) = p(d) x p(k) = > . % conclusions in summary, there is no obvious correlation between protein product type and the determined probabilities for monoclonality. furthermore, pools with a viability as low as % and pools with an aggregation level as high as . % can be used for scc resulting in acceptable probabilities of monoclonality. background ich guidance [ ] requires that any cell line used to produce biopharmaceuticals originates from a single progenitor cell. recently, there has been increased scrutiny of the method(s) used to achieve this requirement. here, we review the suitability of the legacy capillary aided cell cloning (cacc) method in light of this changing landscape of expectations. the cacc method is based on the 'spotting' technique [ ] and relies on independent visual conformation by two scientists of the presence of a single cell in a μl droplet. this method achieves a high probability of monoclonality in one cloning round. although the method has since been replaced by facs single cell deposition for routine use, it remains a viable cloning method. -performed by trained scientists -dilute culture to ± cells/ml with ≤ % doublets -draw cell suspension into pipette tip by capillary action; tap tip against the centre of the base of each well of a well plate. -size of resulting droplet =~ μl (fig. a ) -two scientists independently view all wells using a microscope (initially use x magnification with the entire rim of the droplet visible within the field-of-view. next, examine particles using x or x magnification to confirm they are cells) and individually record the number of cells present in each well's droplet (fig. b to d) . -exclude droplet from further analysis if full visualisation is hindered (fig. e to h) . -add growth medium, and incubate plates. record all wells containing colonies; only progress colonies from wells that both scientists agree contains only one cell. -data analysis: -each scientist's observations categorised as: cells, cell or > cell -observed outcome for each well: growth or no growth -probability of monoclonality estimated from data using a statistical model cloning (ldc) increased accuracy of p(monoclonality) with cacc -ldc weakness: no visualisation after seeding (to check both well seeding and subsequent growth of colonies is well described by the poisson distribution), potentially overestimating p(monoclonality) -addressed by cacc: visual examination with colonies arising from wells seeded with cell distinguished from those seeded with > cell -visualisation step further strengthened by: using controls for exclusion of wells; measuring errors based on the presence or absence of colonies in wells where two scientists independently reported cells; and formally analysing the data using a suitable statistical model decreased time and resource requirements with cacc -high p(monoclonality) possible in single round as each well examined individually with only those containing a single cell progressed, and because the error rate for incorrect scoring is considered to be low two scientists miss a cell one cell sitting on top of another and the two thus appearing as one an experiment was performed to estimate error frequency [ ] . conclusion -scientists miss a cell infrequently (in the range . % to . %, [ ] ) -error frequency does not invalidate use of direct observation methods for cell cloning -single cell seen by both scientists is highly likely to be monoclonal -during method development, strategies established to control potential sources of error ( table ) use of a contemporaneous visualisation approach, a strict control strategy, and a suitable statistical model (which takes into account potential errors) results in: -the cacc method being at least as robust as the ldc method -the cacc method being a reliable, single-step method for cloning to achieve a high p(monoclonality) background vector design is a key step in cell line development for the expression of therapeutic biologics. it is essential that the vector design results in high, stable expression of the encoded protein. other considerations include ease of cloning, stability for propagation in e. coli as well as in the mammalian host cell line, and ease of sequence amplification for verification of vector construction and for detection of insertion site and copy number in stably expressing cells. for these reasons, use of the same promoters and polya tails in dual cassette vectors, as is common for expression of the heavy and light chains of monoclonal antibodies, can be problematic. in order to minimize sequence similarities between the two expression cassettes, we have modified the promoters, introns, and polya tails of the light chain and heavy chain expression cassettes in the dual expression vector commonly used for the expression of therapeutic antibodies in the chozn® gs -/cell line development platform. gene synthesis and vector construction of igg and fluorophore-expressing vectors was done by atum. vectors were transfected into chozn® gs -/cells via electroporation. analysis of gfp and rfp expression was achieved using a macsquant instrument. selection and generation of stable pools and single cell clones from transfections with igg -encoding vectors was performed as described in the chozn® platform technical bulletin. titer analysis was performed in static ( well plate), in a day tpp assay and in a day fed batch assay using a qk fortebio. initial screening experiments identified a lead vector, # , and a vector, # , which produced very low titers and relatively few minipools expressing detectable levels of igg . analysis of gfp and rfp expression from the modified vectors indicated relatively high expression from the rfp/hc expression cassette of vector # . a stronger promoter resulting in overabundance of hc, known to be toxic to cells, provides a possible explanation for the poor results with this vector. interestingly, swapping the positions of the lc and hc in # resulted in a vector, # , that outperformed the initially identified lead vector (fig. ). this same change was made to vector # without any resulting improvement in titers (vector # , fig. ). interestingly, vector # had a smaller difference in relative promoter strengths, based on mean channel fluorescence ratio of gfp to rfp, suggesting that overabundance of hc was not an impediment to igg expression from # . poor titers were also seen with a modified version of vector # (vector # , fig. ) in which the glutamine synthethase selection cassette was in the reverse orientation. this second screen identified vector # as the lead vector design (fig. ) . a full comparative study of vector # and the control vector was performed, cumulating in the generation and comparison of single-cell clones from each. these studies have demonstrated the equivalence of these vectors in terms of igg titer. this work has resulted in the identification and characterization of a dual expression vector with minimized similarity between the two expression cassettes, easing the cloning, propagation and analysis of vector integration in stable cell lines while maintaining the high, stable expression of the encoded protein of the original vector design. background traditional cell line engineering strategies mainly include an antibiotics resistance selection. in this process, cells are transfected with the goi (gene of interest) together with an antibiotics resistance gene and those cells are selected that survive treatment with the respective antibiotic [ ] . although the gene responsible for the survival of the cell is transfected together with the goi, resistance is not necessarily linked to high goi expression. thus, a significant proportion of resistant cells may not express the goi at all, necessitating the search for alternative, more closely linked selection systems. sirnas (silencing inducing rnas) are short, noncoding rnas that can bind to complementary mrna and inhibit their translation. this function has been used in many approaches to silence the expression of certain genes [ ] . with their short length, sirnas can be hidden in introns (non-translating regions) of genes, making it possible to couple the expression of a sirna to a gene. this way a cell produces a correlating amount of sirna when transcribing the gene, without adding any further translational burden on the cell. the co-expression of the sirna can be used as a selective marker by one of the following methods: ( ) knock-down of a suicide gene to enable a cell's survival after suicide gene mrna transfection, ( ) down-regulation of a surface marker which is used in macs (magnetic cell separation) to filter out wanted or unwanted cells, and ( ) inhibition of a fluorophore marker for selection using facs without product specific antibodies. for sirna based cell selection systems, sirnas replace the commonly used antibiotics resistance as a marker. cells that produce goi will also produce the sirna that protects the cell from a suicide gene. the selection protein (suicide genes, fluorophores, surface markers, etc.) is transfected as mrna and is only expressed during selection. the general process is outlined in fig. . (a) the traditional antibiotics resistance marker is replaced by an sirna, which is cotranscribed with the goi. unlike in antibiotic resistance, the marker here is not a protein, reducing the translational burden and providing more resources for goi production [ ] . transfection with the suicide gene proved to be % lethal within days, with no outgrowth over two weeks. protection by expression of the sirna was shown to be efficient. currently a comparison of stable cell line development programs based on sirna selection and neomycin selection is ongoing. conclusions the novel selection system should speed up cell line development, as the system kills rapidly and directly selects for cells transcribing the product gene on a high level. we expect to see more high producers earlier in the process, which will allow for an easier and faster selection in the following steps. sirna based selection offers great opportunities. by directly selecting based on goi transcription and not a proxy marker, we expect more relevant cells on a pool level. in addition, the elimination of an antibiotics resistance allows more cellular resources for goi production. the system offers multiple ways of application, either by enriching wanted, or depleting unwanted cells. background single-cell cloning is an essential step used in the upstream development of transformed cell lines for therapeutic protein production. while single-cell clones are typically used to ensure product consistency, such low cell density cultures present a survival challenge; cells grow more slowly or may even not survive at low densities in protein-free media, costing the industry time and money and limiting the pool of candidate colonies for choice of production clones [ , ] . to address this problem, we aimed to develop a highly efficient serum-free medium suitable for optimising single-cell cloning efficiency by studying a range of conditioned media (cm) samples isolated from different chinese hamster ovary (cho) cell lines. materials and methods cho-s, dg and cho-k were adapted to cho-s sfm-ii (gibco) medium for a minimum of three passages. conditioned media was then collected when the cultures reached a cell density of x cells/ml (typically day -day depending on the growth profiles of each cell line and whether they grew in suspension or attached conditions). samples were then centrifuged twice to remove cell pellet/debris and stored at - °c. the ability of conditioned media to support cho colony formation was then assayed using -well plates, seeding the cells at low cell density ( - cells/well) by diluting down cho cultures in media/conditioned media. after incubation at °c for days, cloning efficiency was assayed using a standard xtt assay. initial screening of the nine cm samples was performed using cho-k cells due to their widespread use in industrial antibody production. successful media candidates were subsequently screened using additional cho cell lines. table ) . the k -sfmii-cm product improved cell cloning efficiency for dg cells (avg. increase> . -fold) and cho-s cells (avg. increase> -fold) ( fig. ) and also the adherent cho-k cell line growing in atcc + %fbs. the ability of conditioned media to support cho growth in limiting-dilution conditions ( , and cells/ml) was investigated. from a range of nine conditioned media samples; four compelling products have been identified which improve low-cell density growth of cho-k cells, compared to sfm-ii control media. we feel that these early-stage conditioned media products may increase cloning efficiencies during upstream cho cell line development, resulting in financial savings for industry and increasing the possibilities of identifying particularly highperforming transformed clones. ( ): - . the main rate-limiting step in the upstream stages of protein biomanufacture is the isolation of stable, high producing cell clones. ubiquitous chromatin opening elements (ucoe®s) consist of at least one promoter region with associated methylation-free cpg island from housekeeping genes; they possess a dominant chromatin opening capability and thus confer stable transgene expression. ucoe®-viral promoter (e.g. cmv) based plasmid vectors markedly reduce the time it takes to isolate high, stably producing cell clones. although some ucoe®-viral promoter combinations have been tested, they have not been thoroughly evaluated in chinese hamster ovary (cho) cells. plasmid vectors containing combinations of either the human hnrpa b -cbx ucoe® (a ucoe®) or murine rps ucoe® linked to different viral promoters (hcmv, gpcmv, sffv) driving expression of an egfp reporter gene were functionally analysed by stable transfection into cho-k cells and expression analysed by flow cytometry and qpcr to determine vector copy number. the results at days post-transfection and selection clearly indicate that the rps ucoe®-gpcmv and -hcmv combinations give the highest transgene expression as shown in fig. . the a ucoe®-hcmv/gpcmv constructs were the next efficacious but -fold lower than the rps ucoe® vectors. the sffv promoter linked with either of the two ucoe®s was the least effective with expression levels -fold lower than the rps -cmv constructs. the rps ucoe®-gpcmv/hcmv constructs are now being further modified to include elements that will provide optimal post-transcriptional pre-mrna processing (splicing, polyadenylation, transcription termination, mrna stability) thereby maximising stable cytoplasmic transgene mrna levels and protein production. in the last years, growing number of innovator biologics and biosimilars have formed a competitive environment, where speed and efficiency of generating robust and highly productive cell lines needs to be improved continuously. through various advances, especially in media development and process optimization, product titers as high as g/l were achieved in the pharmaceutical industry (kim et al., ) for standard products such as monoclonal antibodies. nevertheless, other proteins e.g. bispecific antibodies, fc-fusion proteins or fab-related products are difficult-to-express (dte) in chinese hamster ovary (cho) and may result in delays or even in termination of the cell line development process. we developed a new robust pool generation approach (cld . ) addressing both, easy-and difficult-to-express molecules, while reducing timelines down to months (cld standard = months), improving reliability of cell line development as well as clearly increasing obtained titers. in order to create stable cell lines, we transfected our cho dg host cells by electroporation. cells processed using the standard approach were cultivated in selective medium or medium containing additional . nm methotrexate (mtx) for three weeks. after an amplification step with nm mtx for three weeks, stable individual cell pools were expanded and clones were generated by facs-sorting. clones were analyzed for growth performance and product concentration in fed-batch studies. in our new cld . approach, we increased mtx concentrations ( . nm, nm and nm mtx) during the first selection phase of three weeks. afterwards we omitted the nm mtx amplification step. thereby, pool generation finished four weeks earlier than in the standard approach. to evaluate the stability of cell clones derived from mini pools (mps) generated according to the cld . approach, stability studies were performed for eight weeks, including stability fed batches at t= weeks and t= weeks. altogether three different proteins of interest with six cell clones each were tested. we adapted our cell line development process by increasing the initial selection during the first selection phase, thereby allowing the omission of the nm mtx amplification step. we observed that the capacity of amplifiability varied for different products. cell lines with a protein titer ranging from > g/l to . g/l (dte) in shake flask fed-batch showed to be more susceptible to increased initial mtx levels and were thus not amplifiable with nm mtx. in contrast, cell lines with high protein titer > . g/l were observed to adapt to nm mtx easily and were amplifiable. finale shake flask fed-batch data with cld . clones of highexpressing products showed comparable titers to clones from the standard approach. cld . clone titers for dte proteins revealed in average a . -fold increase compared to clones generated in the standard approach. titers of top producing clones were in a range of . g/l to to . g/l (fig. ) . furthermore, stability data of cld . cell clones from different dte products showed a stable specific productivity in a range of +/- % over eight weeks cultivation. fed-batch titer from t= weeks and t= weeks were in a normal range of +/- % of the standard nm projects. our results demonstrate that cld . is a robust and reliable process for standard products (mab) and dte proteins. with our new process, we were able to increase titer of difficult-to-express proteins up to %. by omitting the amplification step ( nm mtx) % of generated clones were stable over eight weeks cultivation time. additionally using the cld . approach, the time line from dna to rcb was reduced to months. background cho cells have become the most popular platform for production of therapeutic proteins [ ] . however, the generation of high-producer cells is a time-consuming and labor-intensive process that requires the screening of large amount of cells to get a clone of high titer and stability. since the expression titer and stability of clone is highly dependent on the site of integration, we demonstrated a new cell line development strategy by using ngs to identify the integration site and using crispr/cas to generate the target integrated high producing cell lines [ , ] . to identify the high expression sites in the cho cells, we employed ngs to analyze the integration sites of a high producing cell line (titer > g/l). the pair-end reads with one read mapped to the vector and the other read mapped to the cho reference genome are extracted to identify the integration sites. to test the expression activity of the integration sites, we employed crispr/cas to specifically integrate the antibody gene into cho genome for expression. our data showed integration sites are in the high producing cell line. among the integration site, is integration site was tested by crispr/ cas for target integration of antibody gene for expression. the is target integrated cell pool present higher expression titer than cell pool generated by target integration into other integration sites (fig. a) . the single cell clones derived from is -target integrated cell pool had low copy number of goi (fig. b) . after normalization with copy numbers, the single cell clones derived from is -target integrated cell pool showed high titer per copy ( ~ mg/l/copy) (fig. c) . this study demonstrated the generation of high-producing cell lines by crispr/cas mediated target integration. this approach will cost less time and labor than traditional method. the active integration site will serve as a platform like a cassette player for therapeutic antibody production. background cho, hek and sp / are the dominant host cells for biologics drug production. achieving high level of recombinant protein production by these cell lines still remains a challenge. in order to understand the potential roles of lipids in protein production, secretion, vesicular transport and energy metabolism, we coupled high-throughput transcriptomics and lipidomics technologies. quantitative lipidomics is an emerging 'omics technology which can help us understand the physiological limitations of each cell line. the two types of major lipid groups in cells are non-polar and polar lipids. polar lipids such as glycerophospholipids (pls) include phosphatidylethanolamine (pe), phosphatidylcholine (pc), phosphatidylinositol (pi), phosphatidylserine (ps), phosphatidylglycerol (pg), and phosphatidic acid (pa). in this study; we integrated two dimensional high performance thin layer chromatography ( d-hptlc) and mass spectrometry (ms) lipid analysis of sp / , cho, and hek cell lines to understand the major differences in the lipid content of these hosts. bligh-dyer method was used to extract the lipids and extracts were analyzed by hp-tlc and ms. the polar lipids were separated into different categories by -d hp-tlc using a chcl -meoh-h o ( : : . , v/v/v) solvent system in the first dimension and a chcl -meoh-acetic acid-h o ( : : : , v/v/v/v) solvent system in the second dimension. non-polar lipids were separated by -d hptlc using hexane-diethyl ether-acetic acid. , -dichlorofluorescein dye was used to visualize both polar and non-polar lipids. further detailed analysis was performed on a qqq mass spectrometer (thermo tsq vantage, san jose, ca) using negative-ion and positive-ion esi modes as well as negative-ion esi mode in the presence of lithium hydroxide. in this study, quantitative lipidomics was coupled with transcriptomics to further understand the physiological pathways of hek, cho-m and sp / cells. initial hp-tlc analysis indicated that major lipids in these industrial cell lines were pe and pc. other polar lipids such as pi, ps, pg, pa, and sm were lower compared to pc and pe in exponential and stationary phases of each cell line. figure represents d hp-tlc results of hek with the relative quantitation of polar lipids. in order to investigate the lipid subgroups, shotgun ms analysis was conducted for both exponential and stationary growth phases of the three cell lines. ms analysis indicated that lysophosphatidylethanolamine (lpe) and lyso-phosphatidylcholine (lpc) amounts were - fold and - fold higher in hek cells compared to sp / and cho cell lines. sphingomyelin (sm) was another lipid subgroup that was shown to have a major difference between sp / and other mammalian cell lines. sm was - fold lower in sp / cell line compared to cho and hek. to understand these metabolic differences, transcriptomics analysis using illumina highseq and gene expression omnibus was conducted on these mammalian cells. the kyoto encyclopedia of genes and genomes (kegg) database was used to map the transcriptomics data to the lipid synthetic pathways. transcriptomics data mapping to kegg pathways demonstrated that differences in lpe and lpc pathways correlate with the expression profiles of secretory phospholipase a (spla ), lysophospholipid acyltransferase (lpeat), lysophosphatidylcholine acyltransferase (lpcat), and lysophospholipase (lypla) [ ] . the hp-tlc and lc/ms findings demonstrated that high levels of lpe and lpc existed in the hek cell line and low levels of sm were observed in the sp / cell line. coupling lipidomics with transcriptomics provides us with an improved understanding of the physiological differences across sp / , cho, and hek cell lines that could be used to guide cell engineering efforts with the goal of increasing the recombinant protein expression capabilities of these three cell lines. biopharmaceuticals are a class of biological macromolecules that include antibodies and antibody derivatives, generally produced from cultured mammalian cell lines via secretion directly into the media. manufacturing at medimmune requires the generation of chinese hamster ovary (cho) clonal cell lines capable of producing the biopharmaceutical product at commercially relevant quantities with optimal product quality. the isolation of cell clones based on random single cell deposition via fluorescence activated cell sorting (facs) provides a heterogeneous panel of expressers. we hypothesize that the application of facs to provide an additional sorting step based on desirable cell attributes that correlate with productivity, product quality or cell growth attributes could lead to the isolation of higher producing cell lines with enhanced product quality attributes. a panel of cell lines expressing a model recombinant monoclonal antibody were characterised in terms of growth, productivity, intracellular recombinant protein and mrna amounts. assays were also developed to investigate cell attributes using the commercially available imagestream instrument, an imaging flow cytometer, which enables the investigation of cellular characteristics that correlate with cell productivity at the single cell level. characterisation revealed the cell lines exhibited a range of values for productivity, growth, and intracellular (ic) antibody mrna and protein expression, ideal for further imagestream characterisation. western blot and qrt-pcr analysis demonstrated that final titre correlated with both ic heavy chain (hc) protein and mrna amounts (pearson correlation coefficient (pcc) = . and pcc = . , respectively). to assess productivity at the single cell level, assays multiplexing ic hc protein and mrna with cell attributes were therefore developed. initial assay development focusing on hc mrna and protein amounts has revealed interesting results; four cell lines displayed two distinct populations, one producing the antibody and another nonexpressing population. the ratio of these populations varied amongst the cell lines. images obtained from the imagestream have shown the cellular localization and expression of hc and lc message and protein (fig. ) . for both message and protein, hc and lc colocalize in the cell. whether there is any relationship between ic hc protein and cell attributes at the single cell level was then also investigated, as well as correlations with cell culture parameters at the population level. at the population level, correlations were found between titre and ic hc protein and mrna (pcc = . and pcc = . , respectively) confirming the data obtained by western blot and qrt-pcr analysis. a panel of cell lines has been characterised at the population level and show a wide range of antibody expression profiles at both the mrna and protein level. in parallel, assays have been developed for the imagestream to measure hc and lc message and protein amounts at the single cell level. protein and message quantification with the imagestream are consistent with more traditional approaches, such as western blots and qrt-pcr, that operate at the population level. the developed assays are now being used to investigate single cell productivity attributes and for the isolation of more productive clones. background productivity and stability are key factors for the selection of cell line in protein drugs production. large amount of target gene integrated in cell genome could lead to the instability of production. therefore, cells with low copies of target gene integrated in high yield sites could be an ideal production cells for manufacturing. it has been known that the transposon system can control the integrated copy number of target gene and can generate high yield producing cells, it could be a great approach to generate stable high yield producing cell lines carrying low copies of target gene through transposon system. we intended to develop a platform to generate high yield producing cell lines carrying - copy of the integrated target gene using transposon system. two cho cell lines, cho-s cells and dxb cells, have been applied. cells were co-transfected with transposon and target gene expression plasmids. after drug selection, the cell pool with highest productivity per target gene copy was applied to single cell cloning. the productivity and copy number of cell clones were determined, and the stability of cell clones was analysed after culture of about generations. in the stable pools of cho-s and dxb cells, the productivities per integrated target gene copy were about - mg/l/copy and - mg/l/copy in a batch culture, respectively. after single cell cloning, the integrated copy numbers in most cell clones were less than three copies per cell. in cho-s and dxb cell clones, the productivities per integrated target gene copy were - mg/l/copy and - mg/l/copy in a batch culture, respectively. the productivity per integrated target gene in cell clones developed by the transposon system was much higher than that in cell clones developed by random integration (fig. a and b) . to evaluate the productivity stability of cell clones developed by the transposon system, ten cell clones at generation , , , and were applied in the analysis. of interest, about % of cell clones were stable at generation , but lost the productivity at generation (fig. c) , implying the most cell clones could maintain the stability within months. using the optimized conditions of the transposon system to develop the stable gene expression cells, the productivity per integrated target gene was higher than random integration. these results suggested that our platform is capable to develop high yield producing cells with - copy of integrated target antibody gene and can be applied to identify high yield integration sites. background mammalian cells show an inefficient metabolism characterized by high glucose uptake and the production of high amounts of lactate, a widely known growth inhibition by-product [ ] . recently, we have observed a different glucose-lactate metabolism in some cell lines. while some cell lines are unable to metabolize lactate, others can co-metabolize simultaneously glucose and lactate under certain culture conditions, even during the exponential growth phase [ ] . these metabolic differences between different mammalian cell lines (cho, hek and hybridoma) have been studied by means of flux balance analysis (fba). three different cell lines were cultured in a -liter bioreactor: cho-s, hek sf and hybridoma kb- . . for the fba, two adapted genome-scale metabolic models were used: a reconstruction of mus musculus for cho and hybridoma [ ] , and a reconstruction of human metabolic model (recon ) for hek [ ] . in cultures where ph was not controlled, two different metabolic phases were observed for cho and hek cells. during the first phase both cell lines produced large amounts of lactate as a consequence of the high glucose consumption rates. interestingly, when ph dropped below . , due to acid lactic secretion and accumulation, a second metabolic phase was identified, in which concomitant consumption of glucose and lactate was observed even during the exponential growth phase. conversely, hybridoma cells were unable to co-consume lactate and glucose simultaneously even under noncontrolled ph conditions. therefore, the hybridoma physiological data used for the fba corresponded to only phase of phcontrolled cultures. a summary of the main cell growth and metabolic parameters obtained from the different experiments performed is presented in table . fba shows ( fig. for hek cell culture) that lactate is produced in phase because pyruvate has to be converted to lactate to fulfill the nadh regeneration in the cytoplasm and only a small amount of pyruvate can be transported into tca through acetyl-coa. cell metabolism in phase is highly inefficient, as the majority of the carbon source is not used for the generation of energy nor biomass. in phase , in which mitochondrial ldh was considered, tca fluxes could be maintained as in phase at the maximal rate encountered; hence, the energy available for cells to grow was similar in both phases, obtaining similar growth rate. two different glucose and lactate metabolism behaviors have been observed in cho and hek cultures depending on the culture conditions: phase ) glucose consumption and lactate production, and phase ) glucose and lactate simultaneous consumption. in contrast, only phase was observed in hybridoma cultures even when ph was non-controlled. fba showed that tca fluxes in phase and phase were similar, obtaining similar cell growth rate, but glucose uptake rate was much lower in phase due to the lactate co-consumption. some authors hypothesize that cells metabolize extracellular lactate as a strategy for ph detoxification [ ] . glucose and lactate co-metabolization resulted in a better-balanced cell metabolism, as can be seen from the metabolic fluxes calculated, with minor effects on cell growth. the observation of glucose and lactate co-consumption metabolic behavior and its deeper study and characterization could open the door of novel culturing strategies with the aim of increasing bioprocesses productivity. background transient protein expression in mammalian cell lines has gained increasing relevance as it enables fast and flexible production of high-quality eukaryotic protein. considerable efforts have thus been made to overcome existing limiting aspects of transient gene expression systems, in terms of cell lines, cell culture-based systems, and protein production in a cost-effective manner. milligram amounts of protein per liter can be produced within several days, allowing a significant shortening of the bioproduction process in comparison to protein production from stable clones. to ensure the robustness of the process, it is essential to have a reliable and easy-to-use transfection method. to palliate for the need of a reliable transfection reagent, we developed peipro®, the only commercially available pei optimised for mid to large-scale transient protein production during process development. peipro® is a non-polydiperse and fully-characterised polymer that has become the gold pei standard due to its reliability, reproducibility in high dna delivery efficiency and in ensuing high protein production yields. here, we present experimental data showing the benefits of using peipro® for protein production in comparison to other peis. we further demonstrate compatibility of using peipro® for recombinant protein production in most commonly used chemicallydefined media. materiel and methods suspension hek- and cho cells were cultured in shaker flasks in various synthetic media, as listed in table . hek- and cho cells were resuspended at × cells/ml of serum-free medium, on the day before transfection. cells were transfected with . - mg of plasmid dna encoding for the luciferase gene reporter using peipro®, pei "max" and l-pei kda (polysciences, warrington, pa) resuspended at mg/ml according to the manufacturer's recommendation. protein expression of the luciferase reporter gene was assayed hours post-transfection by affinity chromatography using protein g (hplc). comparison of peipro® to other commercially available peis was achieved by transfecting suspension hek- and cho cells with plasmid dna encoding for the luciferase gene reporter. luciferase production yields obtained in hek- and cho cells were at least respectively -fold and -fold higher when using a similar amount of peipro in comparison to the other peis (fig. ) . furthermore, peipro® was the only pei that led to similar luciferase production yields when decreasing the amount of plasmid dna per liter of cell culture. conversely, at least mg of plasmid dna and -fold more of pei "max" and l-pei kda were needed to obtain a similar luciferase expression range in both hek- and cho cells. we further assessed the compatibility and versatility of peipro® by measuring protein production yields obtained in most commonly used animal-free synthetic media. as shown in fig. , peipro® leads to high protein production yieds in several commercially avaialble media formulations for hek- anc cho cell lines. peipro® is the only fully characterised pei transfection reagent that is suitable for reliable and reproducible recombinant protein production, irrespective of the scale of production and of the type of adherent and suspension cell culture system. fig. (abstract p- ) . peipro® requires less reagent and similar to lower dna amount compared to other peis. suspension hek- and cho cells were seeded at × cells/ml in serum free medium and transfected with peipro®, pei "max" and l-pei kda (polysciences, warrington, pa) resuspended at mg/ml. luciferase expression was assayed h after transfection using a conventional luciferase assay fig. (abstract p- ) . peipro® is optimized for transfection of hek- and cho cells in several specific synthetic culture media. suspension hek- and cho cells were seeded following the recommended protocol in serum-free media and transfected with peipro® using the standard conditions. igg -fc production was assayed h after transfection using protein g affinity quantification (hplc) monoclonal antibodies (mabs), which are widely used in anticancer therapies, are mainly produced by mammalian cell lines. mab conjugation to biological molecules for enhancing their antitumor activity offers a new powerful tool for anticancer therapies. we have assessed the production of commercially approved anti-her therapeutic antibody trastuzumab (tzmb) [ ] and also its fusion with interferon-α b (ifnα b). two cloning strategies consisting in transfecting cho-s and hek cell lines with two bicistronic or with a single tricistronic plasmids have been assessed. the in vitro efficacy of both antibodies has been tested and compared side by side. tzmb heavy and light chains were cloned in two bicistronic plasmids (pirespuro and piresneo , clontech) and in a tricistronic plasmid derived from pirespuro . ifnα b was spliced to tzmb heavy chain by overlap extension pcr and the resulting tzmb-ifnα b fusion protein was also cloned in the expression vectors in the same way than non-modified tzmb. selected cell pools were cultured in ml shake flasks containing sfmtransfx supplemented with % v/v of cell boost (hyclone), mm of glutamax (gibco) and μg/ml of puromycin and also with μg/ml neomycin in the case of the cells transfected with pires-neo . cells were cultivated in the same conditions as described elsewhere [ ] . purified products (using protein a chromatography (hitrap mabselect sure, Äkta avant )) were quantified by both elisa and sds-page. antigen binding test was performed in sk-br- breast cancer cell line by means of flow cytometry analysis. the biological activity of the different candidates was tested with mtt assay. both tzmb and the fusion protein tzmb-ifnα b have been successfully expressed in cho-s and hek , which use for heterologous protein expression have previously been optimized in prior works [ ] . the tricistronic strategy resulted in the most efficient, showing a . fold increase in terms of productivity with respect to the bicistronic double-transfection for tzmb in cho-s cells and a -fold increase in hek cells (fig. a) . in the case of tzmb-ifnα b, the tricistronic strategy also allowed to achieve higher productivities than the bicistronic one (fig. b) . regarding the differences of specific productivity between both cell lines tested, hek emerged as the best production host candidate, for the two tested strategies (tricistronic and bicistronic) and for the two produced proteins, showing a . -fold increase in terms of productivity with respect to cho-s cells for tzmb using the tricistronic strategy. tzmb and tzmb-ifnα b were analysed in terms of their antigen binding capacity, and both were find to efficiently bind to her + skbr- cells (fig. c) . thus, the antibody affinity to her antigen has not been affected when fused to inf-α b. finally, antiproliferative activity of tzmb and tzmb-ifnα b were assessed on the same sk-br- cells. at a concentration of nm of tzmb, and after a -hour incubation, sk-br- cells presented a % growth with respect to the untreated control. however, no antiproliferative effect was observed for tzmb-ifnα b (fig. d) . the tricistronic strategy provides higher productivity yields in hek and cho-s cell lines for both recombinant proteins (trastuzumab and tzmb-ifnα b). regarding which cell line is the best production host candidate, hek achieved higher productivity than cho-s cells for the two proteins tested. all constructions performed preserved the binding affinity to its antigen, trastuzumab and tzmb-ifnα b bind efficiently to the her antigen present in skbr- cells. finally, tzmb-ifnα b does not present an improved antiproliferative effect with respect to trastuzumab when compared by means of an in vitro assay. the genetic engineering of patient-specific t cells with lentiviral vectors (lvv) expressing chimeric antigen receptors (car) for late phase clinical trials requires the large-scale manufacture of high-titer vector stocks. the state-of-the-art production of lvv is based on -to layer cell factories transiently transfected in the presence of serum. this manufacturing process is extremely limited by its labor intensity, open-system handling operations, its requirements for significant incubator space plus costs and patience risk due to presence of serum. to circumvent these limitations, this study aims to develop a stable and serum-free process to produce lvv with pei-mediated transfection. in addition, this study also focuses on the development of a a c b d fig. (abstract p- ) . expression of trastuzumab (a) and trastuzumab-ifnα b (b) from bicistronic strategy (bc) and tricistronic strategy (tri) with cho-s and hek cells. relative specific productivity units are used for comparing the different strategies. c antigen binding analysis of trastuzumab and trastuzumab-ifnα b. d antiproliferative activity of trastuzumab and trastuzumab-ifnα b on sk-br- cells production system not only using a gfp marker but also a therapeutically relevant transgene (cd -car) [ ] . therefore, three different cell lines (hek , t, ft) were investigated concerning their productivity of lvv and their growing behavior in the in-house serum-free medium transmacs. as part of this, design of experiment was used to investigate the optimal conditions for pei/ dna-transfection. furthermore, this statistical approach was used focusing an ideal ratio between the rd generation plasmids (transfer plasmid cd -car or gfp, envelope plasmid, packaging plasmids). in addition, different enhancers (sodium butyrate, lithium acetate, caffeine, trichostatin a, cholesterol, hydroxyurea, valproic acid) were investigated concerning their effects on productivity comparing hek cultures producing lvv encoding for gfp-marker or cd -car. concerning productivity and growing behavior, hek t was the favored cell line for our serum-free lv manufacturing process. in addition, an additive screen revealed that sodium butyrate alone had the most promising effect on both gfp-lvv and cd -car-lvv production. after pei/dna titration, we finally could increase lvv productivity by lowering pei/dna amount at higher cell densities referred to our standard transfection protocol. furthermore, the titration for the optimal plasmid ration revealed, that for large transfer constructs higher amounts of transfer plasmid are required than for smaller constructs to achieve a high productivity (fig. ) . the outcome of these experiments enabled the development of a robust hek t based process to produce clinical relevant lvv under serumfree conditions. furthermore, it provides an insight how therapeutic genes and the expression of its transgene can influence cell productivity. led to a vast increase in productivity, cho cells yield less than other expression systems like yeast or bacteria [ ] . to improve yields and find beneficial bioprocess phenotypes, genetic engineering plays an essential role in recent research. the mir- cluster with its genomic paralogues (mir- a and mir- b) was first identified as differentially expressed during temperature shift, suggesting its role in proliferation and productivity [ ] . the common approach to deplete mirnas is the use of a sponge decoy which, requires the introduction of reporter genes. as an alternative this work aims to knockdown mirna expression using the recently developed crispr/cas system which does not require a reporter transcript. this system consists of two main components: the single guide rna (sgrna) and an endonuclease (cas ) which induces double strand breaks (dsbs). these dsbs can result in insertion or deletion (indels) of base pairs which can disrupt mirna function and processing [ ] . a cho-k cell line stably expressing an igg was used for knockdown experiments. sgrnas were designed to target the seed region of each mirna member and stable mixed populations were generated (fig. a) . total rna form each mixed population was reverse transcribed into cdna using mirna specific stemloop primers. the expression was quantified by rt-qpcr. to further analyse the range of indels the mir- a and mir- b clusters were amplified by a standard high-fidelity pcr. amplicons were cloned into pcr tm -topo® vector and positive clones were analysed by sanger sequencing. cell growth was monitored using viacount tm viability stain on a guava tm benchtop flow cytometer. productivity was assessed by elisa. students t-test was used for statistical analysis. it was shown that mirna expression was significantly reduced in mixed populations. a knockdown up to % was achieved for mir- a, mir- b and mir- . the knockdown in mir- a and mir- b expression was considerably less -between - % (n= , * p ≤ . , ** p ≤ . , *** p ≤ . ) (fig. b) . furthermore, it was shown that various sizes of indels were generated by targeting the seed region. smaller indels (+ / + /- /- bps) seemed to be more common but larger deletions were detected as well (fig. c) . mir- a, mir- b and mir- b showed increased viability in late stages of the culture. depletion of mir- a reduced growth significantly whereas knockdown of mir- showed increased proliferation as well as boosting igg titers (table ) . in this work, we have shown that crispr/cas can be successfully applied as a tool to knockdown mirna expression in cho cells. the data was generated using mixed pools and it remains to be established if both alleles can be successfully targeted e.g. using nextgeneration sequencing of individual clones. background chinese hamster ovary (cho) cells are the most widely used host cell line for the production of therapeutic antibodies. pre-and posttranslational modifications and optimization of culture methods contributed to increase the productivity, resulting in a very high titre [ , ] . however, it has been pointed out that the intracellular secretion process is a bottleneck in the production of therapeutic antibodies [ ] . in addition, the details of the process of secretion of humanized recombinant antibodies from cho cells have not been well investigated. in this study, we thus analysed the detailed process of secretion of therapeutic antibodies using cho cell lines, which have already been established as high producers, with the aim of obtaining information for the more rational and efficient establishment of high-producer cells. we performed ) chase assay, ) immunofluorescent microscopy observation, and ) size exclusion chromatography (sec) analysis to investigate the duration of secretion, bottleneck position, and formation of recombinant igg, respectively. high-producer cho cells expressing humanized igg [ ] and igg were used. for the chase assay, cells were cultivated in shake flasks with serum-free medium containing μg/ml cycloheximide (chx) to stop nascent peptide synthesis. the amounts of igg both remaining in the cell and secreted into the medium at each time point were measured by quantitative western blotting. for immunofluorescent microscopy observation, cells were cultivated on coverslips with chx for h. immunofluorescent staining against the recombinant igg, endoplasmic reticulum (er), and golgi apparatus was performed after chemical fixation. for sec, cells cultured with chx were re-suspended in a buffer containing tritonx- and injected into a column. the amount of igg in each fraction was measured by quantitative western blotting. the amount of igg in the supernatant increased until - h after the inhibition of protein synthesis by chx; however, it hardly changed thereafter (fig. , upper panel) . at this point in time, however, around % of igg still remained in the cells (fig. , lower panel) , meaning that all of the synthesized igg could not be secreted into the medium and remained in the cells for several hours. this result was almost the same as that of studies using igg -expressing cells [ , ] . the localization of igg in the cells was checked before and after the addition of chx, with the results showing that igg remained in the er and was hardly seen in the golgi apparatus [ ] [ ] [ ] ; igg did not seem to be efficiently transported to the golgi apparatus. the sec experiment showed that most of the igg remaining in the cell seemed to form full-sized antibodies [ , ] , but it could not be secreted despite this. the high-producer cells could not secrete all of the synthesized igg, and around % of igg remained in the cells for several hours. this incomplete secretion is a common phenomenon among cho cells producing different types of recombinant igg. the igg could not be transported from the er to the golgi despite its formation of fullsized antibodies. solving this bottleneck in the transportation of igg from the er to the golgi and/or achieving more efficient glycosylation of igg after the formation of full-sized antibodies might be the next target to improve productivity. background humanized monoclonal antibodies (mabs) are among the most promising drugs, but defined strategies for their modification are still not available. our work deals with humanization of murine mab / h . the superhumanization approach leads to a loss of binding affinity which was partially restored by a single human-to-mouse backmutation (t hr). [ ] this residue was selected by synergistic combination of sequence analyses of antibody framework regions and structural information using novel in silico simulations. for structural stabilization, a conglomeration of tyrosine residues surrounding t hr was identified, the so called "tyrosine cage". [ ] analysis of the "tyrosine cage" was done by alanine scanning mutations with a double mutation variant t hr + y ha (bm ) and a triple mutation variant t hr + y ha + y ha (bm ). in a recent series of experiments we tried to enhance binding affinity by three new variants with backmutations in the variable light chain (vl). originating from t hr, residues in the vl were selected based on their spatial proximity to the cdr loop of the variable heavy chain. affinity improvement of t hr was evaluated by vl-double backmutation variants t hr + f ll (su ) and t hr + q ls (su ) and a triple backmutation variant t hr + f ll + q ls (su ). all five variants were expressed transiently in hek - e cells and binding affinities were investigated in two individual settings with bio-layer interferometry. in the first approach concentrated cell culture supernatants were directly applied and mabs were captured on protein a tips, blocked with d scfv-fc and the association and dissociation of f igg was measured. for the second approach, the culture supernatants were purified and the affinity was determined with streptavidin biosensors. first, biotinylated f igg was bound and then the association/dissociation of the purified h variants was measured. affinity evaluation of concentrated culture supernatants with protein a sensor tips showed a decrease of binding affinity of bm and a loss of binding of bm . the protein a measurement showed an increased binding strength of su , su and su compared to su h and bm . su and su result in a higher binding affinity compared to su . these results can be confirmed with purified variants by the streptavidin bio-assay (fig. ) . alanine scanning of the tyrosine cage demonstrated a reduction of binding affinity (bm ) and a severe loss of binding (bm ), concluding that the tyrosine cage plays an important role for supporting a correct cdr loop conformation. further affinity improvement of the single mutation variant t hr could have been reached via mutations in the vl. it demonstrates the underestimated role of the vl for the interaction with its binding partner. although cho cells are a major expression system for production of recombinant biopharmaceuticals, the molecular and cellular background characterizing a high producer is largely unknown. it has been observed that in producer cell lines important signaling pathways like the akt-signaling are altered in characteristical ways. thus analyzing according signaling events should lead to identification of key elements characterizing high producer cells. to investigate this, our emphasis lies on the phosphorylation status of involved proteins as reversible switches in all signaling pathways. we aimed to establish a workflow for cho-specific phosphoproteomics and focused on igf signaling, as cell culture media often are supplemented with this growth factor. two producer cell lines and the according parental cells were cultivated in a stable isotope labeling with amino acids in cell culture (silac) experiment, followed by quantitative ms phosphoproteomic analysis including chospecific data evaluation. the chosen cho cell lines were cultivated in triplicates in silac media containing isotopically-labeled lysine/arginine (hlys/harg) and in parallel in identical standard media (llys/larg, tcx d, xell). cell density, viability, metabolism and cell cycle distribution were monitored during ml batch culture for - days. at day . igf was added into hlys/harg cultures. min later a part of the cells was harvested. for ms analysis igf-treated (hlys/harg cultures) and control cultures (llys/larg cultures) were combined. the following ms sample preparation workflow included digestion of whole protein lysate and phosphopeptide enrichment via tio beads. nanolc-esi-orbitrap ms (q exactive plus, thermo fisher scientific) of phosphopeptides was excecuted with subsequent identification and quantification in maxquant [ ] . in addition to silac quantification of h/l ratios for investigation of igf effects, aquired data was also used to perform label-free quantification (lfq) in maxquant [ ] for comparison of cell lines. statistical significance was calculated via t-test (p< . ) or anova (permutation-based fdr< . ) in perseus [ ] . results igf effects on growth and production the igf treatment resulted in a prolonged viability for all cell lines. however, an increased vcd was only observed for producer cell line , yielding in an enhanced integral of vcd (ivcd). for the parental cells growth was inhibited by igf, although s-phase cells were enriched at least temporary (fig. a) . regarding antibody production igf led to a decreased qp and product titer, concomitantly with an increase in s-phase cells (fig. a) . this inverse correlation of proliferation and cell specific productivity is known from different productivity enhancing molecules, like butyrate [ ] . ms investigation of signaling events the phosphoproteomic experiment resulted in the identification of . class i-phosphorylation sites. statistical evaluation of phosphopeptide abundances in perseus showed up significant differences between the cell lines and led to producer vs. parental classifications (fig. b) . the quantitative evaluation via silac yielded in about . quantifiable phosphosites in at least biological replicates. rapid phosphorylation changes after growth factor treatment indicated signaling towards protein synthesis, cell cycle and regulation of actin cytoskeleton amongst others. for phosphosites significantly different h/l ratios were calculated between the two groups parental vs. producer, four of them are listed (table ) . the workflow to study phosphorylation states revealed differences in the related cell lines and gave insights into signal transduction as a response on igf. on the one hand, igf-treatment resulted in a fast and widespread upregulation of phosphorylation sites within aktand mapk-signaling. on the other hand, a different phosphorylation status for producer compared to parental cell lines uncovered distinctions in biological processes like rna-and dna-binding and regulation of cytoskeleton. in sum, our sucessfully established phosphoproteomic approach allows to detect important signaling key players in cho cells that subsequently can be targeted through cell engineering or small molecule treatment. to improve antibody production in the cho cell expression system, it seems to be useful to up-or downregulate gene expression including antibody folding, secretion, and cell metabolism. many cell engineering approaches, including gene introduction, knockout and knockdown, have been employed to enhance recombinant antibody production [ ] . however, identifying production enhancer genes is the rate-limiting step for cho cell engineering, because the conventional method requires a series of experiments including genomic integration of the tested genes, selection of stable cell clones and cell culture experiments of several clones. in this study, we propose an approach for rapid evaluation of production enhancer genes based on an episomal expression system. plasmid vector carrying the epstein-barr virus (ebv) encoded nuclear antigen (ebna ) was transfected into cho cell line producing igg antibody. after g selection and single colony isolation, ebna expression was checked with capillary electrophoresis system wes (proteinsimple). ebv ebna -antibody ( eb ) was used for detection as the primary antibody. the expression vector for the gene of interest was prepared by inserting bp of an orip dna sequence into a plasmid vector carrying cag promoter, resulting in the potc vector. pei max (polysciences, inc.) and balancd transfectory cho (irvine scientific) were used for the transfection. the number of viable cells and gfp-positive cells were counted using countess ii fl automated cell counter (thermo fisher scientific). the transfected cells were cultured in cellstar cellreactor tubes. the tubes were incubated in a climo-shaker isf -x (kuhner). antibody production was measured using biolayer interferometry with an octet qk system (fortebio). we constructed four cho cell lines stably expressing ebna , termed igg -eb to eb . in capillary electrophoresis analysis, we observed a clear peak corresponding to the ebna expression in all four cell lines. we tested the transfection efficiency by potc-gfp plasmids. in the best transfection condition, pei/dna ratio of / , igg -eb cell showed the highest gfp-positive cell number ( . × cell/ml) and transfection efficiency ( %) among the four cell lines. therefore, igg -eb cell lines were selected for further study. after the transfection, the number of gfp-positive cells continued to increase even after the passage (fig. ) , suggesting that the potc-gfp plasmid was stably retained and replicated by ebna /orip system in igg -eb cell lines. in preliminary experiments, we introduced three genes, mdh , gss and gclm, into igg -eb cell lines. cotransfection of these three genes led to an increase in igg production from ± mg/l (control) to ± mg/l at day (p< . , t-test, n= ). this result suggests that these three genes work as production enhancer genes. conventional methods based on stable cells take up to months to determine whether the gene of interest is beneficial for recombinant igg production. in contrast, identification of production enhancer genes is achievable within days by our proposed method based on ebna /orip system. the proposed method makes it possible to evaluate production enhancer genes in a rapid manner. the proposed method is a promising approach to identify genes enhancing recombinant antibody production. background g unic™ ( gun) technology comprises a set of protected genetic elements that improve protein production by acting on transcription as well as on translation. the elements can either be inserted into existing (platform) vectors or be provided as complete ready-to-use vectors. the technology can be used in stable and in transient transfection to boost protein production for product development and is being applied in cld for pharmaceutical proteins. in combination with antibiotic selection or dhfr selection, gun technology routinely results in - fold increase in expression of client antibodies or fusion proteins, both in pools and after clonal selection. previously, we have successfully combined gun technology with glutamine synthetase (gs) selection and the cho gs null cells of horizon discovery, resulting in clonal cell lines producing > g/l of a biosimilar mab in fed-batch assay. here we present data on the successful application of the gun technology for the enhanced expression of a large (> kda) human heterotrimeric glycoprotein, a renowned difficult-to-express (dte) protein. all expression vectors comprised a hcmv promoter and bgh polyadenylation sequence in the expression cassettes for the gene of interest, and a selection marker gene with sv promoter and sv polyadenylation sequence. g unic™ vectors also contained genetic elements ( g unic™ technology, proteonic). cho gs null cells (horizon discovery) were transfected in duplicate with reference or gun expression vectors and selected in media lacking glutamine and containing the appropriate antibiotics. the bulk pools were seeded at equal viable cell density after obtaining maximum viability and cultured for days without feeding (batch). expression of the target protein in cell culture supernatants of stable bulk pools was measured by elisa. the three protein subunit genes were expressed from vectors with different selection markers. in the reference constructs (without gun), the α, β, and γ chains were expressed from vectors with marker genes for zeocin, blasticidin, and gs, respectively. a similar vector combination was also generated with gun elements integrated in each vector. in addition, vectors with subunits (γ-α and α-γ), each with a separate gun element, promoter and polyadenylation signal, were generated with a gs marker gene. cho gs -/cells were transfected with the appropriate vector combinations in equimolar ratios and selected in bulk in medium lacking glutamine and or antibiotics. the -vector transfected cell pools recovered first, due to the presence of only antibiotic in the medium (fig. a) . the pools transfected with three gun vectors recovered to maximum viability just a few days after the -vector gun pools. recovery of the reference pools took up to a week longer than the gun pools. production of each pool was assessed in a batch production run in shaker flasks. all -vector gun pools which recovered first produced titers around . g/l, which is almost -fold higher as compared to the production by reference pools (fig. b) . the highest titers of . g/l were obtained in the -vector gun pools. these data show that the g unic™ genetic elements can be successfully used to obtain a significant increase in the titer of difficult-to-express proteins. similar results have been obtained with other dte proteins, including fc-fusion proteins and bi-specific antibodies (not shown). the expression of a large, glycosylated multimeric difficult to express protein can be increased more than ten-fold in cho gs pools by application of g unic™ genetic elements. the highest expression of is obtained using a separate vector for each subunit. characterization of antibody-producing cho cells with chromosome aneuploidy noriko yamano , , sho tanaka background chinese hamster ovary (cho) cells are commonly used as host cells to produce biopharmaceuticals. however, the number of chromosomes in cho cells varies. previously, dg -sc and dg -sc cell lines with modal chromosome numbers of and were isolated from parental cho-dg cells, from which igg -expressing cell lines named igg -sc and igg -sc were established, respectively. the igg -sc cell pool showed a higher specific igg production rate than the igg -sc cell pool [ ] . even though all of the igg -sc clones and half of the igg -sc clones contained the same number of vector integration sites (single integration site), igg-sc cell clones produced more igg following the culture of single-cell clones than any of the igg -sc clones [ ] . in this study, we performed transcriptome analysis to investigate the characteristics of high-producer cells with chromosome aneuploidy. transcriptome analyses using amplified fragment length polymorphism (aflp)-based high-coverage expression profiling (hicep) and de novo mrna-seq were performed on dg -sc , dg -sc , igg -sc and igg -sc . to compare cell lines with different numbers of chromosomes, transcriptome data from mrna-seq were adjusted for cell number using rna reference materials (nmij crm -a; national institute of advanced industrial science and technology) mixed at equal amounts per cell. pathways related to differentially expressed genes were searched using keymolnet (km data). high-chromosome-number cho cells showed larger cell diameters, as determined by vi-cell (beckman coulter) measurement. the predicted volume ratios, based on these diameters, are . (dg -sc :dg -sc ) and . (igg -sc :igg -sc ). the levels of β-actin and the products of most other genes that were detected by mrna-seq differed by approximately % in the comparison between sc and sc (sc > sc ). based on the analysis of gene expression levels per cell volume, approximately % of detected genes showed lower expression in both dg -sc and igg -sc compared with the levels in dg -sc and igg -sc , respectively. in addition, the number of genes whose expression level was decreased in igg -sc compared with that in dg -sc was larger than those showing the opposite pattern. the results of the comparisons between igg -sc and igg -sc indicate that differentially expressed genes were mainly related to cell growth (e.g. myc, smad), apoptosis (e.g. caspase), lipid metabolism (e.g. srebp, pparγ) and epigenetic histone modification (e.g. brca, hat) pathways. the mrna levels of myc, smad, caspase, brca and hat related genes were lower in igg -sc , while those of srebp and pparγ related genes were higher in igg -sc . the effects of these pathways on antibody production should be examined in future. in this study, we found that high-chromosome-number cho cells have lower amounts of mrna relative to their volume. a reduction per unit volume in the expression of genes that are required for survival might generate additional energy for recombinant protein production in high-chromosome-number cells. from an evolutionary perspective, an increased set of chromosomes underlies rapid evolutionary adaptation. although there are issues to be considered, such as stability, there may also be advantages to using high-chromosome-number aneuploid cho cells as a production host cells of recombinant proteins. background human growth factors have an enormous therapeutic potential. among them, the bone morphogenetic protein- (bmp- ) can induce de novo bone formation endowing the protein a high therapeutic potential. however, finding a suitable recombinant production system for such a protein still remains a challenge. recombinant expression of hbmp was investigated in transiently transfected hek- cells and in stable clones established in cho-k cells cultivated in excell and pro-cho medium, respectively. protein stability and interaction of the hbmp with the producer cells were investigated in vitro using commercially available rhbmp . in addition, we investigated a cell-free protein synthesis system harboring translocationally active microsomal structures, hence having the potential to perform post-translational modifications, as an alternative production method. we showed that growth rates and viabilities of the rhbmp producing cells were similar to those of the parent cell line, while entry into the death phase was delayed in case of the recombinant cells. the maximum rhbmp concentration detected in the culture supernatant was low for stable clones but can be greatly improved combining the hek- cells transient expression system and batch reactor cultivation which reflects a better compatibility of the codon usage in the human cells (table ) . hbmp protein is sensitive to slightly acidic ph and to a lesser extend to proteases (fig. a ) and binds to both producers cell lines (fig. b) -all this could incidentally contribute to the low product titers. cell-free protein synthesis has been proposed as alternative for "difficult-to-express" proteins. since native hbmp is glycosylated, a cell-free system based on eukaryotic cell lysates is required for its production. cho cell lysates were chosen, since they had previously been established as the most productive eukaryotic system in our hands [ ] , while concomitantly enabling a direct comparison to the production of hbmp in stable clones established in cho-k . the ability to perform post-translational modifications is a major advantage of eukaryotic systems. the cho lysates prepared by the protocol used here have previously been shown to contain significant amounts of endogenous microsomes derived from the endoplasmatic reticulum during lysis [ ] . to enforce translocation of the target protein into the microsomal structures, a melittin signal peptide was fused to the hbmp cdna. the glycosylation of the protein was assessed by enzymatic treatment (pngase, endoh) and confirmed using c-mannose for the de novo protein synthesis. upon cell-free protein synthesis, the hbmp yield was -fold higher than the best one in the hek- cells. the difference becomes even more dramatic, when productivities are considered (table ) , i.e. the fact that maximum product titers are reached within h in the cellfree system compared to h in the cell-based ones. this demonstrates that the cell-free expression system is most suitable compared to mammalian cell expression method for the production of glycosylated human bmp (table ) [ ] . human growth factors are complex molecules, which make their production in mammalian cells desirable. however, low product titers caused by a variety of both cell and process related effects may hinder the development of highly productive processes. in such cases, cell-free protein production using cho cell lysates containing endogenous microsomes for posttranslational processing, may eventually present an attractive alternative. in particular since these lysates can be used under tightly controlled conditions assuring a higher degree of reproducibility, than, e.g. transient transfection systems. cell-free systems are known to circumvent typical bottlenecks of cellbased ones, e.g. metabolic regulation and cell maintenance mechanisms. in consequence, the production of a recombinant protein is neither inhibited by its accumulation nor by any interaction with the cells, e.g. through the activation of inhibitory signaling pathways. core. preliminary studies showed that the corresponding polyplexes, but also some of the cells that came into contact with them, became magnetic and were manageable by magnetic fields [ ] [ ] [ ] . here, we present a characterization of the influence of structure and composition on the function of these polymers using a library of highly homogeneous, paramagnetic nano-stars with varied arm lengths and densities [ ] . the paramagnetic nano-stars library was synthesized by coating maghemite nanoparticles (γ-fe o ) with a thin silica-shell functionalized with an atomic transfer radical polymerization (atrp) initiator. pdmaema arms were grown from the core particles via atrp. in one case, the pdmaema arms was end-capped with pdegma blocks produced during a second atrp step. all nanostars were characterized by size exclusion chromatography and thermogravimetry to calculate number and length of the pdmaema arms. the core diameter was determined by transmission electron microscopy and dynamic light scattering (dls). the different variants (table ) were analyzed for their ability to complex pdna (pegfp-n ) using various physicochemical methods (dls, zeta sizer). transfection efficiency/cytotoxicity in cho-k cells were determined by flow cytometry. transfected cells were placed in a magnetic field and the influence of the polymer architecture on the magnetic separation was investigated. nonparametric spearman analysis was used to correlate between arm length/arm densities, magnetic properties of the cells and transfection efficiency. based on the hydrodynamic radii of the polyplexes, the investigated nano-stars could be divided into three subgroups (table ) . middle, but also high arm density nano-stars formed smaller polyplexes with hydrodynamic radii ≤ nm, a size that is considered suitable for endocytosis and transfection. transfection efficiencies and cytotoxicities varied systematically with the nano-stars architecture, with viability showing a more pronounced dependency on the characteristics of the transfection agent than the transfection efficiency itself. the arm density was particularly important, with values of approximately . arms/ nm yielding the best results (fig. a) . the end-capping the polycation arms with pdegma significantly improved the serum compatibility (fig. b) . the gene delivery potential of a given nano-star and its ability to render the cells magnetic did not correlate. although, compared to the non-separated cells, egfp-expressing cells were consistently more frequent in the magnetic cell fraction, while the non-magnetic fraction was slightly depleted. when the egfp-expressing cells were further divided into low, middle and high producers, a statistically significant shift towards the high producers was observed in the magnetic cell fraction (fig. c) . a nonparametric spearman correlation analysis was used to statistically evaluate possible links between the molecular characteristics of the nano-stars, the physicochemical properties of the corresponding polyplexes, the transfection conditions, and the cellular reactions. the resulting correlogram is shown in fig. d . transfection agents with magnetic properties enlarge the toolbox for studying non-viral gene delivery, since cellular magnetism is added as a new parameter. this allows, inter alia, a distinction between mere cellular interaction and actual uptake, which is otherwise difficult. viability showed a much more pronounced dependency on the characteristics of the transfection agent/polyplex than the transfection efficiency itself, which should be taken into account during method optimization. end-capping the polycationic pdmaema-arms with pdegma-blocks improved the compatibility of the polycationic nano-stars with serum components. in future optimized, blood-compatible, nano-stars, which can be retained/directed by magnetic fields, could become options for non-viral gene delivery in vivo. the increasing demand for monoclonal antibodies has necessitated the need to increase the productivity of current industrial cell lines. in our earlier study [ ] , we had shown that treatment with er-stress inducer, tunicamycin significantly increased the titers and productivity in recombinant cho cell lines with a simultaneous upregulation of many genes from the unfolded protein response pathway (upr). however the loss in cell viability prevented a sustained increase in titers. in the current study we explore the effect of varying concentrations of tunicamycin and treatment times, such as to modulate the increase in protein folding capacity while preventing induction of apoptosis. anti-rhesus igg-secreting cho cells [ ] were cultured in sf-cdm in ml shake flasks. the cells were treated with varying concentrations ( - ng/ml) of tunicamycin in a batch culture. further, the effect of treatment with tunicamycin for short periods of time ( hrs) was also evaluated. igg titers and mrna expression levels were quantified using elisa and qrt-pcr (illumina), respectively. results cho cells were treated with different concentrations of tunicamycin and cultured in a batch for days (referred as continuous treatment/cte). figure a presents the maximum vcd and % drop in viability under treatment. a dose-dependent inhibitory effect is observed on growth and viability of cells in cte-cultures, with minimal inhibition as lower concentrations. contrastingly, igg titers (fig. b) were higher in treated cultures w.r.t. control in initial phase of the cultures at all the concentrations of tunicamycin. the per-cell productivity (fig. c) also showed a significant increase w.r.t control at all the concentrations of tunicamycin. however, the increased productivity due to tunicamycin was not sustained and levels become similar to control after day (data not shown). to prevent loss of viability due to tunicamycin, the effect of short-term treatment (ste) with tunicamycin was explored. cells treated with tunicamycin for hours were harvested (corresponding to day of cte cultures) and inoculated in fresh media. the ste-cultures showed improved viability and higher maximum vcd as compared to cte-cultures (fig. a) . the fold increase in igg titers was not sustained beyond day - in stecultures ( fig. d ) but significant increase in productivity was seen in the initial phase (fig. e) . further, the cells were adapted over continuous generations under ng/ml tunicamycin. the adapted cells had overall . -fold higher productivity, as compared to control (fig. f) , in a batch culture. to understand the molecular basis of increase in productivity, mrna expression level of key genes was determined. xbp s is a transcription factor involved in activation of chaperones (like grp , calreticulin) and apoptotic genes (such as chop). significant increase in the levels of calreticulin was seen on treatment with tunicamycin (fig. g) . both xbp s and grp were marginally induced when treated with ng/ml of tunicamycin in both cte-and ste-cultures (fig. h) , and significantly up-regulated when treated with ng/ml of tunicamycin. the chop mrna levels also increase with increasing tunicamycin concentrations, with levels in ste-cultures lower than cte-cultures (fig. h) . the results suggest that upr induction may be important to increase productivity in these cte/ste-cultures. note that, tunicamycin had no effect on the expression levels of igg heavy-chain, thus eliminating the involvement of igghc-mrna in increasing productivity (fig. i) . tunicamycin induced er-stress increased productivity in the initial phase of the culture and enhanced upr-mediated folding capacity can be attributed as one of the reasons for it. at lower concentrations of tunicamycin, a fine balance between optimum upr induction and apoptosis can be achieved, as seen in ng/ml tunicamycin ste-cultures. in summary, this study demonstrates an alternate approach to enhance productivity of current industrial cell lines. background chinese hamster ovary (cho) cells have been widely used for the large-scale production of biopharmaceuticals [ ] . to construct antibody-producing cho cells, exogenous genes encoding antibodies are usually integrated into unspecified regions of chromosomes (random integration). however, the chromatin structure differs depending on the location of the chromosomal region, which affects the expression level of the gene of interest [ ] . recently, gene-targeting methods that enable site-specific integration of expression vectors have been developed. however, the regions that are most efficient for exogenous gene expression have not been clarified. we previously constructed a cho genomic bacterial artificial chromosome (bac) library generated from the recombinant cho-dg cell line. it was expected to cover the entire cho genome five times. the chromosomes in cho-dg cells were aligned in decreasing order of size and assigned letters from a to t [ ] . three hundred and four bac clones were mapped to every chromosome of cho-dg . among the karyotypes of cho-dg , cho-k and primary chinese hamster cells, chromosomes a and b are considered as the sole paired chromosomes corresponding to chromosome in primary chinese hamster cells. hence, chromosomes a and b are considered to be stable [ ] . in this study, we constructed antibody-producing cells by using a gene-targeting method, which focused on the stable chromosomes. a gene map of chromosome was constructed by combining the bac-fluorescence in situ hybridization (fish)-based chromosome physical map and sequence data of mapped bac clones. the sequences of bac clones were searched by blast with ncbi and chogenome.org databases. three different regions on chromosomes a and b were selected based on cho genomic bac library sequences as target sites. cho-k cells were stably transfected by lipofection. the target sequences were broken using the clustered regularly interspaced short palindromic repeats (crispr)/crispr-associated protein (cas ) system and humanized igg genes were integrated by non-homologous end joining recombination. transfection without using the crispr/ cas system was also performed. these cell pools were cultivated for six days with serum-supplemented medium, and their levels of antibody productivity were evaluated by elisa. copy number analysis was also performed using real-time pcr. results and discussion construction of gene map of chromosome : eighty-three bac clones were mapped onto chromosomes a and b (each clone contained - kb of the cho genome sequence). as a result of annotations of bac clone sequences, genes were mapped on chromosome . investigation of the differences of productivity among antibodyproducing cells that were constructed by chromosome targeting and/or random integration: cell growth was not affected by the gene targeting site. the specific production rates of antibodyproducing cell pools constructed by gene targeting of chromosome were higher than those of the cell pool constructed by random integration. all cell pools constructed by gene targeting showed lower copy numbers of heavy chain and light chain in genomic dna than those in the cell pool constructed by random integration, despite showing high productivity. our results indicate that high productivity of the cells constructed by gene targeting of chromosome does not depend on the increase of the antibody copy number, and that the environments around these target regions are suitable for exogenous gene expression. the approach of using gene targeting to chromosome may be promising for constructing antibodyproducing cells. retroviral vectors have been widely used as gene delivery tools in various biotechnology fields. however, the random integration feature of retroviral vectors seems to cause problems such as insertional mutagenesis and gene silencing. we previously demonstrated cre-mediated retroviral transgene insertion into a pre-determined site of the founder cells using integrasedefective retroviral vectors (idrvs), where a cre expression plasmid was transfected into the cells prior to retroviral transduction [ ] . recently, we reported novel hybrid idrvs (cre-idrvs) incorporating bioactive cre recombinase protein, and validated site-specific gene integration of an scfv-fc antibody expression unit into the chinese hamster ovary (cho) cell genome [ ] . we also developed an accumulative site-specific gene integration system, which enables repeated integration of multiple transgenes into a pre-determined locus of the cell genome [ ] . here, we attempted repeated integration of transgenes using cre-idrvs. a viral vector plasmid (pqmscv/hd[scfv-fc]) encoding reporter genes and an scfv-fc expression unit flanked with wild-type and mutant loxps was constructed for the production of idrvs. cre-idrvs were produced described previously [ ] . results and discussion figure a shows a schematic drawing of each round of targeted transgene integration using cre-idrvs harboring an scfv-fc expression unit. (fig. b) . genomic dna extracted from the cells were subjected to pcr using specific primer pairs α and β, and γ and δ to confirm site-specific integration. dna fragments with expected sizes were amplified in each cell clone (fig. c) . these results indicate that site-specific repeated integration was achieved using cre-idrvs. in contrast, scfv-fc productivity in cho/hd[scfv-fc]× cells was slightly decreased compared with that of cho/ne[scfv-fc]× (data not shown). although the reason remains unclear, repeat-induced gene silencing might occur due to tandem repeat structure of expression units. we reported improved recombinant antibody production using a production enhancer element [ ] . such a cis-regulatory element might be a feasible approach to enhance the productivity. we demonstrated site-specific repeated transgene integration into a pre-determined chromosomal locus using cre-idrvs for the production of an scfv-fc antibody. if lipids role in the cell have been reduced for a long time to cell membrane formation, it is now understood that lipids plays also a role into energy metabolism, vesicular transport, membrane structure, dynamics and signaling. however, the exact mechanism of how compositional complexity affects cell homeostasis remains unclear. thanks to recent advances in mass spectrometry, it is now possible to study a wide range of lipids, providing a better understanding of lipid homeostasis in high performance cell culture processes. the purpose of this work was to develop a robust lipidomics method applied to mammalian cell cultures in a three step method: extraction, separation and detection (fig. ). both matyash [ ] and folch [ ] extraction method were performed on our cells to reach the highest yield. two separation techniques were also tested: hydrophilic interaction liquid chromatography (hilic) and reverse phase chromatography. finally lipid classes' identification was achieved by tandem mass spectrometry analysis thanks to structure-specific fragmentation ions. the yield obtained with matyash extraction method was higher than with folch method for each lipid class tested. besides, matyash method presents also the advantage to be less toxic and suitable for high throughput analysis since the organic layer is above the aqueous layer. lipids separation by hilic is based on their polar head. since lipid classes are defined by polar head, the lipids are eluted class by class, making their identification easier. the separation of lipids by reverse phase was correct but the method is longer and we observed a massive carryover of triglycerides on the column. finally each lipid class was screened in ms/ms parent ion mode. target daughter ion was set according to the lipid class structure and fragmentation pattern. this detection technique enabled the identification of different lipids. to ensure the absolute quantification of the detected lipids and to guarantee comparable results between batches labeled internal standard were added prior to extraction. this method was optimized in a stepwise process to ensure a sensitive and selective measurement of the lipids. lipids were extracted by matyash method, separated by hilic and detected by tandem mass spectrometry. this method is suitable for both in process sample lipid analysis providing information on the cell lipid content, and for harvest samples, enabling to follow the lipid release during the different harvest steps. this non-targeted lipidomic quantitation method will enable us to better control lipid synthesis during biopharmaceutical fed batch production through clone selection, metabolomics studies and harvest development. background human mesenchymal stem/stromal cells (hmsc) can easily be isolated from e.g. bone marrow, fat tissue or umbilical cord blood and are therefore a central player in regenerative medicine, gene therapy and cell therapy [ ] [ ] [ ] . the necessary gene shuttle is mainly provided by viruses associated with diseases, like retrovirus or adenovirus [ ] [ ] [ ] [ ] . these possible pathogen viruses demand for high safety standards. also, they are prone to genomic alterations and there is the possibility of virus inactivation, triggered due to pre-existing immunity in the patient [ ] [ ] [ ] . in this context, the autographa californica multicapsid nucleopolyhedrovirus (acmnpv) is a safe alternative. the virus replication is hostspecific for insects [ ] , but it is known since the mid- s, that a temporary transduction of mammalian cells is possible [ ] . some modifications of the virus increased the applicability in stem cells. pseudotyping the virus with the vesicular stomatitis glycoprotein (vsv-g) led to an expansion of the transducable cell [ , ] and the integration of the woodchuck hepatitis virus post-transcriptional regulatory element (wpre) prolonged the recombinant protein expression [ , ] . for achieving a baculovirus-induced differentiation of hmscs, the promotor and the expression strength of the recombinant protein are crucial factors. still, there are still few comparative promotor studies [ , ] . however, a successful virus uptake is the prerequisite for a successful protein expression. we therefore investigated factors significantly influencing the transduction process by applying design of experiments (s. fig. a ). the experimental design comprises a two level factorial screening, set-up using design expert v .for the transduction , c/cm were seeded in -well plates with dmem + % fcs and incubated overnight at °c, % co and humidified atmosphere. the recombinant baculovirus using an integrated ef α promoter to control gfp expression, described elsewhere [ ] , was diluted to the respective concentrations in the different surrounding fluids. after discarding the cultivation medium of the hmsc-tert, ml of virus containing solution was added to the cells. the following incubation was varied in duration before replacing the virus solution with growth medium and an incubation overnight. h post transduction (hpt) the cells were washed with pbs, trypsinized with μl trypsin/edta and incubated for min at °c. trypsination was then stopped applying μl soybean trypsin inhibitor and the cells were analyzed using flow cytometry. as shown in fig. a , the virus concentration and incubation time exert the highest influence on the transduction efficiency. obviously, a higher concentration of viral particles and longer incubation of cells with virus increases the probability for hits between cells and virus particles. additionally, the surrounding fluid can have a negative impact on the transduction. this is due to the interaction of medium components with the baculovirus. therefore, pbs containing ca + & mg + is recommended as surrounding fluid for transduction experiments. in fig. b , the transduction conditions resulting in the highest percentage of gfp+ cells are displayed: virus particles per cell (ppc) and an incubation time of h with hmsc-tert. the experiments show, that especially the virus concentration and the incubation time of cells with virus influence the transduction efficiency. based on the results of the screening, further optimization of the transduction conditions will be done using a face centered central composite design with pbs containing ca + & mg + as surrounding fluid and at an incubation temperature of °c. background breast cancer is the second main cause of cancer related deaths for women worldwide and among them the triple negative subtype (tnbc) represents a clinical challenge by being associated with high mortality and having no effective therapies against it [ ] , [ ] . accordingly, there is an urgent need to design new and more effective drugs to treat breast cancer. notch signaling is an evolutionary conserved cell-to-cell communication pathway crucial during embryonic and breast development and tissue homeostasis. this pathway is often hyper-activated by overexpression of notch receptors and/or its ligands in several types of cancers, such as breast cancer (tnbc included), where it contributes to its development, progression and drug resistance [ ] , [ ] , [ ] . our aim is to generate a function blocking antibody against the notch delta-like- (dll ) ligand with therapeutic efficacy against breast cancer. materials and methods dna of human dll full length extracellular domain (dll -ecd) and a truncated version, containing the minimal binding region to the notch receptor (dll -egf ), were cloned into pfuse-fc -igg , and expressed in hek e cells. recombinant proteins were purified from culture media by protein-a affinity and size exclusion chromatography. the human scfv phage display tomlinson i+j library was used to select specific scfv against peptides targeting dll binding regions to notch. the binding ability and specificity of the selected scfv clones was evaluated by scfv-on-phage elisa. our strategy allowed us to obtain mg of pure (> %) and stable dll -ecd-fc as confirmed by sds page and thermofluor assay. dll -egf -fc yield was very low and buffer screenings are ongoing to optimize protein stability. functional studies performed in human breast cancer mcf cells showed that both ligands are biologically active as they increased the expression of the notch-dependent genes hes- , hey-l and hey- . recombinant dll and peptides were used to select for monoclonal antibodies by phage display. after three rounds of panning with dll peptides we identified scfv positive clones, of which presented high affinity to dll -ecd-fc. currently we are performing more phage display selections to increase the number of positive clones. scfv with higher affinities will be reformatted into iggs and their ability to inhibit the notch pathway will be evaluated. the anti-oncogenic effects of anti-dll iggs will be assessed in breast cancer cells in viability/apoptosis, proliferation, migration, and invasion assays. an anti-dll igg with therapeutic efficacy against breast cancer will demonstrate that targeting dll could be one of the key factors for successfully targeting breast cancer. recombinant adeno-associated virus (raav) approaches have an outstanding reputation in gene therapy and are evaluated for cancer therapy [ ] . advantages include long-term gene expression, targeting of dividing and non-dividing cells, and low immunogenicity. established raav production utilizes triple transfection of adherent hek cells, which hardly meets product yield requirements for clinical applications. we transferred the aav production system to hek -f suspension cells. this process is scalable and uses serum-free media streamlining downstream procedures. after optimization of transfection efficiencies and shaker cultivations, we produced titers of × viral genomes per cell in a l bioreactor. the suspension adapted hek-freestyle -f cell line was used for the experiments in chemically defined animal component free media (hek-tf, hek-gm (xell ag), freestyle f (thermo fisher scientific)). samples for viable cell density and viabilities were taken daily and analyzed using an automated cell counting system (cedex, roche diagnostics). transient transfection of × cells/ml was carried out with polyethylenimine max in a : dna-pei ratio (w/w) with μg dna. three plasmids (pgoi, prepcap, phelper) were applied in a molar : : ratio (fig. a) . pretests were performed in orbital shaking tube spin bioreactors. for scale-up, batch processes were carried out in ml shake flasks as well as in l stirred bioreactors at % air saturation and ph . . transfection efficiencies and raav production were quantified by flow cytometry using a goi coding for a fluorescent protein and qpcr of genomic copies, respectively. by optimizing the dna amount for transfection of -f cells more than % of the cells were reproducibly transfected. batch cultivations in shaker flasks revealed that raav were produced in the first - h after transfection. figure b shows viable cell densities and viabilities in relation to the genomic titer. genomic titers were determined from raw cell extracts and up to copies/ml were repetitively achievable. a decrease in viability marked the decline in genomic copies per ml showing that a prolongation of the process e.g. by addition of a feed would probably not increase yield. in a first scale-up, the raav production was transferred to a l bioreactor (fig. c) . transfection efficiencies in bioreactors of up to % were comparable to that obtained in a simultaneous shaker flask experiment. transfection efficiencies were lower compared to prior experiments due to controlled conditions in the bioreactor. nonetheless the titer with up to × genomic copies per cell was elevated compared to that of shaker flasks. first experiments with -f cells in hek tf medium showed promising results of transferring raav production from the adherent system to suspension. after improvement of transfections by the adjustment of dna amounts in small scale experiments, aav production was analyzed in shaker flasks. the batch process showed an expected increase in cell density with low variability between biological replicates (fig. b) . the genomic titer increased according to the viable cell density until day four where a sudden drop started. this observation was made for aav productions in hek-tf, hek-gm and freestyle f medium. for optimal yields, we assume that a slight decrease in viability marks the point in time for harvest. from optimized protocols, a batch process in a l bioreactor was carried out. interestingly the bioreactor cultivation resulted in lower overall viable cell densities but in higher genomic copies per cell compared to shaker flasks (fig. c) . these results are comparable to already published data for suspension cells [ ] . subsequent optimization of the bioreactor protocol will lead to further increase in raav yield. genethon and pall have collaborated to assess pall's single-use icellis fixed-bed bioreactor for viral vector production. clinical use of gene therapies to treat formerly incurable genetic diseases is advancing rapidly. viral vectors are an important tool for introducing genes into target cells. many gene therapies have been developed using adherent cells in -dimensional flatware or roller bottles but using these technologies to reach commercial-scale production represents a significant challenge. the icellis bioreactor enables large-scale viral vector production by providing a -dimensional matrix for cell growth in a compact configuration (fig. ). up to m of surface area is available in a compact bioreactor measuring mm in diameter in a total volume of l with ph, do and temperature control. a key feature of the icellis bioreactor is that it scales by increasing the diameter of the fixed-bed while keeping the height constant with no change in aspect ratios. the height of the fixed-bed can be varied ( , and cm) as well as density of carrier packing ( gm/l or gm/l). the icellis system comes in two formats, the icellis nano bioreactor ( . - . m ) and the icellis bioreactor ( - m ). processes developed in the bench top icellis nano bioreactor can be directly transferred to the corresponding icellis system. the icellis nano bioreactor enables an efficient platform for process optimization. the genethon raav- process was transferred to an icellis nano bioreactor . m ( cm bed height, gm/l density) bioreactor using freestyle media. the initial icellis nano process was established as ( ) seed on day , ( ) transfect at day , ( ) harvest at day and yielded < x vp/cm (n= ). media exchange, cell density at transfection, pdna/cell ratio, and lysis method were then changed to determine the effect on productivity. the modified process was then scaled from . m to . m ( cm bed height, gm/l density) icellis nano bioreactor. -media: a media exchange at hours post transfection with dmem substituted for freestyle medium resulted in an x increase in specific productivity. (abstract p- ) . a schematic overview of raav production in hek cells with triple-transfection system. b viable cell densities (vcd), viabilities and genomic copies per ml (gc) of a raav production with -f batch cultivations in shaker flasks. genomic copies per ml refer to the titer determined in ml culture volume. error bars represent biological and technical duplicate measurements of samples. c viable cell densities and genomic copies per cell of a raav production with -f batch cultivation in a l bioreactor. for reasons of comparability between shaker and bioreactor data genomic copies are given per cell. error bars represent technical duplicate measurements of samples -cell density at transfection: cells were seeded at , cells/cm and reached , cells/cm at day which was determined to be the optimal cell density for transfection. -pdna/cell ratio: reducing pdna by % had no significant effect on productivity. -lysis: use of trion x- at . % with mm nacl at ph resulted in > % virus recovery compared to sampled carriers. -scaling: specific productivity was maintained as the system was scaled from . m to . m . -overall, an average yield of x vg/m was achieved. the icellis technology is being adopted widely for viral vector production. transferring a process to the icellis nano bioreactor can be easily achieved and once in place can be optimized to provide significant productivity increases and cost savings such as reduced pdna. the icellis nano bioreactor is an efficient bench-top system the results of which can be readily scaled to the icellis system. background tissuse multi-organ-chip (moc) platform contributes to the ongoing advancement in systemic substance testing in vitro. current in vitro and animal tests for drug development are failing to emulate the systemic organ complexity of the human body and, therefore, often do not accurately predict drug toxicity. especially, cardiotoxicity is one of the main reasons why new compounds are failing in clinical trials. therefore, we aimed to establish an autologous dynamic multiorgan-device integrating cardiomyocytes for substance testing. generic d monolayer and d suspension ipsc derived cardiomyocytes differentiation protocols were established. beating cardiomyocytes were first seen on day in monolayer as well as in spheroid culture. cardiomyocytes show up to % cardiac troponin t positive cells and % myosin heavy chain positive cells by flow cytometry (fig. g, h) . myosin ii heavy chain, α-actinin, myosin / , myosin and caldesmon expression was shown by immunohistochemistry ( fig. a-d) . due to the exclusion of a lactate enrichment of cardiomyocytes, cardiac fibroblasts are also expressed in the spheroids shown by vimentin staining. those cardiac fibroblasts lead to a physiological heterologous cell population similar to the human heart. beating spheroids were cultivated for days under dynamic culture conditions in the multi-organ-chip. the integrated on-chip micropump provides physiological-like pulsatile circulation at a microliter scale and leads to better nutrition and oxygen supply. the next significant step is to combine multiple autologous d organ equivalents in our multi-organ-chip using ipsc differentiation technology. differentiating all cell types from one ipsc donor is crucial to overcome source and rejection problems. combining our multi-organ-chip platform with ipsc differentiation technology will eventually lead to a personalized system for drug and substance testing. lab as a service -automated cell-based assays lena schober, moriz walter, andrea traube laboratory automation and biomanufacturing engineering, fraunhofer ipa, stuttgart, germany correspondence: lena schober (lena.schober@ipa.fraunhofer.de) bmc proceedings , (suppl ):p- background the use of cell-based assays in pharmaceutical industry and academic research is a growing trend that is a driving force to reduce costs for drug development. academic research is gaining information about intracellular targets or functional mechanisms through the variety of different assays. these benefits can be used in preclinical studies and furthermore costly late-stage drug failures may be reduced by the use of cell-based assays. the use of automated systems is also in great demand and will change the testing of substances and research activities. nevertheless, there are a lot of barriers at the moment limiting the successful application of automated systems in this field. by the lack of flexibility and the demand for skilled computer scientists & engineers just the two main aspects stated by experts shall be mentioned. our strong background on automated cell culture technologies and expertise, gained in several projects, let us rethink the overall process chain and overcome established principles. a new service orientated platform for the execution of cell-based assays that are commonly used will be introduced. the main idea is to give access to automated infrastructure for academic research or spin-offs which cannot afford the special infrastructure. nowadays it is known that the development of inhibitory antibodies by hemophiliac patients is closely related with immunogenic epitopes present in the coagulation factors. these proteins are produced in hamsters cells [ - ] which insert a different posttranslational modification profile when compared with the human profile. patients with high-titer/high-responding inhibitors must be treated with bypassing agents that can achieve hemostasis. activated factor vii (fviia) is an attractive candidate for hemostasis, independent of fviii/fix, making this coagulation factor an alternative for hemophilia patients with inhibitory antibodies. however recombinant factor vii is produced in bhk- cells (baby hamster kidney cells) and as well as the others coagulation factors, it may contain immunogenic epitopes [ - ] . in this context, becomes extremely important to produce recombinant proteins with complex posttranslational modifications in a cell line not yet used [ - ] . we have been using the sk-hep- human cell line for the production of recombinant fvii. to generate the recombinant cell line we have used a bicistronic lentiviral vector, -gfp, containing a fvii gene and the gfp selection marker gene. a master cell bank and a work cell bank were generated in gmp conditions. the rfvii analyses were made by elisa assay, western blot, gene expression quantification and biological activity using the prothrombin time (pt) assay. rfvii purification by affinity chromatography using viiselect (ge) column. after purification the rfvii was formulated and dry froze to be used in in vivo experiments. in static conditions sk-hep- cells showed, for a period of months, a stable fvii production with an average of , iu/ml of fvii, % of cell viability and % of cells expressing the gfp gene. after purification with viiselect column it was possible observe a recover of % of the purified protein with % degree of purity (fig. ) . this recombinant purified fvii is being used in in vivo experiments to determine the pharmacokinetics parameters and to evaluate the posttranslation modifications profile. in conclusion, this study reports the use of sk-hep- cell line for high-level production of recombinant factor vii. these cells have proven to be effective in the production of recombinant protein and can be used as a new platform for the production of recombinant proteins. fig. (abstract p- ) . a determination of protein yield of egfr (epidermal growth factor receptor) synthesized in a cho cecf system. analysis of egfr protein yield obtained in a various batches of cecf formatted reaction. cecf synthesis was performed in the presence of c leucine for radio labeling of target proteins. radio labeled proteins were precipitated using tca followed by scintillation measurement. b detection of radio labeled egfr by autoradiography. a no template control (ntc) was prepared containing no egfr dna template background emergence of stem cell-based regenerative medicine recently leaded to the necessity to reach a sustained production of such cells [ ] . hence, new bioreactors and carriers were designed for cell expansion. however, to meet this increasing demand, improvement of both quality and quantity of stem cells remains necessary. soft biocompatible microcarriers mimicking extracellular matrix in term of structure and stiffness should be of valuable utility as substrate stiffness strongly influence in vitro stem cell fate and differentiation [ , ] . our expertise in the field of microbeads design using jetcutting technology [ ] enabled us to engineer +/- μm alginate beads of various g/m monomer ratio. we used jetcutter (genialab gmbh) with μm nozzle at max speed rpm. alginate solutions with concentrations % to % were gelifyed in % cacl etoh % solution. alginates with estimated viscosity (@ %) from to mpa were tested. a further surface treatment with gelatine ( , %, %) and poly-l-lysine ( , %) was carried out to reach an optimal cell anchoring of human adiposederived mesenchymal stem cells (atcc-psc- - ) in mesempro rs medium (gibco). jetcutter technology allowed us to obtain alginate microcarriers with a good homogeneity in size around μm and sphericity comparable to commercial carriers (table ) . best adhesion of human adipose-derived mesenchymal stem cells was obtained on , % gelatine coated alginate carriers (fig. ) . we observed limited apoptosis and human adipose-derived mesenchymal cells stemness was conserved after days in culture (data not shown). cellular bioassays developed with functionally immortalized cell lines aileen bleisch , aleksandra velkova , tom wahlicht , dagmar wirth , tobias may inscreenex gmbh, braunschweig, germany; msys, helmholtz centre for infection research, braunschweig, germany; greiner bio-one gmbh, frickenhausen, germany bmc proceedings , (suppl ):p- background a major challenge of current research is the limited availability of physiologically relevant cells [ ] . thus the development of relevant cellular bioassays that are robust, reproducible and scalable is hindered. to overcome current limitations we developed an immortalization strategy allowing the efficient and reproducible establishment of novel cell lines showing an in vivo-like phenotype. the main feature of our ci-screen technology is the ability to combine the advantage of cell linesthe unlimited cell supplywith the advantage of primary cellsthe physiological relevance. using this technology we have immortalized, amongst others, a human osteoblast cell line (ci-huob) [ ] . in the present study, the in vivo-like phenotype and functionality of the novel ci-huob was examined. therefore, ci-huob cells were used to develop a d cell culture model by using the magnetic d bioprinting technology (nano d biosciences, houston, tx, usa) [ ] . the ci-huob cell line was recently described and cultivated in huob maintenance medium (inscreenex, germany). for spheroid creation ci-huobs were grown in a monolayer, magnetized by adding a magnetic nanoparticle assembly (nanoshuttle, ns, nano d biosciences, houston, tx, usa) at a concentration of μl ns/cm growth area. after an overnight incubation magnetized ci-huob were detached and seeded into cellstar® cell-repellent -well plates (greiner bio-one, frickenhausen, germany). with the help of mild magnetic forces cells were printed into spheroids within h. these consist of . - . cells and were cultured for a period of up to days. the cell viability was analyzed by a propidium iodide (pi) and calcein am staining. to improve spheroid functionality spheroids were cultivated with huob differentiation medium (inscreenex, germany). "mini bone" tissue functionality and thus mineralization was analyzed by an alkaline phosphatase (alkaline phosphatase activity) and an alizarin red s staining (ca + deposits). the combination of ci-huob cells with the magnetic d bioprinting technology enabled the establishment of reproducible and consistent d spheroids. single spheroids per well were formed independent of the amount of cells ( . - . cells) (fig. a) . formed spheroids were stable for a culture period of up to days (fig. b) . neither cell death nor cell proliferation were observed in the bioprinted spheroids which is indicated by the stable size of the spheroids throughout the cultivation (fig. c) . after treatment with a differentiation stimulus the d bioprinted spheroids became fully functional "mini bones". this was highlighted by the alkaline phosphatase activity and the ca + deposits within the d bioprinted spheroids (fig. d,e) . taken together, these results demonstrated that the functional immortalization technology provides physiologically relevant cells in sufficient numbers and that the magnetic d bioprinting technology enabled a fast, consistent cell aggregation and the formation of stable uniform spheroids. importantly, these immortalized cells are capable to differentiate when a suitable stimulus is provided. for differentiation into mini bones, d spheroid cultivation and additional stimulation by small molecules are required. the combination of physiologically relevant cell systems with three dimensional culturing will help to generate in vitro test systems which closely resemble the in vivo physiology and thereby supporting future drug discovery approaches. fig. (abstract p- ) . characterization of spheroid "mini bones". a different number ( . - . cells) of ci-huob cells were printed into spheroids. b . ci-huobs were printed into spheroids and cultivated for indicated time points. c for analyzing spheroid sizes, pictures were taken and quantified by imagej. (d/e) . ci-huob cells were printed into spheroids and cultivated with (huob differentiation medium) or without a differentiation stimulus for two weeks. afterwards, bioprinted spheroids were sectioned by a cryo microtome and d stained for ca + deposits (alizarin red s) or e stained for alkaline phosphatase activity crispr/cas , a novel genomic tool to knock down microrna in vitro and in vivo degrontagged dcas /cpf effectors for multi-directional drug-inducible control of synthetic gene regulation assessing the variability of an innovator molecule n-glycan profile correct primary structure assessment and extensive glyco-profiling of cetuximab by a combination of intact, middle-up, middle-down and bottom-up esi and maldi mass spectrometry techniques d-dige screening of high productive cho cells under glucoselimitation -basic changes in the proteome equipment and hints for epigenetic effects dependence on glucose limitation of the pco influences on cho cell growth, metabolism and igg production fcgalactosylation modulates antibody-dependent cellular cytotoxicity of therapeutic antibodies fc glycans of therapeutic antibodies as critical quality attributes development of an automated, multiwell plate based screening system for suspension cell culture global cancer statistics remission of disseminated cancer after systemic oncolytic virotherapy screening different host cell lines for the dynamic production of measles virus beitrag zur kollektiven behandlung pharmakologischer reihenversuche a simple method of estimating fifty per cent endpoints webinar: ambr as a sedimentation-perfusion model for cultivation characteristics and product quality prediction de novo" high density perfusion medium: increased productivity and reduced perfusion rates a high-yielding cho transient system: co-expression of genes encoding ebna- and gs enhances transient protein expression an integrated vector system for the eukaryotic expression of antibodies or their fragments after selection from phage display libraries towards the development of a surface plasmon resonance assay to evaluate the glycosylation pattern of monoclonal antibodies using the extracellular domains of cd a and cd biotinylation of the fc gamma receptor ectodomains by mammalian cell co-transfection: application to the development of a surface plasmon resonance-based assay ambr™ mini-bioreactor as a high-throughput tool for culture process development to accelerate transfer to stainless steel manufacturing scale: comparability study from process performance to product quality attributes maximizing binding capacity for protein a chromatography protein glycosylation and its role in protein folding afucosylated antibodies increase activation of fcγriiia-dependent signaling components to intensify processes promoting adcc sialic acids and other nonulosonic acids production of antibody in insect cells suitability and perspectives on using recombinant insect cells for the production of virus-like particles cloning of cdna and characterization of anti-rnase a monoclonal antibody a production of functional antibody fab fragment by recombinant insect cells optimization of hek- s cell cultures for the production of adenoviral vectors in bioreactors using on-line our measurements enhancing heterologous protein expression and secretion in hek cells by means of combination of cmv promoter and ifnα signal peptide hek cell culture media study towards bioprocess optimization: animal derived component free and animal derived component containing platforms comparison of control strategies for fed-batch culture of hybridoma cells based on on-line monitoring of oxygen uptake rate, optical cell density and glucose concentration continuous bioprocessing: the real thing this time? mabs white paper on continuous bioprocessing fda perspective on continuous manufacturing. ifpac annu. meet screening and assessment of performance and molecule quality attributes of industrial cell lines across different fed-batch systems amanullah: quantitative modeling of viable cell density, cell size, intracellular conductivity, and membrane capacitance in batch and fed-batch cho processes using dielectric spectroscopy optimal and consistent protein glycosylation in mammcalian cell culture journal of laboratory automation. designs and concept-reliance of a fully automated high content screening platform mini-bioreactor as a highthroughput tool for culture process development to accelerate transfer to stainless steel manufacturing scale: comparability study from process performance to product quality attributes perfusion media development using cell settling in automated cell culture system model-based design of process strategies for cell culture bioprocesses: state of the art and new perspectives model-based strategy for cell culture seed train layout verified at lab scale hyperosmotic stimulus study discloses benefits in atp supply and reveals mirna/mrna targets to improve recombinant protein production of cho cells effects of osmoprotectant compounds on ncam polysialylation under hyperosmotic stress and elevated pco evaluating the bottlenecks of recombinant igm production in mammalian cells igm characterization directly performed in crude culture supernatants by a new simple electrophoretic method effect of culture ph on erythropoietin production by chinese hamster ovary cells grown in suspension at . and . °c enhancing protein expression in hek- cells by lowering culture temperature relationship between tissue plasminogen activator production and specific growth rate in chinese hamster ovary cells cultured in mannose at low temperature a quantitative proteomic analysis of cellular responses to high glucose media in chinese hamster ovary cells process analytical technologies in the pharmaceutical industry: the fda's pat initiative effects of ammonium and lactate on growth and metabolism of a recombinant chinese hamster ovary cell culture glycosylation in cell culture structural mechanism of high affinity fcgammari recognition of immunoglobulin g inhibition of glycosylation on a camelid antibody uniquely affects its fcγri binding activity investigation of cho secretome: potential way to improve recombinant protein production from bioprocess bladder cancer cell-derived exosomes inhibit tumor cell apoptosis and induce cell proliferation in vitro exploring packaged microvesicle proteome composition of chinese hamster ovary secretome glycome mapping on dna sequencing equipment iron (iii) citrate inhibits polyethylenimine-mediated transient transfection of chinese hamster ovary cells in serum-free medium efficient high-throughput biological process characterization scale-up of a stirred single-use bioreactor family quality control: er-associated degradation: protein quality control and beyond pharmacological targeting of endoplasmic reticulum stress signaling in cancer when er stress reaches a dead end oxidative stress and antioxidant defense the antioxidant edaravone attenuates er-stress-mediated cardiac apoptosis and dysfunction in rats with autoimmune myocarditis advanced process and control strategies for bioreactors. book chapter model-based strategy for cell culture seed train layout verified at lab scale seed train optimization for cell culture. chapter high cell density cultivation of human leukemia t cells (jurkat cells) in semipermeable polyelectrolyte microcapsules. eng. life sci cell retention by encapsulation for the cultivation of jurkat cells in fixed and fluidized bed reactors the role of interleukin- during homeostasis and activation of the immune system creating a biomimetic microenvironment for the ex vivo expansion of primary human the era of digital biomanufacturing the multivariate normal distribution recovery from rabies: a call to arms the role of vaccination in rabies prevention eliminating canine rabies, the principal source of human infection: what will it take? rabies virus-like particles expressed in hek cells immunogenic virus-like particles continuously expressed in mammalian cells as a veterinary rabies vaccine candidate an avian cell line designed for production of highly attenuated viruses a chemically defined production process for highly attenuated poxviruses a genotype of modified vaccinia ankara (mva) that facilitates replication in suspension cultures in chemically defined medium easy and efficient protocols for working with recombinant vaccinia virus mva large-scale transfection of mammalian cells for the fast production of recombinant protein recombinant protein production by large-scale transient gene expression in mammalian cells: state of the art and future perspectives high density transfection with hek cells allows doubling of transient titers and removes need for a priori dna complex formation with pei regulation of recombinant protein expression during chobri/rcta pool generation increases productivity and stability qc, canada; department of microbiology, infectiology and immunology the cumate gene-switch: a system for regulated expression in mammalian cells rapid protein production from stable cho cell pools using plasmid vector and the cumate gene-switch christoph zehe sartorius stedim cellca gmbh, laupheim animal cell technology: from target to market recent advances in mammalian protein production molecular mechanisms of rna interference. annual review of biophysics ribosome profiling-guided depletion of an mrna increases cell growth rate and protein secretion ubiquitous chromatin-opening elements (ucoes): applications in biomanufacturing and gene therapy the art of cho cell engineering: a comprehensive retrospect and future perspectives a novel bxb integrase rmce system for high fidelity site-specific integration of mab expression cassette in cho cells high-troughput lipidomic and transcriptomic analysis to compare sp / , cho, and hek- mammalian cell lines single cell characterisation of chinese hamster ovary (cho) cells eva pekle , , guglielmo rosignoli generation of stable chinese hamster ovary pools yielding antibody titers of up to . g/l using the piggybac transposon system comparison of three transposons for the generation of highly productive recombinant cho cell pools and cell lines effects of ammonia and lactate on hybridoma growth, metabolism, and antibody production lactate and glucose concomitant consumption as a self-regulated ph detoxification mechanism in hek cell cultures flux balance analysis of cho cells before and after a metabolic switch from lactate production to consumption reducing recon for steady-state flux analysis of hek cell culture trastuzumab -mechanism of action and use in clinical practice hek cell culture media study towards bioprocess optimization: animal derived component free and animal derived component containing platforms enhancing heterologous protein expression and secretion in hek cells by means of combination of cmv promoter and ifnα signal peptide production of lentiviral vectors references . wurm f m: production of recombinant protein therapeutics in cultivated mamammalian cells initial identification of low temperature and culture stage induction of mirna expression in suspension cho-k cells small indels induced by crispr/cas in the ' region of microrna lead to its depletion and drosha processing retardance production of recombinant protein therapeutics in cultivated mammalian cells improved antibody production in chinese hamster ovary cells by atf overexpression the vesicle-trafficking protein munc b increases the secretory capacity of mammalian cells rapid evaluation of n-glycosylation status of antibodies with chemiluminescent lectin-binding assay intracellular secretion pathway analysis for constructing highly producible engineered cho cells. th annual peptalk intracellular secretion analysis of therapeutic antibodies in engineered high-producible cho cells analysis of intracellular recombinant igg secretion in engineered cho cells. the th annual and international meeting of the japanese association for animal cell technology humanization strategies for an anti-idiotypic antibody mimicking hiv-i gp antibody humanization by molecular dynamics simulations-in-silico guided selection of critical backmutations maxquant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification the perseus computational platform for comprehensive analysis of (prote)omics data hoffrogge: label-free protein quantification of sodium butyrate treated cho cells by esi-uhr-tof-ms the art of cho cell engineering: a comprehensive retrospect and future perspectives gs system for increased expression of difficult-to-express proteins the netherlands correspondence: maurice van der heijden (heijden@proteonic.nl) bmc proceedings increased recombinant protein production owing to expanded opportunities for vector integration in high chromosome number chinese hamster ovary cells ires-mediated translation of membrane proteins and glycoproteins in eukaryotic cell-free systems cell-free protein expression based on extracts from cho cells comparison of cell-based vs. cell-free mammalian systems for the production of a recombinant human bone morphogenic growth factor. eng dual-responsive magnetic core-shell nanoparticles for nonviral gene delivery and cell separation pdmaema-grafted core-shell-corona particles for nonviral gene delivery and magnetic cell separation influence of polyplex formation on the performance of starshaped polycationic transfection agents for mammalian cells systematic study of a library of pdmaema-based, superparamagnetic nano-stars for the transfection of cho-k cells systems biology of unfolded protein response in recombinant cho cells dynamics of unfolded protein response in recombinant cho cells cells were transfected with np@(pdmaema ) (n/p ), separated h post transfection (t = ) by magnetically-assisted cell sorting and placed into separated cultures. the bars represent the overall transfection efficiency. distribution of low (light green), middle (green), and high (dark green) producers within the egfp-expressing cell fraction. data represent one experiment carried out in duplicate, with random experimental error shown. d correlogram between the molecular characteristics of the nano-stars (core diameter, arm density, arm length, number of monomeric units per nano-star), the physicochemical properties of the corresponding polyplexes (hydrodynamic radius, zeta potential), the transfection conditions (n/p ratio, amount of polymer), and the cellular reactions (transfection efficiency, magnetism, viability) production of recombinant protein therapeutics in cultivated mammalian cells position effects on eukaryotic gene expression bacterial artificial chromosome library for genome-wide analysis of chinese hamster ovary cells construction of bac-based physical map and analysis of chromosome rearrangement in chinese hamster ovary cell lines suguru imanishi , akira ito , masamichi kamihira , department of chemical engineering cre recombinase-mediated sitespecific modification of a cellular genome using an integrasedefective retroviral vector targeted transgene insertion into the cho cell genome using cre recombinase-incorporating integrase-defective retroviral vectors an accumulative site-specific gene integration system using cre recombinase-mediated cassette exchange improved recombinant antibody production by cho cells using a production enhancer dna element with repeated transgene integration at a predetermined chromosomal site lipid extraction by methyl-tert-butyl ether for high-throughput lipidomics a simple method for the isolation and purification of total lipids from animal tissues using baculovirus as a gene shuttle in hmsc: optimization of transduction efficacy gundula sprick clinical applications of mesenchymal stem cells concise review: mesenchymal stem cell treatment of the complications of diabetes mellitus wharton's jelly-derived mesenchymal stromal cells as a promising cellular therapeutic strategy for the management of graft-versus-host disease gene therapy: twenty-first century medicine state-of-the-art gene-based therapies: the road ahead viral vectors: a look back and ahead on gene transfer technology basic biology of adeno-associated virus (aav) vectors used in gene therapy biosafety challenges for use of lentiviral vectors in gene therapy adenoviral vector-mediated gene therapy for gliomas: coming of age manufacturing of viral vectors for gene therapy: part i. upstream processing the complete dna sequence of autographa californica nuclear polyhedrosis virus efficient gene transfer into human hepatocytes by baculovirus vectors efficient transduction of mammalian cells by a recombinant baculovirus having the vesicular stomatitis virus g glycoprotein recombinant baculoviruses as mammalian cell gene-delivery vectors post-transcriptional regulatory element boosts baculovirusmediated gene expression in vertebrate cells baculoviral vector-mediated transient and stable transgene expression in human embryonic stem cells systematic comparison of constitutive promoters and the doxycycline-inducible promoter baculovirus-induced recombinant protein expression in human mesenchymal stromal stem cells: a promoter study triple-negative breast cancer: an unmet medical need the therapeutic monoclonal antibody market. mabs a monoclonal antibody against human notch ligand-binding domain depletes subpopulation of putative breast cancer stem-like cells notch activation stimulates migration of breast cancer cells and promotes tumor growth notch-out for breast cancer therapies aav production in suspension: evaluation of different cell culture media and scale-up potential modular adeno-associated virus (raav) vectors used for cellular virusdirected enzyme prodrug therapy production of recombinant adenoassociated virus vectors using suspension hek cells and continuous harvest of vector from the culture media for gmp fix and flt clinical vector development of a cost-efficient scalable production process for raav- based gene therapy by transfection of hek- cells simon arias , mustapha hohoud , roel lievrouw , fabien moncaubeig b- brussels, belgium; généthon, rue de l'internationale cell-free systems based on cho cell lysates: optimization strategies, synthesis of "difficult-to-express" proteins and future perspectives cell-free protein expression based on extracts from cho cells comparison of cell-based vs. cell-free mammalian systems for the production of a recombinant human bone morphogenic growth factor ires-mediated translation of membrane proteins and glycoproteins in production of recombinant factor vii in sk-hep- human cell line zip - , brazil; department of clinical, toxicological and food science analysis, faculty of pharmaceutical sciences of ribeirão preto human cell lines for the production of recombinant proteins: on the horizon production of recombinant protein therapeutics in cultivated mammalian cells recombinant protein therapeutics from cho cells - years and counting establishment of a cell line expressing recombinant factor vii and its subsequent conversion to active form fviia through hepsin by genetic engineering method expression and fast preparation of biologically active recombinant human coagulation factor vii in cho-k cells implications of the presence of n-glycolylneuraminic acid in recombinant therapeutic glycoproteins uniquely human evolution of sialic acid genetics and biology production platforms for biotherapeutic glycoproteins. occurrence, impact, and challenges of non-human sialylation human cells: new platform for recombinant therapeutic protein production therapeutic glycoprotein production in mammalian cells masthering industrialization of cell therapy products tissue cells feel and respond to the stiffness of their substrate matrix elasticity directs stem cell lineage specification continuous cider fermentation with co-immobilized yeast and leuconostoc oenos cells eternity and functionality -rational access to physiologically relevant cell lines generation and characterization of two immortalized human osteoblastic cell lines useful for epigenetic studies biocompatibility of nanoshuttletm and the magnetic field in magnetic d bioprinting publisher's note springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations we accept pre-submission inquiries • our selector tool helps you to find the most relevant journal • we provide round the clock customer support • convenient online submission • thorough peer review • inclusion in pubmed and all major indexing services • maximum visibility for your research submit your manuscript at www submit your next manuscript to biomed central and we will help you at every step authors thankfully acknowledge the biotechnology and biological sciences research council for funding this research work. sns thanks esact for providing her with the opportunity to present her work at the meeting. we would like to thank moritz frei for his support for the generation of the ngs transcriptomics data. many thanks to valentine chevallier for her precious advices, to stefanos grammatikos for his support and to the whole upstream process sciences team. we thank david bruehlmann and thomas vuillemin from merck (vevey, switzerland) for providing the igg glycan variants and the -ab-uplc glycan data. polyplus-transfection would like to thank généthon for their kindly provided data.acknowledgements asmita mukerji, reetesh pm, sasi kumar k, pilot plant team acknowledgment to cedric allier from cea leti, grenoble, france. we would like to thank a. schemel and a. ehrlich for technical assistance. the authors would like to mention that this research was supported by the fi-dgr ( ) from spanish government and the project was led by prof. jordi joan cairó badillo. the authors want to thank the biotech process sciences team at merck in corsier-sur-vevey for their support and also the members of the morbidelli group at eth zürich for their input and collaboration. authors would like to thank dr. benjamin youn in manufacturing science and technology (msat) at biomarin for his help on coding the excel macro program for ambr , and dr. donald l. traul from tap biosystems (now part of the sartorius stedim biotech group) for his assistance on ambr operations. thanks to the bioreactor team of dustin davis, amer al-lozi, and jana mahadevan. we organic vaccines tm and the nih, who kindly provided to univercells. we thank polymun scientific immunbiologische forschung gmbh for providing the antibodies igm , igm and igm as a kind gift. this allison kurz and gian andrea signorell, genedata ag, basel, switzerland allison kurz, gian andrea signorell, genedata, basel, switzerland. allison kurz, gian andrea signorell, genedata ag high glucose concentration and low specific cell growth rate improve specific r-tpa productivity in chemostat culture of cho acknowledgements r. boraston for photographs in fig. . we acknowledge atum for their contributions to vector design and construction. austrian bmwfw, bmvit, sfg, standortagentur tirol, government of lower austria and business agency vienna through the austrian ffg-comet-k . this research is supported by sfi grant /ia/ . financial support of the austrian science fund (fwf; grant number p ) is gratefully acknowledged. we would like to thank the australian institute for bioengineering and nanotechnology, university of queensland-brisbane, australia (aibn) for providing the cho clones. this research is partially supported by the developing key technologies for discovering and manufacturing pharmaceuticals used for next-generation treatments and diagnoses both from meti and amed, japan. this work was supported in part by grants for developing key technologies for discovering and manufacturing pharmaceuticals used for next-generation treatment and diagnoses, both from the ministry of economy, trade and industry (meti), japan and from the japan agency for medical research and developments (amed). many thanks stefanos grammatikos for his support and to the whole upstream process sciences team. the authors thank the hessen state ministry of higher education, research and the arts within the hessen initiative for scientific and economic excellence (loewe program) for the financial support. the authors thank xell ag, bielefeld, for providing hek serum-free media (hek gm and hek tf) and for fruitful discussions. the authors would like to thank dana wenzel for cho lysate preparation (fraunhofer izi, potsdam-golm, germany). this work is supported by the european regional development fund (efre) and the german ministry of education and research (bmbf, no. b a). the authors acknowledge são paulo research foundation -fapesp ( / - ), centro de pesquisa, inovação e difusão (cepid), and national institute of science and technology in stem cell and cell therapy -inctc for financial support. this work was supported by grants from the niedersächsisches ministerium für wissenschaft und kultur ( ) and the german ministry for economic affairs and energy (igf n). the infrastructure which was modularly built up, consists of automated liquid handling robots, plate and tube handling robots as well as incubators, refrigerator and analysis systems as for example an imaging system. the aim is to address the need on reproducibility and reliability of results and to offer access to a maximal controlled and automated environment. with the help of a web-based configurator assay selection as well as parameterization of the assays can be done in an easy way. after the order process, test items can be shipped to the lab. assays will be executed on the fully automated platform. by capturing in process data as well as environmental conditions, a real complete data set is leading to comprehensively results. as soon as results are available during the process, the view and analysing can be done in a secure cloud. the service can be used for single experiments in low throughput applications and is therefore a benefit for labs which cannot afford automated infrastructure or the staff for the maintenance for such platforms. extensive monitoring and data capturing during the run leads to a gapless data trail and the possibility of detailed result analysis. due to automated processing the reproducibility is increased associated with direct reduction of costs and time. the centralized service paired with specific know-how allows up-scaling of processes at any time. the web-based interface provides a flexible guidance for the user and the online order gives / access on the infrastructure, leading to a fast reliable result generation. furthermore the secure interaction with additional services e.g. other specific data analysis tool is possible. this dynamic access to automation offers high flexibility for low throughput experiments and will push high quality research and drug development in early stage. development of alternative animal cell technology platforms: cho based cell-free protein synthesis systems for the production of "difficult-to-express" proteins lena thoring , background nowadays, animal cell technologies are commonly used for a broad range of medical and pharmaceutical applications. one main topic of these technologies is the production of proteins used for therapeutical purposes. these in vivo production processes are often time consuming and limited in production of so called "difficult-to-express" proteins including the pharmaceutical relevant class of membrane proteins. to overcome these issues, novel cell-free protein synthesis platforms were developed based on the industrial working horse cho cells [ ] . cell lysates provide a basis for this technology by including all components of the translational machinery and enabling protein production within a few hours. microsomal structures present in cho cell lysates enable posttranslational modification of target proteins and insertion of membrane proteins into lipid bilayer. in this study a cell-free protein synthesis platform was developed based on a combination of cho cell lysates and a continuous exchange reaction format. the continuous exchange reactor consists of a twochamber system, a reaction and a feeding chamber, separated by a semipermeable membrane. due to concentration gradients, energy components can diffuse to the reaction chamber, while inhibitory byproducts are continuously removed. different classes of proteins were selected to evaluate the quality of the cho cecf system including a transmembrane receptor, a single chain variable fragment and an ion channel. cell-free protein synthesis was performed in the presence of c leucine for radio labeling of synthesized proteins. protein yield was quantified by tca precipitation of radio labeled proteins followed by scintillation measurement and molecular mass was detected by autoradiography. posttranslational modifications and activities of proteins were estimated by kinase assays, elisa, endoglycosidase treatment and electrophysiological measurements. the demonstrated results showed a protein production of up to around g/l while detecting correct molecular weights by autoradiography. analysis of the productivity using different lysate batches by the production of the membrane protein egfr revealed only minimal batch-to-batch variations (fig. a) . posttranslational modifications of proteins, including phosphorylation and glycosylation, were detected using western blot and autoradiography (fig. b) . evaluation of localization of membrane embedded eyfp fusion proteins by confocal laser scanning microscopy resulted in the detection of proteins in the microsomal fraction of cho cell lysate. produced single chain variable fragments showed binding specificity in elisa experiments. the activity of synthesized ion channels was underlined by electrophysiological measurements and detected single channel activities. a cell-free system based on cho cell lysates for high yield production of proteins was developed that provides a platform for efficient production of "difficult-to-express" proteins. the combination of a cho lysate based cell-free system and a continuous exchange cell-free system leads to be a highly efficient production system for various classes of "difficult-to-express" proteins. this approach opens up a fast and cost-effective process pipeline for the production of "difficult-to-express" proteins and shows a high potential for industrial applications including screening technologies, protein structure determination and just-in-time protein production processes. key: cord- - ggaqf authors: nan title: abstracts of the papers presented in the xix national conference of indian virological society, “recent trends in viral disease problems and management”, on – march, , at s.v. university, tirupati, andhra pradesh date: - - journal: indian j virol doi: . /s - - - sha: doc_id: cord_uid: ggaqf nan patients showed rashes on face, hand and foot. ev detection carried out in vesicular fluid, stool, serum and throat swab specimens by rt-pcr of ncr gene. serotyping was carried out by using rt-pcr of viral protein of vp / a junction region followed by sequencing and phylogenetic analysis using neighbor-joining-algorithm and kimura- parameter model of mega- software. overall ev positivity detected in hfmd patients from kerala, tamil nadu, west bengal and orissa states was found to be . %, . %, . % and . % respectively. typing of vp gene sequences indicated presence of ca- , ev- , echo- strains in kerala and ca- in west bengal, orissa and tamil nadu. phylogenetic analysis indicated ca- , ev- , echo- strains showed . - . % and - . % homology with japanese, australian and french strains. however, ca- strains were closer to malaysian strains with . - . % nucleotide homology. the present study documents the association of multiple types of ev's i.e., ca- , ev- , echo- and ca- strains contributing as prime viral pathogens in hfmd epidemics in the reported regions with new emergence of ca- circulating strain in kerala, india. tasgaon september . sera were collected from suspected hepatitis cases and there contacts and tested for anti hev igm/igg antibodies (elisa) and liver enzymes like alanine aminotransferase (alt). anti hev igm antibodies were detected in . % ( / ) of the suspected cases. the overall attack rate was . %. male to female ratio was : . majority ( . %) of the cases were in the age group - years and recovered without any clinical complications. weekly distribution of cases showed that the majority ( . %, / ) cases occurred between nd and rd week of june. dark urine ( . %), jaundice ( . %), fatigue ( . %), abdominal pain ( . %), anorexia ( . %), vomiting ( . %), fever ( . %), giddiness ( . %), diarrhoea ( . %) and arthalgia ( . %) were the prominent symptoms. sera collected from antenatal cases (ancs) showed anti hev igm antibody in . affected pregnant women had a normal outcome. a death of year, male hepatitis e case was reported during the outbreak period that had cirrhosis of liver with oesophageal varices. sanitary survey revealed that water pipelines were laid down in close proximity of sewerage system, and water posts were without tap. these are the likely sources of faecal contamination of water supplies. among water samples collected from various places, were found to be unfit for drinking based on the routine bacteriological tests conducted at state public health laboratory, pune. no case occurred after the pipelines were repaired. this typical outbreak of hepatitis e re-emphasizes need for proper water supply/sewage disposal pipelines and adequate maintenance measures. jayanthi shastri, nilima vaidya, sandhya sawant, umesh aigal department of molecular biology, kasturba hospital for infectious diseases, mumbai, india dengue and dengue haemorrhagic fever are amongst the most important challenges in tropical diseases due to their expanding geographic distribution, increasing outbreak frequency, hyperendemicity and evolution of virulence. the gobal prevalence of dengue has grown dramatically in recent decades. who estimates - million cases of dengue virus infections worldwide every year resulting in , to , cases of dhf and , deaths each year. public health laboratories require rapid diagnosis of dengue outbreaks for application of measures such as vector control. laboratory diagnosis of dengue virus infection can be made by the detection of specific virus, viral antigen, genomic sequence and/or antibodies. currently basic methods used by laboratories for diagnosis of dengue virus infection are virus isolation and characterisation, detection of genomic sequence by nucleic acid amplification technology assay and detection of dengue virus specific antibodies/antigen. molecular diagnosis based on reverse transcription (rt)-pcr s.a. one step or nested pcr, nucleic acid sequence based amplification (nasba), or real time rt-pcr, has gradually replaced the virus isolation method as the new standard for the detection of dengue virus in acute phase serum samples. several pcr protocols for detection have been described that vary in the extraction method, genomic location of primers, specificity, sensitivity and the methods to determine the products and the serotype. pcr-based dengue tests, due to the specificity of amplification, enable a definitive diagnosis and serotyping of the virus. in addition dna sequencing of the amplification product enables the virus to be genotyped, providing important information on the sources of infection. more recently tests have incorporated flurogenic probe, so called taq man technology for the specific real time detection of dengue - amplicons. product is detected by a specific oligodeoxy nucleotide probe that is labelled with carboxy-fluorescein (fam). this technology offers the advantage of being both rapid and potentially quantitative. second, the detection of product by hybridisation of flurochrome labelled probes increases specificity. third, as the product is detected without the need to open the reaction tube, the risk of contamination by product carry over is minimised. the advantages of speed, contamination minimisation and reduced turn around time justify application of this assay over the currently used nested pcr assay. during the period january to october , molecular laboratory received samples from patients presenting with acute onset fever for dengue . %) samples were tested positive by this method. the disease peaks in the monsoon season with a percentage of . %. rapid tests, igm and igg capture elisa are popularly used tests for diagnosis of dengue infection. its utility is limited for diagnosing dengue in convalescecce ( - days) . specificity is also compromised due to infections with flaviviruses: japanese encephalitis and chikungunya. dengue ns ag elisa with its cost effectiveness, specificity and sensitivity should be considered as the test of choice for diagnosing dengue in the acute phase of illness in the developing countries. molecular diagnosis enables confirmatory diagnosis of dengue in the acute phase of the illness and is suitable for further typing methods. assistant general manager and r&d coordinator, division of quality control and r&d, bharat immunologicals and biologicals corporation ltd., village chola, bulandshahr, up vaccine development in india, though slow to start, has progressed by leaps and bounds in the past years. it was dependent on imported vaccines but now it is not only self-sufficient in the production of vaccines conforming to international standards with major supplier of the same to unicef. the role of drug authorities is to enhance the public health by assuring the availability of safe and effective a indian j. virol. (september ) (suppl. ):a -a vaccines, allergenic extracts, and other related products. vaccine development is tightly regulated by a hierarchy of regulatory bodies. guidelines provided by the indian council of medical research (icmr) set the rules of conduct for clinical trials from phase i to iv studies as well as studies on combination vaccines. these guidelines address ethical issues that arise during a vaccine study. a network of adverse drug reaction (adr) monitoring centers along with the adverse events following immunization (aefi) monitoring program provide the machinery for vaccine pharmacovigilance. genetic modifications have been developed to develop effective and cheaper vaccines by the use of recombinant technology. to ensure safety of consumers, producers, experimental animals and environment, governments all over the world are following regulatory mechanisms and guidelines for genetically modified products. as with other industrializing countries undergoing rapid shifts, india clearly recognizes the need to restructure its regulatory system so that its biopharmaceutical industry can compete in international markets. genetic engineering approval council (geac), recombinant dna advisory committee (rdac), review committee on genetic manipulation (rcgm), institutional biosafety committees (ibsc) are responsible for development, commitment for parameters and commercialization of recombinant vaccines. to centralize and coordinate the whole system, government has taken to form two agencies to regulate the regulation laws to develop recombinant pharmaceuticals products including vaccines. the first is the creation of the national biotechnology regulatory authority (nbra), under the department of biotechnology (dbt), as part of india's long-term biotech sector development strategy. the second major initiative will affect the entire indian pharmaceutical industry. this is the replacement of most state, district, and central drug regulatory agencies with a single, central, fda-style agency, the central drug authority (cda). the cda is expected to have separate, semi-autonomous departments for regulation, enforcement, legal, and consumer affairs; biotechnology products; pharmacovigilance and drugs safety; medical devices and diagnostics; imports; quality control; and traditional indian medicines. it will set up offices throughout india and will be paid for inspection, registration, and license fees. its enforcement powers will be strengthened by a new law increasing the criminal penalties for illegal clinical trials. in the manufacturing area, though, the country has been tightening the rules and enforcement. an amendment to the regulations, ''schedule m'' of the drug and cosmetics act, now specifies the good manufacturing practice (gmp) requirements for factory premises and materials. these requirements were modeled after us fda regulations, to improve regulatory coordination between indian and us regulators. india has realized the importance of regulations in pharmaceutical specially in vaccine field but it will take several years to implementation of these. india has coordinated some of its regulatory functions with western organizations. the us pharmacopoeia established an office in hyderabad in . a representative of the indian pharmaceutical lobby also recently has expressed openness to an expansion of the fda's oversight of indian manufacturing. as india expands its global drug and biologicals production, us and europe, as the world's largest drug importers, will likely expand their regulatory support in the development of the country's regulatory systems. rapid diagnosis of japanese encephalitis virus (jev) infections is important for timely clinical management and epidemiological control in areas where multiple flaviviruses are endemic. however, the speed and accuracy of diagnosis must be balanced against test cost and availability, especially in developing countries. an antigen capture enzyme-linked immunosorbent assay (elisa) for detection of circulating jev specific nonstructural protein (ns ) was developed by using monoclonal antibodies (mabs) specific to recombinant (ns ). the applicability of this jev ns antigen capture elisa for early clinical diagnosis was evaluated with acute phase serum/ cerebrospinal fluid (csf) specimens collected from different epidemics during [ ] [ ] [ ] . jev ns antigen was detected in circulation from day to . the sensitivity and specificity of jev ns detection in serum/csf specimens with reference to reverse transcriptase pcr was %, and . % respectively. no crossreactions with any of the other closely related members of the genus flaviviruses (dengue, westnile, yellow fever and saint louis encephalitis (sle) viruses) were observed when tested with either clinical specimens or virus cultures. these findings suggested that the reported jev specific mab-based ns antigen capture elisa will be a rapid and reliable tool for early confirmatory diagnosis as well as surveillance of je infections in developing countries. manmohan parida the recent emergence of a novel human influenza a virus (h n ) poses a serious global health threat. the h n virus has caused a considerable number of deaths within a short duration since its emergence. a two-step single tube accelerated rapid real-time and quantitative swine flu virus specific h rtlamp assay is reported by targeting the h gene of the novel h n hybrid virus. the feasibility of swine flu h rtlamp for clinical diagnosis was validated with a panel of suspected throat wash samples comprising confirmed positive and confirmed negative cases of ongoing epidemic. the comparative evaluation of h specific rtlamp assay with real-time rt-pcr demonstrated exceptionally higher sensitivity by picking up all the h n positive and additional positive cases amongst the negatives that were sequence confirmed as h n . none of the real-time rtpcr positive samples were missed by rtlamp system. the comparative study revealed that rtlamp was -fold more sensitive than rtpcr with a detection limit of copy number. these findings suggested that rtlamp assay is a valuable tool for rapid, real-time detection as well as quantification of h n virus in acute phase throat swab samples without requiring any sophisticated equipments. because of its recurrent nature. despite considerable progress in understanding of the virus at cellular and molecular levels, the proper management of the disease in its different stages is still a dilemma particularly whether to use antiviral or steroids or both. the risk of using steroids with its attendant complications has to be weighed against the risk of progression of the disease if avoiding the use of steroids. this dilemma can be reduced to a considerable extent if basic principles of virology and pathogenesis are kept in mind. this article reviews current concepts of virological and clinical aspects of hsv keratitis to enable a broad understanding of the disease process. it is recognized several influential host factors including the fact that hsk is more common in men than women. it is observed that the ability of hsv to establish latent infection in sensory neurons and possibly cornea, but have as yet been unable to use this knowledge to prevent the disease limitations. acknowledging limitations may further stimulate application of laboratory knowledge in coping with hsk which constitutes to present major challenge in terms of management. mvo- study on effect of human bhsp in immunity of hcv core protein and hbv hbsag there are more than million individuals with hepatitis b and c in the world. in spite of vaccination in the different areas there are several reports about patients who got vaccine before. also there is not efficient vaccine against of hepatitis c and one of the important problems in vaccine project is development of effective and suitable adjuvant in human vaccines. at present research we applied human bhsp protein as adjuvant and chaperon. this protein injected to balbc mice as adjuvant together with recombinant proteins of hcv core and hbv hbsag. then humoral and cellular immune systems of the mice were studied. core and hbsag genes were cloned into petduet- vector and thermal vector of pgp - was used for human heat shock protein expressions. the different combination of these three proteins was injected to mice and we evaluated the total igg and igg a of mice serums after a week. two weeks after booster injection, we studied the proliferation and cytokine secretion of spleen, inguinal and popliteal lymph nodes lymphocytes in vitro and ex vivo conditions. so the core/hbsag + hsp and core + hbsag + hsp complexes induced total igg and igg a secretion. the spleen lymphocytes proliferation were increased equal to serum igg a level that was constant in second time bleeding with significant different to complexes with freund's adjuvant. at first il- and il- cytokines were increased and then decrease of il- meaned no hypersensitivity. the chaperon effect of hsp on structure of core and hbsag proteins was studied by cd and flourometer. it could fold the proteins after heating and unfolding. hepatitis b virus (hbv) infection is vaccine preventable global public health problem. all commercially available vaccines contain one or more of the recombinant hepatitis b envelope protein or surface antigen (hbsag). measurement of antigen responsible for immunogenicity of vaccine is central to quality assessment. the problems associated with the use of a polyclonal antibody in an assay with regard to its poorly defined nature and batch-to-batch variation has been mitigated by the use of mabs as described in this paper. the initial capture of hbsag by the mab could orientate it such that the same antibody could bind to it as a detection antibody after labeling with out steric hindrance. the development of an immuno-capture elisa (ic-elisa) to measure the hbsag content using a monoclonal antibody (mab) specific to determinant ''a'' of hbsag in the experimental vaccine formulations is being discussed. murine mabs developed against hbsag, subtype adw were found to cross-react with the other subtypes viz. ad and ay too. the mabs have been characterized following which, one mab hbs was chosen for developing ic-elisa format for the quantification of the hbsag in the final algel adsorbed vaccines. the unadsorbed hbsag was used to establish the standard curve of hbsag/a. the elisa had a sensitivity of ng/ml of hbsag. the recovery rate of hbsag/a was found to be around % in the vaccines treated to desorb the antigen from algel. twenty seven experimental batches of monovalent hepatitis b vaccines were analyzed for the hbsag content, both by ic-elisa and a commercial kit (axsym kit, abbott laboratories, usa). the statistical analysis of ic-elisa results indicated that an experimental equation f(x) = . (x) + . , could precisely estimate the amount of hbsag in the adsorbed vaccines. the amounts of hbsag recovered from the adsorbed vaccines as estimated by the ic-elisa format had a good correlation with the estimates derived from a commercial kit, which is being used by several vaccine manufacturers in india for the quality control of vaccine antigen. the varying amounts of vaccine antigens that could be recovered seemed to depend on the quality of the hbsag and the methods of hbsag adsorption to the alum gel during vaccine manufacture. epidemiology of the spread of h n virus. children of school going age have become victim of this deadly virus as evident from the reporting data generated in the past few weeks. the mortality rate has also been slightly increased. the disease spread in wave pattern and presently the world is passing through the second wave of pandemic with more severity in young and otherwise health people with a predilection for lungs leading to viral pneumonia and respiratory failure. now the pandemic gained hold in the developing world affecting more severely as millions of people live under deprived conditions having multiple health problems, with little access to basic health care. current data about the pandemic from developed counties need to be very closely watched in relation to shift in virus sub type, shift of the highest death rate to younger populations, successive pandemic waves, higher transmissibility than seasonal influenza, and demographic differences etc. presently the world appears to be better prepared. vaccine is available in market in many countries. even vaccine trials are actively going on in indian population. effective antivirals are available. although till now h n diagnostic centers worked with cdc/who recommended h n specific primer, probes with taqman chemistry by real time pcr, efforts on the development of indigenous diagnostics, vaccines and chemoprophylaxis is going on to have a better combat against this highly infectious virus. were positive for rotavirus infection by either page or elisa methods. the available data highlights the importance of rotavirus as a cause of diarrhea in children, which is severe enough to deserve specialized care. the observed proportion of . % of all diarrhea cases being associated with rotavirus falls within the range of values reported by other workers. the reported positivity varies from . to . %. in our study a complete concordance of elisa and page results were observed in ( %) of the tested specimens. this finding closely correlates with the findings of other authors who found a . - . % concordance results between elisa and page methods. some authors found rna-page method that is as sensitive and rapid as elisa for detecting rotavirus in stool samples of cases of diarrhea and some others proposed elisa is more sensitive than page method fond to be % specific. the remaining ( %) samples showed conflicting results. in a lone sample in which the od value of elisa test was . , this value was almost at the cutoff level, the possibility of this sample being positive by elisa test is doubtful. negative result of the same sample in page method is difficult to explain, the possibility of presence of lot of empty virus particles or due to low concentration of viral rna in the fecal specimen and insufficient extraction of viral rna could be possible. on the other hand, of the samples which gave positive results by page method were negative by elisa test. these samples had a typical - - - rna pattern. the reason for their being elisa negative thus remains unexplained, however blocking factors or the presence of inhibitory substance in stools might have been responsible. the samples containing predominantly complete particles can also give false negative results. since, the group antigen is not exposed. earlier studies have also reported page to be the most sensitive technique although some are of view that it is laborious procedure. how ever, the page system used in this study was very simple to perform and the results were available on the same day. the main requirement was of trained personnel and proper standardization of the technique. most reports states that the greatest advantage of page and silver stain method are its lack of ambiguity and the fact that it provides information about viral electropherotypes. the modified page system was thus found to be reliable, simple and rapid, no expensive reagents were required. locally available reagents from hi media were used. the cost of the chemical for page per specimen was rs. approximately as compared to rs. per test by confirmatory elisa. a locally produced slab gel electrophoresis system with power pack was the only equipment required. this method could be used for the routine diagnosis of rotavirus infection in the laboratory. vaccine, rapid diagnosis plays an important role in early management of patients. in this study a qc-rt-pcr assay was developed to quantify chikungunya virus rna by targeting the conserved region of e gene. a competitor molecule containing an internal insertion was generated, that provided a stringent control of the quantification process. the introduction of -fold serially diluted competitor in each reaction was further used to determine sensitivity. the applicability of this assay for quantification of chikungunya virus rna was evaluated with human clinical samples and the results were compared with real-time quantitative rt-pcr. the sensitivity of this assay was estimated to be rna copies per reaction with a dynamic detection range of to copies. specificity was confirmed using closely related alpha and flaviviruses. the comparison of qc-rt-pcr result with real-time rt-pcr revealed % concordance. these findings demonstrated that the reported assay is convenient, sensitive and accurate method and has the potential usefulness for clinical diagnosis due to simultaneous detection and quantification of chikungunya virus in acute-phase serum samples. in india, measles vaccine was introduced as part of expanded programme of immunization in . measles, mumps and rubella (mmr) vaccine is still not part of the national immunization schedule of india. the indian association of paediatrics (iap) recommends measles vaccine at months of age and mmr vaccine at - months. however, in a recent policy update, iap committee on immunisation opined that there is a need for a second dose of mmr vaccine for providing adequate immunity against mmr. the aim of the present study was to assess the extent of sero-protection against mmr at - years of age in children who have received one dose of mmr between and months of age. an attempt has also been made to assess the sero-response to the second dose of mmr vaccine in - years old children. a total of consecutive children between the ages of - years who had received mmr vaccine between and months of age and attending the immunization clinic of gtb hospital, delhi were enrolled. the vaccination status, anthropometry and physical examination findings were recorded. three ml of venous sample was again withdrawn for estimation of post vaccination antibody titre. it was observed that . %, . % and . % children were seroprotected for mmr respectively after . - . year of receiving first dose of mmr vaccine. seroprotection rose to . %, % and % for mmr respectively after - weeks of receiving second dose of mmr vaccine. geometric mean concentration of antibody also rose significantly in all three diseases. in view of low seroprevalence of mmr and hence high susceptibility to infection at - years of age, who have already received mmr vaccine, there is need to boost the immune responses against these three diseases by giving a second dose of mmr vaccine. baseline information on the epidemiology of viral agents causing stis and types of risk behaviour of affected persons are essential for any meaningful targeted intervention. the present study documents the pattern of viral stis in patients attending a tertiary care hospital, correlating the syndromic approach and the laboratory investigations to determine the aetiology. three hundred consecutive patients attending the sti clinic were diagnosed and categorized according to the syndromic approach of the who along with detailed history and demographic data. majority of the patients were men ( . %) with a mean age of years. men received education up to middle school. half of the female subjects were illiterate. sixty percent of the patients were married and among these, % were regular condom users. first sexual contact at or before years of age was more in men ( % vs. . % in women). promiscuity was more among male patients who had contact with csw. genital herpes was the commonest viral sti ( / ) followed by genital wart ( / ). concomitant infection with more than one virus was seen in % of patients. hiv was prevalent in . % of sti patients. hepatitis b, hepatitis c, herpes simplex type and molluscum contagiosum were the other viral agents seen in sti clinic attendees at our centre. this disease currently prevalent in more than countries world wide and annually - million people are infected with dengue virus among which . - lakhs cases were dengue hemorrhagic fever (dhf) and dengue shock syndrome (dss) which are serious forms of dengue virus infection and due to this condition , deaths might occur annually world wide and approximately million children were hospitalized for the fast decades. this disease is characterized by sudden onset of high fever with sever headache, pain in the back and limbs, lymphadenopathy macuolo-papulur rash over the skin and retro-bulbar pain. early diagnosis can be established with simple and rapid lgg/ gm antibodies detection in the blood samples of the patients based on the bi-directional immunoassay system for its management and control to reduce morbidity and mortality. details will be presented. myocarditis and dilated cardiomyopathy (dcm) are common causes of morbidity and mortality both in children and adults. the most common viruses involved in myocarditis are coxsackievirus b or adenovirus. recently, the coxsackievirus and adenovirus receptor (car), a common receptor for coxsackieviruses b , b and adenoviruses , has been identified. increased expression of car has been reported in patients with dcm suggesting utilization of car by these viruses for cell entry. the present study was designed to study the expression of car in myocardial tissue of patients with dcm. formalin fixed myocardial tissues were obtained from autopsy cases. a total of cases of dcm and cases of controls which included non-cardiac (group-a) and cardiac disease other than dcm (group-b) were included in the study. expression of car was studied by immunohistochemical staining of myocardial tissue using car specific rabbit polyclonal antibody and biotin conjugated secondary antibody. the tissue sections were considered positive when[ % of the cell showed brown color staining by immunohistochemistry (ihc). the car positivity in dcm cases was found to be % ( / ) as compared to % in control group a and % in control group b respectively. the car positivity was significantly higher in the test group as compared to both the control groups. further car positivity in all the cellular types (myocytes, endothelial cells and interstitial cells) was found significantly higher in test group as compared to both the control groups. the expression of car was significantly higher in myocytes as compared to both endothelial and interstitial cells in all the groups. however, no significant difference was observed in car positivity between endothelial and interstitial cells. the present study highlights the increased expression of car in dcm cases with further significance of car expression in myocytes and endothelial cells. this may help further in understanding the tropism of viruses or cellular susceptibility, which in turn will help in appropriate diagnostic and therapeutic approach in management of viral myocarditis and dcm cases. food security and safety vary widely around the world, and reaching these goals is one of the major challenges, raising public concern for the wellbeing of mankind, in particular. industrialized production and processing as well as improper environmental protection have clearly shown severe limitations such as worldwide contamination of the food chain and water. contaminated water and food during the processes of production, processing and handling are essentially responsible for food and water borne viral infections/diseases. the cases of viral food borne outbreaks are on the rise, creating a threat to human health. recent researches indicate that epidemiological studies are meager to focus the frequently contaminated foods and food borne viral diseases. current paper projects the etiology of select food borne viral diseases, probable reasons for non availability of appropriate methods to detect the viruses responsible for the diseases, routes of water and food borne transmission of enteric viral infections, currently available methods of detection of select viruses and bio safety measures to prevent food borne viral infections. dietary/nutritional management in food borne viral diseases is crucial to control weakness and gastro enteric intolerance due to disease condition and antibiotic therapy. it will principally improve food intake, resulting in better nutritional status leading to optimum immune response. food borne viruses are mainly belong to rotaviruses, enteropathogenic viruses, astroviruses, adenoviruses and caliciviruses, causes acute gastroenteritis (ag) which is an important health problem. the frequency of rotavirus as a cause of sporadic cases of ag ranges between . % and . %. astroviruses cause ag, with a frequency ranging between and %: outbreaks have been described in schools and kindergartens, but also in adults and the elderly. the frequency of identification of adenoviruses and as causes of sporadic ag in non-immuno suppressed children ranges between . % and . %, although there is probably underreporting because the sensitivity of conventional techniques is low. caliciviruses are separated phylogenetically into two genera: norovirus and sapovirus. norovirus is frequently associated with food-and water-borne outbreaks of ag. it is estimated that % of cases of ag due to norovirus are food borne. in sweden and some regions of the united states, norovirus is the first cause of outbreaks of food borne diseases. sapovirus outbreaks due to person-to-person and food borne transmission affecting both children and adults have recently been reported in countries such as canada and japan. it has been predicted that the importance of diarrhoeal disease, mainly due to contaminated food and water, as a cause of death will decline worldwide. evidence for such a downward trend is limited. this prediction presumes that improvements in the production and retail of microbiologically safe food will be sustained in the developed world and, moreover, will be rolled out to those countries of the developing world increasingly producing food for a global market. sustaining food safety standards will depend on constant vigilance maintained by monitoring and surveillance but, with the rising importance of other food-related issues, such as food security, obesity and climate change, competition for resources in the future to enable this may be fierce. in addition the pathogen populations relevant to food safety are not static. food is an excellent vehicle by which many pathogens (bacteria, viruses/prions and parasites) can reach an appropriate colonization site in a new host. although food production practices change, the well-recognized food-borne pathogens, such as salmonella spp. and escherichia coli, seem able to evolve to exploit novel opportunities, for example fresh produce and even generate new public health challenges, for example antimicrobial resistance. in addition, previously unknown food-borne pathogens, many of which are zoonotic, are constantly emerging. awareness and surveillance of viral food-borne pathogens is generally poor but emphasis is placed on norovirus, hepatitis a, rotaviruses and newly emerging viruses such as sars. it is clear that one overall challenge is the generation and maintenance of constructive dialogue and collaboration between public health, veterinary and food safety experts, bringing together multidisciplinary skills and multi-pathogen expertise. such collaboration is essential to monitor changing trends in the well-recognized diseases and detect emerging pathogens. it is also necessary to understand the multiple interactions between these pathogens and their environments during transmission along the food chain in order to develop effective prevention and control strategies. to analyse the effectiveness of these sirnas targeting rabies virus l gene, the bhk- cells expressing sirnas in shrna form were produced by transduction of cells with radv-l. the transduced bhk- cells expressing sirna were infected with rabies virus pv- strain. there was reduction in rabies virus multiplication as analysed by reduction in fluorescent foci forming unit (ffu) count by . % ( ffu in bhk- cells expressing sirna-l compared to ffu in bhk- cells expressing negative sirna). the expression of l gene mrna was reduced by . fold in rabies virus infected radv-l transduced cells compared to radv-neg transduced cells (negative control) as detected using real-time pcr. after analyzing the effectiveness of radv-l in vitro, its effectiveness was also evaluated in vivo in mice after virulent rabies challenge. the mice were inoculated with plaque forming units (pfu) of radv-l in masseter muscle (i/m route) and challenged with ld rabies virus challenge virus standard (cvs) strain. the results indicated % protection with improved median survival from to days compared with group of mice treated with radv-neg. the results of this study indicated that sirnas targeting rabies virus polymerase (l) gene delivered through adenoviral vector inhibited rabies virus multiplication in vitro and in vivo. and were successfully produced and purified from the infected spodoptera frugiperda (sf- ) cells using these recombinant baculovirus. the morphology of the vlps was validated by electron microscopy in comparison to the authentic bt virions. the vlps produced here were stable and were highly immunogenic with intact outer layer which is rapidly lost during normal infection of btv. these btv-vlps elicited long lasting protective immunity in vaccinated sheep against virulent virus challenge. with the use of btv-vlps it was also possible to differentiate the infected and vaccinated animals (diva). vlp-based btv vaccine has potential advantages with regard to controlling the spread of btv with multiple serotypes. it is possible to produce milligram quantities of correctly folded and processed protein complexes using this baculovirus expression system and hence it is a more promising system for producing new generation vaccines like vlp subunit vaccine against any viral diseases in large scale. peste des petits ruminants (ppr), goatpox and orf are oie notifiable diseases of small ruminants especially goat and sheep. these diseases are economically important, in enzootic countries like india and cause significant loss and are major constraints in the productivity. considering the geographical distribution of ppr, goat pox and orf infections and prevalence of mixed infection, in the present study, safety and potency of the experimental triple vaccine comprising attenuated strains of thermostable-ppr virus (pprv jhansi, p- ) grown at °c, high passaged goat poxvirus (gtpv uttarkashi, p ) and attenuated orf virus (orfv mukteswar, p ) was evaluated in sub-himalayan local hill goats. goats simultaneously immunized with ml of vaccine consisting of either tcid or tcid of each of pprv, gtpv and orfv were monitored for clinical and serological responses for a period of - weeks post-immunization (pi) and post challenge (pc). specific immune responses i.e., antibodies directed to pprv, gtpv and orfv could be demonstrated by ppr competitive elisa kit and capripox indirect elisa, snt, respectively following immunization. all the immunized animals resisted infections when challenged with virulent strains of either gtpv or pprv or orfv on day dpi, while in contact control animals developed characteristic signs of respective disease. further, ppr viral antigen could be detected by using ppr sandwich elisa kit in the excretions (nasal, ocular and oral swab materials) of unvaccinated control animals after challenge but not from any of the immunized goats. triple vaccine was found safe at dose as higher as tcid and induced protective immune response even at lower dose ( tcid ) in goats, which was evident from sero-conversion as well as challenge studies. the study indicated that these viruses are compatible and did not interfere with each other in eliciting immune response, paving the feasibility of use of this triple vaccine in combating these infections simultaneously. toll like receptors (tlrs), primary sensors of microbial origin, plays a crucial role in the innate immunity. till now mammalian tlrs have been identified, while there is no information available on tlrs of yak. this study is part of world bank funded-icar project. yak, named bos grunniens for its distinctive vocalization and relationship with cattle, is natural habitant of extremely cold environment. when these animals comes to a lower altitude grazing land, adjacent to villages, become susceptible to the diseases of cattle, buffalo etc. thus, present study was undertaken to with genetic characterization and evolutionary lineage analysis of yak tlrs. we worked on tlr gene, which plays an important role in recognition of ssrna viruses. total rna was extracted from mitogen stimulated pbmcs of yak. the rt-pcr conditions were standardized for full length amplification of tlr gene using specific self designed primers. the expected amplicon of bps was obtained. it was cloned in pgemt-easy vector followed by transformation in e. coli top strain. the recombinant clones were screened, picked up for plasmid isolation and release of tlr was confirmed by restriction digestion. the cloned tlr product was sequenced and analyzed for the nucleotide and deduced amino acid sequences, and d structure analysis. the results revealed that yak shows more than % sequence homology with other bos indicus breeds and bos taurus breeds. however, identity was less than % with other animal species (equine, murine, feline, canine etc.). the evolutionary lineage findings cluster yak more closely with bovine species. point mutations revealed changes at nucleotide positions with corresponding amino acid change at positions. smart analysis of yak protein domain architecture revealed toll-interleukin i receptor (tir), leucine rich repeats (lrr) and signal peptide region. the variations in yak mainly lie in the lrr region. homology modeling revealed horse shoe shaped structure with alpha helix. the additional alpha helix present in bos indicus was not detected in yak. the present study shows existence of genetic variability in tlr gene of yak, in particular the lrr region, which plays an important role in the pathogen recognition and the evolutionary lineage analyses shows its closeness with other bovine species. a.p. aquaculture and fisheries, tirupati in this new millennium, aquatic animal health management strategies in asia expanded and adjusted to the current disease problems faced by the aquaculture sector. this presentation will briefly discuss some of the most serious trans-boundary pathogens affecting asian aquaculture including a newly emerging disease and highlight recent regional and national efforts on responsible health management for mitigating the risks associated with aquatic animal movement. a regional approach is fundamental since many countries share common social, economic, industrial, environmental, biological and geographical characteristics. capacity and awareness building on aquatic animal epidemiology, science-based risk analysis for aquatic animal transfers, surveillance and disease reporting, disease zoning and establishment of aquatic animal health information systems to support development of national disease control programs and emergency response to disease outbreaks are needed. molecular diagnostics with emphasis towards standardization and harmonization, inter-calibration exercises and quality assurance in laboratories, accreditation program and utilization of regional resource centres on aquatic animal health will also be needed. whilst most of these strategies are directed in support of government policies, implementation will require pro-active involvement, effective cooperation and strategic networking between governments, farmers, researchers, scientists, development and aid agencies, and relevant private sector stakeholders at all levels. their contributions are essential to the health management process. generally, aquaculture plays an important role in economy as harvests from natural waters have declined or, at best, remained static in most countries. fish and shrimp, the main aquaculture product sources, have gained the most attention. many factors can cause losses in yields of fish products and infectious disease in fish and shrimp is the biggest threat to the fishery industry. shrimp and fish aquaculture has grown rapidly over several decades to become a major global industry that serves the increasing consumer demand for seafood and has contributed significantly to socio-economic development in many poor coastal communities. however, the ecological disturbances and changes in patterns of trade associated with the development of shrimp and fish farming have presented many of the pre-conditions for the emergence and spread of disease. shrimp and fish are displaced from their natural environments, provided artificial or alternative feeds, stocked in high density, exposed to stress through changes in water quality and are transported nationally and internationally, either live or as frozen product. these practices have provided opportunities for increased pathogenicity of existing infections, exposure to new pathogens, and the rapid transmission and trans boundary spread of disease. not surprisingly, a succession of new viral diseases has devastated the production and livelihoods of farmers and their sustaining communities. this review examines the major viral pathogens of farmed shrimp and fish, the likely reasons for their emergence and spread, and the consequences for the structure and operation of the shrimp farming industry. in addition, this review discusses the health management strategies that have been introduced to combat the major pathogens and the reasons that disease continues to have an impact, particularly on poor, smallholder farmers in asia. btv isolates from the same geographic region have been termed as 'topotypes' and initial observation on segment nucleotide sequences identified a correlation between topotypes and genetic information. later topotyping was proposed based on segment , on the premise that the encoding protein ns , which is involved in virus egress from insect cells, would lead to evolutionary fitness in parallel with the geographic distribution of the different culicoides species. further studies attempted to extend this to nucleotide sequence homology in segments and , but failed to identify clear cut correlations or any evidence for positive selection. for example, south african isolates were found not to cluster into separate african lineage. in this study, we carried out a more extensive analysis of segment sequences. our analysis showed no segregation of isolates into topographically distinct groups. instead we observed topological clustering of the clades, and we attribute this to genetic bottleneck resulting in genetic drift and founder effect leading to homogenous gene pool in a geographical area. we hypothesize that when a new virus enters a geographical area where local btv strains are already circulating, the new genes/segments would enter into a bigger gene pool. consequently, the newer incursions into a heavily endemic area tend to get diluted and disappear from the population because the rate of drift is inversely proportional to the population size, unless they are positively selected. use of live attenuated vaccine in israel, europe, south africa and usa also led to more homogenous population similar to the vaccine strains due to continuous infusion of the vaccine type genes into the gene pool. we conclude that restriction of specific strains to certain geographical areas could generate uniquely imprinted genotypes which would not only indicate origin but also predict movement of viral strains to new areas. vvo- viral diseases of zoonotic importance: indian context k. prabhudas pd-admas, ivri, campus, hebbal, bangalore zoonoses are generally defined as animal diseases that are transmissible to humans. they continue to represent an important health hazard in most parts of the world, where they cause considerable expenditure and losses for the health and agricultural sectors. the emergence of these zoonotic diseases are very distinct, hence their prevention and control will require unique strategies, apart from traditional approaches. such strategies require rebuilding a cadre of trained professionals of several medical and biologic sciences. the article discusses virus infections that have significant zoonotic implications for india. buffalopox is a contagious viral disease affecting milch buffaloes and rarely, cows, with a morbidity rate up to % in the affected herd. although the disease is not responsible for high mortality, it adversely affects the productivity of the animals, resulting in large economic losses. furthermore, the disease has zoonotic implications, as outbreaks are frequently associated with human infections, particularly in the milkers. the causative agent, buffalopox virus (bpxv), is closely related to vaccinia virus. the outbreaks of febrile rash illness among humans and buffaloes were investigated in the villages of districts solapur and kolhapur of western maharashtra. clinico-epidemiological investigations of humans and buffaloes were carried out and representative clinical samples were collected respectively. the samples include vesicular fluid, scab, and blood. laboratory investigations for buffalo-pox virus (bpxv) was done by pcr on blood samples, scabs and vesicular fluid. in vitro virus isolation attempts were carried out by using vero e- cells. negative staining electron microscopy was also employed for detection of virus particles. a total of human cases with pox lesions on hand and other body parts from village kasegaon, district-solapur and cases from different villages of kolhapur district were reported. besides pox lesions patients were having fever, malaise, pain at site of lesion and axillary and inguinal lymphadenopathy. in kasegaon village, attack rate in human cases was . % and in buffaloes . % ( / ). whereas in kolhapur area attack rate in buffaloes was . % ( / ). bpxv was confirmed in blood, vesicular fluid and scab specimens from human cases and scab specimen from buffalo by polymerase chain reaction (pcr) method. the bpxv was also isolated from different clinical specimens and further identified by pcr and electron microscopy. clinical manifestation of the disease in buffaloes from solapur district was as reported earlier like common pox lesions on teats and udders whereas the buffaloes from kolhapur district had lesions on hairless parts of ears and on the eyelids with purulent discharge. bpxv from human and buffalo cases showed similarity. vaccines have been made against several diseases and used for controlling the afflictions. however a few of them were not effective for successfully controlling the disease. the reasons for the failure are many, the major being, either the pathogen is not completely cleared from the vaccinated animal or it reemerges after changing its antigenic structure, thus making the vaccination programme less effective. in addition to this, emergences of newer diseases such as hiv the development of suitable vaccines have become a challenging task. this is especially true in the case of viral diseases. these challenges have warned the researchers ''that protection by vaccination is not that simple and strait forward approach'', and lot need to be understood in terms of host virus interaction and role of environment in perpetuating the disease. so the immediate step that was considered was the environmental safety by way using non infectious materials as vaccines. with the understanding that has been developed in molecular immunology and molecular biology and with the availability of molecular tools that have been developed through recombinant dna technology the field of vaccinology has changed dramatically to emerge as modern vaccinology. this presentation deals with the modern approaches that are being used to produce effective vaccines in the case of foot and mouth disease of cloven footed animals. the similar approach may be worked out for other viral diseases also. despite the availability of an inactivated vaccine that is noted to provide solid immunity against the disease over a short period of time, the search for an ideal vaccine, the criteria for which are; safety of the vaccine for environment, easy in its preparation, does not require a cold chain for its storage, provides longer lasting immunity, economically viable and may be able to clear the virus in case of persistent infection is on. the advent of recombinant dna technology together with the information available on the molecular biology of viruses has enabled to design the development of newer vaccines that can induce strong cellular and humoral responses. the underlying principal in the present vaccine development strategy world over is the virus antigen gene has to be expressed in the tissue and the vaccine backbone has to trigger the immune system for eliciting desired immune response. bangalore campus of ivri has been vigorously pursuing research to develop ideal vaccines for foot and mouth disease keeping above principal in mind to achieve the previously mentioned criteria. the approaches selected are to see that the virus antigen/s replicate transiently in the host. the self replicating vaccines that have been developed are pox virus vectored vaccines, alpha virus replicase based vaccines and fmdv vectored vaccines. the approach and the result obtained so far will be discussed. silkworm, bombyx mori is affected with various diseases caused by viruses viz., nuclearpolyhedrosis (bmnpv), densosnucleosis (bmdnv) and infectious flacherie (bmifv). silkworm viral diseases form major constraints for the silk cocoon production in all the sericultural countries. the losses due to silkworm diseases is estimated about - % and among them viral diseases are most common. in sericulture, prophylactic measures play a vital role in the management of silkworm diseases. these include disinfection of silkworm rearing house and appliances, rearing area, rearing surroundings, silkworm egg and body, and rearing bed disinfection associated with maintenance of general hygiene and personnel hygiene. all these activities are generally carried out as rituals by using general disinfectants often with partial success. recent trends in complete management of silkworm diseases include development of silkworm hybrids evolved from disease resistant/tolerant breeds, effective eco-and user-friendly disinfectants, anti-microbial feed-supplements and use of transgenic silkworms. biotechnological breakthrough in this regard is through rna interference (rnai) approach involving dsrna mediated nuclear polyhedrosis management and this is presently pursued by apssrdi, hindupur in collaboration with centre for dna fingerprinting and diagnostics (cdfd), hyderabad. nadu and karnataka. the disease appears to be more severe in rural flocks than organized farms. our investigations revealed the morbidity, mortality and case fatality rates among rural and organised farms as . %, . %, . % and . %, . %, . % respectively. higher morbidity and mortality in rural areas may be due to stress factors like poor nutrition, parasitic burden, fatigue due to long walks and non availability of veterinary aid. kulkarni et al. also reported the severe bt outbreaks in rural areas of maharashtra with overall morbidity, mortality and case fatality of %, % and % respectively. all the south indian sheep breeds were found to be susceptible and clinical farm of the disease is evident in all of them though saravanabava ( ) reported variations in susceptibility among the indigenous sheep. trichy black and ramnad white sheep were found to be more susceptible than the vambur and mecheri sheep of tamil nadu. prevalence of bluetongue in sheep, goat and cattle appears to be high in the region. serological surveys conducted in andhra pradesh during revealed the prevalence of btv antibodies in sheep ( . %) goats ( . %) cattle ( %) and buffaloe ( %). similar high prevalence of btv antibodies in sheep and goats were also reported from the other states in the region. clinical disease has not been recorded in kerala though btv antibodies were recorded in sheep ( . %) and goats ( . %) (ravi sankar ) . culicoides are the known biological vectors of btv. all the culicoides species are not capable of transmitting the btv. the occurrence of the disease is related to the presence of the competent vectors in the area. jain et al. ( ) established the involvement of the culicoides in transmitting the btv by isolating the virus from culicoides at haryana, the north indian state. c. imicola and c. oxystoma were found to be prevalent in andhra pradesh and tamil nadu. narladakar et al.( ) reported the presence of c. schultzei, c. perigrinus and c. octoni in marathwada region of maharastra. culicoid vectors are significantly affected by the climate and annual variations in the climate reflects the outcome of the disease. the monsoon season (june to dec) with the temperature ranging from . to . °c appears to be favourable period for the multiplication of culicoides. the maximum no of outbreaks were recorded during the north east monsoon period (oct-dec) followed by south west monsoon period (june to sep) in the region. however, details on the distribution of the competent vectors, feeding habits and their dynamics in the region is lacking multiple btv serotypes were found to be circulating in the region. (kulkarni and kulkarni ; janakiraman etal. ; mehrotra et al. ) a total of serotypes viz. - , , , , , and were identified based on the virus isolations. sreenivasulu et al. isolated btv serotype from an outbreak of bt in native sheep of andhra pradesh. btv serotype , and were also isolated from the outbreaks occurred in andhra pradesh. some of the isolates need to be serotyped. deshmukh and gujar ( ) isolated btv type from maharashtra. following is the summary of the distribution of btv serotypes in this region. clinical picture of bt in native sheep appears to be slightly different, the major difference being that swelling of lips and face was less conspicuous. mucocutaneous borders appeared to be very sensitive to touch and bleed easily upon handling. the classical signs of cyanosis of tongue and reddening of coronary band are not the common features of the disease in native sheep. the disease was also confirmed by the virus isolation and identification. clinical disease has not been reported in cattle, buffaloes and goats in spite of high seroprevalence. in conclusion bt is established in native sheep and causes severe economic losses to the farmers. the disease is concentrated in the southern peninsula of the country. the disease is seasonal and is associated with the rain fall. multiple serotypes appear to be circulating in this region. the btv serotypes were of virulent in nature as evident by severe outbreaks. s. janardana reddy*, d. c. reddy department of fishery science and aquaculture, sri venkateswara university, tirupati in less than three decades, the penaeid shrimp culture industries of the world developed from their experimental beginnings into major industries providing hundreds of thousands of jobs, billions of u.s. dollars in revenue, and augmentation of the world's food supply with a high value crop. concomitant with the growth of the shrimp culture industry has been the recognition of the ever increasing importance of disease, especially those caused by infectious agents. in india viral diseases have become an important limiting factor for growth of shrimp aquaculture industry. although more than different viral pathogens have been identified in different species of shrimp world wide, only a few viruses have identified which are causing disease problems in cultured tiger shrimps in india, east coast of andhra pradesh, in particular. diagnostic methods for these pathogens include the traditional methods of morphological pathology (direct light microscopy, histopathology, and transmission electron microscopy), enhancement and bioassay methods, traditional microbiology, and the application of serological methods. while tissue culture is considered to be a standard tool in medical and veterinary diagnostic labs, it has never been developed as a useable, routine diagnostic tool for shrimp pathogens. the need for rapid, sensitive diagnostic methods led to the application of modern biotechnology to penaeid shrimp disease. the industry now has modern diagnostic genomic probes with nonradioactive labels for viral pathogens like infectious hypodermal and hematopoietic necrosis (ihhnv), hepatopancreatic virus (hpv), taura syndrome virus (tsv), white spot syndrome virus (wssv), monodon baculo virus (mbv), and bp. highly sensitive detection methods for some pathogens that employ dna amplification methods based on the polymerase chain reaction (pcr) now exist, and more pcr methods are being developed for additional agents. these advanced molecular methods promise to provide badly needed diagnostic and research tools to an industry reeling from catastrophic epizootics and which must become poised to go on with the next phase of its development as an industry that must be better able to understand and manage disease. within this field, shrimp immunology is a key element in establishing strategies for the control of diseases in shrimp aquaculture. research needs to be directed towards the development of assays to evaluate and monitor the immune state of shrimp. the establishment of regular immune checkups will permit the detection of shrimp immunodeficiencies but also to help monitor and improve environment quality. for this, immune effectors must be first identified and characterised. in the end, however, the assumption may be made that the sustainability of aquaculture will depend on the selection of disease-resistant shrimp, i.e. to develop research in immunology and genetics at the same time. the development of strategies for prophylaxis and control of shrimp diseases could be aided by the establishment of a collaborative network to contribute to progress in basic knowledge of penaeid immunity. however, to improve efficiency, it appears essential also to open this network to complementary research areas related to shrimp pathology, physiology, genetics and environment. bluetongue is an important viral disease of sheep causing severe economic losses to the farmers. lack of effective vaccine is the major impediments in controlling the disease. multiple serotypes were found to be circulating in the state. attempts are being made to develop the vaccine employing the available serotypes to control the disease. hence, it is essential to identify the antigenic relationship among the serotypes to identify the candidate vaccine strains to be incorporated in the preparation of vaccine. reciprocal cross neutralization test was employed to find out the r% values between btv- , - and - which indicated the extent of antigenic relationship between the serotypes. r% value between btv- and btv- was recorded as . r% value of . and . were observed between btv- and - and btv- and - respectively. the r% values recorded in the present study revealed a weak antigenic relationship between the btv serotypes. the extent of antigenic relationship between the btv serotypes was also determined by multiple sequence alignment of the nucleotide and amino acid sequences of the reference btv serotypes , and . the sequence analysis of the vp gene revealed a homology of - % and - % at the nucleotide and amino acid levels respectively. r% values obtained using reciprocal cross neutralization test with the btv- , and serotypes isolated in native sheep of andhra pradesh and the genomic analysis of the reference serotypes of btv- , and revealed very weak antigenic relationship and were highly divergent. diseases especially those by viral pathogens cause greater economic losses in most horticultural crop species throughout the world as compared to agricultural crops. non-genetic methods of management of these diseases include quarantine measures, eradication of infected plants and weed hosts, crop rotation, use of certified virus-free seed or planting stock and use of pesticides to control insect vector populations implicated in transmission of viruses. however, none of these measures is likely to provide an enduring solution against these diseases especially those caused by viruses due sometimes to the huge expenditure involved, but mostly to the questionable effectiveness and reliability of those methods. as key control pesticides are getting increasingly abandoned, development of alternative methods to control diseases has been a felt-need in the recent past. though breeding for disease resistance generally provides a reliable security in a long run, introgression of host plant resistance did not materialise in most important crops. non-availability of an appropriate source of resistance in inter-fertile relatives, linkage to undesirable traits, or often times polygenic nature of such sources of resistance are the stumbling blocks in breeding programs. the limitations of conventional breeding and routine cultural practices prompted the need for the development of other approaches of virus control that could be fully incorporated into traditional methods. in this perspective, the concept of pathogen-derived resistance offers an attractive strategy to evolve newer methods of virus management, by transforming crop plants with nucleotide sequences derived from the pathogen's genome. an increasing number of molecular characterisation of plant virus genomes and the stable transformation of a number of horticultural crop species have in fact opened an avenue for molecular breeding against virus pathogens. successful field-testing of genetically modified crop cultivars renders proof of their supremacy over existing cultivars. it also contributes to demonstrate their capability with regard to environmental safety with a view to winning over public concern and scepticism. in general, the eventual commercialisation transgenic lines expressing virus resistance will rely upon a host of factors including their field performance, genetic stability, public acceptance and the resolution of environmental concerns and patent related issues. as such, elaborate field trials and allied studies are now required to adapt genetically engineered horticultural crops expressing virus resistance for their implementation into practical agriculture. a few examples from current research at tnau, in india or elsewhere will be discussed in this presentation. virology unit, division of plant pathology, iari, new delhi in recent times there has been greater emphasis on vegetatively propagated crops in india to help diversify the indian agriculture. fruit, flower, spice and plantation crops are important vegetatively propagated horticultural crops, which have become a driving force for economic development in several parts of india. however, most of the vegetatively propagated crops are threatened by biotic stress caused by plant pathogens in general and plant viruses in particular. plant viruses produce specific and non specific symptoms and in some cases no symptoms are produced. correct identification and diagnosis of viral diseases is first step in the management of any disease including viral diseases. there have been two major breakthroughs in virus diagnostics during last four decades. the first one was serological assay using monoclonal or polyclonal antibodies in enzyme linked immunosorbent assay (elisa) and the other one was the use of in vitro amplification of dna in polymerase chain reaction (pcr). a significant development in serological assays has been its simplification in form of user's friendly quick strip/dip stick method. the one-step lateral-flow (lf) tests have been developed for the on-site detection and identification of several plant viruses. rapid advancement in virus genome characterization has led to the development of novel approaches of nucleic acid based diagnostics which include conventional pcr, real time pcr, multiplex pcr, micro/macro arrays and biochips. pcr protocols already exist for many plant viruses of citrus, banana, apple, papaya, vegetables, ornamental and spice crops. a further advancement has led to development of realtime pcr assay which is relatively easy but requires training for diagnosticians. in real-time pcr assays, results can be available within min. the nucleic acid template preparation in pcr has been simplified. membrane based dna template protocol and co-isolation of nucleic acid template preparation are novel approaches in pcr detection of virus and virus like pathogens. since many of the horticultural crops are often infected by more than one virus, their individual detection by pcr is not only expensive but also time consuming. therefore, multiplex pcr has been developed where in genome of more than one virus could be amplified and detected in the same reaction mixture. development of nucleic acid based chip is now one of the fastest and recent growing areas in the field of pathogen detection. these nucleic acid based chips have been named as dna/rna chips, biochips, genechips, biosensors or dna arrays. when it comes to applications of microarray technology for plant viruses, it is not too difficult to see the value of a method that could potentially detect a whole range of viruses using a single test. however, microarrays are unlikely to become the only method in use in a diagnostic laboratory. processing of germplasm including transgenic planting material imported for research purposes into the country. during the last two decades, a total of , samples of wheat including transgenics were imported from cimmyt (mexico), icarda (syria) and many other countries. these were grown in post-entry quarantine nursery each year at nbpgr, new delhi and the transgenic samples were grown in national containment facility of level- (cl- ) since its inception to ensure that no viable biological material/pollen/pathogen enters or leaves the facility during quarantine processing of transgenics. in addition, post-entry quarantine inspections of the transgenic wheat grown by indenters are also undertaken by nbpgr quarantine scientists. virus-induced gene silencing (vigs) is a technique in which viral genomes are used, usually after appropriate modifications, for transient gene silencing in plants. the mechanism behind vigs is the phenomenon called rna-interference (rnai), which is widespread in many organisms and is believed to be form of inherent defence system against intracellular pathogens, such as viruses and transposons. double-stranded rna or rna containing strong secondary structures, commonly produced during viral infections, are believed to cause triggering of rnai, which employs a battery of proteins and nucleoprotein complexes to identify and degrade specific viral transcripts. in vigs, viral genomes not causing severe symptoms, but which can accumulate and spread efficiently in the host plant are used as vectors in which a host gene is cloned and introduced into the plant. upon replication, the viral vector triggers rnai response in the host plant, which also targets the host gene, leading to its silencing and subsequently, the silenced phenotype revealing gene function in vivo. vigs has been used extensively to study gene functions in dicot plants, such as tobacco, tomato, pea, soybean, etc., using vectors derived from reference genes are commonly used as an/the endogenous normalisation measure for the relative quantification of target genes. the expression (characteristics) of seven potential reference genes was evaluated in tissues of healthy, physiologically stressed and barley yellow dwarf virus (bydv) infected cereal plants. these genes were tested by rt-qpcr and ranked according to the stability of their expression (characteristics) using three different methods (two-way anova, genorm and normfinder tools). in most cases, the expression (characteristics) of all genes did not depend on the abiotic stress conditions or on the virus infections. all the genes showed significant differences in expression (characteristics) among plant species. glyceraldehyde- -phosphate dehydrogenase (gapdh), beta-tubulin (tubb) and s ribosomal rna ( s rrna) always ranked as the three most stable genes. on the other hand, elongation factor- alpha (ef a), eukaryotic initiation factor a (eif a), and s ribosomal rna ( s rrna) for barley and oat samples; and beta-tubulin (tubb) for wheat samples were consistently ranked as the less reliable controls. the bydv titre was determined in two oat varieties by rt-qpcr by three different quantification approaches. statistically, there were no significant differences between the absolute and the relative quantification, or between quantification using gapdh + tubb + tuba + s rrna and ef a + eif a + s rrna. the geometric average of gapdh, s rrna, tuba and tubb is suitable for normalisation of bydv quantification in barley and oat tissues. for wheat samples, a combination of gapdh, s rrna, tubb, eif a and e fa is recommended. department of microbiology, yogi vemana university, vemanapuram, kadapa large scale production and import of propagative material poses potential risk of introducing several destructive pathogens particularly viruses and mycoplasma like organisms in our country. this demands adequate quarantine safe guards such as growing them under approved post entry quarantine facility for specific period so as to facilitate virus detection, thereby curtailing risk. when such facilities are coupled with propagation by tissue culture will ensure virus free propagative plant material. the requirement of nationwide network of post entry quarantine facility working in close collaboration with crop institutions are very much emphasized for considering import of high risk plant genera for agriculture development. present paper discusses about virus disease of quarantine importance affecting ornamental and fruit plants such as chrysanthimum, dahlia, dianthus, rosabengalensis, cattleya, cymbidium, dendrobium, lilium, citrus, vitis etc. the paper also discusses on immunodiagnostic methods of detection and methods of obtaining virus free propagative material. rice tungro occurs as epidemics in regular cycles and has been reported in the last years from all the major rice growing regions of india, especially prevalent in the southern and eastern states. development of the durable resistant varieties to tungro is crucial for the management of the disease. molecular breeding, involving the use of dna markers linked to the resistant gene(s) for selection, can overcome the difficulties encountered in conventional resistant breeding programs. for successful marker-assisted selection (mas), the identification of closely linked markers through the process of gene tagging and mapping is a prerequisite. attempts have been initiated for identification of tungro resistance genes through molecular mapping and their introgression into the target varieties using marker-assisted selection at drr, hyderabad. the inheritance of resistance to rice tungro virus disease was studied in seven resistant rice cultivars with field evaluation at hot spot locations. the microsatellite markers linked to rice tungro resistance in utri merah was studied and found that resistance genes were linked to rm on chromosome . through molecular mapping two qtl were identified controlling rtv resistance on chromosomes and in 'utri rajapan' explaining . % and . % of the phenotypic variance. in variety 'vikramarya', another two qtl for rtv resistance were detected on chromosomes and explaining . % and . % of the phenotypic variance. the closely linked markers identified in this study flanking the gene of interest through mapping will improve the efficiency and precision of introgression programs in marker assisted breeding for rtv resistance. functional characterization of these qtl for rtv resistance is under progress. there is only a limited pool of natural virus resistance in cassava against cassava mosaic geminiviruses and cassava brown streak ipomovirus hence the development of transgenic resistance in this significant crop might present an option. rna mediated resistance through the expression of inverted-repeat dsrna sequences derived from the virus genome and the modification of plant microrna to produce antiviral artificial microrna are strategies that have recently been proven very effective for induction of virus resistance (immunity) against a number of rna viruses. results from rna interference strategies against geminiviruses never resulted in immunity of transgenes. however, it suggest that viral mrna are targets of rna silencing and that the success of the strategy depends on the relevance of the target gene in the systemic spread of the virus. we have generated a number of rna silencing constructs to induce resistance against cbsv and the indian cassava mosaic viruses icmv and slcmv. due to the serious problems inherent with transformation of cassava and subsequent resistance screening, these constructs were tested for efficiency either by transient-or by transgenic expression in n. benthamiana. complete immunity was reached in transgenic n. benthamiana against cbsv using inverted repeat or amirna constructs. using different species of cbsv for resistance screening, immunity was broken, to show the minimum context for broad spectrum resistance. similarly, highly specific resistance was reached in expression of amirna. in contrast, virus resistance against icmv/ slcmv using single amirna constructs was not successful. results from the experiments to generate virus resistance against cbsv and icmv/slcmv will be shown; methods to evaluate efficiency of rnai gene constructs by transient gene expression in n. benthamiana and strategies to develop efficient resistance against rna and dna viruses in cassava will be discussed. bitter gourd (momordica charantia l.) which is also called bitter melon, balsam apple and balsam pear belongs to family cucurbitaceae. it is an important traditional vegetable of nutritive and medicinal value that is cultivated in tropical and sub-tropical asia, but is considered as a weed host reservoir for viruses in jamaica. viral disease-like symptoms were observed occurring naturally on the crops of bitter gourd grown in the fields of northern india during - . an incidence of . % of diseased plants was recorded which showed chlorotic spots and mosaic ranging from mild mottling to green blisters along with leaf smalling, leaf and fruit deformations, bud necrosis and stunted growth whereas . % plants exhibited leaf curling alone or in combination with mosaic-type disease. a reduction of . % in fruit yield was recorded in mosaic-like disease which could be attributed to lesser fruit setting due to bud necrosis, smaller fruit size and stunted plant growth. such plants produced deformed, notched, irregularly shaped fruits wherein pre-mature yellowing and necrosis on the anterior and posteriors ends made . % fruits unfit for marketability. the dwindling yield and production of unmarketable fruits posed a major constraint for profitable cultivation of this economically important crop, thus warranting for studies on etiology and management of these diseases. the mosaic-like disease was transmitted to healthy seedlings of bitter gourd at -leaves stage by sap inoculation as well as by aphid viz., myzus persicae sulz. and aphis gossypii glov. initially studies were carried out to optimize protocols for efficient plant regeneration and agrobacterium-mediated transformation for nagpur sweet orange, which is a popular and elite citrus cultivar in india. organogenesis was induced in etiolated epicotyl explants of one-month-old axenically raised polyembryonic seedlings by culturing them in mt medium supplemented with g/l sucrose with varying concentrations of plant hormones. it was found that bap at mg/l without auxin was best for efficient shoot regeneration in citrus using epicotyl explants. a % regeneration frequency was obtained and multiple shoot formation was obtained from both the cut ends of all the explants. an average of . well-differentiated shoots per explant were obtained, all of which rooted normally under the influence of mg/l iba. this improved regeneration protocol was utilized in standardizing agrobacterium-mediated transformation of citrus using a. tumefaciens strain eha , containing binary plasmid pcambia that harbors gus reporter gene and npt-ii plant selection marker gene. one-month-old epicotyl explants infected with over-night grown agrobacterium (a . - . ) for min and co-cultured for days were found to be optimum for transformation as assessed on the basis of pcr analysis and gus activity displayed by the stem and leaf sections of putative transgenics. overall transformation frequency ranged from to %. current study focuses on the generation of citrus transgenics for ctv resistance using a. tumefaciens strain eha containing binary plasmid pbinar harboring portion of coat protein gene of ctv and npt-ii gene employing the standardized protocols. several putative transgenic shoots were recovered on selection medium and they are being utilized for molecular analyses and resistance against ctv. work is also in progress on the generation of citrus transformants using rnai construct harboring ctv cp and p genes, singly and in conjunction. our lab was also involved in developing rice transgenics for resistance against rice tungro disease, which is one of the most important and widespread virus diseases of rice in south and southeast asia, causing an annual estimated loss in crop yield of economic losses worth millions of rupees are caused due to these diseases annually. virus diseases are frequently less conspicuous than those caused by other plant pathogens and last for much longer. this is especially true for perennial crops and those that are vegetatively propagated. one further problem with attending to assess losses due to various diseases on a global basis is that what most of the data are from small comparative trials rather than wide scale comprehensive surveys, even the small trials do not necessarily give data that can be used for more global estimates of losses. this is for several reasons, including: ( ) variation in losses by a particular crop from year to year; ( ) variation from region to region and climatic zone to climatic zone: ( ) differences in loss assessment methodologies; ( ) identification of the viral etiology of the disease; variation in the definition of the term 'losses' and ( ) chilli is the major vegetable and spice crop grown in thar desert areas of rajasthan. leaf curl disease (chlcd) is one of the major constrains in chilli cultivation faced by farmers and cause yield loss up to %. a survey was conducted in major chilli growing areas of thar desert; bikaner, nagur, jodhpur and jalore districts of rajasthan during november, to understand the present status of leaf curl disease in chilli. among the four district surveyed for chlcd, the disease incidence was recorded maximum (up to %) in jodhpur district followed by jolore district (up to %). no relation was found between the disease incidence and varieties. the major varieties grown in these area are; mehsana, rch (mandoria), haripur raipur, mathania and local cultivars. the number of whitefly was also counted in top, middle and bottom leaf of chilli grown in these areas. the average number of whitefly per plant ranged from . to . . more number of whitefly ( . ) was recorded in jodhpur district and lowest ( . ) in jalore district. total dna was extracted from three leaf curl infected samples from each district and tested for the presence of begomovirus using coat protein (cp) and dna-b specific primers. all the samples were positive for cp and dna-b amplifications by pcr. the cloning and sequencing of selected cp gene and dna-b fragments are in progress. the preliminary investigations shows that the leaf curl disease of chilli is widespread in the arid region of rajasthan and may be caused by begomovirus associated with satellite dna-b. bittergourd (momordica charantia) is an important vegetable crop of kerala. the crop is affected by several diseases of which mosaic is a prominent one. a field experiment was conducted to evaluate the efficacy of potentised resistance inducing substances (ris) viz., mosaic affected bittergourd plant tissue, ash of mosaic affected bittergourd plant tissue, plumbago and salicylic acid for control of bittergourd mosaic in march . ris were applied as drench and foliar spray at three potency levels twice, before flowering of the crop. the experimental crop was grown as per the package of practice recommendations in split plot design with five replications per treatment. the disease incidence, disease severity and yield of the crop were recorded. the result of the experiment shows that spraying was more effective than drenching of treatments for reducing mosaic incidence and severity. among treatments, infected plant extract at potency was the most effective one for reducing mosaic incidence and it showed the maximum incubation period and minimum disease severity. the spray application of treatments produced significantly higher yield than drenching. among the treatments, ash of infected plant at and potency and infected plant extract at potency were on par and produced comparatively higher yield. elephant foot yam (amoprhophallus paeoniifolius), colocasia (colocasia esculenta) and tannia (xanthosoma sagittifolium) are the major edible aroids cultivated in india. the elephant foot yam cultivation is gaining importance due to its high production potential, nutritional and medicinal values and good economic returns. all these aroids are vegetatively propagated and viral diseases are spreading through planting materials. ctcri has the mandate of producing healthy planting materials of these edible aroids. accurate diagnosis and identification of the virus is essential for production of healthy planting material and effective management of the disease. though occurrences of viral diseases on edible aroids in india were known in s, not much attention was given for detection and identification of the virus involved. in case of elephant foot yam - % mosaic incidence was observed with varying symptoms of mosaic, puckering, filiformy etc. in colocasia and tannia, - % incidence was noticed. rt-pcr amplification with potyvirus group specific primers and subsequent cloning and sequencing of the amplified product has confirmed the association of dasheen mosaic virus (dsmv) with all the three edible aroids cultivated in india. the complete full length coat protein gene of dsmv infecting elephant foot yam was cloned in pgem-t vector and sequenced. further sequence analysis revealed that the cp of dsmv consisted of nucleotides and the utr comprised of nucleotides. blast and phylogenetic analysis showed highest similarity of % with that of dsmv isolate af , reported from usa. the deduced amino acid sequence of cp had . - . % identity with other dsmv isolates. blast analysis of the partial cp gene sequences of colocasia and tannia also confirmed that the virus involved is dsmv. rt-pcr analysis of large number of samples from all the three crops confirmed that the potyvirus group specific primers (mj and mj ) are good for rapid detection of dsmv in these crops. dsmv specific biotinylated cdna and digoxigenin labelled crna probes were also prepared and dsmv in elephant foot yam was detected through nucleic acid spot hybridization. yellow leaf disease (yld) caused by sugarcane yellow leaf virus (scylv) is a recently recorded disease in india and is found wide spread throughout country. in popular varieties, the disease incidence varied from to . % and attained epidemic levels under field conditions. detailed studies on the impact of yld on sugarcane revealed that the virus infection significantly reduces various cane growth parameters, cane yield and juice quality. sequence comparisons of the coat protein (cp) and movement protein (mp) of scylv isolates from india and database sequences showed a significant variation between indian isolates and the database sequences both at nt and aa level in the cp/mp coding regions. the significant variation in our isolates with the database isolates, even in the least variable region of the scylv genome showed that the population existing in india is different from rest of the world. further, comparison of partial sequences encoding for orf and revealed that yld in sugarcane in india is caused at least by three genotypes viz., cub, ind and bra-per, of which a majority of the samples were found infected with cuban genotype (cub). the genotype ind was identified as a new genotype and this was found to have significant variation with the reported genotypes. we have identified specific primers from cp region of the virus and optimized rt-pcr conditions to diagnose the virus. this assay has been found efficient in detecting the virus in asymptomatic plants and tissue culture derived seedlings. elimination of the virus through meristem culture has been demonstrated to purify the virus from the infected planting materials and this technique needs to be adopted to supply disease-free planting materials for effective management of the disease. studies are also in progress to identify the yld-resistant sources in sugarcane germplasm to initiate breeding for yld-resistance in sugarcane. mycoviruses are viruses that infect fungi. they have been identified in all major fungal families. in the present scenario, mycoviruses are the important means of biocontrol of plant fungal pathogens. most identified fungal viruses have double stranded rna genomes, often with more than one dsrna present per virus particle, and have been spherical in shape. these viruses are mostly vesicle bound, as other viruses have protein coatings. to be a true mycovirus, they must demonstrate an ability to be transmitted-in other words be able to infect other healthy fungi through anastomosis and spores. mycoviruses lead 'secret lives', reduce the ability of their fungal hosts to cause disease in plants. this property, known as hypovirulence (hypovirulence is a term used to describe reduced virulence found in strains of pathogens), this phenomenon was first observed in cryphonectria (endothia) parasitica (chestnut blight fungus) on european castanea sativa in italy, where naturally occuring hypovirulent strains were able to reduce the effect of virulent ones. these slower growing hypovirulent strains of c. parasitica contain a single cytoplasmic element of double-stranded rna (ds rna) similar to that found in mycoviruses that was transmitted by anastomosis in compatible strains through natural virulent populations of c. parasitica. hypovirulence has also been reported in many other fungal plant pathogens, including rhizoctonia solani, gaeumannomyces gramini var. tritici, ophiostoma ulmi, sclerotinia homoeocarpa, diaporthe ambigua alternaria alternata, and fusarium sp. etc. hypovirulence has attracted attention owing to the importance of fungal diseases in agriculture and the limited strategies that are available for the control of these diseases. it reduces the use of toxic fungicides which also affect the plant growth. the symptoms resulted by the mycoviruses are reduction in growth, reduction in pigmentation and sporulation, excessive sectoring and aerial mycelial collapse. these are the consequences of alteration in complex physiological and biochemical processes involving interaction between host and virus. cassava (manihot esculenta crantz.) is the major tuber crop in peninsular india, it is grown in an area of . lakh hectares with the annual production of . million tonnes both for direct consumption and the starch grain (sago) producing industries, mainly in the southern states of tamil nadu, kerala and andhra pradesh (fao ) . in tamil nadu, cassava primarily produced for sago producing industries where it is considered as an industrial crop rather than food crop, so the resource rich farmers are cultivating the cassava as irrigated crop in their fertile land and the poor farmers are raising the crop under rainfed conditions. in south india in addition to cassava there is a practice of intercropping important vegetable crops like, tomato, brinjal, legumes and gourds in cassava fields since all the above mentioned crops are short duration and are money spinners for the farmers. unfortunately, the major production constraint in these vegetable crops including cassava is the geminiviruses belonging to the family of in recent years there has been growing concern regarding the standard of scientific researches in india. the strengths, weaknesses, opportunities and threats (swot) analysis on indian scientific research reviewed the progress of science during the last six decades. although the 'strengths' were highlighted in good measure, it was the list of 'weaknesses' that called for attention to upgrade the standard of research and 'opportunities' that provide scope for overall scientific growth. a comparison between india and other countries in terms of research papers published revealed that india's contribution to science has come down enormously. what ails indian science? should we compare the growth of indian science with other developed countries? what criteria should be adopted to judge the quality and standard of scientific research? how to motivate the scientists to improve their scientific output? how do motivate the scientists to improve their scientific output? how do indian journals perform in maintaining quality? this paper analyses critically the scientific journals around the world, based on the scores allotted by the national academy of agriculture sciences (naas) in and for and journals respectively. in general, the indian journals performed poorly irrespective of the disciplines with only - % in the high standard. the paper dealt with the reasons for low impact factor, the anomalies in the allotment of scores to wide spectrum of the journals and the disadvantages the scientists face with the scoring system. a case study was presented of an institute with over scientists whose publications were analyzed to discuss the merits and demerits of the system. the performance of the journals published by prestigious academics, societies and councils was also projected. the paper concluded with the need for enhancing the image of the country through research publications in high standard journals and the role of various scientific bodies with shore and long term measures. poster session herpes simplex virus (hsv) keratitis is a leading cause of corneal blindness throughout the world. the infection can be diagnosed by clinical manifestations but in case of atypical ocular cases, laboratory diagnosis is more helpful in timely management of disease. collection of corneal scrapings in all cases of stromal and epithelial keratitis may not be possible, but collecting tear fluid is a convenient procedure causing less discomfort to the patients. therefore, the present study was intended to evaluate the suitability of tear specimens for detecting hsv by polymerase chain reaction (pcr) and immunofluorescence (ifa). tear fluid and corneal scrapings were collected from patients of suspected herpetic keratitis. hsv- antigen was detected by ifa using rabbit anti-hsv antibodies. pcr was performed to amplify bp region of thymidine kinase (tk) coding gene and bp region from dna polymerase coding gene of hsv. out of patients hsv antigen was detected in ( . %) of corneal scrapings and ( . %) of tear specimens and in ( . %) patients from both the specimens. hsv gene could be amplified in ( . %) of corneal scrapings and ( . %) of tear fluids and in ( . %) patients from both the specimens. although, corneal scraping seemed to be marginally superior material for detection of hsv, tear fluid may also serve as an appropriate alternative clinical specimen, due to ease of collection and least discomfort to the patients. in either cases pcr detected higher number of hsv cases than ifa. therefore if and when feasible, both ifa and pcr should be used simultaneously on each specimen to obtain best results. cytokines play a key role in the regulation of immune responses. in hepatitis c virus infection (hcv), the production of inappropriate cytokine levels appears to contribute to viral persistence and to affect response to therapy. il- is produced by a variety of cells including t cells, phagocytes and fibroblast. cytokine genes are polymorphic at specific sites, and certain mutations located within coding/regulatory regions have been shown to affect the overall expression and secretion of cytokines in patients with hcv infection. to correlate the serum levels and polymorphism of il- gene in chronic hepatitis c patients and healthy controls. forty patients positive for hcv rna attending the medicine out patient department and wards of lok nayak hospital, new delhi as well as forty healthy controls were enrolled for the study. the serum level of il- was detected by using elisa. genomic dna was extracted from whole blood of hcv infected patients and healthy controls by using accuprep genomic dna extraction kit according to manufacture's instruction. the genotyping of il- promoter (- variant) was carried out by pcr and direct sequencing using the method of patricia woo et al. . the serum level of il- was significantly down regulated in hcv infected chronic patients as compared to the healthy controls. genotyping of - promoter variant of il- was performed by pcr and direct sequencing. il- polymorphism in the g/g, g/c and c/c allele was non significant when compared to hcv patients and healthy controls. the il- serum levels were significant among hcv infected patients when compared to healthy controls. the polymorphism in the promoter region of il- (- ) was found nonsignificantly associated in hcv patients compared to healthy controls. in conclusion, the present study suggests that the host il- polymorphism alone may not play a significant role in the outcome of hcv infection. acute gastroenteritis (age) is a global health problem and has been associated with multiple etiological agents, which include bacteria, protozoa and viruses. viral gastroenteritis is considered as the second most common illness in children after upper respiratory tract infection. among enteric viruses, rota, noro, enteric adeno, astro and enterovirus are found to be associated with gastroenteritis. although, association of enteric viruses has been established in children hospitalized for age no such data is available from hospitalized children other than enteric infections. to determine the prevalence of enteric viruses circulating in hospitalized children. fecal samples, n = ( symptomatic and asymptomatic for age) were collected from children \ year of age from three different hospitals across the city of pune from june to feb. . detection of group a rotavirus was carried out by using antigen captured elisa. rt-pcr and pcr was carried out for the detection of norovirus, enterovirus, astrovirus and enteric adenovirus detection by using primers targeted to rdrp gene, ncr gene and consevered gene for serine protease and hexon gene respectively. out of fecal samples tested for enteric viruses in age cases, the prevalence of rota, entero, noro, enteric adeno and astrovirus were . % ( ), . % ( ), . % ( ), . % ( ) and . % ( ) respectively. however, the presence of these viruses in the asymptomatic cases (n = ) was detected at . % ( ), . % ( ), . % ( ), . % ( ) and . % ( ) levels respectively. mixed infections of enterovirus and rotavirus were found in both symptomatic . % ( ) and asymptomatic cases . % ( ). however, mixed infection of enterovirus with adenovirus were found only in asymptomatic cases . % ( ). no marked difference was observed in the seasonal pattern of all viruses in the patients with or without gastroenteritis. the findings of this study document highest circulation of rotaviruses in patients symptomatic and asymptomatic for age. the entero and noroviruses remain second most important enteric viruses in these patients. influenza in humans is a major public health concern and the understanding of its evolution in the light of its ''antigenic drift'' helps prediction of epidemics and update of yearly influenza vaccine. to antigenically characterize influenza a (h n ) isolates and study antigenic drift during to in pune city. patients with influenza like illness were identified using a strict case definition from dispensaries located in different areas in pune and clinical samples (ns/ts) were collected after obtaining informed consent. these clinical samples were processed in vivo (in fertile eggs) and in vitro ( overall, an additional ( . %) positive cases of dengue could be detected when ns antigen assay was also used in the study. highest ns antigen positivity was encountered among the samples collected on the rd day of fever whereas mac elisa for anti igm antibody was positive after th day and gradually there was an increase in the positivity towards the convalescent phase of the disease. the results of this study indicate that ns antigen based elisa test can be an useful tool to detect the dengue virus infection in patients during the early acute phase of disease since appearance of igm antibodies usually occur after fifth day of the infection. concurrent use of both diagnostic assays namely ns antigen as well as mac elisa will improve the overall detection of dengue infection. early detection of acute dengue virus infection is crucial to provide timely information for the management of patients. human parvovirus b , a member of the parvoviridae family, is a pathogen associated with a wide variety of diseases. most commonly, it causes childhood rash erythema infectiosum, but in some cases more serious symptoms such as persistent arthropathy, critical failures of red cell production causing transient aplastic crisis, this infection in pregnancy causes hydrops fetalis and myocarditis. traditional immunosuppressive therapy being unsuccessful, anti-viral therapy might be worthy of consideration. functional annotation would provide role of viral proteome in its survival and pathogenic mechanisms. svmprot functional family annotations of vp protein had deciphered its zincbinding, coat protein, outer membrane, chlorophyll biosynthesis, dna repair and calcium-binding nature. vp protein is having a key role in viral assembly of b virus and being non-homologous to human proteome, it was identified as an attractive molecular target for structure based drug discovery. the vp protein crystal structure was energy minimized using charmm. a structure based virtual screening method was applied using ligandfit to identify potential inhibitors of vp protein from chembank database and ten potential human parvovirus b vp inhibitors were proposed. prism genetic analyzer. the drafting of the sequences was performed using bioedit software and were submitted in genbank. for phylogenetic interpretation denv representing the full extent of genetic diversity in denv- , denv- and denv- were collected from genbank. neighbor joining algorithm was implemented with bootstrap value of , replicates for phylogenetic inference using mega . . . the genomic region to (c-prm gene junction) of denv were amplified directly from patient serum. twelve of samples were positive for dengue viral rna. of these were dengue type , was dengue type and were dengue type . for molecular epidemiological survey and genotyping of the sequences more than sequences from different geographical areas including sequences form previously reported north indian isolates were compared with our present data set. the critical analysis of the sequences revealed: dengue type sequences were clustered within sub-type of genotype iii and all the sequences of den- clustered along with genotype iii. thus, among the dengue types , and currently circulating in north india, dengue type , genotype iii, being the predominant one followed by, genotype iii of dengue type . although there is no specific treatment or vaccine available currently, the confirmative rapid diagnosis based on detection of viral nucleic acid or igm antibodies in serum, an indication of recent infection, helps in epidemiological monitoring, symptomatic treatment of patients and determining prognosis. serological detection of anti-cgv igm antibodies was performed using rapid immuno-chromatographic assay (rica) and igm-antibody capture enzyme linked immunosorbant assay (mac-elisa). eighty convalescent sera were tested by rica and of them were found positive for anti-cgv igm antibodies. twenty-five anti-cgv igm antibody rica positive sera were further assayed using mac-elisa. more sera from the patients are currently being tested to compare the sensitivity of these two serological assays in anti-cgv igm antibody based early serological diagnosis of cgv infection and the findings will be presented. thus the present study was designed to evaluate the utility of multiplex pcr (mpcr) for simultaneous and rapid detection of dengue and chikungunya viral infections. seventy-two acute phase blood samples from clinically suspected dengue cases were subjected for dengue and chikungunya uniplex pcr using dengue genus specific primers and e gene specific primers for chikungunya virus as well as multiplex pcr was developed for simultaneous detection of dengue and chikungunya infection. standard strains of dengue and chikungunya virus were used as controls. of the clinically suspected dengue samples were found to be positive for dengue viral rna by dengue uniplex pcr as well as dengue chikungunya mpcr whereas none of the samples were positive for chikungunya virus infection by both uniplex chikungunya pcr and dengue chikungunya mpcr. the result of dengue and chikungunya uniplex pcr was found to be % concordant with dengue chikungunya multiplex pcr. dengue chikungunya multiplex pcr was found to be a potential rapid test to detect dengue and chikungunya viral infections simultaneously in clinical samples. sheetal malhotra, neelam marwaha, karan saluja, ratti ram sharma department of transfusion medicine, pgimer, chandigarh transmission through blood and blood products can be reduced to a great extent by efficient and reliable testing of the blood. the newer fourth generation elisa assays simultaneously detect antibodies against hiv- and and the presence of p antigen and thus shorten the window period to about days, as compared to days with third generation elisa. to compare the hiv seroprevalance among blood donors using fourth generation elisa (antigen-antibody) versus third generation elisa (antibody) assay. this was a prospective study involving blood donors of which were voluntary donors ( being students and being non students) and were replacement donors. sex workers are one of the core group for transmission of sti/hiv and as a ''bridge group'' to the general population. accordingly, highest priority is given to this group in targeted intervention for prevention of hiv/aids. here we are describing one such female sex worker who was harbouring concomitant sti including viral sti. a year old female sex worker was brought to the sti clinic of a tertiary care hospital by ngo with complaint of genital discharge for days. on per speculum examination, cervix was slightly erythematous, tender with mucopurulent discharge. there was no vaginal discharge or ulcer in anogenital area. however, there was a wart at lateral wall of vagina. as per naco syndromic management guideline, treatment was given for n. gonorrhoeae, c. trachomatis and hpv. cervical swab was taken and subjected to various microbiological investigation for the detection of sti viz n. gonorrhoeae, c. trachomatis, t. pallidum, candida spp., t.vaginalis, hsv- , hsv- , hiv, hbv, hcv, hpv and m. contagiosum. saline wet mount showed pus cells, but no yeast cells or trophozoite of trichomonas vaginalis. gram stained smear showed more than four polymorphonuclear leucocytes in the absence of gramnegative intracellular diplococci and a presumptive diagnosis of non gonococcal urethritis was made. no organism was isolated on any culture media after appropriate incubation. cervical swab was negative for antigen of c. trachomatis. serum was tested positive for hbv, hcv, hsv- and t. pallidum though it was seronegative for hiv. in the present case, the female sex worker was harbouring four viral sti viz hsv- , hbv, hcv and hpv alongwith t. pallidum. however clinically she was diagnosed and treated accurately only for genital wart while cervical discharge due to hsv- was misdiagnosed. it is necessary to try to test alternative approaches such as periodic presumptive therapy of viral sti, because this will not only boost up the efforts of sti control in the target group but also help in hiv control. alternatively, regular clinical and laboratory screening for viral sti may be tried. densonucleosis viruses (dnv) belong to parvoviridae family. they are the etiological agents of insect's disease known as densonucleosis, which leads to death or loss of vital functions of the infected insect. densonucleosis virus of mosquitoes has generated lot of scientific interests because of its tremendous potential in biological control and its application as a transducing vector. earlier, we have reported the isolation and characterization of a dnv from aedes aegypti mosquitoes and its prevalence among different ae. aegypti populations from india. there are reports suggesting that when aedes albopictus mosquitoes co-infected with dengue- and dnv, the multiplication of den- is suppressed. the present study focus on the effect of coinfection of ae. aegypti mosquitoes with dnv and chikungunya virus (chik). the first instar mosquito larvae were infected with dnv and the emerging dnv infected females were then infected with chikv by oral feeding. thus obtained chik infected female mosquitoes were analyzed by real time pcr for both dnv and chikv on alternate days post-infection, up to the th day. the data showed no significant difference in the multiplication of either of the viruses after co-infection. results suggest that chikv neither stimulates the replication of dnv nor is its own replication suppressed due to co-infection. this study forms an initial step in understanding the role played by such endogenous viruses on the vector dynamics. chandipura virus pathogenesis is manifested as encephalitis in young children with a very high mortality rate. this damage could be due to direct replication of the virus in brain parenchymal tissue or immune system mediated. this study aims at elucidating the role of brain infiltrating lymphocytes in pathogenesis using mice as the model system. mice were inoculated intracerebrally with the virus and the perfused brain tissue was used to isolate the lymphocytes. control mice were inoculated with an equal amount of media. in order to standardize the procedure for isolation of lymphocytes from brain tissue, splenocytes were processed to isolate the lymphocytes using histopaque density gradient method. methods to isolate lymphocytes from brain tissue as described by earlier workers were tested for the ease and efficiency of procedure using known suspension of lymphocytes from spleen. percoll density gradient method provided optimum yield of lymphocytes with an ease of handling. in this, brain cell suspension used to prepare % percoll is layered over % percoll prepared using media in : ratio. density gradient centrifugation is carried out at g for min at °c to obtain lymphocyte layer at the interface. leishman staining was performed to analyze the morphological characteristics of isolated lymphocytes. normal lymphocytes showed dark blue stained nucleus. some bigger sized cells with diffused nucleus characteristic of atypical lymphocytes were observed and some of the cells were surrounded by hair like structures. phenotypic characterization was carried out using flow cytometry. the presence of cd + , cd + and cd + cells was observed. the percentages of cd + , cd + and cd + cells was found to be . %, . % and . % respectively in the lymphocytes isolated from infected animal and . %, . % and . % respectively from control animal. hence, cd + cells showed maximum infiltration after infection. (santosh et al. ; pradeep et al. ). in the present study chikv suspected blood samples were collected and the acute phase samples were subjected to rt-pcr for the presence of virus specific rna by using the primer pair dvrchk-f/dvrchk-r as described by us earlier (naresh kumar et al. ). the convalescent phase samples were screened for chikv specific antibodies by using sd bioline chikungunya igm rapid test. six sets of primers were designed to amplify the complete nsp and complete structural genes of chikungunya virus. the products were further gel purified, cloned in ptz r/t vector and the recombinant clones were sequenced and submitted to the genbank. the complete ns gene and structural genes were compared with other available sequences in the genbank. sequence analysis results will be presented. the present study discusses these aspects in detail. . some of these phages (viz. v , v ) showed plaques at °c but not at °c. thus they seem to be lysogenic. for propagating and increasing the titre of all the above isolates, various previously described methods were attempted, but none of these methods were satisfactory. but when siliconized glassware and plastic-ware were used, propagation was successful. we showed that siliconization of glassware and plastic-ware was essential for the propagation of our mycobacteriophage isolates v , v , v , v and v . also, phage dilution medium (pdm) as described by chaterjee et al. ( ) was found to be effective for picking out of the plaques made by the phages. in this way, the phage isolates were propagated up to p . the various passages of the phage isolates v , v , v , v and v (i.e. original, p , p and p ) were stored at - °c. the four major routes of transmission are unsafe sex, contaminated needles, transmission from an infected mother to her baby at birth (vertical transmission) and breast milk. screening of blood products for hiv has largely eliminated transmission through blood transfusions or infected blood products in the developed world. in , globally, about million people died of aids, . million were living with hiv and . million people were newly infected with the virus. hiv infections and aids deaths are unevenly distributed geographically and the nature of the epidemics vary by region. more than % of people with hiv are living in the developing world. there is growing recognition that the virus does not discriminate by age, race, gender, ethnicity, socioeconomic status-everyone is susceptible. however, certain groups are at particular risk of hiv, including men who have sex with men (msm), injecting drug users (idus), and commercial sex workers (csws). the present study indicates the prevalence of hiv infection among the people residing in the northern region of india predominantly among the foothills of the himalayas. the study was carried out on the patients visiting herbertpur christian hospital (a unit of emmanuel hospital association) under the integrated counselling and testing centre scheme at the respective hospital during the - . the study indicates the screening of people groups residing in the respective area through community health schemes. the diagnosis of the hiv infection is done by three types of assays namely. the tridot method which is the rapid method of diagnosis followed by the. hiv coombs test which involves the dot immunoassay principle. the third assay is the enzyme linked immunosorbent assay (elisa). the number of patients screened during the period of september to march is which include patients coming from four different states namely haryana uttarakhand uttarpradesh and himachal pradesh. the number of people who were tested positive are and the number of people who were tested negative are . the people tested positive are sent to the higher centre for other confirmatory tests such as pcr and western blot analysis. these patients are sent for treatment and prophylaxis at a respective recognised centre in dehradun. the present study determines a consistent community hiv screening and treatment approach through diagnostics counselling and awareness programmes. classical swine fever (csf) also known as hog cholera is a highly contagious and fatal disease of swine. csf became rapidly a major issue of pig industries. it still causes important economical losses worldwide. it is considered as a major health problem of swines in india. during the month of august to october there was an outbreak of classical swine fever in bihar. from three districts darbhanga, patna and supol, total numbers of different infected tissue samples like kidney, spleen and lymphnode were collected from the dead morbid/pigs. total rna was isolated from % homogenate of infected tissues in sterile pbs by tri-reagent (sigma, usa) according to the manufacturer's instructions and cdna was prepared by using commercial available kit. the cdna was stored frozen at - °c until used. for the molecular detection of classical swine fever virus specific nested pcr amplification of e and ntr was done along with ns b and e rns amplification. primarily these samples were found positive with these primers. further confirmation by sequencing was done by cloning of these pcr products in pgem-t easy vector. e and ntr sequences were considered for phylogentic analysis along with complete available sequences of csfv. nucleotide sequence alignments were carried out using the clustalw program (dnastar) and phylogenetic tree analysis (dnastar) showed that ntr have close proximity with taiwan strain (accession no. ay ) and e shows close proximity with chinese isolate csfv- (accession no. af ). peste des petits ruminants (ppr) and sheeppox are oie notifiable diseases of small ruminants especially sheep and goat. both the diseases are economically important, in enzootic countries like india and are major constraints in the productivity of animals. considering the geographical distribution of both ppr and sheep pox infections and prevalence of mixed infection, in the present study, safety and potency of the experimental duel vaccine comprising attenuated strains of thermostable-ppr virus (pprv-revati, p- ) grown at °c and attenuated sheep poxvirus (sppv-srinagar, p ) was evaluated in local non-descript sheep. experimental animals were grouped into four groups and each group was comprising six animals, received doses ( tcid ), dose ( tcid ) and / th dose of vaccines and normal saline as control in ml volume subcutaneously, respectively. serum samples were collected on , , , and th day post vaccination. sheep simultaneously immunized with ml of vaccine consisting of either or doses of each of pprv and sppv were monitored for clinical and serological responses for a period of - weeks post-immunization (pi) and post challenge (pc). specific immune responses i.e., antibodies directed to both pprv and sppv could be demonstrated by ppr competitive elisa kit and capripox indirect elisa, respectively following immunization. all the immunized animals' resisted infection when challenged with virulent strain of sppv (srinagar isolate at p- ) on day dpi, while in contact control animals developed characteristic signs of sheeppox. the challenge of the sheep against ppr was not carried out, however, the antibody titre after immunization determined by snt and elisa, indicated that protective titre, as per earlier report on the goats. dual vaccine was found safe at higher dose and induced protective immune response even at lower dose ( tcid ) in sheep, which was evident from sero-conversion as well as challenge study with sppv. the study indicated that both the viruses are compatible and did not interfere with each other in eliciting immune response, paving the feasibility of use of this dual vaccine in combating both infections simultaneously. goatpox is one of the highly contagious, oie notifiable and economically important viral diseases of goats. the disease is caused by goatpox virus (gtpv) is classified of the genus capripoxvirus in the family poxviridae. the disease incurs severe economic losses in terms of high morbidity in adults and heavy mortality in young kids and is a major constraint in goat farming in india. considering the enzotic nature and economic impact of the disease, it is all important to control the infection by developing an effective vaccine. recently, vero cell based a live attenuated goat pox vaccine; using gtpv uttarkashi isolate (p ) has been developed in authors' laboratory and evaluated in goats. the vaccine was found safe, potent and immunogenic experimentally and even at field trials. the vaccine has been evaluated at large-scale at different regions of the country and found suitable for mass vaccination. however, the longevity of potency was not evaluated. therefore, a long term potency trials were studied for a period of years with annual challenge by using virulent goatpox virus and sero-monitoring. a sufficient number of hill goats has been vaccinated with dose of vaccine ( . tcid /ml) and monitored for clinical and serological response. every year, significant number of vaccinated (n = ) and control animals (n = ) were used for challenge with virulent strain ( . srd /ml, gtpv mukteswar). sera of pre-and post-challenged ( dpc) animals including controls have been collected and monitored for serological response in the form of specific antibody production by snt and indirect elisa. all the vaccinated animals were protected on challenge, whereas, all unvaccinated controls developed infections. the same has been reflected in sero monitoring of collected sera. so the developed live attenuated goat pox vaccine was found safe, immunogenic and potent for a period of years of immunization and suitable for mass scale vaccination in control and eradication of goat pox along with a/are suitable diagnostic tool/s in goatpox enzootic country like india. rotavirus infection in avian species varies from subclinical infections to outbreaks of diarrhea. the economic significance of rotaviral enteritis to the poultry industry has not yet been defined, but by analogy to the situation in mammals, it is likely to be significant. unlike the extensive studies performed on rotavirus infection in humans and animals, limited studies have been carried out to determine the extent of exposure of poultry birds to rotaviruses. to determine the prevalence of avian rotavirus antibodies in commercial broiler chickens. a total of chicken serum samples were collected from the lairage of a poultry slaughter house where birds from four different broiler farms in and around pune city were supplied to. the serum samples were tested by an igg antibody capture elisa wherein purified chicken rotavirus ch was used as coating antigen. sera from specific pathogen free (spf) chick (n = ) served as negative control in the test. cut off was calculated as mean negative control ? sd (standard deviation). s/co (mean sample od /cut off) values above ( . - . ) in % ( / ) serum samples were indicating positivity to rotavirus antibodies. the result of the study indicates exposure of the birds to avian rotavirus or similar agent that is circulating in pune city. bluetongue has become established in south india causing regular outbreaks in sheep. btv serotypes , , and were isolated from native sheep of andhra pradesh. the other serotypes circulating in the state need to be identified. however the major constraint is the serotype identification. to overcome the difficulties of traditional serotyping methods (neutralization tests), nucleic acid based tests are being tried. rt-pcr for serotyping was standardized using primers specific to vp gene of btv- , and serotypes. rt-pcr resulted in bp product of btv- , bp product of btv- which was defined by specific primers. however non specific amplification at two different sites i.e. bp and bp was noticed for btv- . specificity of rt-pcr was evaluated. btv- and btv- specific primers could amplify only btv- and btv- respectively where as btv- type specific primers amplified not only btv- but also btv- and btv- . nucleic acid sequence data obtained from btv- pcr product and btv- cloned products were specific to vp gene of btv- and btv- respectively. however, and bp products of btv- were identical to vp gene of btv- , , , , and and vp gene of btv- , and respectively, indicating the non specific amplification of btv- . foot and mouth disease is the most contagions and highly economically impotent disease of cloven footed animals. the disease is controlled by regular vaccination using the vaccine produced from the virus grown in the cell culture. the vaccine strain used for vaccine production is selected from the field isolates based on the adaptability and growth kinetics in bhk cells and antigen coverage. however the field viruses need to be passaged several times to adapt in tissue culture. passage of field viruses in tissue culture may results in development of mutants whose genetic makeup may differ from the field samples also some of the field strains may fail to adapt or may grow poorly in the tissue culture, thus the efficiency of the vaccine gets affected. structural proteins of fmdv carry the sequences which determine the serotype specificity and immunogenicity. thus one may replace the gene coding for structural proteins from the full length cdna copy of the vaccine strain that has been adapted to the tissue culture with the poly-structural protein gene (pi) so that the chimeric virus gets the serotype specificity of the field strain besides retaining the other characteristics that are needed for a vaccine virus. we have made replication competent fmdv asia i full length genome and cloned under t and cmv promoter separately in plasmid vectors. bam h sites were created for inserting pi- a gene of other field strains. the p - a of type 'o' vaccine strain was amplified directly from the cattle tongue material, cloned in plasmid vector and studied the specificity by sequence analysis and gene expression. we have introduced 'o' p - a gene into the full length construct devoid of asia structural protein gene, p - a. the in vitro transcribed rna in case of t promotered construct and plasmid dna in case of cmv promotered construct were transfected into the bhk cells. after the passaging the virus obtained was studied for the speciality. this approach may be used not only for rapid selection of vaccine strain and also as a repository of the cdna copy of the virus. the p is composed of a, b, c and d (vp , vp , vp , and vp ) respectively of which the vp is the most immunogenic and subunit vaccine produced with vp alone was able to induce high level of neutralising antibodies. thus to control the disease in india polyvalent vaccine consisting of the inactivated virus of all the three serotypes are in use. however the conventional vaccines have several drawbacks which include safety and temperature sensitivity. hence alternatively sub-unit vaccines consisting of vp protein has been tried. however this showed limited success due to the antigenic variations occurring in the field viruses thus escaping the neutralization from the antibodies generated from single cloned protein. hence the present study was undertaken with an objective to include all the neutralizing epitopes present in the three serotypes by linking vp ( d) genes and produce a poly valent protein for using as poly subunit vaccine. in this study we have constructed a cassette by linking the genes of three serotypes 'o' ( bp), 'a' ( bp) and 'asia ' ( bp). these genes were cloned individually in commercially pbsk vector and confirmed by sequence analysis before linking in pc dna vector. the linked gene construct was sub-cloned in pet expression vector. the expression of the protein gene from the pet vector was induced with iptg and analysed by sodium dodecyl sulphate polyacrylamide gel electrophoresis (sds-page). a fusion protein of size kda was observed in page gels. since the protein contains his residues from the vector at the n-terminal end, affinity purification was carried out using nickel nitrilo-tri-acetic-acid (ni-nta) agarose matrix. the immunoreactivity of the purified protein was assayed by western blot with the anti fmdv type 'o' and 'asia ' specific sera. the may be used as a subunit vaccine. silkworm diseases caused by viruses, bacteria, fungi and protozoans form major constraints for the silk cocoon production in all the sericultural countries and among these silkworm viral diseases viz., nuclear polyhedrosis and infectious flacherie caused by bmnpv and bmifv cause severe crop loss. the traditional disease management strategies include prophylactic measures and use of disease free silkworm eggs. the prophylactic measures such as disinfection of silkworm rearing house and appliances, egg surface, silkworm bed disinfection and rearing surroundings. the disinfectants used presently in sericulture are either formaldehyde or chlorine based products, but these chemicals are neither eco-nor user-friendly. the awareness about health hazards caused by formaldehyde and environmental pollution caused by cl necessitated the development of eco-and user-friendly disinfectant products for use in sericulture. these include alternative disinfectant products developed using biodegradable chemicals and plant based ingredients by apssrdi, hindupur and central silk board for the management of silkworm diseases in india. the ideal disinfectant for sericulture would be the one which can inactivate silkworm pathogens of diverse origin and economical for sericulture. the paper discusses on the disadvantages of hcho and cl based disinfectants and advantages of eco-and user-friendly disinfectant for the management of silkworm diseases especially the ones caused by viruses. the baculovirus expression vector system (bevs) is widely used for the production of high levels of properly post-translationally modified, biologically active and functional recombinant proteins and has facilitated basic biomedical research on protein structure, function, drug discovery and the roles of various proteins in disease. bevs is based on the introduction of a foreign gene into nonessential for viral replication genome region via of homologous recombination with a transfer vector containing target gene. the resulting recombinant baculovirus lacks one of nonessential gene (polh, v-cath, chia etc.) replaced with foreign gene encoding heterologous protein which can be expressed in cultured insect cells and insect larvae. insect cell-bev system is widely used to produce recombinant proteins. bevs also eliminates concerns regarding pathogens that could potentially be transmitted to humans as it is non-infectious to vertebral animals. these features make silkworm system an ideal expression and delivery package for producing proteins of medicinal importance. the efficiency, low cost and large-scale production of proteins using bevs represents breakthrough technology that is facilitating highthroughput proteomic studies. the bevs has become a core technology for cloning and expression of genes for study of protein structure, processing and function; production of biochemical reagents; study of regulation of gene expression; commercial exploration, development and production of vaccines, therapeutics and diagnostics; drug discovery research; exploration and development of safer, more selective and environmentally compatible biopesticides. utilization of silkworm larvae and pupae as bioreactor with recombinant bmnpv producing foreign proteins extends the usages of silkworms. due to its large-size and high protein synthesis ability as well as the expediency in mass culture, silkworm is considered as good candidate for producing recombinant proteins. wssbv is the causative agent of a disease, which has recently caused high shrimp mortalities and severe damage to shrimp culture. wssbv has been found across different penaeid shrimp species. in order to develop a effective diagnostic tool, a wssbv genomic library was constructed by cloning wssbv genomic dna extracted from purified virions. in the present study wssv disease free (confirmed by pcr analysis) were collected from hatcheries from different areas of guntur and prakasam districts and analysed to study the effect of various physical parameters like temperature, p h , salinity and turbidity on the prevalence of above disease. the studies on the surface water temperature revealed fluctuations in the ponds ranging between to . °c in diseased ponds and . to . °c in healthy ponds. these results show definite influence of temperature on the prevalence of wssv. present day strategy in vaccine development is to include marker facility that helps in distinguishing antibody response due to vaccination vis-à-vis infection in vaccinated animals. such information becomes relevant for effective disease control programmes especially when using inactivated virus vaccines like foot and mouth disease (fmd). the antibodies generated in the animals, only through vaccination, is the measure of vaccine efficacy and safety. presently inactivated fmd virus (fmdv) vaccines are used to control the disease in the endemic countries like india. the quality assurance of the vaccine depends on the efficacy of the vaccine in generating protective antibody without causing subclinical disease due to improper inactivation. since protective antibody response in vaccinated animals can not be distinguished from that of infected animals one needs to assay the antibody response against non structural proteins (nsps) and the vaccine must be free of contaminated nsps. production of vaccine free of nsps requires the cumbersome method of virus purification which adds to the cost of the vaccine. alternatively one may develop a positive marker vaccine by including a foreign protein or epitope which is not expected to be present in the vaccine and the antibodies generated against which helps in detecting the vaccine related response. here we report a molecular approach by which we introduced a immuno-dominant epitope of green fluorescent protein (gfp) into the structural protein gene of foot and mouth disease virus vaccine strain asia ( / ). our laboratory has produced a mini-genome of fmdv asia that lacks structural protein gene (p - a) coding for all the structural proteins (vp - ) of fmdv asia as a vector (pcfl dasia ). the p - a of the asia vaccine strain was cloned separately into a plasmid vector and by successive pcr mutagenesis and cloning we have introduced nucleotide sequence corresponding to amino acid epitope of gfp into p - a gene. gfp epitope was inserted by replacement at n-terminal region of vp- which is not immunogenic. the modified p - a was expressed in e. coli and studied. the modified p - a gene with gfp epitope was inserted into the pcfl dasia to get full length replication competent cdna cloned under cmv promoter in pcdna (pcflasiagfp). this can be used to produce synthetic virus with gfp epitope that can generate antibodies not only against neutralizing epitopes but also against gfp epitope. presence of antibody against gfp epitope in the vaccinated animal will reveal vaccine efficacy. elisa against gfp can be used as a companion test not only for safety evaluation but also for quick evaluation of efficacy. further absence of nsp antibodies in the serum may reveal the quality of the vaccine in respect of safety. self replicating dna vaccines are developed to achieve robust immune response through enhanced antigen production and gamma interferon expression in vaccinated animals. since self replicating dna vaccines induce gamma interferon expression which helps in viral clearance such vaccines are expected to be useful to cure even the carrier and persistently infected animals. understanding the events that help in the elicitation of both the arms of immune response in vaccinated animals is necessary to understand the effectiveness of the vaccine. the work presented here deals with the immunological evaluation of a sindbis virus replicase based dna vaccine carrying linked fmdv vp genes in vaccinated guinea pigs. we have constructed self replicating dna vaccine vector and to the down stream of a sub genomic promoter we have inserted secretory signal followed by linked-vp genes of fmdv serotypes (o-a-asia ) with glycine and proline bridge in between. guinea pigs were vaccinated with the construct and the sera at days post vaccination were evaluated both for cellular response by studying the cd levels and by mtt and cytokine profiles by real time assays. the humoral response was evaluated by studying cd levels in the whole blood by facs analysis and serum antibody levels by snt and elisa. the animals were challenged with gp infective dose of fmdv type 'o' virus lesions were scored. further the replicative efficiency of the challenge virus was studied by ab elisa. the results showed that all the assays except antibodies against ab protein have positive correlation with the protection. as expected the titre of the antibodies against ab protein was lower indicating that the challenge virus replication was inhibited in the vaccinated animals. the limited studies conducted by us showed that self replicating vaccine has a potentiality to emerge as potent vaccine for fmd. ganjam virus (ganjv) belongs to the genus nairoviruses (family bunyaviridae). these viruses cause diseases in livestock. it has been isolated from different animal hosts and tick vectors from india. genus nairoviruses includes a total of tick-borne viruses, classified into serogroups. the important serogroups are crimean congo hemorrhagic fever (cchf) and the nairobi sheep disease (nsd). the main members of the nsd group are nsd and dubge viruses. their genome consists of three segments of single stranded rna, viz. s, m and l that encodes viral nucleocapsid protein, viral glycoprotein g and g and the viral polymerase respectively. ganjv is very closely related to (nsdv). nsdv is found in east and central africa, causes very high morbidity and mortality in livestock. the present study involves phylogenetic comparison of ganjv isolates from india with other nairoviruses based on complete n gene. it will help to understand the kind of nucleotide (nt) and amino acid (aa) changes that have occurred in ganjv strains from different geographical areas. eight strains of ganjv isolated at niv during - from different parts of india were used in this study. virus stocks were prepared in vero e cell line these were used as the source of viral rna. the n gene was amplified either as a complete gene in one reaction or in fragments whenever necessary. thus obtained sequences were analyzed; annotated to get a consensus sequence, aligned against the sequence of prototype strain of ganjv and other representative nairoviruses. the nt sequences were converted to aa sequences and analysis was done at both nucleotide and amino acid levels. based on what nt or aa phylogenetic tree was constructed and compared with other nairoviruses (cchf, dugv, hazv, kupv and nsdv) where complete s segment sequences were available gen-bank database (ncbi). the phylogenetic data at both the nt and aa levels showed that all the strains of ganjv form monophyletic lineage with the nsdv. cchfv and hazv together formed another clade, whereas dugv and kupv made a separate branch in the tree. the different ganjv strains showed - % difference with nsdv at the nucleotide level and - % difference at the amino acid level. hazv showed - % difference at the nt level and % difference at the aa level with ganjv as well as nsdv. the present data obtained suggests that ganjv and nsdv are minor variants of the same virus. diarrhoeal syndrome is one of the major concerns of the livestock industry. most of the diarrheic cases in animals go unnoticed and limited attention is paid on viral etiology. presence of large amount of fecal matter in animal shed acts as a source of infection for calves via drinking water, feed, or contaminated soil. keeping this in view, investigation was planned to detect the association of rotaviruses with diarrhea in dairy calves and to observe the genomic diversity among the circulating viruses in tarai area of uttarakhand. a total of diarrheic fecal samples collected from instructional dairy farm, nagla, pantnagar, uttarakhand were screened during the study. samples were collected from both cow calves and buffalo calves in - months of age. for the diagnosis of rotavirus, all the fecal samples were subjected to rna-electrophoresis after nucleic acid extraction. viral genome segments were visualized by silver staining. out of the total samples tested, seven were found positive in rna-page showing typical genome segments migration pattern of bovine rotavirus. in the given samples prevalence of bovine rotavirus was . % and % in cow and buffalo calves, respectively. on the basis of migration patterns of rotavirus in rna-page, group a were identified with typical : : : pattern. variation within movement of various genome segments among isolates of bovine rotaviruses was observed during the study that may be indicative of emergence of mutants in the circulating isolates. the vp gene based group a specific rt-pcr was standardized and all the isolates in this area were confirmed to be of group a type. work is in progress to genotype the bovine rotaviruses of this region based on vp and vp genes. this study emphasizes the need to explore the prevalence of bovine group a rotaviruses in different places of uttarakhand and their genetic characterization which could help in selection of control strategies for rotavirus infections. foot-and-mouth disease (fmd) is endemic in india causing enormous economic loss to the animal keepers and trade embargo with fmd free countries in livestock and animal products. rapid diagnosis of fmd is of immense importance in prevention and control of the disease. fmd is initially diagnosed clinically and confirmed by laboratory tests. virus isolation in cell culture and sandwich elisa for antigen detection are commonly practiced in laboratories. the virus isolation though is very sensitive but it can be slow and analytical sensitivity of the elisa is lower and can not be used with certain sample types. the use of molecular techniques in the diagnostic laboratory has greatly increased the speed, specificity and sensitivity of fmd diagnostic tests. molecular techniques like rt-pcr, pcr-elsa and dot hybridization can be used with more success for detecting carrier animals and animals harboring sub-clinical infection and can be applied in a wide range of clinical sample types. these techniques can be used as genus and serotype specific test including detection of particular lineage/genotypes with in the serotype. multiplex pcr has been used to differentiate serotypes of fmdv and the technique is sensitive, experimentally simpler, cost effective and less time consuming. the assay can be used for serotyping on elisa negative samples. the molecular techniques not only help in diagnosis but also useful for epidemiological studies. lineage differentiating rt-pcr has been useful in identifying different lineages of serotype asia (lineage b, c and d) before proceeding with sequencing of d region. similarly genotype differentiating rt-pcr has been developed and used in differentiating two different genotypes of serotype a (genotype vi and vii). these assays have the potential to be applied on clinical samples directly, thereby saving much time needed for sample processing and nucleotide sequencing. recent development of real time rt-pcr methodology has allowed the diagnostic potential of molecular assays to be realised. advancement in real time pcr technology made it possible to combine several assays within a single tube which is in the progress in our laboratory. integration of these assays onto automated high throughput platforms provides diagnostic laboratories with the capability to test large numbers of samples. microarray technology was provided greater screening capabilities for pathogen detection. the microarray allows the addition of large number of oligonucleotide probes for identification of mutant pathogen and also for subtype determination. the combined properties of high sensitivity and specificity, low contamination risk, and speed has made realtime pcr and microarray technology a highly attractive alternative to conventional methods in increasing percentage of outbreaks confirmed and analyzed and for tracing the origin of fmd virus responsible for outbreaks. dna vaccines are expected to elicit both humoral and cellular responses, cellular response being long lasting. however the approach has several limitations like poor stability of dna, poor expression and risk of integration. poor expression becomes the major limitation in the case of fmd as fmdv proteins are poor immunogens. also dna vaccine vectors carrying only eukaryotic promoters elicit strong cmi response and weak humoral response. the methodology to achieve humoral response involves the expression and secretion of the expressed protein so that the antigen presenting cells will be able to process the antigen and produce humoral response. in case of fmd humoral response is as important as cellular response. the present project aims at addressing these issues; achieving higher expression and getting the protein secreted out by constructing self replicating gene vaccines for fmd and studying their efficacy. the vector for humoral immune response contains eef promoter, sindbis virus polymerase gene and secretory and anchoring signals. the integrity of the vectors was confirmed by sequence analysis. the linked polyvalent protein genes of fmdv serotype a, o and asia were cloned into the vectors and the presence of the insert was confirmed by restriction enzyme digestion. the functionality of the constructed dna vaccine vector (pvac self rep ) was assayed by transfecting the dna into bhk cell monolayer and studying the s labeled proteins in immuno-precipitation assays. the studies showed high level of expression in case of constructed vector as compared to infected virus for the specific protein. the secretion of the expressed protein was assayed by immuno-fluorescence assay and found to be positive. encouraged with these studies the preliminary studies were conducted on vaccine efficacy studies in guinea pig model. the immunized guinea pigs showed high antibody titres by snt and elisa, as compared to conventional dna vaccines (pup cd) even at / th of the dose. this approach of constructing self replicating dna vaccine for humoral response is the first report. genetically engineered microorganisms are important sources of industrial and medicinal proteins. over the past decade, plant host system has been investigated as potential host system for expressing proteins of therapeutic and diagnostic use. however concerns regarding the stability and environmental safety need to be addressed. chloroplast engineering is expected to resolve some of these issues since, plastids/chloroplasts are inherited maternally and are not disseminated through pollen. this makes plastid transformation a valuable tool for transgenic creation besides offering biological containment. since foot and mouth disease (fmd) of cloven footed animals is a major concern in the world over. foot and mouth disease (fmd) is the most feared, viral disease of the cloven footed animals causing heavy losses to the live stock industry. the disease is enzootic in many parts of the world including asia. the conventional vaccines for fmd have several limitations which include safety, temperature sensitivity and duration of immunity. attempts have been made to overcome these limitations using recombinant dna technology. amongst the newer vaccines, edible vaccines are cost effective and easy to administer. since the stability of the gene of interest is the major concern in the case of plant transgenics, marker genes are used for regular selection. the detection methods based on the available marker proteins like b glucoronidase (gus) protein/antibiotic selection are cumbersome and cost intensive. however selection based on herbicide resistance is much simpler and easy. hence in the present study, the -enolpyruvylshikimate- -phosphate synthase (epsp) gene was used as a marker along with the immunogen gene of fmdv. epsp is the key enzyme in the shikimate biosynthesis pathway necessary for the aromatic amino acids production. in order to investigate the mechanism of long term immunity and the effect of protective immunity induced by cationic plg micro particle coated dna vaccination. we constructed the expression plasmid containing a foot-and-mouth disease virus (fmdv) id gene sero type a. intramuscular vaccination of guinea pigs with the micro particles coated plasmid dna induced a strong antibody response and neutralization antibodies, cellular mediated immune response which lasted year. we further analyzed the persistence and expression of id gene by polymerase chain reaction and reverse transcriptase polymerase chain reaction and quantitative pcr. the results showed that id gene was present and expressed in the muscle cells up to year after days post vaccination. furthermore, guinea pigs vaccinated with micro particles coated plasmid dna were protected against a challenge with fmdv virus. therefore the micro particles coated plasmid dna vaccination dose induce a protective immunity and long term humoral, cellular immuno responses against fmdv, which could be maintained by persistent expression of id gene in muscle cells. foot and mouth disease virus (fmdv) causes a highly contagious viral disease of cloven hoofed animals, which has a considerable socioeconomic impact on the countries affected. interleukin- (il- ) enhances the il- driven th immune response that is important in immunity against intracellular pathogens. the multiple roles of il- in many physiological and pathological processes have generated a great deal of interest in recent years. antiviral effects of il- have been reported. we evaluated the effects interleukin- (il- ) on the replication of fmdv in vitro in bhk- cells. bovine il- mature protein coding sequence was amplified from the bovine pbmc cells and cloned into prokaryotic expression vector pet a. protein expressed was purified and specificity was confirmed by immunoblotting. bhk- cells were treated with purified expressed il- protein with ( lg/ml) h prior to fmd infection. cell culture supernatants were collected at h post infection were subjected for elisa and virus titration assay. rna extracted from the cells was subjected to real time pcr for viral rna quantification. log titer reduction was observed in the fmd virus titer in il- treated cells compared to the untreated cells where as virus antigen quantified by elisa has shown a reduction of -folds. -fold reduction in the fmd viral rna copy number was observed in the il- treated cell compared to the untreated measured by qpcr. current study demonstrated the potent anti viral activity of il- on fmdv by inhibiting the viral replication. these results further suggests that il- has the potential role of il- as molecular adjuvant in fmd vaccine development and development of therapeutic for fmd. foot and mouth disease is the most contagious viral disease of farm animals. control of the disease in animals is by vaccination and slaughtering of infected animals. conventional oil adjuvant vaccine has its own limitation. alternate to this genetic vaccines where the dna encoding viral antigen may be a promising approach. naked dna vaccine has limitations like poor uptake of dna by cells and more importantly by nucleus. as a result delivery of naked dna through calcium phosphate nanoparticle was attempted. calcium phosphate nanoparticle is a potential delivery agent which proved to enhance the immune response. fmdv p - cd ''o'' vaccine gene constructs in pcdna . ? entrapped by the nanoparticles was prepared by using different molarity of calcium chloride and disodium hydrogen orthophosphate. the nanoparticles entrapping fmdv p - cd ''o'' and naked dna were presented to the guinea pigs through intramuscular injection to study the mrna expression of antigen by rt-pcr. animals were sacrificed at defined time to collect different organs and total rna was extracted. each time blood was collected to analyse the fmdv specific serum antibodies. dna vaccines presented through calcium phosphate produced transcripts in the injected muscle up to days whereas naked dna up to days. serum antibody levels of naked dna vaccine showed antibody titre till days. whereas nanoparticle injected animals showed serum antibody till days. serum neutralization titres of . were observed in calcium phosphate dna vaccines at about - days, where as naked dna sn titers were observed for short period of - days. the study clearly showed calcium phosphate nanoparticle entrapping fmdv vaccine dna may be a better delivery system for dna vaccines as it confirms availability of the antigen and persistence of antibody for longer duration than naked dna. capripox is highly infectious, contagious, and oie notifiable disease of small ruminants, caused by sheeppox and goatpox viruses which are members of capripoxvirus genus of the family poxviridae. in the present study, we analyzed the partial gene sequences of p protein, an immunogenic envelope protein of capripox viruses (capv) to assess the genetic relationship among different sheep pox and goat pox virus isolates from different geographical areas of the country. product of this gene has been shown to be important in attachment of capv to host cell surface receptors during viral entry and host immune response. the following virus isolates have been used in the analysis: gtpv-uttarkashi, p , vaccine virus; gtpv mukteswar, p , challenge virus; gtpv (akola), gtpv bareilly/ , gtpv ladakh/ and gtpv sambalpur/ , field isolates and sppv srinagar, p ; sppv ranipet, p ; sppv-rf, p , vaccine viruses and sppv makdhoom/ , sppv cirg/ , sppv pune/ , sppv bareilly, sppv / and sppv / , field isolates. in this study, all virus isolates were confirmed by pcr amplification and analysed in pcr-restriction fragment length polymorphism (pcr-rflp) using ecori enzyme to confirm their specificity. further, the amplicons were cloned and sequenced commercially. nucleotide and the deduced amino acid (aa) sequences were compared with published sequences of the members of the genus capripox virus. sequence analysis of partial bp sequence has shown high sequence identity among all indian sppv and gtpv isolates at both nt and aa levels. it revealed a . - % and . for gtpv field isolates where as, % for sppv field isolates at both the nt and aa levels. in general, capv isolates in this study shown . - . and . % homology between gtpv and sppv at nt and aa levels as reported earlier. further, it revealed a unique change of g a in all gtpv isolates resulting in formation of drai site in place of ecori and possible development of restriction enzyme specific pcr-rflp for differentiation of sppv and gtpv from field isolates. orf or contagious ecthyma is considered as non-contagious, proliferative disease and is caused by orf virus of the genus parapox virus of the family poxviridae. it is reported most commonly in sheep and goats and also a zoonotic agent. camels are also infected by orf virus and reported in camel rearing countries as a mixed infection with camel pox, the later is caused by an orthopox virus. in india, there are few reports of the orf virus infection in camels and identified by clinical signs and pcr. in this study, we identified the presence of orf virus from clinical samples of suspected case of sporadic infection in camels by serological and molecular techniques. viral dna isolated from processed scabs used initially in nested polymerase chain reaction as diagnostic pcr which successfully amplified bp fragments and also sequenced to check the fidelity of the product. after confirming the infection by pcr, some of the structural and non-structural genes were amplified for sequence analysis. out of the five genes characterized, the major important one selected for sequence and phylogenetic analysis is b l gene which is homologous to a major envelope protein p k of vaccinia virus. full open reading frame of bp from orf b l was amplified by pcr, cloned and sequenced commercially. nucleotide and deduced amino acid sequences of b l were compared with other published sequences of the members of the genus papapox virus. sequence analysis shows a maximum percent identity of . and (indian orf virus isolates); . and . (other orf isolates); . and . (orf-camel/jodhpur/ ); and . (bovine popular stomatitis virus) and finally . and . (pseudo cowpox virus) respectively at nt and aa levels. phylogenetic analysis of the isolate was also performed using the neighbour joining method in mega program to know the phylogeny relatedness of the virus, which revealed that the isolate is well grouped with the jodhpur isolate and closely related to pseudo cowpox virus. it warrants further analysis of other potential genes to confirm the causative agent of the contagious ecthyma in camels as pseudo cowpox virus. chikungunya an arboviral disease is transmitted through the bite of an infected aedes mosquito. it causes a self limited febrile illness along with arthralgia and myalgia. in some cases neurological and severe hemorrhagic manifestations has been observed. chikv epidemic has been reported in africa, india, south east asian countries and during the current out break imported cases of chikv has been encountered in most of the european countries. the causative agent belongs to the genus alphavirus family togaviridae. human beings serve as the chikungunya virus reservoir host during epidemic periods. outside these periods the main reservoirs are monkeys, rodents, birds, and other unidentified vertebrates. antibodies to chikv have been detected in domestic animals. in the present study we surveyed madanapalli, palamaner, b. kotta kota and tirupati and collected a total of rodent samples, bovine samples; sheep samples and canine samples. total rna was isolated from all these samples and subjected to rt-pcr using a primer pair dvrchk-f/dvrchk-r which could amplify a bp e gene product specific to chikungunya virus (naresh kumar et al. ). all the serum samples were further screened for chikv specific igm antibodies using commercially available ctk biotech strips. none of the samples were found positive either for chikv specific rna or chikv specific igm antibodies. more number of samples from domestic animals as well as rodents are being screened to study their possible role if any in the maintenance of chikv in nature and during the inter epidemic periods. the present study discusses these aspects in detail. petunia hybrida is widely used as experimental host plant for begomovirus identification and its characterization. hitherto, natural infection of begomovirus on petunia has not been reported in india. recently, petunia hybrida grown in and around ludhiana were found to be depicting typical symptoms caused by begomovirus. the symptoms include severe reduction in leaf size, downward curling and distorted leaves. severely infected plant became bushy, stunted and produces no flower. total genomic dna was extracted from the plants showing symptoms of begomovirus, by ctab method. the presence of virus was confirmed by using degenerated primers, designed to identify all the begomovirus prevailing in the world. to identify the strain associated with the disease, the positive samples along with healthy control were tested against different strain specific primers of tomato leaf curl virus, so far reported in india i.e. tomato leaf curl new delhi virus, tomato leaf curl palampur virus, tomato leaf curl banglore virus, tomato leaf curl karnataka virus and tomato leaf curl gujarat virus. among these, only tomato leaf curl new delhi virus specific primer was able to give the desired amplicon of * bp. hence, it is confirmed that the leaf curl disease of petunia hybrida is associated with tomato leaf curl new delhi virus. this disease of petunia can become a sever production constraint in coming years. from last years ( and ) it was observed that some varieties of brinjal grown in rainy season, showed typical leaf curl type of symptoms. the symptoms include upward curling of the leaves, cupping, vein thickening, reduction in leaf size and distortion of leaves. the severely infected plant remains stunted and bushy, became unproductive or produces only few fruits. the disease was experimentally transmitted from naturally infected brinjal to healthy seedlings by whiteflies (bemisia tabaci) and grafting, but not by mechanical or aphid transmission. to detect the begomovirus associated, total genomic dna was extracted from the plants showing disease symptoms. the presence of virus was confirmed by using pcr based begomovirus geneus-specific primers designed by deng et al., wyatt and brown and rojas et al. these degenerated primers give the expected product size of * , * and * bp, respectively. core coat protein (cp) gene and dna-b was also amplified in the samples using specific primers. to identify the strain associated with leaf curl virus, dna was subjected against primers of different indian tomato leaf curl virus strain i.e. tomato leaf curl new delhi virus, tomato leaf curl palampur virus, tomato leaf curl banglore virus, tomato leaf curl karnataka virus and tomato leaf curl gujarat virus, using pcr. among these, only tomato leaf curl new delhi virus primer was able to show the desired product size of * bp. therefore, it was confirmed that leaf curl disease of brinjal is caused by tomato leaf curl new delhi virus in association with satellite b-dna. to identify the strain associated with the disease, all samples were further subjected to the specific primers, designed to amplify all the tomato leaf curl virus strains, so far reported from india i.e. tomato leaf curl new delhi virus, tomato leaf curl palampur virus, tomato leaf curl banglore virus, tomato leaf curl karnataka virus and tomato leaf curl gujarat virus, using pcr. among these, only tomato leaf curl palampur virus specific primer was able to give the expected product size of * bp. this shows the association of tomato leaf curl palampur virus with leaf curl disease of calendula and marigold. thus, calendula and marigold can act as a reservoir for the tomato leaf curl palampur virus and may cause severe constrain in the production of these important ornamental plants. groundnut bud necrosis virus (gbnv) belongs to serogroup iv of the genus tospovirus in bynayaviridae family and infects several economically important crops all over india. the nucleocapsid protein (np) encoded by the small rna of gbnv encapsidates the viral rnas. apart from this structural role, the np has also been implicated in the replication, transcription, maturation and cell to cell movement. with a view to study the structure and function, the np of gbnvtomato isolate from karnataka was over expressed in e. coli and purified by ni-nta chromatography. the purified np was present as ribonucleoprotein complex and as heterogeneous mixture containing monomers, tetramers and higher order multimers. in order to determine the regions involved in oligomerization and nucleic acid binding, mutational approach was taken. n-and c-terminal deletion clones were generated (n np, n np, c np and c np), over expressed in e. coli, and were purified by a procedure identical to that used for the wild type protein. initial studies on oligomeric status suggested that in addition to n-and c-terminal regions there may be additional regions or residues which contribute to multimerization of np. the amount of rna bound to the truncated proteins was reduced in case of n np, n np and c np. interestingly removal of amino acid residues (natively unfolded region) from the c terminus resulted in complete loss of nucleic acid binding suggesting that the rna binding domain was located in c-terminal region of np. further np was observed to get phosphorylated in in vitro kinase assays by a kinase present in the soluble fraction of tobacco plant sap. both atp and gtp were utilized as phosporyl donors and mn ? was the preferred metal ion which suggests that np might be phosphorylated by a ck -like protein kinase. phosphorylation studies with n-and c-terminal truncated proteins revealed that the site of phosphorylation lies within the amino acid residues - . by mass spectrometric analysis of the protein threonine- and serine- were identified as possible phosphorylation sites. a naturally occurring isolate of virus infecting gherkin (cucumis anguira l.) showing mosaic symptoms of mosaic, leaf distortion and dark green islands in the lamina was identified in the export cultivars of gherkin grown in commercial fields of kuppam rural, chittoor district, andhra pradesh. the virus infection was deadly prevalent among the field that caused a lot of economic damage to the crop that resulted in yield losses and reduced quality of fruits meant for export. symptoms of the infected fruit included blistering and malformation of the fruit. the virus infected leaf samples were collected and initial host range tests were conducted with different cucurbit species showed that the host range include propagation hosts like cucumis anguira (gherkin), cucumis sativus, cucurbita pepo, cucumis melo, langeneria vulgaris, momordica charantia and local assay host like chenopodium amaranticolor. the virus host range was only restricted to cucurbit species and chenopodium. the virus was maintained for further studies on cucurbita pepo by sap or mechanical inoculation. the virus induced mosaic, vein clearing symptoms on pumpkin. electron microscopy of the leaf dip preparations stained with % uranyl acetate from the pumpkin leaves showing symptoms revealed the presence of a long flexuous filamentous particle measuring nm. the virus positively reacted to the polyclonal antisera of papaya ringspot virus-w, potato virus y, tobacco etch virus and also strongly reacted with the polyclonal antiserum of zucchini yellow mosaic virus in direct antigen coated-enzyme linked immunosorbent assay (dac-elisa). because of very strong reaction to polyclonal antisera of zucchini yellow mosaic virus, we tried to amplify the partial nib and cp genes of the virus along with the utr by using two primers zy gctccatacatagctgag acagc and zy taggctttttgcaaacggagtcta at c . total rna from gherkin infected leaves was isolated using trizol ls reagent (sigma). rt-pcr was performed to obtain an amplicon of * . kbp, cloned into fermentas ptz r/t vector and sequenced at mwg biotech, bangalore. sequence analysis revealed that the virus was isolate of zucchini yellow mosaic virus and was showing % of homology to that of the zucchini yellow mosaic virus strain b genome ay and zucchini yellow mosaic virus nat genome ef which were strains reported from israel. the sequence of the present study was submitted to the genbank gq . the results state a suspicion that the virus could have been mobilized by some infected source brought by the commercial israeli based companies into india due to poor quarantine regulations as the gherkin cultivation in these regions is chiefly supported, purchased, exported and marketed by these private companies that are based from israel. this is the first report on molecular characterisation of zucchini yellow mosaic virus infecting cucumis anguira (gherkin) from india. they also exhibited synergism with other virus which was region specific. fifty percent of the total symptomatic plant population was found be positive only for carla while remaining showed mixed infection of carla with tospo in some regions while in others carla virus was found to be associated with cmv. presence of only carlavirus was up to - % incidence, without association of tospo, cmv, poty or tobamo viruses was also observed in some fields. avijit tarafdar, raju ghosh, k. k. biswas plant virology unit, division of plant pathology, indian agricultural research institute, new delhi citrus tristeza virus (ctv), a brown citrus aphid (toxoptera citricidus) transmitted closterovirus under family closteroviridae, is one of the major limiting factors in cultivation of citrus worldwide. ctv is a longest known plant virus having flexuous particle of nm in size. ctv genome is a positive sense ssrna of about kb nucleotide containing open reading frames (orfs) encoding proteins. several biological as well as genetic variants of ctv are reported in all the citrus growing countries in the world. ctv causes decline and death of millions of citrus trees in the world. in india, ctv is a century old problem, and has killed an estimated one million citrus trees till today. in molecular and genetic level, ctv isolates from india were not fully characterized. genetic diversity and sequence divergence in ctv isolates of india are not fully established. further, evidence of recombination and causes of evolution of ctv variants in india have not been studied till date. therefore, in the present study, effort has been made to characterize several indian ctv isolates in genetic level, examine their genetic diversity, identify recombination events and analyze evolution of divergent ctv. a total number of ctv isolates from different regions of india ( from darjeeling hills, five from bangalore, from delhi and from vidarbha) were under taken for genetic study. two genomic regions of ctv, i.e., entire cp gene (cpg) ( nt) and a gene fragment of orf a (orf a) ( nt) were amplified, cloned, sequenced and nucleotides were analyzed. based on cpg, indian isolates shared - % nucleotide identity, and based on orf a they shared - % identity, among them. incongruence of phylogenetic relationship was observed as on sequence analysis five phylogenetic clades based on cpg, and eight clades based on orf a, were generated suggesting the recombination events have been occurred between the sequences of indian ctv isolates. thus, to identify the potential recombination events, and determine the parental sequences in ctv isolates, six recombination detecting algorithms, namely, rdp, genconv, bootscan, maxchi, chimera and siscan were used. out of indian ctv, cpg of and orf a of isolates of ctv showed recombination events suggesting orf a was more prone and fragile to rna recombination as compared to cpg. this findings indicated that high degrees of genetic diversity and incongruent relationships of indian ctv isolates are due to genetic recombination occurred, which may be the important factors in driving evolution ctv variants in india, that was also supported by a splittree decomposition analysis. b. v. bhaskara reddy, y. sivaprasad, k. rekha rani, k. raja reddy department of plant pathology, regional agricultural research station, acharya n.g. ranga agricultural university, tirupati, andhra pradesh sunflower (helianthus annus l.) is one of the most important oil seed crops in the world which ranks third in area after soyabean and groundnut. the sunflower necrosis disease (snd) is characterized by necrosis of leaves, necrosis streaks on petioles, stem, floral parts and stunted growth. the causal agent of the disease has been identified as tobacco streak virus (tsv) which belongs to genus ilarvirus of the family bromoviridae. the suspected tsv infected sunflower samples collected from chittoor district in andhra pradesh were found positive for tsv-dac elisa. total rna was extracted from sunflower using rnaeasy isolation kit (qiagen). the tsv coat protein (cp) gene, movement protein (mp) gene and replicase (rep) gene were amplified by rt-pcr with specific primers, cloned in ptz r/t vector, sequenced and deposited in genbank (gu , gu and gu ). the size of cloned cp gene was bp and codes for amino acids. the cp gene sequence analysis revealed that the tsv-tpt infecting sunflower has - % homology at nucleotide level with soybean, tagietus-tpt and okra-tn isolates and - % homology at amino acid level. the movement protein gene was bp and codes for amino acids. the mp gene sequence analysis showed that it has - % homology at nucleotide level and - % at aminoacid level. chilli (capsicum annuum), the important commercial vegetable/spice of himachal pradesh, is affected by several viral diseases; of them cucumo, tospo, poty and gemini viruses are the most common genera. however, these viruses are not identified clearly and characterized fully, which are foremost needed to formulate the management strategy. therefore, in the present study, effort has been made to identify and characterize the important viruses causing diseases in chilli. in this study, several farms in major chilli growing areas of bilaspur and kangra districts in himachal pradesh were surveyed and infected plant samples were collected randomly. virus infection in these samples were detected by das-elisa using antisera to cucumber mosaic virus (cmv) and potyvirus (group specific) and through slot-blot hybridization (sbh) using cmv, iris severe mosaic poty virus (ismv), tomato spotted wilt tospo virus (tswv) and chilli leaf curl gemini virus (clcuv). based on das-elisa and sbh, the incidence of disease was estimated and ranged from . to . % by cmv and . to . % by potyvirus. to detect tospo and geminivirus in the infected chilli, sbh test was carried out. infected samples showed maximum virus titer in both das-elisa and sbh test were further confirmed by pcr using specific primers. desired sizes of amplicons; * bp, * bp, * bp and * bp of cmv, poty, gemini and tospo viruses, respectively, were obtained. as the present study clearly indicated that cmv appeared as a major one among the viruses infecting chilli in the hilly region of himachal pradesh, two isolates of cmv were characterized in genetic level. thus the amplified products (* bp) of cmv, palampur and palampur were cloned in pgemt cloning vector, sequenced and the sequences were submitted to ncbi database (palampur : acc-fm and palampur : acc-fm ). the sequences were then analyzed and compared with other sequences available in the data base. based on sequence analysis, it was found that present cmv isolates shared % nucleotide identity between them, are closely related with australian cmv isolate cmv-ly (acc-af ) by % nucleotide identity. in phylogenetic tree analysis, it was observed that indian cmv isolates formed same cluster along with cmv-ly. as it is known that cmv subgroup ii comprises cmv-ly, it is concluded that the cmvs of this hilly region of himachal pradesh belong to subgroup ii. chilli is essentially a crop of the tropics and grows better in hotter regions. chlii (capsicum annuum), a member of family solanaceae is an important vegetable and spice crop of immense commercial importance. the pungency in pepper is due to an alkaloid known as capsaicine and peppers are characterized as sweet, hot or mild depending on capsaicine content. the present investigation were conducted to find out the highly resistant cultivars of capsicum annuum against cmv and tylcv among ten cultivars of chilli in agroclimatic condition of aligarh. the highest ( and ) percentage of infection was observed in hc- and kalyanpur type- by showing the positive reaction to cmv by elisa test. no symptoms was recorded in case of bc- , lca- and jca- and showed negative reaction to cmv by elisa. bc- and lca- also showed negative reaction to tylcv by elisa and these were symptomless. maximum infection ( and ) was registered in hc- and c , cultivar. so, the bc- , lca- and jca- has proved highly resistant varieties against cmv and tylcv and these may be used in breeding programmes against viruses. cotton leaf curl virus belongs to the family geminiviridae, genus begomovirus. the members of this family contain circular single stranded dna molecules as their genomes. there are two kinds of begomoviruses-bipartite viruses with genomes consisting of two dna molecules designated dna-a and dna-b and the monopartite viruses which contain only dna-a but not dna-b. in monopartite viruses, the dna-a is accompanied by a small circular dna molecule called dna-b which is essential for the development of typical disease symptoms. cotton leaf curl virus is a monopartite virus and causes the cotton leaf curl disease which has emerged as a major disease of cotton in the indian subcontinent. the non-structural protein ac of cotton leaf curl kokhran virus-dabawali isolate (clcukv-dab) was cloned into pgex x vector and overexpressed in bl (de )plyss e. coli cells. the overexpressed gst-ac protein was purified by glutathione sepharose chromatography. the purified gst-ac protein was found to possess atpase activity. the optimum temperature and ph for the activity were °c and . respectively. the atpase activity was inhibited in presence of edta, showing that it is dependent on divalent metal ions. the activity was supported by magnesium, manganese and zinc ions but inhibited in presence of calcium ions. it was also inhibited by the non-hydrolyzable atp analogue adenosine-b, c-imido triphosphate and in the presence of other nucleotides like ctp and gtp. the k m and the v max of the reaction for atp as the substrate are . mm and . nmol/min/ mg of the protein respectively. the enzyme could also utilize gtp as the substrate. the fact that ac is specifically an ntpase and not a general phosphatase is revealed by the finding that it does not hydrolyze p-nitrophenyl phosphate to yield yellow colour while a similar reaction carried out in parallel with alkaline phosphatase readily yields the colour. it has been suggested earlier that ac may be involved in cell to cell movement of the virus (rojas et al. ) . it is possible that by its ability to hydrolyze atp, ac serves to power viral movement in the plant. thirteen sugarcane yellow leaf virus isolates causing yellow midrib and irregular yellow spot pattern from six states of india were characterized by rt-pcr assays. scylv- f and scylv- r primers were used as forward and reverse primer pairs and the amplified products were cloned and sequenced. comparative coat protein sequence analysis confirmed that all the scylv-indian isolates were clustered into two major groups confirming the existence of two strains of scylv affecting sugarcane crops of india. in a separate experiment, the member of both of the phylogenetic groups were found to be transmitted by the sugarcane aphid, melanaphis sacchari from infected to healthy sugarcane suggesting its secondary spread in nature. the symptoms produced by the virus causing cotton mosaic disease were little bit different in both sap inoculation and under natural field condition. in natural field condition it has shown clear chlorosis type of symptoms on major leaves of plants but in sap inoculated plants veinal chlorosis and mosaic type of symptoms are found to be common. in field conditions infected plants grows erect and have less boll formation. there is no effect found on seed shape or seed size. the initial symptoms produced on cotton leaves after inoculation were wonderful. local lesions observed in second week from inoculation and then they changes to chlorotic type of symptoms and some are necrotic symptoms also. the plants at early stage are found to be affected, has less lateral branch development and hence reduction in yield production. the naturally field infected plants showing good symptoms are also difficult to identify in lateral stage of plant. because they disappear with time. the virus is very easily sap transmissible. the virus is found to be transmitted by thrips palmi and thrips tobacci in persistent manner. no seed transmission is observed. virus showed same physical properties as it shows in stem necrosis of peanut or sunflower necrosis disease. the physical properties are found to be thermal inactivation point (tip) - °c, dilution end point (dep) - to - and longevity in vitro (liv) h, virus infecting nineteen different host plants are identified belonging to five different types of families viz. malvaceae, chenopodiaceae, compositae, leguminaceae and solanaceae. however they found to produce same types of symptoms as in most of the host that have been tested before. in elisa test report it is found that the virus showing positive test only with anti serum of tsv of a cowpea and cotton but negative reaction with pbnv of cowpea and cotton which clearly denied possibility of presence of pbnv in cotton producing these kinds of symptoms elisa report clearly shows that tsv antiserum of cowpea is showing positive results with clear chlorotic types of symptoms. a powerful approach to functional genomics, and an alternative to the massive generation of transgenic plants, is the use of the recently described virus induced gene silencing (vigs) process, which allows viral vectors to knock out the function of a gene-of-interest. vigs is based on a silencing mechanism that regulates gene expression by the specific degradation of rna. as a tool for reverse genetics, vigs has many advantages over other common ways to study gene function because of the ability of viruses to replicate and move systemically within a plant. vigs can generate a phenocopy of a mutant without all the troubles of traditional methods of mutagenesis. geminiviruses with their small dna genomes and ease of inoculation through agrobacterium, are excellent candidates for vigs vector development. as a first step, the geminivirus bhendi yellow vein mosaic virus, characterized in our lab (jose and usha, virology : [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] ) has been chosen. the satellite b dna associated with this virus has a single open reading frame (bc ). bc is essential for symptom development but not for replication. therefore, bc has been replaced by a multiple cloning site harbouring sali, xbai, bamhi, bsrgi and xhoi, initially in a cloning vector and then in the binary vector containing the partial tandem repeat of the b dna. in the place of the bc orf, the plant phytoene desaturase gene has been cloned and the resulting construct was used for agroinfiltration along with the partial tandem repeat clone of the begomovirus (dna a component chilli (capsicum annuum l.) plants exhibiting prominent symptoms of begomovirus like: leaf curl, vein swelling, shortening of petioles, crowding of leaves and stunting of plants were collected from rorkee, uttarakhand and dhaulpur, rajasthan, india. total genomic dna was isolated from naturally infected chilli samples and pcr was carried out with coat protein (located in dna-a) gene specific primers. as expected to the primers, * bp dna fragments were amplified from the infected chilli samples. to know the bipartite nature of the virus isolates, nuclear shuttle protein (located in dna-b) gene specific primers were employed which also resulted in positive amplification of * bp dna bands with all the coat protein tested positive samples. to ascertain the association of dnab component with the virus isolates, a set of dna-b specific primers were used which resulted in positive amplification of full length (* . kb) dna bands in the chilli samples collected from rorkee, uttarakhand, however, multiple sizes bands were resulted with the samples collected from dhaulpur, rajasthan. these findings confirmed that both the virus isolates under study are bipartite begomovirus associated with dna-b satellite. the sequencing of the pcr products is under progress which analysis will be discussed. groundnut bud necrosis virus (gbnv) belonging to the genus tospovirus, which is a unique member of the family bunyaviridae, infects several economically important crops. the virus has three genomic ssrna segments namely s (ambisense), m (ambisense) and l (negative sense). the s rna codes for nucleoprotein (np) and non-structural protein (nss) from viral complimentary and viral strands respectively. many viral nonstructural proteins such as ns of hepatitis c virus, yellow fever virus, dengue virus, sv large t antigen and cytoplasmic inclusion protein of tamarillo mosaic potyvirus are known to exhibit rna/dna stimulated ntpase, dntpase and helicase activity. nss of gbnv does not have any sequence similarity with any of the above mentioned viral rna/dna helicases but has a ntp binding domain. however, it has been implicated as suppressor of gene silencing in vivo. with a view to elucidate the mechanism by which nss could act as a suppressor of gene silencing and examine the other potential roles of nss in the life cycle of the virus, the gbnv (to) nss was over-expressed in e. coli and purified by ni-nta chromatography. in vitro studies with the purified rnss suggest that it exhibits an rna stimulated ntpase activity. many of the proteins that possess the rna/ dna stimulated ntpase and datpase activity, are also shown to have atp dependent nucleic acid unwinding activity. it was therefore of interest to examine whether nss has the nucleic acid unwinding activity. the helicase assays revealed that nss has dna/rna helicase activity. helicase activity of nss was absolutely dependent on atp and mg ? ion. nss could unwind dsdna substrate with overhang, or overhang. mutation of the crucial lysine in walker motif a (k ) severely affected the unwinding activity where as mutation of aspartate residue in walker motif b (d ) resulted in only % loss of activity. in this regard, rnss is a unique enzyme which does not have the canonical helicase motifs but can catalyze dsdna/dsrna unwinding in an atp and mg ? dependent manner. the rnss might act as a suppressor of by unwinding the dsrna, the substrate for dicer. in addition to being a suppressor of ptgs, nss may also regulate the viral replication and transcription by modulating the secondary structure of the viral genome. this new research finding on nss might pave way for further studies on its role in viral replication and transcription. yellow vein mosaic disease of pumpkin (cucurbita moschata) poses a serious threat to the cultivation of this crop in india. the disease was found to be associated with whitefly-transmitted bipartite begomoviruses were detected in varanasi field using polymerase chain reaction (pcr) with primer design through coat protein conserved region of begomoviruses from ncbi database. all plant samples showing symptoms were infected with begomovirus. the virus species were provisionally identified by sequencing * bp of the viral coat protein gene (av ageratum conyzoides is commonly known as billygoat-weed, chick weed, goatweed and whiteweed. in india it is popularly known as bill goat weed. it is an annual herbaceous plant with a long history of traditional medicinal uses in several countries of the world and also reputed to possess varied medicinal properties including the treatment of wounds and burns. in cameroon and congo, it is used traditionally to treat fever, rheumatism, headache, and colic. during survey in and around gorakhpur in , ageratum plants were found affected with the symptoms of leaf curling, mosaic mottling and leaf yellows. the infected leaf samples were processed for virus identification and association with pcr assays. total dna was extracted and pcr were performed with begomovirus specific primers (tlcv-cp). a * bp band was consistently amplified on % agarose. the pcr products were directly sequenced and sequence was submitted in genbank with the accession no. gq . the blast search analysis showed highest similarity of % with the ageratum enation virus. vernonia cinerea leaves with yellow vein symptoms were collected around crop fields in madurai. a bp product amplified from total dna extracted from symptomatic leaves with degenerate primers designed to amplify a part of the av gene from begomoviral dna a component was cloned and sequenced. based on the above sequences, specific primers were designed and the full length dna a of nucleotides with typical genome organization of begomoviral dna a was obtained and was submitted to embl data base (acc no: am ). the sequence comparison with other begomoviruses revealed the closest identity ( %) with emilia yellow vein virus from china and less than % with all known begomoviruses. the international committee on taxonomy of viruses (ictv) has therefore recognized vernonia yellow vein virus (vyvv) as a distinct begomovirus species. conventional pcr could not amplify the dna b or dna b from the infected tissue. however, the b dna ( bp) associated with the disease was obtained (acc no: fn ) by the rolling circle amplification-restriction fragment length polymorphism method (rca-rflp) using phi dna polymerase. sequence analysis shows that dna b of vyvv has the highest identity ( %) with dna b of ageratum leaf curl disease and - % with the b dna associated with other begomoviruses. infectious clones of vyvv dna a and dna b as dimers were made using the products of rca-rflp. these infectious clones will be used for agroinfection of vernonia and the results will be discussed. this is the first report of the molecular characterization of vernonia yellow vein virus (vyvv) from vernonia cinerea in india. production of bulb and seed crop of onion (allium cepa l.) is hampered by onion yellow dwarf virus (oydv) and iris yellow spot virus (iysv) with an incidence of . % and . % in bulb crop and . % and . % in seed crop, respectively in the popularly grown cv. hisar- . four symptom-based variants of oydv designated as grade a, b, c and d produced varied types of symptoms in onion crop incurring heavy losses in bulb and seed production. iysv caused tiny hay coloured spots of different shapes and sizes on leaves and scapes which later coalesced and led to drying and lodging of scapes. the plant height, bulb weight and bulb size were . cm, . g and . cm in plants infected with oydv, . cm, . g and . cm in iysv infection, . cm, . g and . cm due to their combined infection, as compared to . cm, . g and . cm respectively, in healthy plants of bulb crop. in plants infected with oydv grade a the plant height was minimum ( . cm) whereas the number of umbels was maximum ( . umbels/pl.) but other yield parameters viz., weight/umbel ( . g), number of seeds/umbel ( ), seed weight/umbel ( . g) and seed yield/plant ( . g) were recorded to be the lowest. the minimum reduction in plant height ( . cm), weight/umbel ( . g), number of seeds/umbel ( ), seed weight/umbel ( . g) and seed yield/plant ( . g) were recorded in oydv grade d. the plant height was . cm with . umbels per plant, . g weight/umbel, seeds/umbel, . g seed weight/umbel and . g seed yield/plant in iysv infected plants. the plant height ( . cm), umbels/plant ( . ), weight/umbel ( . g), number of seeds/umbel ( ), seed weight/umbel ( . g) and seed yield/plant ( . g) were found to be the lowest in combined infection of oydv and iysv diseases in comparison to higher values in healthy controls ( . cm, . , . g, , . g, . g, respectively). a minimum reduction in the test weight, germination and seed vigour index were found ( . g, . % and ) due to oydv grade a infection, whereas these were . g, . % and in iysv disease infected plants and . g, . % and in combined infection of oydv and iysv diseases in comparison to . g, . % and in healthy plants. the maximum hampering of seed vigour parameters was recorded due to iysv infection. lodging of scapes caused by this disease was responsible for heavy losses in seed production and seed quality. cotton leaf curl disease is one of the major threats to cotton cultivation from northern india. survey conducted during , observed the disease incidence ranged from to % from bhatinda, abohar, fazilka, sri ganganagar, hanumanghar. in order to study genetic variability in the virus, twelve clcuv isolates were partially characterized ( bp common region, full length av gene and partial sequences of ac and av gene). full length characterization of representative isolates from bhatinda, abohar, fazilka, sri ganganagar, hanumanghar is under progress. partial sequence analysis of clcuv isolates revealed that, the virus isolates collected during cropping season are closely related to cotton leaf curl burewala virus from pakistan and results were discussed. pratibha singh, h. s. savithri department of biochemistry, indian institute of science, bangalore tospoviruses, belonging to the family bunyavirideae, infect economically important plants such as groundnut, tomato, watermelon etc. they have a tripartite genome, with l, m and s segments of rna, in pseudo circular (panhandle) form. the viral genomes encode four structural proteins (l, n, g and g ) in the antisense orientation, and two non structural proteins nss and nsm in the sense orientation. the nsm is the only protein unique to tospoviruseses that infect plants in the bunyaviridae family and hence is proposed to be important for cell to cell movement. ground nut bud necrosis virus (gbnv), a member of the tospovirus genus, is the most prevalent virus infecting several species of leguminosae and solanaceae plants in india. total rna was isolated from gbnv infected tomato leaves and rt-pcr was performed using appropriate primers to amplify the nsm gene. the pcr product was cloned in pgex x vector. the recombinant nsm clone was transformed into bl (de ) e. coli cells and over-expressed by induction with . mm iptg. sds-page analysis of induced and uninduced fraction revealed the presence of overexpressed protein of expected size. the soluble gst-nsm was purified by gsh sepharose affinity chromatography. purified gst-nsm was shown to interact with in vitro transcribed rna transcript by electrophoretic mobility shift assay. further nsm was shown to interact with viral encoded proteins np and nss using elisa and yeast two hybrid system. nsm was also shown to be phosphorylated in vitro by pellet fraction of plant sap. thus the recombinant gbnv nsm possesses the characteristic features of a movement protein such as nucleic acid binding, interaction with nucleocapsid protein, and ability to undergo posttranslational modification. solanum melongena, commonly called as egg plant is one of the most important vegetable crop in the world. it is cultivated widely in the tropical and sub tropical regions. several viruses such as cucumber mosaic cucumo virus (cmv), potato virus-y (pvy), potato virus-x (pvx) and tobacco ring spot virus (trsv) infect egg plant under natural conditions. in india major crop losses due to cmv infection in brinjal is % (fao stat- ) . in the present study the infected leaf samples were collected from local fields of ramapuram, chandamama palli, chandragiri, madanapalli, yadhamari, durgasamudram villages in and around tirupati, were tested for cmv infection by dac-elisa with cmv antisera. the resulting positive samples were further inoculated to the raised brinjal seedlings of selected varieties through mechanical sap inoculation. different varieties of brinjal like mullabadhine, ankhur, ravya, mattigulla, casper and easter egg were used for monitoring the susceptibility to cmv infection. the mosaic symptoms were observed after weeks of inoculation in all varities of brinjal except mullabadhina. among all these susceptible varities ankhur variety is selected to study induced biochemical changes such as chlorophylls, carbohydrates, proteins, nucleic acids and polyphenol oxidases in cmv infected brinjal leaves. in the infected leaves considerable reduction in chlorophyll and starch and increase in total proteins, sugars, rna and polyphenol oxidases was observed when compared to healthy leaves. the amount of total starch, protein and dna decreased to about , and lg/g respectively in infected leaves, where as sugars ( lg/g), rna content ( lg/g) and polyphenol oxidase activity was increased as compared to healthy leaves. the above results suggests that there is an altered concentrations of chlorophyll, proteins, nucleic acids, carbohydrates and polyphenol oxidase activity in the brinjal leaves due to the effect of cucumber mosaic cucumo virus infection. leaf analysis was found to be used as widely accepted diagnostic tool to assess the nutritional status of the vegetables. the present study deals with these aspects in detail. the total rna and dna was isolated from infected leaf samples. rt-pcr assays were performed using sugarcane yellow leaf virus (scylv) specific primers (scylv- f and scylv- r). the infection of scylv was detected in all the collected samples, which showed the expected size (* bp) amplicon during rt-pcr. in another experiment with nested pcr analysis, a phytoplasma characteristic . kb rdna pcr product were amplified from dnas of all infected samples but not in healthy sugarcane plants tested using phytoplasma universal primer pairs p /p and fu /ru . dna extracts from plants with yellow mid rib and leaf yellows produced products of bp, which gave typical phytoplasma profiles when digested with hae iii and hha i. no pcr amplifications were produced using dna from symptomless plants. our results suggest that the yellow mid rib and leaf yellows symptoms on sugarcane varieties in uttar pradesh and uttarakhand states of india exhibiting midrib yellowing and leaf yellows symptoms is mainly caused by mixed infection of scylv and scylp. the affected clumps showed reduction in stalk height as compared to healthy fields. thirty-one sugarcane mosaic isolates belonged to sugarcane mosaic virus (scmv) and sugarcane streak mosaic virus (scsmv were collected from china and india), confirmed in indirect elisa and rt-pcr amplification with scmv and scsmv-specific primers. the amplicons ( . kb) from the coding region of coat protein (cp) were cloned, sequenced and compared to each other as well as to the sequences of scmv isolates from sugarcane (australia, usa, china, brazil, mexico and south africa), maize (australia, china, iranian) and one scsmv isolate from sugarcane (india) in genbank. maximum likelihood and maximum parsimony analyses robustly supported two major monophyletic groups that were correlated with the host of origin: the scmv subgroup that included isolates from china and only isolates from india, and the scsmv subgroup that contained all isolates from india. maize dwarf mosaic virus (mdmv) and johnsongrass mosaic virus (jgmv) were not detected in any of the samples tested. a strong correlation was observed between the sugarcane groups and the geographical origin of the scmv isolates. the millable sugarcane samples from china contained a virus tentatively described as sorghum mosaic virus (srmv). three isolates from nine chewing canes in fujian, yunnan and guizhou provinces of china also contained srmv, and the other samples including five isolates from india was found infected with scmv. no srmv infection has been detected in sugarcane mosaic samples from india. sequence comparisons and phylogenetic analysis indicated that srmv can be considered as the most common and prevalent potyvirus infecting sugarcane in china, however in india sugarcane streak mosaic virus is dominant in causing mosaic symptoms on sugarcane. dig-labeled dna probe complementary to coat protein (cp) region of tobacco streak virus (tsv) sunflower isolate was designed for the sensitive and broad-spectrum detection of tsv isolates, the most devastating virus in india. dot-blot and tissue print hybridizations with the digoxigenin labeled probe were performed for the tsv detection at field levels. here, dot-blot hybridization was used to check a wide number of tsv isolates with a single probe and sensitivity with different sample extraction methods. the probe with cp conserved region prepared from sunflower pcr amplicon was hybridized with the tsv field isolates of gherkin, pumpkin, sunflower, marigold and globe amaranth samples because of highly conserved with little variability in cp region. the sensitivity limits were decreased from total nucleic acid to partially purified and crude extract preparations. in particular, tissue blot hybridization offers a simple, reliable procedure as dot-blot, but requires no sample processing. because there is minimal sample preparation, tissue-print hybridization could be an important component of tsv management programs. thus, the above non-radioactive labeled probe techniques can facilitate in screening the samples during tsv outbreaks and in quarantine services. savita patil, rupali sawant*, k. banerjee virology group, agharkar research institute, macs, g.g. agarkar road, pune two mycobacterium smegmatis strains (ari lab nos. v and v ) were employed for the isolation of mycobacteriophages from soil and sewage samples. mycobacteriophages were isolated from soil samples collected from an area surrounding the tuberculosis (tb) ward, naidu hospital, pune, against m. smegmatis strain v . these were numbered as v , v and v and were isolated by using washed-cell preparation method. the bacteriophages against the other m. smegmatis strain, i.e. v , were isolated from soil samples (collected from around tb ward, sassoon hospital, pune). some of these phages (viz.v , v ) showed plaques at °c but not at °c. thus they seem to be lysogenic. for propagating and increasing the titre of all the above isolates, various previously described methods were attempted, but none of these methods were satisfactory. but when siliconized glassware and plastic-ware were used, propagation was successful. we showed that siliconization of glassware and plastic-ware was essential for the propagation of our mycobacteriophage isolates v , v , v , v and v . also, phage dilution medium (pdm) as described by chaterjee et al. ( ) was found to be effective for picking out of the plaques made by the phages. in this way, the phage isolates were propagated up to p . the various passages of the phage isolates v , v , v , v and v (i.e. original, p , p and p ) were stored at - °c. pvp- effect on pigments due to geminivirus infection on cowpea (vigna unguiculata) shail pande*, naveen pandey, k. shukla mahatma gandhi p. g. college gorakhpur, d.d.u. gorakhpur university, gorakhpur geminiviruses are one of the most important group of viruses causing economic losses in tropics. the symptom produced are yellowing of leaves which directly affect the pigments of diseased plants it in turn affects productivity and yield of diseased plant. cowpea vigna unguiculata is one of the important crop cultivated throughout india for its green pods which are used as vegetables and seeds are used as pulse. cowpea is affected by many viruses amongst them geminiviruses are one of the important virus on the cowpea plant. in the present study total chlorophyll content was studied in leaf of cowpea of diseased and healthy plants using arnon's method. carotenoids were also studied using ikan's method. it was found that chlorophyll content in diseased plants were lower compared to healthy plant similar results were found with carotenoids so the geminivirruses infection lowers the chlorophyll and carotenoid content in diseased plants which reduces yield of diseased cowpea plant. shweta sharma , amrita banerjee , j. tarafdar , r. rabindran , indranil dasgupta * department of plant molecular biology, university of delhi, south campus, new delhi; bidhan chandra krishi vishwavidayalaya, kalyani, nadia, west bengal ; tamil nadu agricultural university, coimbatore, tamil nadu rice tungro disease is an important disease of rice, caused by a joint infection by two viruses: rice tungro spherical virus (rtsv) and rice tungro bacilliform virus (rtbv) in south and southeast asia. the complex of rtbv and rtsv is transmitted by an insect vector green leaf hopper (glh). previously we reported complete genomic sequences of two geographically distinct isolates of rtbv; rtbv-wb (west bengal) and rtbv-ap (andhra pradesh) collected from the field in mid- s. both the sequences showed high homology all along the genome but showed divergence from previously reported southeast asian isolate i.e. rtbv-phil (philippines). to check whether a time period of a decade has resulted into variability in the genomic sequence of different isolates of rtbv in india, we cloned and sequenced the complete genome of rtbv from two geographically distinct regions of india i.e. west bengal and kanyakumari collected from the field in . the complete nucleotide sequence of the dna fragments covering the whole genome of rtbv was determined using universal primers m f and m r and by primer walking, without any ambiguities remaining. the nucleotide sequences of overlapping clones were assembled and analyzed using the dna analysis software generunner and blastn program of ncbi. homology search at the nucleotide and amino acid level were performed using the blastn and blastp (respectively) programs of ncbi. multiple sequence alignments were performed using clus-tal-w software. sequence analysis results thus obtained showed that both the recently obtained complete genomic sequences of rtbv from two geographically distinct regions of india i.e. west bengal and kanyakumari showed very high homology (both at the nucleotide and amino acid levels) with the two previously reported rtbv isolates from india i.e. rtbv-wb (west bengal) and rtbv-ap (andhra pradesh) all along the genome. as observed earlier both the sequences diverged significantly from the southeast asian isolates. this suggests that even after the spatial and temporal difference (a time gap of approx years) between the two previously reported rtbv isolates and the recently reported one, there is very little sequence variability between them. this further strengthens the earlier reports that the rtbv genomes in india are highly conserved. homology search at the nucleotide level using blastn program with the previously existing rtbv isolates revealed a very high percentage identity of % with the rtbv west bengal isolate and % with the rtbv andhra pradesh isolate. this further strengthens the earlier reports that there is not much genetic variability in the rtbv genomes in indian subcontinent. complete genomic rna sequences of two geographically distinct isolates of rice tungro spherical virus (rtsv), a member of the genus waikavirus, family sequiviridae, were determined from india. out of the two previously reported sequences, the indian isolates were closer to the resistance breaking strain rtsv-[vt ] than rtsv- [phila] . between them, the indian sequences showed nucleotide as well as amino acid identities of %. a moderate homology was observed between the leader peptide and a putative helper component protein involved in insect transmission of the maize chlorotic dwarf virus, a closely related waikavirus, indicating its possible transmission-related function. unlike rice tungro bacilliform virus, which causes rice tungro disease jointly with rtsv, and is significantly different between isolates from india and philippines, rtsv genomes were observed to be much more conserved between isolates from the two countries. rice tungro bacilliform virus (rtbv) are believed to be the joint causative agents for the devastating tungro disease of rice prevalent in south and southeast asia [ ] . rice tungro disease has become the major cause of production losses in rice during last three decades in several rice growing states of india. here, we report, for the first time the complete sequence analysis of two geographically distinct indian isolates of rtsv. we analyze the deduced protein sequences and their phylogenetic relationship with the two complete rtsv sequences from philippines as well as with other members of sequiviridae family. we provide molecular evidence that the indian isolates of rtsv are closely related to those from the philippines. we had earlier reported that rtbv isolates between india and philippines differ significantly from each other [ ] . this study was undertaken in order to see whether rtsv isolates from india also show similar difference from those reported from the philippines. frequent outbreaks of tungro were reported near kanyakumari in the last - years. the present work was undertaken to clone and sequence the full-length rtbv and rtsv genomes from the infected rice plants collected from above region and to analyze the similarity of its genetic material with the existing indian isolates of rtbv and rtsv. a . kb dna fragment encoding the reverse transcriptase gene of rtbv genome was amplified and cloned in t/a vector and was sequenced commercially. homology search at the nucleotide level using blastn program with the previously existing rtbv isolates revealed a very high percentage identity of % with the rtbv west bengal isolate and % with the rtbv andhra pradesh isolate. this further strengthens the earlier reports that there is not much genetic variability in the rtbv genomes in indian subcontinent. similarly, the cp region of rtsv was amplified by rt-pcr and was cloned in t/a vector. recently, rice tungro disease has been reported from kanyakumari district of tamil nadu. it is important to determine the genetic nature of this isolate in order to develop resistance strategies. it is thus necessary to clone and characterize the viruses from kanyakumari and to determine the mechanism of virus resistance in transgenic lines. rice tungro disease is an important viral disease of rice. rice tungro is caused by infection by two viruses: rice tungro bacilliform virus (rtbv) and rice tungro spherical virus (rtsv). rtsv is a plant picornavirus with a kb single stranded rna genome. it belongs to genus waikavirus in the family sequiviridae and is necessary for transmission of the two viruses by the leafhopper vector nephotellix virescens. rtsv rna is translated to form a large polyprotein, which is then self cleaved to form the viral proteins, including the three coat proteins, replicase, protease. studies have been conducted on rtsv from philippines. correct information of sequence variability of viral isolates to check whether different geographical conditions like those present in india select for genotypically variable strain and to design for transgenic resistance strategy, information on rtsv from india is absolutely essential. the objective of this study was to clone rtsv isolates from india and compare the genetic diversity of indian isolates from other southeast asian isolates and amongst each other. also develop strategy to impair the attack of virus-complex on rice. the achieve this, complete genomes of two isolates from india were cloned by amplifying different genes by rt-pcr and subsequently cloned in ta vectors, followed by sequencing. subsequently constructs containing cp - , antisense replicase, sense replicase and double stranded replicase were cloned in plant transformation vector. these constructs were used to transform aromatic rice variety from indian-pusa basmati (pb ). pcr analysis of the above plants was done to check the stable insertion of insert in the transgenics. jatropha (jatropha curcas) of the family euphorbiaceae is being grown in india as a major commercial fuel (bio-diesel) crop. jatropha is cultivated in districts of potential states of india. unfortunately, the cultivation of jatropha is limited by the severe mosaic disease. recently, a severe mosaic disease with significant disease incidence was observed in - on j. curcas grown in experimental plots of nbri and j. gossypifolia, a weed growing road side around lucknow and kathaupahadi, madhya pradesh. the disease consisted of the symptoms of severe mosaic, blistering, leaf distortion and stunting of whole plant and no fruit/seed production in severely affected plants. symptomatology and whitefly population observed on them suggested the occurrence of begomovirus infection. to detect the begomovirus infection, the total dna from leaf samples of infected jatropha plants was extracted and polymerase chain reaction (pcr) were performed using three sets of begomovirus genus specific (cpit-i/cpit-t, paliv /paric and paliv /palic ) primers and the expected size * bp, . kb and . kb amplicons were obtained which confirmed the begomovirus infection. further to identify the begomovirus/es and investigate the genetic diversity among them exists if any, the * . kb amplicons were cloned and sequenced. the sequence data were deposited in the genbank database under accession nos.: gq and fj (from j. curcas) and eu and fj (from j. gossypifolia). during blast analysis gq and fj shared highest % sequence identity with each other and - %% with sri lankan cassava mosaic virus (aj , aj , aj , aj and aj ) and indian cassava mosaic virus from india (ay ) therefore, designated as two strains of jatropha mosaic india virus-lucknow. blast analysis of eu showed maximum % similarities with croton yellow vein mosaic virus (aj ), % with tomato leaf curl new delhi virus (dq ) and - % with papaya leaf curl virus (aj and y ), therefore, identified as strain of croton yellow vein mosaic virus. blast analysis of the virus isolate (fj ) showed highest % identities with tomato leaf curl virus-bangalore ii (tolcv-b ii-u ) and - % with tomato leaf curl karnataka virus (tol-ckv, ay , fj ), therefore, considered as new begomovirus species ''jatropha yellow mosaic india virus''. the phylogenetic analysis of gq and fj (from j. curcas) and eu and fj (from j. gossypifolia) was performed along with some selected isolates of begomovirus which showed [ % sequence identities during blast analysis. the isolate eu showed closest relationship with croton yellow vein mosaic virus while fj showed separate clustering of all the four begomovirus from jatropha species. during phylogenetic analysis these isolates formed three separate clusters, therefore, they were considered as three distinct begomoviruses. the above data clearly show that some genetic diversity exists among the begomoviruses infecting jatropha species in india. bitter gourd (momordica charantia l.) of the family cucurbitaceae, also known as bitter melon is extensively cultivated in north eastern region of uttar pradesh, india. it is regarded as one of the world's major vegetable crops and has great economic importance. a severe yellow mosaic disease on bitter gourd (momordica charantia) with a significant disease incidence was observed during the survey of different locations of eastern up, india in the year . the whitefly (bemisia tabaci) population was also observed in the vicinity. the characteristic disease symptoms and whitefly population indicated the possibility of begomovirus infection. total dna were isolated from infected as well as healthy leaf samples. two primer pair (tlcv-cp and roja's primer) were used to study, which resulted * bp with tlcv-cp in / samples and * . kb amplicons with roja's primer in / samples. for further identification of the begomovirus, the pcr amplicons were cloned and sequenced (genbank accession no. eu and eu , respectively). the blastn search analysis of eu indicated - % identity with several isolates of tomato leaf curl new delhi virus (tolcndv). the phylogenetic analysis also showed closest relationships of the isolate (eu ) with tolcndv isolates. based on highest sequence identity and closed relationships with tolcndv the virus isolated from bitter gourd was considered as an isolate of tomato leaf curl new delhi virus. while, blastn search analysis of eu isolate, shared highest - % identites with pepper leaf curl bangladesh virus (peplcbv) isolates. the phylogenetic analysis of the virus isolate with selected begomovirus isolates revealed a closest relationship with peplcbv. these results confirmed the association of peplcbv on bitter gourd. study revealed the variability of viruses on bitter gourd in eastern up, india. tobacco streak virus groundnut isolate was characterized biologically by taking six cultivars (jl , tmv , k , k , k ) and one pre-release culture (k ) using seedlings of - days old under glasshouse conditions. there were clear differences were observed among cultivars tested regarding incubation period, percent seedling wilt and time taken to death of seedlings. k- was least susceptible among all the cultivars tested and it supported least virus titer (a nm: . - . ). both localized (necrotic lesions on leaf, veinal necrosis, leaf yellowing, wilting) and systemic (petiole necrosis, necrotic lesions on young leaves, death of top growing buds not only on main stem but also on all primaries (side shoots), followed by stem necrosis, stunted growth, axillary shoot proliferation with small leaves having general chlorosis, peg necrosis, pod necrosis, pod size reduction, wilt of plants) symptom were observed in all cultivars tested. biological differentiation of tsv and gbnv was made by sap inoculation of both viruses separately using susceptible groundnut cultivar jl under glasshouse conditions. there were certain similarities and differences were observed between these viruses infecting groundnut. seed infection of tsv ranged from . to . % in seeds collected from naturally infected and sap inoculated groundnut cultivars/pre-releases (jl , tmv , k- , k- , k- and k- ) belonging to spanish and virginia types. tsv was detected both in pod shell and seed testa from pod samples produced by sap inoculation under glasshouse conditions. however, seed transmission of tsv was not observed in groundnut. coat protein (cp) gene of three groundnut tsv isolates (gn-ap- - ; gn-ap - ; gn-ap - ) were sequenced and all the three isolates contained a single open reading frame (orf) of bp nucleotide and could potentially code for amino acids (aa). cp gene of tsv isolates originating from different hosts shared high degree of sequence identity both at nucleotide ( . - %) and amino acid ( . - %) levels respectively. tones grown in an area of . . ha (fao stat ). in india papaya is grown in nearly , ha with an annual production of , , tones (fao stat ) and occupies fourth place in the world. the crop is severely affected by a number of viruses. papaya ring spot virus (prsv-p) is the most important virus. the detection of virus infection in plants has traditionally involved either bioassay on indexing plants and or immunological methods (hill , torrence and jones ) . use of nucleic acid probes has improved the detection and sensitivity of viruses. the most common non-radioactive probes are biotynilated probes, which are very specific and sensitive. papaya ring spot virus (prsv-p) is a positive sense ssrna virus belonging to the genus potyvirus family potyviridae and transmitted by aphids. prsv-p coat protein gene region was used as template cdna for probe preparation. dot-blot hybridization with the biotin labeled probe were performed for prsv-p detection. the clarified sap of healthy and infected plants were serially diluted and spotted onto the nitrocellulose membrane, hybridized to biotin labeled probe. biotin labeled rna's are employed as probes, with a subsequent detection based on streptavidin-alkaline phosphatase conjugates. the sensitivity for viral detection of the biotin labeled probe was found to be sensitive than enzyme linked immunosorbent assay (elisa). in recent years tospovirus is causing devastating damage to the yield of vegetables in india. it infects economically important crops viz., tomato, chilli, peppers, groundnut, watermelon and various legumes. now it is emerging as severe disease in brinjal also. in order to monitor the natural occurrence and distribution of tospovirus in vegetable, surveys were conducted in the predominant brinjal growing areas of gujarat, karnataka, maharashtra and andhra pradesh during - incidence ranging from to %, to %, to %, and to . % respectively. samples collected from different places of india were found positive to pbnv in direct antigen coating-enzyme linked immunosorbent assay (dac-elisa). pbnv infected brinjal plants showed mosaic mottling of leaves with leaf distortion, longitudinal streaks on the stem and necrotic rings on leaves and fruits. early infection led to severe stunting and abnormal fruiting. biological and molecular characterization of pbnv-brinjal isolates were compared with other isolates and results are discussed. for identification of virus causing mosaic symptoms on soybean various host plants were tested. plants species belonging to the different families viz. caricaceae, graminae, leguminosae, malvaceae and solanaceae were tested. the virus produced symptoms on diagnostic plant species like chenopodium album, c. quinoa, helianthus anus, phaseolus vulgaris and vigna ungiculata. among tested families the leguminosae that were the host of virus included arachis hypogea, the virus causing mosaic symptoms in soybean is inactivated between and °c and between dilution of - to - . all the inoculated plants of assay host showed the symptoms at °c but not at °c. similarly local lesions produced at - but not at - . the virus in crude sap was infectious up to h but not at h at room temperature. however, the percentage infectivity decreased progressively as the aging of the sap was increased at room temperature. on the basis of reactions on diagnostic hosts pvp- identification and characterization of potyvirus infected chilli (capsicul annum l the virus under study caused mild mosaic and severe mottling symptom in leaves of infected plants. the dilution end point (dep) of the virus was found to be - to - , longevity in vitro (liv) - days at room temperature ( °c), thermal inactivation point (tip) - °c. electron microscopy of purified virus preparation revealed the presence of flexuous particle of size nm long and nm in width with characteristic cytoplasmic inclusions: pinwheels and scrolls. the virus was transmitted by sap and by aphid myzus persicae. the host range study revealed that the host species were restricted to family chenopodiaceae and solanaceae. on the basis of above characteristic, the virus under study was identified as potyvirus associated with mild mosaic and severe mottling symptom in capsicum. phytoplasma causing grassy shoot disease and sugarcane yellow leaf viruses are important pathogens of sugarcane. these pathogens are causing severe losses in sugarcane productivity. with a view to producing virus and phytoplasma free planting material of sugarcane, experiments were undertaken using infected varieties of sugarcane growing at the farms of sugarcane research institute. apical meristems measuring about mm in length, were dissected out, surface sterilized and cultured on agar gelled murashige and skoog's (ms) medium containing growth regulators for shoot induction. the established shoot cultures were multiplied through repeated subcultures on fresh media at - days interval. elimination of gsd and scylv was confirmed through molecular analysis of regenerated plants using specific primers of scylv and gsd. results revealed that apical meristem culture technique is effective in eliminating the pathogens like scylv and phytoplasma (gsd) from the infected clones. this is probably the first report on elimination of grassy shoot disease in sugarcane through meristem culture. papaya ringspot virus (prsv), which causes the most widespread and devastating disease in papaya, isolates originating from different geographical regions in south india were collected and maintained on natural host papaya. the entire coat protein (cp) gene of papaya ringspot virus-p biotype (prsv-p) was amplified by reverse transcription-polymerase chain reaction (rt-pcr). the amplicon was inserted into pgem-t vector by t-a cloning method, sequenced and sub cloned into a bacterial expression vector prset-a using directional cloning strategy. the prsv coat protein was over expressed as fusion protein in e. coli. sds-page gel revealed that cp expressed as a * kda protein. the recombinant coat protein (rcp) fused with his-tag was purified from e. coli using ni-nta resin. the antigenicity of the fusion protein was determined by western blot analysis using antibodies raised against purified prsv. the purified rcp was used as an antigen to produce high titer prsv specific polyclonal antiserum. the resulting antiserum was used to develop an immunocapture reverse transcription-polymerase chain reaction (ic-rt-pcr) assay and compared its sensitivity levels with elisa based assays for detection of prsv isolates. ic-rt-pcr was shown to be the most sensitive test followed by dot-blot immunobinding assay (dbia) and plate trapped elisa. key: cord- -jxbhdch authors: koroleva, olga; mckeown, peter; pendle, alison; shaw, peter title: proteomic analysis of the plant nucleolus date: journal: plant proteomics doi: . / - - - - _ sha: doc_id: cord_uid: jxbhdch the nucleolus is a prominent sub-nuclear structure found in all eukaryotes. it is where the ribosomal rna genes are transcribed and ribosomes are synthesised. however, much evidence has now accumulated that the nucleolus is involved in many other nuclear processes. nucleoli are of moderate protein complexity, comprising a few hundred proteins, and can be isolated for proteomic analysis. in this chapter we describe the purification and analysis of plant nucleoli by proteomic methods and summarise the current results. we also discuss more specific tagging methods that have been used to analyse individual protein complexes, as well as methods for analysing post-translational modifications of nucleolar proteins. finally we discuss the assessment of the reliability of such proteomic data, and the presentation and curation of this type of data. the nucleolus is a prominent structure found in the nuclei of all eukaryotes. it is where ribosomal rna genes are transcribed by rna polymerase i and where these transcripts are processed to form pre-ribosomes. like all such nuclear substructures, the nucleolus lacks any bounding membrane, and must therefore be held together by intermolecular interactions as well as by the continuity of the rdna within it with the rest of the nuclear dna. it has been suggested that the biochemistry of ribosome biogenesis itself is responsible for the existence of the nucleolus as a distinct compartment (melese and xue ; hernandez-verdun et al. ) and nucleolar volume correlates with transcription levels. however rrna transcription occurs without nucleolus formation in archaea (omer et al. ) , whereas nucleoli are formed in organisms lacking rna polymerase i ( conrad-webb and butow ) or multiple rrna repeats (matsuzaki et al. ) , suggesting that the formation of a nucleolus is not essential to rrna transcription, but is an adaptation to create a compartment in which pre-ribosomes may be formed with high efficiency (raska et al. ) . however, much evidence now suggests that there must be more to nucleolar structure and function than making ribosomes. in the first place, nucleolar structure varies considerably between species, between cell types and between individual cells. second, a number of unexpected, unconventional activities have been localised, at least in part, to the nucleolus (raska et al. ) . in most cases these unconventional activities have been investigated in only one or two species, so that it is not yet clear whether they are general nucleolar functions or specific adaptations. for example the nucleolus has been implicated in many other aspects of rna biology, including biogenesis of snrnas and snornas, of the signal recognition particle, of trnas and rnase p, and of telomerase, as well as in mrna surveillance. the nucleolus has also been linked to viral infections, to cell cycle regulation, to cancer and to stress responses (hiscox ; rubbi and milner ; maggi and weber ; mayer and grummt ; yuan et al. ) . in many cases the involvement of the nucleolus was discovered by localising factors involved in the various processes to the nucleolus. however, recent approaches using high throughput localisation studies and proteomic analysis of purified nucleoli have put these unconventional activities on a more systematic basis (scherl et al. ; andersen et al. ; pendle et al. ) . for the most part, proteins are recruited to the nucleolus as a consequence of their intermolecular interactions rather than by recognisable nucleolar targeting sequences. this has frustrated attempts to predict nucleolar localisation in silico. nucleolar localisation sequences (nols) exist in viral proteins, such as the plant umbravirus orf protein (kim et al. ) and both potato leaf-roll virus (plrv) capsid proteins (haupt et al. ) . indeed, the nucleolar location of one plrv capsid protein was predicted through nols identification (haupt et al. ) , and comparison of coronavirus n-protein sequences allowed similar predictions for components of the sars virus (reed et al. ). however, these reflect binding to existing nucleolar proteins (nucleolin, in the latter case) and the interacting sequences are not well conserved. similarly, a native nucleolar protein may include sequences important for localisation, as in the basic c-terminal domain of yeast nop (fujiwara et al. ) but even this is not widely conserved. hence, nucleolar localisation remains difficult to predict. without reliable prediction, direct determination of the composition of nucleoli is the best option, and it has recently become possible to use high-throughput proteomic techniques to identify large numbers of nucleolar proteins from complex mixtures derived from purified nucleoli. direct visualisation of subcellular localisation has also been used in mice to identify nucleolar proteins amongst other nuclear proteins via an enhancer-trap system, although this study took as a starting point a broad range of predicted nuclear proteins of which only % were exclusively nucleolar (sutherland et al. ) . in pilot screens aimed at systematically localising proteins in mammalian (simpson et al. ) and plant cells (cutler et al. ; escobar et al. ; tian et al. ; koroleva et al. ) by systematic green fluorescent protein (gfp) open reading frame fusion expression, - % of gfp fusions were highly enriched within the nucleolus. as expected, these included many rna processing functions as well as protein kinases and phosphatases that may regulate various aspects of rrna metabolism (koroleva et al. ) . perhaps more surprising was the finding that certain transcription factor-like proteins were preferentially located in the nucleolus, whereas related family members were found mainly in the nucleoplasm (tian et al. ; koroleva et al. ) . the two techniques commonly used for proteomic purposes are d polyacrylamide gel electrophoresis ( -de) and liquid chromatography / mass spectrometry (lc/ms) methods. although d gel approaches do have some advantages (see chap. by hurkman and tanaka, this volume), such as quick visualisation of the distribution of major protein groups, accessibility and relatively low basic costs, and ease of sample and data storage, they are applicable only to the major protein components of the proteome, and not suitable for proteins with extreme values of mr (relative molecular mass) or pi (isoelectric point), hydrophobic membrane components or low-abundance proteins. more advanced uses of this methodology, such as difference in gel electrophoresis (dige) (unlu et al. ; lilley and dupree ) significantly increase the costs -fluorescent dyes and image analysis software are expensive -but the dynamic resolution is still not very high. furthermore, since many plant proteins are extensively modified, plant protein mixtures do not generally resolve very well on -de gels. it has been estimated that single gel-based analyses allow the identification of approximately % of expressed cellular proteins (heazlewood and millar ) . only an estimated nucleolar proteins had been identified by such techniques prior to the use of high throughput ms methods . therefore, there seems be more future in non-gel-based approaches for analysing the plant proteome, such as recently developed techniques like lc-maldi ms; lc esi ms/ms and mudpit (aebersold and mann ) . nevertheless, there are several examples where a combination of d gel separation with subsequent identification of proteins from excised spots have allowed analysis of the plant nuclear proteome. in arabidopsis, - spots were detected on d gels, and analysis using maldi-tof (matrix assisted laser desorption / ionisation-time of flight) ms led to the identification of spots corresponding to different proteins (bae et al. ) . in rice nuclei isolated from suspension culture cells, from a total of proteins resolved on -de, proteins were identified by maldi-tof ms from major protein spots (khan and komatsu ) . maldi-tof is the most widespread type of ms analysis. this method simply measures the mass of each ionised peptide, providing a "fingerprint" of protein composition, with protein identification based on matches between measured masses of peptides and predicted masses of proteolytic cleavage products of the proteins present in databases. the caveat is that with the increasing length of the peptide chain, many peptide fragments with different sequences will have very similar masses. the same problem sometimes results from protein modifications. besides maldi, another common ionisation method is electro-spray ionisation (esi). the multiply charged ions generated by esi often produce ms/ms spectra that are cleaner and potentially easier to interpret (bodnar et al. ) . in general, thorough protein separation/purification is required for esi, since this technique is very intolerant of any contaminants. a more complex analytical approach is tandem mass spectrometry (ms/ms), which has two ms stages, often a combination of quadrupole and time-of-flight mass detectors (q-tof). in this technique, peptide fragments of a given mass are resolved by the first ms stage, then fragmented in a collision chamber and the mass fragmentation patterns are recorded by the second ms stage. the advantage of tandem ms over single ms fingerprinting is that the precise sequence of amino acids in each peptide can be determined, which allows much more reliable identification. an important principle for the preparation of samples for proteomic analysis is to reduce sample complexity by protein fractionation, therefore increasing the possibility of detecting proteins with lower abundance in the complex protein mixture. very commonly, lc separation precedes the ionisation and ms stages. maldi tandem mass spectrometers allow a preliminary lc separation "off-line", in which the eluted fractions are spotted onto a maldi sample plate for later ms analysis. therefore, lc-maldi techniques do not suffer from the time constraints imposed by the transient presence of peptides eluting from a column; if necessary, each sample can be analysed more than once. on the other hand, with lc-esi techniques the sample eluted from the lc is sprayed directly into the esi source and data is acquired immediately. because in lc-maldi the lc is decoupled from the ms analysis, the data is not available immediately, as it is with esi analyses, and chromatography problems can be present but undetected until too late. mudpit (multi-dimensional protein identification technology) is a development of an online lc approach, in which two sequential microcapillary columns (an ion exchange column followed by a reverse phase column) are used to separate very complex mixtures of peptides for ms/ms analysis. this type of analysis allows the identification of very large numbers of proteins/peptides from complex mixtures with the minimum of pre-treatment. nucleoli are not bounded by membranes and, since they contain rdna, they are connected by these dna strands to the rest of the nuclear chromatin. for these reasons, all methods for preparing nucleoli have relied on mechanical fragmentation procedures. in the case of plant cells, the cell wall must be digested by degrading enzymes, such as cellulases and pectinases, to produce protoplasts. this is necessary to avoid the nuclei and nucleoli being trapped inside the cell wall residues after cellular fragmentation. pendle et al ( ) used arabidopsis suspension cultures as a convenient, reproducible and abundant source of cells. the cells were protoplasted, and then the protoplasts were carefully disrupted by a small number of strokes of a stainless steel homogeniser, with a clearance between the piston and chamber of µm, observing the state of the preparation after every few strokes. in general the first few strokes released mostly intact nuclei, while further homogenisation then disrupted the nuclei to release nucleoli. nucleoli constitute the densest cellular component with the exception of starch granules, which may be an unavoidable contaminant, and can be quickly and efficiently purified from the other cellular debris by differential centrifugation. one of the most critical factors is the mg + concentration. magnesium ions causes chromatin to cross-link into an unworkable network, and centrifugation steps then bring down the chromatin with the nucleoli enmeshed in it. to solve this problem, pendle et al ( ) simply left mg + out of the homogenisation buffer. the nucleoli could then be separated from the chromatin. however, a small concentration of mg + was added immediately after centrifugation, since lack of mg + caused a gradual disintegration of the nucleoli. similar methods have been used for the purification of nucleoli from human cell culture scherl et al. ) , although in this case nuclei were purified first, with sonication being necessary to release the nucleoli. this may indicate that human nucleoli are more tightly associated with the chromatin and the rest of the nucleoplasm. we have extracted nucleoli from arabidopsis seedlings on a pilot scale, although not so far on a preparative scale, by chopping tissue with a razor blade to release nuclei, which can them be homogenised in the same way as culture cells and nuclei. studies during the past few years using live cell imaging of fluorescently tagged proteins have caused a reappraisal of the dynamic nature of the nucleolus, and indeed all other nuclear compartments. it has become clear that virtually all nuclear and nucleolar proteins are rapidly exchanged with the nucleoplasm, and that what distinguishes 'nucleolar' components is a mean nucleolar residence time of the order of seconds rather than of fractions of a second (phair and misteli ; raska et al. ). this raises the interesting questions of how nucleoli can be isolated as distinct structures over the timescale of minutes or hours necessary for the purification, and how the structures isolated relate to nucleoli seen in living cells. it would be expected that most of the associated protein and other mobile factors would be lost to the medium during extraction and purification, leaving only the most strongly attached factors. the fact that most of the expected proteins are present in purified nucleolar fractions suggests that the situation is more complicated. one possibility is that, along with the rapidly exchanging population of proteins, there is a more stable population. another possibility is that the changes in medium, ionic strength, metabolite concentrations, etc., that occur during nuclear fragmentation and nucleolar purification cause the dynamic exchange of many of the proteins to stop, either because active processes necessary are halted or because changes in the medium cause an effective precipitation of nucleolar contents into a less soluble state. it is important to bear in mind that these processes may affect different nucleolar constituents to different extents, and that 'purified' nucleoli are unlikely to have exactly the same composition as their in vivo counterparts. however, extracted arabidopsis nucleoli remain transcriptionally competent (p. mckeown and p. shaw, unpublished data), which gives some confidence in the validity of the purified fractions. these approaches were successfully applied in the two first published proteomic analyses of human nucleoli purified from hela cells, in which proteins and proteins (scherl et al. ) , respectively, were identified. recently, a new study identified proteins within nucleoli prepared from hela cells . so in total, the results obtained from the three independent analyses of hela nucleoli provided a list of individual proteins ). the latest study by andersen et al ( ) employed both an lc ms/ms q-tof instrument and a linear ion trap fourier-transform ioncyclotron resonance mass spectrometer (ft-icr-ms), which provided very high resolution and mass accuracy. the development of hybrid mass spectrometers employing ft-icr potentially presents new opportunities for proteomics analysis of the nucleolus . the arabidopsis nucleolar preparation obtained as described above from suspension cell cultures was subjected to high throughput proteomic analysis as described by pendle et al ( ) , which identified proteins. this has been subsequently increased to over proteins by a mudpit approach using twostage microcapillary lc linked to a q-tof instrument (p. mckeown, p. shaw, a. bottrill, unpublished data). many of the proteins that were identified were expected: known nucleolar proteins, ribosomal proteins, proteins involved in rdna transcription, and other rna-interacting proteins involved in ribosome biogenesis. however, many unexpected proteins were also found in the nucleolus, including for example, spliceosomal proteins, small nuclear rnp (snrnp) proteins and translation factors (see fig. . ). these results reinforce the results of several previous studies, implicating the nucleolus in a variety of functions in addition to ribosome biogenesis, including the biogenesis or transport of a range of rnas and rnps, and roles in mrna maturation, cell cycle control and stress responses (rubbi and milner ; andersen et al. ; olson and dundr ; pendle et al. ; pontes et al. ) . it could be argued that many of the proteins identified were contaminants -a large and poorly defined structure such as the nucleolus is likely to be impossible to fully separate from other nuclear and cellular components. nevertheless, the proteomic analysis has been confirmed by systematic protein expression studies using gfp-tagged proteins. for example, pendle et al ( ) showed that the vast majority ( %) of the proteins identified by proteomic analysis were also found to be nucleolar-located by the structural criterion of expressing gfp fusions of the identified proteins. many of the unexpected human proteins have also subsequently been confirmed as being in the nucleolus by structural methods such as gfp tagging and immunofluorescence . it is also notable that a large proportion of the nucleolar proteins identified in these proteomic analyses are currently completely uncharacterised, further supporting the view that we have as yet a very incomplete picture of the biochemistry that is taking place in the nucleolus. the following websites host current databases of identified nucleolar proteins: http://lamondlab.com/nopdb/; http://bioinf.scri.sari.ac.uk/cgi-bin/atnopdb/home; http://www.expasy.org/ch d/. post-translational modifications (ptms) are very common in nucleolar proteins, and several hundred specific modifications, many of which are likely to be functionally important in nucleoli, are known. for example, arginine dimethylation has defined roles in nucleoplasmic shuttling, and unknown roles on the abundant nucleolar proteins fibrillarin, nucleolin and gar (lapeyre et al. ; najbauer et al. ; xu et al. ) . histone modifications are implicated in the control of rdna transcription (earley et al. ) and both ubiquitination and sumoylation pathways are active in the nucleolus (mo et al. ; song and wu ; panse et al. ) . modifications of histones include phosphorylations, methylations, acetylations and deaminations, and the complex patterns of histone modifications have been proposed to constitute a 'histone code' (nightingale et al. ) . a detailed analysis of nucleolar histone modifications could in principal explore the potential histone code as it applies to the rdna. it is also likely that ptms of other non-histone proteins have been underestimated, and will be revealed by detailed ms analyses. the use of tandem ms/ms processing to sequence covalently modified peptides was first reported by olsen and mann ( ) , and the method is amenable to highthroughput techniques by using a quadrupole as the first ms stage. as with identification of unmodified peptides, it is possible to study either a pre-selected group of proteins of interest, or to sample the entire proteome. more focussed approaches have been used in several organisms for different modifications, and may allow a greater level of detail. nevertheless, full determination of all modification sites is still technically difficult. for example, analysis of phosphorylation sites of the nucleolar rent complex in yeast failed to detect all known sites, even when several different techniques were focussed on just these proteins (chen et al. ) . when extracted arabidopsis nucleoli were studied with q-tof ms/ms, a number of modifications were detected, including a ribosomal protein found in an acetylated form (p. mckeown and p. shaw, unpublished work). in the same study, eef a ( g ) was found acetylated at two sites, and the mammalian equivalent has also been detected in an acetylated form (kim et al. ). the total number of acetylations found was fewer than in mouse, partly because the majority of acetylated proteins in mouse are mitochondrial or cytoplasmic, and many acetylated nuclear proteins are probably not enriched in the nucleolus (kim et al. ). kim et al. made use of affinity purification with an anti-acetyl lysine antibody, but were still unable to detect many known acetylations, especially those associated with rare transcription factors. combining fractionation with affinity tagging may increase the number of nucleolar covalent modifications detected, but restricts the number of modifications that can be searched. for example, our survey detected seven sites of methionine acetylation that would not have been detected had an acetyl lysine-specific antibody been used. care is needed, however, in the interpretation of this type of result. some small modifications have very similar masses: methylation and formylation are close enough in mr for methylated peptides to also be detected in a screen for formylations, although with lower confidence ratings, which may allow them to be rejected. dimethylation and citrullination are isobaric and hence cannot be distinguished by this method. additionally, it may not be possible to distinguish between the modifications of different residues within peptides, a particular problem with highly modified proteins such as histones. modifications have to be actively sought in search engines such as mascot, precluding the detection of novel modifications, although known modifications can be found on novel substrates or at positions not previously identified. in studies of large modifications such as ubiquitination or sumoylation, which themselves fragment, it has been found to be necessary to isolate the modified proteins before carrying out ms analysis. plasma membrane-bound proteins modified with glycosylphosphatidylinositol (gpi)-anchor proteins were identified by a 'shave and conquer' technique: isolated membranes were treated with phospholipase d to liberate any gpi-anchored protein, and this protein fraction was analysed (elortza et al. ) . covalent modifications have also been determined by silac (stable isotope labelling by amino acids in cell culture), with one population of hela cells fed with labelled tyrosine and the other fed with labelled arginine and lysine. again, this allowed identification of unknown gamma-phosphorylation sites, and could be applicable to nucleoli (amanchy et al. ). a typical proteomic analysis of a protein mixture produces a list of identified components but without any quantification of the relative amounts of the different proteins. the detection and successful identification of a particular peptide will depend on a several other factors apart from the relative abundance of the original protein in the mixture: first, on effective digestion by the proteolytic enzyme used to produce the peptide; second, on how easily the peptide is volatilised and ionised; third, on the way the peptide fragments in the collision cell; fourth, on the presence of modifications, which may not have been predicted. finally, it will also depend on a presence of the peptide sequence in the database or databases to be searched subsequently. the last factor is a particular problem for species without completely sequenced genomes, since mascot and similar search engines search for a sequence match that is based on accurate mass data and therefore cannot rely on homology. for many proteomic applications, for example, cataloguing a particular tissue/ organellar proteome or getting a list of binding partners in a protein complex, simple identification of proteins present in the sample is all that is required. on the other hand, quantitative analysis is essential for many other proteomic applications, where the research involves comparison between different tissues and/or developmental stages of an organism, specific conditions or treatments, mutants or transgenic organisms with over-expressed or silenced genes. therefore, methods have been developed for both relative and absolute quantification of protein amounts in proteomic analysis. most biological applications require relative analysis, to compare two or more datasets, and in this review we will consider current proteomics techniques for relative quantification. one possibility for quantification is -de comparison. although significant progress has been achieved recently with the development of fluorescent stains and the dige technique (unlu et al. ; lilley and dupree ) , -de still allows only a subset of the proteome to be analysed, as certain groups of proteins, such as membrane proteins, co-migrating, low abundance proteins and those with extreme values of molecular mass and pi, cannot be clearly separated and/or visualised. in our experience this means that the majority of plant nuclear and nucleolar proteins either do not run cleanly in d gels or do not enter the gel at all. the low dynamic range of dige is also a major limitation of this technique. protein identification from excised gel spots also has limitations because the spots are often contaminated with other proteins. alternative approaches, non-gel-based quantitative proteomic methods, are based on comparison of peptide abundance, determining the isotopic composition of one or more elements in a compound after stable isotope labelling of a sample, either in vivo or in vitro. we will describe here a selection of isotopic labelling techniques that have been used on plant material. the labels can either be incorporated in vivo as substituted metabolites such as amino acids, or after purification and digestion by reaction with the resulting peptides. silac uses the in vivo incorporation of isotopically substituted amino acids to shift the molecular masses of the resulting peptides. two or more cell cultures are grown in parallel using different substituted amino acid mixtures, and then the cultures are subjected to different conditions or treatments, such as drug treatments or stress. after treatment, the cultures are pooled and used for biochemical purification. at the ms stage the relative amounts of each peptide can be quantified, since the origin of the peptide can be determined by its isotopic composition. in general, isotopically substituted lysine and arginine are used together, since all tryptic peptides should then be labelled. it is usually necessary to grow the cell culture in the substituted amino acids for a few days to get a good level of incorporation. this approach was used by andersen et al ( ) to determine the effects of different drug treatments on human nucleoli. other typical applications are analysing dynamics events by adding isotopically substituted amino acids to a cell culture at a given time, and analysing samples during a time course (ong et al. ; blagoev et al. ) . a large scale proteomic study tested the feasibility and technical challenges associated with silac to uncover quantitative changes during apoptosis in the nuclear proteome, resulting in the identification and quantification of , putative nuclear proteins (hwang et al. ) . another recent example of the application of this technique is selective isotope labelling of proteins from arabidopsis cell cultures by growing cells in the presence of a single stable isotopically labelled amino acid (gruhler et al. ) . a potential problem in applying this method to plants is that plant cells are better at inter-converting amino acids than animal cells, and so there is a risk that other amino acids may eventually be labelled. in isotope coded affinity tagging (icat), free cysteines in a protein are reacted with a special affinity tag. labeled proteins are enzymatically digested and labelled peptides are separated from the bulk mixture, first using affinity chromatography and then, in a second round, using ion-exchange chromatography prior to ms. the tag has three functional elements: a biotin tag, used during affinity capture (avidin chromatography); the isotopically encoded linker chain (with either eight hydrogens or eight deuteriums); and the reactive group, which will bind to and modify cysteine residues of the protein (gygi et al. ; li et al. ) . the tag is marketed by applied biosystems (foster city, ca). however, icat has limitations, because it only labels cysteine-containing peptides; - % of every genome codes for proteins with no cysteine, and many proteins contain only a single cysteine, providing only a single peptide that can be potentially quantified. localisation of organelle proteins by isotope tagging (lopit), a variation of the icat technique, was developed for discovering novel proteins in endomembrane organelles (dunkley et al. b; lilley and dupree ) , and uses analytical centrifugation in combination with differential isotope labelling. the method involves partial separation of organelles by density gradient centrifugation, thus producing over-lapping fractions, followed by the analysis of protein distributions in the gradient by icat labelling (dunkley et al. a (dunkley et al. , b and ms. multivariate data analysis techniques are then used to group proteins. a good correlation was observed between identification lists of proteins clusters in lopit and previous experimental evidence of protein locations within sub-cellular structures, and it has been shown that the lopit technique can be used to discriminate golgi, endoplasmic reticulum (er), plasma membrane, and mitochondrial/plastid proteins (lilley and dupree ) . in the latter paper the more versatile itraq method (see below) was used instead of icat. isobaric tag for relative and absolute quantification (itraq) involves chemical tagging of the n-terminus of peptides generated from protein digests; the reagent used reacts with primary amines (zieske ) . fragmentation of the tag attached to the peptides generates a low mass unique reporter ion. there are four tags available, all with an identical mass of da. the advantages of this approach are that all tryptic peptides are labelled, resulting in higher quality data; up to four labels can be used for multiple experiments; improved ms/ms fragmentation results in better confidence identification; and post translational modifications can be detected. the subsequent data analysis requires specialised software, and apart from the proquant software supplied by applied biosciences, non-commercial i-tracker software is also available for quantitative proteomics using itraq (shadforth et al. ) . disadvantages of itraq include the increased ms time required because of the increased number of peptides, and the fact that samples must be prepared according to strict guidelines. what is the best ms quantification technique to use? recently a comparative study was carried out on three proteomic quantitative methods, dige, icat, and itraq, using either -de or lc-maldi tof/tof (wu et al. ) . all three approaches yielded quantitative results with reasonable accuracy when the same protein mixture was used. in dige, accurate quantification was sometimes compromised due to the full or partial co-migration of proteins. the itraq method was more susceptible to errors in precursor ion isolation, especially with increasing sample complexity. the global-tagging itraq technique was more sensitive than the cysteine-specific icat method, which in turn was as sensitive as, if not more sensitive than, the dige technique. the tandem affinity purification (tap) method (rigaut et al. ) was developed to improve the purification of protein complexes, particularly for subsequent proteomic analysis. the most common previous purification methods for protein complexes used antibody pull-down, and this caused a high background of peptides arising from the antibodies used. the original tap tag consists of tandem protein a domains, linked via a tobacco etch-virus (tev) protease site to a calmodulinbinding protein (cbp) domain. in practice, the protein of interest is expressed as a tap-tagged fusion protein, and a cell extract is made after expression of the fusion. the tagged protein, along with complexed proteins, is absorbed on to igg beads via the protein a domains, washed, and then released by tev protease treatment. a second round of purification involves binding to calmodulin beads in the presence of ca + , followed by washing and release by a buffer containing egta to remove the ca + , and dissociate the calmodulin complex. subsequently, the tap tag has been made more general by removing a nuclear localisation signal, and has been adapted for use in plant cells by removing a cryptic splice site, polyadenylation sites and at-or gc-rich regions, and by the inclusion of castor bean catalase intron for improved expression in plants ( rohila et al. rohila et al. , . two gateway-compatible binary vectors, ntapi and ctapi, were constructed and initially tested using agrobacterium-mediated transformation for transient expression in nicotiana benthamiana leaves. recently, in a project studying protein kinase signaling networks in cereal leaves, transgenic stable transformants expressing tap-tagged rice protein kinases were produced and used for subsequent analysis of interacting proteins in rice plants (rohila et al. ) . in total, all rice kinases were purified, and of these were isolated as complexes with one or more interacting proteins (rohila et al. ) . however, the ntapi and ctapi tags also have some disadvantages. first, many plant proteins have cbds and therefore will be co-purified with the tagged protein (reddy et al. ) . we regularly observe co-purification of elongation factor (ef) -alpha proteins, from a small ef-tu/ef- a subfamily of identical genes at g , at g , at g , at g (an example present in table . ), which have a cbd, in protein mixtures pulled down by a number of different ntapi-and ctapi-tagged proteins. two rice ef-homologous proteins were reported to be among the recurring cellular proteins in the tap procedure by rohila et al. ( ) . second, the proteolytic treatment by tev protease involves incubation at °c, which may lead to proteolysis by endogenous proteases, because at this stage general protease inhibitors cannot be added to the reaction buffer because they would inhibit the tev protease. the exception is e- , but this inhibits only cysteine proteases. these problems have been addressed recently by development of an alternative tandem affinity purification tag for the isolation of plant protein complexes, called pc-tapa (rubio et al. ). this construct is also a gateway-compatible vector, allowing convenient recombination of orfs from pre-existing gateway entry clones. the pc-tapa tag consists of two protein a binding domains, a c hrv protease site, six histidine repeats ( -his) and nine myc epitope repeats (myc tag). an advantage of c hrv protease is that this enzyme is active at a wide range of temperatures from as low as °c (although this requires an increase of either the amount of the enzyme in the reaction mixture, or in incubation time, compared to the relatively fast cleavage at °c). the second purification step can use either the -his tag or the myc tag, but -his tag ni affinity chromatography has been reported to be more efficient than myc-tag epitope immunoprecipitation (rubio et al. ) . we have used ntapi-fusion for purification and proteomic analysis of proteins interacting with nuclear transport factor p h (at g ) using transient expression in arabidopsis cell culture, as described previously (koroleva et al. ) . this produced relatively high levels of expression of this particular protein. another significant change made to the original protocol was using tca protein precipitation and tryptic digestion of the pellet, instead of separating proteins on a gel and cutting out bands with subsequent digestion. this modification of the procedure avoided losses of yield at the stage of gel separation and extraction of peptides from the gel, which can be % or more. it also minimised the time required for ms analysis, as proteins (table . ) were identified simultaneously during a single h lc run followed by nano esi q-tof analysis. several fractions were set aside from the affinity purification steps for subsequent western blot analysis, which showed the molecular mass for the fusion protein band to be much higher than expected for this small kda protein (fig . ). this suggested that the protein was isolated in its homo-or hetero-dimeric form. another nuclear transport factor, p h (at g ), was present among the copurified proteins (table . ), and so it is likely that these proteins form a stable hetero-dimer or higher order complex. half of the identified proteins (table . ) were components of the large and small subunits of the ribosome, which would be expected for a nuclear transport factor. we expressed a gfp fusion of p h orf in arabidopsis suspension culture and observed a striking pattern of localisation on the surface of nuclear envelope, which probably corresponds to nuclear pores with which p h is associated (fig . ) . from our experience with several other tap-tagged proteins, the most essential factors for the successful isolation of protein complexes are the abundance and stability of the complex, a conclusion also drawn by rohila et al ( ) . fig. . purification steps of p h -ntapi and interacting proteins for proteomic analysis. western blot with rabbit anti-cbp primary antibody and anti-rabbit secondary antibody. lanes: protein relative molecular mass (mr) markers, initial cell extract, x extract after igg bead adsorption, wash from igg beads, tobacco etch-virus (tev) wash, tev eluate, wash through from calmodulin column, - fractions eluted from calmodulin column numerous small foci at the nuclear periphery are labelled, consistent with localisation to the nuclear pores and with the function of p in nuclear export. a a single optical section showing the foci labelled at the nuclear periphery. b d projection from the entire focal section stack through the nucleus shown in a proteomic analysis is an exciting new technique that can provide large data sets in short periods of time. however, when data from different large-scale projects are compared, certain groups of proteins appear to be ubiquitous despite pre-proteomic sample fractionation techniques. so an important challenge is how to set up criteria to distinguish between true positive identifications and false positives. a consortium has been established to develop general criteria for the publication and exchange of ms data and database search results (kaiser ) , and guidelines have been developed to assist researchers in the publication of protein identification from ms data (carr et al. ) . a method to estimate the rate of false positive identifications using a reverse database (peng et al. ) has provided a technique for reporting the stringency of the search parameters (cargile et al. ). this approach was applied to the analysis of nucleolar proteins in the most recent publication of the nucleolar proteome . the criterion used in this study was set up as at least two matching peptides per protein, a mass accuracy within ppm, a mascot score for individual peptides of more than , and a delta score of more than . the threshold of statistical significance in mascot searches is a generally accepted criterion for protein identifications. however, researchers working with unsequenced or incompletely sequenced genomes have to perform searches against expressed sequence tag (est) databases and sometimes have to consider single peptide matches. although some aberrant proteins can be distinguished by the use of replicates, this is insufficient to deal with 'persistent contaminants' -high abundance proteins, or those with a tendency to aggregate with nucleoli during extraction (cytoplasmic metabolic enzymes seem especially prone to this). in fact, as far as proteomic analysis is concerned, these proteins are genuine components of the starting preparation and it is only from the cell biological point of view that they are contaminants. proteomic analysis is unlikely to resolve this problem, although comparison with other studies and databases of common contaminants can help to identify suspect proteins. the best solution is to use a completely different technique to confirm the sub-cellular location of proteins identified. the most direct technique is visualisation of cellular location by fluorescent tagging or immunofluorescence, either by sampling a small sample of proteins whose localisation is unconfirmed or by the use of high-throughput approaches such as the gateway system to analyse a large proportion of the proteins identified (pendle et al. ) . this also allows a distinction to be drawn between proteins that are largely nucleolar, and those that are also found in other cellular compartments. the great advantage of high-throughput techniques is that single sets of experiments produce large volumes of data, but this can be a mixed blessing, especially in the absence of generally agreed strategies for ensuring data quality. hence, the utility of proteomic data to the scientific community depends on curation of data into easily navigable databases, and the use of regularly updated formats that allow an assessment of the biological relevance of any given protein or proteins within a proteome. following the identification of the proteins from arabidopsis nucleoli, the results were classified by probable protein function and placed online at the arabidopsis nucleolar protein database (http://bioinf.scri.sari.ac.uk/cgi-bin/ atnopdb/home; brown et al. ) . potential human and yeast orthologues of the arabidopsis nucleolar proteins were identified by reciprocal blast search, and comparisons with the human nucleolar proteome were performed. a database of the human nucleolar proteome is also available, and is sufficiently well characterised for use as a test system for novel ms techniques (vollmer et al. ) . over proteins are listed, dynamically characterised, with orthologues annotated from yeast, drosophila and caenorhabditis elegans (http://www.lamondlab.com/ nopdb/; leung et al. ) . both databases are cross-referenced to the pubmed systems and the lamond nopdb database also includes raw data, i.e. sequenced peptides from tandem ms/ms. as coute et al. ( ) note, the data on the basis of which such identifications were made may need to be provided to ensure valid comparisons between different experiments or systems (such as the full list of peptides made available following recent analysis of the human acetylproteome; kim et al. ) in a manner analogous to the 'miame' standards (http://www.mged. org/workgroups/miame/miame_ . .html), which allow direct comparisons between microarray data. proteomic analysis of isolated organelles cannot determine whether a protein is specific to the organelle in question, or whether it is also located in other parts of the cell. neither can it determine the distribution of that protein within the organelle. all of this information provides important guides to function. for these reasons, and also to assess the rate of false positives, about half of the proteins identified in our original arabidopsis nucleolar proteome (those for which we could obtain orfs at the time) were transiently expressed as gfp-fusions (pendle et al. ) . these data are also provided in the atnopdb database, including specificity for subnucleolar compartments and presence in other parts of the nucleus and cytoplasm. attempts to understand the complexities of protein localisation could be aided by the use of databases that compile and compare localisation data from different sources. unfortunately, neither the cross-species organelle db (http://organelledb. lsi.umich.edu; wiwatwattana and kumar ) nor the arabidopsis-specific gfp/ ms-based "sub-cellular location database for arabidopsis proteins (suba)" at www.suba.bcs.uwa.edu.au (heazlewood and millar ) or gfp-based database at http://aztec.standford.edu/gfp/ ) specify sub-nuclear regions. even after reciprocal blast search, arabidopsis nucleolar proteins are of unknown function due to the lack of characterised orthologues. the presence of protein domains of known function within such proteins can help both in suggesting potential functions and in the broader study of the nucleolus in evolution. for example, an analysis of protein domains of the human nucleolar proteome concluded that the core functions of the nucleolus (i.e. those connected to ribosome biosynthesis) were of archaeal origin, but that the many eukaryote-specific domains suggested that the nucleolus had undergone a massive subsequent enlargement, driving the evolution of, for example, rna helicases (staub et al. ). accordingly, the human proteome database is searchable by protein domain, as well as by gene ontology. in the examples cited above, genome sequences can be used to identify the proteins detected by ms. this is not possible in organisms with as yet unsequenced genomes, such as wheat, and alternative strategies will be needed to present ms data in a useable manner. full use will have to be made of est databases, and it will be necessary to ensure that such libraries are as amenable to automated searching by mascot or related software as are genome databases. blast search against the genomes of sequenced relatives such as rice will also be important. although considered the 'model grass', rice is only distantly related to wheat (diverging - million years ago; crane et al. ) , and the proposed sequencing of the brachypodium genome may be important for searching for homologues of wheat proteins within a more closely related species. the presumed goal of proteomics is to catalogue all the different proteins that can be expressed in a given organism along with all the ptms of each protein, to quantify the expression and modification levels and to determine the way these levels change in different cell types and cell stages. cell biologists would add to this the determination of the sub-cellular and sub-organellar location and the dynamics of all the cell's proteins. we are a long way from achieving these ambitious goals, if indeed they are achievable or if they are still considered worth achieving when they become technically possible. one major problem is the sheer number of possibilities; most mammalian genes are subject to alternative splicing, and alternative splicing is likely to be much more widespread in plants than currently expected. each resulting polypeptide is then potentially subject to hundreds of ptms. in addition to the enormous number of possible molecular species arising from each gene, the dynamic range in concentrations of proteins is also enormous, ranging from as low as one or two molecules to many billions per cell. this is currently one of the most significant limitations in proteomics technology; low abundance species are simply swamped by the overpowering concentrations of common proteins. nevertheless, huge improvements have been made recently both in the experimental methods and in the instrumental flexibility and sensitivity for proteomics. each new generation of ms machines has greater sensitivity, speed and throughput. the result is that current technologies are proving highly productive for samples of medium complexity, such as nucleoli, which contain a few hundred different proteins. in many cases, these 'partial proteomes' seem more accessible than total cellular proteomes and, at least in the short term, the information that this type of analysis can provide seems more useful. many practical problems of displaying, interpreting and curating the information remain, as well as issues of reproducibility and control of artefacts, but these technical problems are likely to be alleviated as the technology matures. as with other types of large-scale bioinformatics data, accessibility and interoperability of different types of data is still a problem. in our experience, the power of proteomics analysis has to be seen in the context of the complementary cell biology techniques for localisation of the proteins identified. proteomics will clearly be most powerful when closely integrated with other techniques in multi-disciplinary approaches to biology. mass spectrometry-based proteomics phosphoproteome analysis of hela cells using stable isotope labeling with amino acids in cell culture (silac) directed proteomic analysis of the human nucleolus nucleolar proteome dynamics analysis of the arabidopsis nuclear proteome and its response to cold stress temporal analysis of phosphotyrosinedependent signaling networks by quantitative proteomics exploiting the complementary nature of lc/maldi/ms/ms and lc/esi/ms/ms for increased proteome coverage arabidopsis nucleolar protein database (atnopdb) potential for false positive identifications from large databases through tandem mass spectrometry the need for guidelines in publication of peptide and protein identification data: working group on publication guidelines for peptide and protein identification data mass spectrometrybased methods for phosphorylation site mapping of hyperphosphorylated proteins applied to net , a regulator of exit from mitosis in yeast a polymerase switch in the synthesis of ribosomal-rna in saccharomyces cerevisiae deciphering the human nucleolar proteome the origin and early diversification of angiosperms random gfp::cdna fusions enable visualization of subcellular structures in cells of arabidopsis at a high frequency the use of isotope-coded affinity tags (icat) to study organelle proteomes in arabidopsis thaliana localization of organelle proteins by isotope tagging (lopit) erasure of histone acetylation by arabidopsis hda mediates large-scale gene silencing in nucleolar dominance proteomic analysis of glycosylphosphatidylinositol-anchored membrane proteins high-throughput viral expression of cdna-green fluorescent protein fusions reveals novel subcellular addresses and identifies unique proteins that interact with plasmodesmata paraspeckles: a novel nuclear domain mapping a nucleolar targeting sequence of an rna binding nucleolar protein stable isotope labeling of arabidopsis thaliana cells and quantitative proteomics by mass spectrometry quantitative analysis of complex protein mixtures using isotope-coded affinity tags nucleolar localization of potato leafroll virus capsid proteins integrated plant proteomics -putting the green genomes to work emerging concepts of nucleolar assembly the nucleolus -a gateway to viral infection? systematic characterization of nuclear proteome during apoptosis: a quantitative proteomic study by differential extraction and stable isotope labeling proteomics. public-private group maps out initiatives rice proteomics: recent developments and analysis of nuclear proteins substrate and functional diversity of lysine acetylation revealed by a proteomics survey involvement of the nucleolus in plant virus systemic infection high-throughput protein localization in arabidopsis using agrobacterium-mediated transient expression of gfp-orf fusions protein and cdna sequence of a glycine-rich, dimethylarginine-containing region located near the corboxylterminal end of nucleolin (c and kda) bioinformatic analysis of the nucleolus protein profiling with cleavable isotope-coded affinity tag (cicat) reagents: the yeast salinity stress response systematic analysis of arabidopsis organelles and a protein localization database for facilitating fluorescent tagging of full-length arabidopsis proteins methods of quantitative proteomics and their application to plant organelle characterization nucleolar adaptation in human cancer genome sequence of the ultrasmall unicellular red alga cyanidioschyzon merolae d cellular stress and nucleolar function the nucleolus -an organelle formed by the act of building a ribosome nucleolar delocalization of human topoisomerase i in response to topotecan correlates with sumoylation of the protein peptides with sequences similar to glycine, arginine-rich motifs in proteins interacting with rna are efficiently recognized by methyltransferase(s) modifying arginine in numerous proteins histone modifications: signalling receptors and potential elements of a heritable genetic code improved peptide identification in proteomics by two consecutive stages of mass spectrometric fragmentation the moving parts of the nucleolus homologs of small nucleolar rnas in archaea stable isotope labeling by amino acids in cell culture, silac, as a simple and accurate approach to expression proteomics formation and nuclear export of preribosomes are functionally linker to the small-ubiquitin-related modifier pathway proteomic analysis of the arabidopsis nucleolus suggests novel nucleolar functions evaluation of multidimensional chromatography coupled with tandem mass spectrometry (lc/lc-ms/ms) for large-scale protein analysis: the yeast proteome high mobility of proteins in the mammalian cell nucleus the arabidopsis chromatin-modifying nuclear sirna pathway involves a nucleolar rna processing center structure and function of the nucleolus in the spotlight genes encoding calmodulin-binding proteins in the arabidopsis genome delineation and modelling of a nucleolar retention signal in the coronavirus nucleocapsid protein a generic protein purification method for protein complex characterization and proteome exploration improved tandem affinity purification tag and methods for isolation of protein heterocomplexes from plants protein-protein interactions of tandem affinity purification-tagged protein kinases in rice disruption of the nucleolus mediates stabilization of p in response to dna damage and other stresses an alternative tandem affinity purification strategy applied to arabidopsis protein complex isolation functional proteomic analysis of human nucleolus ) i-tracker: for quantitative proteomics using itraq systematic subcellular localization of novel proteins identified by large-scale cdna sequencing identification of a novel nucleolar localization signal and degradation signal in survivin-deltaex : a potential link between nucleolus and degradation insights into the evolution of the nucleolus by an analysis of its protein domain repertoire large-scale identification of mammalian proteins localized to nuclear subcompartments high-throughput fluorescent tagging of full-length arabidopsis gene products in planta difference gel electrophoresis: a single gel method for detecting changes in protein extracts multidimensional hplums of the nucleolar proteome using hplc-chip/ms organelle db: a cross-species database of protein localization and function comparative study of three proteomic quantitative methods, dige, cicat, and itraq, using d gel-or lc-maldi tof/tof in vivo analysis of nucleolar proteins modified by the yeast arginine methyltransferase hmt /rmt p genetic inactivation of the transcription factor tif-i! leads to nucleolar disruption, cell cycle arrest, and p -mediated apoptosis a perspective on the use of itraq reagent technology for protein complex and profiling studies we are grateful to andrew bottrill and mike naldrett (john innes centre) for their help with proteomic analysis. financial support for this research was provided by the uk biotechnology and biological sciences research council (bbsrc). key: cord- -pj pal authors: ouyang, bo; dong, ying; chou, james j. title: structural and functional properties of viral membrane proteins date: - - journal: advances in membrane proteins doi: . / - - - - _ sha: doc_id: cord_uid: pj pal viruses have developed a large variety of transmembrane proteins to carry out their infectious cycles. some of these proteins are simply anchored to membrane via transmembrane helices. others, however, adopt more interesting structures to perform tasks such as mediating membrane fusion and forming ion-permeating channels. due to the dynamic or plastic nature shown by many of the viral membrane proteins, structural and mechanistic understanding of these proteins has lagged behind their counterparts in prokaryotes and eukaryotes. this chapter provides an overview of the use of nmr spectroscopy to unveil the transmembrane and membrane-proximal regions of viral membrane proteins, as well as their interactions with potential therapeutics. we will focus our discussion on two classes of transmembrane (tm) proteins encoded by viruses, viroporins (or viral channels) and membrane fusion proteins, as these proteins have been sought after as antiviral targets and they often exhibit peculiar structural features not seen in other membrane proteins. as implied by their name, the viroporin proteins can form channel-like structures in lipid bilayer that permeate ions or solutes. now over a dozen viroporins from various sources have been characterized, covering a wide range of ion substrates including h + , k + , na + , ca + , and cl − , as well as larger substrates such as rna. the exact function of ion channel activity in many viruses is not yet known, though their roles have been implicated in entry, virus assembly and virus release. the function of membrane fusion proteins is much better defined, that is, to enable viral entry by mediating virus-host membrane fusion. the viral fusion proteins have large extramembrane domains for receptor recognition and for grabbing onto the host cell membrane, but the function of their tm and membrane-proximal regions have been elusive. functional mutagenesis studies have suggested that, at least in the cases of hiv- and influenza a viruses, the tm domains (tmds) of fusion proteins are not merely membrane anchors, but play important roles in membrane fusion and viral infectivity. apart from the channels and fusion proteins, some viruses have developed enzymatic domains anchored to the membrane, e.g., the polymerases of the hepatitis c virus and the neurominidase of the influenza viruses. in these cases, the tmds are believed to only serve as membrane anchors and thus will not be discussed in this chapter. viroporins serve highly diverse functions in viral infection cycles (illustrated in fig. . ). early studies of viroporins have focused on only a few systems including b of poliovirus, k of togavirus, and m from influenza a virus (am ). the b is a well-known viroporin that can modify membrane permeability and allow the passage of ions and small molecules during later stages of viral infection [ ] [ ] [ ] . the k is a small, hydrophobic acylated protein that has been shown to form cation-selective channels when inserted into lipid bilayers [ ] . the k is proposed to be involved in virus budding, but the exact role of k in the viral life cycle is poorly characterized [ ] [ ] [ ] . the am proton channel is the most studied viroporins because ( ) it is the drug target of the first anti-flu drug (amantadine or rimantadine) and ( ) it is one of the smallest proton channels with proton selectivity motif not found in any of the proton channels from other organisms [ ] [ ] [ ] . since the definition of viroporin has been put forward, more and more viroporins have been classified (table . ). the influenza b virus encodes the bm proton channel that is a functional homolog of the am but cannot be blocked by the adamantane family of drugs [ ] . the human immunodeficiency virus type (hiv- ) vpu and the hepatitis c virus (hcv) p have been shown to form oligomeric channels to conduct cations [ ] [ ] [ ] [ ] [ ] . the b protein from the enterovirus (ev ), however, displayed altered channel activity, as it conducts anions (e.g., cl − ) instead of cations [ ] . paramecium bursaria chlorella virus (pbcv- ) has a viral k + channel named kcv that contains the canonical potassium selectivity filter [ ] and was proposed to adopt similar structure as kcsa [ ] . in addition to the ion channels, the vp from rhinovirus can form pores that ( ) . virion attaches directly host cell receptors on the membrane ( ) . and subsequently enters the cell by receptor-mediated endocytosis ( ) . acidification of the endosomal vesicles triggers conformational changes in the virion, resulting in fusion between the viral and the endosomal membranes, allowing the release of the nucleocapsid into the cytoplasm. the viral rna is then released into the cytoplasm and presented to the endoplasmic reticulum (er) ( ) . at the er, viral rna is translated into protein that is processed by viral and host proteases ( ) . after the viral replication complex is synthesized, rna synthesis begins by the transcription of an antisense viral rna followed by the amplification of viral rna ( ) . the newly synthesized rna is subsequently packaged by capsid protein, forming a nucleocapsid ( ) . virus assembly occurs on the surface of the er when the nucleocapsid buds into the er lumen, then transported through the golgi, where acidification induces virus maturation ( ) . mature viral particles are released in the neutral ph of the extracellular millieu (a) viral entry, which acidifies the virion interior immediately after endocytosis and facilitates rna release (b) dissipation of the proton gradient in the golgi and the trans-golgi network. the viroporins influenza a virus (iav) matrix protein (m ) and hepatitis c virus (hcv) p equilibrate the proton concentration with the cytosol to reduce the acidification of vesicular acidic compartments (c) viroporins play an essential role in assembly, budding and release of the viral particles (d) alteration of cellular ca + homeostasis. calcium uptake by the mitochondria can lead to dissipation of the inner-mitochondrial-membrane potential (ΔΨm), permeabilization of the outer mitochondrial membrane and, finally, the release of cytochrome c. in the cytosol, cytochrome c promotes the subsequent formation of apoptosome that is involved in apoptosis allow single-stranded viral rna to cross the membrane [ , ] , further expanding the list of substrates that viroporins transport. probably the best-defined roles of viroporins are that of the m proton channels of influenza a and b viruses, both of which involve equilibrating ph between compartments. one important role of the m is to equilibrate ph across the viral membrane during viral entry, which acidifies the virion interior immediately after endocytosis and facilitates rna release [ , ] (fig. . a ). another role of m is to equilibrate ph across the trans-golgi membrane of the infected cells during viral maturation, which preserves the pre-fusion form of the hemagglutinin fusion protein through de-acidification of the golgi lumen [ ] (fig. . b) . the roles of other cation-selective viroporins are less clear. one of the general roles proposed for viroporin is to depolarize either the er or plasma membrane to facilitate membrane curvature formation during virus budding [ , ] (fig. . c ). in the case of hcv, for example, the p -mediated channel activity has been reported to facilitate virus release [ , ] . another general role for viroporin is simply to cause cation leakage, which would induce cellular stress and programmed cell death [ ] (fig. . d ). in addition to ion permeation, several viroporins are known to participate in viral assembly through interaction with other proteins. for example, the influenza am and bm proteins both have significant cytoplasmic domains that are believed to recruit matrix proteins m during virus assembly [ ] [ ] [ ] . the hcv p protein has also been reported to interact with other non-structural proteins such as ns , and this interaction appears to be crucial for the production of infectious hcv particles [ , ] . pore formation indicated but a defined oligomeric state either does not exist or is not characterized enveloped viruses infect host cells by the fusion of viral and cellular membrane. depending on the type of the virus, fusion can occur either with the host plasma membrane or with the endosomal membrane. the membrane fusion must occur without consuming energy; it is catalyzed by major conformational changes of specialized envelope proteins generally known as viral fusion proteins. despite their diversity, all known viral fusion proteins convert from a prefusion state (dimers or trimers, depending on the class), through a key intermediate state (extended conformation), to a postfusion state (a trimer of hairpins) that brings the fusion peptide, attached to the target membrane, and the tm domain, attached to the viral membrane, into close proximity thereby facilitating the fusion of viral and target membranes ( fig. . a) [ ] . three distinct classes of viral fusion proteins (exemplified in fig. . b) have been defined based on structural criteria [ ] . the class i includes fusion proteins from some of the best characterized human pathogens, such as influenza and hiv- . these proteins are trimers of a single chain precursor that requires a proteolytic cleavage to generate two fragments. for example, the influenza a hemagglutinin (ha) contains the n-terminal fragment ha and c-terminal fragment ha [ ] , and the hiv- envelope protein is cleaved into the gp and gp fragments [ ] . the fusogenic fragment gp bears a hydrophobic fusion peptide capped by the outer receptor binding fragment gp [ ] . receptor binding accompanies cap protein release, allowing major conformational changes in the gp that ultimately leads to fusion [ ] . in the case of influenza, the cap ha is released by low ph of the endosome after endocytosis [ ] . the class ii fusion proteins are found on flaviviruses, alphaviruses, and bunyaviruses (e in flaviviruses, e in alphaviruses, gn and gc in bunyaviruses) [ , ] . although these fusion proteins are architecturally different from the class i fusion proteins, i.e., class ii fusion proteins are mostly made of β-sheets as opposed to the helical structures of the class i fusion proteins, they operate on the same physical principle. their fusion peptides are formed as loops at the tips of β-strands that serve as an anchor to insert into the host membrane after endocytosis. unlike class i proteins, which remain trimeric, their conformational change involves a change in oligomeric state from pre-fusion dimers to post-fusion trimers [ , ] . the reduced ph of the endosome induces a cascade of subsequent refolding and trimerization events that achieve membrane fusion [ ] . members of the class iii include g protein of vsv, gb of herpes simplex virus and epstein-barr virus (ebv), and gp of the insect-cell baculovirus [ ] . the class iii fusion proteins combine some of the features of class i and ii fusion proteins [ ] . each protomer is composed of five domains that contain a central trimeric coiled-coil, a hallmark of class i proteins in their postfusion states, three of the domains are predominantly made of β-sheets and their fusion peptide more closely resemble those of class ii proteins in that their fusion loops are found at the tips of extended β-strands [ ] . due to the essential roles of viral fusion proteins during infection, they have long been pursued as targets for antiviral intervention. for example, the approved small molecule drug arbidol is used as a broad-spectrum inhibitor of influenza a and b virus, as well as hepatitis c virus [ , ] . unlike many other broad-spectrum antivirals, arbidol has an established mechanism of action against the has in influenza a and b viruses that involves the inhibition of virus-mediated membrane fusion and thus viral entry [ ] . in addition, enfuvirtide (t , fuzeon) is an approved peptide drug that blocks the hiv- entry [ ] ; it is derived from the c-terminal heptad repeat region of the hiv- gp envelope glycoprotein, designated the c-peptide, that disrupts membrane fusion by competing with an intra-molecular interaction that is important for the refolding of gp during membrane fusion [ ] . apart from being therapeutic targets, another important medical use of the fusion proteins is vaccine development. viral fusion proteins are usually presented on the virion surface with high copy numbers, and thus are popular antigens for eliciting broadspectrum neutralizing antibodies. for example, the native conformations of the the n terminus of ha and the c-terminus of the ha ectodomain are labeled. blue arrow: position of fusion peptides inserted near threefold axis in prefusion form. only ha is shown on the bottom. the n-terminus (blue arrow; note that the fusion peptide is not part of the structure shown) and c-terminus of the blue-colored subunit are indicated class ii: dengue virus type e protein ( oan and ok ). the radial ("top") view shows just the "stem-less" ectodomain ( oan). colors: domain i: magenta; domain ii: cyan; domain iii: yellow; stem: cyan; colors for domains i, ii and iii are the same in the postfusion representation. a dashed cyan arrow on the postfusion trimer shows where the stem emerges from domain iii. red asterisks: fusion loops class iii: vsv g ectodomain ( j j and cmz). the three chains are in gold, sea green, and magenta. dashed lines show the location of a disordered, c-terminal segment that connects the folded protein to the transmembrane anchor. only the magenta-subunit c-terminal segment is shown on the bottom. the curved gold arrow indicates that in the transition from the conformation on the top to the conformation on the bottom, the domain bearing the fusion loops flips over by about ° to engage the host-cell membrane. red asterisks: fusion loops hiv- envelope glycoprotein have being pursued intensely as immunogens for b-cell based vaccine development [ , ] . for most single-pass tm proteins, the role of their tmds beyond membrane anchoring is unresolved, partly because of the difficulty in structural studies: they are notorious for resisting crystallization and due to their small sizes, unfeasible for visualization by cryo-electron microscopy (em). there have been only a few examples of crystal structures of the small tmds, including the tmd of the am [ , ] and the more recent crystal structure of the tm helix dimer of the glycophorin a protein [ ] . as cryo-em is rapidly approaching atomic resolution, it is becoming increasingly used to examine the structures of the much larger viral fusion proteins containing the tmd. most notably, a recent cryo-em study of the hiv- envelope glycoprotein (env) including the tmd achieved a high resolution structure of the prefusion state of the hiv- env [ ] . this study, however, showed that the tmd and the membrane-proximal regions of the env are disordered, possibly due to incompatibility between the membrane-associated regions of the env and the detergent used to solubilize the env. in this chapter we mainly focus on the use of nmr to study the tm and membrane-proximal regions of viral membrane proteins, including methods for structure determination and for investigating drug binding. the first challenge of nmr studies is preparing sufficient quantities of isotope labeled proteins and this is not trivial for small and hydrophobic proteins. a robust approach is to force highlevel expression of the hydrophobic peptides, which are toxic to cells, into inclusion bodies. in our experience, the most effective implementation of this approach is to express the peptide as a c-terminal fusion to the trple protein (which drives inclusion body formation) in the pmm-lr vector [ ] [ ] [ ] (fig. . a). the trple has an n-terminal ×his tag to permit nickel affinity purification. a methionine between the trple and the peptide is added for cleavage by cyanogen bromide so that the peptide can be released from the trple. purification of the peptide involves ( ) purifying the trple-peptide fusion protein from the inclusion bodies by nickel affinity, ( ) cleavage by cyanogen bromide of the fusion protein in formic acid, and ( ) separation of the peptide from trple or fusion protein by reverse-phase hplc. while the above protocol is quite general for hydrophobic peptides, it requires that the peptides have no methionine in the middle of the peptide sequence. if the native methionines cannot be mutated, an alternative method is to use soluble fusion domains (e.g., the maltose binding protein) with specific protease cleavage site for releasing the peptide. the purified hydrophobic peptides can be reconstituted in any membrane mimetic media, including detergent micelles, bicelles, and lipid nanodiscs. while nanodiscs are the closest mimic of native membrane, we find that the lipid/detergent bicelle system is a good compromise between having the capacity to provide a near lipid bilayer environment and generating good nmr spectra. previous studies on the bicelle system with , -dimyristoyl-sn-glycero- -phosphocholine (dmpc) as lipid and ( , -dihexanoyl-sn-glycero- -phosphocholine (dhpc) as detergent have shown that when the molar ratio of lipid to detergent (q) > . , the assembly reaches the ideal bicelle condition in which the lipid and detergents are well segregated [ , ] . at q = . , for example, the estimated diameter of the lipid bilayer region of the bicelle disc is ~ Å according to the equation describing bicelle assembly [ ] [ ] [ ] , and this size is sufficiently large to accommodate small tmds. remarkably, even bicelles of such size could generate nmr spectra of sufficiently high quality to enable full-scale structure determination, as was demonstrated for the structure fig. . a schematic of hydrophobic peptide sample preparation using hcv p as an example . trple plasmid with targeted peptide is transformed into and bl (de ) competent cells for expression . the cells were grown in m media . cells were collected by centrifugation and lysed by sonication . after lysis, the inclusion bodies and membranes were collected by centrifugation and dissolved in m guanadine buffer. insoluble aggregates were removed by centrifugation. the supernatant was then subjected to ni-nta purification. . the protein was eluted using mm imidazole . cyanogen bromide cleavage followed by dialysis in water . the protein was further separated by reverse-phase hplc or ni-nta . the lyophilized peptide was then dissolved in detergent and refolded by dialysis against the nmr buffer . nmr experiments were performed at spectrometers determination of the trimeric tmds of the fas death receptor [ ] and hiv- env [ ] . here we provide an example of bicelle reconstitution from a previous study on the tmd of hiv- env [ ] . the hydrophobic peptides (lyophilized) are first completely dissolved in strong organic solvent (e.g., hexafluoro-isopropanal) with calculated amount of dmpc. the solution is slowly dried to a thin film under nitrogen stream. the dried film is then dissolved in m urea containing calculated amount of dhpc. the denaturant is removed by dialysis, during which the dmpc: dhpc ratio is monitored by d nmr, and the loss of dhpc during dialysis is compensated by further addition of dhpc. owing to the small size of most of the tmds of viral membrane proteins, structure determination by nmr is normally straightforward. probably the biggest challenge is measuring inter-protomer restraints in cases of oligomer complexes. in particular, structure determination of symmetric oligomers faces the challenge of measuring noe-based h- h distances between structurally equivalent protomers having the same nmr peaks, which are needed as inter-protomer restraints. a proven way to solve this problem is using a mixed sample in which half of the protomers are ( n, h)-labeled and the other half c-labelled, for measuring, exclusively, noes between the n-attached h of one protomer and c-attached h of the neighboring protomer [ , ] (example from the tmd of the fas receptor shown in fig. . b). once the trimer solution is established, conventional n-and c-edited noe experiments can then be used to collect self-consistent backbone-sidechain and sidechain-sidechain noe restraints. in addition to structure determination, an important aspect of small tmds is interaction with membrane. the bicelle system allows accurate determination of the position of tmd in the bilayer region of the bicelles using a solvent paramagnetic relaxation enhancement (pre) analysis [ ] . in this method, titration of the water-soluble and membrane-inaccessible paramagnetic agent, gd-dota (gadolinium (iii) , , , -tetraazacyclododecane- , , , -tetraacetate), is used to generate the solvent pre. this approach is based on the notion that if the bicelle is sufficiently wide, the lateral solvent pre becomes negligible, thus allowing the use of measurable solvent pre to probe residue-specific depth immersion of the protein along the bicelle normal. at each titration point, a d h- n correlation spectrum is recorded to measure residue-specific pre. for each of the residues, the pre titration curve is fitted to exponential decay to derive the residue-specific pre amplitude (pre amp ). the pre amp vs. (residue position) along the trimer symmetry axis (parallel to the bilayer normal) is then analyzed using the sigmoidal fitting method [ ] to determine the tmd region that resides at the center of the bilayer as well as the thickness of the bilayer around the protein. for studying small molecule drug binding to viral membrane proteins, chemical shift perturbation (csp) induced by drug titration is a convenient probe for identifying the approximate binding site. but, cautions need to be taken due to complications associated with membrane-mimetic media. first, many small molecule inhibitors are hydrophobic and preferentially partition into detergent micelles or lipid/detergent bicelles, and thus the csp can be induced simply by alteration of the micelle/bicelle environment. second, in addition to csp caused by close ligand contact, csp could arise from changes in conformation and/or dynamics. hence, the more direct noe data is preferred wherever feasible. the simplest and the most sensitive approach for collecting protein-drug noe is using proteins that are n-labeled and completely deuterated so that noe between the protein backbone amide protons and drug protons could be measured unambiguously (fig. . c ). once the binding site has been unambiguously spotted, further contacts between the drug and protein sidechains can be measured using conventional c-edited methyl or aromatic noesy. details of these experiments have been described in studies that identified the drug binding sites in the influenza am and hcv p channels [ , , ] . structural studies of viroporins have been challenging because these small membrane proteins are typically dynamic and very hydrophobic. in the past years, multiple biophysical techniques including solution nmr, solid-state nmr, x-ray crystallography, and em have been used to gradually fill the structural gaps. taking the influenza am for example, there is now structural information in crystal and solution states, in lipid bilayer, under different phs, and bound to different small molecules. these complementary structural data allow elucidation of the functional mechanism from different view angles. the am of influenza a and the bm of influenza b are -and -residues single-pass membrane proteins, respectively, that form homotetramers in membrane [ ] [ ] [ ] [ ] [ ] . the sequences consist of three domains: an extracellular n-terminal domain, a transmembrane domain (tmd) and an intracellular c-terminal domain. these domains arrange into different structures. the only homologous sequence between the two proteins is the hxxxw sequence motif of the tmds that is essential for channel activity. the tm region of the am contains residues - . the unstructured extracellular segment of am is relatively conserved and has been sought after as a vaccine epitope [ ] [ ] [ ] [ ] [ ] . bm has a similar sized tmd (residues - ), but a much larger c-terminal cytoplasmic domain [ , ] . the bm cytoplasmic region also assembles into oligomers which are important for recruiting the matrix proteins to the cell surface during the viral assembly [ , , , ] . the first high-resolution structures of the am channel were reported concurrently by solution nmr spectroscopy and x-ray crystallography in [ , ] . subsequent x-ray and solution nmr studies also determined structures under different conditions [ ] and with different drug resistance mutations [ , ] . moreover, structural characterization by solid-state nmr studies generated the channel structures in the lipid bilayer environment [ ] [ ] [ ] . the above am structures solved under different conditions show significantly different conformations ( fig. . a) , suggesting that the tetrameric assembly of the am is quite dynamic and is sensitive to the reconstitution environment. the structural plasticity observed in am is likely present in many other viral membrane proteins, which could explain why structural study of this type of proteins has been so difficult. despite the differences, the experimental structures converged to a common mode of channel assembly: a left-handed four-helix bundle forms the channel pore, and that tetramerization of the four tm helices is further stabilized by intermolecular contacts between c-terminal amphipathic helices flanking the tmd [ , ] . this mode of assembly places the "h" and "w" of the hxxxw sequence motif inside the channel (fig. . a) . four imidazole rings of his , which are ph-gating features and are essential in transporting protons, are packed closely inside the pore (fig. . b) . moreover, packing of the trp indoles creates a channel gate, which occludes the c-terminal end of the pore. the structure of the bm channel was solved using solution nmr methods [ ] . although the overall assembly of the bm tmd is similar to that of the am tmd, i.e., both show left-handed helical packing, the two differ substantially in details ( fig. . c) . unlike the am , the bm channel shows strong coiled-coil characteristics with regular heptad repeats [ ] . in the bm structure, the two heptad repeats show that positions a and d are occupied mostly by hydrophilic residues such as serines and his ; positions g and e are occupied by hydrophobic leucines and phenylalanines, respectively, to allow for peripheral hydrophobic interactions between the leucines and phenylalanines (fig. . d ). this arrangement for coiled-coil assembly in membrane allows the tm segment to form a stable tetramer by itself and is the opposite to that of water-soluble coiled-coil tetramer, in which positions a and d are typically hydrophobic residues and positions g and e are polar residues [ ] . the histidine (his ) and tryptophan (trp ) of the hxxxw motif are also pore-lining in the inverse coiled-coil assembly in membrane. the cytoplasmic domain of the bm also oligomerizes into a left-handed coiled-coil tetramer, but it is water-soluble and shows strong bipolar distribution in the surface charges. this charged domain supports the interaction with the m matrix protein [ ] . the structural arrangement of the histidine and tryptophan inside the pore of the am and bm suggests that the two residues play an essential role in the channel selectivity and channel gating (fig. . e ). functional mutagenesis [ ] and nmr measurements [ ] showed that proton transport across the membrane through the am channel involves cycles of histidine protonation and deprotonation, and that the histidines serve as proton shuttling devices. not all histidines can be protonated at the same time in the narrow channel, as protonation of one histidine would increase the energy barrier for the protonation of another histidine. indeed, multiple pka values ( . , . and one below . ) have been detected in the am [ , ] . therefore, our current understanding of the proton conduction mechanism is that the minimally required unit for ph-dependent proton conduction in the am and bm is the his-trp complex. it was proposed in ref. [ ] that in the non-conductive state two pairs of histidines in the tetramer each share one proton, which explains structures of the influenza proton channels and mechanism of proton conduction. the many structures of the influenza m channel. the pdb codes rlf and kwx represent the solution nmr structures of the wildtype and the v a mutant determined using residues - . the c j and lbw are crystal structures of the tm domain (residues - ) determined at ph . and ph . , respectively. the structures l j and kqt were obtained using solid-state nmr using protein constructs that encompass residues - and residues - , respectively the high pk a ~ . . lowering ph results in the protonation of the third histidine from the n-terminal side, which, in turn, disrupts the two histidine dimers and leads to the proton conductive state. it was also proposed that protonation of histidines leads to cation-π interactions between the histidine and tryptophan [ , ] . the remaining question to be addressed is how does the third protonation affect the tryptophan conformation, allowing proton to be relayed to the c-terminal side of the tryptophan gate [ ] . the viroporin p encoded by the hcv genome is a -residue protein that oligomerizes in membrane to form cation-selective channels [ , ] , with higher selectivity for ca + than k + /na + [ , ] . the channel activity of p is important for the assembly and release of infectious viruses, although the molecular mechanism of this function remains unknown [ , ] . as in the case of am , structural characterization of p was confronted with challenges of coping with the hydrophobic and dynamic nature of the protein. earlier nmr studies of p under conditions that support the monomeric state of the protein showed that p has three tm helical segments: two in the n-terminal half of the sequence and one near the c-terminus [ , fig. . b isolated view of the pore-lining histidine (magenta) and tryptophan (cyan) sidechains in m and bm channels. images are from the high resolution crystal structure ( lbw) and nmr structure ( rlf) of m and nmr structure of bm ( kix) ] . although the monomeric state should not conduct ions, it could be involved in interacting with the ns protein during virus assembly [ , ] . the oligomeric form of p was first examined using single-particle em, which showed that p from hcv genotype a (jfh- strain) assembles into hexamers in , -diheptanoylsnglycero- -phosphocholine (dh pc) micelles and the complex adopts a flowerlike shape that does not resemble any of the known ion channel structures in the database [ ] . later, a more detailed structure of the p hexamer was determined by solution nmr using p from genotype a (euh strain) reconstituted in dodecylphosphocholine (dpc) micelles [ ] . consistent with the flower-shaped em images, the nmr structure shows a funnel-like architecture with six minimalist chains, each containing three helical segments: h , h , and h . the h and h form the narrow and wide regions of the funnel-shaped cavity, respectively, and the h helices wrap the channel peripheral by interacting with h and h (fig. . a) . the assembly strategy adopted by p differs significantly from those known channels from bacteria and eukaryotes. ion channels typically have two essential features: ( ) pore elements that support selective ion dehydration; ( ) a gate or constriction that prevents non-specific per- meation, but can open in response to regulating factor such as ph, voltage, ligand, or the ion of selection itself [ , ] . the channel interior of p has a number of strongly conserved residues that are likely candidates to serve the above functions. one suspect is asn , which forms a ring of carboxamide near the narrow end of the channel (fig. . b) . residue is asparagine in all strains except being substituted with histidine in genotype viruses. formation of a ring of carboxylates or carboxamides has been a recurring theme in prokaryotic and eukaryotic channels that have selectivity for divalent cations. for examples, the cora mg + channel has a pentameric ring of asparagines [ ] , and the calcium release-activated calcium (crac) channel orai has a hexameric ring of aspartic acids [ ] . these channels all have strong selectivity for divalent cations, although they can also conduct monovalent cations such as na + and k + . in addition to the asn ring, residues near the hinge between h and h (ser , asn , trp ) are in an arrangement that may also bind in addition to what appears to be the cation selectivity ring near the narrow, n-terminal exit of the channel, the wider, c-terminal entrance of the channel is decorated with a conserved ring of arginines or lysines (fig. . b) . placement of a positively charged ring at the entrance was anticipated because it may repel cations. but an earlier study reported that a designed tm barrel with internal argininehistidine dyads forms efficient cation selective channels [ ] because the immobile arginines can recruit mobile anions, which in turn facilitate cations to diffuse through the pore. it is interesting to note that a highly basic region containing arg , lys and lys in the pore was also found in the crac orai structure [ ] . one possible role of these basic residues in cation selective channels is binding and obstructing anions while allowing cations to diffuse into the pore. this mechanism would be consistent with the observation that replacing arg with negatively charged aspartic acid largely abrogated conductance [ ] . there are still many unanswered questions. does the nmr structure, solved in the absence of ca + and inhibitors, represent the open or closed state of the channel (if the two states exist)? how strong is the ca + selectivity of p ? or was p developed as a general, unspecific cation channel for the purpose of dissipating membrane potential? for example, another recently study reported p activity in dissipating proton gradients within cell membrane compartments [ ] . it is unclear what ion flux mediated by p plays a dominant role in the hcv life cycle. from a structural perspective, the funnel-like architecture is formed with multiple helical segments connected by hinges and short loops and we believe this flexibility can afford the dynamic opening and closing of the tip of the channel. probably the most intriguing aspect of small molecule interaction with viroporin is the finding that the adamantane derivatives amantadine and rimantadine have inhibitory effect on multiple viroporins including the influenza am and hcv p . amantadine (symadine) or rimantadine (flumadine) was the first licensed drug for treating influenza infections [ ] . in fact, the compound also played critical roles in the early days of functional characterization of the am channel [ ] [ ] [ ] . the bm channel is a functional and structural homolog of am but is not sensitive to the adamantane family of drugs [ ] . remarkably, the hcv p channel structure is completely different but showed detectable, though not strong, sensitivity to rimantadine [ , ] . the mechanism of how amantadine/rimantadine inhibit the am channel has been elusive for quite some time. previous confusion came mainly from the multiple binding sites that have been observed experimentally. in a crystallographic study of the am tmd (residues in the presence of amantadine, the drug density was found inside the channel near residue ser , but at structural resolution of . Å, it was difficult to confirm the position of amantadine binding [ ] . at the same time, however, a solution nmr study of a longer version of am (residues showed that rimantadine binds to an external, lipid-facing pocket around residue asp between adjacent tm helices [ ] . the two different binding sites obviously suggest very different mechanisms of inhibition. one is the drug directly blocking the channel passage, and the other is the drug binding to the external site allosterically favors the closed state of the channel. subsequent solid-state nmr measurements of the am in lipid bilayer showed that both sites exist, with higher affinity for the internal site [ ] . the strongest evidence that the internal site is the primary site of drug action came from functional studies using an am -bm chimera protein [ , ] . in this study, the authors constructed a chimera of m variants from influenza a and b viruses that contains only internal site showed that the chimera channel is still amantadine sensitive, indicating that the internal site is the primary site of drug inhibition. later, the complex structure of the tmd of the chimera with rimantadine was determined by solution nmr, providing a detailed view of the drug binding inside the channel pore [ ] . the drug binding site consists of eight methyl groups of the m tetramer (two from each subunit: val c γ h and ala c β h ) that form a deep internal hydrophobic pocket surrounding the adamantane cage of the drug (fig. . a ). the structure also shows that the nitrogen of the rimantadine amino group may form a hydrogen bond with the backbone carbonyl oxygen of ala of one of the four subunits. the terminal methyl group of the rimantadine is in the middle of the pore, facing the open space in the channel around the gly position. the structure also shows that rimantadine binding is slightly tilted: its vertical axis is on average ~ ° from the c symmetry axis of the channel. the tilt angle is consistent with the amantadine tilt in the m channel observed with solid-state nmr spectroscopy [ ] . in addition to blocking the am channel, the amantadine and its derivatives have also been shown to pose some inhibitory effects on the p channel conductance [ , fig. . the amantadine and rimantadine binding sites in the m and p channels (a) the precise nmr structure of the am -bm chimeric channel with rimantadine determined in dhpc micelles and at ph . . left panel: detailed illustration of the methyl groups (in green) that interact with the adamantane cage. right panel: surface representation for showing the hydrophobic pocket that fits the drug snuggly. one of the four subunits is omitted for drug visibility (b) the amantadine or rimantadine binding site of the p channel determined by nmr in dpc micelles and at ph . . left panel: the drug binds to six equivalent hydrophobic pockets of the p channel. right panel: a close view of amantadine docked into the binding pocket as determined using nmr noe restraints (c) comparison between adamantane binding sites of influenza m and hcv p channels top: the internal pocket that wraps around rimantadine in the am -bm chimeric channel. the am -bm chimeric channel is a well-behaved model system wth its n-terminal half is from influenza a m protein (sensitive to amantadine or rimantadine inhibition) and its c-terminal half is from influenza bm protein (insensitive to amantadine or rimantadine). on the right panel, one subunit of the tetrameric complex is removed to unveil the channel interior bottom: amantadine binds to the peripheral pockets between the h and h helices; and a representative pocket among six equivalent pockets in the p hexamer ]. the physical binding sites of amantadine and rimantadine have been identified in p of genotype a using intermolecular noe experiments [ ] . the noe data revealed that amantadine or rimantadine binds to six equivalent hydrophobic pockets (due to the six-fold symmetry of the p channel) between the pore-forming and peripheral helices (fig. . b) . in each site, leu , leu , and leu from h and val , val , and phe from h appear to form a hydrophobic pocket that wraps around the adamantane cage of the drug. the amino group of amantadine or rimantadine is facing the largely hydrophilic channel lumen. an important property of the drug binding site is that it consists of elements from different helical segments and from different monomers. as rationalized above, permeation of cations through the p channel may depend on opening the narrow end of the funnel, which in turn depends on the reorientation of the helical segments. the binding of adamantane derivatives to the pocket may inhibit channel activity allosterically by causing the channel to close. indeed, an nmr relaxation dispersion study showed that residues at the h -h hinge (phe ) and the narrow end of the cavity (val , leu ) experienced substantial chemical exchanges (k ex~ ± s − and ~ % excited state). this data is consistent with movements of the h helices that cause the tip of the funnel to open and close. more importantly, addition of rimantadine slowed down motion at the tip of the channel, as relaxation dispersion curve for val , which has significant chemical exchange in the apo state, is completely flat in the drug-bound state [ ] . the dispersion curve of phe is also significantly flatter, and individual curve fit yielded k ex value of ± s − . clearly, rimantadine binding makes the channel less dynamic. therefore, the rimantadine may thus act as a "molecular wedge" that prevents the dynamic "breathing" of the channel required for ion conduction. comparing the amantadine or rimantadine binding mode of hcv p to that of influenza am shows two fundamentally different mechanisms of drug inhibition. in the case of am , one drug binds to one channel. drug binding inhibits proton transport by directly blocking the channel passage; it also prevents channel from opening. in the case of p , amantadine and rimantadine are clearly too small to block the channel. they instead bind to six equivalent sites outside of the channel cavity, which can afford up to six drugs per channel. if rimantadine binding to this site is relevant to inhibition, as crudely suggested by previous functional mutagenesis, drug binding to these sites inhibits cation conduction with an allosteric mechanism, possibly by stabilizing the closed state of the channel. although the mechanisms of drug inhibition may be completely different, the structural bases that govern drug-binding affinity for am and p are actually similar, and they involve hydrophobicity and size of the pocket, and position of the drug amino group (fig. . c ). in the case of the am tetramer, the drug adamantane cage fits snuggly in a hydrophobic pocket formed by eight methyl groups from val and ala (two from each subunit), while the drug amino group forms polar contact with the backbone oxygen of ala and points to the polar region of the channel cavity. for the p channel, the adamantane cage is in contact with ten methyl groups and an aromatic group from the protein. these hydrophobic groups form a deep hydrophobic pocket that also matches closely the size of the adamantane cage. the amino group of amantadine or rimantadine points to the channel lumen; it is in position to form polar contacts with the electronegative groups such as backbone carbonyl of residues - . hence, having a greasy pocket for the adamantane cage and a nearby electronegative group to interact with the amino group may be a general requirement for amantadine or rimantadine binding. previous functional mutagenesis studies have suggested that the tmds of viral fusion proteins are not only limited to the function of membrane anchoring, but also involved in other functions such as membrane fusion or assembly of the fusion protein on the viral membrane. for example, sequence analysis of the ha from mutant viruses and site-specific mutagenesis of the fusion peptide identified a group of mutations in the n-terminal half of the influenza ha tmd severely affected membrane fusion (the hemifusion to pore formation) [ ] [ ] [ ] . in the case of hiv- , multiple lines of evidences suggest that the tmd of gp is not merely a membrane anchor, but plays critical roles in membrane fusion and viral infectivity [ ] [ ] [ ] [ ] [ ] . the amino acid sequence of gp tmd is also highly interesting. there is a gly rich motif in the tmd, which suggests some sort of oligomerization. even more peculiar is the presence of a conserved arginine in the middle of the predicted tm region. unlike the structural biology of viroporins, there is essentially no structural information of tmds of viral fusion proteins except for that of the tmd of hiv- env. hence, we focus on the discussion on the trimeric membrane anchor of the tmd of the hiv- envelope spike. hiv- envelope spike [env; trimeric (gp ) , cleaved to (gp /gp ) ] is a type i membrane protein that fuses viral and host cell membranes to initiate viral infection [ ] . the gp and gp are the receptor recognition and membrane fusion proteins, respectively. conformational changes in gp when triggered by binding to receptor (cd ) and co-receptor (e.g., ccr or cxcr ) lead to a cascade of refolding events in gp (similar to those illustrated in fig. . a) , and ultimately to membrane fusion [ , [ ] [ ] [ ] . the mature and functional env spikes, (gp /gp ) , are the sole antigens on the virion surface and thus important candidates for vaccine development [ , ] . the native prefusion conformation of hiv- env is recognized by most broadly neutralizing antibodies (bnabs) [ ] [ ] [ ] and it is generally believed to have the potential to induce such antibody responses. thus, the native conformation of env spikes on the surface of virions is extremely important to immunogen design in b-cell based vaccine development. a vast amount of structures of the ectodomain (ecd) of the gp /gp complex have been determined [ , , , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] , but relatively little is known about the tm and membrane proximal regions of gp due to challenges of preserving the native-like folding of these regions in membrane mimetic media. it has been shown that truncations in the cytoplasmic tail (ct) of gp could alter the antigenic surface of the env ecd on the opposite side of the membrane [ ] , suggesting that the ecd, the tmd, and the membrane proximal regions are conformationally coupled, and thus the conformational stability of the tmd is an important consideration for immunogen design in b-cell based hiv- vaccine development. recently, the nmr structure of the gp tmd has been solved in bicelles (made of dmpc lipid and dhpc detergent; q = . ) [ ] using a gp fragment (residues - ) from a clade d hiv- isolate ug . , designated gp hiv d ( - ). the tmd forms a well-structured trimer, almost helical all the way from the n-to the c-terminal end. it shows two peculiar features not seen with other known oligomeric tm helices (tmhs) (fig. . a) . one unusual feature is that the tmd trimer appears to be stabilized by two separate packing modes (fig. . c) . the n-terminal half encompassing the gxxxg motif forms a coiled-coil trimer, whereas the c-terminal half is held together by a network of polar contacts, which we named the hydrophilic core. for the n-terminal coiledcoil, the helical wheel representation of the trimer clearly indicates packing of hydrophobic residues such as ile and val at the "a" and "d" positions of the heptad motif, forming a hydrophobic core. the gxxxg is a well-known motif that drives tmh dimerization [ , , ] . in the classic example of the glycophorin a tmd dimer structure, the two glycines of one tmh allow close packing with the gxxxg face of another tmh, resulting in a very strong tmh dimeric complex [ , ] . there has been no previous report, however, of the gxxxg involvement in tmh trimerization. in the coiled-coil region of the hiv- env tmd, only g is involved in the trimer assembly, i.e., its small sidechain allows a close vdw contact with v of the adjacent tmh. the other glycine, g , is on the periphery of the trimer facing outwards and its mutation to alanine or valine has essentially no effect on tmd trimerization, as well as env functions [ ] . therefore, the key difference in the structural role of the gxxxg motif between the glycophorin a tmd dimer and hiv- env tmd trimer is that both the glycines are required for forming intermonomer contacts in the dimer, while only one glycine is important for the trimeric assembly. tm segments of many viral fusion proteins contain a gxxxg motif or "smallxxxsmall" motifs ("small" refers to residues with a small side chain, such as, glycine, alanine, serine or cysteine) [ ] [ ] [ ] , suggesting that oligomerization of their tmds may be a common property. for example, recent biochemical evidence has shown that the tmds of hepatitis c virus envelope glycoproteins e and e form stable dimers or trimers that are also resistant to sds [ ] . no highresolution structure of any tmd oligomer from other viral fusion proteins has been reported, due to technical challenges for structural studies of such constructs in the context of lipid bilayer. the nmr structure of the hiv- env tmd may provide some clues for how other viral fusion proteins oligomerize in the membrane. another peculiar feature is the presence of three copies of arginine (r ) near the middle of the tmhs (fig. . b) , suggesting three unbalanced charges in the hydrophobic core of the membrane if the arg remains protonated. in the nmr structure, the tips of the long sidechains of these arginines are facing lipids and each of them is surrounded by three hydrophobic residues (l , l , and i ). it is also interesting to note that r occupies a "d" position in the coiled-coil, with its cβ facing inwards to the trimer interface. precise physical basis of how r is accommodated in the highly hydrophobic environment remains unclear, but the underlying mechanism must tolerate a lys residue, which is present in some viral isolates at this position. it is interesting to mention that the noe experiment for the r epsilon protons showed clear water noe even with a short noe mixing time of ms, suggesting that the arg is somehow hydrated in the membrane. a positively-charged arg or lys in the tm segment of viral fusion protein is present in some related enveloped viruses, including simian immunodeficiency virus, caprine arthritis and encephalitis virus, equine infectious anemia virus, visna virus, and foamy virus, as well as in hepatitis c virus [ ] [ ] [ ] [ ] [ ] [ ] , but absent in many others. it is therefore not a prerequisite for viral membrane fusion in general [ ] . functional mutagenesis indicated that the r a mutant of hiv- env showed some defect in cell-cell fusion, but it could be fully compensated by high env expression [ ] . moreover, this mutant has wildtype viral infectivity. in vitro infectivity and cell-cell fusion do not, however, mimic all the conditions under which the virus moves from one cell to another in an infected individual, nor are those assays particularly sensitive to physiologically relevant kinetic parameters. the structures of viroporins discussed above are all substantially different from any of the channel structures found in prokaryotes and eukaryotes, and are thus clean structural targets for developing antiviral compounds. the available structures also suggest that viroporins generally adopt minimalist architecture and can possess structural features compatible with selective ion transport. the viral channels, however, may not be as functionally robust or specifically regulated as some of their counterparts in prokayrotes and eukaryotes, because viruses often do not need intricate regulation of channel activities during their infection cycles. having minimalist structure also makes viroporins fragile and sensitive to the membrane environment. this is reflected by the conformational variations observed for the am channel in different reconstitution media. therefore, it remains important to examine these viroporins in more native environments, e.g., lipid bilayer and full-length proteins. as solid-state nmr continues to improve spectral resolution [ , ] , obtaining detailed structures of viroporins in liposomes should in principle be feasible in the near future. alternatively, the ideal bicelle system can be explored with solution nmr to revisit some of the viroporin structures determined previously in detergent micelles. future establishment of the nmr systems for viroporins under more native conditions would certainly provide versatile and effective platforms for investigating inhibitor binding. apart from the progress in structure determination, major challenges lie ahead in the functional aspects of viroporin research. the precise functional roles of many viroporins are still unclear. moreover, due to the lack of robust single-channel recording setups suitable for viroporins, the ion conductance properties of most viroporins have not been fully characterized. therefore, better definition of the functional roles and channel properties of viroporins will certainly draw greater enthusiasm for developing therapeutics that target this interesting family of membrane channels. the example from hiv- env suggests that tmds of viral fusion proteins can adopt very interesting structures, and the distinct structural features certainly allude to their roles in viral fusion protein assembly and incorporation into the envelope, as well as in the process of membrane fusion. the nmr structural study of the hiv- gp tmd in bicelles with q = . [ ] demonstrates that the structures of most of the viral fusion protein tmds can in principle be determined in essentially lipid bilayer environment using modern nmr techniques. moreover, the combined use of ideal bicelles (q ≥ . ) and solvent paramagnetic relaxation enhancement measurements can be used to determine the membrane partition of the tmds in the bilayer region of the bicelles [ ] . we believe the next major challenge lies in understanding the roles of the fusion protein tmds in the fusion mechanism. in the cases of the flu ha and hiv gp , for example, new experiments need to be designed to address questions such as, "does the tmd interact with the fusion domain in the hemifusion stage in which the two domains are presumably in close proximity?" and "does the tmd play a role in facilitating the conversion from the hemifusion to the pore formation?" understanding the structural properties and conformational stability of the tmd may also have far reaching implication to vaccine development and this has been recognized at least for hiv- . as mentioned above, truncations in the ct of the env could drastically alter the sensitivity of the env ecd to the known trimer-specific bnabs [ ] . the continuous structure from the n-to c-terminal ends of the env tmd, as observed by nmr, suggests that the env ecd can be structurally coupled via the tmd. indeed, it has also been shown that the env tmd can also modulate the antigenic structure of the ecd in a cell-cell fusion assay and a pseudovirusbased neutralization assay [ ] . the results show direct correlation that more disrupted tmd trimer led to less inhibition or neutralization by the trimer-specific bnabs that target only the ecd. the results suggest that the stability of the tmd trimer is an important consideration when designing immunogens for hiv vaccines. the hiv env tmd is, however, not directly linked to the env ecd. another important segment known as the membrane-proximal external region (mper) is the direct link that connects the tmd to the ecd, and thus it could in principle also affect the antigenic properties of the env ecd. the mper sequence is extremely conserved with five absolutely conserved tryptophans, suggesting that the mper probably also has interesting structural features. the conformation of the mper attached to the tmd trimer in a lipid bilayer environment is still unknown (fig. . d) . previous nmr studies of an isolated mper peptide in detergent micelle suggested that the mper is monomeric and folds into a kinked helix with many hydrophobic residues embedded in the micelles [ , ] . this of course would have a rather negative implication for the mper as a vaccine epitope because antibodies that bind to the mper must dig it out of the lipids, which could be accompanied with polyreactivities. the env tmd appears to be well structured and assembled strongly to trimers. hence, the structure of the mper when connected to the trimeric tmd and in the context of a lipid bilayer remains to be characterized. plasma membrane-porating domain in poliovirus b protein. a short peptide mimics viroporin activity cell permeabilization by poliovirus b viroporin triggers bystander permeabilization in neighbouring cells through a mechanism involving gap junctions poliovirus b insertion into lipid monolayers and pore formation in vesicles modulated by anionic phospholipids alphavirus k proteins form ion channels semliki forest virus k protein modifies membrane permeability after inducible expression in escherichia coli cells discovery of frameshifting in alphavirus k resolves a -year enigma the alphavirus k protein activates endogenous ionic conductances when expressed in xenopus oocytes influenza m proton channels structural and dynamic mechanisms for the function and inhibition of the m proton channel from influenza a virus m protein from influenza a: from multiple structures to biophysical and functional insights influenza b virus bm protein has ion channel activity that conducts protons across membranes identification of an ion channel activity of the vpu transmembrane domain and its involvement in the regulation of virus release from hiv- -infected cells correlation of the structural and functional domains in the membrane protein vpu from hiv- channel activity of a viral transmembrane peptide in micro-blms: vpu( - ) from hiv- the p protein of hepatitis c virus forms an ion channel that is blocked by the antiviral drug the hepatitis c virus p protein forms an ion channel that is inhibited by long-alkyl-chain iminosugar derivatives dids blocks a chloride-dependent current that is mediated by the b protein of enterovirus a potassium channel protein encoded by chlorella virus pbcv- the viral potassium channel kcv: structural and functional features rna transfer from poliovirus s particles across membranes is mediated by long umbilical connectors capsid protein vp of human rhinovirus induces membrane permeability by the formation of a size-selective multimeric pore nuclear transport of influenza virus ribonucleoproteins: the viral matrix protein (m ) promotes export and inhibits import unpacking the incoming influenza virus influence of amantadine resistance mutations on the ph regulatory function of the m protein of influenza a viruses viroporins: structure and biological functions chlorovirus-mediated membrane depolarization of chlorella alters secondary active transport of solutes hepatitis c virus p and ns proteins are essential for production of infectious virus hepatitis c virus p protein is crucial for assembly and release of infectious virions viroporins from rna viruses induce caspase-dependent apoptosis the influenza virus m protein cytoplasmic tail interacts with the m protein and influences virus assembly at the site of virus budding cytoplasmic domain of influenza b virus bm protein plays critical roles in production of infectious virus solution structure and functional analysis of the influenza b proton channel identification of specific regions in hepatitis c virus core, ns and ns a that genetically interact with p and co-ordinate infectious virus production subcellular localization and function of an epitope-tagged p viroporin in hepatitis c virus-producing cells mechanism of membrane fusion by viral envelope proteins structures and mechanisms of viral membrane fusion proteins: multiple variations on a common theme structure of influenza haemagglutinin at the ph of membrane fusion hiv- envelope glycoprotein biosynthesis, trafficking, and incorporation core structure of gp from the hiv envelope glycoprotein mechanism of membrane fusion by viral envelope proteins intermonomer interactions in hemagglutinin subunits ha and ha affecting hemagglutinin stability and influenza virus infectivity class ii virus membrane fusion proteins class ii fusion proteins structure of the dengue virus envelope protein after membrane fusion crystal structure of dengue virus type envelope protein in the postfusion conformation and its implications for membrane fusion characterization of a membraneassociated trimeric low-ph-induced form of the class ii viral fusion protein e from tick-borne encephalitis virus and its crystallization class iii viral membrane fusion proteins characterization of ebv gb indicates properties of both class i and class ii viral fusion proteins class iii viral membrane fusion proteins structural basis of influenza virus fusion inhibition by the antiviral drug arbidol arbidol as a broad-spectrum antiviral: an update improved pharmacological and structural properties of hiv fusion inhibitor ap over enfuvirtide: highlighting advantages of artificial peptide strategy enfuvirtide: the first therapy to inhibit the entry of hiv- into host cd lymphocytes b-cell-lineage immunogen design in vaccine development with hiv- as a case study b cell responses to hiv- infection and vaccination: pathways to preventing infection structural basis for the function and inhibition of an influenza virus proton channel structure and mechanism of proton transport through the transmembrane tetrameric m protein bundle of the influenza a virus crystal structure of the glycophorin a transmembrane dimer in lipidic cubic phase cryo-em structure of a native, fully glycosylated, cleaved hiv- envelope trimer unusual architecture of the p channel from hepatitis c virus the structural basis for intramembrane assembly of an activating immunoreceptor complex structure and mechanism of the m proton channel of influenza a virus optimal bicelle size q for solution nmr studies of the protein transmembrane partition structural evaluation of phospholipid bicelles for solution-state studies of membrane-associated biomolecules characterization of magnetically orientable bilayers in mixtures of dihexanoylphosphatidylcholine and dimyristoylphosphatidylcholine by solid-state nmr magnetically-oriented phospholipid micelles as a tool for the study of membrane-associated molecules structural basis and functional role of intramembrane trimerization of the fas/ cd death receptor structural basis for membrane anchoring of hiv- envelope spike the structure of phospholamban pentamer reveals a channel-like architecture in membranes structural investigation of rimantadine inhibition of the am -bm chimera channel of influenza viruses structural basis of interaction between the hepatitis c virus p channel and its blocker hexamethylene amiloride influenza virus m protein is an integral membrane protein expressed on the infected-cell surface structural characteristics of the m protein of influenza a viruses: evidence that it forms a tetrameric channel influenza virus m integral membrane protein is a homotetramer stabilized by formation of disulfide bonds influenza b virus bm protein is an oligomeric integral membrane protein expressed at the cell surface the m proton channels of influenza a and b viruses a universal influenza a vaccine based on the extracellular domain of the m protein influenza a vaccine based on the extracellular domain of m : weak protection mediated via antibody-dependent nk cell activity an influenza a vaccine based on tetrameric ectodomain of matrix protein enhanced influenza virus-like particle vaccines containing the extracellular domain of matrix protein and a toll-like receptor ligand prokaryote-expressed m e protein improves h n influenza vaccine efficacy and protection against lethal influenza a virus in mice solid-state nmr investigation of the conformation, proton conduction, and hydration of the influenza b virus m transmembrane proton channel distinct domains of the influenza a virus m protein cytoplasmic tail mediate binding to the m protein and facilitate infectious virus production influenza b virus bm protein is a crucial component for incorporation of viral ribonucleoprotein complex into virions during virus assembly mechanism of drug inhibition and drug resistance of influenza a m channel solution nmr structure of the v a drug resistant mutant of influenza a m channel structure of the amantadine binding site of influenza m proton channels in lipid bilayers insight into the mechanism of the influenza a proton channel from a structure in a lipid bilayer structure and mechanism of the influenza a m - dimer of dimers a switch between two-, three-, and four-stranded coiled coils in gcn leucine zipper mutants kinetic analysis of the m proton conduction of the influenza virus mechanisms of proton conduction and gating in influenza m proton channels from solid-state nmr histidines, heart of the hydrogen ion channel from influenza a virus: toward an understanding of conductance and proton selectivity comparison of the activities of bm protein and its h and w mutants of influenza b virus with activities of m protein and its h and w mutants of influenza a virus interactions between histidine and tryptophan residues in the bm proton channel from influenza b virus cation-selective ion channels formed by p of hepatitis c virus are blocked by hexamethylene amiloride nmr structure and ion channel activity of the p protein from hepatitis c virus secondary structure, dynamics, and architecture of the p membrane protein from hepatitis c virus by nmr spectroscopy the -dimensional structure of a hepatitis c virus p ion channel by electron microscopy principles of selective ion transport in channels and pumps structural basis for the coupling between activation and inactivation gates in k(+) channels crystal structure of the cora mg + transporter crystal structure of the calcium release-activated calcium channel orai synthetic multifunctional pores: deletion and inversion of anion/cation selectivity using pm and ph two different conformations in hepatitis c virus p protein account for proton transport and dye release antiviral activity of -adamantanamine (amantadine) the molecular basis of the specific antiinfluenza action of amantadine influenza virus m protein has ion channel activity ion channel activity of influenza a virus m protein: characterization of the amantadine block genotype-dependent sensitivity of hepatitis c virus to inhibitors of the p ion channel functional studies indicate amantadine binds to the pore of the influenza a virus m proton-selective ion channel an amantadine-sensitive chimeric bm ion channel of influenza b virus has implications for the mechanism of drug inhibition a conserved basic loop in hepatitis c virus p protein is required for amantadine-sensitive ion channel activity in mammalian cells but is dispensable for localization to mitochondria transverse relaxation dispersion of the p membrane channel from hepatitis c virus reveals conformational breathing fusion mutants of the influenza virus hemagglutinin glycoprotein a point mutation in the transmembrane domain of the hemagglutinin of influenza virus stabilizes a hemifusion intermediate that can transit to fusion studies of the membrane fusion activities of fusion peptide mutants of influenza virus hemagglutinin changes in the transmembrane region of the human immunodeficiency virus type gp envelope glycoprotein affect membrane fusion mutations in the membrane-spanning domain of the human immunodeficiency virus envelope glycoprotein that affect fusion activity role of the membrane-spanning domain of human immunodeficiency virus type envelope glycoprotein in cell-cell fusion and virus infection conserved arginine residue in the membrane-spanning domain of hiv- gp is required for efficient membrane fusion the transmembrane domain of hiv- gp inhibits t-cell activation by targeting multiple t-cell receptor complex components through its gxxxg motif viral membrane fusion atomic structure of the ectodomain from hiv- gp structure and immune recognition of trimeric pre-fusion hiv- env hiv entry and its inhibition antibody neutralization and escape by hiv- rapid evolution of the neutralizing antibody response to hiv type infection hiv- envelope. effect of the cytoplasmic domain on antigenic characteristics of hiv- envelope glycoprotein hiv- envelope trimer elicits more potent neutralizing antibody responses than monomeric gp a next-generation cleaved, soluble hiv- env trimer, bg sosip. gp , expresses multiple epitopes for broadly neutralizing but not non-neutralizing antibodies structure of an unliganded simian immunodeficiency virus gp core structures of hiv- gp envelope glycoproteins from laboratory-adapted and primary isolates structure of an hiv gp envelope glycoprotein in complex with the cd receptor and a neutralizing human antibody structure of a v -containing hiv- gp core atomic structure of a thermostable subdomain of hiv- gp three-dimensional solution structure of the kda ectodomain of siv gp crystal structure of a soluble cleaved hiv- envelope trimer cryo-em structure of a fully glycosylated soluble cleaved hiv- envelope trimer a transmembrane helix dimer: structure and implications spatial structure of the dimeric transmembrane domain of the growth factor receptor erbb presumably corresponding to the receptor active state interaction and conformational dynamics of membrane-spanning protein helices hepatitis c virus envelope glycoprotein e forms trimers at the surface of the virion the transmembrane domain in viral fusion: essential role for a conserved glycine residue in vesicular stomatitis virus g protein mutations within the putative membrane-spanning domain of the simian immunodeficiency virus transmembrane glycoprotein define the minimal requirements for fusion, incorporation, and infectivity structure and genetic variability of envelope glycoproteins of two antigenic variants of caprine arthritis-encephalitis lentivirus synthesis and processing of the transmembrane envelope protein of equine infectious anemia virus nucleotide sequence of the visna lentivirus: relationship to the aids virus an evolutionarily conserved positively charged amino acid in the putative membrane-spanning domain of the foamy virus envelope protein controls fusion activity contribution of the charged residues of hepatitis c virus glycoprotein e transmembrane domain to the functions of the e e heterodimer hiv- broadly neutralizing antibody extracts its epitope from a kinked gp ectodomain region on the viral membrane antibody mechanics on a membrane-bound hiv segment essential for gp -targeted viral neutralization the active oligomeric state of the minimalistic influenza virus m ion channel is a tetramer the oligomeric state of the active bm ion channel protein of influenza b virus evidence for the formation of a heptameric ion channel complex by the hepatitis c virus p protein in vitro oligomerization state and supramolecular structure of the hiv- vpu protein transmembrane segment in phospholipid bilayers the human immunodeficiency virus type vpu protein enhances membrane permeability severe acute respiratory syndrome-associated coronavirus a protein forms an ion channel and modulates virus release analysis of sars-cov e protein ion channel activity by tuning the protein and lipid charge viral weapons of membrane destruction: variable modes of membrane penetration by non-enveloped viruses viroporin-mediated membrane permeabilization. pore formation by nonstructural poliovirus b protein the hpv- e protein represses expression of stress pathway genes xbp- and cox- in genital keratinocytes hpv- e oncoprotein upregulates lipid raft components caveolin- and ganglioside gm at the plasma membrane of cervical cells the human papillomavirus type (hpv- ) e protein sensitizes human keratinocytes to apoptosis induced by osmotic stress viral encoded potassium ion channel is a structural protein in the chlorovirus paramecium bursaria chlorella virus- (pbcv- ) virion key: cord- -ad i wgj authors: nan title: th international congress on amino acids and proteins : vienna, austria, august – , date: journal: amino acids doi: . /s sha: doc_id: cord_uid: ad i wgj nan the raised concentration of protein bound homocysteine in homocystinuric (hcu) patients displaces protein bound cysteine and increases the free/bound cysteine ratio in plasma. this ratio is independent of albumin concentration. results from hcu patients were compared to controls. free cystine concentrations in hcu were poorly discriminated from the control range but the total cysteine results were almost invariably lower than control data. this appears to result from an increased free/bound cysteine ratio in hcu [mean (range) for control . ( . - . ) and for hcu . ( . - . ); p ϭ . ]. ex vivo protein binding experiments in albumin solution revealed the free/bound cysteine ratio to be linearly related to the amount of homocysteine bound (r ϭ . , p Ͻ . ). we conclude that measurement of total cysteine is essential for assessment of the true cysteine status in hcu. however, any cysteine deficit, or alteration to free/bound cysteine ratios, does not obviously effect glutathione synthesis as assessed by measurement of plasma total glutathione. ( nmol/g; . %); sprague-dawley rat (rattus norvegicus, rodentia) (n ϭ ; - nmol/g; . - . %); rabbit ( nmol/ g; . %); pig (sus scrofa f. domestica, artiodactyla) ( nmol/ g; . %); bovine (bos primigenius f. taurus, artiodactyla) ( nmol/g; . %); seal (phoca vitulina, carnivora) ( nmol/g; . %), and rob (halichoerus grypus, carnivora) ( nmol/g; . %). from the date it is concluded that d-aas are common in body fluids and certain tissues of vertebrates. in order to determine the quantity of cyst(e)ine and methionine, the oxidation of cyst(e)ine and methionine (Ϫ) refers to not detected or not determinable; asx ϭ asp ϩ asn; glx ϭ glu ϩ gln; his, arg, trp, cys not determined; a) feed fortified with dl-met; b) nmol/g lyophilized serum. mentation and subsequent purification. the separation of the enantiomers of cysteic acid, methionine sulphone, aspartic acid and glutamic acid is displayed on the chromatogram. into cysteic acid and methionine sulphone with performic acid is often applied before hydrolysis of protein. the authors examined the applicability of this process in case of quantification of cyst(e)ine and methionine enantiomers. the rp-hplc analytical method was developed for the determination of the amount of cysteic acid and methionine sulphone enantiomers. the rate of conversion during oxidation from cyst(e)ine into cystic acid and from methionine into methionine sulphone was determined. the racemisation of l-cyst(e)ine and l-methionine was negligible during oxidation with performic acid, therefore this process can be applied before hydrolysis during quantification of cyst(e)ine and methionine enantiomers. after the performic acid oxidation and the m hcl hydrolysis of the protein, opa/tatg (o-phthaldialdehyde/tetra-o-acetyl- -thio--d-glucopiranoside) precolumn derivatisation method was used, and the enantiomers of sulphur containing amino acids were separated by rp-hplc (lichrosphere rp- e, ϫ mm, µm column, merck-hitachi lachrom hplc). the resolution of the peak of cysteic acid and methionine sulphone enantiomers was better than , . the method was used to determine the amount of l-and d-cyst(e)ine and land d-methionine containing preparations prepared by fer- d/l rate of aspartic acid and the individual age of specimens. a method for age determination based on d-aspartic acid content and on the racemisation of l-aspartic acid of teeth was developed. d-glutamic acid, beside d-aspartic acid, was found to be eminently suitable for the estimation of individual age, as it showed a sufficiently high sensitivity. calibration curves based on these investigations were used for the age estimation of adults ( males and females) of unknown individual age from the avar period series of kereki-homokbánya (hungary). the age distribution of the sample was the following: individuals ( %) belonged to the adult age group, persons ( %) to the mature and ( %) to the senile one. the correlation between our results and those obtained using standard paleoanthropological methods was over . . quantitative determination of free and bound -nitrotyrosine in rat plasma and tissues using isotope dilution liquid chromatography-electrospray tandem mass spectrometry t. delatour, p. a. guy, j. richoz, j. vuichoud, and nestlé research centre, nestec ltd, vers-chez-les-blanc, lausanne, switzerland since -nitrotyrosine was reported to be readily formed in proteins by reactions with nitrite or nitrogen dioxide, it has been postulated to be a possible marker for investigating peroxynitrite-mediated nitration of proteins. thus, several methods were developed to assess nitration of tyrosine in proteins and determine -nitrotyrosine in physiological fluids. methods based on hplc or gc/ms techniques were described to quantify -nitrotyrosine within tissues or biological fluids. unfortunately, it has been demonstrated that an artifactual nitration of tyrosine occurs with gc/ms assays leading to an overestimation of the response. in the present work, lc-esi-ms/ms methods for quantification of free -nitrotyrosine in rat plasma as well as bound -nitrotyrosine in tissue samples are reported. plasma samples were spiked with , , -d - -nitrotyrosine and the following steps were applied prior to injection into the lc-esi-ms/ms system used in selected reaction monitoring (srm) mode (m/z ae for the analyte and m/z ae for the internal standard): protein precipitation, solid phase extraction on aminopropyl cartridge and derivatization in nbutanol in hcl n. -nitrotyrosine butyl ester has lead to a dramatic increase of the sensitivity (ca. -fold) by comparison with -nitrotyrosine. under such conditions, calibration curves exhibited excellent linearity (r Ͼ . ) within concentration range . to . nm (equivalent to . - , fmol on column) and recoveries above %. inter-and intra-assay precision was determined below % over the concentration range . to . nm. no artifactual nitration of tyrosine occurring during sample clean-up was observed. this was unambiguously established by plotting experimental ratio of analyte response/ internal standard response versus expected within the range . - . nm. this curve strongly correlated with a linear model (r Ͼ . ) and slope was . Ϯ . (mean Ϯ sd). basal level of -nitrotyrosine in rat plasma was measured to be within concentration range Ͻ lod to . nm. -nitrotyrosine basal level in rat plasma, kidney and liver proteins was established by performing enzymatic hydrolysis in order to avoid artifactual nitration of tyrosine which may occur under strong acidic conditions (hcl n at °c). resulting hydrolysates were analysed by lc-esi-ms/ms and nitrotyrosine was monitored in srm mode (m/z ae for the analyte and m/z ae for the internal standard). t. guszczynski , r. b. kapust , d. s. waugh , and t. d. copeland basic research laboratory, and macromolecular crystallography laboratory, national cancer institute at frederick, maryland, u.s.a. the set-can fusion gene was first detected as associated with acute undifferentiated leukemia. set (also called phap ii) is a nuclear phosphoprotein with a long acidic tail. set has been shown to inhibit phosphatase pp a and is a substrate of human granzyme a. in order to determine any zn(ii) binding properties of set, we utilized affinity capillary electrophoresis (ace) to detect shifts in mobility as zn(ii) ions bind to the protein. we have earlier employed ace to measure the binding constants of zn(ii) to the nucleocapsid protein of hiv- . with a constant concentration of recombinant set as a receptor and varying concentrations of zn(ii) as ligand in the sample buffer, we observed changes in electrophoretic mobilities of set when complexes were formed with zn(ii). scatchard analysis of the mobility provided the stoichiometry and binding constant of zn(ii) to set. interdisciplinary research center, institute of nutritional science, department of food sciences, university of giessen, germany peptaibols are defined as fungal polypeptides containing a high proportion of aib (α-aminoisobutyric acid) and a cterminal bound amino acohol. the mold trichoderma aureoviride (strain imi ; commonwealth mycological institute, kew, uk) was cultured in complex medium consisting of casein peptone, g; soy peptone, g; yeast extract, g; dglucose, . g; nacl, g; dipotassium hydrogen phosphate, . g in l demineralized water adjusted to ph . . fermentation was conducted in nineteen -l shake flasks, each containing ml medium, for d at °c. mycelia were obtained by filtration and extracted with meoh and meoh/chloroform. extracts were evaporated to dryness and subjected to sephadex and silica gel chromatography (eluent chloroform/meoh/acoh/water : : : ) yielding . g and . g, respectively, crude peptaibol mixture named trichoaureocins. the peptide mixture was uniform on tlc but could be separated by analytical (fig. ) and semipreparative hplc (nucleosil c- ; ϫ mm id; µm). six peptides could be isolated each of which was subjected to sequencing using on-line hplc (fluorocarbon stationary phase) esi-ms/ ms (lcq, thermoquest, finnigan mat) as described for peptaibols trichovirins and antiamoebins. sequences are presented in fig. . the -residue peptaibols represent a natural peptide library and cause hemolysis of sheep erythrocytes and exert antibiotic activity against bacillus subtilis and staphylococcus aureus. national institute of chemistry, ljubljana, slovenia wine consists of several hundred components present at different concentrations. the dominant ones are water, ethanol, glycerol, sugars, organic acids, and various ions, while amino acids are present at much lower concentration. the composition of amino acids is of great importance in wine production. they act as a source of nitrogen for yeast during fermentation, they influence the aromatic composition of wine and their composition can be used to differentiate wines according to vine variety, geographical origin, and year of production. among already established analytical methods high-field nmr has been shown to be a promising method for the nondestructive analysis of low-molecular mass compounds in complex mixtures like wine due to its selectivity and capability of simultaneously detecting a great number of compounds. h and c one-dimensional nmr spectra of wine are very crowded and many signals are overlapped. due to a great difference in concentration levels the signal intensities of particular compounds may vary for the factor of . the tails of the dominant frequencies of water, ethanol and glycerol obscure weak signals of minor compounds like amino acids in the near surroundings. the use of d homo-and heteronuclear experiments and the suppression of strong signals are a prerequisite for a successful h and c signal assignment. a complete assignment of h and c nmr resonances of seventeen amino acids commonly present in wine and of γ-aminobutyric acid at ph was accomplished using gradient-selected cosy, tocsy, gradientselected hsqc and hmqc experiments with incorporated wet pulse sequence for the supression of large signals. unambiguous assignment of h and c nmr resonances of amino acids is necessary for the selection of appropriate signals in fast and simple one-dimensional nmr that can serve as parameters in the chemometric classification of wines according to the provenance, vine variety, and year of production. institute of medical biochemistry, jagiellonian university collegium medicum, kraków, poland highly sensitive colorimetric method for determination of aldehydes in the reaction with n-methyl benzothiazolone hydrazone (mbth) turned out to be not very specific for such carbonyl compounds. namely, it has been found that tryptophan and to higher degree its n-derivatives (n-acetyl-trp, ala-trp, gly-trp) and also tripeptides (gly-trp-gly and leu-trp-leu) in the reaction with mbth and fe ϩ are converted to coloured products, with maximum wavelength at nm. the properties of the products and the kinetics of the reaction under defined conditions are described in the spectrophotometric procedure. proteins containing tryptophan are also substrates in the reaction with mbth. comparison of molar extinction coefficients of mbth-fe ϩ -treated various proteins with those of simple n-derivatives of tryptophan shows, that not all molecules of tryptophan in proteins are accessible to the reagents, and in order to determine all tryptophan moieties partial unfolding of protein has to be performed. it should be emphasized that aldehydes cannot be detected and accurately determined in the presence of tryptophan derivatives and protein, and also aldehydes interfere with determination of tryptophan derivatives. natural product laboratory, department of chemistry, the university of burdwan, w. bengal, india detection of protein amino is of utmost importance for the evaluation of protein structure and also their presence in numerous natural products. several specific and non-specific reagents have been used for their detection using thin-layer chromatography, an important tool for such purpose. of the reagents in general use, ninhydrin is the most popular for its high sensibility, however, nihydrin produces same purple color with most of the amino acids (only proline and hydroxproline produce yellow color). an endeavour has been made to resolve this color problem with a reagent which is capable of developing various distinguishable colors with many of the protein amino acids and also shows its high sensitivity comparable to ninhydrin. a probable mechanism for such color formation has also been proposed. measuring enrichments below the sensitivity range of conventional gc-ms. the gc-c-irms technique combines the resolution capabilities of gc with the accuracy and precision of irms. at low abundance gc-c-irms analysis it is superior in terms of time, labor, and sample requirement as compared to the conventional off-line analysis. we discuss some latest advancements and applications of gc-c-irms amino acid analysis related to nutrition research. plasma amino acids in omnivorous human subjects show a characteristic n-isotopic pattern with phenylalanine and threonine showing the lowest abundance, whereas e.g. alanine and leucine are higher by ‰ δ n. in rats fed diets containing intrinsically labeled c casein or the corresponding amino acid mixture labeled with c leucine and n lysine whole-body protein homeostasis is better supported by casein-bound than free amino acids. there is no adaptation to a low lysine diet by an enhanced bioavailability of intestinal microbial lysine to extra-splanchnic tissues in minipigs. highly selective hplc determination of tyrosine, tryptophan and their related compounds based on precolumn derivatization followed by intramolecular fluorescence resonance energy transfer detection h. nohta , m. yoshitake , h. yoshida , t. yoshitake , and m. yamaguchi faculty of pharmaceutical sciences, fukuoka university, nanakuma, johnan-ku, fukuoka, and chemical evaluation and research institute, ishii machi, hita, oita, japan we have developed highly selective hplc method for the determination of tyrosine, tryptophan and their related compounds (l-dopa, catecholamines, -hydroxytryptamine, etc.). the compounds were precolumn-derivatized with a commercially available fluorogenic reagent for amines by usual manner. each derivative afforded intramolecular fluorescence resonance energy transfer (fret) from the tyrosyl or tryptophoryl moiety (donor) to the labeled fluorophore (acceptor); the acceptor fluorescence was observed with the excitation of the donor at nm. the derivatives were separated on a reversedphase column and then effectively detected by monitoring their fret. through the screening study of fluorogenic reagents, o-phthalaldehyde (with -mercaptoethanol) and dansyl chloride gave the best results for the purpose. the fret detection method was highly selective and sensitive by comparison with the previous methods detecting native fluorescence of the compounds or typical fluorescence of the acceptor. the presented study was devoted to determination of the energetic effect of interactions in aqueous solutions between urea and neutral amino acid derivatives. the principal reason for studying of interactions of peptides with urea is the hope that such investigations will give insight into the factors affecting protein denaturation in aqueous solutions. the enthalpies of solution of n-acetylglycinamide, n-acetyl-l-alaninamide and n-acetyl-l-leucinamide were measured in water and in aqueous solutions of urea of molality . to . mol·kg Ϫ using the "isoperibol" type calorimeter at . k. from the obtained standard dissolution enthalpies ∆ sol h ϱ m the enthalpic pair interaction coefficients h xy for urea-nacetylamino acid amide pairs in water were calculated. these parameters derived from mcmillan-mayer theory are regarded as a measure of effect of interactions between solute molecules in solution. the h xy values for the systems investigated suggest that the interactions between urea and amide molecules dominate the effects of dehydration of nonelectrolyte and of peptides. the replacement of the hydrogen atom in the hydrocarbon chain with a methyl group causes a positive change in the value of the enthalpic pair interaction coefficient. the obtained results were compared with those of earlier studies of interactions between electrolytes, namely sodium chloride, potassium chloride and sodium iodide and the same n-acetylamino acid amides. the effect of the solute type on the magnitude of the interaction parameter was also analysed. the side chains of amino acids in solution react in various ways with the water molecules which surround them as well as with other components of solution depending on the fact whether they possess non-polar, polar or ionic groups. many research laboratories carry out studies intended to describe precisely the intermolecular interactions with the participation of amino acid side chains. such a description may allow one to describe better the spatial structures of protein and the mechanisms of folding its surface area. the present work reports the results of calorimetric measurements of the dilution enthalpies of l-α-amino acids in water. using modified mcmillan-mayer's theory, these results served to calculate the enthalpic homogeneous interaction coefficients which characterise interactions between the amino acid zwitterions with the competitive participation of water molecules. thus, these coefficients illustrate the differences in amino acid molecules interactions both with the homogeneous amino acid molecules and water molecules around them, and consequently they may play the part of a parameter which differentiates the hydrophobic/hydrophilic properties of amino acid side chains. the enthalpic interaction coefficients of the homogeneous pairs of l-α-amino acids were compared also with the hydrophobicity parameters obtained by fauchere et al., which describe the side substituents of natural amino acids as well as aminobutiric acid (aba). based on the above statement, one may conclude that the obtained enthalpic homogeneous pair interaction coefficients of l-α-amino acids in water make it possible to systematise amino acid side chains according to their affinity to water or their hydrophobic-hydrophilic properties. thus the enthalpic homogeneous pair interaction coefficients may play the role of parameter describing the lipophilicity (hydrophobicity) of amino acid side chains. compounds (iii) with amino acid ligands. in this work we present results of x-ray investigation of fourth amino acid complexes of rhenium (iii), which have different coordination of amino acids around binuclear complexforming center -re ϩ . substances (glyh) . h o -in inner, but gaba has cisposition according to re -re bond. influences of fatty radical length in the amino acid ligand on week interaction between binuclear anion [re cl ] Ϫ and protonized amino acid are discussed. role of hydrogen bonds in formation of crystal unit cell of investigated substances is shown. these two factors are the reason of formation of staggered conformation of an anion [re cl ] Ϫ in the substance (glyh) [re cl ]cl together with existence of quadruple re -re bond that is described first. in the substance [re (gaba) cl (h o)]cl . h o axial position of re ϩ fragment are substituted by ligands of different kind: h o and cl Ϫ -that says about possibilities to coordinate a substrate of biological nature exactly to these position. a precise, sensitive and reliable rp-hplc/uv method was developed to enable determination of α, and k caseins in cow's milk. the optimised method using a chrompack p- -rp column allowed separation of caseins in min. this column differs from conventional alkyl-bonded silica rp matrices in that it is an underivatised polystyrene-divinylbenzene matrix, a material which proved excellent chemical and ph stability. gradient elution was carried out at a flow rate of ml/min and a temperature of °c, using a mixture of two solvents. solvent a . % trifluoroacetic acid in water and solvent b was % acetonitrile- % water- . % trifluoroacetic acid. the effluent was monitored by a uv detector at nm. the determinations were performed in the linear range of . - . mg/ml for k-casein, . - . mg/ml for α-casein and . - . mg/ml for -casein. the detection limits were . , . and . mg/ml, respectively. the validity of the method was verified. the recoveries ranged from to % for cow's milk. the precision of the method was also evaluated, the % cv being less then . %. the developed methodology was also applied with success to the separation of caseins in ewe and goat milks. different chromatographic profiles were obtained for the three kinds of milk. department of aquatic biosciences, the university of tokyo, bunkyo-ku, tokyo, japan several aquatic crustaceans and bivalve molluscs accumulate a large amount of free d-alanine ( - µmol/g wet wt.) in their muscle tissues. during seawater acclimation from freshwater to % seawater, red swamp crayfish procambarus clarkii largely accumulated d-and l-alanine by . -and . fold, respectively, together with l-glutamine, l-proline, and glycine. the percentage of d-alanine to total alanine increased from % in freshwater to % in % seawater. these data indicate that d-and l-alanine are the major compatible osmolytes responsible for the intracellular isosmotic regulation of this species as well as other crustaceans. under anoxia stress for h in freshwater, and % seawater, crayfish increased d-and l-alanine in muscle and hepatopancreas in addition to the increase of lactate. the increase was much higher in seawater than in freshwater. thus, d-and l-alanine may be anaerobic end products during prolonged anoxia of this species. alanine racemase [ec . . . ] has been proved to catalyze the interconversion of d-and l-alanine in crustaceans and bivalve molluscs. this enzyme was isolated to homogeneity from the muscle of black tiger prawn penaeus monodon. the purification was , -fold with % yield. the molecular weight of the enzyme was estimated to be kda on sds-page and kda on gel filtration, suggesting the dimeric nature of this enzyme. the amino acid sequences of the peptide fragments obtained from the isolated enzyme showed low homology below % with those of microbial enzymes. syntheses and immunological effect of thymic humoral factor-γ analogues research laboratory, global shinwa pharmaceutical co. ltd., yoriki, matsuomura, iwate-gun, iwate-ken, japan nine analogues of thymic humoral factor (thf)-γ , were prepared by the solid-phase method and their in vitro restoring effect on the impaired blastogenic response of phytohemagglutinin (pha)-stimulated t-lymphocytes of uremic patients with infectious diseases were examined. the results were as follows: [arg ]-thf-γ exhibited higher restoring activity than that of our synthetic thf-γ . phylos has developed a powerful combinatorial biology platform for peptide and protein selections. phylos' proprietary profusion tm technology enables the selection of peptides and proteins with desired properties. the fundamental advance represented in this unique platform is the in vitro covalent linkage of a peptide or protein (phenotype) to the encoding messenger rna (genotype). this linkage permits the selection of a protein based on its characteristics and allows the recovery and amplification of that protein through pcr, an efficient means of bring the desired proteins to easily detectable levels. profusion tm technology has routinely selected peptide and protein binders with affinity constants in the nanomolar to picomolar range. the starting library size of randomized peptide or protein profusion tm constructs is typically . linear and constrained loop peptide libraries, for ligand generation, enzyme: substrate interaction, peptidomimetic design, and epitope mapping have been successfully used. randomized constrained loops have also been incorporated in a betasandwich scaffold, resulting in the successful selection of binders against targets of therapeutic interest. antigenic properties of three biological active de novo proteins were investigated by peptide scanning approach, using noncleavable multipin technology. a de novo protein albebetin (pid caa ) was engineered to attain a pre-designed d structure and later modified by grafting short peptide fragments from human α interferon (aag ), and insulin molecules (aag ). such protein constructs carrying important biological activities may be used in future as potential protein pharmaceuticals. despite artificial proteins are investigated for more than years, immunological properties of these substances are not known. in our experiments we applied an innovative approach of raising antibodies in yolks of egg-laying hens. three continuous antigenic determinants with different immunogenic potentials have been revealed in two proteins with partially overlapping sequences. it was shown that the octapeptide interferon fragment is the immunodominant site in albeferon and albeferon-insulin molecules. on the contrary, the hexapeptides, corresponding to the insulin fragment displayed low immunogenic activity. thus we recognise that the fragments attached to the de novo frame could essentially govern immunological properties of resulting construct. no preference of any type of secondary structure was observed in antigenic determinants. nevertheless, all of them are located at the boundaries of the secondary structure elements and on the predicted surface-located sites of albebetin molecule. peptide fragments from human α -interferon and insulin corresponding to the functionally important sites of their molecules were grafted into de novo protein albebetin (pid caa ) engineered to attain a pre-designed tertiary structure with a unique topology that has not been observed in natural proteins. by means of genetic engineering the dna fragments corresponding to these peptides were inserted into the albebetin gene to obtain two variants of albebetin with antiviral fragment of human α -interferon and two variants of albebetin with insulin-like peptide. the chimerical genes were expressed in escherichia coli in a fusion expression system with thioredoxin based on the plasmid pet- (novagen). the fusion proteins were digested by highly specific protease factor xa and the target chimerical proteins were purified and tested for their structure and biological activity. according to the cd spectroscopy study the chimerical proteins maintained the pre-designed structural properties of albebetin. toxicological testing of the proteins in the mtt-test did not reveal their cytotoxicity. antiviral activity of de novo proteins with human α -interferon fragments was studied in vitro using human fibroblasts cell line l- and simian cell line vero. treatment of these cell lines with the proteins revealed the dose-dependent stimulated antiviral activity on fibroblasts and direct dose-dependent antiviral activity on the vero cells. one of two de novo proteins including insulin-like fragment (pid aag ) acquired ability to stimulate glucose uptake by l- cells although the efficacy of stimulation was lower than that for the synthetic peptide and insulin. these results demonstrated that albebetin can be used as a scaffold for constructing of the functionally active de novo proteins possessing the pre-designed tertiary fold of albebetin and various biological activities. the identification of genes encoding unique tumor associated antigens (taas) has facilitated the development of novel immunotherapeutic strategies in cancer patients. clinical investigations have focused on targeting these cancer antigens for the generation of anti-tumor t-cell responses. taa epitopes come from differentiation antigens, from embryonal reexpressed or overexpressed proteins, from mutated proteins and from viral proteins in viraly associated tumors. we have recently developed a novel screening system for identification of immunogenic and antigenic ctl peptide epitopes using d b-/-x m -/double knockout mice, transgenic for a single-chain hla-a - m molecule (hhd mice). specific ctl were derived by immunization of hhd mice with tumor peptide extracts loaded on antigen presenting cells and with hhd transfected human tumor cell lines ctl induced against peptides from various tumors recognized tumor peptides more effectively than peptides extracted from normal tissues and also reacted with a serie of peptides derived from overexpressed candidate proteins, identified by differential display methods (sage, microarrays) comparison of ctl derived from hhd mice to ctl induced from patient's pbmc showed overlapping recognition of many candidate peptides. using these hhd mouse derived ctl we identified novel peptide sequences from prostate, bladder, breast and colon carcinomas, antigens pap and steap, from breast carcinoma antigens muc and ba - . analysis of tumor differentially expressed genes by the sage method in colon, followed by screening for hla-a binding peptides resulted in candidate peptides for immunogenicity screening. we have identified antigenic peptides of which peptides were found to be immunogenic in hhd mice. interestingly of these peptides are derived from the same protein. differential expression studies, using "dna chips" were performed on prostate and bladder tumors versus normal tissues. ten new candidate genes from tcc were analysed for expression and potential immunogenic peptides. novel peptides from uroplakins and from mage- were identified. surface plasmon resonance biosensing in the study of viral antigenic sites mimicked by synthetic peptides p. gomes , e. giralt , and d. andreu centro de investigação em química da universidade do porto, portugal department de química orgànica, universitat de barcelona, spain antigen-antibody binding has been regarded as one of the most representative examples of specific molecular recognition in nature. the simplistic view of antigenic recognition in terms of a lock-and-key mechanism is superseded, since it is now evident that both antigens and antibodies are flexible and can undergo substantial mutual adaptation. this flexibility is the source of complexities such as degeneracy and non-additivity in antigenic recognition. we have used surface plasmon resonance to study the effects of combining multiple amino acid replacements within the sequence of the antigenic gh loop of foot and mouth disease virus. our aim was two-fold: to explore to what extent can antigenic degeneracy be extended in this particular case, and to search for potential non-additive effects in introducing multiple amino acid replacements. combined analysis of one such multiply substituted peptide by spr, solution nmr and x-ray diffraction shows that antigenic degeneracy can be expected as long as residues directly interacting with the paratope are conserved and the peptide bioactive folding is unaltered. structural properties of creatine kinase from amphioxus, branchiostoma belcheri gray f. inoue, s. obase, t. suzuki, and t. imai department of physiological chemistry, faculty of sciences, toho university, funabashi, japan to further our knowledge of creatine kinase (ck) in the fields of molecular evolution and comparative enzymology, we analyzed the ck gene of the protochordate amphioxus. amphioxus is thought to be the phylogenetic predecessor of vertebrates and thus possesses characteristics, such as enzymological properties, that are associated with ancestral vertebrates. the results clarified the sequence of bases including the active site. the homology of the active site and the surrounding bases for the amphioxus ck gene to that of the human and electric ray ck-m gene was . % and . %, respectively. the amino acid sequence of this region of amphioxus ck was also identical to that of human and electric ray ck-m. in addition, the estimated secondary structure of amphioxus ck was compared to that of human and electric ray. there were no marked differences in the relative ratio of the α helix, sheet and turn structures for the peptide structure of ck consisting of amino acid residues. there was a high degree of homology in the sequence of amino acid residues (met ϳhis ) near the active site of ck between amphioxus and other organisms, suggesting that this region of ck is functionally essential for transphosphorylation. gelsolin is a ca ϩ -activated and phosphoinsitide-regulated cytoskeletal actin-binding-and-severing protein, its fragments - : ksglkykk (g - ) and - khvvpnev vvqrlfqvkgrr (g - ), are responsible for the binding of this protein to actin and the cellular messenger phosphatidylinositol , -bisphosphate (pip ). the binding of peptides g - and g - to a cluster of four pip molecules in a dimyristoyl-phosphatidylcholine lipid was in vestigated by means of molecular-dynamics (md) simulations of , ps. the binding of the pip molecules to the peptides g - , g - showed both electrostatic and hydrophobic nature: lysine residues of the peptides formed salt bridges with the phosphate groups of the pip molecules, while hydrophobic interactions occurred between the nonpolar residues of the peptides and the fatty-acid tails of pip . during the binding some of the pip molecules were dragged out of the lipid, thus disrupting the bilayer. after the binding dissociated a draggen-out pip molecule tend to incooporate back to the lipid. division of applied physiology, institute of veterinary physiology, university of zürich, switzerland chemical modification of the proteins: bovine serum albumin, α-lactalbumin, -lactoglobulin and chicken egg white lysozyme by -hydroxyphthalic anhydride ( hp) yielded compounds which exerted antiviral activity in vitro as compared with the native unmodified proteins. of the three enveloped viruses tested: human herpes simplex virus (hsv- ), bovine parainfluenza virus (pi- ) and porcine respiratory corona virus (prcv), the hp proteins were shown to be active against human herpes simplex virus only indicating that a perturbation of the viral envelope is unlikely. pre-incubation of vero cells with hp-albumin, hp--lactoglobulin and hplysozyme resulted in protection against hsv- infection whereas pre-incubation with hp-α-lactalbumin had no antiviral effect. however, all hp modified proteins showed a more significant inhibition when present during or after the viral infection step. thus multiple mechanisms appear to be involved in the inhibition of hsv- infection. the blocking of cell receptors may contribute to the antiviral activity as shown by the preincubation data. however, a direct interaction between the modified proteins and the hsv- glycoproteins responsible for viral entry and spread, seems to play a more important role, as indicated by the smaller ec values obtained during and after the infection. a dummy or a protagonist on the stage of inflammation? r&d department, zambon group, bresso, milan, italy amino acids are usually present in large excess in healthy and the excess is used as source of calories. however, metabolic alterations are observed in ill patients and preferential retention of sulphur amino acids (saa) occurs during the inflammatory response. the metabolism of cysteine is modified during the acute phase of sepsis in rats. sulphate production is lower, whereas the higher liver production of taurine seems to play a protective role; glutathione concentration is greater in liver, kidney and other organs and cysteine incorporation into proteins was higher in spleen, lung and plasma (acute phase proteins) while albumin level decreases. another important phenomenon is the impairment of methionine conversion to cysteine during stressed condition. premature infants or hiv patients synthesise cysteine from methionine at a much lower rate. thus, the metabolic flow through the trans-sulphuration pathway may be insufficient to meet the glutathione and cysteine requirement in critical conditions. the pro-inflammatory cytokines, interleukin- , interleukin- and tnf-α are the main initiates that alter protein and amino acid metabolism. in this complex picture, saa supply may contribute to the immune system regulation. our previous investigations showed some biological activity of newly synthesized cluster rhenium compoundtetrachlorodi-µ-(γ-aminobutirato)dirhenium(iii) chloride -i such as antitumour activity, cell-stabilizing activity against osmotic hemolysis, changing of morphology of cells, and other. there exists some information about stabilizing effects of some metal-organic substances with antitumour properties on the isolated ishaemic-reperfused rat heart (leperre a. ) throughout decrease of malonaldehyde (mda) production. some new investigations showed the influence of metal-organic substances on apoptotic processes (winter b. , syrkin a. , that are considered now as the main mechanism of such tissue damages as ishaemia, myocardial infarct, etc. thus we tried to analyze such activity of i. two models of hemolytic anemia was used: a -on rabbits by introducing of pbac -solutions; this model permits to investigate dynamics of anemia in one experimental animal; bon rats by introducing of phenylhydrazine chloride. i was administrated as in solution as in lyposomic (lyp) forms. all measurements and models were accomplished according to described procedures. administration of i led to: increase of hemoglobin and resistance of erythrocytes and to prolonging of life for hemolytic animals; significant decrease in quantities of mda and increase in quantities of reduced glutathion (gsh), glutathionreductase (gsr) and glutathionperoxidase (gsp) in myocardium, blood, brain, liver, splenic and entherocites of anemic animals. the most effective was i in lyposomic form. mechanism of antioxidant action of rhenium cluster compound is speculated and experiments with some well-known antioxidants to compare with i are working out. at present problem of finding remedies against the mostly dangerous human disease -aids is one of higher interest. the aim of this work was the investigation of inhibiting effect of high-pure l-lysine-α-oxidase (lo) e.c. . . . , extracted from trichoderma sp., on hiv-virus reproduction, comparatively to azidotymidin (azt), being now in use for treatment of aids-patients. for studying of inhibiting effect of lo, the mt- cells, sensitive to citopathical action of virus, were used. the experimental studying has shown, that the enzyme at concentration - ng/ml suppresses hiv reproduction and synthesis of virus' proteins, not exerting toxical effect on mt- cells. toxical dose of lo has been determined preliminary. a comparison with standard preparation -azidotymidin, which causes suppression of virus reproduction at concentration mkm ( , mg/ml) not exerting toxical effect on mt- cells. the same effect is attained having used lo in doses - ng/ml. using lower concentrations of enzyme leads to partial increasing of virus' titre comparatively to control cultures. obtained data allow to conclude that lo from trichoderma sp. is more high specific agent than azidotymidin, because it needs times lower concentration for the same action. comparison of azt and lo action on synthesis of virus' antigens presenting in cultural media of mt- cells infected with virus, leads to conclusion, that lo has inhibitory action both on virus' reproduction and virus' protein synthesis. department of microbiology, dokkyo university school of medicine, mibu, tochigi, japan our previous studies showed that the cellular amino acid composition obtained by amino acid analysis of whole cells, differs in various organisms. these results suggest that the difference in the cellular amino acid composition reflects biological evolution. however, the basic pattern of cellular amino acid composition is relatively constant in all organisms, and the cellular amino acid compositions of the archaeobacteria are quite similar to those determined from codon usage data, based on the complete genomes. in the present study, the free amino acid compositions in archaeobacteria, eubacteria, protozoa, blue-green alga, green alga, slime mold, plants and mammalian cells were analyzed, to investigate whether changes in their free amino acid compositions reflected biological evolution. cell homogenates were treated with - % ethanol to separate cellular proteins and free amino acids contained in the cells. rat hepatoma cells (r-y b) were cultured in eagle's minimum essential medium (mem) containing % serum or in a modified mem lacking arginine, tyrosine and glutamine. no significant difference in the free amino acid composition was observed between the two cell groups cultured under two different conditions. the patterns of the free amino acid compositions differed completely from those of the cellular amino acid compositions, and from each other in various organisms. characteristic differences were observed between plant and mammalian cells, and between archaeobacteria and eubacteria. the patterns of the free amino acid composition in blue-green alga, green alga, protozoa and slime mold differed from each other and from those of eubacteria and archaea cells. it has been suggested that the free amino acid composition reflects apparent biological changes as the result of evolution. ) catalyzes the hydrolysis of gamma-glutamyl compounds such as glutathione, and the transfer of their gamma-glutamyl moieties to amino acids and peptides. we previously developed enzymatic methods for the synthesis of various gammaglutamylamino acids using the transfer reaction of ggt from e. coli k- as a catalyst. it has been reported that gamma-lglutamyltaurine has a potent and long-lasting antiepileptic action, and its chemical synthesis has also been reported, but it required protecting and deblocking of reactive groups. thus, the purpose of this study was to develop an enzymatic method for the synthesis of gamma-l-glutamyltaurine using ggt. the optimum reaction condition was mm l-glutamine, mm taurine and . unit/ml ggt, ph , and -hr incubation of °c. forty-three mm gamma-glutamyltaurine was obtained and the yield was .%. gamma-glutamyltaurine was purified by dowex ϫ column and c column, and then identified with gamma-l-glutamyltaurine by nmr and polarimeter. in this study the yield of gamma-l-glutamyltaurine was comparatively low because synthesized gamma-lglutamyltaurine was promptly converted into the by-product, gamma-l-glutamyl-gamma-l-glutamyltaurine. the production of antimicrobial peptides is an important aspect of host defense in animals ranging from insects to mammals. they do not target specific molecular receptors on the microbial surface, but rather assume amphipathic structures that allow them to interact directly with microbial membranes, which they can rapidly permeabilize. they are thus perceived to be one promising solution to the growing problem of microbial resistance to conventional antibiotics. insects express a battery of potent antimicrobial proteins in response to injury and infec-tion. until now, approximately immune peptides have been characterized from insects and other invertebrates. an antimicrobial gene (md-cecropin) belonging to cecropin family was cloned from the bacteria-charged adult house fly, musca domestica. expressed in the vector pgex- t . mrna was isolated and degenerated primers were designed according to the conserved sequences of cecropins. the full-length cdna encoding md-cecropin was cloned by rt-pcr and Ј, Ј-race and sequenced. the deduced amino acid sequence indicated that a prepeptide with amino acid residues is first translated and then processed to a mature peptide with amino acids. the dna encoding the mature peptide was subcloned into expression vector pgex- t , and expressed efficiently in e. coli bl as a fusion protein. the fusion protein was purified and specifically digested and the md-cecropin was further purified to homogeneity and the activity spectrum was investigated. escherichia coli with metabolic engineering methods l. yun, x. zhang, s. wang, q. xu, and l. ma biotechnology laboratory, institute of beijing radiation medicine, beijing, p.r. china a bioengineering escherichia coli strain was obtained by metabolic engineering method. three genes related to the biosynthesis of phenylalanine, arog, phea, and tyrb encoded key enzymes: -deoxy-d-arabino-heptulonate- -phosphate synthetase (ds), a bifunctional protein-chorismate mutase (cm)/prephenate dehydratase (pd) and aminotransferase (at), respectively. in this work, the feedback inhibition of ds and cm/pd were relieved by site-directed mutagenesis on bases of homology comparison of related sequences of the key enzyme. the feedback inhibition resistant genes encoding ratelimiting enzymes in the main and terminal pathways were amplified by co-expressed in order of arog-phea-tyrb on the plasmid by their own operator plpr, pl, and pr. in the recombinant strain showed great resistant to the l-phenylalanine analogues, the specific activities of ds, cm, pd and at were increased by . , . , . and . folds, respectively. as the result, the amount of phenyalalnine biosynthesis of the bioengineered strain was increased greatly compared with that of the host strain. an enzymatic approach for the mapping of phosphoproteins resolved on two-dimensional polyacrylamide gels hiroshima proteome laboratory, regional science promoter program, kagamiyama higashihiroshima, japan an enzymatic approach for high-throughput mapping of phosphorylated proteins resolved on two-dimensional ( -d) polyacrylamide gels is presented. proteins of cultured rat skin fibroblasts were divided into two aliquots, one of which was dephosphorylated using recombinant protein phosphatase. the two aliquots were then subjected to -d electrophoresis. the phosphoproteins could be mapped on the -d gel by com-paring the gels of the phosphatase-and non-treated samples, because the dephosphorylated proteins shifted to more basic positions on the gel. this technique revealed that approximately % of the detectable proteins were phosphorylated. fifteen phosphoproteins were identified by mass spectrometry, including proteasome component c and small glutamine-rich tetratricopeptide repeat-containing protein. furthermore, the extent of phosphorylation of two actin-modulating proteins, destrin and cofilin, was found to be significantly reduced when the cells were chemically or enzymatically detached from the culture dishes. the presented technique can be applied to all biological materials because it requires no protein-labeling step, and is therefore useful for high-throughput mapping of phosphoproteins in proteome research. with the completion of the human genome sequence maldi-tof-ms is increasingly becoming an established method for identification of proteins separated by d gel electrophoresis. mono-isotopic peptide mass fingerprinting (pmf) has been previously shown to be amenable to full automation encompassing the process of acquisition, data processing and databank searching under full software control. until now the throughput of maldi-tof-ms for proteomics has been limited to several hundred samples in a working day and this represents approximately - % of the total proteins resolved by a large format d gel. to reduce the number of proteins to be identified the d gels are imaged and analysed to determine differences in expression levels within a set of gels. although much of the image processing is semiautomated the comparison is labour intensive as manual pattern matching has a role in the gel alignments (land marking). increased ms sample throughput allows the possibility of identifying every protein spot in a d gel within a day. this could eliminate the potentially erroneous step of human gel image alignment, whereby land marking could be achieved using the ms data. increased sample throughput requires greater capacity and robust unattended instrument operation. in this poster we describe an integrated robotic multiple plate loader that allows overnight unattended ms operation. other improvements include an increased laser repetition rate that allows the data capture rate to increase four fold. sample tracking, data archiving and data reporting are essential attributes of this new technology and these aspects are outlined in the presentation. the proteinchip tm biology system for ciphergen biosystems: a novel proteomics platform for rapid biomarker discovery, validation and identification ciphergen biosystems ltd., surrey technology centre, the surrey research park, guildford, surrey, u.k. the proteinchip system uses seldi (surface enhanced laser desorption/ionization) proteinchip technology to perform the separation, mass detection and analysis of proteins at the femtomole level directly from biological samples. surfaces are based on either chromatographic based chemistry (ion exchange, reverse phase, imac etc.) that bind large classes of proteins or biologically defined surfaces (antibodies, dna, receptors, etc.) that are used to investigate specific proteininteraction events. as with conventional elution chromatography each type of surface is designed to bind a different subset of proteins from a crude mixture. sample complexity is reduced on the surface by washing with standard biological buffers compatible with the chosen proteinchip array. unlike elution chromatography, proteins are detected directly from the stationary phase using laser based mass spectrometry greatly increasing throughput whilst reducing sample loss and improving reproducibility. multiple proteinchip surface and wash conditions are explored with a small sample set to resolve hundreds of proteins and establish assay conditions that reveal candidate biomarkers or diagnostic protein profiles. the resulting custom built assay is then used to monitor disease processes or drug toxicity profiles by screening large banks of samples such as tissue extracts or physiological fluids (serum, urine, csf, etc.). pharmaceutical research, genomics technologies, f. hoffmann-la roche ltd., basel, switzerland to the present, samples representing the total protein mixture have been usually analyzed by proteomics technologies mainly only the abundant, hydrophilic components have been visualized. these proteins could be solubilized with reagents compatible with isoelectric focusing, for example urea and chaps. such an analysis provides us with a limited image of the proteome, which is insufficient for the detection of the majority of the proteins. in a -d gel, where about mg of protein amount has been resolved, , - , protein spots can be detected, using coomassie blue staining. the spots represent the products of only - different genes. other gene products, not visualized, are most likely expressed at too low levels for detection or they can not be identified because of limitations of the current technology, they are too small, too large, basic or hydrophobic. here we will discuss protein enrichment approaches prior to the analysis, which we have applied for the enrichment of bacterial and eukaryotic proteins. proteomic analysis of the rat liver mitochondrial proteins m. fountoulakis , j.-f. juranville , and l. suter genomics technologies, and drug safety, f. hoffmann-la roche ltd., pharmaceutical research, basel, switzerland subcellular fractionation increases the probability of detection of low-abundance proteins. we prepared mitochondrial, microsomal and cytosolic protein fractions from total liver of male rats. the proteins of the three fractions were analyzed by two-dimensional electrophoresis using broad and narrow ph range immobilized ph gradient strips. the proteins were identified by martix-assisted laser desorption ionization mass spectrometry. in the mitochondrial fraction, different gene products were detected. approximately % of the identified mitochondrial proteins are enzymes with a broad spectrum of catalytic activities. most of the identified proteins had been detected before in other samples as well, analyzed in our laboratory. eight gene products were detected for the first time. these were represented by one spot each, whereas most of the frequently detected proteins were represented by multiple spots. in average, approximately spots corresponded to one gene product. centre for molecular medicine, university college london, u.k. three kinds of experiments have been carried out successfully in our labs. ( ) identification of post-translational modifications of the endothelin a and b receptors (etar and etbr) including both phosphorylation and acylation. we have developed new, very efficient methods for single step isolation of highly pure etar and etbr from cells. this has allowed us to obtain evidence that the post-translational modifications are very complex and result in multiple phenotypes showing different forms of modification for receptor. as with other systems, e.g. insulin-like growth factors, it is probable that these multiple phenotypes of the et receptors correspond to different forms of signalling dependent on cellular state, e.g. the cell cycle. it is, for example, already clear from the phosphorylation of the receptor that a series of different kinases must be involved. ( ) following stimulation of fibroblasts with endothelin, phosphorylation/dephosphorylation signalling cascades involving several hundred proteins have been observed by use of high resolution d electrophoresis and detection of phosphorylated proteins labeled with p by autoradiography or immunological methods. the large number of proteins involved are being identified by mass spectrometric methods such as mass fingerprinting or sequencing by mass spectropmetry. ( ) differential gene expression has been followed by using s met pulse chase labelling concurrently with endothelin stimulation. at least proteins showed significant changes in expression of d gels and these proteins are also being identified. these experiments demonstrate that it is now possible to use proteomics methods to investigate the integration of response to an extracellular signal at the levels of the receptor itself, the subsequent signalling cascades and the ensuing gene expression. the proteomics technology permits concurrent monitoring of large numbers of protein phenotypes (the forms and amounts of individual proteins and is therefore able to provide a global overview of signalling processes which greatly augments more traditional investigations of individual proteins or pathways. furthermore, these new methods will allow quantitative determination of the changes in protein phenotypes, which is very important in view of the highly non-linear amplification properties of such signalling processes. an integrated approach to automated high throughput protein identification by d gel electrophoresis and mass spectrometry d. gostick , s. cohen , p. young , b. karol , j. langridge , j. randell , t. slyker , and a. jacobson micromass, manchester, u.k. waters corporation, milford, massachusetts, and bio-rad laboratories, hercules, california, u.s.a. establishing the function of gene products is the major challenge of the post genomic era. the rate-limiting step in this endeavour is the speed with which proteins can be isolated and identified. separation of proteins from cell lysates or sub-cellular domains by d gel electrophoresis is an established method of visualising these complex systems. recently mass spectrometry has proved to be a powerful method of further characterising these proteins. from the mass spectrum of the enzyme digest of a d gel spot, the resulting digest map is compared with the theoretical maps from the databases and the protein identified when these correlate. maldi-tof is of great benefit in these studies since it requires a minimal amount of sample, is relatively tolerant to salts and other contaminants arising from the gel and may be configured for automated sample analysis. high sample throughput with automated analyses including data processing and client-server database searching are already available. our system automatically acquires the data and processes the maldi mass spectrum into a monoisotopic peak list. this peak list is then automatically sent to a networked database for protein identification. when proteins are not identified from the maldi analysis or an ambiguous result is obtained, then further analysis of the sample by electrospray caplc-ms-ms is required. the development of a hybrid quadrupole orthogonal acceleration timeof-flight mass spectrometer (micromass, q-tof) has facilitated the generation of unambiguous amino acid sequences from the ms-ms analyses of tryptic peptides. these ms-ms spectra can be automatically searched against protein, nucleotide or est databases. thus enabling protein identification from gel spots, despite non-specific enzymatic cleavage, protein co-migration and post transitional modifications. for organisms who's genome sequences are poorly represented in the data bases de novo amino acid sequencing may be required. inferring de novo peptide sequences from ms-ms data is complex and is often the rate-determining step in this method. however, it is now possible to interpret the ms-ms spectrum automatically. in our approach the raw ms-ms spectrum is reduced to the plausible single-charge, monoisotopic mass spectrum. sequence interpretation is achieved by generating "trial sequences" consistent with the experimentally determined molecular weight. a probabilistic fragmentation model is used to transform the trial sequences to predicted spectra for comparison to the single-charge, monoisotopic spectrum and to calculate the likelihood that the trial sequence would account for the observed data. the possible number of trial sequences for any peptide is large, for example there are possible sequences for a peptide containing any of the naturally occurring amino acids and having residues. to reduce the scale of the problem a terminated markov chain monte carlo algorithm is used to produce sequences. this bayesian method simulates an exhaustive search of all sequences having the correct mass. the huge increase in genomic sequence information available, combined with the increased sensitivity and selectivity provided by mass spectrometry, has allowed large-scale protein identification. however the analysis of the post translational modifications present on the identified proteins is a more challenging problem. currently the approach that offers the most expedient and specific solution, to determine modified peptides, is precursor ion scanning. this approach has primarily been performed on a triple quadrupole mass spectrometer where the rear quadrupole, (ms ) is set to transmit only the fragment ion of interest. the ms quadrupole is then scanned across the appropriate mass to charge range. in this paper we describe a method that allows specific post translationally modified peptides to be identified and sequenced during the course of an hplc experiment on the q-tof mass spectrometer. during the hplc run the instrument is switched alternately at one-second intervals between low and high collision energy with argon in the collision cell. the quadrupole, ms is not mass selective, operating in the rf only mode. the first data set at low energy ( ev) shows only the normal pseudo molecular ions. the second at higher energy shows their fragments. wherever a product ion of interest occurs in the high-energy data all its possible precursors are revealed by the corresponding ev data. since the two data sets contain the entire set of precursor and product ions that can be formed it is clearly possible to generate the equivalent of a constant neutral loss scan. this is invaluable in the case of phosphorylated peptides where the neutral loss of da (h po ) occurs via -elimination from the phosphoserine and phosphothreonine residues. this allows the q-tof mass spectrometer to switch from the ms mode to the ms/ms mode of operation when a potential pseudo molecular ion exhibits a neutral loss of da between the high energy and low energy data sets. the product ion ms/ ms spectrum can then be acquired on the phosphorylated precursor ion. in the case of phosphotyrosine, neutral loss of the h po moiety is not observed, however a low mass immonium ion at m/z can be detected. this characteristic ion (from the high energy data) is used to direct the mass spectrometer to fragment potential phosphopeptide precursor ions, which are selected from the low energy data. in this case several precursor ions may require ms/ms interrogation at one decision making time-point. with the first draft of the human genome completed largescale protein identification by mass spectrometry, even for samples originating from higher organisms has become relatively straightforward. this requires a high throughput facility to identify proteins that have usually been separated by d page. the approach providing the highest level of automated sample throughput, in terms of samples per hour, is currently maldi-tof-ms. this technique provides a peptide mass fingerprint of the protein digests and allows the rapid and accurate identification of the parent protein by comparison to a databank. however, under some circumstances, for example if the number of peptides detected is small or if the sequence coverage is poor, it is advantageous to be able to include even a short piece of sequence information to provide added specificity. in a conventional maldi-tof-ms instrument post source decay (psd) can be used to try and generate sequence information, however this approach is notoriously unreliable in producing good quality ms/ms data. one reason for this is that the peptide ions do not undergo fragmentation in a controlled environment such as a gas cell with selected collision gas and collision energy. an alternative approach is to use the predictable fragmentation obtained from a hybrid quadrupole ortho maldi source has been fitted to a hybrid quadrupole orthogonal acceleration time-of-flight (q-tof) mass spectrometer. in contrast to a conventional maldi-tof-ms instrument the resolution and mass measurement accuracy of the data is comparable between the ms and ms/ms modes. this allows superior data acquisition in the ms-ms mode compared to conventional maldi-tof-ms. a number of modifications have been made to optimise the system for high throughput proteomics. the maldi source has been configured with a high-density target plate, compatible with a well microtiter plate. the acquisition software has been modified for automated data acquisition in both the ms mode and the ms to ms/ms switching mode. dedicated processing software has been developed to fully automate the post acquisition and databank searching. this software has been optimised to consider the unique nature of the data acquired from this configuration of instrument. in this paper we demonstrate the how an maldi-q-tof instrument can be used for high throughput proteomics. we also compare and contrast is functionality in comparison with alternative strategies for high throughput proteomics, namely conventional maldi-tof-ms and electrospray lc-ms/ms. pseudomonas putida is an ubiquitous, metabolically and physiologically extremely variable soil bacterium. it is kown to be a good colonizer of plant roots and a plant growth promoter. now, after the sequencing of the total genomic dna has been finished we have focused on the functional analysis of this strain. plant growth promotion is achieved in different ways. one is the inhibition of fungal and bacterial phytopathogens, which is known to be a multifactorial mechanism. an important factor of this mechanism is the production of siderophores (iron-transport-agents), small linear or cyclic peptides, which are synthesized in a ribosomal-independent manner by special synthetases. the siderophore production is induced by iron limitation. the regulation of this process was investigated by pulselabelling with [ p] inorganic phosphate. d-protein patterns generated from cells grown with and without fesupplementation were compared. proteins which were phosphorylated under iron limitating conditions were analysed by maldi-tof peptide mass fingerprint. for the identification of the proteins we used an in-house peptide mass database which has been built based on the genomic sequence data. bio-rad laboratories, inc., hercules, california, u.s.a. worksbase software for proteomics is a platform independent information management system encompassing laboratory experimental workflow and bioinformatics for protein and biochemical research. the worksbase system is designed to allow direct internal integration between laboratory experimental data and background biological knowledge found in reference and in-house data, such as gene, protein and functional annotation databases. worksbase provides a crossdisciplinary research infrastructure for drawing together multiple lines of evidence for characterization of proteins, and integration of this data with domains such as gene expression, pharmacological screening, structure and related areas. while the focus is on the biology underpinning the experimental work, the system is also designed with the capability of providing a sample and workflow tracking system for use in the wet lab, effectively a proteome lims (laboratory information management system). as experimentation proceeds in the laboratory, worksbase software can be used for development of hypotheses on protein, biochemical pathway, and post-translational processing involvement in biological systems and disease processes. as such, identifications that are derived from lab work and user observation can be used to augment the reference data repository. however, unlike databases ands systems where the methods and reasoning for assignment of annotations are obscure, by maintaining the link between the source data and the biological roles derived from them, the accuracy and integrity of any information stored in the worksbase system can be directly ascertained. changes in the brain protein levels following administration of kainic acid k. krapfenbauer , , m. berger , g. lubec , and m. fountoulakis f. hoffmann-la roche ltd., pharmaceutical research, genomics technologies, basel, switzerland institute of cancer research, and department of pediatrics, university of vienna, austria kainic acid (ka), a potent neurotoxin and excitatory amino acid, leads to derangements and modulation of brain proteins. no global brain protein expression pattern induced by ka-treatment has been reported yet. we studied the effect of systemic ka administration on the levels of brain proteins. rats were injected placebo or ka intraperitoneally and brain was taken after one week. the mitochondrial and cytosolic fractions of the brain proteins were analyzed by proteomics technologies. heat shock protein hsp was exclusively detected in brains of animals treated with ka. the levels of neurofilaments and alpha-internexin were significantly decreased and a fragment of tubulin alpha- chain was manifold increased in ka-brains. the mitochondrial enzymes dihydrolipoamide dehydrogenase, atp synthase beta chain and isocitrate dehydrogenase were reduced and pyruvate kinase m was increased following ka treatment. the results indicate altered regulation of heat shock proteins, neuronal death, cytoskeletal disruption and mitochondrial derangement by systemic ka administration. this report confirms and extends previous studies on the effect of ka on the expression of brain proteins and suggests that our analytical system can serve as a model for neurotoxicological, neurobiological and neuropathological proteome studies. the rat brain mitochondrial proteins genomics technologies, f. hoffmann-la roche ltd., pharmaceutical research, basel, switzerland we constructed a two-dimensional database for rat brain mitochondrial proteins. rat is a useful model of human diseases of the central nervous system. in order to detect alterations in the levels of the low abundance brain proteins, the mitochondrial, microsomal and cytosolic fractions were prepared. the proteins of each fraction were analyzed by two-dimensional electrophoresis, followed by martix-assisted laser desorption ionization mass spectrometry. approximately proteins were identified in the mitochondrial fraction, which were the products of different genes. about % of the identified proteins were detected in the mitochondrial fraction only and the rest were detected in the cytosolic and about % were found in the microsomal fraction as well. of the proteins had not been detected before in our laboratory. the identified proteins were in the majority enzymes or enzyme subunits with a broad spectrum of catalytic activities and heat shock proteins. whilst lc-ms/ms has been utilised for the identification of proteins from complexes and cell lysates (qualitative proteomics), the quantitative study of gene expression using differential display has until recently been the preserve of a d gel based proteomic experiment. however, recently a great deal of interest has been generated on the use of isotope coded affinity tags (icat) for the quantitative study of gene expression at the proteome level. the technique is based upon chemically modifying the cysteine residues of proteins isolated from cells in two different states with light and heavy isotopically labeled reagents. the two cell states are then combined, digested with trypsin and the cysteine containing peptides preferentially selected by binding to an avidin column, prior to analysis by mass spectrometry. the eluent from this column is then analysed by capillary lc esi-ms/ms. interrogation of the eluting peptides by tandem mass spectrometry and databank searching results in the identification of the associated protein. we describe how icat data analysis has been automated within a software environment. the ms and msms data acquired using the qtof instrument are processed and analysed using a new algorithm which recognises related isotope clusters and quantifies their relative intensities. based on a user defined ratio threshold the software will automatically carry out an lc-ms/ms experiment and databank search in a client-server mode and provide a report of the identified proteins and their expression ratio in the two cell states. deterioration of the transcriptional, splicing and elongation machinery in brain of fetal down syndrome b. lubec and m. fountoulakis department of neonatology, university of vienna, austria gene technologies, cns research, f. hoffmann la roche, basle, switzerland perturbation of brain development i.e. regulation of gene expression, differentiation, growth and migration in down syndrom (ds) has been reported to occur early in life pointing to impairment of the complex system of transcription and or translation and indeed, altered expression of transcription factors has been reported in adult ds brain. we therefore decided to compare the transcriptional and translational machinery in cortex of brains of controls and fetuses with down syndrome in the second trimenon of gestation. we determined a series of transcription/translation factors by d-electrophoresis followed by maldi -identification and quantification with specific software. the protooncogene c-crk, crk-like protein, elongation factor -alpha , elongation factor , elongation factor tu and two out of four spots representing ptb-associated splicing factor psf were significantly downregulated in brain of fetal ds fetuses as compared to controls. the finding of reduced transcription and translation factors may indicate deranged protein synthesis. the underlying cause for individual reduced transcription, splicing and translation factors may be explained by chromosomal imbalance or by posttranslational modifications as e.g. phosphorylation, known to be aberrant in ds. reduced expression of transcription factors in fetal ds during early life may be responsible or reflecting impaired brain development and deficient wiring of the brain in ds. r. mazzoli , m. g. giuffrida , e. pessione , g. dellavalle , c. barello , e. griva , and c. giunta dipartimento di biologia animale e dell'uomo, università di torino, and csaapz-cnr. c/o bioindustry park canavese colleretto giacosa (to), italy a fast phenol degrading acinetobacter radioresistens strain was isolated in our laboratories and selected for bioremediation applications. this bacterium is also able to grow on benzoate and catechol as sole carbon-energy sources, metabolizing them via the ortho route. in previous researches we detected, by means of proteome analysis, some marker enzymes of the phenol and benzoate degradative pathways. in the present work we extend the identification of the proteins involved in the aromatic-ring opening (the different components of the phenol hydroxylase and benzoate dioxygenase, the catechol dioxygenase isozymes) together with other satellite proteins specifically induced by the aromatic growth substrate. of these last proteins some are probably related to the cellular uptake of benzoate and phenol while others are ascribed to the groel family of heat-shock chaperonines, involved in proteins processing and folding. aromatic substrates may thus act as stress-agents like heat or cold. proteomic studies on rat body fluids i. miller , r. wait , l. sironi , i. eberini , m. gemeiner , e. tremoli , and e. gianazza veterinärmedizinische universität, wien, austria imperial college school of medicine, hammersmith, london, u.k. universita' degli studi, milano, italy previously, we have characterized rat serum proteins, both under "normal conditions" and during experimental inflammation, using two-dimensional electrophoretic separation, densitometric quantitation and identification by mass spectrometry and immunological procedures (http://linux.farma.unimi.it/ homeframed.html). we have now extended these studies to the protein composition of cerebrospinal fluid (csf) and urine, and have identified several proteins specific to these fluids, including major urinary protein, uromodulin, and prostaglandin d synthase. these baseline data provide a useful comparison to the biological fluids of stroke-prone spontaneously hypertensive rats, an inbred strain, which develops cerebrovascular abnormalities following high blood pressure. our studies have detected signs of an inflammatory condition several weeks prior to stroke. we have confirmed the sharp rise in proteinuria preceding stoke onset, and have identified the excreted proteins. following stroke we observe a massive increase in csf protein concentration as serum proteins, even those of large molecular size, cross an impaired blood-brain barrier. as a first step to discover useful disease markers from the urinary proteome, we have developed a unique and systematic approach for detection of low molecular weight urinary proteins by using high resolution two-dimensional ( d) electrophoresis and mass spectrometric methods. unlike previous studies on urinary proteins, and most importantly as observed in present study, our results show that a large number of low molecular weight protein spots can be visualized in the d electrophoresis pattern. it was observed that protein concentration and fractionation methods were critical for our ability to detect many proteins in the gel pattern. therefore, several approaches were carefully considered to concentrate and fractionate proteins in urine samples. initially, urine specimens from normal individuals were concentrated by using centrifugation and ultrafiltration methods. the concentrated samples of urine proteins were then fractionated by size exclusion and immunoaffinity chromatography. the size exclusion method was used to generate two fractions of proteins based on their native molecular weights. further, this method allowed us to enrich concentrations of less abundant proteins for each fraction. the immunoaffinity method was used to specifically remove well-known abundant urinary proteins (such as albumin) from the above mentioned two fractions. that the d pattern includes many native low molecular weight proteins was confirmed by analyzing both protein fractions from size exclusion chromatography. a detailed mass spectrometric analysis of the protein spots is carried out to identify the proteins observed in d pattern. since urine is an ultrafiltrate of plasma, many factors in urine are present in proportion to their rate of synthesis in the body. these factors include many low molecular weight proteins that remain undiscovered due to their low abundance. therefore, the present analysis of urinary proteins would serve as the most useful guide for the discovery of novel diagnostic markers in urinary proteins. i. pucci minafra , , s. fontana , p. cancemi , g. alaimo , and s. minafra , centre of experimental oncobiology, department of cell biology and development, and institute of histology and embryology university of palermo, italy breast cancer is one of the leading causes of death for cancer among women. there are different types of breast cancers, grouped as invasive and non-invasive types. among the invasive types "infiltrating ductal carcinoma" (idc) accounts for about % of all breast cancers. in order to study some biological properties related to this type of cancer, we have developed and well characterized an "in vitro" system, consisting of an idc-derived cell line, -bc (minafra et al., br. j. cancer, , - , ) and some of its cloned cell lines, selected for their high and low invasive activity in matrigel. using this model we are producing proteomic maps to compare with that of non-tumoral breast epithelial cells and with breast tissue fragments, existing in our collection or available at the expasy proteomics server. protein identification is currently done by means of gel matching, edman-microsequencing and immuno-detection. to rationalize data we grouped proteins into functional categories: a) cytoskeletal proteins, b) metabolic enzymes, c) chaperonins and other functionally related proteins, d) peptides and enzymes with regulatory functions. a fifth group consists of peptides with unknown identity. among these sets of proteins we found that glycolitic enzymes and some chaperonins are overexpressed in cancer cells. in addition, new isoforms of potential interest as biomarkers for breast cancer, were identified by means of microsequencing. a. santucci , l. trabalzini , d. soldateschi , e. ferro , a. paffetti , and p. martelli dipartimento di biologia molecolare, sezione di chimica biologica, universita' degli studi di siena, and diesse diagnostica senese srl, siena, italy human cytomegalovirus (hcmv) is an ubiquitous virus, belonging to the herpesviridae family, betaherpesvirinae subfamily, able to induce morbidity in immunocompromised patients and congenitally infected new-borns. hcmv has the largest genome among the herpes-viruses ( kbp): ad strain genome was completely sequenced, containing about open reading frames encoding polypeptides, most of which are not characterized. the viral genes are activated in a cascade fashion: ) alpha, immediate-early genes, coding for regulatory proteins necessary for the activation of ) beta, early genes, needed for dna replication, and, finally ) gamma, late genes, coding for structural proteins of the mature virions. this latter category includes the virus surface antigenic proteins responsible for the main immune response during hcmv infections. although the sequencing of hcmv genome has been completed, very little is known about the actual nature of the viral proteins. the most appropriate approach to characterize hcmv phenotype is to study its protein expression as it is carried out within the host cell. for this purpose, we analyzed by two-dimensional electrophoresis ( d-page) the protein phenotipic repertoire of human fibroblasts and compared it with that of the same cell type following infection with hcmv strain ad . the phenotypic d map of human fibroblasts dramatically changes following infection with hcmv. a relevant amount of newly appeared spots is attributable to hcmv proteins, mainly of the structural category, since we analyzed host cells at the - th day of infection, when the late, gamma genes are supposed to be the only to be activated. on the other hand, a marked decrease of protein synthesis can be easily evidentiated in the infected fibroblasts respecting to uninfected cells. a temptative mapping of the main structural viral proteins (those against which patients sera are directed) was carried out by immunoblotting, microsequencing and mass spectrometry. comparative proteomics of cultured cells: identification of genetic defects and molecular mechanism of apoptosis regulation v. seyrantepe , k. landry , s. taurin , s. n. orlov , and a. v. pshezhetsky sainte-justine hospital research centre, and research centre, chum, university of montreal, montreal, pq, canada we employed a comparative proteomics of cultured cells to study mechanism of genetic disorders and for identification of key proteins involved in cell proliferation, differentiation, and death. in particular, this technology proved to be very useful to understand molecular basis of severe inherited diseases resulting from deficiency of lysosomal membrane transporters, and a role of programmed cell death (pcd) of vascular smooth muscle cells (vsmc) in cardiovascular disorders. to increase sensitivity of the identification of cellular proteins we have either have isolated cellular organelles such as lysosomal membranes or performed the differential extraction of soluble, membrane and cytoskeletal proteins. by comparison of pro-teomic cell maps from normal controls and individuals affected with lysosomal transport disorders we have selected and identified several candidate disease-causing proteins, which have to be further studied by mutation analysis and functional expression. for the second group of disorders we identified proteins, which de-novo synthesis could result in survival of vsmc including a two members of hsp family, a molecular chaperone grp , and so-called mortalin (grp ) highly expressed in non proliferative tissues and associated with mortal cell phenotype. two-dimensional polyacrylamide gel electrophoresis ( d-page) is the established technology employed for the separation of proteins from a cell lysate, sub-cellular organelle or tissue sample prior to identification of the excised protein spots by mass spectrometry. in the order of several hundred to several thousand proteins, can be separated and visualised on a d gel by conventional staining or utilising fluorescent labelling techniques. the advantage of performing a two dimensional gel based separation is the ability to obtain quantitative information by comparing and contrasting two samples in a differential display experiment, for example, between a healthy and diseased state. the last stage however stipulates that the gels are reproducible which can be both difficult and time consuming to achieve. the relativity poor dynamic range that the gels exhibit also limits quantification. other restrictions include the under representation of certain classes of proteins, such as membrane proteins, large or small proteins and very acidic/basic proteins. for these reasons, amongst others, alternatives to d-page are being investigated. advances in both lc and mass spectrometry instrumentation have allowed the analysis of protein complexes, which have not been separated on a d gel. in this case protein identification is achieved via database searching of esi-ms/ms data. this provides qualitative information on the proteins that are present and has recently been coupled with isotope dilution experiments to provide relative quantiative information. these experiments normally involve separation of the complex digest mixture by microcapillary liquid chromatography connected to an instrument capable of data dependant switching between the ms and ms/ms modes. using this approach it has been demonstrated that hundreds of ms/ms spectra can be acquired in a fully automated fashion, resulting in the identification of significant numbers of proteins, including low copy number proteins, from a single lc-ms/ms experiment. if, however, a complex protein mixture is to be investigated then a fractionation step prior to separation of the peptides on the basis of their hydrophobicity would be advantageous. we have, therefore, adopted a d lc-ms/ms approach using a capillary lc system (caplc) operating at nanoliter per min flow rates coupled to a q-tof mass spectrometer. by replacing the standard sample loop within this system with a strong cation exchange (scx) cartridge followed by a c trap cartridge it is possible to pre-fractionate the peptides before separation on a c column. after loading the sample, discreet fractions are sequentially eluted from the cation exchange cartridge using a salt step gradient; the eluted peptides are then retained on the trapping c cartridge whilst they are desalted. finally the peptides are eluted from the c pre-column, at nl/min, onto a um id ϫ cm waters symmetry analytical column for separation and elution into the mass spectrometer. this analytical approach will be discussed with examples where this methodology has been used for the analysis of standard protein mixtures and also for the analysis of cell lysates and sub-cellular fractions. monoclonal igg are commonly observed in various b cell disorders, the most clinically relevant being multiple myeloma. in a series of serum samples, immunofixation identified igg , igg , igg , and igg in , , , and cases, respectively. their light-chains were k in cases and λ in cases. these monoclonal igg were further characterized by high resolution two-dimensional polyacrylamide gel electrophoresis ( -de) with various isoelectric focusing conditions as well as by -de ( -de of the proteins extracted from agarose after serum protein agarose electrophoresis). after -de or -de, the monoclonal γ-chains were not visualized in out cases, whatever the isoelectric focusing conditions that were tested. in cases, γ-chains were only detected using alkaline ph - gradients. monoclonal γ-chains and light chains were highly heterogeneous in terms of pi and mr. however, a good correlation (p Ͻ . ) was observed between the index of migration of the monoclonal igg in agarose gels and the pi of their γand of their light-chains (r ϭ . , multiple linear regression). because of the extreme diversity of the different γ-chains as well as of the k-and γ-chains, it appears that a classification of monoclonal igg based only on their electrophoretic properties is not possible. alzheimer's disease (ad) is one of disorders caused by protein conformational changes and recent studies have shown that several chaperone proteins are involved in this process. as information of chaperone expression in ad brain is limited, we aimed to study the expressional pattern of chaperones in several brain regions as this may be essential to understand how folding defects can lead to disease. we studied the concomitant expressional patterns of molecular chaperones in seven brain regions of adults with ad using two-dimensional polyacrylamide gel electrophoresis ( -de) and matrixassociated laser desorption ionization mass spectroscopy (maldi-ms). we unambiguously identified and quantified nine different chaperone proteins. six chaperone proteins, heat shock protein (hsp ), hsp ry, heat shock cognate (hsc) , alpha crystallin b chain, glucose regulated protein (grp) and grp showed aberrant expressional patterns depending on brain region. hsp . , grp and t-complex (tcp- ) epsilon subunit did not show any significant expressional change. these findings are compatible with neuropathological and biochemical abnormalities in ad brain and this report presents the first approach to quantify nine different chaperones simultaneously at the protein level in individual ad brain regions providing evidence for the relevance of aberrant chaperone expression to ad neuropathology. the mainstream approach to protein separation, visualisation and identification has been to use two-dimensional gel electrophoresis coupled to mass spectrometry for the identification of the separated proteins. however this approach is limited with the level of protein that may be loaded onto the d gel and the nature of the proteins that may be incorporated onto the first dimension (ipg strip). an alternative approach for the qualitative analysis of complex protein mixtures is the use of tryptic digestion followed by electrospray lc-ms/ms. this approach is dependent on a high degree of chromatographic separation prior to the mass spectrometer, such that ideally individual peptides are eluted into the source. if this is the case then the dynamic range of protein identification can be increased and low copy number proteins can be identified. often, however there is a large degree of redundant sequence information acquired, as in theory one peptide ms/ms spectrum is sufficient to identify a protein from a sequence databank. if a protein identification is obtained from a databank search of an ms/ms spectrum, it is potentially valuable to exclude the rest of the theoretical tryptic peptides to "mine" deeper into the protein complex being studied. we have introduced a new protein databank search engine capable of matching a tryptic peptide from the swissprot/ trembl databank to an ms/ms spectrum in one second. using this search engine we are able to generate dynamic tryptic peptide exclude and include lists, based upon the theoretical tryptic peptides from the identified protein, which can be passed to the acquisition software of our q-tof mass spectrometer in real time. thus, we are able to automatically steer the q-tof, during acquisition, to select and switch to the ms/ms mode only on those peaks that meet the modified selection criteria. experiments can be designed in which peaks that belong to a protein already identified during acquisition can be avoided. this exclusion list is based upon m/z, charge state and a user definable mass tolerance. the mass measurement of the data from the q-tof mass spectrometer is typically better than ppm and as a consequence of this a tight mass tolerance can be selected, thus making the exclude list extremely specific. alternatively, in the case of samples derived from d gel spots, the mass spectrometer may abandon the current sample, re-equilibrate the lc column and move on to the next sample. to illustrate this methodology we show examples, both on standard samples and complex protein mixtures where q-tof data acquisition has been directed based upon the results from a databank search. this data will be compared and contrasted to data acquired in the normal automated lc-ms/ms mode. the specific anti-cancer activity of green tea (؊)-epigallocatechin- -gallate (egcg) department of molecular biology, hebrew university-hadassah medical school, jerusalem, israel the effect of the green tea polyphenol-(Ϫ)epigallocatechin- -gallate (egcg) was tested in cultures of normal and transformed nih-patmras fibroblasts. in this system transformation can be induced at will by the addition of dexamethasone, which induces the expression of h-ras by activating the mammary tumor virus long terminal repeat (mmtv-ltr) promoter. this facilitates a reliable comparison of the susceptibility of normal and transformed cells to egcg. it has been shown that egcg inhibited the growth of transformed but not of the normal fibroblasts. in an attempt to elucidate the mode of the preferential inhibitory activity of egcg, its effect on growth promoting factors has been examined. the level of ornithine decarboxylase (odc, ec . . . ), which is a signal for cellular proliferation, was reduced by egcg in the transformed but not in the normal cells. egcg also showed strong inhibition of tyrosine kinase and mitogen-activated protein kinase (mapk) activities, without affecting the kinases in the normal cells. similarly, egcg also preferentially decreased the levels of the oncogenes ras and jun in transformed cell. egcg preferentially induced apoptosis in the transformed fibroblasts. in vitro chemosensitivity tests demonstrated that egcg inhibited the proliferation of leukemic cells. these findings suggest that egcg has a therapeutic potential in the combat against cancer. objectives: to develop a safo, affordable immune supportive therapy for hivϩ patients. design: a randomised, double blind, placebo-controlled study, testing an internationally patented l-methionine combination (lmc), in approximately hivϩ patients; not yet on anti-viral treatment (cd count to ). methods: parameters measured included: cd count, total lymphocyte court, viral load, several clinical, as well as mechanistic parameters. the difference in the change from the baseline (active -placebo) was determined for each parameter. the study is ongoing. results: within months, significant trends are noted. the cd count of the patients on the active therapy, presented with a slower rate of decrease, compared to the placebo group, mean difference (md) in this change from baseline; . /cmm and % confidence interval (c ), this was confirmed by the total lymphocyte court values. after months the placebo group was placed on active, causing the difference to disappear. conclusions: although further trials are needed, these results already indicate t-methionine as an important role player in the immune system of patients with impaired immune function. c. chiarla , i. giovannini , j. h. siegel , g. boldrini , and m. castagneto centro di studio per la fisiopatologia dello shock cnr, catholic university, rome, italy department of surgery, umdnj, newark, new jersey, u.s.a. in critical illness and sepsis, changes in amino acid plasma levels (aapl) have been assessed extensively, while little is known about the relationship with changes in other plasma components, such as those involved in fluid-electrolyte and osmotic balance; their investigation is also limited, in large clinical samples, by inter-patient variability. we analyzed the relationships between plasma sodium (na ϩ pl, meq/l) and aapl (µm/ l) in eighty consecutive measurements performed in one single patient with post-traumatic sepsis and severe, prolonged illness. unique feature of plasma taurine (tau) was maintenance of a highly significant inverse correlation with na ϩ pl (r ϭ . , p Ͻ . ). all other aapl were correlated directly, or unrelated, to na ϩ pl, the only exception being a weak inverse correlation between tryptophan and na ϩ pl. tau was correlated, strongly and directly, also to phosphoethanolamine (pea), glutamate (glu) and aspartate (asp): tau ϭ . ϩ . (pea) Ϫ . (na ϩ pl) r ϭ . , p Ͻ . tau ϭ . ϩ . (asp) Ϫ . (na ϩ pl) r ϭ . , p Ͻ . tau ϭ . ϩ . (logglu) Ϫ . (na ϩ pl) r ϭ . , p Ͻ . and unrelated, or weakly and inversely related, to other aapl (measurements of beta-alanine were not included). co-variation of na ϩ pl and these aapl (particularly tau and pea) was influenced by severity of illness, and more complex regressions were needed to quantify this effect. these results provide useful information on interdependency of tau, na ϩ pl and other aapl in critical illness. the central nervous system (cns) shows an exceptionally high degree of vulnerability to reactive oxygen species. considerable evidence suggests that free radical formation and oxidative stress might play an important role in the pathogenesis of parkinson's disease (pd). moreover, it has been reported that the levels of glutathione and vitamin e increase in the brain of patients with pd as a compensatory mechanism to deal with oxidative stress. since vitamin e is an effective free radical scavenger in the brain, its neuroprotective function is the issue of new therapeutic approaches in neurodegenerative diseases. to elucidate the possible role of vitamin e in the pathogenesis of pd, we assessed the plasma levels of vitamin e, measured by high-performance liquid chromatography, in patients with pd. vitamin e concentrations were also assessed in age and sex matched normal individuals. the mean plasma levels of vitamin e did not differ significantly between these two groups ( . Ϯ . mmoli/l for pd patients and . Ϯ . mmoli/l for controls). the results of our study suggest that plasma vitamin e concentrations do not play a major role in the pathogenesis of pd. vitamin e and cardiovascular disease: nutritional and intervention approaches f. galli , institute of biological chemistry, university of urbino, italy department of cardiovascular research, st thomas' hospital, london, u.k. vitamin e is represented by a family of eight natural vitamers ( tocopherols and tocotrienols) of which αtocopherol (α-t) form has the highest biological activity. this vitamin accounts for most of the lipid-soluble, chain-breaking antioxidant activity in mammalian tissues and plasma. in addition, it shows nonantioxidant properties through which it modulates cell signaling and the expression of specific enzyme in cell models playing a role in atherogenesis (e.g. endothelial and inflammatory cells). the preventive effect of vitamin e on acdv is still a matter of debate. the largest epidemiological investigations and out of main intervention studies at yet available have suggested a correlation between levels of vitamin e and incidence of atherosclerotic cardiovascular disease (acvd) and related mortality. an overall conclusion rising from these studies is that the major effect (if any) of vitamin e is to be found with intakes higher than iu ( mg all-rac α-tocopheryl acetate) per day. however, other investigations have failed to demonstrate a beneficial effect of vitamin e against acvd, suggesting the need for more studies on its metabolism and function. recently a family of tocopherol binding and transport proteins has been identified. they play a key role in the selective uptake and delivery of tocopherols to lipoproteins and tissues. genetic abnormalities of these proteins have been demonstrated to be responsible for conditions of vitamin e deficiency in humans. their tissue distribution and regulation are now under investigation. the information available on vitamin e metabolism and its response to supplements or diet changes are at yet poorly characterized. the synthesis of stable isotopes and the characterization of major metabolites of main vitamers provide important advances in this research. in the last years, both plasma levels and urinary excretion of relevant metabolites of α-t have been characterized. little information is available on metabolites formed by other vitamers. the emerging role of γ-t and its main catabolite , , -trimethyl- -(b-carboxyethyl)- hydroxychroman (γ-cehc) in the defense against nitrogen oxide species formed during the activation of inflammatory cells is now well established and suggests the need for further studies on the bioavailability and transformation of this homologue of vitamin e in humans. at the same time, an oxidation byproduct of α-t found in human plasma, namely α-tocopherylquinone, has been proposed to be also de novo synthesized from phenylalanine with a role in the genesis of a defective polyunsaturated fatty acid metabolism observed in phenylketonuric patients. this suggests a possible, and at yet unexplored relationship between vitamin e and phenylalanine/fatty acid metabolism which might have also a role in atherosclerotic process. r. gaspari , s. mensi , g. mercurio , c. callà , l. colacicco , e. sacco , and s. lippa department of anaesthesiology and intensive care medicine, and department of biochemistry and clinical biochemistry, catholic university of rome, italy four patients ( females, male; aged from to years) affected by severe liver failure, were treated by a new blood purification method, namely molecular adsorbent recycling system (mars). mars removes albumin-bound toxins using a specific membrane with a dialysate solution containing albumin. in the patients the plasma levels of methionine (meth), branched chain and aromatic amino-acids and liposoluble antioxidants were measured. the fischer's index did not show any significant variation, whereas the plasma levels of meth were well correlated with the levels of liposoluble antioxidants (vitamin e and coq ). in fact, in the patients receiving just branched chain amino-acids, the plasma levels of both meth and antioxidants progressively decreased. on the contrary, if meth and branched chain amino-acids were administered, the plasma levels of coq and vitamin e showed a positive correlation with the plasma meth levels (p Ͻ . ; r ϭ . and p Ͻ . ; r ϭ . , respectively). since vitamin e and coq are mutually dependent-molecules, the administration of meth, essential substance for coq synthesis, may be effective to maintain a good antioxidant status in patients with severe liver failure undergoing mars treatment. we obtained new synthetic peptide preparation epitalon to be widely applied as a pharmaceutical due to its properties important in medical care. epitalon was found to stimulate repair processes in retinal diseases via restoring the retinal functions, in particular its photoreceptors. this promising peptide drug is a linear tetrapeptide of formula h-ala-glu-asp-gly-oh (alanyl-glutamyl-aspartyl-glycine). the substance was obtained by classic peptide synthesis in a solution (scheme: ( ϩ ) ϩ ) with n-oxysuccinimide activated esters. coohgroups of lateral radicals of glutamic and aspartic acids were defended as benzyl esters, benzyloxycarbonyl (ala) and tert.butyloxycarbonyl (glu) n-defending groups were employed, deblockade conducted by trifluoroacetic acid and catalytic hydrogenolysis. preparative hplc on a reverse phase was applied for purification. the product was fully characterised by the data of analytical hplc (substance content - %), amino acid analysis, ir-and hmr-spectra. the ready drug form is ampoules containing µg of the substance in ml of isotonic solution. epitalon application in patients with pigmented retinal degeneration stopped the pathology development in % and increased visual functions in % of the cases. in % of the patients visual acuity raised by . - . . electroretinography confirmed the retinal functional activity increase. an increasing number of proteins are implicated in apoptosis and several of them have been shown to be altered in alzheimer's disease (ad) brain. because of this apoptosis is thought to be the underlying mechanism of neuronal cell loss in ad. to further substantiate this hypothesis we investigated the expression of a recently identified apoptosis related proteins and other apoptosis regulators in frontal cortex and cerebellum of ad by western blot and elisa techniques. quantitative analysis revealed unaltered levels of bax and raidd (receptor interacting protein associated ich- (caspase- )/ ced- (caenorhabditis elegans death protease- )-homologous protein with death domain) in both regions. zip (zipper interacting protein) kinase, bim/bod (bcl- interacting mediator of cell death/bcl- related ovarian death gene) and p were significantly increased only in ad frontal cortex (p Ͻ . , in all cases). cerebellar bcl- levels were significantly increased in ad (p Ͻ . ) while in ad frontal cortex, although the levels tended to increase did not reach significance level. the results indicate that apoptosis indeed account for the neuronal loss in ad. however, it does not seem to involve bax and raidd. a. magyar , m. brózik , r. tóbi , t. szabó , j. szakonyi , b. rojkovich , p. gergely , and f. hudecz research group of peptide chemistry hungarian academy of science, budapest, central laboratory of immunology, semmelweis university, budapest, and national institute of rheumatology, budapest, hungary rheumatoid arthritis (ra) is a systemic autoimmune disease of unknown etiology. it is the most common of the inflammatory joint diseases, affecting - % of the world population. anti-filaggrin antibodies (afa) directed against the epidermal protein, filaggrin, belongs to the most specific markers of ra. epitopes, containing citrulline within the sequence of filaggrin, have been recently identified as major antigenic sites recognised by afa. the aim of our study was to identify these epitopes of filaggrin derived-peptides targeted by ra specific antibodies to provide further information about the nature of the initial autoantigenic substance. the most immunogenic six sequences of filaggrin and further, on the n-and c-terminal, shortened version of the original peptide ( shqestrgrsrgrsgrsgs ) were synthesized. we used conventional solid-phase peptide synthesis (fmoc strategy) carried out on "multipin ncp" noncleavable kit. in elisa experiments the presence of afa was deter-mined using serum samples of ra patients and healthy blood donors. in conclusion our results provide further evidence that not simply the presence of citrulline but also the nature of its surrounding amino acids have important role in the creation of autoantigenic epitope reactive with anti-filaggrin antibodies. the autoimmune nature of multiple sclerosis (ms) has introduced cytokine genes as logical candidates for the loci determining susceptibility to the disease and/or influencing disease progression. interleukin (il)- alpha and beta are major proinflammatory cytokines that have been related with several chronic inflammatory diseases such as ms. the il -receptor antagonist (il- ra) is a protein structurally related to il- beta that effectively inhibits the proinflammatory effects of il- . a polymorphism in the Ј-flanking regulatory region at Ϫ of the il- alpha gene, which may cause an overexpression of il- alpha and a variable number tandem repeats (vntr) polymorphism in the il- ra gene have been also associated with several inflammatory diseases. two biallelic base change polymorphisms in the il- beta gene have been reported to influence the protein production: one is located in the promoter region at position Ϫ and the other is in exon at position ϩ . to analyze the contribution of il- alpha, il- beta and il- ra genes in the genetics predisposition to ms, we have examined four polymorphic genetic markers in italian patients with clinically definite ms and healthy controls. in summary, no significant differences in genotypes and allele frequencies were found between ms patients and healthy controls. fibronectin -the extracellular matrix protein is oxidatively modified with oxygen reactive species (ros) in inflammation site. activated neutrophiles release the hypochlorite acid (hocl) and chloramines as products of myeloperoxidase/ h o /cl Ϫ system. these reactive chlorine species chlorinate in turn matrix proteins. the resulting changes of tertiary protein structure could be evaluated by monitoring the antigen/antibody complex formation. the formation of the complexes between native/chlorinated fibronectin and igg class antibodies were examined by means of elisa with luminol chemiluminescence detection. the degree of fibronectin modification was monitored with spectroscopic methods. since the oxidation leads to the fibronectin aggregation -the tryptophane contents in resulting aggregates were evaluated with stern-volmer approach (acrylamide quenching). moreover, the aldehydes influence on the ag/ab complex formation was examined -since aldehydes are known products of amino acids n-chloramines deamination. also the native and modified fibronectin adherence to the matrix proteins was monitored with use of hrp labeled antifibronectin antibodies. the preliminary results suggest that chlorination impairs the ab/ag complex recognition but also prove that igg bounded chlorinated fibronectin promotes igg clusters formation. it was found also that mm concentration of the serine derived glycoaldehyde decreases the fibronectin/igg recognition and the effect could be attributed to the igg aggregates formation. we demonstrate also that hrp-labeled iggs detect the collagen and fibrynogen adherent fibronectin in a dose dependent manner-details of the elisa method are discussed. in subjects with rheumatoid arthritis (ra) oxidized low density lipoproteins (ldl) are supposed to serve as mediators for joint damage, further exacerbating the inflammatory process. to better understand mechanisms of ldl oxidation in ra a specific marker of oxidative modification of apolipoprotein (apo) b- proline and arginine residues, hydroxy- -aminovaleric acid (hava), had been measured in plasma and synovial fluid ldl subfractions (ldl , svedberg units (s f ) - and ldl , s f - ) by gc-ms. paired knee synovial fluid and plasma samples were collected from subjects with ra. additionally, plasma samples were collected from healthy controls. the ldl hava content in plasma was not different between the groups (ra, . Ϯ . vs controls, . Ϯ . mol/mol apob- , p ϭ . ). the ldl hava content in plasma was significantly higher in ra ( . Ϯ . vs . Ϯ . mol/mol apob- , p ϭ . ). furthermore, synovial fluid ldl and ldl in ra contained elevated hava levels when compared with plasma concentrations (ldl syn , . Ϯ . mol/mol apob- (p Ͻ . ) and ldl syn , . Ϯ . mol/mol apob- (p Ͻ . )). results suggest that proline and arginine residues of apob- are highly reactive toward oxygen radicals in both plasma and synovial fluid in ra. furthermore, susceptibility of apob- to oxidative modification increases along the lipoprotein metabolic cascade. particularly small dense ldl were prone to direct oxidation of apob- . correlation between hava content in plasma and synovial fluid ldl and ldl in ra may allow the use of hava as a clinical marker of antioxidant barrier impairment in ra. vascular collagen accumulation contributes to development of hypoxic pulmonary hypertension (ph). we have shown that injections of a polymer of the proline analogue cis- -hydroxy-l-proline (chyp) in liposomes attenuated acute ph in rats (ajrccm ; : ) . we now treated rats with established ph with a new polymer containing an increased "payload" of chyp. chyp was conjugated to a low mw poly(ethylene glycol)-lysine carrier {poly (peg )-lys-chyp} to increase the % by wt of the analogue. rats were exposed to % o for da to induce ph. on da , and after da of hypoxia, animals were injected iv with chyp polymer in liposomes (hc) or bioinactive trans-hyp polymer in liposomes (ht). air controls received thyp polymer in liposomes (at). at and da, we measured mean right ventricular pressure (rvp) and hydroxyproline (hyp) content in main pulmonary arteries. on da , rvp (mmhg) was Ϯ and hyp (µg/vessel) was Ϯ in at. rvp and hyp increased to Ϯ * and Ϯ *, respectively, in hypoxic animals (n ϭ ; *p Ͻ . vs. at). on da , rvps were at Ϯ , ht Ϯ *, hc Ϯ * †; hyps were at Ϯ , ht Ϯ *, hc Ϯ * † (n ϭ ; *p Ͻ . vs. at; †p Ͻ . vs. ht). from da to , rvp did not increase and hyp decreased in the hc group vs. ht. we conclude that weekly injections of polymeric chyp prevented progression of established hypoxic ph and reversed hyp accumulation. targeted delivery of antifibrotic polymers may prevent and reverse the progression of ph. (support: phs, barbara cornwall foundation). glucosinolates are amino acid-derived natural plant products found throughout the capparales order, which includes agriculturally important crops such as oilseed rape, brassica vegetables and the model plant arabidopsis. glucosinolates and their degradation products have a wide range of biological activities, e.g. in plant defense as deterrents against insect and fungi and as attractants to insects that are specialized feeders in brassicaceae. the conversion of amino acids to oximes is a key step in glucosinolate biosynthesis. we have recently shown that cytochromes p belonging to the cyp family catalyze the conversion of aliphatic, aromatic as well as indole amino acids to the corresponding oximes. cyp e catalyzes the oxime-metabolizing step in the biosynthesis of the cyanogenic glucoside dhurrin. we have recently shown that the oxime-metabolizing enzyme in the glucosinolate biosynthetic pathway is a cytochrome p homologous to cyp e . the post-oxime enzymes in the glucosinolate pathway have high substrate-specificity for the functional groups, and low substrate-specificity for the side chain. therefore, we have been able to metabolically engineer new glucosinolate profiles into arabidopsis by altering the level of endogenous cyp s and by introducing new cyp s. the approach has great potential for design of "biotech crops" with improved pest resistance and increased nutritional value. hypercalcemia as a potential threat in the dietary treatment of maternal phenylketonuria f. eyskens and s. beernaert pediatrician, metabolic diseases and dietitian, azm-koningin paola childrens hospital, metabolic lab pcma, antwerp, belgium over % of infants born to mothers with blood phenylalanine (phe) concentrations above µmol/l exhibit evi-dence of foetal dammage, low birth weight, microcephaly, dysmorphic facies, slow postnatal growth and development and long-term intellectual impairment. keeping maternal phe concentrations below µmol/l before conception and throughout pregnancy reduces significantly the risk of abnormalities in the offspring of women with phenylketonuria (pku). we describe a woman, years old, who showed phe blood levels of - µmol/l under a strict diet (total protein content of . g/kg body weight/day with . g/kg natural proteins and . g/kg proteins provided by the aminoacid mixture pku (milupa, germany); , cal/day) at the beginning of her first pregnancy. the first weeks she developed vomiting which gradually increased in severity. at weeks of pregnancy, she had diarrhea, severe bouts of vomiting and manifested a deficient nutritional status with intake of . g/kg bw proteins and , cal/day. she was hospitalized to start refeeding using continue drip feeding administered by nasogastric tube. after days on this regimen she developed vomiting, heart palpitations and mental confusion. her serum calcium level, that was normal at admission in the hospital, showed an elevation to . - meq/l (ref. value . - . meq/l). the feeding was stopped immediately and under an intravenous infusion and gradually introducing a feeding composed of pku , carbohydrates and mct fats the serum calcium and the blood phe levels dropped to normal values. and volatile components of caramel obtained by heating commercial maltose solution for different time intervals. one sample containing maltose only was used as control, the caramelization was conducted at c° for total time period minutes and subjected to sensory analysis and isolation of volatile components. the odour and colour sensory tests were evaluated according to the international standard methods (iso). the results showed that addition of lysine as a catalyst gave rise to a significant (p Ͻ . ) increase in intensity of the whole flavour in comparison with the control sample. the sweet and caramel notes, the most characteristic attributes of caramel, showed remarkable increase. on the other hand the increase in heating time in presence of lysine as a catalyst resulted in high significant increase in browning rate of caramel solution. the volatile components of each sample were isolated by using the new technique, solid phase microextraction (spme) and subjected to gc and gc-ms analysis. over volatile components were separated, however only the most important component for caramel flavour were reported. maltol and hydroxymethyl- -furfural (hmf) and . h-pyran- -one, , dihydro- , -dihydroxy- -methyl (dihydro dihydroxy maltol), the main characteristic caramelization products were present in high concentration in samples contaning lysine heated for minutes. in addition one pyrazine was only identified in the samples contaning lysine. a comparative study between the present results and those of our previous study concerning addition of alanine as a catalyst was carried out. short-term exposure of human umbilical vein endothelial cells (huvecs) to hyperglycemia increases l-arginine transport (system y ϩ /cats) and nitric oxide (no) production (via enos). it has been reported that enos could also be activated by a ca ϩ -independent mechanism involving phosphorylation of ser by a phosphatidylinositol -kinase (pi -kinase) dependent pathway. we investigated the involvement of pi kinase on the stimulatory effect of acute hyperglycemia on enos and l-arginine transport in huvecs. l-arginine transport, no synthesis and phosphorylation of ser in enos were increased by d-glucose ( mm, min). similar results were obtained in huvecs exposed to insulin. incubation of cells with wortmannin (pi -kinase inhibitor) prevented the effects of d-glucose and insulin. no changes in the intracellular ca ϩ and enos protein levels were detected. thus, acute hyperglycemia increases l-arginine transport and enos activity through a pi -kinase dependent, ca ϩ independent mechanism in huvecs. [ the hypercalcemia in this patient was due to a very high content in calcium of the feeding administered ( - times the adh value) associated with a high vitamine d concentration (see table) and a clinical state of dehydratation. the further pregnancy was uncomplicated and a healthy girl was born who developed normal. • the aminoacid mixtures used in the treatment of pku contain a high level of calcium, phosphate, magnesium and iron. they also contain a high concentration of vitamine d. • nutritional monitoring of pregnant pku patients should include the calcium, phosphate, iron, zinc and vitamins status. • vitamins a and d suppletion is contraindicated in these patients based on the high concentrations of these vitamins in the aminoacid mixtures used in the dietary treatment. flavour and aroma chemistry department, national research centre, dokki, cairo, egypt caramelization of various carbohydrates leads to product with a high tinctorial strength provided by different additives catalyzing the process. the present study was conducted to evaluate the catalytic effect of lysine on the sensory attributes administered pku ( g ϭ adh , g/kg/bw) lipoic acid is a prosthetic group of h-protein of the glycine cleavage system and e components of the pyruvate, oxoglutarate and branched-chain -oxoacid dehydrogenase complexes. in mammals, attachment of lipoic acid to these proteins requires two enzymes. lipoate-activating enzyme (lae) catalyzes the activation of lipoate to lipoyl-nucleoside monophosphate. then, lipoyltransferase transfers the lipoyl moiety to the specific lysine residue of the proteins. we purified lae from bovine liver mitochondria. lae activated lipoate with gtp at a -fold higher rate than with atp. the reaction absolutely required lipoate and mggtp, and the reaction product was lipoyl-gmp. lae activated both r-and senantiomers of lipoate to the respective lipoyl-gmp although preference for r-lipoate was observed. lipoyltransferase equally transferred both r-and s-lipoyl moiety from respective activated lipoate to apoh-protein. however, only h-protein carrying r-lipoate was active in the glycine cleavage reaction. cdna clones encoding a precursor lae with a mitochondrial presequence were isolated. amino acid sequence of lae was identical with that of xenobiotic-metabolizing/medium-chain fatty acid : coa ligase-iii, but an amino acid substitution due to snp was found. these results indicate that the medium-chain acyl-coa synthetase in mitochondria plays a novel function with gtp, the activation of lipoate. instituto di chimica biologica "g. fornaini", università di urbino, italy nitric oxide (no) can modulate red blood cells (rbc) glycolysis by translocation of the enzyme glyceraldehyde- phosphate dehydrogenase (gapd) [e.c. . . . ] from the cytosolic domain of the membrane protein band (cdb ) in the cytosol. in this study we have investigated which no-reactive thiols might be involved influencing gapd translocation, and which is the role of glutathione (gsh) in this context. two highly reactive cys residues (k ϭ . m Ϫ s Ϫ and . m Ϫ s Ϫ , respectively) were identified by transnitrosylation with nitrosoglutathione (gsno) of cdb and gapd. the cys in the catalytic site of gapd is exclusively involved in this gsno-induced nitrosylation. reassociation experiments carried out at equilibrium with preparations of rbc membranes and gapd revealed that different no-donors may form Ϫsno on, and decrease the affinity between, gapd and cdb . in intact rbc, both the no-donors -morpholino-sydnonimine (sin- ) and peroxynitrite (onoo Ϫ ) significantly increased gapd activity in the cytosol and glycolysis measured as lactate production and energy charge levels. however, we obtained data suggesting that onoo Ϫ is the main no-derivative able to cross the rbc membrane leading to gapd translocation and Ϫsno formation. both in cell-free experiments and intact rbc, diamide (a thiol oxidant able to inhibit gapd activity) was observed to reverse the effect of sin- on gapd translocation. the results demonstrate that cdb and gapd contain reactive thiols that can be transnitrosylation mainly by means of gsno, these can ultimately influence gapd translocation/ activity and the glycolytic flux. abteilung für allgemein-viszeral-und gefässchirurgie, kliniken dr. erler gmbh, nürnberg, germany new surgical procedures like minimal-invasive-surgery brought many advantages for the surgical patient: less pain and shorter hospitalization. regarding nutrition, patients gets normal food on the ward still on the operation-day and need only saline-infusions overnight for fluid and electrolyte substitution but no hypocaloric parenteral nutrition. hypocaloric parenteral nutrition had been developped as a peripheral intravenous nutritional concept for patients with a normal body mass index over a period not longer than - days. multiple clinical studies showed that bowel movements increase earlier after an early postoperative enteral feeding which allows an earlier discharge of the patient. the result is a remarkable decrease of costs and an increase in patient benefit. still some years before surgeons preferred in visceral surgery parenteral nutrition over a period of - days under the opinion not to stress an anastomosis. this opinion changed in the last years under the aspect that about , - , ml of bile fluid, , - , ml pancreatic juice and , - , ml gastric juice per day are passing a small intestine anastomosis without any complications. concerning colon-anastomoses, the colon is preoperatively washed out, so it lasts until days until defecation. multiple studies also showed a benefit for the patient regarding immunostimulation by early postoperative enteral feeding. conclusion: in our hospital with surgical patients we recommend postoperatively either early normal enteral feeding or a high caloric parenteral nutrition if parenteral nutrition is needed for longer than days. if artificial nutrition is necessary for more than days we recommend enteral nutrition given by a tube or peg (percutaneous endoscopic gastrotomy). department of food technology, national research centre, dokki, cairo, egypt in the near east, "frekeh" has been known for many centuries as a stable food made from wheat. it is generally claimed that "frekeh" is better than wheat regarding its storage stability. the protein quality of parched immature durum wheat (frekeh) produced from variety was evaluated. frekeh from four maturing levels during the dough stage of the seed development, were analyzed for approximate analysis. results showed that "frekeh" produced at the beginning of the dough stage was of better nutritional value than that produced at the following maturity levels, since the former was higher in protein, fat, minerals and crude fiber as well as in reducing sugar content. in addition, it was shown that these results confirm well with the sensory quality evaluation of the cooked product. further more, it was found that the cooking time was suitable to produce a "freqkeh" meal with high levels of acceptability. the observed decrease in protein content with increasing maturity level raised the question of how the protein quality of "frekeh" versus that of nature wheat grains varied. in this investigation, the amino acid of "frekeh" was determined. dietary treatment and carnitine supplementation has greatly improved long-term outcome of patients with ppa and (vitamin b unresponsive) mma. however, metabolic decompensation may be frequent and final outcome in most patients show various handicaps. to investigate the usefulness of measuring free carnitine and acylcarnitines in dried blood by tandem mass spectrometry, we investigated patients with ppa and with mma in a period of months by weekly capillary blood punctures performed by the parents. age of the patients were from . until years. clinical status at the time of blood drawing was evaluated by regular phone calls. free carnitine in all patients substituted by oral carnitine treatment ( - mg/kg/day bw) was normal. the parameter best reflecting clinical status was the c /c -acylcarnitine quotient. mean value in mma and ppa patients showed a range of . - . (normal . ϩ/Ϫ . , n ϭ ), there was no difference between ppa and mma patients. individual mean values of the patients significantly increased when the patient was ranked higher in the clinical score system or during decompensation. since measurement of acylcarnitines in dried blood by tandem mass spectrometry is easy to perform, this method may be used for home monitoring of patients with mma and ppa. influence of acute treatment with , , , tetrahydroisoquinoline on the levels of glutathione and reactive oxygen species, and on the enzymatic activity of γ-glutamyl transpeptidase in dopaminergic structures of rat brain e. lorenc-koci , m. sokoĺowska , m. zapaĺa , and l. wĺodek institute of pharmacology, polish academy of sciences, kraków, and institute of medical biochemistry, collegium medicum, jagiellonian university, kraków, poland , , , -tetrahydroisoquinoline (tiq) and its derivatives generated considerable interest as molecular species that may be implicated in the pathogenesis of parkinson's disease (pd). in pd, apart from the lack of dopamine in the striatum, a decreased concentration of glutathione (gsh) is found in the substantia nigra (sn). it is also known that gsh depletion potentiates the toxicity of mptp and -hydroxydopamine. however, there are no data available on the tiq influence on gsh metabolism. the aim of the present study was to exemine the effect of acute tiq administration on the levels of gsh and reactive oxygen species (ros), and on the enzymatic activity of γ-glutamyl transpeptidase (γ-gt) in dopaminergic structures of rat brain. the investigation was carried out h after a single dose of tiq ( mg/kg i.p.). at that time, a marked increase in the tissue gsh level and simultaneous significant inhibition of γ-gt were found in the structures studied. in tiq-treated rats, the production of ros was reduced in the sn, but it was markedly enhanced in the striatum. our results suggest that the increase in gsh level in dopaminergic structures stems from inhibition of γ-gt and refers to the extracellular pool of this peptide. apparently, the tiq-mediated alterations in the levels of gsh and ros may have some implications for the etiology of pd. tetrahydrobiopterin-responsive phenylalanine-hydroxylase deficiency with mutations distant from the tetrahydrobiopterin binding site z. lukacs , r. steinfeld , a. kohlschütter , j. zschocke , and k. ullrich department of pediatrics, university of hamburg, and university-children's hospital, heidelberg, germany phenylalanine hydroxylase (e.c. . . . ) catalyzes the hydroxylation of phenylalanine to tyrosine in the presence of oxygen and the cofactor ( r)-l-erythro- , , , tetrahydrobiopterin (bh ). mutations in the phenylalanine hydroxylase gene may cause phenylketonuria or hyperphenylalaninemia. alternatively, disorders in bh metabolism also result in an increase in phenylalanine concentrations but simultaneously affect other bh -dependent enzymes, consequently, causing a severe neurological disorder. recently, several patients with a phenylalanine-hydroxylase deficiency but with normal bh -metabolism were reported who showed a significant decrease in blood phenylalanine concentrations upon treatment with bh . indeed, two such patients in our hospital were also sensitive to daily oral doses of - mg bh /kg. the subsequent molecular genetic analysis revealed that patient was homozygous for the widespread mutation y c and patient was compound heterozygous for the mutations a d and k n. it is striking, that all mutations are located distant from the known bh -binding site and thus, should not be associated with bh -sensitivity. additionally, further patients who share the same genotype are not sensitive to bh . therefore, it must be concluded that factors independent of the phenylalanine hydroxylase gene, like e.g. individual chaperone proteins, influence the three-dimensional structure of the enzyme and thereby, enhance enzymatic activity in the presence of elevated concentrations of bh . retrotransposons are structurally similar to retroviral gag and pol which are required for their replication via reverse transcription, and seem to be an ancestral form of specialized retroviruses. reverse transcription of retrotransposons was assumed to occur in virus-like particles as well as in retroviruses. rna-packaging in this particles suggests a possibility of infection. presumably, the formation of functional virus-like particles requires the interaction of gypsy rna with a protein encoded by gypsy first open reading frame (orf ) or a product of its processing. the objective of this work was to study whether the protein by this frame can bind with nucleic acids similarly to retroviral gag-protein and how phosphorilation of that protein may influence to this interaction. then gypsy orf was cloned and expressed in escherichia coli, and its protein product was purified by ion-exchange chromatography on deae-cellulose and affinity chromatography on heparin-sepharose and tested electrophoretically. it was shown that recombinant protein bound with its own mrna and with dna. the affinity for ssdna bing higher than for dsdna. the binding constant was estimated with rna. the method utilizes the ability of nitrocellulose to bind proteins but not nucleic acids. binding of % gypsy rna was achieved with about ng of the protein in ml of the reaction mixture. the binding constant was × m- , which is consistent. the structure of the putative nucleic acid-binding domain suggests that the protein is more similar to the core proteins of spumaviruses of the family retroviridae that to those of other retroviruses. phosphorilation of gag-like protein encoded by first open reading frame of retrotrasposon gypsy (mdg ) affects to interaction with nucleic acid. tryptophan (trp) in humans is catabolized by several pathways leading to various metabolites of kynurenine and indolic compounds formation. a number of diseases are connected with abnormalities in its excretion, but relationship of cause and effect is usually unclear. we introduced a two-step procedure for the detection of defects in metabolism of trp: ) tlc is employed when starting the investigation, ) two hplc methods were proposed and used at the next step, when pathological findings are to be proved and the individual metabolites quantified. the first hplc procedure enables the assessment of tryptophan, indolylacry-loylglycine (iag) and other five indolic compounds. the second method is intended to the monitoring of kynurenine and seven of its catabolites. the same sep-pack pre-treated sample of plasma and urine is used for all methods. the reference values and the excretion pattern in some groups of patients ( in total) were assessed. hepathopathy, gastrointestinal defects, myopathy and seizures with other neurological symptoms were the conditions connected with changes in the excretion of some metabolites of trp. significant decrease of iag excretion was found in burn patients early after the injury. urine analyses were performed at patient with hartnup disease and benign xanthurenic aciduria, inherited metabolic defects of trp. in other experiments, trp effect on the decarboxylation of other aromatic amino acids in the liver was investigated; only weak inhibition under physiological conditions was recognised. ( two hypothetical proteins of escherichia coli, ybbq and yhae, show high sequence similarity to d-threonine dehydrogenase from pseudomonas cruciviae ifo . we cloned each gene encoding ybbq and yhae into e. coli jm . both ybbq and yhae showed no d-threonine dehydrogenase activity and showed significant activities for d-serine in the presence of nad. ybbq and yhae were purified to homogeneity from the e. coli clones. ybbq consisted of two identical subunits with a molecular mass of kda, whereas yhae was a tetramer (native molecular mass, kda). ybbq showed the maximum activity at ph . for the oxidation of d-serine. whereas optimum ph of yhae was ph . . they catalyzed oxidation of glycerate and -hydroxyisobutyrate. d-glycerate was the best substrate for both enzymes. both enzymes also catalyzed reduction of tartronate semialdehyde in the presence of nadh. at physiological ph, the rate of tartronate semialdehyde reduction was much higher than that of d-glycerate oxidation. the ybbq gene is in the operon of glyoxylate utilization and the yhae gene is in the operon for d-glucarate/galactarate utilization. these results suggest that both ybbq and yhae are dglycerate -dehydrogenases and function physiologically in conversion of tartronate semialdehyde into d-glycerate. a serine protease inhibitor model: synthesis and biology z. mucsi , Á. bódi , l. gráf , a. perczel , a. patthy , and g. orosz department of organic chemistry, biochemistry, eötvös university, budapest, agricultural biotechnology centre, gödölló´, and research group of peptide chemistry, hungarian academy of sciences, budapest, hungary sgci is structurally related to the pmpd- family of canonical serine protease inhibitors. in these peptides, there is a p -p Ј position which is responsible for reversible binding to chymotrypsin. their structure is characterized by structural compactness: the molecule contains three -sheets and three disulfide bonds. in the sgci molecule the p -p Ј corresponds to lys-leu bond, which is cleaved by chymotrypsin extremely slowly. the question arises why an excellent substrate behaves at the same time as inhibitor. it was assumed that the threedimensional structure of the molecule is responsible for the inhibitory activity. a model was designed to include all the known features of the inhibitor: the structurally necessary -sheet structure and the fragment containing the p -p Ј environment. three model peptide were synthesized. two model peptides had no inhibiting effect and were cleaved by chymotrypsin. one of the cleavage points is the expected p -p Ј position, while the other positions found to be chymotrypsin preferred positions after the first cleavage. the three-dimensional structures of the model peptides were mapped by nmr. on the basis of nmr structures obtained it has been shown that the cyclopeptide part is more flexible in the models than in sgci. the initial process in the reaction mechanism of a bisubstrate enzyme, rat mercaptopyruvate sulfurttransferase: inactivation study by using chloropyruvate n. nagahara , t. nakagawa , and m. minami department of hygiene and public health, nippon medical school, sendagi bunkyo-ku, tokyo, japan institute for organic chemistry, darmstadt university of technology, darmstadt, germany to investigate the reaction mechanism of a bisubstrate enzyme, rat mercaptopyruvate sulfurtransferase (ec . . . , mst), inactivation kinetics with -chloropyruvate (chloropyruvate) was studied; each inactivation reaction was completed in a preincubation procedure. chloropyruvate is an analog of -mercaptopyruvate (mercaptopyruvate) and irreversibly inhibits mst. the inactivation depended on incubation time and the concentration of chloropyruvate and showed saturation kinetics. the plot for the logarithm of % activity remaining versus preincubation time showed pseudo-first-order. the kinact is . ϫ Ϫ min Ϫ and k is . mm. these suggest that chloropyruvate serves as a mechanism-based inactivator. mercaptoethanol , so that chloropyruvate can approach cys via the donor substrate route and acceptor substrate one, and a ternary complex may be formed prior to the inactivation. these findings suggest that a donor substrate enters the catalytic cavity prior to an acceptor one in the initial process of the mst reaction: mst follows an ordered sequential mechanism. polyketides are natural products of bacteria, fungi, marine organisms and higher plants, many of which have clinical usage. actinorhodin ( ) is an antibiotic produced by streptomyces coelicolor via an iterative type ii polyketide synthase (pks) system. this consists of a multi-enzyme complex with a single catalytic function for each enzyme. departments of chemistry and pediatric surgery, school of medicine, fujita health university, toyoake, japan it has been reported that, in rats with a single intoxication of α-naphthylisothiocyanate (anit), acute liver injury develops with enhanced lipid peroxidation and neutrophil infiltration in the liver tissue. melatonin functions as an antioxidant. melatonin is known to inhibit neutrophil infiltration into damaged liver tissues. therefore, we examined whether melatonin exerts a protective or preventive effect on anit-induced acute liver injury. male wistar rats received a single i.p. injection of anit ( mg/kg) and oral administraton of melatonin ( mg/kg) at or h after anit injection. animals administered with melatonin at and h after anit injection were sacrificed and h, respectively, after the injection. liver injury appeared h after anit injection and developed at h. melatonin administered at h after anit injection prevented liver injury formation with attenuation of increases in hepatic lipid peroxide level and myeloperoxidase activity, an index of neutrophil infiltration. melatonin administered at h after anit injection prevented liver injury development with attenuation of further increase in hepatic lipid peroxide level. thus, melatonin protects against and prevents anit-induced acute liver injury in rats possibly through its antioxidant action and/or its inhibitory action against neutrophil infiltration in the liver tissue. k. okamura-ikeda, s. katayama, k. fujiwara, and y. motokawa t-protein is a component of the glycine cleavage system and catalyzes the tetrahydrofolate-dependent reaction. mutation in human t-protein (ht) gene results in clinical nonketotic hyperglycinemia (nkh). eight point mutations have been identified so far in nkh patients with t-protein deficiency. to understand the structure and function of ht, the wild-type (wtht) and three mutant t-proteins (g r, g d and r h) were expressed in escherichia coli with chaperons groel and groes which facilitated the recovery of the expressed proteins as a soluble form. levels of expression of these proteins were similar but the recovered soluble forms of mutants were about one-third of wtht. g r showed comparable specific activity to wtht, whereas g d and r h mutants exhibited remarkable reduction in specific activity. since homoallelism for g d mutation and heteroallelism for g r and r h mutation were identified in typical and atypical nkh, respectively, these results suggest that g and r h mutations are highly deleterious in the aspects of not only protein folding and/or stability but also catalytsis. on the other hand, g r mutation might affect mainly on the protein stability. detailed characterization of these mutants is now in progress. laboratory of animal nutrition and biochemistry, miyazaki university, miyazaki-shi, japan the minimal actinorhodin pks, shown in black below, consists of the ketosynthase (ks), chain length factor (clf) and acyl carrier protein (acp) and is the minimum set of enzymes required for polyketide production. we have investigated the stoichiometry of the ks-clf complex and the ks-clf:acp minimal system using three methods: . native gel electrophoresis. . cross-linking of proteins using dibromoacetone. . radical cross-linking of proteins. this new method has also been used with wild type s. coelicolor cell free extract with % ks-clf in order to elucidate which proteins are in close proximity to ks-clf during in vitro actinorhodin production. in ruminant animals, essential amino acids have never been completely established, because of the difficulty of its estimation due to the presence of microorganisms such as bacteria and protozoa in the first stomach called rumen. in our previous paper, histidine was shown to be the first limiting amino acid in the rumen contents when evaluated by chemical score. recently we have also reported that rumen microorganisms cannot synthesize histidine from histidinol. on the other hand, there have been some reports which showed that nitrogen balance of ruminants was not improved by supplementation of histidine to rumen microbial protein together with methionine, lysine and threonine which had been known to improve. based on these facts, we have a hypothesis that histidine may not be an essential amino acid for ruminants. in the present paper, we will report about the abilities of cattle liver and kidney to synthesize histidine from histidinol comparing with those of swine liver and kidney. the ability was demonstrated by examining the activities of histidinol dehydrogenase (crude enzyme) by means of direct measurement of an increase in histidine and decrease in histidinol. the amount of histidine produced from histidinol by the enzyme seemed sufficient for meeting the histidine requirement of cattle. the browning reaction is the sequence of events which begins with the reaction of amino group in amino acids, peptides or proteins with glycosidic hydroxyl group of sugars; the sequence terminates with the formation of brown nitrogenous compounds or melanoidines. this reaction gives rise to tremendous number of components such as volatile alcohols, ketones, aldehydes, esters, ethers and sulfur and nitrogen containing heterocycles in addition to nonvolatile amadori compounds and complex brown pigments of medium to high molecular weights. the present study was designed to choose a currently occurring system (aspartic acid -fructose) as a model system, since aspartic acid was found to be one of the most important amino acids in many kinds of food varieties. the reaction was done under controlled conditions of reactants ratios, temperature and time. the reaction mixtures were subjected to successive extractions with suitable solvents where the obtained corresponding flavour concentrates were thoroughly investigated. the results indicated different classes of compounds such as aldehydes, furans, alcohols and alkylated pyrazines varying in quantities depending on the reaction conditions. these products were also investigated concerning their toxicological effects. so, such products of nonenzymatic reactions showed different chemical and biological properties. purification and characterization p. piyarat , s. nagata , h. misono , and k. packdibamrung department of biochemistry, faculty of science, chulalongkorn university, bankok, thailand department of bioresources science, faculty of agriculture, kochi university, nankoku, kochi, japan nad ϩ dependent alanine dehydrogenase was purified fold to homogeneity from aeromonas hydrophila. molecular mass of , daltons was estimated for alanine dehydrogenase by sephadex g- chromatography. sodium dodecyl sulfate-polyacrylamide gel electrophoresis of the purified en-zyme showed polypeptide band with molecular mass of , daltons, indicating that the enzyme is hexamer. the enzyme is highly specific for alanine and nad ϩ . sulfhydryl group of the enzyme plays an important role in the catalysis. the enzyme retained its activity on heating at °c for h. optimum ph for reductive amination and oxidative deamination were . and . , respectively. the steady state kinetic studies including product inhibition on the enzyme reaction indicated that the oxidative deamination proceeds through a sequential ordered binary-ternary mechanism in which nad ϩ binds first to the enzyme followed by l-alanine and products are released in the order of pyruvate, ammonia and nadh, respectively. the k m values for nad ϩ , l-alanine, pyruvate, ammonia and nadh were . , , . , and . mm, respectively. an elevation of apolipoprotein (apo) b- concentrations is a particular feature of several metabolic disorders, such as type diabetes (t d), impaired glucose tolerance (igt), and familial combined hyperlipidemia (fchl). to further understand the in vivo turnover of apolipoprotein b- of very low density lipoprotein subfractions (vldl , svedberg units (s f ) - and vldl , s f - ) kinetic studies were performed in subjects with t d, igt, fchl, and healthy controls using a tracer of either l-[ring- c ]-phenylalanine or l-[ , , - h ]leucine. these studies showed direct hepatic vldl apob- secretion to be increased in patients with t d and igt when compared with controls. in contrast, patients with fchl showed a discrete increase in hepatic vldl apob- secretion. in all patients vldl catabolism is not essentially impaired. vldl apob- secretion is associated with plasma insulin and free fatty acid (ffa) concentrations, resp., whereas vldl apob- secretion is correlated with plasma mevalonate and lathosterol levels. in conclusion, vldl overproduction is supposed to be completely responsible for higher triglyceride (tg) levels found in patients with t d, igt, and fchl. vldl overproduction seems to be regulated by tg and ffa substrate and appears to be an indicator of decreased insulin sensitivity. in contrast, vldl overproduction is more likely to be regulated by the availability of cholesterol substrate. these data give further in vivo evidence that vldl and vldl secretion is regulated independently. arabidopsis resulted in enhanced production of cysteine and glutathione graduate school of pharmaceutical sciences, chiba university, chiba, japan serine acetyltransferase (satase) catalyzes the formation of o-acetyl-l-serine (oas) which is the key intermediate of cysteine biosynthesis. oas is not only a dominant limiting factor but recently suggested as a possible signal molecule for gene expression in cysteine biosynthesis. in has been shown that the activity of cytosolic satase from watermelon was feedback inhibited by l-cysteine. to enhance the ability of cysteine biosynthesis in plants and to reveal the role of oas in the regulation of sulfur assimilation, we made the point-mutated watermelon satase gene (satg c) whose product was not inhibited by cysteine, and introduced satg c into arabidopsis. the contents of oas, cysteine, and glutathione in transgenic arabidopsis were increased significantly as compared to the wild-type arabidopsis. we are currently dealing with the expression analysis of sulfur-related genes in transgenic arabidopsis accumulating oas due to the overexpression of satase. certain amino acids as source of specific branched chain fatty acids in fish sauce manufacture n. g. sanceda , e. suzuki , and t. kurata institute of environmental science for human life, and department of human biological studies, ochanomizu university, tokyo, japan the source of some branched volatile fatty acids (vfa) during the fermentation process in the manufacture of fish sauce was investigated. we previously reported that straight chain volatile acids seemed to have been derived from fish fats but unlikely for branched fatty acids which was believed to be derived from other sources. to clarify the source of branched volatile acids, specific amino acids, alanine, leucine, iso-leucine and valine were used in this study. these amino acids were first mixed with salt and added to fish. the fish mixtures were then aerobically and anaerobically incubated for one and a half months. results showed that addition of valine significantly increased the production of iso-butyric and iso-hexanoic acids and leucine increased that of iso-valeric in the aerobically fermented fish mixtures. a similar tendency was observed in the anaerobically fermented fish mixture except that an increase in the amount of iso-hexanoic acid was observed in the leucine added mixture, which was not observed in the aerobically fermented one. it seemed that specific branched volatile fatty acids were derived from certain amino acids. glutathione (gsh) is an important component of the cellular defense mechanisms that protect cells from oxidative injury. in the retina, the glial (müller) cells have been shown to synthesize and transport gsh, and thus are likely to be involved in regulating gsh levels. in the present study, we have characterized gsh transport system in a müller cell line using s-gsh uptake. the results showed that gsh was taken up in a na ϩ -and concentration-dependent manner with a k m of . mm. moreover, cellular gsh had no effect on the rate of gsh uptake. in related studies, we found that oxidative stress induced the expression of γ-glutamylcysteine synthetase (gcs) subunits, and that gcs mrna levels were correlated with the degree of gsh depletion. because organic anion transporters (oatps) have been implicated in glutathione cotransport, we examined expression of oatp members using rt-pcr. we found that the müller cell line expressed transcripts for oatp , oatp and oatp . these studies indicate that the müller cell plays important role in gsh homeostasis in the retina. in the active site of human porphobilinogen synthase (ec . . . , pbgs), two zinc ions are coordinated by cys , cys and cys , and his and cys , respectively. the fomer zinc ion, closer to catalytic site lys , plays an important role in catalysis. on the other hand, a role of the latter (distal) one has not been clarified. interestingly, in human hep b cell, his was replaced with arg (h r). to elucidate the role of his in catalysis, the kinetic properties of wild type and h r mutant enzymes were studied. these cdnas were cloned by rt-pcr with total rna from human peripheral lymphocyte and hep b cell, respectively. each cdna encoding pbgs with Ј non-coding region was inserted into pet- b(ϩ) vector and then the constract was transformed into e. coli strain bl (de ). the cells were cultured in lb medium containing mg/ml ampicillin and µm zn ion for h at °c. after addition of mm isopropyl--d(Ϫ)-thiogalactopyranoside, cells were further cultured for h at °c. the highly purified pbgss were obtained by ultora centrifugion, fractionation with ammonium sulfate and column chromatographies with deaecellulose, hydroxylapatite and superdex , serially. we are now investigating molecular properties of these pbgss. agriculture and agri-food canada, lacombe research centre, lacombe, alberta, canada handling and management procedures such as capture and restraint can be significant stressors for recently domesticated animals such as elk (cervidae elaphus). the objective of the current study was to investigate the use of pre capture nutritional therapy in attenuating hpa response and improving animal welfare. fourty eight adult male elk stags ranging in age from - years and raised on pasture were used in the study with as control and as nutritionally treated. twenty four hours prior to capture the elk were offered either kg of a cereal grain based dietary supplement or kg of a cereal grain based nutritional therapy product containing specified amino acids (usa patent # ). the amino acid content of the nutritional therapy product was minimally . g per kg animal weight of ala, lys, phen, meth, thre, isoleu, val and tryp plus g per kg weight of leu and g/ kg weight of glut. the animals were subsequently captured and held in appropriate facilities designed to handle elk. saliva samples were collected on all animals immediately following capture and salivary cortisol was monitored by ria. animals offered the nutritional therapy product containing the amino acid mixture displayed lower cortisol levels ( . nmol/l) compared to the untreated controls ( . nmol/l; p Ͻ . ). the data suggest that amino acid therapy can be used to attenuate hpa response to a stressor in captured elk. department of bioengeneering and technology, delhi, new delhi, india resistance to analogues of methionine by corynebacterium lilium results in the partial de-repression of methionine biosynthetic enzymes. the levels of enzymes involved in methionine biosynthesis also increased step-wise by successive endowing the resistant markers, resulting in the overproduction of methionine. moreover, the repressibilities of the enzymes were also reduced by the addition of methionine analogue resistance. analogue resistant mutants were developed by uv induced mutagenesis of corynebacterium lilium (wild type) strain. the single analogue (norleucine) resistant mutant c. lilium nl- produced µg/ml methionine in shake flasks with methionine yield at . g methionine/g glucose and specific methionine production at . mg/g dcw, while double analogue (norleucine and triazole) resistant mutant c. lilium nt- produced µg/ml methionine. a triple analogue (norleucine, triazole and ethionine) resistant mutant c. lilium nte- produced . g/l methionine. the methionine yield was . g methionine/g glucose and its specific productivity was . g methionine/g dcw. clinical biochemistry, laboratory , luxemburg, grand duchy of luxemburg blood plasma glucose level was compared on fast and minutes after oral administration of mg of acetylcysteine. in the group of healthy persons the plasma glucose level feel by . % over the minute period. in the diabetics on the contrary, the plasma glucose level observed minutes after administration of acetylcysteine was . % higher than in blood plasma taken on fast. similar tests were carried out "in vitro" to interpret these different results. the control group consisted of ml of distilled water ϩ . ml % glucose ϩ . ml god pad (boringer mannheim gmbh). in the acetylcysteine group the distilled water was replaced by ml . % solution of acetylcysteine. in the glucagon group the distilled water was replaced by . % solution of glucagen hypokit novo nordisk. spectrometric determination was carried out after minutes of incubation. a % diminution of glucose was observed in the acetylcysteine group in comparison with the control group. a . % increase in glucose was observed in the glucagon group in relation to the controls. the results with healthy persons and the tests "in vitro" indicate that acetylcysteine lowers the level of glucose. but it elevates the level of glucose in the blood plasma of diabetics. it may be presumed that acetylcysteine modifies the insulinglucagon balance in favour of glucagon. the objective of this study was to fortify yogurt with three oilseed protein hydrolysates prepared from soybean (glycine max), sesame (sesanum indicum) and rice bran (oryza sativa) flours. hydrolysis was carried with two enzymes one of plant origin (papain) and the other of microbial origin (alcalase). a yogurt fortification experiment was then carried using the previous hydrolysates. the hydrolysates were added to yoghurt at , and % levels of fortification and the fortified yoghurt was analyzed fresh, and after and days of consuming period. fortified yogurt was chemically examined for fermentation activity (ph values, acidity and proteolysis) as well as its organoleptic properties. results of this experiment indicate that the addition of soybean hydrolysates with papain ( . units/g) for minutes (tb) and rice bran hydrolysates with alcalase ( . units/g) for minutes (te) to yoghurt can ex-ceed - %, while fortification with sesame hydrolysed with papain ( . units/g) for minutes (td) and soybean hydrolysed with papain ( . units/g) for minutes (tc) can not reach up to %. it is well known that dna is fragile to reactive oxygen intermediates (rois) damage. evidences that dna fragmentation and apoptosis occur in cardiomyopathies, in the failing heart and in cultured cells under hyperbaric oxygen (hbo) stress, demonstrated that oxygen free radicals also play a critical role in heart failure. as a consequence, myocardial cell survival depends on response to oxidative stress. experimental data obtained in vitro suggested that polyamines, by acting as rois scavengers, play a role in prevention of endonucleasemediated dna fragmentation and inhibition of alkylating agents-mediated damage, potentially exerting a protective role against rois damage. thus we studied polyamine metabolism and superoxide dismutase (sod) expression in an in vivo model of heart oxidative stress, such as rats subjected to hbo. four experimental groups were used: ) controls; ) rats subjected to hbo for min once and immediately sacrificed; ) rats treated as group but for consecutive days and immediately sacrificed; ) rats treated as group but sacrificed h later (recovery). northern blot analyses showed that odc mrna accumulation increased immediately (paralleled by activity) in groups - , while ssat mrna decreased remarkably, thus leading to higher polyamine concentration in rois-stressed hearts. contrariwise, sod mrna level decreased rapidly in groups - . this suggests that hbo-induced compensatory mechanism in rat heart is based on specific and rapid boosting of polyamine concentration, caused by coordinate induction of biosynthesis and inhibition of catabolism, and not of enzymes known to metabolise rois such as sod. amino acids oxidation was greater in tumor-bearing rats muscle. leucine is an important ketogenic amino acid that proves energy to the skeletal muscle. leucine supplemented diet was used to analyze the effects produced by walker growing in pregnant rats which were distributed into six groups. three groups received normal diet ( % protein): control (c), tumor-bearing (w), pair-fed rats (cp). three groups were fed with diet supplemented with % leucine ( % protein plus % leucine): pregnant fed with leucine (l), tumor-bearing with leucine (wl) and pair-fed with leucine (lp). after days, the animals were submitted to intestinal perfusion to measure leucine, methionine and glucose absorption. leucine absorption increased in w and wl groups. glucose absorption reduced in tumor-bearing. in pregnancy with cancer, metabolic changes provided both reduced fetal and tumor development. tumor-bearing rats showed increase in methionine and leucine absorption, probably diverting this nutrients to tumor cells. glucose absorption reduced in w and wl. leucine supplemented diet group promoted high leucine absorption which could be used by neoplasic cells, and mainly by fetus and host. probably, the transamination of the branch long chain amino acid provided energy substract for the skeletal muscle, keeping the nitrogen offered to host carcass. ( undernutrition cause several changes as body weight loss, in biochemical parameters, even microscopic alteration in absorptive epithelium. this means the nutrients absorption process has been harmfully and consequently increase the damages caused by malnourished. knowing leucine is used as a ketonic and oxidative amino acid our main propose was to recovery the malnourished young rats with normal (rc) and leucine supplemented diet (rl, % of leucine) for days. it was measured body, liver, and muscle weight, intestinal absorption of glucose, methionine and leucine, and body chemical composition. the body weight gain in rc and rl was higher than control group, suggesting that nutritional replacement for these groups could provided nutrients to support the body weight recovery, reaching as the same weight as the control. methionine and glucose absorption was reduced in malnourished group, but it was recovered (glucose, methionine and leucine) after nutritional replacement. leucine supplemented diet promoted a good recovery of carcass collagen nitrogen, keeping the carcass structural nitrogen. further studies are necessary to investigate this mechanism. [financial support: fapesp ( we diagnosed the very rare autosomal recessive disorder hyperprolinaemia type ii (deficiency of ∆ -pyrroline- carboxylate dehydrogenase, ec . . . ) in a girl aged months presenting with seizures and encephalopathy. l-∆ pyrroline- -carboxylic acid accumulates in this disorder and there is a - -fold increase in plasma proline. surprisingly, she also had vitamin b deficiency. this was an unrecognised association, which was not explained by her diet or medications. we hypothesised that pyridoxal phosphate (vitamin b coenzyme) was de-activated by l-∆ -pyrroline- -carboxylic acid. with high resolution h nuclear magnetic resonance spectroscopy and mass spectrometry, we have shown that these two compounds react at ph . and °c in vitro to form three novel adducts, which we characterised. they are products of a claisen condensation (or knoevenagel type of reaction) of the activated c carbon of the pyrroline ring with the aldehyde carbon of pyridoxal phosphate. if this previously unreported interaction occurs in vivo, pyrroline- -carboxylic acid is a unique endogenous vitamin antagonist. preliminary observations show that pyrroline- -carboxylic acid also condenses with other biologically important aldehydes and ketones. some of these reactions may contribute to the brain disturbances in hyperprolinaemia type ii. we have already identified adducts with acetoacetic acid in urine from our child, which is evidence that condensation can occur in vivo. the kidneys are characterized by a high activity of γglutamyl transpeptidase (γ-gt), as well as by a high cysteine level. the present paper was aimed to obtain information on how the activity of γ-gt and the levels of non-protein sulfhydryl compounds (npsh) changed with age in rat kidneys. simultaneously, protein-bound cysteine (pb-cys) and sulfane sulfur compounds were estimated. the kidneys were from following rats groups: young ( -month-old), middle-aged ( month-old) and old ( -month-old). the obtained results showed that the activity of γgt and npsh levels in the kidneys fell with age. at the same time, a significant increase in the level of protein-bound cysteine was observed. on the other hand, the content of sulfane sulfur compounds was elevated in the group of the oldest animals. these findings indicate that -due to disturbances in the γ-glutamyl cycle -the capacity for extracellular glutathione degradation and, in consequence, the availability of cysteine for intracellular gsh biosynthesis may be impaired. the increased pb-cys level indicates potentiation of the thiolation reaction, i.e. development of protein-mixed disulphides, cysteine, sulfane sulfur compounds, oxygen reactive species. national research centre, dokki, cairo, egypt in the past few years, many attempts have been made to prepare a synthetic insulin. the biological activity of insulin is known to be closely related to the c-terminal octapeptide fragment of its b-chain. this does not necessarily mean, however, that each of the amino acid residues of the octapeptide fragment is essential for its activity. it was found that b gly and b phe were present in all insulins so far obtained from various animal species indicating the significance of these two residues. it would therefore seem desirable to study the effect of each of these two amino acid residues or both on biological activity of the octapeptiede fragment of the b-chain. weitzel et al. found that the substitution of arginine b with another amino acid resulted in a very large decrease in biological activity, which indicates that it participates in the action of insulin. also it was found that the aromatic amino acid residues (b -b ) participate in the action of insulin. a heptapeptide arg-phe-tyr-thr-pro-lys-ala-och , corresponding to (b -b ) insulin des gly -phe , and an octapeptide arg-phe-phe-tyr-thr-pro-lys-ala-och , des gly were synthesized using the solid phase method. the c-termenal ends of both peptide were converted to methyl ester by transesterification cleavage from the resin. the side chain protecting groups were removed by hf. manual counter current distribution method was used for purification of the free peptides. the way to solve the evaluation of tyrosine containing peptide was studied. the free methyl ester peptides were administrated for insulin-like activity test by glucose metabolism in the rat fat cells technique in vitro. nitric oxide synthase inhibitors influence dynorphin immunoreactivity in the rat brain following hyperthermia p. alm and h. s. sharma department of pathology, university hospital, lund university, lund, and laboratory of neuroanatomy, department of medical cell biology, biomedical centre, uppsala university, uppsala, sweden nitric oxide (no) is a free radical gas that influences neuronal communication in the central nervous system (cns). recent reports suggest that no can influence dynorphin neurotransmission in the normal brain as well as in several pathological states. previous reports from our laboratory show that the enzyme nitric oxide synthase (nos) responsible for no formation is upregulated in several brain regions following hyperthermia. the present investigation was carried out to find, whether hyperthermia can influence dynorphin immunoreactivity in the brain, and if so, whether inhibition of nitric oxide synthesis will alter its distribution in heat stressed rats. rats subjected to hyperthermia at °c for h in a biological oxygen demand incubator (bod) resulted in marked redistribution of dynorphin immunoreactivity in several brain regions e.g., cerebral cortex, hippocampus, cerebellum and brain stem. pretreatment with two potent nos inhibitors, l-name ( mg/kg, i.p.) and l-nmma ( mg/kg, i.p.) min before heat stress significantly altered the dynorphin immunoreactivity in the brain. these drugs alone however, did not influence the peptide expression in normal rats. the results suggest that (i) hyperthermia has the capacity to influence dynorphin immunoreactivity in the brain, and (ii) inhibition of nitric oxide synthase considerably influences the dynorphin immunoreaction in hyperthermia, not reported earlier. the functional changes induced by uncompetitive and competitive nmda antagonists, memantine, amantadine and mk- , and cgp , respectively, were studied in both saline-pretreated and mptp-pretreated c bl/ mice. the nmda antagonists were administered acutely by themselves or in combinations of either: nmda antagonist plus subthreshold l-dopa dose or nmda antagonist plus suprathreshold l-dopa dose, to either the mptp-pretreated or the salinetreated mice. activity-enhancing or functional restorative effects of the nmda antagonists were variable with memantine and mk- distinguished from amantadine and cgp . in the study of long-term effect of nmda antagonists mk- was administered postnatally and spontaneous motor behaviour and motor activity in response to several pharmacological interventions was assessed. marked alterations associated possible with apoptotic penchance are discussed. t. archer and a. fredriksson department of psychology, university of göteborg, and department of psychiatry, university of uppsala, ulleråkers hospital, uppsala, sweden synergistic antiparkinsonian actions of different classes of putative therapeutic agents co-administered with a subthreshold dose of l-dopa ( mg/kg) in drug-naive mptp-treated mice as well as the restorative actions of those compounds in suprathreshold l-dopa-tolerant mptp-treated mice subjected to "wearing-off" of l-dopa efficacy were assessed in a series of experiments. the classes of compounds studied included the noncompetitive nmda antagonists, memantine, amantadine and mk- , the anticonvulsive and putative anticonvulsive agents, lamotrigine, fce , phenytoin, the monoamine oxidase inhibitors, l-deprenyl, amiflamine, α-ethyltryptamine, clorgyline and phenelzine, and the α -adrenoceptor agonists, clonidine and guanfacine. in this final case, the restorative effects of clonidine and guanfacine were antagonised by the α -adrenoceptor antagonist, yohimbine, but not the α adrenoceptor antagonist, prazosin. within each class of potentially therapeutic agents a differential restorative efficacy was obtained, but the combination of different doses of apomorphine with clondine failed to restore motor activity. in vivo proton mr-spectroscopy of the human brain: assessment of n-acetylaspartate (naa) reduction as a marker for neurodegeneration w. block , f. träber , s. flacke , f. jessen , ch. pohl , and h. h. schild department of radiology, department of psychiatry, and department of neurology, university of bonn, germany proton magnetic resonance spectroscopy ( h-mrs) is a well accepted non-invasive method to investigate changes in brain metabolite composition in different types of cerebral disease. we performed proton spectroscopy in patients with dementia of the alzheimer's type (ad) and in patients with motor neuron disease (mnd) with the aim to detect a specific metabolic pattern for each of these two neurodegenerative disorders. overall, more than spectroscopic data sets of patients with mnd and more than data sets of ad patients were acquired within the last years. in the mnd group we found a significant reduction of naa/tcr metabolite ratios in the central region, which correlates with the disease severity and the clinical lateralisation of neurological symptoms and increases in the time course of the disease. in ad patients a similar reduction in relative naa contents was observed in the medial temporal lobe. the observed regional metabolic alterations correlate well with the characteristic neurological symptoms in ad (dementia) and mnd (muscular palsy) and seem to follow the disease process over time. since naa is exclusively expressed in neurons as shown by immunohistochemical studies, reduced naa levels suggest neuronal loss or dysfunction in the observed regions. center for molecular imaging research, massachusetts general hospital, boston, massachusetts, u.s.a. non-invasive measurement of hemodynamic parameters and imaging neovasculature architecture during angiogenesis is highly important in determining tumor prognosis and in assessing treatment efficacy. we suggested a technique to map the tumor vascular (vvf) and interstitial volume fraction (ivf) noninvasively in vivo. a poly-l-lysine based macromolecular probe (mpeg-pl-gddtpa) with extended circulation in the bloodstream designed to shield chelated paramagnetic lantanide with poly(ethylene glycol) chains. we hypothesized that a magnetic resonance signal after intravenous administration of a vascular paramagnetic probe can be maximized so the signal change after administration of a second comound (gddtpa) reflects the ivf but not the vvf. the method and its assumptions were verified in animal models of cancer. tumoral vvf and ivf values were consistent with histology data and literature values. imaging showed heterogeneity of both parameters at submillimeter pixel resolution. this technique was used for characterizing differential angiogenesis in human mammary adenocarcinoma lines as well as for imaging anti-angiogenic drug effects. anti-angiogenesis was induced using synthetic d-reverse peptides derived from thrombospondin- . this study showed that peptide treatment results in slower brain tumor growth due to inhibition of de novo blood vessel formation and synergistic anti-proliferative effect on tumor cells. in conclusion, in vivo mr imaging can be used for non-invasive treatment assessment of novel antiangiogenic drugs. wallenberg neuroscience centre, lund university, lund, sweden we have recently found that -hydroxydopamine lesioned rats gradually develop dyskinetic-and dystonic-like movements upon repeated administration of a therapeutic dose of l-dopa. such movements simulate the time course of peak-dose dyskinesia in parkinson's disease. in this rat model, the severity of l-dopa-induced dyskinesia is strongly correlated with an upregulated expression of the prodynorphin gene in striatal neurons. using antisense technology and gel-shift assay analyses, we have addressed the role of transcription factors which may mediate this response. we have found that the camp response-element binding protein (creb) is essential in maintaining a basal expression of prodynorphin mrna in the intact striatum, but it is not required for l-dopa to induce the prodynorphin gene in dopamine-denervated striatal neurons. we have thus addressed the role of fos-and jun family tran-scription factors, and found very high levels of fosb-and jundlike proteins in the striata of dyskinetic animals. these proteins could bind to both ap and cre sites in the prodynorphin promoter. moreover, intrastriatal fosb knockdown could inhibit both the upregulation of prodynorphin gene expression and the development of dyskinesias under chronic l-dopa treatment. we propose that dimers of fosb-and jund-like proteins mediate abnormal changes in striatal gene expression which are linked to the development of l-dopa-induced dyskinesia. department of pharmacology, grünenthal gmbh r&d, aachen, germany glutamate plays important roles in both normal and pathophysiological nocieception. upon physiological conditions, glutamate release from primary afferents in the spinal cord activates largely ampa receptors. as those are ubiquitously involved in fast transmission in the cns, ampa antagonists have a broad side-effect profile. prolonged activation of nociceptors by tissue damage, inflammation or nerve injury evokes a long-lasting release of glutamate and neuropeptides, activating nmda receptors in the spinal cord. this mechanism appears to play a key role in pain chronification. the nmda receptor is, therefore, an important target for chronic pain treatment. both animal and human studies confirm the efficacy of nmda antagonists in chronic pain, however, clinically available compounds are weak or have unacceptable side-effects. glycine b antagonists and compounds selectively blocking nr b-containing receptors appear to be safer, the reasons for this remain unclear. central side-effects could potentially be avoided by using nmda antagonists with restricted central access. peripheral nmda receptors (as well as some other subtypes of glurs) could be activated by glutamate released from the site of injury, thus contributing to peripheral hyperexcitability. some other subtypes of glurs can also contribute to peripheral sensitisation. of ionotropic glurs, kainate receptors appear important in inflammatory and neuropathic pain. they can be activated by high intensity stimulation of nociceptive afferents, and may act as autoreceptors controlling release of glutamate. group i metabotropic glurs are also present on primary afferents and on second order neurones in the spinal cord, and may play a similar role. antagonists of these subtypes of glurs are active in some models of chronic pain. specific upregulation of group ii metabotropic glurs in some pain-relevant structures could reflect a possible adaptive role of these inhibitory receptors under chronic pain conditions; their selective agonists also have a potential for the treatment of chronic pain. we have performed a series of studies of the distribution and function of mglur subtypes in the basal ganglia that suggest that members of this receptor family could serve as targets for novel therapeutic agents that would be effective in treatment of pd. for instance, two group ii mglurs (mglur and mglur ) are localized on presynaptic terminals of striatal neurons in the globus pallidus where they could reduce gaba release. furthermore, activation of group i mglurs results in a depolarization and increased cell firing of neurons in the subthalamic nucleus (stn) and projection neurons of the substantia nigra pars reticulata (snpr). interestingly, this effect is mediated by mglur in snpr projection neurons and mglur in stn neurons. finally, activation of group ii mglurs results in inhibition of glutamate release from stn terminals in the snpr. furthermore, selective agonists of group ii mglurs inhibit haloperidol-induced catalepsy in rats, suggesting an antiparkinsonian effect of these compounds. the rich distribution and diverse physiological roles of mglurs in basal ganglia raises the possibility that these receptors may provide targets for novel therapeutic agents that could be used for treatment of pd and related disorders. a. cupello , m. parodi , and m. balestrino centro di neurofisiologia cerebrale, cnr, genova and dipartimento di scienze neurologiche, university of genova, italy in vitro rat hippocampal slices are commonly used to study the effects of hypoxia in the central nervous system, because they allow to differentiate the effects of hypoxia in the brain from that of systemic (e.g., respiratory and cardiac) failure that may accompany hypoxia. we used electrophysiology to monitor and evaluate the damage caused by transient hypoxia to the nervous tissue. a few minutes after oxygen deprivation brain tissue suddenly depolarizes. this event, which is termed ìanoxic depolarizationî is accompanied by dramatic metabolic changes: transmembrane ionic gradients disappear (na ϩ enters, k ϩ exits the neurons), neurons swell, there is intra-and extra-cellular acidosis. this event is caused by functional inactivation of (na ϩ / k ϩ )atpase caused by decreased atp content, as it is suggested by the fact that it is mimicked by ouabain treatment. one of us has contributed to show that if this event is not promptly reversed by reoxygenation it causes irreversible damage, mainly by determining a massive entry of ca ϩ into neurons. pretreatment of tissue with creatine ( mm or more) both increases neuronal energy store by increasing neuronal phosphocreatine and protects brain tissue from irreversible damage. in vivo increase in phosphocreatine has been shown using lower ( . mm) creatine concentration, injected directly into the lateral ventricle. a different type of hypoxia-induced damage is observed when hypoxia is of shorter duration. in this case upon reoxygenation one does not observe disappearance of evoked potentials but their increase. this phenomenon, originally described as ìpost-hypoxic hyperexcitabilityî has been later called ìanoxic long-term potentiation (ltp)î. we showed that this event can be prevented by inactivating the nuclear protein s- . while this damage is milder than that induced by anoxic depolarization, it may explain stroke-induced epileptic fits. we are currently investigating what role, if any, pretreatment with creatine may have in preventing also this type of damage. cytoskeleton is subject to continuous modification to yield changes in cell shape and function of plasmamembrane proteins linked to the cytoskeleton. gelsolin (gsn) depolymerizes filamentous actin and thus causes dynamic uncoupling of membrane ion channels. we have studied alteration of neuronal ca ϩ influx by the absence of gsn and its pathophysiological consequences during cerebral ischemia. cytosolic ca ϩ concentrations were determined ratiometrically in synaptosomes preloaded with fura- am. glutamate release from synaptosomes superfused with krebs' buffer was measured by hplc. transient focal cerebral ischemia was induced by h occlusion of the middle cerebral artery (mca). in gsn deficient mouse brain cortical synaptosomes [ca ϩ ] i increase in response to k ϩ ( mm) depolarization was % higher than in wild-type. ω-agatoxin iva . µm decreased ca ϩ -influx in neocortical wild-type synaptosomes by %, and abolished differences between gsnϩ/ϩ and Ϫ/Ϫ genotype. k ϩinduced release of glutamate in neocortical synaptosomes was % higher and lesion size after mca occlusion was % higher than in wild-type. it is concluded that presynaptic ca ϩ influx is increased in gsn deficient nerve terminals which, together with subsequently increased glutamate release, increases neuronal vulnerability. in vivo assessment of tissue alteration in cerebrovascular and neurodegenerative diseases s. flacke, w. block, f. träber, p. mürtz, h. urbach, and h. schild the combined used of perfusion imaging (pi) and molecular diffusion imaging (dwi) are opening a new window into the processes that occur during the first hours of ischemia. dwi detects early changes of proton diffusion associated with cytotoxic edema. pi has the potential to characterize the degree of regional hypoperfusion. mismatches between dwi and pi, i.e. hypoperfused areas with normal adc are considered potentially salvageable. we present results of patients with an angiographically defined thrombembolus in the middle cerebral artery and a spontaneous stroke evolution. whereas the infarct core was clearly visible on both dwi and pi, tissue at risk of infarction could only be detected by an increased blood volume and transit time. however only in a subgroup of patients (n ϭ ) these areas were incorporated into the final infarct. in these patients perfusion parameter of tissue at risk of infarction were more pronouncedly altered than in those where the tissue at risk was spared from infarction (ratios of tissue at risk vs normal (rcbv . Ϯ . , mtt . Ϯ . , ttp . Ϯ . , p Ͻ . ). these human data show that a detailed analysis of diffusion/ perfusion mismatches allow the identification of tissue at risk of damage. glucose deprivation enhances the sensitivity of cerebellar granule cells to die by excitotoxicity. we have previously reported that neither min of glucose deprivation, a treatment that depletes cell energy resources, nor exposure to µm glutamate (glu) for min, induce significant cell death in cerebellar granule cell cultures, h after treatment. in contrast, the combined treatment with µm glu and glucose deprivation induces both cell death and choline (cho) release. we investigated whether the neurotoxic effect of this treatment is related with inhibition of phosphatidylcholine (pc) synthesis. we found that exposure to µm glu alone for min, glucose deprivation for min, and the combination of both treatments inhibited pc synthesis when measured at the end of treatment by %, % and %, respectively. furthermore, we found that exposure to either µm glu, glucose deprivation or µm glu ϩ glucose deprivation decreased incorporation of [ h]cho into phosphocholine and increased the intracellular content of free [ h]cho, indicating that all these treatments inhibit the synthesis of pc by inhibiting choline kinase activity. since only the combined treatment with µm glu plus glucose deprivation evoked cho release and excitotoxic cell death, the present results indicate that other factors in addition to inhibition of pc synthesis are required to induce cho release and excitotoxic cell death in cerebellar granule cells. (supported by cicyt, saf - .) the role of striatal metabotropic glutamate receptors in degeneration of dopamine neurons k. goĺembiowska, j. konieczny, k. ossowska, and institute of pharmacology, polish academy of sciences, kraków, poland the present study was undertaken to characterize the effect of blockade of the mglu receptor subtype by -methyl- -phenylethynylpyridine (mpep), as well as the effect of stimulation of the mglu / receptor by (Ϫ)- -oxa- aminobicyclo[ . . ]hexane- , -dicarboxylic acid (ly ) on spontaneous and stimulated dopamine (da) release in rat striatum using an in vivo microdialysis. mpep ( - µm), perfused through a microdialysis probe affected neither the basal nor the veratridine ( µm)-stimulated striatal da release. however, mpep given intraperitoneally ( mg/kg) diminished either the basal or the veratridine-evoked da release. ly ( - µm) administered locally also inhibited the veratridine-evoked da release in rat striatum. antagonists of mglur-i and agonists of mglur-ii have been shown to have neuroprotective properties in several models of neurotoxicity in animals. we have approached this issue using a selective mglu antagonist in an animal model of neurotoxicity induced by methamphetamine. in our preliminary experiments, methamphetamine ( ϫ mg/kg sc every two hours) decreased the tissue content of striatal da and its metabolites dopac and hva.mpep ( ϫ mg/kg ip) given before every methamphetamine injection reversed its action. the effect exerted by the mglu antagonist mpep seem to be mediated by sites located outside the striatum due to relieving da neurons of the facilitatory influence of glutamate. in turn, the attenuation of da release from nigrostriatal terminals by ly may be a consequence of activation of striatal mglu / receptors. reversal of the methampetamine-induced da depletion suggests a potential for neuroprotective activity of mpep. o. golubnitschaja , h. h. schild , and j. flammer department of radiology, university of bonn, germany university eye clinic, basel, switzerland glaucoma remains a major eye illness with unknown etiology. although elevated intraocular pressure has been shown to be the major risk factor, there is a cohort of relatively young patients developing normal-tension glaucoma (ntg). assymptomatic ischemic events in brain have been shown to be often attributable to galucoma. perfusion of the retina and optic nerve head suffering from observed vasospastic dysfunction may be further reduced by changes in the intraocular pressure. ocular ischemia developed due to these blood flow deficits may play a major role in initiation of glaucoma. possibly secondary to ischemia the autoimmunogenic capacity is activated by ntg patients having an increased prevalence of systemic autoimmune diseases. therefore, the determination of potential molecular markers in blood lymphocytes could be useful for early diagnostics of ntg. our recent study using "gent hunting"-techniques showed indeed altered gene expression in lymphocytes of ntg patients. the demonstrated downregulation of xpgc gene expression which subsequently leads to the accumulation of damaged dna and an elevated p expression, together with the upregulation of a new abc-transporter seem to be specific for the pathogenesis of ntg. molecular imaging of ntg provides insights in mechanisms of disease initiation and allows the early diagnostics and preventive treatment. (supported by "bio-rad" and "amersham pharmacia aggregate cell cultures prepared from fetal rat telencephalon were used to study neuronal amino acid consumption during glucose restriction. to that end, both mixed (neuronglia) and neuron-enriched cultures were grown in chemically defined medium and tested at an advanced maturational stage. it was found that h of exposure to reduced glucose ( . mm instead of mm) caused significant increases in the consumption of several amino acids and the accumulation of ammonia. it also greatly changed the intracellular level of several amino acids in neurons, particularly of aspartate and glutamate. irreversible neuron-specific damage was observed one week after the insult. elevated glutamine media concentrations ( mm instead of . mm) during glucose restriction further increased ammonia production and neuronal damage, although the overall rate of glutamine metabolism remained practically unchanged. taken together, our findings suggest that glucose deficiency caused (i) the dysfunction of crucial transamination pathways; (ii) a shift towards the oxidative deamination of glutamine and several other amino acids used by neurons as alternative energy substrates; and (iii) the accumulation of neurotoxic ammonia levels. institute for brain research, university of vienna, austria the racemic (d,l) mixture of the naturally occurring neutral aromatic amino acid , -dihydroxy-l-phenylalanine (l-dopa) was first synthetized in . in , the natural levorotatory isomer was isolated from vicia faba beans and declared to be biologically inactive. however, in l-dopa was observed to lower the blood pressure in the rabbit, an effect opposite to the vasopressor effect of adrenaline. following the discovery, in , of the enzyme l-dopa decarboxylase, ldopa's conversion in tissues (by decarboxylation) to dopamine (da), the first biologically active substance in the biosynthetic pathway of catecholamines, was demonstrated. subsequent pharmacological studies, done between and , showed that the biological actions of l-dopa were, in principle, due to da formed from it in the body. in , the central antireserpine effect of d,l-dopa was described in mice and confirmed in with l-dopa in humans. following the demonstration of da's occurrence in the brain in the years / , d,l-dopa was found (in rabbits) to restore brain da levels, reduced by reserpine. in , the severe brain da deficit, confined to patients with parkinson's disease (pd) was reported and a year later l-dopa's superior anti-akinesia effect in patients with pd demonstrated. finally, in the high-dose oral l-dopa regimen was successfully introduced into clinical practise. in contrast to these supreme achievements, two related early studies remained, for different reasons, without consequence. despite some initial doubts about its mechanism of action, there is now convincing evidence for l-dopa therapy being a classic example of a central neurotransmitter replacement therapy, with the severe brain da deficit furnishing a rational basis for the amino acid's clinical use and high efficacy in patients suffering from pd. b. jakobsen , a. tasker , and j. zimmer anatomy and neurobiology, sdu-odense university, odense, denmark department of anatomy and physiology, university of prince edward island, canada the toxicity of domoic acid (dom) was studied in rat hippocampal slice cultures, prepared from -days old rats and grown on semipermeable membranes for - weeks before exposure. dom ( . - µm) was added to the medium, alone or together with the glutamate receptor antagonists ns- , nbqx or mk- , for hrs followed by hrs in normal medium. neuronal degeneration was monitored and ec values estimated by densitometric measurements of the cellular uptake of the propidium iodide (pi) at , , and hrs. the lowest ec values, obtained at hrs, were: ca ( µm), dentate granule cells (dg) and ca ab ( µm),ca c ( µm). protective effects of µm nbqx at hrs were seen against µm dom in dg, ca and ca c and against µm dom in ca and ca c. µm ns and mk only displayed protective effects together with nbqx. mk thus significantly increased the protective effect of µm nbqx in ca against µm dom in combination with µm nbqx and µm ns . we can confirm that dom neurotoxicity primarily involves ampa/kainate receptors, but also nmda receptors at high concentrations (glutamate release). department of physiology/neuroscience, medical university of south carolina, charleston, south carolina, u.s.a. although dopamine has been most clearly tied to the development of addiction to drugs of abuse, recent studies indicate that once addiction has been established the expression of addictive behaviors, such as drug craving, is mediated more by long-term neuroadaptaions in glutamate transmission. data will be presented which supports and extends this hypothesis. repeated cocaine injections were given for one week and three weeks after the last drug injection a number of molecular, neurochemical and behavioral neuroadaptations were measured. it was found that in the nucleus accumbens there is a increase in the expression of genes encoding mglur / and glur , and a decrease in the expression of mglur and its accompanying scaffolding proteins homer bc. this was accompanied by an increase in the capacity of mglur / receptors to regulate presynaptic glutamate release and a blunting in the effects of stimulating mglur / receptors. in addition, there is reduced activity in the cystine/glutamate exchanger wks after repeated cocaine. as a result of these changes there is a decrease in the basal release of glutamate, and a relative increase in the releasibility of glutamate upon stimulation. by using the reinstatement model of drug seeking behavior, it was shown that glutamate transmission in the projection from the prelimbic cortex to the core of the nucleus accumbens was particularly affected by the cocaine-induced changes in gene expression. taken together, these findings support the use of glutamate autoreceptor agonists as possible therapeutic adjuvants in treating the cravings associated with addiction. dopamine (da), a catechol that autoxidizes to an oquinone, is implicated as an endogenous pro-toxin, however, the following studies suggest that da has dual neurodegenerative and neuroprotective roles. in rats treated as neonates with -hydroxydopamine ( -ohda; :g icv), there was a % reduction in striatal tissue da content in adulthood, and a to fold increase in spontaneous hydroxyl radical (ho*) formation (indirect salicylate trapping method: dihydroxybenzoic acid analysis). additionally, systemic l-dopa ( mg/kg i.p.) suppressed ho* formation. however, when glutamate ( mm) was added to an in vivo microdialysate, ho* formation was increased substantially more in the microdialysate from dainnervated striatum. these findings indicate that da innervation is inherently neuroprotective, but in the presence of a high level of an excitatory amino acid, da innervation predisposes to formation of reactive oxygen species. ongoing neuronal activity is likely to interact with and to determine the role of da as a neurotoxic or neuroprotective substance. (supported by ns .) the glutamate hypothesis of schizophrenia along with the dopamine hypothesis was intensively discussed in the past. the last years however suggest more and more that neither a hypofunction of the glutamatergic system alone nor a hypofunction of the dopaminergic system alone is responsible for symptoms found in schizophrenia. the basal ganglia (bg) as the critical structures mediating symptoms of schizophrenia are innervated by dopaminergic fibers from the mesencephalon as well as by glutamatergic fibers from limbic structures; like prefrontal cortex, hippocampus, entorhinal cortex and amygdala. thus, limbic input is able to modulate information processing in each structure of the bg and by this way control dopaminergic functions through feedback mechanisms. dysfunction in limbic structrues may result in an imbalance of information processing via the bg and terminates in behavioral symptoms of schizophrenia. we showed in recent neurochemical studies in combination with behavioral analysis that a simple, generalized hypofunction of limbic glutamatergic input on bg nuclei is not the key mechanism inducing schizophrenic behavior. a dysfunction of a particular limbic structure or pathway seems to be responsible for an imbalanced information processing via the bg and imbalanced behavioral adaptation terminating in schizophrenic symptoms. [ there is a need to identify subtype-specific ligands for mglu receptors to elucidate the potential of these receptors for the treatment of nervous system disorders. to date, most mglur antagonists are amino acid-like compounds acting as competitive antagonists at the glutamate binding site located in the large extracellular n-terminal domain. we have investigated novel subtype-selective mglur antagonists which are structurally unrelated to competitive mglur ligands. using a series of chimeric receptors and point mutations we demonstrate that these antagonists interact with novel allosteric binding sites in the tm domain via a noncompetitive mechanism of action. recent studies in animal models implicate mglu receptors as a potentially important therapeutic target particularly for the treatment of pain and anxiety. vascular endothelial growth factor (vegf) is a major mediator in angiogenesis and vascular permeability. in central nervous system (cns) vegf plays pivotal roles such e.g., inductor of endothelial cell proliferation, migration and inhibition of apoptosis, as well as mediator of blood brain barrier (bbb) breakdown and subsequently of brain edema formation. these ubiquitous epiphenomena are major complications in several cns pathologies, including head trauma and stroke. reduced tissue oxygen tension (hypoxia) and hypoglycaemia triggers vegf expression that occurs in ischemic regions around postraumatic or postinfarct necrosis. after brain injury, the expression of vegf is increased contributing to disruption of the bbb. vegf increases the permeability of bbb via the synthesis/release of nitric oxide and subsequent activation of soluble guanylate cyclase. the immunohistochemistry shows an increase of stained astrocytes around a cortical micronecrosis. vegf participates in the response of the cns to injury in a dose dependent way. immunostaining correlates with infarct volume and clinical disability. vegf-antagonists reduce ischemic brain edema and injury, involving vegf in pathogenesis and eventually in treatment of stroke and related disorders. this cytokine also exerts a neuroprotective effect mediated by its receptor flk- . functions related to the inflammatory response, co-expression with proteins of the ecm and interaction with the two main receptors, flk- and flt- , will be discussed. n-methyl-d-aspartate (nmda) receptors can mediate excitotoxic or neuroprotective responses. one of the molecular mechanisms responsible for nmda neuroprotection involves the release of brain-derived neurotrophic factor (bdnf) which in turn binds to and activates its cognate receptor trkb. bdnf levels in the neuronal culture medium increased -fold when cells were preincubated for three hours with nmda. at three hours, the increase in bdnf protein levels in the medium was accompanied by a concomitant increase in bdnf mrna. thus, nmda elicited two temporally distinct responses: an early release of bdnf protein followed by a later transcriptional activation of dbnf mrna and protein release. these results suggest that nmda activates the trkb receptor via a bdnf autocrine loop resulting in neuronal survival. in addition, extracellular regulated kinases (erk / ) were rapidly activated, which peaked within six hours of nmda treatment. erk / activation is completely blocked by mk- and partially blocked by k a, suggesting the nmda and trkb receptors act in a coordinated fashion to activate erk / . as an extension of this work, we discovered a single nucleotide polymorphism in the human nr gene that, when transfected into hek cells, alters the electrophysiological properties of the nmda receptor complex. possible consequences of this nmda receptor variant in signaling will be discussed. this overview summarizes our recent knowledge of the role that tyrosyl radical (tyro • ) can play in neurochemical systems of brain and thereby lead to neural disorders (pd, ad, als). these could involve the interactions of tyrosine and tyro • with reactive oxygen species (ros) and reactive nitrogen species (rns), via radical mechanisms and chain processes in the presence of o and endogenous brain antioxidants. concentrations of tyro • , ros and rns can increase dramatically under conditions of generalized stress: oxidative, nitrative or reductive. this in turn can directly damage (by lipid peroxidation) or indirectly damage (by protein oxidation and/or nitration) cellular substructures which ultimately can lead to apoptotic neuronal cell death or autoschizis. enzymatically (classical peroxidase mechanisms) or non-enzymatically formed tyro • can react with no • and this reversible and intrinsic "combination" acts to "buffer' tyro • concentrations. the reaction of tyro • with superoxide (o •Ϫ ) is a scavenging reaction which proceeds rather by addition, not by electron transfer; and major resultant products are tyrosine hydroperoxides (tyrooh). however, the decay of tyro • can be also terminated by self-termination (dimerization) resulting in dityrosine (dt) formation. tyro • can catalyze ldl oxidation, although the precise mechanisms of this reaction in vivo remain unknown. nitration of tyrosine to -nitrotyrosine ( -nt) requires a one-electron oxidation as a primary step, with formation of tyro • , followed by addition of the nitrogen dioxide radical (no • ). the promoting effect of carbon dioxide on peroxynitrite-mediated tyrosine nitration (via radical mechanisms) (tyro • /no • /o •Ϫ /no • system) is due to the selective reactivity of the putative carbonate radical anion, as compared to that of the oxidizing hydroxyl radicals ( • oh). moreover, once formed, -nt may act to promote repetitive redox cycling; it may be reduced to the corresponding nitroanion radical, which is then oxidized by molecular o to o •Ϫ and parent -nt. one-electron oxidation of -nt can result in catalytically active imminoxyl radical. dt formation can outcompete tyrosine nitration at lowsteady state concentrations of peroxynitrite. it is unquestionable that very high fluxes of no • and o •Ϫ are requisite intermediates of peroxynitrite, a tyrosine nitration agent formed via tyro • . evidence for the existence of generalized stress within neurons includes the presence of protein peroxides (tyrooh), dt, and -nt. the nitration/denitration processes can be pathologic, but these also may play: ) a signal transduction role; ) a role of "blocker" for radical-radical reactions (scavenging of no • , no • and co •Ϫ by tyro • ); or ) a role of delimiting factors for peroxynitrite formation. it is still unknown whether oxidation/nitration of tyrosine (as dopamine precursor or protein residue) via tyro • formation, is a footprint of generalized stress and neuronal disorders, an important part of o •Ϫ and no • metabolism, or just a part of integral processes for maintaining neuronal homeostasis. the complete answer of these questions should be the first priority task of our recent search, wherein the problem of increased free radical formation in the brain and/or the imbalance of ratios: ros/rns/tyro • may be all important in determining neural cell and tissue injuries under pathological conditions resulted from generalized stress. [acknowledgements. this work was supported in part by kbn ( more than % of patients with type diabetes have coronary heart disease, related to silent ischemia, caused by an autonomic denervation of the heart in diabetic patients. oxidative damage to dna has been well documented in cardiac cells isolated from diabetic patients and rats with streptozotocin-induced diabetes mellitus (dm) . this dmmodel shows already seven days after onset of disease structural changes in vascular tissue typical for the development of atherosclerosis. this study evaluates possible molecular mechanisms for early events in the development of dm-induced cardiomyopathy. methods: using "expression array" we examined the activation of cardiac cell death in heart of dm-rats. ms-pcr was used to examine a differential dna methylation. results: an increased expression of genes encoding renin, angiotensinogen and p was detected in heart of dm-rats. substantial changes in the methylation status of the p dependent p waf /cip -gene and the cyclin d -gene were detected in dm-rats. conclusions: the renin-angiotensin system is upregulated with diabetes, and this may contribute to the development of cardiomyopathy via oxidative damage and p -dependent activation of cardiac cell death. this pathway includes de novo methylation of the p -inducible p waf /cip -gene encoding a protein which binds to and inhibits a broad range of cyclincyclin-dependent kinase complexes. (supported by "bio-rad" and "amersham pharmacia biotech") department of pharmaceutical biosciences, uppsala university, uppsala, sweden during the past decade studies have indicated that growth hormone (gh) may exert effects on the central nervous system (cns). for instance, gh replacement therapy was found to improve the psychological capabilities in adult gh deficient (ghd) patients. furthermore, beneficial effects of the hormone on certain functions, including memory, mental alertness, motivation and working capacity have been reported. likewise gh treatment of ghd children has been observed to produce significant improvement in many behavioural problems seen in these individuals. studies also indicated that gh therapy affects the cerebrospinal fluid (csf) levels of various hormones and neurotransmitters. further support that the cns is a target for gh emerges from observations indicating that the hormone may cross the blood-brain-barrier (bbb) and from studies confirming the presence of gh receptors in the brain. it was previously shown that specific binding sites for gh are present in discrete areas in the cns of both humans and rats. in peripheral tissues gh is shown to elicit its effects through a second mediator insulin-like growth factor (igf- ). igf- is well recognized as a protective agent against neural injury in the cns. the neuroprotective effect of this peptide has a broad spectrum affecting many brain regions and acts through its antiapoptopic effect. the production of igf- is upregulated in areas of brain damage and the igf- system may be an important part of an endogenous neuroprotective system. in spinal cord injuries, however, the content of igf- is reduced. we recently observed a neuroprotective effect of topical application of igf- in animals subjected to spinal cord trauma. the observed effect may be mediated via a mechanism involving nitric oxide. in the same animal model we have very recently observed a neuroprotective effect of gh. recent reports suggest that the level of gh is drastically reduced in patients with spinal cord injury. in victims of spinal cord injury the secretion of gh and igf- , as well, is known to be decreased. therefore, exogenous substitution of gh and igf- might be a promising approach in the future therapy of spinal cord injury victims. in fact, there is one report indicating that prolonged treatment with synthetic gh of spinal cord injured rats attenuates some of the neurological motor dysfunction seen in these animals weeks following trauma. in our animal model we observed that topical application of rgh significantly reduced traumainduced disturbances in the fluid micro-environment. we also noted that gh was capable of attenuating the trauma-induced depression of spinal cord evoked potentials. the mechanism by which gh exerts it neuroprotective effects will be discussed. chronically administered levodopa in parkinson's disease (pd) treatment is ultimately associated with alterations in motor response. in -hydroxydopamine lesioned hemiparkinsonian rats, chronic twice-daily administration of levodopa progressively shortens duration of contralateral turning and augments the period of turning at or below % of peak turning rate. the pathogenesis of the response alterations involves in part sensitization of the corticostriatal glutamatergic synaptic activity. characteristic changes involving interactions between striatal kinase and phosphatase signaling now appear to contribute to sensitization of spiny-neuron glutamatergic receptors. glutamate-mediated striatal dysregulation, subsequently, modifies basal ganglia output system in ways that favor the appearance of parkinsonian motor response complications. at a molecular level, transcriptional activation of striatal creb contributes to the persistent expression of the levodopa-induced motor response alterations. conceivably, a safer and more effective therapy for all stages of pd can be provided by drugs that target intracellularly on striatal kinases or phosphotases, or by agents that interact extracellularly on non-dopaminergic striatal receptors such as ampa and nmda, adenosine a , adrenergic a , opiod, and serotonergic b. the primary cause of parkinson's disease is a loss of dopamine in the corpus striatum. it has been postulated that this effect leads to disinhibition of the striopallidal pathway and, secondarily, to a functional shift towards glutamatergic stimulation. the aim of the present study was to find out whether inhibition of glutamatergic transmission at a level of metabotropic glutamate receptors (mglurs) in the striatum may alleviate parkinsonian-like symptoms in rats. the non-competitive antagonist of receptor subtype (mglur ), mpep ( . - mg/kg ip), or the agonist of group ii mglurs, ly ( - mg/kg ip), reduced the haloperidolinduced muscle rigidity and catalepsy. intrastriatal injections of the antagonist of mglur , (rs) aida ( . - µg/ . µl), but not of the agonist of group ii mglurs, r, r-apdc ( . - µg/ . µl), inhibited the muscle rigidity induced by haloperidol. in order to search for an influence of mglurs on the striopallidal pathway, the effect of mpep or of the agonist of group ii mglurs, dcg-iv, on the preproenkephalin mrna expression in the striatum was examined. the obtained results suggest that blockade of group i mglurs, or stimulation of group ii mglurs may be important to the amelioration of parkinsonian symptoms. striatal mglurs may contribute to at least some of these effects. several lines of evidence suggest an important role of glutamate in depression. the involvement of group i mglurs in depression has also been proposed. thus, we decided to evaluate whether group i mglurs antagonists have antidepressantlike effects. we also investigated if antidepressant treatment influences group i mglu receptors in the brain. the experiments were performed on male wistar rats ( - g) and male c bl/ mice ( - g). aida (group i mglurs antagonist) given i.v. in the dose of µg, decreased the immobility time in the despair test in rats. mpep (noncompetitive, systemically active mglur antagonist) given i.p., was not effective in the despair test in rats. however, in doses of . , and mg/kg, it significantly decreased the immobility time of mice in the tail suspension test. moreover, the deficit in passive-avoidance learning, which was observed in bulbectomized rats, was reversed by chronic, but not acute mpep ( mg/kg) treatment. prolonged imipramine treatment resulted in significant increase of the level of expression of mglu receptors in the ca field of the hippocampus, while prolonged electroconvulsive shock treatment (ect) enhanced significantly the chemiluminescence of mglu receptors in the ca field. the results indicate that group i mglu receptors are modified by chronic antidepressant treatment and that group i metabotropic glutamate receptors antagonists may play a role in the therapy of depression. (this study was supported by kbn grant no. .po a. . ) institute of pharmacology, polish academy of sciences, krakow, poland chronic exposure to nicotine, alcohol, opioids, sedatives, and cannabis results in development of drug dependence that becomes evident upon a cessation of drug administration and expresses itself as a withdrawal syndrome (with its physiological and motivational manifestations). adaptations at the nmethyl-d-aspartate receptor (nmda-r) complex have been observed in different brain areas during chronic exposure to, and upon withdrawal from, opioids, ethanol, benzodiazepines and barbiturates. behavioral studies employ the assessment of the effects of nmda-r antagonists on: a) the development of dependence (nmda-r antagonists are co-administered with the drug), b) the maintenance of dependence (nmda-r antagonists are administered to animals with pre-established dependence, and -most relevant to the clinical situation -c) on the expression of drug dependence (assessment of the withdrawal severity in subjects with nmda-r antagonists administered just before the expected emergence of withdrawal). the development of dependence to opioids and benzodiazepines is significantly retarded by nmda-r antagonists. studies from this laboratory demonstrate similar inhibition by nmda-r antagonists of the maintenance of opioid dependence. both in rodents and humans, the expression of opioid antagonist-precipitated as well as spontaneous (natural) withdrawal is inhibited by nmda-r antagonists, and animal data demonstrate similar inhibition of the expression of dependence produced by ethanol, barbiturates and benzodiazepines. the involvement of the excitatory amino acid, glutamate and the inhibitiory amino acid, gamma-amino butyric acid (gaba) in the pathophysiology of spinal cord trauma is not known in details. this investigation is focused on the involve-ment of glutamate and gaba in a rat model of spinal cord injury using immunohistochemistry. spinal cord injury induced by an incision into the right dorsal horn of the t - segments resulted in profound edema formation and cell damage in the adjacent t and t segments at h. pretreatment with h- / ( mg/kg, p.o.), a potent antioxidant compound, effectively reduced the edema formation and cell injury following trauma. at this time period, untreated traumatised rats exhibited a marked increase in glutamate immunoreactivity and a distinct decrease in gaba immunostaining in the t and t segments compared to the control group. the changes in glutamate and gaba immunoreactivity in traumatised rats were considerably attenuated by pretreatment with h- / . these results suggest that (i) oxidative stress contributes to alterations in glutamate and gaba in spinal cord injury, (ii) glutamate and gaba are contributing to edema formation and cell damage and (iii) the antioxidant compound h- / has a potential therapeutic value in the treatment of spinal cord injuries. dov pharmaceutical, inc., hackensack, new jersey, u.s.a. both preclinical (i.e., behavioral despair models) and clinical studies indicate that compounds reducing transmission at nmda receptors are antidepressant. conventional antidepressants may be viewed as "monoamine-based", increasing the synaptic availability of serotonin, norepinephrine, and/or dopamine. however, chronic administration of of conventional antidepressants alters both mrna levels encoding nmda receptor subunits and radioligand binding to this family of ligand-gated ion channels in circumscribed areas of the cns indicating that nmda receptors may be a downstream target of these monoamine-based agents. we have recently reported (li, et al., neuropharmacology, in press ) that a class of ampa receptor potentiators also exhibits antidepressant-like actions in preclinical models. in this presentation, i will describe how these two distinct, and (at a cellular level) seemingly diametric approaches employing glutamatergic mechanisms converge on intracellular targets that are also impacted by chronic treatment with biogenic amine-based agents. kainic acid is an essential pharmacological tool for many forms of neurobiological research. until several years ago, all commercially available kainic acid was derived from a single biological source (digenia simplex). commercial isolation of kainic acid in japan ceased in , creating a void in the marketplace. recently several different companies have become providers of kainic acid, but each uses a different source of the compound ( biological and synthetic) and different isolation procedures. our objective was to use three common assay systems to evaluate the comparative pharmacological and neurotoxicological properties of these three sources of kainic acid. dose response curves, both alone and in the presence of receptor selective antagonists, were constructed for each kainate formulation using (a) cerebellar granule neurons in culture, (b) isolated hippocampal slice preparations, and (c) whole animal behavioural toxicity studies. preliminary results reveal many similarities, but also distinct differences between the three formulations, especially when challenged with antagonists for different eaa receptors. full results will be presented and discussed with respect to their implications for both extending the known kainite literature and for future studies employing kainic acid as a ligand in both mechanistic investigations and in animal models of neurodegenerative disease. our results from in vitro studies further elucidate the role of cell-cycle related proteins in neuronal apoptosis induced by excitotoxins. exposure of primary cerebellar neurons to toxic concentrations of glutamate was found to produce a significant, short lasting increase in the expression of p and cdc . transcriptional activity of p was shown by increased p dna binding activity and by the concomitant induction of the cdk inhibitor p , the cell cycle regulator gadd and the apoptotic induced bax. cell-cycle proteins are also expressed concomitantly to dna damage in neurons undergoing excitotoxic degeneration. we found that excessive activation of glutamate receptor by nmda results in the formation of -oh-deoxyguanosine, which is a marker of oxidative dna damage. in addition, the expression of the dna repair factor msh increases in cultured cerebellar neurons or in ca pyramidal cells that have been challenged with excitotoxins. excitotoxicity may thus provide a further example of how re-expression of cell-cycle proteins might be tightly connected to dna damage and repair in neurons. rush-presbyterian-st. luke's medical center, chicago, u.s.a. patients with parkinson's disease by definition benefit from levodopa therapy. however, after years of therapy % of patients experience motor response complications (mrc's): the benefit from each dose becomes shorter (wearing-off), more unpredictable (on-off) and associated with involuntary movements (dyskinesias). when dyskinesias first arise, they are associated with high levodopa levels and may be prevented or minimized by lowering levodopa intake. later on, the therapeutic window of levodopa narrows progressively and dyskinesias occur at doses equal to those needed to induce an antiparkinson effect. while the pathogenesis of motor complications remains incompletely understood, recent clinical studies implicate mechanisms downstream from the degenerating nigrostriatal dopamine system, possibly involving glutamatergic projections to the basal ganglia. in a rat model of pd, blockade of striatal glutamate receptors of the n-methyl-d-aspartate (nmda) subtype reverses levodopa-induced motor fluctuations. similarly, in -methyl- -phenyl- , , , tetrahydropyridine (mptp) lesioned primates, several nmda-antagonists reduce levodopa-associated dyskinesias. in parkinsonian patients the nmda-antagonists dextromethorphan, dextrorphan and amantadine improve dyskinesias as well. these findings have lead to the suggestion that hyperfunction of nmda receptors on striatal efferent neurons, as a consequence of chronic non-physiologic dopaminergic stimulation, contributes to the pathogenesis of motor response complications. protein misfolding and aberrant polymerization are salient features of virtually all central neurodegenerative disorders, including alzheimer's disease (ad), parkinson's disease, polyglutamine diseases, tauopathies, and prion diseases. in many instances, a single amino acid change can predispose to disease by increasing the production and/or changing the biophysical properties of a specific protein. possible pathogenic similarities among the cerebral proteopathies suggest that therapeutic agents interfering with the proteopathic cascade might be effective against a wide spectrum of diseases. however, testing compounds preclinically will require diseaserelevant animal models. numerous transgenic mouse models of ad-like pathology have now been produced. our studies have found that tg mice overexpressing human -amyloid precursor protein (huapp k n/m l) produce copious deposits of diffuse and compact -amyloid as they age, and that females are more susceptible than are males (callahan et al., am. j. pathol. , - , . recently, we also found that the overexpression of p protein, an activator of the kinase cdk , results in tau hyperphosphorylation, axonopathy and severe motor deficits in transgenic mice, in the absence of neurofibrillary tangles. none of the existing transgenic models of -amyloidosis or tauopathy fully recapitulates the pathology of ad. in an attempt to more authentically model the human disease, we infused dilute ad-brain extracts into tg mice at -months of age (i.e. - months prior to the usual onset ofamyloid deposition). we found that infusion of ad brain extracts results in: ) earlier and more abundant deposition of -amyloid in app-transgenic mice (kane et al., j. neurosci. , - ); ) evidence for the spread of pathology to other brain areas, possibly by neuronal transport mechanisms; and ) tau hyperphosphorylation (but not neurofibrillary pathology) in axons passing through the injection site. the seeding ofamyloid by ad brain extracts suggests pathogenic similarities between -amyloidoses such as ad and other cerebral proteopathies, and could provide a new model for studying the proteopathic cascade and its neuronal consequences in neurodegenerative diseases. supported by warner-lambert/pfizer. purpose: the effects of essential amino acid deficiencies on function of cornea and lens were investigated. methods: dietary deficiencies of tryptophane and methionine were studied in young rats over months. transparency of cornea and lens were evaluated using slitlamp microscope and scheimpflug camera. after sacrifice, lens fresh weight and crystallin patterns were determined to evaluate effects on lens growth and protein synthesis. results: methionine deficiency had no effect on the parameters investigated. tryptophane deficiency caused severe loss of body weight in both rat strains (brown-norway, bn; sprague-dawley, sd), sd rats also lost their hair. they developed corneal neovascularisations and cortical cataracts. bn rats developed faint neovascularisations and a discontinuity zone in the lens. diet intermission arrested pathological processes restarting when feeding diet again. this observation is supported by lens fresh weight data. dna staining evidenced that tryptophane deficiency arrested lens fiber maturation. conclusion: a difference has been found for essential amino acids in their effects on transparency of cornea and lens. tryptophane deficiency stimulated corneal neovasculariseration, but arrested lens fiber cell maturation. the difference in reaction of cornea and lens to tryptophane deficiency between bn and sd rat eyes remains to be elucidated. dynorphin is a neuropeptide that is present in the dorsal horn of the spinal cord. the peptide is actively involved in pain processing pathways. however, its involvement in spinal cord injury is not well known. alteration in dynorphin immunoreactivity occurs following a focal trauma to the rat spinal cord. infusion of dynorphin into the intrathecal space of the cord results in ischemia, cell damage and abnormal motor function. antibodies to dynorphin when injected into the intrathecal space of the spinal cord following trauma improves motor recovery and reduces edema and cell changes. however, influence of dynorphin on trauma induced alteration in spinal cord bioelectrical activity is still not known. spinal cord evoked potentials (scep) are good indicator of spinal cord conduction that are altered following trauma. therefore, in present investigation, influence of dynorphin antibodies on trauma induced changes in scep was examined in our rat model. in addition, spinal cord edema formation and microvascular permeability disturbances were also investigated. our results show that intrathecal administration of dynorphin antiserum prior to injury has a beneficial effect on trauma induced electrical activity, microvascular permeability disturbances, and edema formation. these observations indicate that dynorphin is somehow involved in the altered bioelectrical activity of the spinal cord and participates in the pathophysiological processes leading to cell injury. fatty-acid binding proteins (fabps) are involved in the intracellular binding, targeting and transport of long-chain fatty acids (fas) to modulate cell growth and/or differentiation. fabp form a family of proteins displaying tissue-specific expression. the expression of brain type fabp (b-fabp) is spatially and temporally correlated with neuronal differentiation during brain development. heart type fabp (h-fabp) is widely distributed and present in skeletal muscles, kidney, lung, brain and endothelial cells. it is neuron-specific in postnatal brain and participates in neurite formation and synapse maturation. epidermal type fabp (e-fabp) is expressed at high levels during neurogenesis, neuronal migration, and terminal differentiation. although all three fabps could be involved in normal brain function in prenatal and postnatal life, a neurobiological role of fabps in neurodegenerative diseases has not been reported yet. these made us evaluate the protein levels of fabps in brains from patients with down syndrome (ds) and alzheimer's disease (ad) and fetal cerebral cortex with ds using two-dimensional ( -d) gel electrophoresis with subsequent matrix-assisted laser desorption ionization mass spectroscopy (maldi-ms) identification and specific software for quantification of proteins. in fetal brain, b-fabp and e-fabp levels were comparable between control and ds. in adult brain, b-fabp was significantly increased in occipital cortex of ds, and h-fabp was significantly decreased in ds (frontal, occipital, parietal cortices) and ad (frontal, temporal, occipital and parietal cortices). we conclude that aberrant expression of fabps, especially h-fabp in neurodegenerative diseases could be involved in impaired neurite outgrowth and synapse maturation. in our previous paper, it was shown that gaba-a receptor antagonist picrotoxin suppressed etoh (ethanol) selfadministration. recently, several authors indicated that systemic injection of dopamine or serotonine agonists reduced ethanol drinking in rats. therefore, in the present study we investigated the effects of thip ( , , , -tetrahydroizokasazolo, , -c pyridin- -ol) gaba-a receptor agonist in naive and long-term ethanol-experienced rats on etoh selfadministration and on cardiovascular system. adult - week-old male, normotensive wistar-kyoto (wky) and spontaneously hypertensive rats (shr) were used. naive rats were examined according to smith method. long-term ethanolexperienced rats were studied according to boyle method. thip was injected in naive rats at a dose of and mg/kg i.p. metabotropic glutamate receptors are coupled to phospholipase c stimulation and adenylyl cyclase inhibition through g-proteins. c glioma cells, that endogenously express the phospholipase c coupled metabotropic glutamate receptor type, were treated with different specific agonists of these receptors and the effect of these treatments on different components of metabotropic glutamate receptor pathway was studied by radioligand binding, phospholipase c activity and rt-pcr assays. agonists treatment caused a decrease in l-[ h]glutamate binding to intact cells and membranes in a time dependent manner being maximum at - hours and recovered at - hours. this decrease was associated with a significant increase in the mrna level coding mglurs. no changes on g q/ mrna level were detected in any case. however, a significant decrease in l-glutamate stimulated phospholipase c activity was detected after agonist treatments in both membranes and intact cells. this decrease was not associated to significant variations in mrna level coding phospholipase c isoform. all these results suggest that agonist exposure causes a desensitisation of glial metabotropic glutamate receptor decreasing not only receptors number but its functionality. in this study the interaction between these two nuclei were investigated by means of microinjection and microdialysis techniques in sprague-dawley rats. steroetaxic surgery was performed by placing intracerebral parenchymal microinjection cannula into the right dmh and microdialysis probe into the left pvn. iliac artery was also cannulated to monitor the pulsatile blood pressure and heart rate by means of pressure transducer connected to a polygraph microinjection of pmol nmda into the dmh was performed and microdialysis perfusates were collected simultaneously from the pvn in conscious rat model. γ-aminobutyric acid (gaba) and l-glutamic acid levels were analyzed by an isocratic hplc (high pressure liquid chromatography) method with the aid of a fluorescent detector. microinjection of pmol nmda into dmh produced significant increases in mean arterial pressure and heart rate. nmda microinjection into the dmh produced significant increase in l-glutamic acid release in the pvn, but no significant change in gaba release was observed. these results suggest that stimulation of dmh by nmda results in subsequent stimulation of the pvn. [this study was sponsored by marmara university research foundation (project no: /sag/ ).] and in long-term ethanol-experienced rats only at a dose mg/ kg i.p. control group (cg) received saline ml/kg i.p. as can be seen in fig. and table the lower consumption of ethanol in shr in comparison to wky rats was observed. systemic injection of thip decreased dosedependently etoh intake in naive rats of both strains. this effect was more pronounced in shr (fig. ) . similar phenomenon was observed after thip injection in long-term ethanolexperienced rats. there were no effect on systolic blood pressure and heart rate after thip treatment. born-bunge foundation, university of antwerp, and university of ghent, belgium increased neuronal excitability may underlie some of the neurological complications in uremic patients. in an effort to identify candidate neuroexcitatory compounds, different uremic retention solutes, including several amino acids and amino acid derivatives, were applied to mouse spinal cord neurons in primary dissociated cell cultures. using the tight-seal whole-cell technique, a few of the candidate toxins were shown to evoke whole-cell currents in cells clamped at Ϫ mv. in a first survey, each of the solutes was briefly applied in a concentration of mm. significant inward whole-cell currents were evoked by guanidinosuccinate, spermine, and -indoxyl sulfate, whereas phenol evoked an outward current. further experiments indicated that guanidinosuccinate-evoked whole-cell currents were due to activation of nmda-type glutamate receptors in concentrations similar to those found in uremic patients. high (mm) concentrations of spermine activated voltage-gated calcium channels, whereas low (µm) concentrations were found to potentiate guanidinosuccinate-evoked currents through its action on the nmda receptor-associated polyamine binding site. whole-cell currents evoked by indoxyl sulfate or phenol seemed to be due to complex interaction with several different ion channels. we conclude that guanidinosuccinate-evoked nmda receptor activation, possi-bly potentiated by the neuroexcitatory effects of polyamines and other putative uremic neurotoxins, could be an important mechanism underlying the increased neuroexcitability in uremic brain. glutamine (gln) is one of the key metabolites in the cns (energy metabolite, precursor of neurotransmitter amino acids, end product of ammonia detoxication, osmolyte), and as such is a routine supplement of cns cell culture media. c glioma cells relatively easily adapt to culturing in a gln-deprived medium. the present study investigated the effects of gln deprivation on the characteristics of the different systems that mediate gln cell membrane transport in the cells. in contrast to a variety of cns and non-cns cells, the absence of gln did not derepress the methyl-amino-isobutyric acid (meaib)-sensitive ("system adependent") uptake. system asc became relatively more-, and system n less active than in cells grown in the presence of gln, but the ion -and substrate specificity of the uptake remained unaltered. system asc in c cells grown in a glnsupplemented medium shows two features distinct from most other cell types: a) strong ph sensitivity and b) partial tolerance of lithium substitution, pointing to domination of system asct -an asc variant strongly expressed in cultured astrocytes. cells grown in gln-deprived medium lost lithium tolerance, but not ph-dependence of the uptake, their properties thus resembling system glnt (sat ), a neuron-specific variant of system a. by contrast, transport of threonine, a standard asc system substrate, was not affected by gln deprivation and showed neither ph dependence nor lithium tolerance, which is typical of an asc in all the non-cns tissues. (supported by scsr grant no. p a .) the classical the hypothalamic-neurohypophysial system (hns) is comprised of neurons originating within the supraoptic nucleus (son) which project to the neurohypophysis to release the nonapeptides oxytocin (oxt) and vasopressin into the blood after appropriate stimulation. previous experiments have shown that a single social defeat experience triggers the release of oxt from somata and dendrites into the extracellular fluid of the son, but not from axon terminals in the neurohypophysis. to further investigate the regulatory mechanisms underlying this dissociated release, we exposed male wistar rats to a -min social defeat experience and monitored the release of the inhibitory amino acids gamma amino butyric acid (gaba) and taurine into the son using microdialysis. social defeat caused a significant increase of the intra-son pre-stress basal release). to reveal the physiological significance of the intrahypothalamically released gababicuculline, a specific gaba a -receptor antagonist -was administered into the son by retrodialysis. this treatment increased significantly the release of oxt both within the son ( %; p Ͻ . vs. pre-stress basal release) and -as measured via chronically implanted jugular venous catheters -into blood under basal and stress conditions (up to %; p Ͻ . vs. prestress basal release). however, bicuculline did not affect plasma vasopressin. these data demonstrate that gaba is released within the son during social defeat to act as an inhibitor of both, central and peripheral oxt secretion during emotional stress. the mechanism described here contributes to the regulatory capacity of the hns to ensure the appropriate involvement of oxt in the stress response of the animal (supported by dfg, en - ). down syndrome (ds) is the most common human chromosomal abnormality caused by an extra copy of chromosome and characterized clinically by somatic anomalies, mental retardation and precocious dementia. the phenotype of ds is thought to result from overexpression of a gene or genes located on the triplicated chromosome or chromosome region. reports that challenge this notion, however, have been published. to add to this body of evidence, the expression ofamyloid precursor protein (app), ets- and collagen α (vi) chain precursor, encoded on chromosome , was investigated in fetal brain by western blot and two-dimensional electrophoresis ( -de). western blot detected app and ets- that migrated at ϳ and kda, respectively. subsequent densitometric analysis of app and ets- immunoreactivity did not produce any significant change between controls and ds. since the metabolic fate of app determines the propensity of amyloid production, the expression of the secreted forms of app (sapp) had been examined. neither the expression of sappα nor sapp showed any detectable changes among the two groups. collagen α (vi) chain precursor, a protein resolved as a single spot on d gel was identified by matrix associated laser desorption ionization mass spectroscopy. quantitative analysis of this spot using the d image master software revealed a significant decrease in fetal ds (p Ͻ . ) compared to controls. linear regression analysis did not show any correlation between protein levels and age. the current data suggest that overexpression per se can not fully explain the ds phenotype. apoptosis is the mechanism by which cells are programmed to die under a wide range of physiological and developmental stimuli. accumulating evidence indicates that enhanced apoptosis (programmed cell death) in down syndrome (ds) may play a role in mental retardation and precocious neurodegeneration of the alzheimer-type. in this regard, alteration of several apoptosis related proteins have been reported in adult ds brain. fetal ds neurons exhibited increased reactive oxygen species leading to early apoptosis, however, expression of apoptosis related proteins in fetal ds, has never been considered. to address this issue, we investigated the expression of proteins involved in apoptosis including fas (cd , apo- ), caspase- , bcl- and annexins in the cerebral cortex of control and ds fetal brain by western blot and two dimensional electrophoresis. here, we report that no detectable changes were obtained in fetal ds brain in the expression of fas, caspase- , bcl- and annexins (i, ii, v, and vi) compared to controls. in parallel experiment, we also examined the expression of neuron specific enolase (nse), a neuronal marker found to be decreased in adult ds brain, to see if there is any neuronal loss and no difference was observed between the two groups. protein expression did not correlate with age. the unchanged levels of fas, bcl- and annexins together with unaltered caspase- expression, a predominant caspase that executes apoptosis in the developing nervous system, suggest that enhanced apoptosis may not be apparent in fetal ds brain as demonstrated for adult ds brain. introduction. among the various metabolites indicating neuronal damage, amino acids are regarded particularly important. detection of amino acids by microdialysis is currently introduced as a neuromonitoring tool in patient care. here, we present changes in the extracellular concentrations of various amino acids in stroke patients and in experimental stroke in cats. method. cat focal ischemia was produced by occlusion of the middle cerebral artery (mca) for h followed by h reperfusion. glutamate, aspartate, gaba, taurine, glycine, serine, glutamine, methionine, threonine, tyrosine, asparagine, valine, phenylanaine, isoleucine and leucine were sampled by microdialysis in the ischemic core and subsequently analyzed by hplc. human microdialysis was performed in patients with large mca infarction. the microdialysis probes were inserted into primarily non-infarcted tissue in the border zone of the ischemic territory. results. transmitter amino acids rose immediately after occlusion in the cat model. correspondingly, these substances increased sharply in the human brain, when the tissue around the probes became infarcted, as shown by positron emission tomography (pet) and ct scan. in contrast, structural amino acids did not show marked increases or even decreased during severe ischemia in both, experimental ischemia and stroke patients. these substances did increase, however, when the brain tissue was only slightly ischemic, i.e. after reperfusion of the cat brain, when brain swelling occurred, or in human brain, when tissue did not show any infarction in the ct scan but hypoperfusion in the pet image. conclusion. extracellular amino acids detected by microdialysis can serve as markers for secondary ischemia. severe ischemia is reflected by rapid increases of transmitter amino acids, due to various mechanisms including synaptic release and reversal of reuptake systems. oligemia seems to be reflected by slow increases of structural amino acids, possibly due to a reduction in cerebral protein synthesis. apoptosis has been implicated in the selective neuronal loss of down syndrome (ds). apoptosis activates a family of cysteine proteases with specificity for aspartic acid residues referred to as, caspases that play a key role in dismantling a cell committed to die. caspases activity is regulated by a variety of proteins that possess a domain resembling the prodomains of caspases. little is known, however, about the changes of caspases and their regulatory proteins in ds. here, we investigated levels of nine such different proteins by western blot technique in frontal cortex and cerebellum of control and ds subjects. the protein levels of dff (dna fragmentation factor ), and flip (fadd like interleukin- -converting enzyme inhibitory proteins) were significantly decreased whereas that of rick (rip-like interacting clarp kinase) increased in both regions of ds. in contrast, cytochrome c, apaf- (apoptosis protease activating factor- ), procaspase- and arc (apoptosis repressor with caspase recruitment domain) were unchanged. procaspase- and - were significantly decreased in frontal cortex but no significant change was observed in cerebellum. regression analysis revealed no correlation between postmortem interval and levels of the investigated proteins. however, inconsistent correlation was found between age and levels of proteins as well as amongst the density of individual proteins. these findings demonstrate that dysregulation of apoptotic proteins does exist in ds brain and may underlie the neuropathology of ds. the study further suggests that apoptosis in ds may occur via the death receptor pathway independent of cytochrome c. hence, therapeutic strategies that target caspase activation may prove useful in combating neuronal loss in this disorder. in order to examine the differential roles of nitric oxide (no) induced by either endothelial no synthase (enos) or neuronal no synthase (nnos) after transient cerebral ischemia, we investigated the effects of the relatively selective cnos inhibitor, l-n -( -iminoethyl)ornithine (l-nio), the relatively selective nnos inhibitor, -nitroindazole ( -ni) and the no scavenger, -( -carboxyphenyl)- , , , tetramethylimidazole- -oxyl -oxide (ptio) on hippocampal dysfunction caused by cerebral ischemia. we measured no concentration, mean arterial blood pressure (mabp), hippocampal blood flow, direct current potential, ca population spike (ps) and release of amino acids from rat hippocampus after transient forebrain ischemia, which was induced by vessel occlusion for min. l-nio ( mg/kg), -ni ( mg/kg) and ptio ( mg/kg) were administered intraperitoncally min before ischemia. ptio, -ni and l-nio reduced ischemiainduced no production in the hippocampus during the early period of reperfusion. the rank order of inhibitory potency was ptio Ͼ -ni Ͼ l-nio. l-nio, but not -ni, reduced hippocampal blood flow during ischemia and increased mabp before, during and after ischemia, compared with the vehicle group. ptio increased mabp during and after ischemia. ptio and -ni, but not l-nio, reduced amplitude of anoxic depolarization induced by ischemia. -ni recovered in part ps amplitude min after ischemia. -ni, but not l-nio, reduced ischemiainduced release of aspartate and glutamate, but not taurine. the present study provides further evidence for the idea that in the early stages of transient forebrain ischemia, enos-derived no has a neuroprotective effect in the hippocampus, while nnos-derived no has a neurotoxic effect. the estrogen affects brain protein synthesis in ovariectomized female rats k. hayase , m. tanaka , k. tujioka , e. hirano , and h. yokogoshi department of home economics, aichi university of education, kariya, aichi, and laboratory of nutritional biochemistry, school of food and nutritional sciences, the university of shizuoka, japan the purpose of this study was to determine whether -estradiol affected the rate of brain protein synthesis in ovariectomized female rats. experiments were conducted on three groups of wk old female rats: group . ovariectomized to reduce the level of plasma estradiol; group . ovariectomized and treated with estradiol; and group . sham-operated control. the fractional rates of protein synthesis in brain of ovariectomized rats treated with estradiol were significantly greater than in ovariectomized rats without estradiol treatment. in brain, the rna activity [g protein synthesized/(g rnaᮀd)] significantly correlated with the fractional rate of protein synthesis. the rna concentration (mg rna/g protein) was not related to the fractional rate of protein synthesis in any organ. the results suggest that estrogen treatment of ovariectomized female rats is likely to increase the rate of protein synthesis in the brain, and that rna activity is at least partly related to the fractional rate of brain protein synthesis. we have synthesized a series of new peptides that have demonstrated potent antidepressant activity in animal models for depression and in phase iia and iib clinical trials. mif- (prolyl-leucyl-glycinamide) an endogenous brain peptide has been reported to have some clinical activity in patients with unipolar depression with few apparent side effects. we have undertaken a study to determine the effect of molecular structural changes on the antidepressant activity of this peptide. we evaluated our new derivatives in a stress-induced animal model for depression, i.e. porsolt test, we have found that -f-phe- -oh-pro-arg-gly-trp-nh (inn ) is superior in all the statistical parameters used. in comparative testing inn was more active than prozac (fluoxetine) and zoloft (sertraline) in our antidepressant model. a u.s. patent has been granted on these compounds. the clinical results of inn show that it is effective in over % of depressed subjects when blood levels exceeded therapeutic threshold with no significant side effects. inn has a rapid onset of action, - days (vs. - weeks for currently marketed products) with sustained effects for months following to doses over - weeks. h. iwama , , a. umino , a. hashimoto , k. takahashi , n. yamamoto , and t. nishikawa , section of psychiatry and behavioral science, tokyo medical and dental university graduate school, and department of mental disorder research, national institute of neuroscience, ncnp, tokyo, japan using an in vivo dialysis technique, we have studied the extracellular contents of endogenous free d-serine in comparison with that of l-serine, glycine and l-glutamate in the brain of the freely moving rat. a high amount of d-serine was detected in the dialysate obtained from the medial prefrontal cortex and striatum, whereas the cerebellar dialysate contained only a trace concentration of the d-amino acid. intra-medial prefrontal cortex perfusion of a sodium channel activator, veratrine, augmented the extracellular release of glycine and l-glutamate but a slight decrease in that of d-serine in a tetrodotoxin-sensitive manner in the prefrontal area. moreover, selective destruction of neuronal cell bodies in the medial frontal region by means of local infusion of an excitotoxin quinolinate resulted in a marked reduction of extracellular and tissue levels of d-serine in the infused prefrontal region. these findings suggest that endogenous d-serine might be liberated into the extracellular space from non-neuronal cells or certain exceptional neuronal cells probably by a carrier-mediated process in the mammalian prefrontal cortex. also, the endogenous d-amino acid has been indicated to be accumulated or synthesized, at least in part, in the neuronal cells. nucleoside diphosphate kinase (ndpk) catalyzes a transfer of the terminal phosphate from nucleoside triphosphates ((d)ntps) to nucleoside diphosphates ((d)ndps) and has been suggested to be involved in the regulation of wide variety of cellular functions. in addition, ndpk isoforms (a and b) are encoded by nm genes (h and h ) , which are related with the metastatic potential of some tumors. although ndpk/ nm has been also implicated to modulate neuronal cell proliferation, differentiation and neurite outgrowth, a neurobiological role of ndpk/nm in neurodegenerative diseases has not been reported yet. here we evaluated the protein levels of ndpk-a/nm -h in brains from patients with down syndrome (ds) and alzheimer's disease (ad) using twodimensional ( -d) gel electrophoresis with subsequent matrixassisted laser desorption ionization mass spectroscopy (maldi-ms) identification and specific software for quantification of proteins. ndpk-a/nm -h was significantly decreased in brain regions (frontal, occipital, parietal cortices) of both ds and ad compared to controls. we conclude that the down-regulated ndpk-a/nm -h upon neurodegeneration could play a pivotal role in a wide range of neurobiological functions such as neurite outgrowth and consequently these could result in functional disturbance of the nervous system in ds and ad. brain α-endosulfine is manifold decreased in brains from patients with alzheimer's disease: a tentative marker? and drug target? α-endosulfine has the sulfonylurea-like ability to block atp-sensitive potassium (k atp ) channels and is expressed in a wide range of tissues. although the blockade of k atp channels has been reported to be involved in the release of neurotransmitters, the neurobiological role of α-endosulfine has not been studied yet. we examined the levels of αendosulfine protein in frontal cortex and cerebellum from patients with alzheimer's disease (ad). α-endosulfine was extremely decreased in both regions of ad compared to controls. this could result in the continuous opening of k atp channels with subsequent decrease of neurotransmitters release and change of potassium fluxes. this study is of great significance for providing a neurobiological function of α-endosulfine in brain and furthermore, α-endosulfine could serve as a useful marker for the diagnosis of ad and a target for drug treatment. children's hospital heidelberg, and department of pharmacology and toxicology, university of marburg, germany d- -hydroxyglutaric aciduria is an inherited neurometabolic disorder of unknown etiology characterized by progressive neurodegeneration of vulnerable brain regions during infancy and early childhood, resulting in psychomotor retardation, hypotonia, seizures, macrocephaly, enlarged lateral ventricles, delayed cerebral maturation as frequent neurological presentation in affected children. the disease is biochemically characterized by the accumulation of d- hydroxyglutarate (d- ), which is structurally similar to lglutamate (ϭ -amino-glutarate). we therefore investigated in primary neuronal cultures from chick and rat, whether d- induces excitotoxic neuronal damage. here we report that d- decreased cell viability in a concentration-and time-dependent fashion by disturbance of intracellular calcium homeostasis as determined by fura- measurement. furthermore, fluorescence microscopy using dihydrorhodamine- revealed an increased generation of reactive oxygen species (ros) elicited by expo-sure to d- . n-methyl-d-aspartate (nmda) receptor blockade specifically prevented excitotoxic neuronal damage as well as increased calcium influx and ros production, suggesting that d- is an agonist at nmda receptors. patch-clamp investigation confirmed that d- activated recombinant nmda receptors in hek cells. furthermore, activity measurement of single respiratory chain complexes revealed a specific inhibition of complex v activity by d- . we conclude that excitoxic mechanisms contribute to the neuropathology of d- hydroxyglutaric aciduria, highlighting new neuroprotective strategies for this neurometabolic diseases. these studies were designed to determine the effects of aging and an aging intervention on nmda subunit expression. in situ hybridization and receptor autoradiography were performed on naïve or behaviorally-tested c bl/ mice of different ages ( , - , and - months old) and diet groups (ad lib-fed and diet restricted). there were age-related decreases in both e and z mrna density in naïve, ad lib-fed mice. correlations were found between changes in e subunit mrna and agonist binding and z mrna expression and antagonist binding. diet restriction significantly improved learning ability, slightly spared glutamate binding to nmda sites and improved z mrna expression in older mice. significant correlations were found between agonist binding and both learning ability and e and e mrna density. learning ability in the old mice also correlated with the ratios of mrna expression for e and e and/or z subunits. these results suggest that changes in nmda receptor binding and the relationship between subunit expression levels are important for maintaining memory functions in older animals. extracts of st john's wort (hypericum perforatum l.) are widely prescribed for the treatment of mild to moderate depression and the putative antidepressant constituent is probably hyperforin. in this study the effect of hyperforin was investigated on the release of neurotransmitter amino acids. coronal cortical slices ( mm) were cut and perfused with gassed ( % o , % co ) acsf at °c. two-minute samples of perfusate were collected and aspartate and glutamate were assayed by hplc. potassium-and veratridine-stimulated release was elicited by administering pulses of kϩ ( mm) or veratridine ( mm) minutes apart. in control experiments the second kϩ pulse elicited glutamate release which was % of the first pulse. hyperforin ( mm) perfused for minutes prior to, and during, the second kϩ pulse significantly increased glutamate release to % (p Ͻ . , n ϭ - ). release elicited by the second veratridine pulse was % of the first pulse for both glutamate and aspartate. hyperforin ( mm) increased this release to the second pulse to % and % respectively (p Ͻ . , n ϭ - ). when perfused on its own for minutes, hyperforin ( mm) increased the basal release of glutamate (p Ͻ . , n ϭ - ). in conclusion, the increase in the release of neurotransmitter amino acids observed following hyperforin is possibly mediated through a facilitatory action on voltage-operated ca ϩ or naϩ channels. glaxosmithkline group, glaxo wellcome s.p.a., medicines research centre, verona, italy n-methyl-d-aspartate (nmda) receptors are ligand gated ion channels widely distributed in mammalian brain, which play a crucial role in important physiological mechanisms, such as excitatory transmission, neuronal migration and memory formation. a peculiar feature of nmda receptors is the absolute requirement of l-glutamic acid and glycine for the opening of the channel. noteworthy, these two aminoacids reciprocally modulate binding at their respective recognition sites. aim of this work was to study nmda receptor glycine/glutamate interactions in rat and human brain. binding of the nmda antagonist [ h]cgp to rat cerebral cortical membranes was inhibited by glycine. the overall effect of glycine consisted in a decrease of [ h]cgp affinity, with a parallel increase of the receptor affinity for glutamate. the glycine antagonist gv a competitively reversed glycine inhibition, proving that the modulation was via the glycine binding site. [ h]cgp binding to rat brain sections revealed the existence of regionally distinct nmda receptor subtypes with difference glycine/glutamate interactions. in most regions of the human brain nmda receptors presented the same allosteric modulation of [ h]cgp binding, as revealed by large section autoradiography technique. nevertheless, detection of any regional variation was not possible, probably due to the high intersubject variability. the effect of long-term high k ϩ -treatment on neuronal survival, cellular maturation, nmda receptor (nr) splice variant expression, and receptor function was investigated in primary cultures of rat cortical neurones. long-term incubation (up to days) with mm k ϩ significantly increased neuronal survival and induced multiple morphological changes associated with promoted cellular maturation. cultures grown in medium containing mm k ϩ also exhibited multiple changes in nr splice variant expression according to rt-pcr studies performed with primer pairs flanking the alternatively spliced regions, in order to estimate the ratios of the corresponding Ј and Ј splice variants. nr - and nr - (each containing exon ) were decreased, whereas nr - and nr - (each lacking exon ) were increased, accordingly. the predominant expression of nr -b was further increased. after administration of ttx, each of the k ϩ -induced changes on mrna expression was virtually abolished. in voltage-clamp recordings (holding potential: Ϫ mv), nmda induced inward currents in a concentration-dependent manner with a maximum effect of Ϫ pa under control conditions. neurones treated with mm k ϩ showed a significantly diminished response to nmda (max. response: Ϫ pa). in conclusion, the present data indicate that a sustained increase in neuronal activity induces adaptive changes in nr splice variant expression and a decrease in receptor function. thus, alternative splicing associated with a diminished receptorcytoskeletal linkage may be important compensatory mechanism in preventing cellular damage due to long-term activation of excitatory nr. it seems conceivable that this mechanism contributes to the promoting effects of mm k ϩ on neuronal survival and maturation. (supported by bmbf grant gg / ) there is accumulating reports that kainate-induced seizures elicit expression of various heat-shock proteins (hsps) in the brain, such as hsp , hsp , and hsp . however, no investigation has been carried out on changes in level of apg- , a member of hsp family, after excitatory amino acid-induced seizures. by means of an immunoblot assay, we determined the levels of hsp and apg- in discrete brain structures of mice after a single intraperitoneal injection of kainate or nmda. apg- level was significantly decreased in the frontal cortex, hippocampus, and striatum days after kainate administration, while hsp level was increased in these regions following the administration. decreased level of apg- returned to the control levels in the three regions days after kainate administration. no significant changes were observed in levels of both hsp and apg- in hypothalamus, midbrain, medulla-pons, and cerebellum of kainate-treated mice. by contrast, nmda administration did not significantly affect both levels in any of the regions examined. these results suggest that, unlike the case of hsp , apg- expression could be temporarily down regulated by signals peculiar to kainate, but not by those peculiar to nmda, in murine telencephalon. high concentrations of glucagon-like peptide- ( - ) amide (glp- ) and its specific receptor (glp- r) have been found in the rat hypothalamus. in this study the actions of glp- and its related peptides, exendin- (glp- r agonist), exendin ( - ) (glp- r antagonist) and glp- ( - )amide (major glp- metabolite) on levels of amino acids (glu, asp, gln, gly, tyr, trp, gaba) in the hypothalamus were investigated. i-glp- binding in rat hypothalamic membranes was competed by the peptides in the following order of potency; glp- Ͼ exending- Ͼ exendin ( - ) Ͼ glp- ( - )amide. intracerebroventricular (icv) glp- ( nmoles) produced a statistically significant reduction in levels of all measured amino acids compared with saline injected controls, whereas nmoles of exendin ( - ) was ineffective. exendin- produced a statistically significant reduction in the levels of trp, glu, and tyr. glp- ( - )amide showed a statistically significant increase in the level all the amino acids tested in this study. prior administration of exendin ( - ) or glp- ( - )amide blocked the effects of glp- on the levels of the amino acids. these data are consistent with exendin- being a glp- r agonist and exendin ( - ) being a specific glp- r antagonist. glp- ( - )amide, a primary metabolite of glp- , appears to act as an endogenous antagonist at the glp- r. department of biophysics, instituto de fisiología celular, universidad nacional autónoma de méxico, méxico city, méxico cholecystokinin (cck), a family of neuropeptides, seems to be involved in anxiety. evidence from several laboratories indicates that the ansiogenic effects of cck are mediated by cck b receptors. however it has been reported that cck a receptors have been found in brain and cck a receptor antagonists have ansiolytic properties. the aim of this work was to study whether or not cck a receptors are also involved in the modulation of anxiety. male rats were cannulated in the lateral ventricle and cck ( fmol) and/or cck antagonists ( fmol) were injected days after surgery. anxiety was evaluated in the elevated plus-maze test and locomotion in an open-field test. ansiogenic effects were observed when cck b receptor agonists (cck ns; cck ) or a mixed cck b and cck a receptor agonist (cck s) were injected. in contrast, cck , a cck a receptor agonist or cck - and cck - were uneffective. furthermore, the ansiogenic effects of cck s were prevented by the previous ( min) administration of l , (cck b receptor antagonist) but not by devazepide (cck a receptor antagonist). no effects on locomotion were observed in any condition. these results indicate that cck a receptors are not involved in anxiety, as measured by the elevated plus-maze test. congenital conditions (i.e. neural tube defects: ntg) have a multifactorial aetiology. deficiencies in the folate and transsulfuration pathways have, in recent years, been positively linked to ntd and other dysmorhogenic syndromes. efficient one-carbon metabolism is crucial for the synthesis of dna precursors, the remethylation of homocysteine and biomethylation of dna. more than % of the one-carbon units that flow through the metabolic system in mammals and birds are derived from l-serine and glycine, the natural substrates for shmt. the mitochondrial glycine cleavage enzyme system (gces) can potentially compete with shmt for tetrahydrofolate (thf) in the generation of the methylenetetrahydrofolate pool. valproate (depakene, epilim), an anti-epileptic agent, appears to be strongly associated with hyperhomocysteinemia, several other induced metabolic conditions, the inhibition of the gces and an increased incidence of ntd in epileptic women of child-bearing age. the exact mechanisms of valproate-induced ntd are not yet clear. we investigated the association of the teratogenic properties of valproate with the inhibition of shmt and/or the gces in developing embryos. chicken embryos were treated with sodium valproate (vpa) and pregnant female mice (c bl) received intraperitoneal injections of vpa, during the critical period of embryonic neural tube development. control embryos were treated with sterilised saline solution. harvested embryos were subsequently investigated for congenital abnormalities and hepatic shmt and gces activities quantified with radiometric assays. the effect of vpa on hepatic dna synthesis was monitored ( h-thymidine incorporation into embryonic dna) and the dna-methylation status determined (dna n -methylcytosine levels). dose-responsive incidences of ntd were observed in vpa treated embryos. very few defects occurred in control embryos. shmt and gces appeared to be inhibited in liver extracts of vpa-treated embryos. hepatic dna synthesis was significantly compromised and -mc levels were altered in vpa-treated embryos. the inhibition of either shmt and/or gces activities appeared to be associated with valproateinduced ntd in the chicken and mouse embryo models. the primary mechanism of this effect can probably be ascribed to a restriction in the flow of one-carbon units through the metabolic system, decreased synthesis of dna precursors and alterations in the methylation status of dna. department of neuroscience, university of cagliari, italy ethanol is long known to cause dose-related biphasic effects and we recently found that ethanol bidirectionally affects also working memory. the euphoriant and excitatory effects produced at low doses are associated with the rewarding action of ethanol and are thought to be mediated by the activation of the mesolimbic dopamine (da) system. however, ethanol monophasically stimulates mesolimbic da release in the nucleus accumbens, even at doses that cause hypnosis and coma. in contrast, ethanol biphasically modulates mesocortical da release in the prefrontal cortex (pfc). the changes in da release induced by ethanol are time locked with corresponding changes in extracellular glutamate levels. these biphasic effects of ethanol on pfc da and glutamate are matched by biphasic changes in the performance in a spatial delayed alternation task -a working memory test that is sensitive to proper function of the pfc -suggesting a link between da and glutamate transmission in the cognitive effects of ethanol. focal application in the pfc of the competitive ampa/kainate receptor antagonist cnqx suppresses both da release and the improvement of working memory induced by low doses of ethanol. these results suggest that ethanol may increase da transmission in the pfc and enhance working memory functions by increasing the release of glutamate, thereby stimulating non-nmda glutamate receptors. the enhancing effect on working memory by low, excitatory doses of ethanol may be perceived as rewarding and could constitute an important neurobiological mechanism for excessive ethanol drinking. physiology department, faculty of medicine, al-quds university, jerusalem, palestine glutamate and asparate are considered as the main excitatory neurotransmitters in brain and spinal cord, in addition to their role in energy metabolism, synthesis of proteins and detoxification of ammonia. glutamate and aspartate are centrally involved in basic mechanisms generating epileptic seizures and in epileptogenesis. stimulated release of glutamate and aspartate was detected in vivo and in vitro following neuronal depolarization. photic stimuli has increased glutamate release from visual cortex, and afferent brachial stimulation has increased the endogeneous release of glutamate from contra-lateral sensorimotor cortex compared to ipsi-lateral side. similar results were achieved after local application of tityustoxin or veratridine to the sensorimotor cortex. implantation of cobalt powder over the right sensorimotor cortex of rats produced an epileptogenic lesions characterized by contra-lateral fore and hind limb jerks and an increase in the frequency of eeg spikes. the jerks started after days with maximum myoclonic activity ( jerks/min). the concentration of glutamate in the epileptogenic focus was decreased significantly by % (p Ͻ . ) compared to the non-epileptogenic area on the left sensorimotor cortex, which was dissected but not treated with cobalt. part of the decrease in glutamate could be related to the enhancement of in-vivo release from the epileptogenic lesion to the extra-cellular fluid. kindling is the best model for studying the development of the epileptic focus (epileptogenesis), it could be achieved by repeated intra-cerebral micro-injection of glutamate ( . µ mol), aspartate ( . µ mol) or nmda ( - n mol), or repeated electrical stimulations of specific brain regions. in addition, glutamate antagonists particularly those specifically acting on the nmda receptor type e.g. -amino- -phosphonovaleric acid (ap ) and -amino- -phosphonopheptanoic acid (ap ) have been found to inhibit seizures in epileptic animals and inhibit the development of electrically kindled epilepsy. pre-synaptic glutamate receptor agonists like ( s, s)-acpd the agonist of group ii, and l-ap the agonist of group iii receptors has reduced ca ϩϩ uptake and glutamate release, thus it has inhibited epileptogenesis by preventing the increase in both seizure score and after-discharge duration. injection into fully kindled animals has produced an anti-epileptic effect by reducing the mean seizure score and by increasing the mean generalized seizure thresholds. this results suggest the mechanism by which pre-synaptically active glutamate receptor agonists block the development of the chronically epileptic state induced by electrical kinding, and indicate that their anticonvulsive activity is due to inhibition of pre-synaptic glutamate and/or asparate release following blockade of pre-synaptic ca ϩϩ entry. testing the changes in glutamate release from hyperactive brain tissues, and the effect of different glutamate agonists and antagonists, supports the role of glutamate in initiating the process of epileptogenesis, and contributes in developing new anti-epileptic agents. (this project was supported by a grant from alexo) the functional roles of cl(Ϫ) and divalent cations in the na(ϩ)/cl(Ϫ)/gaba cotransport were examined in xenopus oocytes expressing the human gat- (hgat- ) gaba transporter cdna. our results showed that cl(Ϫ) was not absolutely required for na(ϩ)/gaba transport via the hgat- (loo et al., j biol. chem. : - , ) . the cl(Ϫ) interacted with the transporter to modulate the binding of external na(ϩ). although hgat- transported cl(Ϫ) across the membrane with a stoichiometry of na(ϩ) : cl(Ϫ) : gaba, the transported cl(Ϫ) did not contribute to the net charge translocated across the membrane, suggesting a cl(Ϫ)/cl(Ϫ) exchange mechanism during the gaba transport cycle. the gaba transport via the hgat- is also modulated by divalent cations. the uptake of [ h]-gaba was inhibited significantly when both ca( ϩ) and mg( ϩ) were removed from the uptake buffer. several divalent cations tested were individually able to sustain the gaba uptake. in contrast to uptake, the gaba efflux was enhanced significantly upon removal of both ca( ϩ) and mg( ϩ) from the efflux buffer. the gaba transporter inhibitor skf a blocked the enhanced efflux, suggesting that the hgat- operated faster in the reverse mode in the absence of external divalent cations. these results suggest a regulatory role for the divalent cations in gaba transport. merck sharp & dohme, neuroscience research centre, terlings park, harlow, essex, u.k. the role of alpha containing gaba a receptors in hippocampal synaptic function has been investigated using pharmacological and electrophysiological techniques, as well as following disruption of the alpha subunit gene in knockout mice (ko). in the ca region of the hippocampus the induction of long-term potentiation (ltp) is powerfully regulated by gaba mediated synaptic currents (ipscs). agents that inhibit gaba-mediated transmission potentiate ltp induction, whereas allosteric agonists such as benzodiazepine-site agonists which slow the decay kinetics of ipscs suppress ltp induction. in alpha ko mice paired pulse facilitation of the amplitude of excitatory synaptic potentials is selectively enhanced in the ca region but not dentate gyrus. likewise, the frequency and rise time of spontaneous ipscs were similar in wt and ko slices. however their amplitude was significantly smaller in ko mice. furthermore, a significantly greater proportion of ipscs were best fitted to a mono exponential function in ko mice compared to wt animals. thus alpha containing gaba a receptors contribute to functional postsynaptic receptors on ca pyramidal cells in the hippocampus and modulate a postsynaptic component of synaptic facilitation. pharmacological research institute, volgograd medical academy, volgograd, russia the purpose of the study is to investigate effect of phenil (karphedon, mephebut, gammoxin) and circle (pyracetam) gaba derivatives on reproductive function of stressed male rats. the adult male rats were stressed by immobilization exposure ( hours) twice in week during weeks. four from five groups of stressed males were given substances (daily) at doses: karphedon - mg/kg, mephebut - mg/kg, gammoxin - mg/kg, pyracetam - mg/kg. the treated males were mated with intact females during days. after the mate the treated males and more in days all the mated females were sacrificed and investigated. analysis of our data indicates that the time of spermatozoa motion and epididymal sperm counts were decreased . % (p Յ . ) and . % (p Յ . ) respectively when compared with their intact controls. gaba derivatives have a softening effect on functional parameters of spermatozoa stressed males. karphedon and pyracetam increased the time of motion spermatozoa . % (p Յ . ), karphedon and mephebut drew near sperm counts to intact control level. the result of mate show that pregnancy rate was increased (p Յ . ) by stress exposure and pregnancy rate of females mated with gaba stressed males was some more (p Ն . ) than that of intact controls. the general embryonic morality was increased twice by stress and so the number of embryos was reduced . %. the gaba derivatives exposure to stressed male rats reduced the embryonic mortality of their posterity and increased the number of embryos to intact control level. our findings demonstrate that gaba derivatives administration has a protective effect on reproductive function of stressed male. transmission on the brain. the realization that glutamatergic pathways are involved in such diverse processes in epilepsy, ischemic brain damage and parkinsons' disease, is of a great practical interest. there are at least three functional classes of ionotropic glutamate receptors: n-methyl-d-aspartate (nmda), α-amino- -hydroxy- methyl- -isoxazolepropionic acid (ampa) and kainate (ka). other central neurotransmitter systems are under nmda influence. some data point on neuroprotective action of nmda antagonist on nigrostriatal pathway. in the present study female wistar rats were exposed during pregnancy with daily injected mk- (dizocilpine) . mg/kg sc. control rats received tap water only. behaviour of month old male offsprings was investigated by several psychopharmacological methods. oral activity, yawning, locomotor activity, stereotypy and catalepsy were recorded following respective central dopamine receptors agonists and antagonists administration (skf , quinpirole, apomorphine, haloperidol). our results indicate that mk- applied during pregnancy modulate reactivity of the central dopamine receptors in adult offspring rats. [ the development of mammalian ingestive behavior is characterized by a transition from suckling to chewing, two distinct motor behaviors. we hypothesize that this transition is accompanied by changes in brainstem circuitry underlying these movements. since glutamatergic neurotransmission is critical for the proper functioning of brainstem circuitry responsible for mastication, we investigated the development of glutamate receptors in trigeminal motoneurons (mo ) and mesencephalic trigeminal neurons (me ); neurons comprising the circuitry responsible for jaw movements. we conducted a series of receptor immunohistochemistry experiments that characterized the expression of iontotropic and metabotropic glutamate receptors (mglurs) during early postnatal development. the functional roles of nmda, ampa and mglurs in neonatal mo were investigated using in vitro electrophysiological experiments. results demonstrated that the spatial and temporal expression of ampa, nmda and group i and ii mglurs are developmentally regulated within and between mo and me during early development. electrophysiological data demonstrate that mglurs function pre-and postsynaptically to modulate synaptic transmission between trigeminal premotoneurons and mo . furthermore, nmda induced bursting is developmentally regulated and coincident with the transition from suckling to chewing behaviors. our studies suggest that the transition from suckling to chewing is accompanied by changes in the composition and function of glutamate receptors. fetal life in down syndrome starts with normal neuronal density but impaired dendritic spines and synaptosomal structure r. weitzdoerfer , m. dierssen , m. fountoulakis , and g. lubec department of pediatrics, university of vienna, austria information on fetal brain in down syndrome (ds) is limited and there are only few histological, mainly anecdotal reports and no systematic study on the wiring of the brain in early prenatal life exist. histological methods are also hampered by inherent problems of morphometry of neuronal structures. it was therefore the aim of the study to evaluate neuronal loss, synaptic structures and dendritic spines in the fetus with down syndrome as compared to controls by biochemical measurements. dimensional electrophoresis with subsequent mass spectroscopical identification of spots and their quantification with specific software was selected. this technique identifies proteins unambiguously and concomitantly on the same gel. fetal cortex samples were taken at autopsy with low postmortem time, homogenized and neuron specific enolase (nse) determined as a marker for neuronal density, the synaptosomal associated proteins alpha snap [soluble n-ethylmaleimidesensitive fusion (nsf) attachment protein], beta snap, snap and the channel associated protein of synapse (chapsyn ) as markers for synaptosomal structures and drebrin (drb) as marker for dendritic spines. nse, chapsyn and beta snap were comparable in the control fetus panel and in down syndrome fetuses. drebrin was significantly and remarkably reduced and not even detectable in several down syndrome brain samples. quantification of snap revealed significantly reduced values in ds cortex and alpha snap was only present in half of the ds individuals. we conclude that at the time point of about weeks of gestation (early second trimester) no neuronal loss can be detected but drebrin, a marker for dendritic spines and synaptosomal associated proteins alpha snap and snap were significantly reduced indicating impaired synaptogenesis. early dendritic deterioration maybe leading to the degeneration of the dendritic tree and arborization, which is a hallmark of down syndrome from infancy. pathfinding of growing axons to reach their target during brain development is a subtle process needed to build up contacts between neurons. abnormalities in brain development in down syndrome (ds) are described in a couple of morphological reports but the molecular mechanisms underlying abnormal wiring in fetal ds brain are not yet elucidated. we therefore performed a study using the proteomic approach to show differences in protein levels involved in the guidance of axons between control and ds brain in early prenatal life. proteins obtained from autopsy of human fetal abortus were applied on -dimensional gel, identified and quantified. we quantified members of the semaphorin/collapsin family, the dihydropyrimidinase related proteins - and the collapsin response mediator protein- (crmp- ) in ds and control cortex samples. drp- and crmp- levels were comparable in the control and ds samples. evaluation of drp- , drp- and drp- revealed significantly decreased levels of of the spots assigned to drp- and increased levels of one spot assigned to drp- and increased drp- in ds brain. we conclude that as early as from the th week of gestation pathfinding cues of the outgrowing axons are impaired in ds. these findings may help to elucidate mechanisms leading to abnormalities in neural migration of ds brain. inflammatory processes play an important role in the degeneration of basal forebrain cholinergic cells alzheimer's disease. the proinflammagen lipopolysaccharide (lps) was infused chronically into the basal forebrain of young rats. we then determined whether the administration of two novel nonsteroidal anti-inflammatory drugs or a pancaspase synthesis inhibitor, zvad, could provide neuroprotection from the cytotoxic effects of the neuroinflammation. we also determined whether the administration of the non-competitive n-methyl-d-aspartate (nmda) receptor antagonist, memantine, could provide neuroprotection from the cytotoxic effects of the neuroinflammation. chronic lps infusions decreased choline acetyltransferase activity and increased the number of activated microglia within the basal forebrain. caspases , and activity was increased in ventral caudate/putamen. non-steroidal antiinflammatory drug therapy attenuated the toxicity of the inflammation upon cholinergic cells and reduced caspases , , and activity in the caudate/putamen. zvad significantly decreased the levels of caspases , and but did not provide neuroprotection for cholinergic neurons. memantine significantly attenuated the cytotoxic effects of chronic inflammation upon cholinergic cells. these results suggest that prostaglandins contribute to the degeneration of forebrain cholinergic neurons in alzheimer's disease and that the cytotoxic effects of prostaglandins occur upstream to nmda receptor activation. intracranial administration of n-methyl-d-aspartate (nmda) receptor antagonists block learning of classical and avoidance conditioning in goldfish. studies with goldfish have shown that nmda receptors are mostly dense in the telencephalon and telencephalon ablation impairs avoidance learning. the present study investigated amnestic effects of microinjection of nmda receptor antagonist ap to the goldfish telencephalon in avoidance conditioning. in experiment , fish received no injection or microinjections of saline or various doses of ap to their telencephalon minutes before three semiweekly training sessions. fish were tested without injec-tions in session . a one-way anova with multiple comparisons on the test scores showed that ap produced anterograde amnesia in a dose-dependent manner. in experiment , fish received several training sessions and a microinjection of various doses of ap minutes before testing. the test scores showed that ap did not decrease avoidance responses, suggesting that microinjection of ap did not impair performance processes. in experiment , fish received microinjections of ap or saline to their telencephalon immediately following three semiweekly training sessions and were tested without injections in session . a one-way anova on the test scores showed that ap did not produce retrograde amnesia. (supported by gvsu grant-in-aid.) tryptophan modulates striatal serotonergic activity relative to fatigue t. yamamoto and e. a. newsholme health science laboratory, tezukayama university, nara, japan department of biochemistry, university of oxford, u.k. we have been reported that mechanism of fatigue in the brain relates to enhanced extracellular tryptophan and serotonergic function. brain concentration of tryptophan is not only dependent on the change of tryptophan which originates from the centarl nervous system, but also enhance tryptophan entering the brain from the blood-brain barrier and peripheral circulating tryptophan which is a trigger. supplementation of ltryptophan ( um) into the incubation medium with the synaptosomal striatum causes tryptophan to the extrasynaptosomal release by high kϩ stimulation. injecting l-tryptophan ( mm/ min) into the left striatum by microdialysis method can induce early fatigue for running time of rats. on the other hand, tryptophan deficiency rats (body weight average g) were made by tryptophan free feeding for weeks, and the rat's running time increased (Ͼ min difference). these results suggests that tryptophan is a potent active substance for fatigue in the brain. the active zone may be presynaptic terminal and the tryptophan itself may be releasing neuromodulators. (we appreciate that tryptophan free diet was provided by ajinomoto co., inc., japan.) our recent studies on the distribution of free d-serine, together with the d-serine action on the glycine site of the nmda type glutamate receptor, suggest that the d-serine can be an endogenous modulator of the nmda receptor. to explore the possible removal systems for brain d-serine signaling, we have evaluated the uptake of [ h]d-serine into the synaptosomal p fraction from the rat cerebral cortex. the cortical p fraction was able to accumulate [ h]d-serine in a temperatureand ph-dependent and saturable manner. the kinetic analysis indicates that cortical d-serine transport occurs by an apparent single-component system with km value of µm and a vmax value of pmol/mg protein/min. depletion of na ϩ and cl Ϫ ions remarkably decreased d-serine uptake into the cortical p fraction. the pharmacological profile of the inhibition of dserine uptake by various amino acids was different from those of glycine uptake system and other amino acid transporters reported. d-serine uptake activity was preferentially observed in the brain tissues such as cerebral cortex and cerebellum to the peripheral tissues. the present data support the view that the endogenous d-serine is taken up mainly through a carriermediated transport system to regulate the extracellular concentration in the mammalian brain. a. bocheva et al. the mammalian brain contains all the urea cycle intermediates, whereas enzymes participating in the conversion of lornithine (l-orn) into l-citrulline (l-cit) are absent, resulting in an incomplete urea cycle. the discovery of nitric oxide (no) synthase that catalyses the formation of no and l-citrulline as a co-product from l-arginine (l-arg) in the brain has indicated an additional pathway for l-arg metabolism. l-canavanine (l-cav), is a potent antimetabolite and structural analog of larginine, produced by legumes such as the jack bean, canavalia ensiformis. l-canaline (l-can) is a potent inhibitor of ornithine aminotransferase. our previous results indicated that l-cav, l-cit, l-arg, and l-orn exerted an antinociceptive effect, whereas l-canaline induced hyperalgesia in rat. l-canavanine exert stronger antinociceptive effect than l-arginine, l-ornithine and l-citrulline. the aim of the present study was to investigate are d-arg, l-cav and naloxone reversed the analdesic effects of l-ornithine, l-citrulline and l-arginine. the experiments were carried out on male wistar rats. the changes in the mechanical nociceptive threshold of the rats were measured by the radall-selitto paw pressure test using and analgesimeter (ugo basile). the amino acids were applied intracerebroventricularly (i.c.v.) at a dose µg/rat. the present results shown that d-arg, l-cav and naloxone reversed antinociception. the regulation of lysine metabolism in cereal crops r. a. azevedo , p. j. lea , s. a. gaziola , a. p. pellegrino , and s. m. g. molina departamento de genética, escola superior de agricultura luiz de queiroz, universidade de são paulo, brazil department of biological sciences, university of lancaster, u.k. a major nutritional drawback of cereal seeds is a deficiency in some amino acids, in particular lysine. biochemical, molecular and genetic studies have considerably increased our knowledge concerning the regulation of the aspartate pathway, by which lysine is synthesized. among the enzymes involved in lysine metabolism, aspartate kinase (ak) and dihydrodipicolinate synthase (dhdps) control the regulation of lysine biosynthesis, whereas lysine: -oxoglutarate reductase (lor) and saccharopine dehydrogenase (sdh), have been shown to play a key role in the breakdown of lysine. in general, lysine overproduction can be obtained by altering the sensitivity of dhdps to lysine, but accumulation of this amino acid in cereal seeds requires further manipulation of lor and/or sdh. this suggestion is strongly supported by five main points: ( ) cereal mutant or transgenic plants do not exhibit any significant accumulation of lysine in seeds, but only in other tissues. ( ) the enzymes of lysine degradation, lor and sdh, are endosperm specific in cereals only. ( ) the opaque- mutant, which exhibits higher concentration of soluble lysine and protein lysine in the seed, contains several-fold lower lor and -fold lower sdh activity when compared to the wild-type maize. this reduction in activity in the opaque- mutant is due to a reduced protein lor-sdh concentration by reduction of the zlkrsdh gene transcript. furthermore, the opaque- maize gene has been shown to regulate ak and lor activity. ( ) intermediates of lysine catabolism accumulated in the seeds of soybean and canola lysine overproducing plants, suggesting the presence of reduced lor and/or sdh activities. ( ) among cereals and although still below the recommend values by fao, rice exhibits the higher concentration of lysine, but lor and sdh are present in much lower activities. also, in phaseolus vulgaris, lor and sdh activities were shown to be around fold lower then in maize endosperm. the regulation of the lor activity is complex and involves a calcium dependent phosphorylation/dephosphorylation mechanism. it remains to be seen whether this latter mechanism can be controlled, so as to allow the production of more crop plants that contain elevated concentrations of lysine in the seed. the genetic progress for nue can be accelerated with the use of secondary traits that possess high inheritance and correlation with productivity. several traits have been studied such as chlorophyll concentration, plant height, leaf senescence, anthesis-silking interval, kernel number, activities of enzymes of n assimilation and loci of quantitative traits for assisted selection. (we are grateful for financial support from fapesp, brazil and the british council.) (termed hyperaccumulators) that grow on metalliferous soils, are able to translocate cd from the roots and accumulate it in high concentrations in the shoots. cd may be detoxified in plants by combination with a family of sulphur rich peptides termed phytochelatins. cd has the capacity to inhibit a range of enzyme activities in plants, in particular those of the calvin cycle and chlorophyll biosynthesis. evidence that cd causes the production of reactive oxygen species (ros) has also been obtained. we have investigated the antioxidant responses of radish, soybean and sugarcane to cd treatment. seedlings were grown in increasing concentrations of cdcl , ranging from . - mm, for up to h in a hydroponic system. analysis of cd uptake indicated that most of the cd accumulated in the roots, but some was also translocated and accumulated in the leaves. roots and leaves were analysed for catalase (cat), glutathione reductase (gr) and superoxide dismutase (sod) activities. gr activity increased considerably in the roots of all plant species tested after exposure to the metal, indicating a direct correlation with cd accumulation. cat activity also increased in roots but to a much lesser extent when compared to gr and also varied depending upon the plant species. the analysis of native page enzyme activity staining, revealed several sod isoenzymes in leaves of all plant species, however, only in radish was a clear increase in activity observed. the results suggest that in these plants, the activity of antioxidant enzymes responds to cd treatment. the main response may be via the activation of the ascorbate-glutathione cycle for the removal of hydrogen peroxide, or to ensure the availability of glutathione for the synthesis of cd-binding proteins. (we are grateful for financial support from fapesp, brazil and the british council.) all plant cells, tissues and organs provide the biosynthetic machinery and capacity to synthesise aliphatic polyamines. however, in physiological conditions only some organs and tissues synthesise polyamines, such as apical buds and sprouts, root apex, lateral buds of branches and secondary roots, as well as superficial layers of young stems and leaves, like epidermis, subepidermis and parenchyma cells. apical roots can also synthesise polyamines, but these activities in physiological conditions are lower than that of the shoots. this patterns recalls the one of auxins. polyamines are accumulated in high concentrations in storage organs, such as seeds, but not in tubers like helianthus tuberosus, potato or tuberised roots such as the carrot. also some fruit, e.g. oranges, contain high level of free polyamines, putrescine in particular. all other organs obtain polyamines through translocation via phloem tubes and xylem vessels. in plants, in addition to free polyamines, many polyamines are conjugated to hydroxycinnamic acids, the hydroxycinnamic amines, that only rarely represented outside the plant kingdom. this compounds are paticularly abundant in solanaceae family, where they can represent as much as % of the total polyamine pool, but they can be detected in different concentrations in many other families. the role of free and conjugated polyamines and their importance in food is discussed. drought, salinity or other environmental stressors promote the accumulation of free amino acids, amines and other organic n-metabolites with low molecular weight. in this contribution the influence of drought on the accumulation of amino acids, polyamines and trigonelline in growing barley plants and barley grains was examined. in comparison to non-stressed plants we obtained in stressed plants, exposed to drought before flowering, a higher concentration of proline (increase: -fold), n-trimethylglycine ( -fold), histidine ( -fold), tryptophane ( , -fold), putrescine ( , -fold), spermine ( , -fold) and trigonelline ( -fold) in the dry matter of barley sprouts. in addition to this, drought caused an increase of the n-content in the plant biomass ( %) as a result of growth inhibition ( %). six weeks later the content of soluble n-metabolites and protein was analyzed in non-stressed and pre-stressed barley plants again. during this reproductive period of plant development all the test groups were cultivated under the same moisture conditions. the analysis of n-metabolites in the ripening grains showed, surprisingly an after-effect of the drought stress. for example, in grains of pre-stressed barley the concentrations of free proline, histidine, tryptophane and asxϩglx were threefold to fivefold higher than in grains of non-stressed barley. depending on the resistance of barley cultivars to drought the biochemical response was different: in plants with low resistance the increase of amino acids and amines was higher than in resistant cultivars. however, resistant cultivars have already high genuine concentrations of n-metabolities in non-stressed plants. by treatments with choline or -aminoethanol the stresspromoted accumulation of amino acids and trigonelline was diminished. consequently, different biochemical responses of cereals to drought result in changes of product quality and nitrogen use. our goal is to increase the lysine content in corn. we have used genetic engineering to increase lysine synthesis and to prevent metabolic breakdown of lysine. to increase synthesis we circumvented the normal feedback control of a key enzyme in the lysine biosynthetic pathway, dihydrodipicolinic acid synthase (dhdps). lysine-feedback-insensitive dhdps, encoded by the corynebacterium dapa gene, was expressed from seed-specific promoters in transformed corn seeds. expression of dhdps in the corn embryo, but not in the corn endosperm, resulted in a to -fold increase in the accumulation of free lysine in the seeds and the total seed lysine content nearly doubled. lysine breakdown products have been observed in transgenic seeds that accumulate high levels of free lysine. we isolated a corn gene for the bifunctional enzyme lysine ketoglutarate reductase (lkr)/saccharopine dehydrogenase (sdh), which catalyzes the first two steps in lysine breakdown. knockout of lkr/sdh in corn by either mutation or genetic engineering results in a -fold increase in seed free lysine. combination of feedback-insensitive dhdps with knockout of lkr/sdh results in to -fold higher levels of free lysine than dhdps alone. no adverse effects on seed or plant agronomic performance are associated with the high lysine trait. biotechnology center for agricultural and the environment and the plant science department, rutgers university, new brunswick, new jersey, u.s.a. Ј-adenylylsulfate (aps) reductase catalyzes a key reaction in the plant sulfate assimilation pathway leasing to the synthesis of cysteine and the antioxidant glutathione. in arabidopsis thaliana aps reductase is encoded by a family of genes. in vitro studies revealed that the enzyme product derived from one of the aps reductase genes (apr ) is activated by oxidation, probably through the formation of a disulfide bond. redox titrations show that the regulation site has a midpoint potential of Ϫ mv at ph . and involves a -electron redox reaction. exposure of a variety of plants to ozone induces a rapid increase in aps reductase activity that correlates with the oxidation of the glutathione pool and is followed by an increase in free cysteine and total glutathione. during the response to ozone the level of immuno-detectable aps reductase enzyme does not increase. treatment of a. thaliana seedlings with oxidized glutathione or paraquat induces aps reductase activity even when transcription or translation is blocked with inhibitors. the results suggest that a post-translational mechanism controls aps reductase. a model is proposed whereby redox regulation of aps reductase provides a rapidly responding, self-regulating mechanism to control the glutathione synthesis necessary to combat oxidative stress. in aspergillus nidulans the structural genes coding for nitrate reductase (niad) and nitrite reductase (niia), share a common promotor region of , bp. we have previously characterized in vitro and in vivo the physiologically relevant cisacting elements for the two synergistically acting transcriptional activators, nira and area. we have further shown that area is constitutively bound to a central cluster of four gata sites and is directly involved in opening the chromatin structure over the promoter region and thus making additional cis-acting binding sites accesible. here we show that the asymmetric mode of nira-dna interaction determined in vitro is also found in vivo. binding of the nira transactivator is not constitutive as in other binuclear c -zn ϩϩ -cluster proteins but depends on nitrate induction and additionally, on the presence of a wild type area allele. dissecting the role of area further, we found that it is required for intracellular nitrate accumulation and therefore could indirectly excert its effect on nira via inducer exclusion. but in a strain accumulating nitrate independently of area nira binding and chromatin rearrangement is not triggered by nitrate in the absence of area. v. nikiforova , m. zeh , o. kreft , s. maimann , h. hesse , and r. höfgen max-planck-institut für molekulare pflanzenphysiologie, potsdam, and institut für biologie, angewandte genetik, freie universität berlin, germany higher plants, being a source of reduced sulfur for animal nutrition, assimilate inorganic sulfate into cysteine which is subsequently converted to methionine, another sulfur-containing amino acid. in order to investigate the possible regulatory points of the cysteine and methionine biosynthesis pathway a series of transgenic potato plants was engineered using clones encoding enzymes of the branched pathway from serine to cysteine as a pathway intermediate and from aspartate further on to methionine. increased cysteine levels were obtained in the leaves of serine acetyltransferase (sat) sense and cystathionine -lyase (cbl) antisense transformants. furthermore, glutathione levels were elevated in sat plants while downregulation of cbl was desastrous for plant growth, eventually. increased methionine levels were successfully obtained in potato by antisense inhibition of threonine synthase (ts). accumulation of free methionine was not only observed in source leaf tissues but as well in tubers. this enzyme competes with cystathionine gamma-synthase for the common substrate o-phosphohomoserine at the branchpoint between threonine and methionine synthesis, respectively. important control points of the biosynthesis of cysteine and methionine in potato, thus, turned out to be sat and ts, while further studies on overexpression of cystathionine gamma-synthase, cbl and ms did not reveal any substantial effect on potato methionine biosynthesis. dniepropetrovsk national university, board of biophysics and biochemistry, ukraine amino acids in root exudates of plants may be chelate agents as an alpha-amino acid can act like a bidentate ligand, forming a five-membered heterocyclic ring with suitable metal cations thus increasing mobility of metals. recently we have showed that application of growth regulators led to sharp increase of root exudative activity of some cultural (zea maize l.) and wild cereals (festuca rubra l., lollium perenne l.) during first days of germination. in this work we present results obtained in experiments with lollium perenne l., grown on sterile sand and on soils contaminated with great quantities of zn. detailed analysis of amino acid content of root exudates of several types of maize (hybrid, several lines, an opaque- mutant line) showed that the specie had more certain amino acids (cysteine, aspartic and glutamic acids and their amides, serine) in root exudates than cultural ones. these amino acids has more possibility for chelation due to existance of one more polar or ionogenic functional groop. seeds of lollium perenne l. were treated with growth regulater and planted on soils contaminated with salts of zink. it was shown that during days of germination quantity of zn in primary leaves increased from , to , % and decreased in soil: in upper layer from , to , , midde layer from , to , , lower layer from , to , mkg/kg correspondingly. thus, it was shown that stimulation of root exudative activity by pretreatment with a growyh regulator may be succesful in cleaning of soils and basicly this is a good method for phytoremediation. erenol exerted the strongest effect. exercise completely abolished the levels of cysteine in the atrial heart muscle. propranolol, isoproterenol, caffeine and pentylenetetrazol increased the ratio of cysteine to the total free amino acids in the atrial muscle, while physical stress and all cardioactive drugs tested increased this ratio in the ventricle muscle. disappearance of cysteine from the heart's atrial muscle after intensive exercise may be attributed to its utilization for atrial natriuretic factor and/or for endothelin synthesis, during stress. on the other hand it seems that hypoxia and isoproterenol are strong stimulants of no production, and consequently decrease the tissue levels of l-arginine, which is the major endogenous donor of no acting as the endothelin antagonist. measurement of serum levels of vitamin b is a screening test for detection of deficiency of this vitamin but low levels do not always indicate a deficiency of the vitamin. measurements of serum homocysteine and methylmalonic acid (mma) are used to confirm this deficiency because two enzymes involved in their metabolism have been shown to require vitamin b , but these results can also be inaccurate. vitamin b deficient white cells exhibit ultrascopic nuclear appenages which have been shown to contain dna; this finding could possibly be used as another confirmatory test of vitamin b deficiency. twenty-seven patients (mean age - . years) with low serum b were studied by electron microscopic determination of the percent of neutrophils exhibiting these appendages and routine clinical parameters. only one patient did not have nuclear appendages; the others had a range of . %- . % of neutrophils examined. there was a significant correlation of homocysteine (r ϭ . , p Ͻ . ) and mma (r ϭ . , p Ͻ . ) with serum b levels but no correlation of appendage number (r ϭ . ) with serum b . there was no correlation of appendage number with homocysteine (r ϭ . ) or mma (r ϭ . ). these results suggest that b -deficient white cell nuclear appendages do not measure the same metabolic pathways as homocysteine and methylmalonic acid and may be useful in confirmation of vitamin b deficiency. further extensive clinical evalution would be necessary to explore this possibility. the hypothesis: "l-theanine has relaxing effects of central nervous system of human beings", was verified by electroencephalographical methods. methods: male, healthy sport-students, free of drugs or stimulants, participated weekly in a cross-over study. after exhaustive bicycle-ergometer test as an individual, reliable, stress model, the subjects recovered by lying in a segregated shaded room. three testdrinks with different l-theanine content (d ϭ placebo, d ϭ mg, d ϭ mg) were given in a randomised, double-blind order. all test-conditions were standardized strictly. eeg-recordings (closed eyes) were carried out (m ϭ min. after stress/before testdrink, m ϭ min.-, m ϭ min.-, m ϭ min.-, m ϭ min. after testdrink) with the cateem ® system. absolute and relative eeg-spectralpower were examined. results: significant reductions in all frequencies (exception theta-power) were found in early recovery, being not significant influenced by testdrinks. qualitative different behavior trends were found in frontal-, central-, occipital-regions with increased alpha , theta (frontal) and decreasing beta relative-power earlier in recovery with d . these findings were related to relaxing effects. after ingestion of l-theanine alpha -, beta -power at occipital regions decreased faster (m ) to placebo recovery levels (m /m ). thus it may be concluded that l-theanine has no pharmaceutical effect on the down regulation system but supports the physiological mechanisms during recovery after physical stress in human brain. arginine and cysteine in muscle cytosol of rats' heart after exercise, hypoxia or challenge with six selected cardioactive drugs r. brus , j. gabryś , j. konecki , and j. shani department of pharmacology and department of histology and embriology, medical university of silesia, zabrze, poland department of pharmacology, the hebrew university school of pharmacy, jerusalem, israel levels of the amino acid l-arginine (a major endogenous donor of nitric oxide-no), cysteine (sulfur-containing amino acid, important for atriopeptins and endothelins synthesis), and of total free amino acids, were assayed by gas-liquid chromatography in cytosols of rats' atrial and ventricular muscle cardiomyocytes. the tissues were assayed after the rats had been exposed to either exercise (swimming), hypoxia or one of six cardioactive drugs such as propranolol, digoxin, pentylenetetrazol, reserpine, isoproterenol and caffeine. physical stress and the examined drugs significantly reduced the total amount of cytosolic free amino acids in both cardiac muscles. in the cytosol of the heart atrial muscle, reserpine, propranolol and pentylenetetrazol increased the relative content of l-arginine, while hypoxia and digoxin decreased it. in the cytosol of the ventricular heart muscle, hypoxia and all six drugs used, decreased the relative levels of l-arginine. hypoxia and isoprot-addition of somatostatin- or of some of its analogs was found to cause a selective inhibition, up to %, of the uptake of large neutral amino acids by isolated brain microvessels. although the luminal and abluminal sides of brain endothelial cells are both capable of taking up large neutral amino acids, only the uptake from the abluminal side was apparently inhibited by somatostatin. the involvement of a type- somatostatin receptor was suggested by assays with a series of receptorspecific somatostatin agonists, and was confirmed by the inhibition release caused by a specific type- receptor antagonist. a type- specific mrna was indeed shown to be present both in bovine brain microvessels ex vivo and in primary cultures of endothelial cells from rat brain microvessels. hemorphins represent a bioactive peptide class which contents between and amino acids and generated from the proteolysis of an hemoglobin "strategic zone". many activities have been related to hemorphins such as in vitro anti tumour effect, analgesia effects in vivo, and a potential role in the renin angiotensin system (ras). as far as their activity towards the ras is concerned, it was demonstrated that they could inhibit angiotensin converting enzyme (ace) and aminopeptidase n activity. so they could reduce angiotensin ii formation and angiotensin iv degradation. moreover some hemorphins, lvv-hemorphin- and vvhemorphin- , could behave like angiotensin iv receptor binding competitor. further it could be interesting to study the angiotensin iv potentiality to interact with ace. inhibition studies showed that it was possible that angiotensin iv could behave like a competitive inhibitor of ace. so some hemorphins could interact at different ras steps to inhibit ace. additionally to their inhibition of angiotensin i conversion, they could inhibit angiotensin iv degradation and consequently cause ace feedback inhibition. inhibition studies have been checked with ras natural substrate (angiotensin i) and confirmed that angiotensin iv, vv-hemorphin- and mainly lvv-hemorphin- could be natural ace inhibitors. so the hemorphins regulatory role in the ras appears to be more and more probable. the role of administration of each of methionine and finastride on the testicular function of both normal and prostate precancerous old male rats was investigated. for normal animals, neither methionine nor finastride has exerted any significant change in the hormonal profile after teatment for days. however methionine alone could exert a significant change in both testosterone and prostatic specific antigen {psa} levels after treatment for days. on the other hand, both methionine and finasteride significantly increased the levels of testosterone and androstenedione, whileas markedly reduced the levels of dihydrotestosterone and prostatic specific antigen {psa} after treatment of prostate precancerous old male rats for a period of days. noteworthy, continuation of treatment for aperiod of days realized marked improvement of hormonal profile of the prostate precancerous old male rats. several observations in our department point to some role of glycine in fatigue and exercise. ) in the framework of a study on the involvement of one-carbon matabolism in patients suffering from a polymorphic episodic psychosis, amino acid loading tests with serine, glycine and alanine were performed. a few hours after the administration of glycine, approximately % of the patients reported overwhelming feelings of fatigue and/or showed vegetative symptoms. ) in patients suffering from chronic fatigue syndrome, we found increased plasma levels of glycine in % of the female patients. moreover, - % of these patients omplained about a distorted sensory perception of objects. ) young soccer players were observed during a period of months, while in the course of this period eight blood samples were taken for amino acid analysis. based on the number and severity of injuries this population was divided into injury-prone and not injury-prone soccer players. it was found that in injury-prone soccer players plasma glycine levels during the whole observation period were significantly lower than in subjects who were not injury-prone. the consequences of the above mentioned observations will be discussed. institute of sportsmedicine, university of paderborn, germany percent of amino acids in green tea leaves are represented by l-theanine ( -n-ethylglutamine). previous rat experiments demonstrated effects of l-theanine to act on metabolism of neurotransmitters. it was therefore suggested that is causes the relaxing effects of green tea. to examine its influence as a component of a drink on the sympathetic nervous system after maximal physical exercise skin resistance measurements through electrosympathicography (esg) were used. after individual maximal exercise on a bicycle-ergometer testdrinks with different amounts of l-theanine ( , and mg) were administered to healthy volunteers in a randomised cross-over double-blind distribution on a weekly base. esg was monitored before and immediately after exercise as well as , , , , and minutes after end of exercise. all testconditions were standardized strictly. a characteristic esgcourse with subsequent qualities could be shown: . decreasing skin resistances after exercise could be established in each volunteer. . esg-activation levels before exercise could not even be reached again after a period of regeneration of / hours. . maximal electrodermal activity did not appear immediately after exercise, but after minutes. however, l-theanine could not significantly influence peripheral sympathetic electrodermal activity during the regeneration after maximal physical exercise. a. mero and h. pitkänen , neuromuscular research center, department of biology of physical activity, university of jyväskylä, and rehabilitation center of kankaanpää, finland essential amino acid leucine has many important roles in the body. therefore the purpose of the present study was to investigate if leucine supplementation has effects on serum amino acid profile and performance following training period or following single training sessions. all experiments were carried out in a randomized double blind cross-over procedure during a training season. thirty six adult male track and field power athletes served as subjects. in experiment ten of them were given leucine ( mg/kg body weight per day) as tablets. the concentration of leucine decreased significantly ( %) in the placebo group (p; n ϭ ) during weeks but not when leucine was taken. also total amino acids (taas) decreased strongly ( %) during weeks when dally protein intake was . g/kg body weight. in experiment the subjects (n ϭ ) carried out a single strength training seasion (sts) and consumed a drink containing leucine mg/kg body weight. following sts leucine in serum increased by % (ns) when leucine was taken but decreased strongly ( %) in p, in experiment the subjects (n ϭ ) underwent at days interval two maximal anaerobic running exercise (mare) tests on treadmill (n ϫ s with a recovery of s between the runs) until exhaustion. the subjects consumed drinks containing leucine ( mg/kg body weight) or placebo min before the test runs. following mare the concentration of leucine strongly increased by % whereas isoleucine ( %) and valine ( %) strongly decreased with the supplementation but no changes occurred in p. there were no improvements in physical performance either in mare or in explosive strength (experiment ) with leucine supplementation. the date suggest that leucine supplementation during a training period and before single training sessions prevents decreases in serum concentration of leucine and may have also effects on some other single amino acids. this may be beneficial during intensive training although improvements in performance were not observed in this study. since there are only limited data regarding effects of training period or training sessions on serum amino acid profile, the purpose of this study was to investigate serum amino acid changes following training period and following three different training sessions. the subjects consisted of track and field adult male power athletes. in experiment eleven of them performed a -week training period including a training sessions per week, which included sprint work, speed endurance work, endurance work, weight training, and jumps. significant decreases in the fasting concentrations of total amino acids (taas) ( %), branched chain amino acids (bcaas) ( %), essential amino acids (eaas) ( %) and leucine ( %) were observed following training with the daily protein intake of . g/kg body weight. in experiment eleven subjects performed a short run session (srs) of ϫ ϫ m with recoveries of s and s, and a long run session (lrs) of s runs with recoveries of s until exhaustion. there were no significant changes in taas following the sessions but bcaas decreased by % in srs and by % in lrs. leucine decreased by % following srs but only by % (ns) following lrs. the peak blood lactate concentrations after srs and lrs were . Ϯ . mmol/l and . Ϯ . mmol/l, respectively. in experiment sixteen subjects carried out a strength training session (sts), which consisted of jumps and heavy resistance exercises (speed and maximum strength) during minutes. the taas decreased significantly by %, bcaas by % and leucine by % following sts, while the peak blood lactate concentration was . Ϯ . mmol/l. these data indicate that remarkable decreases occur in the concentration of amino acids during a training period with the daily protein intake of . g/kg body weight. the decreases in serum amino acids are more pronounced following a strength training session than following lactic anaerobic running sessions. glutamine acts as a multipurpose regulator of amino acid and peptide transport across the blood-brain barrier departments of cellular biotechnology and haematology and of biochemical sciences, university "la sapienza", rome, italy isolated brain microvessels, the in vitro equivalent of the blood-brain barrier, have distinct na ϩ -independent uptake systems for the uptake of large hydrophobic amino acids, of enkephalins and of deltorphins, as shown by the absence of reciprocal inhibition. both d-and l-glutamine were capable, if added to the extracellular buffer, of exerting a competitive inhibition on the uptake of all these substrates. a trans-stimulatory effect was instead induced, in all cases, by l-glutamine preloading of the microvessels -the d-stereoisomer being instead ineffective, probably because of only l-glutamine could be taken up, in a concentrative manner, by some na ϩ -dependent concentrative system(s). all the na ϩ -independent systems present in brain microvessels seem therefore to share some structural feature responsible for their common susceptibility to interference by l-glutamine. this amino acid, whose synthesis can take place in the astrocytes, in the pericytes and also in the endothelial cells of the microvessels, plays a critical role in regulating the movements of several different substrates across the blood-brain barrier. department of applied bioorganic chemistry, division of life science, graduate school of agricultural science, tohoku university, aoba-ku, sendai, japan isolation and structure analysis of two amino acids from bovine ligamentum nuchae elastin hydrolysates revealed the presence of pyridine cross-links in elastin. the structures of these amino acids were determined to have , , -and , , -trisubstituted pyridine skeletons both with three carboxylic acids and a mass of (c h n o ), identified as -( -amino- -carboxybutyl)- , -di-( -amino- -carboxypropyl)pyridine and -( -amino- -carboxybutyl)- , -di-( -amino- carboxypropyl)-pyridine. we have named these pyridine cross-links, desmopyridine (desp) and isodesmopyridine (idp), respectively. structure analysis of these pyridine crosslinks implied that the formation of these cross-links involved the condensation reaction between ammonia and allysine. the elastin incubated with ammonium chloride showed desp and idp levels increased as the allysine content decreased. desp and idp were measured by hplc with uv detection and were found in a variety of bovine tissues. the desp/desmosine and idp/isodesmosine ratios in aorta elastin were higher than in other tissues. desp and idp contents in human aorta elastin were found to be gradually increased with age. the concentration of idp was significantly elevated in aorta elastin of rat with chronic liver cirrhosis induced by carbon tetrachloride when compared with normal rats. the provision of glutamine to marathon runners has resulted in a decreased, self-reported incidence of illness. increasing evidence -in vitro; and in vivo suggests that neutrophils in humans may benefit from exogenous glutamine. the provision of glutamine in vivo should replete the marked decrease in the blood concentration observed after stress such as clinical trauma or prolonged, strenuous exercise. beneficial effects of glutamine supplementation include increased phagocytic activity and reactive oxygen intermediate production in vitro; decreased neutrophilia and il- production (a chemoattractant for neutrophils) in vivo and ex vivo. the aim of the present study was to establish whether glutamine supplementation in vitro and in vivo affects neutrophil function at rest and after exhaustive exercise. in addition, it was planned to establish the presence of glutaminase in human neutrophils, which has not yet been achieved, although glutaminase is present in rat neutrophils. methods: blood samples were taken from marathon runners receiving either glutamine or placebo, immediately after and one hour after a race. measurements included the plasma concentration of glutamine (enzymatic assay), il- production (elisa), and neutrophil activity. the latter was measured with two different techniques for measuring oxidative burst in whole blood, one of which was a novel chemiluminescence assay (knight scientific ltd, u.k.) with the fluorescent label, pholasin, and two different stimuli, f-met-leu-phe (fmlp) and phorbol-myristate-acetate (pma). in addition, isolated whole cells and subcellular neutrophil fractions were assayed for the presence of glutaminase. results: the plasma glutamine concentration was reduced overall by % hr after the race (p Ͻ . ). there was an apparent decrease (only close to significance, p Ͻ . ) in il- production in the glutamine group compared with the placebo group. neutrophil function did not change between groups at any stage. the incidence of illness was % higher in the placebo group than the glutamine group in the week after the race. neutrophils from four out of six subjects gave an increased response ( . %) to fmlp when incubated with glutamine compared with no glutamine, and four out of four gave an increased response to pma ( . %). in the fmlp experiments there were two individuals who did not respond to the addition of glutamine. however, the response was not diminished whether or not glutamine was present. in separate studies, the effect of glutamine on lipopolysaccharide-induced il- production was also monitored. conclusions: the provision of glutamine after prolonged, exhaustive exercise appears to modify exercise-induced neutrophilia via a reduction in il- production and to reduce the incidence of illness in the following week. in vitro data suggest a role for glutamine in neutrophil metabolism. disappointingly, little or no evidence of the presence of glutaminase was found in human neutrophils. the three different methods used, freeze-thaw, homogenisation, nebulisation were apparently not sufficient to break open the granules. current studies are addressing this problem. r. j. ward and l. m. castell departments of biochemistry, university catholique de louvain, belgium oxford university, oxford, u.k. it is essential that the developing muscle has adequate amino acids for the synthesis of actin and myosin as well as those required for a multitude of enzymes involved in muscle metabolism. with carbohydrates and lipids, the body is able to store a reserve as glycogen and triglycerides respectively; however this is not the case with amino acids creatine supplementation in increasingly being used as a dietary supplement by athletes during high intensity, short term exercise to improve physical performance since it is converted in the muscle to phosphocreatine. transporters which permit creatine to cross the muscle membrane namely crt and crt (a na ϩ and clЈ dependent mechanism) have now been identified. creatine uptake is enhanced by the ingestion of carbohydrate at the same time as supplementary creatine. this may be due to increased circulating levels of insulin or insulin-like growth factor . more recently attention has been focussed upon the various transporters for amino acids across the muscle membrane. certain criteria are needed for the amino acids to enter the blood which include the presence of specific carriers for its transport across cells of the gastrointestinal tract, such as enterocytes, as well as minimal metabolism within these cells. a wide number of different transporters has been identified, which include neutral amino acids and cationic amino acids. despite the evidence which suggests that supplementation with some amino acids can influence metabolism, and therefore athletic performance, much more experimental work is still required in this area. m. weiss , t. barthel , r. schnittker , k. e. geiss , w. falke , and l. r. juneja university of paderborn, germany taiyo kagaku co., yokkaichi, japan, isme gmbh mörfelden, germany in animal studies l-theanine was shown to influence neurotransmitter systems. thus it may be helpful in managing stress regulation. so we observed the down regulation after physical stress in the brain (measured by eeg-mapping) and in peripheral hormonal systems (plasma levels of catecholamines, cortisole, prolactine, serotonine, measured by hplc). n ϭ healthy students consumed drinks containing either , or mg l-theanine in randomized double-blind trials in the min - after a near maximal bicycle step test. measurements were done directly after exercise (m ) and (m ), (m ), (m ), (m ) min after the drink. l-theanine seemed to accelerate the normalization of eeg spectral power in high frequency waves (barthel in this congress). the physiological return of increased hormon levels to basal levels / the circadianic rhythm up to m (catecholamines) or m (cortisole, serotonine, prolactine) was not influenced by the drinks. but in the l-theanine trials correlations between eeg spectral power and some hormones were altered (slow wave power/some catecholamines except norepinephrine/delta disappeared and new correlations with prolactine appeared). thus we conclude that l-theanine acts at the switch from the brain to the peripheral stress regulation and thereby supports physiological relaxing after severe exercise. polyamines the development from callus to plantlets, both activities increased, reaching the maximum at this latter stage. also sadenosylmethionine decarboxylase activity displayed a similar trend. all the activities were present in supernatant and in particulate fraction. higher activity of enzymes assayed in the small embryos rather than in the embryo with higher shape, was consistent with following polyamine accumulation. department of biology, laboratory of cell biochemistry, university of rome "tor vergata", rome, italy intracellular transglutaminase (tg, ec . . . ), which catalyzes the formation of ε-(γ-glutamyl)lysine isopeptide cross-links between polypeptides, has been related to a variety of important biological processes and in the development of senile cataract. the majority of the dry weight of the eye lens is composed of protein called crystallins. in the mammalian lens, these proteins are divided into three major classes: α-, -and γcrystallins. native -crystallins are oligomers, which elute in two or more size classes during gel filtration, ranging from - kda. they contain different types of subunits, named b , b , b , a , a , a , ba , ranging from - kda. in the rabbit eye lens two -crystallin subunits ( b and b ), among the water soluble proteins, have been shown to act selectively as acyl donors substrates for lens tg. calpains are cytoplamic ca ϩϩ -dependent cystine proteinases. the cleavage of αand -crystallins, the main substrates of lens calpain ii, has been associated to the increase of lens turbidity, due to insolubilization of peptides. we observed that tg-induced post-translational modification of b -and b -crystallins with polyamines, enhances their cleavage by calpain ii. this finding suggests that the enhancement of calpain ii activity, after conjugation of polyamines into -crystallins, could represent an important regulatory mechanism which may contribute to the opacification process of the eye lens, conducting to cataract formation. transglutaminases represent a family of enzymes, widely diffused in nature, from bacteria to plants and higher animals. the present discussion will focus on isoenzymes in mammals, which have been well characterized from the structural and functional point of view. they act on tissular proteins catalyzing crosslinkage through isopeptide bonds at peptidyl glutamine and lysine residues or incorporation of small molecular weight primary amines, usually polyamines, in an irreversible, calcium dependent reaction. in several instances the expression of transglutaminases is regulated at the transcriptional level. these enzymes help in maintaining structural integrity of tissues intervening in wound repair and in cellular homeostasis at the levels of cell activation, receptor signaling, cell proliferation, differentiation and death. these general roles involve bis(guanylhydrazones) are a class of compounds known to interfere with the metabolism of polyamines by virtue of their ability to inhibit s-adenosylmethionine decarboxylase (samdc), a key enzyme of polyamine biosynthesis. this property has made them useful tools to study the biological functions of these compounds. a curious feature of bis(guanylhydrazones) is their structural relationship with two molecules involved in polyamine biosynthesis, namely spermidine and s-adenosylmethionine. the methylglyoxal derivative of bis(guanylhydrazones), mgbg, has been actively studied both in animal and plant systems. in the present work the male pollen from actinidia deliciosa has been utilized to investigate the role of polyamines on the pollen tube growth. the effect of several bis(guanylhydrazones) was tested on pollen germination, length of pollen tube, levels of free and conjugated polyamines and samdc activity. all bis(guanylhydrazones) tested (glyoxal-bis-guanylhydrazone, gbg, methylglyoxal-mgbg, methylpropylglyoxal-mpgbg, ethylmethylglyoxal-emgbg) inhibit pollen germination and their effect is dose-dependent. a clear reduction of spermidine, both in free and conjugated form, was observed, as well as a pronounced decrease in samdc activity. these results suggest that the mechanism by which bis(guanylhydrazones) reduce the germination of kiwi pollen is related to their effect on spermidine biosynthesis. molecules structurally related to polyamines ( , diaminooctane, , -diaminononane, , -diaminodecane) and other inhibitors of their metabolism (cyclohexylamine, cha) are also tested on kiwi pollen germination. n. bagni , d. bertoldi , , e. candioli , l. martinelli , and a. tassoni istituto agrario, san michele a/adige, and dipartimento di biologia, università di bologna, italy in the frame of the study aiming to enlighten developmental programs during regeneration in grapes, polyamine content (free and conjugated to hydroxycinnamic acids) and biosynthetic enzyme activities were assayed during somatic embryogenesis. aliphatic polyamines are growth regulators affecting plant growth and development both in vivo and during in vitro cultures, being involved in several morphogenic processes related ti their action in cell division. the study was conducted on samples of callus, embryogenic callus, embryo at different stages and plantlets of vitis vinifera brachetto and chardonnay cultivars induced from anthers and ovaries. polyamine content (putrescine, spermidine and spermine) free plus conjugated to percloric acid soluble fraction, referred to unit, was higher in the cv. brachetto than in the cv. chardonnay, and reached the higher levels in the fullydeveloped embryo stage. besides, ornithine decarboxylase activity resulted higher than arginine decarboxylase and during multiple catalytic processes: the receptor signaling activity (demonstrated only for isoenzyme ) is related to an intrinsic gtp-ase activity of type transglutaminase; the processes leading to control of cell proliferation, differentiation and death are mainly related to the protein crosslinking activity, while the cell activation is tentatively considered dependent on the polyamidation of endogenous proteins at glutamine residues. the knowledge on this last aspect lies far back in comparison to the other roles of transglutaminases and requires further accurate investigation, which must further extend to the role of the enzyme in human pathology. the examination of polyamine metabolism at the present time suggests that vitamin b is implicated in polyamines metabolism. literature data speak that spermine and spermidine stimulate activity of cobalamin-dependant methionine synthase, the enzyme that catalyses the recycling of homocysteine to methionine; polyamines inhibit methionine adenosyltransferase. beside the wellknow significance of vitamin b , in transmethylation reaction, the significance of ,-deoxyadenozyl cobalamin, except the conversion of methylmalonyl-coa to succinyl coa, is not well elucidates. methionine as s-adenosylmethionine (sam) is essential amino acid for polyamine biosynthesis. sam has frequently usage in treatment of liver diseases. according the mentioned facts the aim of our experiments is to exanimate the significance of application of vitamin b alone and altogether with methionine to rats without and with experimentally induced cholestasis. our preliminary results speak about the disturbance of polyamine metabolism in hepatic tissue of rats with cholestasis. application of methionine alone increases the amount of polyamine in rat liver tissue, in-group without cholestasis and with bile duct obstruction. the animal treatment with cobalamin has higher amount of polyamines and lower activity of polyamine oxidase in liver tissues in both groups. the effects of vitamin b may be in direct relation with the formation of ,-methylthio deoxyadenosine (mta), the by-product of spermidine and spermine biosynthesis. the explanation the exact roles of vitamin b in polyamine metabolism of liver tissue need the futher investigations. department of molecular genetics, the weizmann institute of science, rehovot, israel exposure of mouse myeloma cells that massively overproduces ornithine decarboxylase (odc) but not of parental cells to ornithine results in a massive increase in the intracellular concentration of putrescine, followed by rapid cell death. the treated odc overproducing cells display fragmented nuclei, chromatin condensation and an oligonucleosome-size dna "ladder"; consequently, their death can be described as apoptosis. the apoptotic process induced by the accumulated putrescine involves the release of cytochrome c from the mito-chondria, activation of caspases cascades demonstrated by the cleavage of caspase- and parp, a substrate of caspase- . the general inhibitor of caspases, bd-fmk, effectively inhibited parp cleavage but failed to inhibit cell death. the intracellular ca ϩ chelator bapta/am and the antioxidant bha inhibit parp cleavage. however, only bapta/am inhibit the induction of cell death. it seems that bha subverted the death into caspase independent pathway. treatment with bapta/am did not interfere with the accumulation of putresine following ornithine treatment, suggesting that the accumulated putrescine induces the elevation in the concentration of intracellular ca ϩ which then activates the apoptotic process. the dominant anti-apoptotic effect of bapta/am over egta suggests that internal stores are the main source of the elevated ca ϩ , but that putrescine is also capable of inducing influx of extracellular ca ϩ . extensive small intestine resection results in the loss of absorptive surfaces, acceleration of intestinal transit and, as a consequence, in malnutrition, weight loss, diarrhoea and other complications of short bowel syndrome. the availability of human recombinant growth hormone rgh and its stimulatory effects on gut growth suggested its use in the treatment of short bowel syndrome. the trophic response of gi tract epithelium to hormones such as growth hormone is mediated by polyamines, which are vital in cell proliferation. this study was undertaken in rats to: / evaluate the effects of rgh by monitoring polyamine and amine metabolism parameters in the adapting short bowel and / determine whether erythrocyte (rbc) polyamine concentrations reliably reflect the proliferative activity of the remaining bowel. seventy per cent resection of the small intestine of wistar rats was performed under ether anesthesia leaving equidistant lengths of bowel from pylorus and ileocecal valve. recombinant human gh ( . iu, s.c., saizen, serono, switzerland) was administered once daily for or days, to randomly selected rats on the second postoperative day. animals were sacrificed , and days after the operation. enzyme activities were measured with radioassays or fluorimetry. polymines were determined as dansyl derivatives by hplc/fluorimetry. gh treated animals had significantly higher intestinal odc and sat and lowel dao activities; higher (non-significant) mucosal growth index and polyamine concentrations than in untreated counterparts on th postoperative day. thereafter the two groups did not differ in the investigated parameters. rbc polyamine concentrations were higher in operated verses control rats; rgh treatment had no significant effect. however, rgh treatment significantly reduced hepatic mao a and b activities. our results suggest gh accelerated the adaptive growth of the bowel remnant. they justify use of erythrocyte polyamine concentration measurement as the marker of small bowel proliferative activity. however, side-effects of this treatment must be considered. tissue transglutaminase (ttg) activity has been evaluated in different neural tissues, such as brain, spinal cord and peripheral ganglia, and appears to be expressed in cerebellar granule cells (cgn) as well as in astrocytes. the role of ttg in neuronal functioning is likely to be quite complex. other than the role during development, significant changes of enzyme activity have been evaluated in different neurodegenerative conditions. it is well known that nmda receptor activation may be able to trigger excitotoxicity. the nmda-induced injury is mainly associated to ion influx and subsequent calcium overload. the effects of nmda application to both, cerebellar granule cells and glial cell cultures, have been assessed. in cgn, ttg activity increased rapidly after a brief stimulation with µm nmda, whereas in glial cell cultures, high levels of enzyme activity was obtained after incubation of h in presence of the same concentration of nmda. such results rule out the possibility that excitotoxicity can modify numerous proteins making them better substrates of ttg, and this could contribute to enhanced ttg-modifications of proteins in response to excitotoxicity. the pote protein can catalyze both uptake and excretion of putrescine. the km values of putrescine for uptake and excretion (putrescine-ornithine antiport) are . µm and µm, respectively. amino acid residues, cys , trp , glu , trp and tyr are strongly involved in both activities, and that glu , tyr , cys , cys , cys and glu are moderately involved in the activities. mutations of tyr , trp and trp mainly affected uptake activity, indicating that these amino acids are involved in the high affinity uptake of putrescine by pote. mutations of lys and tyr mainly affected excretion activity, indicating that these amino acids are involved in the recognition of ornithine. the putrescine and ornithine recognition site on pote was found to be located at the cytoplasmic surface and the vestibule of the pore consisting of twelve transmembrane segments. the cadb protein has % sequence homology with pote protein. cadb can catalyze both uptake and excretion of cadaverine. the km values of cadaverine for uptake and excretion (cadaverine-lysine antiport) are µm and µm, respectively. it was found that two glutamate residues (glu and glu ) and four tyrosine residues (tyr , tyr , tyr and tyr ) are involved in the both activities. the difference of the substrate recognition site on pote and cadb is discussed. a. lentini, b. provenzano, and s. beninati department of biology, laboratory of cell biochemistry, university of rome "tor vergata", rome, italy tissue transglutaminase (ttg, e.c. . . . ) is a protein cross-linking enzyme which catalyzes an acyl transfer reaction where the carboxamide group of a peptide-bound glutamine is the acyl donor, and a lysine residue the acyl acceptor. polyamines may act as acyl acceptors, leading to the formation of mono-and bis-(γ-glutamyl)derivatives. we provided evidence that ttg activity is directly associated to differentiation markers, and inversely related to cell proliferation and invasion. we have shown the in vivo reduction of experimental melanoma metastasis by i.v. injection of a plasmid (psg ) carrying the ttg gene sequence to c bl /n mice. tumor cell metastatization requires specific interactions with subendothelial basement membrane (bm) and migration through the endothelial wall, allowing the colonization of the target tissue. therefore, the investigation on the possible mechanisms responsible for ttg effects is focused on the posttranslational modification of bm proteins. we detected that "matrigel", a tumor-derived complex of bm proteins, modified with polyamines after ttg catalysis, reduces both melanoma cell adhesion and invasion in an in vitro metastatic assay. similar results were obtained using polyamines conjugated to laminin, one of the major bm components, as unique substrate. our findings suggest that the increase of bm proteins conjugated to polyamines may be responsible for impairments of the invasive properties of melanoma cells. we demonstrated that interferon-α (ifnα) induces apoptosis in human epidermoid cancer cells. tissue transglutaminase (ttgase) is an enzyme involved in the regulation of apoptosis through the inactivation of some cell components. among these eukaryotic initiation factor- a (eif a) is peculiar because its activity is modulated by the formation of the amino acid hypusine. recently, we found that growth inhibition induced by ttgase is paralleled by reduced hypusine levels. here we report the effects of ifnα on the apoptosis, ttgase modulation and eif a activity in human epidermoid lung h cancer cells. we have found that h exposure to , iu/ml ifnα induces % growth inhibition and % apoptosis in h cells. moreover, ifnα induced a -fold increase of ttgase activity and expression that already occurred after h of exposure to the cytokine. this effect was paralleled by a . -fold enhance of ttgase mrnas. ifnα induced also a % increased eif a expression while an about % decrease of hypusine levels was observed. increased ttgase activity was paralleled by a decrease of hypusine content and of eif a activity. therefore, ifnαinduced apoptosis could occur through an increase of ttgase activity and the mechanism by which ttgase regulates biological functions can be the reduction of eif a activity. adometdc deficient mice k. nishimura , f. nakatsu , , k. kashiwagi , h. ohno , t. saito , and k. igarashi graduate school of pharmaceutical sciences, chiba university, chiba, graduate school of medicine, chiba university, chiba, and cancer research institute, kanazawa university, kanazawa, japan the amd gene encodes s-adenosylmethionine decarboxylase (adometdc) that is one of the key enzymes of polyamine biosynthesis. to examine the physiological role of polyamines, we performed the targeted disruption of the gene in mice to generate spermidine-and spermine-free mice. although the level of adometdc mrna decreased by % in amd ϩ/Ϫ mice, adometdc activity reduced only by % and spermidine and spermine contents did not change significantly. they showed normal phenotype and life span. to obtain amd Ϫ/Ϫ mice, we intercrossed amd ϩ/Ϫ mice and determined the genotype of the resulting offspring. however, we could not obtain any amd Ϫ/Ϫ mice from heterozygous intercrosses over. amd Ϫ/Ϫ embryos died early in development, between e . and e . days post coitum. in culture of blastocysts at e . , the shapes of all cell lines were normal, but amd Ϫ/Ϫ cells appeared to arrest the cell proliferation at day after the onset of cell culture. the arrest of amd Ϫ/Ϫ cell proliferation was rescued by addition of spermidine. these data indicated that the lethal phenotype of amd Ϫ/Ϫ mice was caused by growth retardation by polyamine depletion at early developmental stage. the formation of active species such as h o and aldehydes during the oxidative deamination of biogenic amines by amine oxidases (ao) suggests for these enzymes a key role in cellular processes. the ability of bovine serum amine oxidase (bsao) to oxidase free amino groups of lysozyme and ribonuclease a has been observed indicating a possible ao involvement in the post-translational protein modification. furthermore, bsao inhibition by h o formed during substrate oxidation under limited turnover conditions was demonstrated, which may be relevant to cellular physiopathology. we have also observed that some inhibitors of mitochondrial amine oxidases (mao) protected human melanoma cell line (m ) against apoptosis. the protection by catalase of mao-substrates induced membrane permeability transition was also obtained in isolated rat liver mitochondria, thus confirming a role of mao-derived h o in apoptosis. enrichment in ao activity by treatment with vegetal ao has been obtained in a erythroleukemia cell line (k ), substaining the possibility to modulate the intracellular ao activity. an antiarrhythmic and cardioprotective effect of bsao has been also observed on isolated rat heart in reperfusion; a protective effect during anaphylaptic crisis has been shown "in vivo", thus suggesting aos as a possible therapeutic agents. tetrakis( -aminopropyl)ammonium, a unique polyamine produced by an extreme thermophile, stabilizes nucleic acids at high temperature t. oshima and y. terui department of molecular biology, tokyo university of pharmacy and life science, hachioji, tokyo, japan an extreme thermophile, thermus thermophilus, produces tetrakis( -aminopropyl)ammonium; a novel polyamine containing a quaternary ammonium nitrogen. to clarify the roles of the unique polyamine in thermophily, the effects of tetrakis( aminopropyl)ammonium on biochemical reactions related to nucleic acids have been investigated. the unique polyamine stabilized both double and single stranded dnas and rnas. tm of a double stranded dna was raised by °c by the addition of . mm of tetrakis( -aminopropyl)ammonium. at around the boiling temperature of water, depurination of dna takes place. other long polyamines produced by the thermophile such as caldopentaamine also stabilized dnas and rnas. we found that tetrakis( -aminopropyl)ammonium prevents depurination most effectively. tetrakis( aminopropyl)ammonium activated the protein biosynthesis catalyzed by a cell-free extract of the thermophile at high temperature. the effects of this unique polyamine on dna and rna polymerases are also being investigated and the results will be presented. tissue transglutaminase (ttg) catalyses the cross-link formation between glutamine (q) residues and nh -donor molecules present in the cells (polyamines, lysine-donor proteins). recently, it has been correlated to neurodegenerative disorders characterised by polyglutamine (q n ) expansion, like huntington's disease. studies carried out on cell extracts revealed that glyceraldehyde -phosphate dehydrogenase (gapdh) was found covalently linked to q n domains. however, to date no structural data are available to solve the issue of which residues of gapdh are substrates for ttg. by coupling classical protein chemistry procedures and mass spectrometric techniques we achieved this goal by using as ttg substrates the substance p, an -aa peptide bearing the simplest q n domain (q ), and polyamines of different size and shape as q-and nh -donor, respectively. in the present study we report that out of the lysines present in gapdh only three are sites of ttgasedependent cross-link formation in vitro. moreover, to characterize the ttg catalysed cross-link between gapdh and polyq protein we used a synthetic q -peptide as ttg substrate in the catalysed reaction with polyamines. we found that any q residue is a potential ttg substrate, no matter the specific position in the sequence or the steric hindrance of the specific amine under investigation. cjf inserm - , institut contre les cancers de l'apareil digestif (ircad), strasbourg, france as soon as the key role of odc in polyamine metabolism was recognised, it became the major target for selective inhibi-tion. s. harik presented in the first potent odc inhibitor, α-hydrazino ornithine. although efforts continued until today, with the aim to improve odc inactivation, -(difluoromethyl)ornithine (dfmo) remained the most important compound among all polyamine-directed drugs. a known anti-leukaemic drug, methylglyoxal-bis(guanylhdrazone), was recognised early on by g. williams-ashman and his collaborators as an inhibitor of adometdc, the other highly regulated biosynthetic decarboxylase, and served as matrix for more recent developments. in the course of the years selective inhibitors for all enzymes involved in polyamine biosynthesis and degradation were synthesised. moreover, a series of polyamineuptake inhibitors were reported. however, only some of these numerous compounds reached a stage above evaluation as growth inhibitors of cancer cells. owing to the sophisticated homeostatic regulation of the polyamines in cells and organs by de novo synthesis, degradation, uptake and release, and due to the fact that exogenous polyamines (i.e. gut polyamines) can be utilised by the vertebrate organism, the efficacy of selective enzyme and uptake inhibitors remained modest in cancer therapy. the fact the dfmo became the most important drug for the therapy of west and central african sleeping sickness relies on differences of vertebrate and parasite biochemistry. a novel approach, initiated by carl porter, involved the design and synthesis of structural analogues of spermidine and spermine, which do not share the growth-promoting effects of the natural polyamines. a very large variety of homologues, mostly of spermine, with different alkyl-substituents on the primary amino groups, have been studied systematically with regard to their ability to alter enzyme and polyamine patterns, and to inhibit cell growth. in addition polymine-like chains with interposed heteroatoms ( , s, si etc.), and analogues with rigid aliphatic chains (due to inbuilt double and triple bonds, or of small rings) have been explored. the structural analogues either mimic regulatory functions of the natural polyamines, and thus lead to the depletion of endogenous pools of putrescine, spermidine and of spermine, or they prevent growth effects of the natural polyamines by displacing them from functionally important binding sites. the later type may be considered as polyamine antagonists. the actual drugs usually exhibit to some extent polyamine mimetic and antagonist properties. at present several polyamine analogues are in clinical trial. however, after more than years of active research, a polyaminerelated anticancer drug is still not available. one may conclude from this fact that the polyamines are an inappropriate target for cancer treatment. however, it is more likely that polyamine metabolism is a difficult target, because the differences between normal and cancer cells are mainly of quantitative nature. moreover, numerous mechanisms have developed in the course of evolution, which enable the vertebrate organism to prevent lethal polyamine losses. nine novel chemically modified polyamine (pa) analogs were evaluated for their ability to inhibit the pa biosynthesis in rat hepatoma g- cell-free system as well as the growth of caov tumor cells. the final concentration of oxy-and aminoadenosine pa analogs or two uracils modified pa analogs were . mm in the reaction mixture. bis(uracilyl)-analogs and -( -oxyethyl)ami-no- --d-xylofuranosyladenine supressed pa and putrescine synthesis and in the same conditions were more effective than dl-α-difluoromethylornithine (dfmo) -strong specific inhibitor of ornithine decarboxylase (odc). the other adenosine modified compounds could act both as activators of odc and inhibitors both diamine and polyamine oxidase activities in regenerating liver test system. in contrast to those mentioned above two uracils modified agents as well as dfmo were able to inhibit odc and to increase the rate of oxidative deamination of pa in the same system. thus bis(uracilyl) pa analogs were the most active and may be useful for further investigation as substances having potential antitumor and antiproliferative properties. several studies concerning the periodontal status in adult and adolescent patients treated with fixed ni-ti archwires have been performed, but until now it is not yet available any information about the influence of patient age on gingival tissue responses to ni-ti alloy. recently, researches by us demonstrated that the prolonged use for over months of ni-ti appliances may contribute to local pathological proliferative processes early detectable only through salivary polyamine concentration increase. although other data from our laboratory showed that salivary polyamine amounts are age and sex-independent, nothing is known about the influence of the age on salivary polyamine content m subjects wearing ni-ti appliances. eighty patients, under orthodontic treatment for months, were divided into four groups: the pre-, the mid-, the late-and the post-pubertal. salivary polyamine concentrations were determined by hplc. only the late pubertal group revealed a significant increase in both the spermine and spermidine content, while the other groups showed no modification. the results suggest that gingival pathological responses to a long-term appliance's use may be related to the endocrine modifications that occur in the late-pubertal age. sexual hormones appear to be in synergy with ni-ti alloy in promoting proliferative activity of gingival cells. the effects of polyamines on the synthesis of various σ subunits of rna polymerase were studied to determine how polyamines influence the functional specificity of transcription using western blot analysis. synthesis of σ was stimulated . -fold and that of σ was stimulated . -fold by polyamines, whereas synthesis of other σ subunits was not influenced by polyamines. stimulation of σ synthesis by polyamines occurred at the level of transcription. since our hypothesis is that polyamines regulate macromolecular synthesis mainly at the translational level, we searched for a target protein, related to the polyamine stimulation of σ synthesis, whose translation is altered by polyamines. stimulation of σ synthesis was due to an increase in the level of camp, which occurred through polyamine stimulation of the synthesis of adenylate cyclase at the level of translation. polyamines were found to increase the translation of adenylate cyclase mrna by facilitating the uug codon-dependent initiation. analysis of rna secondary structure suggests that exposure of the shine-dalgarno (sd) sequence of mrna is a prerequisite for polyamine stimulation of the uug codon-dependent initiation. antitumor quinones are approved for clinical use and others antitumor quinones are in different stages of clinical and preclinical development. the efficiency of the quinonic compounds in inhibiting cancer cells growth is believed to stem from their participation in key cellular redox mechanisms with consequent generation of highly reactive oxygen species (ros). the ros is turn modify and degrade nucleic acids and proteins within the cells. recently, quinonic drugs were attached to the neurodecapeptide lh-rh and evaluated as potential drugs in the treatment of different tumours. we have synthesized several series of n-quinonyl amino acids in which five ω-amino acids are attached to p-quinones with different values of redox potentials. the attachment was made via michael-like reductive addition of the amino acids to the quinonic ring or via substitution of a chlorinated atom. the n-ω-quinonyl amino acids were characterized as to their ability to form semiquinone anion radicals by epr and cyclic voltammetry technique. the preparative methods, the redox potentials as well as the physical and spectral data ( h-nmr, ir, uv-vis and hrms) of these n-ω-quinonyl amino acids will be presented. the de novo design of biologically active peptides and proteins, mostly has involved consideration and design of backbone conformations (secondary structures) such as α-helix, -sheets, -turns, etc. (η/ψ space). however, for many bioactive peptides and proteins, especially those critical for information transduction such as neurotransmitters, hormones, antigens, neurocrines, etc. molecular recognition via side chain moieties is of paramount importance. thus far, the specific three dimensional orientations of side chain groups ( angles; chi space) in terms of biological activity has received only modest attention. in part this may be due to the energetics of chi space compared to ramachandran space. in order to overcome the current limitations of evaluating the importance of chi space in critical biological functions related to disease and behavior, we have designed amino acids with novel structures and unique constraints in chi space ( , , etc.), with special attention to their ability to mimic the chi space of native proteins and peptides. we have developed novel and simple asymmetric synthetic methods for such amino acids, often with ees greater than %. incorporation of these novel amino acids into bioactive polypeptide neurotransmitters has provided ligands with unique biological activities that effect unique behaviors including feeding, sexual, pain, and addictive behaviors. (supported by grants from the usphs and nida.) protein technology, wallenberg laboratory ii, lund university, lund, sweden we describe a method for comparative quantitation and de novo peptide sequencing of proteins separated either by standard chromatographic methods or by one and two-dimensional polyacrylamide gel electrophoresis. the approach is based on the use of an isotopically labelled reagent to quantitate (by mass spectrometry) the ratio of peptides from digests of a protein being expressed under different conditions. the method allows quantitation of the changes occurring in spots or bands that contain more than one protein, and has a greater dynamic range than most staining methods. since the reagent carries a fixed positive charge under acidic conditions and labels only the n-terminal of peptides, the interpretation of tandem mass spectra to obtain sequence information is greatly simplified. the sequences can easily be extracted for homology searches instead of using indirect mass spectral based searches and are independent of post-translational modifications. dehydroamino acids and their derivatives play important roles as constituents of various natural products and as synthetic intermediates for the preparation of optically pure amino acids. a large number of amino acid derivatives containing a pyrazol- -yl, isoxazol- -yl and other heterocyclic moieties has been prepared as potential agonists or antagonists for central glutamate receptors in connection with (r,s)- -amino- -( hydroxy- -methylisoxazol- -yl)propanoic acid (ampa), a bioisostere of (s)-glutamic acid. -hetaryl-α, -didehydroalanines might be considered as conformationally constrained ampa analogs and might be potential candidates for the synthesis of novel types of ampa analogs, for example, via their hydrogenation. compounds containing h-pyran- -one ring are also very useful synthons in selective synthesis. recently we have shown their use for the preparation of (e)-α, -didehydroα-amino acid derivatives containing a pyrazolyl moiety (vraničar l, polanc s, kočevar m ( ) tetrahedron : ). as a continuation of our investigation in this field we report here a detailed study of the transformation of h-pyran- -one derivatives with hydroxylamine ( , x ϭ o) and various hydrazines ( , x ϭ nr ) towards novel types of (e)-and (z)α, -didehydroamino acid derivatives . in most cases, the reactions were performed under basic conditions in a mixture of ethanol and pyridine. depending on the substrate and the reagent used the reaction could be controlled to give either (e)-or (z)-isomers; in some cases decarboxylation to the corresponding enamines also occured during the reaction course. some attempts to hydrogenate compounds towards α-amino acid derivaties by homogeneous or heterogeneous catalysis were also performed. analogs of endomorphin and were prepared to investigate the effect of the positional and c-terminal amide replacements and modifications on the biological activity. modifications in position and were studied. in position several hydroxy-and serine related amino acids were incorporated, whereas in position the amide bond was replaced by hydroxymethyl and allyl group. protected peptide derivatives were synthesized on chlorotrityl resin and further transformed to the corresponding derivatives in solution phase. among the analogs tested, in in vitro tests the most effective compound found was d-ser -endomorphin Ϫ . quite surprisingly, the partial agonist/antagonist properties of the derivatives in receptor binding and g-protein stimulation tests have been shown behave differently. the differences in efficacy and receptor binding properties of the compounds may explain the discrepancies between the in vitro and receptor binding tests. we have been assessing the possible applications of substituted h-pyran- -ones containing α, -didehydroamino acid unit in their structure as dienes in [ ϩ ]-cycloaddition reactions. as dienophiles we have been using different acetylene derivatives as well as n-phenylmaleimide and maleic anhydride. as it is evident from the structure of h-pyran- -ones upon the cycloaddition of acetylene derivatives the first intermediate formed ( ) still contains the carbon dioxide bridge. in many cases easily expels co and substituted benzene derivative is produced. when the alkenes are used, the first part of cycloaddition is the same as when acteylene derivatives are used, but after the extrusion of co from the adduct there are two possible paths: so formed cyclohexa- , -diene ( ) is either aromatized into benzene derivative ( ) or it acts as another diene with favourably positioned double bonds and unusual double cycloadducts ( ) are formed. since co -containing adducts are thermally unstable it is advantageous to use high pressure techniques. with the acetylene derivatives we have not been able to isolate co -containing adducts ( ), while with alkenes we have isolated, depending on the structure pattern of the compound , all three types of products: aromatized , co containing and double adducts . especially the type is suitable for further transformations into other heterocycles containing amino acid moiety. research group of peptide chemistry, hungarian academy of sciences, budapest, hungary among the opioid receptors family, the cloning of µ, k and δ receptors was followed by another member, named lc or orl . searching for an endogenous ligand for this receptor resulted in successful identification of a peptide (fggft garksarklanq) called noc or ofq. in vitro and in vivo studies have demonstrated that noc mediates a variety of biological actions. results from structure-activity experiments suggest that the whole sequence of noc is not required for binding to the lc receptor and for full biological activities. noc( - )-oh seem to be the minimum and essential sequence for good interaction with the receptor. this neuropeptide, similarly other peptides, are unresisting for enzymatic degradation and the releasing metabolites are very weakly active or inactive. some previous experiments refer to that the c-terminal amidation may protect the peptide from degradation. we purposed to synthesize carbamoyl analogues of noc( - )-nh , hoping that these derivatives retain the ability to bind lc receptor and are resistant against biological degradation: phe-nh-co--ala-noc( - )-nh phe--ala-nh-co-phe-noc( - )-nh phe-gly-nh-co--hphe-noc( - )-nh the first step in the synthesis of the carbamoyl analogues was the preparation of the building block [r-co-nh-co-nh-hc(rЈ)-cooh] by the classical method and then it was incorporated into the peptide by solid phase peptide synthesis. [ nonproteinogenic amino acids and their derivatives are valuable compounds from their pharmacological and biochemical effects. they can be used also in synthesis of peptides, as biomarkers, as the ligands in catalitically active transition metal complexes and so on. it is possible to prepare such amino acids by asymmetric hydrogenation of their prochiral precursors. however high enantioselektivities was reached only in the case of chiral phosphine-rhodium catalysts. recently we showed that high diastereoselectivity in the hydrogenation of linear dehydrodipeptides may be achieved over achiral catalyst in the catalytic system substrate -salts of ca ore mg -pd/c due to formation of dehydrodipeptides complexes with ions Ñà ϩ or mg ϩ and hence increasing of the conformational rigidity of substrates. this phenomenon may as well happen in other dehydrodipeptides, containing nonproteinogenic amino acids. among unnatural amino acids those bearing heterocyclic rings have attracted considerable attention due to the possibility of the heteroatoms participation in coordination with ions of metals. we have received some n-acyldehydrodipeptides, containing in the prochiral unit of dipeptides nonproteinogenic dehydroamino acids. all this n-acyldehydrodipeptides form in alcohol solution complexes with cax and mgx where one metal ion binds together several (up to ) substrate molecules. this kind of complexation leads to the increase of conformational rigidity and to the diastereoface shielding of cϭc bond. moreover the combination of cations (ca ϩ or mg ϩ ) and anions (x) and the sequence of their mixing with a substrate determine the assembly inside complex particles and hence the sign and degree of asymmetric induction. indeed hydrogenation of these complexes formed in situ over achiral heterogeneous catalyst (pd/c) gives two diastereomers of corresponding n-acyldipeptides with the substantial increase of the reaction diastereoselectivity (up to %). in living cells, glutamine represents one of the main storage forms of nitrogen and is a major physiological source of ammonia for the biosynthesis of aminoacids, aminosugars, purine and pyrimidine nucleotides and coenzymes. glutamine-dependent amidotransferases perform nitrogen transfer from the amide group of glutamine to various electrophiles. when the latter is fructose- p, the product of the reaction catalysed by glucosamine- p synthase is d-glucosamine -phosphate, a structural building block of peptidoglycane (bacteria) and of chitin and mannoproteins (fungi). fluorinated analogues of glutamine are expected to interfere with this biological process due to the strong electron withdrawing effect of fluorine atom (without significant steric consequence), inducing modulation of binding and/or electronic properties. these compounds might therefore behave as reversible or irreversible active site-directed enzyme inhibitors. synthesis of optically active from d-serine will be described and first results in the biological evaluation on glucosamine -phosphate synthase will be included. o. melnyk , d. bonnet , e. loing , l. bourel , and h. gras-masse -umr , -sedac-therapeutics, biological institute of lille, france lipopeptides, owing to their ability to cross passively the cell membrane or biological barriers, are unique tools for the intracellular delivery of bioactive peptides. the structure of the lipophilic moiety is known to have a profound effect upon the interaction with the membrane and its alteration. the stepwise solid phase synthesis of lipopeptides is limited by the necessity to perform a complex rp-hplc purification following the cleavage and deprotection step. in addition, the harsh conditions used during the final acidolysis procedure does not allow the introduction of unsaturated or sensitive fatty acids. to speed up the access to large lipopeptides modified by various fatty acid moieties or cholesterol derivatives, we have designed novel synthetic methods which involve the chemoselective reaction of fully deprotected and purified hydrazinopeptides with fatty acid succinimidyl esters or glyoxylyl derivatives. application of these methodologies to the c-terminal - portion of interferon (ifn)-γ allowed the selection of the optimal lipopeptide ifn-γ agonist, as determined by its ability to induce the expression of surface mhc-ii molecules through interaction with the intracellular components of ifn-γ receptor. graduate school of science, osaka city university, osaka, japan glutamate receptors in mammalian cns are implicated in the construction of memory and early learning as well as in the pathogenesis of neuron damage to cause various neuronal diseases. in recent years, we have studied the conformational role of l-glutamate when it binds to the receptors through the synthesis of l- -(carboxycyclopropyl)glycines (ccgs) and their related analogs. the works have demonstrated that not only the receptors require a specific conformation of l-glutamate, but also these analogs can be used as important tools for the neuropharmacological research. among them, dcg-iv, a Јsubstituted analog of ccg-i, is used as a potent and selective agonist of mglur . as an extension of these works, next program was focused on the synthesis of α-substituted glutamate analogs which would enable to develop potent and subtype-selective ligands for mglurs and transporters. α-alkoxymethylglutamate and ly and its c epimer were chosen for the synthetic targets, since the former slightly restricts the glutamate conformation to an extended form and the latter rigidly fix to an extended or a folded form on its bicyclo[ , , ]hexane skeleton. the key to the synthesis was a stereoselective construction of the quarternary carbon center, which was efficiently performed based on an asymmetric version of the strecker synthesis. details of the synthesis and their neuropharmacological activities will be described. using a genetically modified organism a broad variety of linear unsaturated amino acids are now accessible in enantiomerically pure form via this methodology, which can be used as starting materials for the synthesis of highly functionalized pipecolic acid derivatives. these compounds can be used to restrict conformations in polypeptides or can serve as scaffolds in synthesizing libraries for drug discovery. the synthetic approach involved both a pd-catalyzed amidopalladation reaction of alkoxy-allenes, in which the nh is added across one allene double bond and a ruthenium catalyzed ring closing metathesis step, to form a benzyloxypipecolic acid. further reaction of this n-sulfonyl-iminium-ion precursor with a nucleophile results in the formation of cis-substituted pipecolic acids. due to the unique electronic properties of fluorine, incorporation of α-fluoroalkylated amino acids is a new approach to design biologically active peptides with increased metabolic stability and defined secondary structure and provides a powerful nmr label for spectroscopic investigations. the application of proteases especially for cn-ligations is an attractive alternative to chemical methods, because the enzymatic formation of peptide bonds is highly regio-and stereospecific and, therefore, does not require large efforts to protect side chains of trifunctional amino acids. recently, the enzyme-catalyzed incorporation of α-fluoromethyl amino acids into the p , p and p Ј-position (nomenclature according to schechter and berger) of peptide fragments has been successfully performed. carboxypeptidase y was now shown to be suitable to catalyze the incorporation of α-trifluoromethyl alanine into the p position of peptides. furthermore, the general applicability of the substrate mimetic concept in enzymatic peptide synthesis was expanded to the transfer of c-terminal α-fluoroalkyl substituted amino acids. generally, each trifluoromethyl-and difluoromethyl amino acid guanidinophenyl esters can be applied as acyl donor in trypsin and chymotrypsin catalyzed peptide bond formation independently of the acyl moiety and the natural enzyme specificity, respectively. via these two approaches, incorporation of αfluoroalkylated amino acids into the p position of peptides using enzymatic methods was successfully applied for the first time. this investigation was performed in search of new Јdeoxynucleoside analogues modified at Ј-and Ј-positions with amino acids and possessing antiviral activity. substrate mimetics strategy: an efficient approach to protease-catalyzed peptide ligation n. wehofsky and f. bordusa , max-planck-society, research unit "enzymology of protein folding", halle, and institute of biochemistry, university of leipzig, germany two main drawbacks seriously restrict the synthetic value of proteases as reagents in peptide fragment coupling: ( ) native proteolytic activity and, thus, risk of undesired peptide cleavage; (ii) limited enzyme specificities restricting the amino acid residues between which a peptide bond can be formed. the latter can be overcome by the use of substrate mimetics. contrary to common acyl donors, substrate mimetics bear a binding site specific ester leaving group instead of having a specific amino acid moiety at the c-terminus of the acyl residue. this replacement mediates the acylation of the protease by nonspecific acyl residues. deacylation of the artificial acyl enzyme intermediate by the amino component added results in peptide bond formation regardless of the primary specificity of proteases enabling nonspecific coded and noncoded amino acid derivatives and even non-amino acid-derived acyl moieties to be coupled. the successful application of these artificial substrates for model peptide ligations catalyzed by the argspecific trypsin, the glu-specific staphylococcus aureus strain v protease (v protease), and α-chymotrypsin, which is specific for aromatic amino acid moieties, will be demonstrated. new development in the tritium labelling of peptides and proteins using solid state catalytic isotopic exchange with spillover-tritium yu. a. zolotarev , a. k. dadayan , b. v. vaskovsky , and n. f. myasoedov institute of molecular genetics, russian academy of sciences, and shemyakin-ovchinnikov institute of bioorganic chemistry, russian academy of sciences, moscow, russia the reaction of high temperature solid state catalytic isotope exchange (hscie) of hydrogen in peptides and proteins with spillover-tritium was studied. the reaction ability of amino fragments in hscie was shown to depend both of their structure and on the availability and the mobility of the polypeptide chain. [ h] peptide analysis using h nmr spectroscopy was carried out, and the modified fragment [ h]actc - (met-glu-his-phe-gly-pro), with molar activity of ci/mmol and [ h] zervamicin iib (ac-trp-ile-gln-iva-ile-trh-aib-leu-aib-leu-hyp-gln-aib-hip-aib-pro-phl, where aib ϭ amino-isobutyric acid) with molar activity of ci/mmol was produced. the obtained preparations completely retained their biological activity. with the -galactosidase protein from termoanaerobacter ethanolicus as example, the interrelation between the protein's tertiary structure and the isotopic label distribution incorporated due to the hscie reaction was used. the labeled protein with the molecular mass of kda was brought to fragmentation by glu-proteinase. peptide fragments were separated by hplc and were identified by maldi mass spectrometry. a correlation between the position of the amino acid fragment in the protein tertiary structure and its reaction ability in the hscie reaction was obtained. data on the retention of thegalactosidase enzymatic activity in condition of tritium label introduction are supplied. taurine chloramine modulates cytokine production by peripheral blood mononuclear cells m. chorą z . y , e. kontny , j. marcinkiewicz , and w. maśliń ski institute of rheumatology, warsaw, and jagiellonian university, cracow, poland objective. proinflammatory cytokines are produced in a cascade fashion, where monocyte-derived tnfα and il- trigger production of il- and il- also in the other cell types. we reported recently that taurine chloramine (tau-cl) inhibits production of the latter cytokines in fibroblast-like synoviocytes. in present study the effect of taurine (tau) and tau-cl on tnfα, il- and il- production was examined. methods. peripheral blood mononuclear cells from healthy volunteers were stimulated with lps ( h) in the presence of tau or tau-cl ( - µm). cytokine production was measured in culture supernatants (secreted) and cell lysates (intracellular) using elisa. results. in lps-stimulated cells both secreted and intracellular il- and il- were inhibited by tau-cl with ic Ϸ µm and µm, respectively. however, tau-cl exerted dual effect on tnfα production, raising it slightly ( . times) at low ( - µm) while reducing it (ic Ϸ µm) at higher concentration. tau did not significantly affect cytokine production. tau-cl modulates proinflammatory cytokine cascade and eventually might down-regulate it when present at high (Ͼ µm) concentration. department of biology, division of general physiology, university of oslo, blindern, norway every living cell must deal with osmotic and hydrostatic pressure changes between its environment and its interior and counteract volume changes. swelling activated channels is one group of effectors in the cell membrane that is important in preventing excessive volume increases by releasing inorganic ions and organic solutes that include taurine. such channels are associated with several physiological processes, but little is known about their activation mechanisms. we have used a rat thyroid cell line (frtl- ) to investigate the activation of a swelling sensitive [ h]taurine efflux pathway. hypo-osmolality and thyrotropin (tsh, µm) increased transiently the rate coefficient for [ h]taurine efflux with a similar pattern of activation. the phosphodiesterase inhibitor -isobutyl- -methyl xanthine ( µm) increased the swelling activated efflux rate coefficient . times above the control level and the camp analogue dibutyryl-camp ( µm) activated the pathway. these results indicate that both swelling and tsh activation of the taurine efflux pathway are mediated by camp. other aspects of the signal transduction pathway will be discussed. based on the inclination of n-chloroamines to disproportionate, the endogenous bactericidal agent n-chlorotaurine (nct), mainly at ph Ͻ , is accompanied by n, ndichlorotaurine (ndct). since pure ndct could be synthesized as crystalline sodium salt, a first evaluation of its chemical and bactericidal properties was possible. ndct-na (melting point: - °c, decomp.) is very well soluble in water and poorly in ethanol where it can be recrystallized from. on storage the initial ph of the aqueous solution decreases which correlates with a decrease of oxidation capacity of . % per day, probably originated by the elimination reaction r-ch -ncl ae r-chϭncl ϩ h ϩ ϩ cl Ϫ as a first step. contrary to nct-na an immediate decomposition occurs when ndct-na comes into contact with undiluted dmso. in aqueous solution, however, ndct does not react with dmso. the bactericidal activity of mm ndct at ph and against the gram-positive bacteria s. epidermidiand two strains of s. aureus was the same as with equimolar nct though ndct bears twice the oxidation capacity. against the gramnegative bacteria e. coli, p. aeruginosa, and p. mirabilis, however, a significantly higher activity of ndct was observed at both ph. the mechanism of taurine chloramine inhibition of fibroblastlike synoviocytes growth e. kontny , m. kurowska , j. kowalczewski , i. janicka , j. marcinkiewicz , and w. maśliń ski institute of rheumatology, warsaw, and jagiellonian university, cracow, poland objective. in rheumatoid arthritis (ra) enhanced proliferation of fibroblast-like synoviocytes (fls) leads to hyperplasia of synovial membrane (sm). therapeutic approaches to inhibit an excessive growth of these cells are not satisfactory. thus, we investigated the effect of taurine (tau) or taurine chloramine (tau-cl) on ra fls growth. methods. fls isolated from sm of ra patients were stimulated for hours with rhpdgf or rhtnf-α. tau or tau-cl were added at - µm concentrations. cell proliferation was determined by incorporation of h-thymidine into dna. expression of proteins regulating cell-cycle progression or apoptosis, was estimated by western blotting. results. at µm concentration tau-cl inhibited (by Ϸ %) both pdgf-and tnf-α-triggered cell proliferation and similarly reduced expression of pcna (a cofactor for dna polimerase δ). however, tau-cl affected neither the expression of cell-cycle inhibitors (p , p ) nor anti-apoptotic bcl- protein. tau has no effect on tested responses. conclusion. we report that tau-cl inhibits proliferation of ra fls by affecting expression of pcna, that is critical for cell cycle progression. e. kontny , k. szczepań ska , j. kowalczewski , m. kurowska , i. janicka , j. marcinkiewicz , and w. maśliń ski institute of rheumatology, warsaw, and jagiellonian university, cracow, poland objective. proinflammatory cytokines play critical role in the pathogenesis of rheumatoid arthritis (ra). we reported recently that taurine chloramine (tau-cl), but not taurine (tau), inhibits production of il- and il- by fibroblast-like synoviocytes (fls). in present study the mechanism of tau-cl inhibitory action was investigated. methods. fls isolated from synovial membrane of ra patients were stimulated with rhil- . tau or tau-cl were added at - µm concentration. after . - h or h the dna binding activity of nfkb and ap- (emsa) and the expression of il- and il- mrnas (rt-pcr) was examined, respectively. results. il- raised nfkb and ap- activity, followed by the elevation of cytokine mrnas expression. tau-cl, but not tau, reduced both the expression of cytokine mrnas (il- Ͼ il- ) and the activity of transcription factors (nfkb Ͼ ap- ). conclusion. tau-cl inhibits transcription of il- and il- genes due to its ability to diminish the activity of key transcriptional factors, that regulate these proinflammatory cytokine expression. institut für organische chemie, universität bremen, germany the synthesis of taurine and hypotaurine from cysteine can be followed up in astroglia-rich primary cultures obtained from brain of neonatal wistar rats. using h and c nuclear magnetic resonance spectroscopy cell extracts of glia cells incubated with c labelled cystein show the label subsequently in hypotaurine, taurine, lactate and gluthathione. within h, % of the total intracellular hypotaurine and . % of taurine were newly synthesized from cysteine. both metabolites were also released to the medium. neurones are capable to take up both metabolites from glia media to recruit their organic osmolite. part of newly synthesized glutathione and lactate are also exported to the medium. by this means lactate may serve as an energy substrate for neurons. in-vivo mrs of lactate is obscured by line splitting and signal overlay. using various two dimensional pulse sequences as spin preparation sequences prior to localized single voxel, in-vivo mrs or spectroscopic imaging sequences will provide homonuclear non-coupled resonance signal of taurine. these singlet signals are detectable and quantified. diffusion weighted spectroscopy is used to characterize the mobility of taurine in living tissue. department of pharmacology and ophthalmology and visual sciences, texas tech university health sciences center, lubbock, texas, u.s.a. taurine depletion, whether by removing taurine from the diet or by using a taurine transport inhibitor, has demonstrated various pathologies in various animal models including man. the first reported pathology associated with dietary taurine depletion was in the retina of the cat. in this animal model, taurine deficiency resulted in disorganization of the tapetum (the light reflecting membrane), disruption of the outer segments, photoreceptor dysfunction, and cell loss. when allowed to proceed for a number of months the result was blindness. subsequent studies demonstrated that taurine deficiency also had a profound effect on cardiac physiology. echocardiograms of the left ventricle of the cat heart depleted in taurine showed a dilated cardiomyopathy reflected in an extended end-diastolic diameter and an extended end-systolic diameter. dietary taurine supplementation resulted in the above parameters returning to normal. the cat is a difficult animal model to use for a variety of reasons and thus the rat was chosen to further probe the consequences of taurine depletion. unfortunately, the tissue taurine levels in the rat do not respond to dietary taurine depletion, and thus other experimental means had to be designed. guanidinoethanesulfonic acid (ges), an analogue of taurine and a taurine transport inhibitor, has been utilized for the last years to deplete rat tissues of their taurine content (j. e. shaffer and j. j. kocsis, methods of reducing tissue taurine levels, and r. j. huxtable, h. e. laird, and s. lippincott, rapid depletion of tissue taurine content by guanidinoethyl sulfonate. in: the effects of taurine on excitable tissues, spectrum publications, new york, ) . ges, when administered to rats in their diet in the drinking water as a - . % solution, usually produces a significant decrease in the taurine content of all tissues within one week of treatment. within - weeks the levels of taurine reach their nadir ( - % of control) and continued feeding of ges does not further reduce the levels of taurine. unfortunately, ges replaces taurine and thus one must always consider the effects of ges on physiological events that occur within the tissues in question. again, as in the cat, taurine depletion manifested itself in retinal pathology: disruption of the photoreceptor structure, dissociation of the disc membranes, and abnormal electroretinograms (erg). other animals models such as the monkey have also demonstrated structural disorganization of the photoreceptors and abnormal ergs. finally, the ultimate test is whether taurine deficiency has an effect in man. in , koppel and associates (geggel et al., n. eng. j. med. : - , ) demonstrated that children on long term parenteral nutrition devoid of taurine had abnormal erg. supplementation of the parenteral nutrition with taurine restored the ergs to normal in the majority of the children. because of these definitive studies, all infant formulae in the united states, europe and japan now contain taurine. (supported in part by a grant from the taisho pharmaceutical co., ltd., tokyo, japan.) department of pharmacology and ophthalmology and visual sciences, texas tech university health sciences center, lubbock, texas, u.s.a. it has been demonstrated previously in our laboratory that taurine inhibits the phosphorylation of an ϳ kdalton protein present in the mitochondrial fraction of the rat heart (j. mol. cell. cardiol. : - , . upon administering . % guanidinoethanesulfonic acid (ges) in the drinking water of rats for weeks, the taurine levels in cardiac tissue decline by %. however, the phosphorylation of a ϳ kdalton protein in the mitochondrial fraction of the heart tissue increased by % (j. mol. cell. cardiol. : - , ) . reversal of these effects could be accomplished by feeding the rat . % taurine in the drinking water for weeks. the ϳ kdalton protein was isolated by -dimensional polyacrylamide gel electrophoresis (page) using traditional glycine buffers followed by re-electrophoresing the cut out portion of the gel, which corresponds to the ϳ kdalton protein, on a tricine-buffered gel resulting in sufficient pure protein for digestion and sequence analysis. it was determined that the ϳ kdalton was pyruvate dehydrogenase (amino acids : - , ) which indicates a significant regulatory role for taurine in energy metabolism in cardiac tissue. these data are of significant interest in that taurine may be an additional effector of this enzyme or of the enzyme complex. studies are in progress to determine if taurine has a direct effect on either the kinase (inhibition) or the phosphatase (stimulation) associated with the pyruvate dehydrogenase complex. it has also been demonstrated and now reported that taurine depletion utilizing ges in vivo in rats affects the phosphorylation of myelin basic protein (mbp). in these experiments the animals were given ges ( %) for weeks in their water and then killed; the hearts were removed and homogenized. the homogenate was then incubated with buffer containing mbp ( ì) and radioactive atp for minutes. animals were also treated with taurine ( %) in their drinking water for - weeks or treated with taurine following the ges treatment. page of the incubation mixture, autoradiography on the dried gel, and densitometry of the mbp band gave the following results: relative % activity Ϯ sem (normalized to mg protein) control Ϯ (n ϭ ) ges-treated Ϯ (n ϭ ) p Ͻ . (paired -test) control Ϯ (n ϭ ) taurine-treated Ϯ (n ϭ ) p Ͼ . control Ϯ (n ϭ ) ges followed by taurine Ϯ (n ϭ ) p Ͼ . these data confirm previous reports that it is easier to deplete animals of their cardiac taurine content than it is to raise the levels of taurine. these data on the effects of taurine depletion (increase in mapkinase activity) and taurine supplementation (no change in mapkinase activity) on mapkinase activity reflect these past observations. (supported in part by a grant from the taisho pharmaceutical co., ltd., tokyo, japan.) act additively, or in the case of mpo negated each other's effects. regarding our results there is significance to pharmacological regimens which enhance the supply of propofol or taurine in whole blood. these regimens influence considerably pmn intracellular amino acid concentrations and it is this pmn "labile free amino acid pool" which may be one of the determinants in cell nutrition positively or adversely affecting pmn immune functions. taurine supplementation to pmn seems to interfere independently from the effects of propofol on pmn free amino acids and on immune functions tested. institut für hygiene, universität innsbruck, austria n-chlorotaurine (nct) is a long-lived oxidant produced by activated human leukocytes during the oxidative burst. it has activity against a broad spectrum of pathogens including bacteria, fungi, viruses and helminths. as a special feature, the killing of microbes by nct can be increased significantly in the presence of ammonium and also of some amino acids (alanine, glycine). this is explained by transfer of the active chlorine ("transhalogenation") from nct to ammonium and amino acids to form the corresponding, stronger microbicidal n-chloro derivatives monochloramine and n-chloro amino acids, respectively. especially addition of ammonium to nct provokes rapid inactivation of fungi and even mycobacteria. because of its good tolerability, nct solution can be applied to human tissue to treat infections. in ammoniumcontaining body fluids like nasal mucus and urine, fungi and bacteria are killed within minutes. therefore, amino compounds of human secretions can be transformed to the above quoted endogenous and highly microbicidal chloramines by nct via transhalogenation -a unique property of an antimicrobial agent. successful treatment in cases of urinary tract and otorhinolaryngological infections and conjunctivitis in phase iia clinical trials provides strong support for this concept. the endogenous sulfonated amino acid taurine has numerous functions in the central nervous system, including positive modulation of gaba a receptor function. recently we found that mice lacking protein kinase c -epsilon (pkcε) are behaviorally and biochemically supersensitive to ethanol and other positive allosteric modulators of the gaba a receptor. in addition, these mice consume - % less ethanol and wildtype controls in two separate self-administration paradigms. microdialysis studies in pkcε-deficient mice revealed elevated extracellular levels of taurine, which may account for the supersensitivity of gaba a receptors in these mice and resulting decreases in ethanol intake. in light of the fact that the taurine derivative acamprosate (calcium acetylhornotaurinate) is moderately effective in reducing craving and relapse in detoxified alcoholics, we examined the effect of taurine-related compounds on acute ethanol consumption in a two-bottle choice paradigm in rats. taurine ( - mg/kg ip) was only slightly effective in reducing ethanol intake but not preference, while the highest dose of taurine ( mg/kg) also suppressed water intake. the taurine precursor hypotaurine ( - mg/kg ip) was also weakly effective in reducing ethanol intake but not preference or water intake. the most effective compound tested was homotaurine ( - mg/kg ip), which suppressed ethanol intake and preference by approximately % without altering water intake. these data indicate that endogenous taurine may regulate sensitivity to ethanol and subsequent ethanol self-administration, and that taurine-related compounds may be effective in reducing alcohol intake in humans. we are currently exploring whether taurine and related compounds are able to suppress ethanol-stimulated mesolimbic dopamine release, a primary neural substrate of ethanol reinforcement. (this work was supported by funds provided by the state of california for medical research on alcohol and substance abuse through the university of california at san francisco.) organic osmolytes, such as taurine, regulate a cell's osmotic balance without directly altering either the cell's ionic composition or the membrane potential. this property of the organic osmolyte often renders the cell resistant to damage during a pathological insult. indeed, ischemia is associated with a massive efflux of taurine from the cell, an event that minimizes the severity of the osmotic imbalance that develops from the accumulation of lactate, inorganic phosphate and sodium. however, taurine depletion also activates specific signaling pathways that provide further protection to the cell. among the signaling pathways activated by taurine depletion is a pi -kinase (phosphatidylinositol -kinase) linked pathway that catalyzes the phosphorylation and inactivation of the pro-apoptotic factor, bad. taurine depletion also activates protein kinase c, which in turn elevates the intracellular content of the antiapoptotic factor, bcl- . increases in the extracellular osmolality by either addition of mm taurine or mm mannitol to the incubation medium activates similar pathways. however, pi kinase assumes a more important role in the mannitol treated cell than the taurine depleted cell. moreover, p map kinase is activated by mannitol treatment but not by taurine depletion. despite these differences, both taurine depletion and mannitol treatment protect the cell against hypoxia-induced apoptosis. the data suggest that osmotic stress protects the cell against apoptosis by increasing cellular levels of bcl- and promoting the inactivation of bad. this work was supported by a grant from the american heart association. a dummy or a protagonist on the stage of inflammation? r&d department, zambon group, bresso, milan, italy amino acids are usually present in large excess in healthy and the excess is used as source of calories. however, metabolic alterations are observed in ill patients and preferential retention of sulphur amino acids (saa) occurs during the inflammatory response. the metabolism of cysteine is modified during the acute phase of sepsis in rats. sulphate production is lower, whereas the higher liver production of taurine seems to play a protective role; glutathione concentration is greater in liver, kidney and other organs and cysteine incorporation into proteins was higher in spleen, lung and plasma (acute phase proteins) while albumin level decreases. another important phenomenon is the impairment of methionine conversion to cysteine during stressed condition. premature infants or hiv patients synthesise cysteine from methionine at a much lower rate. thus, the metabolic flow through the trans-sulphuration pathway may be insufficient to meet the glutathione and cysteine requirement in critical conditions. the pro-inflammatory cytokines, interleukin- , interleukin- and tnf-α are the main initiates that alter protein and amino acid metabolism. in this complex picture, saa supply may contribute to the immune system regulation. department of applied biological chemistry, the university of tokyo, japan the intracellular level of taurine is maintained not only by the taurine transporter that transports extracellular taurine inside cells but also by endogenous synthesis from methionine and cysteine. we therefore investigated the regulation of both the taurine transporter and the cysteine dioxygenase, one of the main taurine biosynthetic enzymes, in hepg human liver cells. the intracellular taurine content of hepg cells was extremely increased by culturing in a hypertonic medium. the activity of taurine transport was increased by hypertonic conditions, which was due to the increased expression of the taurine transporter gene. the expression level of the cysteine dioxygenase gene was also increased, suggesting that the expression levels of both the taurine transporter gene and the cysteine dioxygenase gene were regulated in harmony by hypertonic conditions to accumulate taurine inside cells. on the other hand, the activity of taurine transport in hepg cells was down-regulated on culturing the cells in taurine-rich medium, the expression level of the taurine transporter gene being also markedly decreased. however, the expression level of the cysteine dioxygenase gene was not significantly altered under taurine-rich conditions, indicating that the gene expression of the taurine transporter and that of the cysteine dioxygenase was independently regulated by extracellular concentration of taurine. the amino acid, taurine, is found in very high concentration in the heart. although its most important putative function is osmoregulation, it also serves as a regulator of cell growth. isolated cardiomyocytes exposed to medium containing nm angiotensin ii undergo hypertrophy, a process blocked by mm taurine. the amino acid also inhibits angiotensin iiinduced activation of c-fos, upregulation of atrial natriuretic factor and induction of tgf-betal. central to virtually all of these actions of angiotensin ii is the translocation and activation of key protein kinase c (pkc) isoforms. therefore, we proposed that taurine inhibited the hypertrophic actions of angiotensin ii by interfering with the translocation of one or more of the pkc isoforms. indeed, taurine and angiotensin ii exhibited different effects on the translocation of several pkc isoforms. while taurine promoted the translocation of pkcalpha, pkcdelta and pkcepsilon from the particulate fraction to the cytosol, the levels of the three isoforms in the particulate fraction were elevated following treatment with angiotensin ii. by contrast, both taurine and angiotensin ii increased the pkczeta content of the particulate fraction and the pkcbeta content of the cytosol. when the isolated cardiomyocytes were incubated with medium containing both angiotensin ii and taurine, the effects on pkc distribution were largely additive. these data support the notion that taurine prevents the hypertrophic effects of angiotensin ii by interfering with the translocation of either pkcalpha, pkcdelta, pkcepsilon or a combination of more than one of the isoforms. (the study was supported by a grant from taisho pharmaceutical co.) main final metabolites of l-cysteine in mammals are sulfate and taurine, and they are excreted in the urine. our previous studies in rats have shown that the ratio of urinary sulfate and taurine in rats fed diet containing sufficient methionine and cysteine is : - . in the present study, we determined urinary sulfate and taurine in urine samples of healthy japanese women after h starvation following usual meal. free (inorganic) and total (free ϩ ester) sulfate were determined with ion chromatography, and taurine by reversed-phase hplc after dabsylation. average excretions (micromols per mg of creatinine) were: total sulfate, . Ϯ . ; free sulfate, . Ϯ . ; ester sulfate, . Ϯ . ; taurine, . Ϯ . ; urea, . Ϯ . . the ratio of total sulfate and taurine was : . . this suggests that sulfate formation in humans is more dominant than taurine formation as in rats and this tendency is more evident in humans than in rats, which is in accordance with low cysteinesulfinate decarboxylase activity in humans. sum of sulfate and taurine excretions was significantly correlated with that of urea: correlation coefficient, . . this indicates that sulfur metabolism in humans is in the state of sulfur equilibrium similar to that of nitrogen and reflects protein metabolism. h. yokogoshi and h. oda laboratory of nutritional biochemistry, school of food and nutritional sciences, the university of shizuoka, and department of applied biological sciences, nagoya university, nagoya, japan the effect of taurine on hypercholesterolemia induced by feeding a high-cholesterol (hc) diet to rats was examined. when various amounts of taurine ( . - g/kg) were supplemented to hc for wk, serum total cholesterol gradually and significantly decreased in a dose-dependent manner, compared with the control (cholesterol free) diet group. by contrast, serum hdl-cholesterol was elevated by taurine supplementation. in the hypercholesterolemic rats fed the hc diet, the excretion of fecal bile acids and hepatic cholesterol α-hydroxylase (cyp a ) activity and its mrna level increased significantly, and the supplementation of taurine further enhances these indexes, indicating an increase in cholesterol degradation. agarose gel electrophoresis revealed that, in hypercholesterolemic rats fed the hc diet, the serum level of the heavier vldl increased significantly, but taurine repressed this increase and normalized this pattern. significant correlations were observed between the time-and dose-dependent increases of cyp a gene expression and the decrease of blood cholesterol concentration in rats fed the hc diet supplemented with taurine. these results suggest that the hypocholesterolemic effects of taurine observed in the hypercholesterolemic rats fed the hc diet were mainly due to the enhancement of cholesterol degradation and the excretion of bile acid. in vitro studies have shown that ammonia, which is responsible for neurological symptoms associated with hyperamonemia, causes a massive release of taurine from cultured cns cells and brain slices. in this study, taurine (tau) release was measured in vivo in rat striatum following direct application to the microdialysis tube of mm ammonium chloride which renders the final ammonia concentration in the extracellular space of ϳ mm. various in vivo stimuli evoke taurine efflux either by opening osmosensitive anion channels and/or by a mechanism secondary to glu accumulation and its interaction with nmda or ampa receptors. the following compounds were coadministered with ammonia to distinguish between these mechanisms: anion/cation transport inhibitors -dids and furosemide, a glu transport inhibitor-pdc, and nmda and ampa/ka receptor antagonists dizocilpine and dnqx. ammonia stimulated tau accumulation in the microdialysates to ϳ % of basal value. dids and furosemide moderately inhibited the effect of ammonia (furosemide by ϳ %), albeit dids added alone induced massive accumulation of tau with a delayed onset as compared to ammonia. ammonia-dependent tau accumulation was increased by ϳ % in the presence of pdc and reduced in an equal degree (ϳ %) by dizocilpine and dnqx. none of the agents affected tau accumulation in the absence of ammonia. the results show that ammonia in vivo evokes tau accumulation both via anion channels, possibly secondary to cell volume changes, and in consequence of stimulation of both nmda and ampa/ka receptors. (supp. by a scsr grant no p a and cimo, the acad. of finland) biosciences department, university of hertfordshire, hatfield herts, u.k. the discovery in that endothelium-derived relaxing factor is nitric oxide (no) was followed a year later with reports that the cationic amino acid l-arginine is the physiological precursor for nitric oxide. it has since been established that the terminal guanidinium nitrogen of l-arginine is metabolised via a series of oxidation reactions resulting in no production, with citrulline being formed as a co-product. of interest was the parallel observation that uptake of l-arginine was enhanced in inos expressing cells and that this was due to de novo synthesis of carrier proteins. the precise signaling pathway that regulates the enhanced expression of these carriers has been the subject of intense studies in recent years. current literature suggests that activation of upstream signaling molecules such as protein kinase c may be critical. in addition, downstream kinases thought to be points of convergence for various signals originating from cell surface receptors have also been implicated. two of these downstream targets include the and kda forms of mitogen-activated protein kinase (mapk) and the stressactivated kda mapk. it is worth noting however that the involvement of these different transduction pathways in the regulation of the induction of l-arginine transporters is not universal, and likely to be different from system to system. as a result there has been conflicting data on the relevance of these signaling proteins in inducing l-arginine transport in different cell. these issues will be discussed and the individual signaling pathways assessed on a cell type and species basis. moreover, the role of downstream signaling molecules will be examined in more detail, looking in particular at the critical dependency on the p mapk. this kinase currently exists in four different isoforms which are p α, , γ and δ. the involvement of individual isoforms of p in enhancing the expression of carrier proteins for l-arginine will be discussed. gw is an acetamidine derivative of heterosubstituted lysine which has been shown to have a marked selectivity for the human inducible nitric oxide synthase isoform (young et al. . bioorg. med. chem. lett., : , - ) . the systems associated with transport of this compound have been investigated using the macrophage cell line j . prior to each study, j cells were seeded in -well culture plates and allowed to adhere for h in dulbecco's modified eagle's medium (dmem). transport studies were carried out using hepes buffered krebs solution ( µl; °c) containing l-[ c]gw ( µciml Ϫ ) in the presence of either . mm or . - mm unlabelled substrate. in parallel studies transport ( µciml Ϫ , . mm) was monitored in the presence of mm excess of various other amino acids known to be substrates for distinct transport systems. time course experiments revealed that transport of . mm of l-[ c]gw occurred in a time-dependent manner and was linear for up to min. in addition, uptake was only marginally dependent on extracellular na ϩ . kinetic studies revealed that transport was saturable, and michaelis-menten analysis revealed single affinity entry with an apparent k t of . mm and v max of . pmol·µg protein Ϫ min Ϫ . at mm, -methylaminoisobutyric acid (meaib), lalanine, l-valine and - -amino-bicyclo-( , , )-heptane- carboxylic acid (bch) caused little or no inhibition of l-[ c]gw ( . mm) uptake. in contrast, transport of l-[ c]gw was inhibited markedly by l-arginine, llysine, l-leucine, l-methionine, -diazo- -oxo-l-norleucine (don) and l-glutamine. with the exception of l-arginine and l-lysine, the inhibition caused by the other substrates was critically dependent on extracellular na ϩ and was completely reversed when extracellular na ϩ was replaced with choline. in parallel kinetic inhibition experiments, transport of . mm l-[ c]gw was inhibited in a concentration dependent manner by l-arginine (ki ϭ . mm), l-leucine (ki ϭ . ), don (ki ϭ . mm) and l-glutamine (ki ϭ . mm). taken together, these data suggest that gw may be transported, at least in part, by system y ϩ . however, the marked inhibition caused by l-leucine, l-glutamine and l-methionine, substrates for the relatively high affinity cationic amino acid transporter system y ϩ l, would suggest that this system may also contribute to the uptake of gw ; if so, the monophasic substrate kinetics imply that the two systems handle gw with similar affinity. other systems such as b ,ϩ could be ruled out on the grounds that this transporter is critically na ϩ -dependent while uptake of gw is largely (ϳ %) na ϩ -independent. similarly, b ,ϩ , another broadspectrum aminio acid transporter that may be capable of transporting gw does not interact with l-glutamine and thus unlikely to be involved in transport of gw , at least in j cells. although a large number of different amino acid transporters have been identified on a molecular basis, some of themfunctionally described in mammalian cells -are still missing. in search of mammalian est sequences, which contain the signature of the aaap (amino acid/auxin permease) family, we identified a murine full length cdna, which encodes a membrane protein with - putative transmembrane domains. the transporter mrna is expressed in various murine tissues, including lung, heart and kidney. for functional characterization we used the xenopus laevis oocyte expression system and employed flux studies and electrophysiological analysis. oocytes injected with the crna showed an increased uptake of h-l-alanine and h-l-proline. detailed electrophysiological analysis revealed an electrogenic transport mode, independent of sodium and chloride ions. lowering the extracellular ph increased significantly substrate induced currents in crna injected oocytes. out of the proteinogenic amino acids the transporter recognizes only small amino acids, such as gly, ala, pro and ser. distinct structural analogues of these amino acids also interact with the transporters substrate binding site. in conclusion, we describe the molecular and functional characteristics of the first electrogenic proton driven amino acid transporter of mammals. pharmacology department, dr. willmar schwabe gmbh, karlsruhe, germany it is now well established that transport of amino acid neurotransmitters (like glutamate, aspartate, gaba and glycine etc.) from and to the neurones is essential for their proper functioning. like in the case of other neurotransmitters, specific pre-and post-synaptic as well as vesicular transporters are involved in such processes. extensive efforts to clarify the mechanisms and processes involved in the control and/or proper functioning of the amino acid transporters are now, therefore, being made in numerous laboratories. such efforts have not only led to the identification of a few specific ligands and/or modulators of neuronal amino acid transporters, but also have started unravelling the complex and diverse processes regulating their functions. aim of this communication is to point out potential usefulness of some neuroactive constituents isolated from therapeutically used medicinal herbs for clarifying the mechanisms involved in neuronal amino acid transport. our interest in such studies was initially triggered by the observations made with hyperforin, i.e. quantitatively the major neuroactive component of hypericum perforatum extracts widely used for the treatment of mild to moderate depressive disorders. this acyl phloglucinol derivative not only modulate synaptic transports of biogenic amines but also of glutamate, aspartate and gaba. since it does not interact with any of the till now described transporters for these neurotransmitters, efforts were made to clarify the mechanisms involved in their observed effects (both in vitro and as well as in vivo). the results of the in vitro studies available to date strongly suggest that its effects on neuronal amino acid transport processes is mediated via some novel extracellular mechanism controlling the h ϩ (and/or other ionic) concentrations of neurones. these observations not only demonstrate that hyperforin represent a structurally and mechanistically novel class of therapeutically useful agent but also suggest that it could be useful tool for clarifying the complex mechanisms involved in the control of neuronal amino acid transport. these observations stimulated us to screen other putative psychoactive herbal extracts and their active constituents on neuronal amino acid transport and on the consequences of disturbances caused by malfunction of specific transporters. observations made with several such agents indicate that either modulation of mechanisms and/or processes involved in neuronal amino acid transport or reversal of pathologies caused by anomaly of transporter functions could be involved in their modes of actions. these observations reinforce our conviction that studies directed towards clarifying the effects of herbal constituents on neuronal amino acid transport might not only be a feasible way for identifying novel types of therapeutically interesting molecules but also could expedite our knowledge on these complex processes. glutamate-regulated sodium dynamics in cortical astrocytes: implications for cellular bioenergetics j.-y. chatton, p. marquet, and p. j. magistretti the mode of na ϩ entry and the dynamics of intracellular na ϩ concentration (na ϩ i ) changes consecutive to the application of the neurotransmitter glutamate were investigated in mouse cortical astrocytes in primary culture by video fluorescence microscopy. an elevation of na ϩ i was evoked by glutamate, whose amplitude and initial rate were concentration-dependent. the glutamate-evoked na ϩ increase was primarily due to na ϩ -glutamate cotransport. the rate of na ϩ influx decreased during glutamate application, with kinetics that correlate well with the increase in na ϩ i and which depend on the extracellular concentration of glutamate. a tight coupling between na ϩ entry and na ϩ /k ϩ atpase activity was revealed by the massive na ϩ i increase evoked by glutamate when pump activity was inhibited by ouabain. during prolonged glutamate application, na ϩ i remains elevated at a new steady-state where na ϩ influx through the transporter matches na ϩ extrusion through the na ϩ /k ϩ atpase. a mathematical model of the dynamics of na ϩ i homeostasis will be presented which precisely defines the critical role of na ϩ influx kinetics on the establishment of the elevated steady-state and its consequences on the cellular bioenergetics. indeed, extracellular glutamate concentrations as low as µm approximately doubled the energetic demands of the astrocytes. department of biochemistry and molecular biology, faculty of biology, university of barcelona, spain in the last years a new family of amino acid transporters composed by two different subunits has been defined. two heavy subunits (rbat and f hc) and seven light subunits are known. rbat and the light subunits b ,ϩ at and y ϩ lat are responsible for the inherited aminoacidurias type i cystinuria, non-type i cystinuria and lysinuric protein intolerance, respectively. the heavy subunits are highly glycosylated type ii proteins, while light subunits are very hydrophobic unglycosylated membrane proteins, displaying a polytopic (generally transmembrane domains) predicted structure. the specificity of the amino acid transport activity depends on the light chain expressed. this, together with its topology, indicates that the transport function mainly relies on the light subunits. i will summarize some of our current studies directed to the understanding of structure-function relationships of these heteromeric carriers, specially concerning their oligomeric structure and initial attempts to reconstitute them. ongoing work on the isolation of new rbat-associated light subunits and new b ,ϩ at-associated heavy subunits, which could also play a role in cystinuria, will also be discussed. department of pharmacology, joh. gutenberg university, mainz, germany mammalian cationic amino acid transporters (cats) catalyze the transport of basic amino acids through the plasma membrane. the cat family comprises at least five related carrier proteins (cat- , - a, - b, - and - ) with cat- a and - b being splice variants. in humans, only the "old" members of the family have been characterized (hcat- , - a and - b). hcat- and - b exhibit high affinity for cationic amino acids and are sensitive to trans-stimulation, consistent with the classical system y ϩ . in contrast, hcat- a is a low affinity carrier relatively insensitive to trans-stimulation. interestingly, hcat- a and hcat- b differ only in a region of amino acids. cat- , so far only identified in rat and mouse, exhibits also system y ϩ activity. however, the substrate recognition and maximal transport activity seems to differ from other y ϩ transporters. cat- expression has been reported to be restricted to the brain in adult animals. a cdna encoding for human hcat- has recently been isolated, however, the transport activity of hcat- has not been characterized. when optimally aligned, the amino acid sequence of hcat- shows only about % identity with the other hcat isoforms. in contrast, the amino acid sequences of hcat- , - (a or b) and - are about % identical. to elucidate which amino acids are responsible for the difference in the transport properties of the hcat proteins, we constructed chimeric proteins between hcat- and hcat- a and performed site directed mutagenesis. using this approach, we identified two amino acid residues that are responsible for the different transport properties of hcat- a compared to the high affinity cat-isoforms. to characterize the human cat- , we cloned a cdna encoding hcat- . when expressed in xenopus laevis oocytes, hcat- had a similar transport activity and affinity for l-arginine as hcat- or - b. hcat- mediated l-arginine transport was trans-stimulated and independent of extracellular na ϩ ions. expression studies demonstrated that hcat- is not only expressed in different regions of the human brain, but also in peripheral tissues. to investigate if hcat- also functions as an amino acid transporter, we measured the transport of cationic, neutral and acidic amino acids in xenopus laevis oocytes expressing hcat- , but could not detect an transport activity for any substrate tested. a bright fluorescence could be detected in the plasma membrane of oocytes expressing hcat- with the green fluorescent protein attached to the c-terminus. therefore, hcat- might either need a complementary protein to function as an amino acid transporter or serve as a transporter for a yet unidentified substrate. renal amino acid reabsorption in immature and adult rats as a sensitive marker of heavy metal-induced nephrotoxicity (pt, cr, tl) institut für pharmakologie und toxikologie, klinikum der friedrich-schiller-universität jena, germany the effects of cis-platinum (cp; . mg/ g b. wt. i. p.), sodium dichromate (cr; mg/ g b. wt. s. c.) and tl so (tl, mg/ g b. wt. i. p.) on renal amino acid excretion and plasma amino acid composition were investigated in -(both sexes) and -day-old (female) anaesthetised wistar rats (han : wist). on the basis of diuresis experiments on conscious rats (determination of urinary volume and protein excretion) the mentioned doses and times ( st day after cr in both age groups and in -day-old rats after cp and rd day after cp in adult rats; nd [ -day-old rats] and th - th day [ -day-old rats] after tl) were found out to be optimal for the characterisation of amino acid transport after heavy metal poisoning. interestingly, in conscious -day-old rats cr nephrotoxicity is not detectable after mg/ g b. wt. whereas all of the other experimental groups showed nephrotoxic effects of cr, tl and cp in conscious rats. urine volumes were lower, but not significantly, in anaesthetised immature rats, independently of the administered nephrotoxin. glomerular filtration rate (gfr) is significantly lower in -day-old rats compared to adults. after cp, cr and tl gfr is significantly reduced only in adult rats and age differences disappeared nearly completely. in principle the renal fractional excretion (fe aa ) of amino acids was distinctly higher in immature rats as a sign of lower amino acid reabsorption capacity. nevertheless, the amino acid plasma concentrations were relatively high in immature control rats. however, both cr and cp did not distinctly influence molecular cloning and functional characterization of ata , a novel subtype of the amino acid transport system a medical college of georgia, augusta, georgia, u.s.a. recent molecular cloning studies have revealed that the amino acid transport system a consists of more than one subtype. two different system a subtypes, called ata and ata , have been cloned and functionally characterized. ata is expressed primarily in the brain and placenta whereas ata is expressed ubiquitously. heterologous expression studies have shown that these two subtypes cannot be distinguished functionally based on substrate affinity nor substrate specificity. we have now cloned a third subtype of system a, designated ata . it is expressed primarily in the liver. apart from the liver, detectable level of expression is noted only in the skeletal muscle. interestingly, ata can be easily differentiated from the other two subtypes of system a based on functional characteristics. we first isolated rat ata cdna from a skeletal muscle cdna library using rat ata cdna as the probe. rat ata consists of amino acids and exhibits a high degree of homology in amino acid sequence to rat ata ( % identity) and rat ata ( % identity). interestingly, this new transporter also has a comparable degree of homology to sn and sn , the two known subtypes of the amino acid transport system n. however, when expressed heterologously in xenopus laevis oocytes, rat ata transports α-(methylamino)isobutyric acid (meaib), a specific model substrate for system a, confirming that this transporter is definitely a subtype of system a. system n does not transport this system a model substrate. with two-microelectrode voltage-clamp technique, we have shown that exposure of rat ata -expressing oocytes to neutral, short-chain aliphatic amino acids induces inward currents. the amino acid-induced current is na ϩ -dependent and phdependent. analysis of the currents with alanine as the substrate has shown that k . for alanine (i.e., concentration of the amino acid yielding half-maximal current) is . Ϯ . mm and that the na ϩ : alanine stoichiometry is : . subsequently, we have cloned the human homolog of rat ata from a liver cell line (hepg ) cdna library. human ata also contains amino acids and shows % identity in amino acid sequence with rat ata . the sequence identity of human ata with human ata and human ata is % and %, respectively. the homology of human ata with human sn and sn is also similar ( % and % identity, respectively). the gene coding for human ata contains exons and is located on chromosome p . in the human, ata is expressed almost exclusively in the liver. when expressed in mammalian cells heterologously, human ata mediates the transport of neutral amino acids, including meaib, in a na ϩ -dependent manner. interestingly, while characterizing the function of this clone, we have uncovered a unique feature of this system a subtype. human ata is capable of mediating the transport of cationic amino acids. in fact, the affinity of human ata for cationic amino acids is higher than for neutral amino acids. however, the human ata -mediated cationic amino acid transport is na ϩ -independent. in this respect, ata is similar to transport system y ϩ l that also transports neutral amino acids in a na ϩ -coupled manner and cationic amino acids in a na ϩindependent manner. in contrast, ata and ata have not been shown to interact with cationic amino acids. in addition to this difference in substrate specificity, ata also differs from ata and ata in substrate affinity. ata and ata interact with meaib with a k t of ϳ . mm whereas the affinity of ata for this model substrate is comparatively at least -fold lower (k t , ϳ mm). but, ata interacts with arginine with a k t value of . mm. since liver does not express any of the previously known high affinity cationic amino acid transporters, amino acid plasma concentrations. but in both age groups the administration of cr and cp significantly decreased amino acid reabsorption capacity (increase in fe aa ) as a sign of nephrotoxicity, most pronounced in adult rats after cp. on the other hand, after tl, the fe of amino acids was distinctly higher only in adult rats as a sign of lower amino acid reabsorption capacity and, thus, as a sign of higher nephrotoxicity. in immature animals fe aa was increased only for few amino acids. however, in both age groups tl administration significantly decreased plasma amino acid concentrations, more pronounced in immature rats. the investigation of renal amino acid handling confirmed: ( ) cr, cp and tl were more nephrotoxic in -day-old animals compared to immature rats as could be demonstrated previously using other parameters for nephrotoxicity testing. ( ) the extent of toxic effects of heavy metals on the kidney is related to the maturity of renal functions involved in the enrichment of the respective metal in renal tissue and in its toxicity mechanism. ( ) changes in the fractional excretion of amino acids (reduction in renal amino acid reabsorption capacity, e.g. increase in fe aa ) and in amino acid plasma concentrations (especially decreases as a consequence of enhanced renal loss of amino acids) are early indicators of nephrotoxicity. ( ) therefore, the determination of renal amino acid handling is a highly sensitive marker for nephrotoxicity testing, both in immature and in adult rats. the mammalian h ϩ /peptide cotransporter pept was initially identified in the brush border membrane of renal proximal tubular cells as a high affinity type ptr -family member. here we describe the synthesis and functional analysis of novel high affinity inhibitors for pept that will be useful in further studies on structure and functions. starting from lys[z(no )]-pro a series of different lysine-containing dipeptide derivates were synthesized and studied for interaction with pept based on transport competition assays in pichia pastoris yeast cells and in epithelial skpt cells, both expressing pept . the twoelectrode-voltage-clamp technique in x. iaevis oocytes expressing pept was used to determine whether the compounds are transported electrogenically or block the uptake of dipeptides. synthesis and functional analysis of lys-lys derivates containing z(no ) side chain protections provided a set of inhibitors that reversibly inhibited the uptake of dipeptides by pept with k i values as low as nm. this is the highest affinity of a ligand of pept ever reported. moreover, based on the structure-function relationship we can conclude that the spatial location of the ε-amino protecting group in a lys containing dipeptide and its intramolecular distance from the alpha catom are key factors for the transformation of a substrate into an inhibitor of pept . ata is likely to provide the major route for the uptake of arginine in this tissue. institute of pharmacology and therapeutics, faculty of medicine, porto, portugal the present study examined the nature and regulation of the l-dopa transporter in two functionally different clonal subpopulations of opossum kidney (ok lc and ok hc ) cells. the inward transfer of l-dopa was largely promoted through an energy-dependent and sodium-insensitive transporter, though a minor component (ϳ %) was found to require extraceilular sodium. l-dopa uptake was insensitive to meaib, but competitively inhibited by bhc (ok lc , ic ϭ µm; ok hc , ic ϭ µm). l-and d-neutral amino acids and basic amino acids markedly inhibited l-dopa accumulation. l-dopa, lleucine, l-arginine, bhc or l-arginine plus bhc stimulated [ c]-l-dopa efflux. the accumulation of l-dopa was significantly higher at an acidic ph, and incubation of cells with l-dopa ( µm) resulted in marked intracellular acidification. modulators of pka, pkg, pkc and ptk failed to affect the accumulation of l-dopa. only the ca ϩ / calmodulin inhibitors inhibited l-dopa uptake. it is likely that system b ,ϩ might be responsible for the sodium-dependent uptake of l-dopa in ok cells, whereas sodium-independent uptake of l-dopa may include systems b ,ϩ and lat , the activation of which results in trans-stimulation of l-dopa outward transfer. the trans-stimulation of l-dopa inward transfer by an imposed h ϩ gradient suggest that ok cells are provided with an l-dopa-h ϩ cotransport system. amino acids are essential nutrients for cell growth and maintenance. the essential amino acids arginine and lysine, are mainly transported via the cationic amino acid transporter protein (cat ). the regulation of translation of the cat mrna during amino acid starvation was studied. an adaptive response to amino acid starvation and stress is a global decrease of protein synthesis, by phosphorylation of the translation initiation factor eif a. translation of the transporter mrna increases when eif a is phosphorylated, allowing synthesis of the essential for survival arginine/lysine transporter protein. the mechanism of increased translation of this mrna involves the induction of activity of a uorf-containing internal ribosomal entry sequence (ires). translation of the uorf and phosphorylation of eif a are required for increased activity. we propose that eif a phosphorylation triggers translational attenuation within the uorf, converting a relatively inactive, to a high activity ires. this study demonstrates that like yeast, mammalian cells have developed a sophisticated response to stress conditions: when expression of most genes decreases, synthesis of stress response proteins increases to support cell survival. amino acid transport, cell volume and the regulation of cell death f. lang, s. fillon, i. setiawan, p. lang, v. tanneur, d. häussinger, and s. bröer department for physiology, university of tübingen, germany cell volume regulatory mechanisms participate in a wide variety of cellular functions including regulation of epithelial transport, excitability, hormone and transmitter release, metabolism, migration, cell proliferation and apoptotic cell death. besides ion transport, polyols, betaine and glycerophosphorylcholine, cells utilize amino acids including taurine to balance extracellular osmolarity and regulate their volume. cells counteract shrinkage by uptake and swelling by release of amino acids including taurine. moreover, cell swelling stimulates synthesis and cell shrinkage favours breakdown of proteins which are osmotically less active than the sum of the amino acids thus generated. conversely, amino acid transport does influence cell volume. concentrative uptake of amino acids leads to cell swelling, amino acid release to cell shrinkage. through alterations of cell volume the amino acids participate in the regulation of protein metabolism. thus, concentrative amino acid transport inhibits and release of amino acids favours proteolysis. these mechanisms participate in the regulation of cell death. cd induced apoptotic death of jurkat t lymphocytes is paralleled by the release of taurine. the taurine release occurs with a delay of some min following cd receptor triggering but immediately preceedes apoptic cell shrinkage and dna fragmentation. the signaling leading to taurine release is in large part elusive but requires at some stage activation of caspases. moreover, taurine release and apoptotic dna fragmentation are strongly inhibited by lowering of temperature. preloading of the cells with taurine retards cd induced dna fragmentation pointing to an active role of taurine in the regulation of apoptosis. peptide transporters of the ptr-family are integral plasma membrane proteins, that mediate the electrogenic protoncoupled transport of di-and tripeptides and peptide-like drugs across cell membranes. the physiological role of pept , one member of this family in mammals, is mainly the uptake of small peptides into intestinal and renal tubular epithelial cells. in caenorhabditis elegans a homologue to mammalian pept is encoded by the pep- gene, which is expressed in the intestinal cells and a subset of sensory neurons in the head of the animal. to study the physiological role of the pep- transporter in vivo, a c. elegans pep- mutant was constructed. the animals deficient in pep- show a remarkable phenotype with pronounced signs of malnutrition, characterised by a delayed development, less eggs in the uterus, a smaller brood size and a prolonged mean life-span compared to wild-type animals. we rescued the phenotype by the expression of the wt pep- gene in the mutant. the observed starved phenotype in pep- mutants might be best explained by the reduced intestinal absorption of peptide bound amino acids that are required for protein synthesis and energy metabolism and provides the first direct evidence for the predominant role of the intestinal peptide transporter in amino acid absorption. adenosine is a potent vasodilator in many vascular beds and modulated tone via elevation of intracellular camp and/or release of nitric oxide (no). we have previously reported that adenosine (ado) stimulates l-arginine transport and no production in human cultured umbilical vein endothelial cells (sobrevia et al., j. physiol. , - , ) , and here further characterise the signalling cascades. rt-pcr demonstrated that fetal endothelial cell possess mrna levels for a a , a b and a -adenosine receptor subtype, whereas negligible levels were detected for the a -receptor. adenosine ( µm, min) induced increases in l-arginine transport and no production were ca ϩ and camp independent and stimulated transport was abolished in cells depolarised with mm k ϩ . whole-cell patch clamp experiments revealed that adenosine activated inward k ϩ currents, resulting in a membrane hyperpolarization and enhanced influx of the cation substrate l-arginine. adenosine induced l-arginine transport and no production were also abolished by inhibitors of tyrosine kinases (genistein), mek / (pd , u ) but unaffected by inhibitors of pkc (calphosin c) and pi- kinase (ly ). these data suggest that adenosine induces membrane hyperpolarization by activating inward k ϩ currents, increasing the driving force for cationic amino acid influx via system y ϩ . the discovery of nocardicine a by aoki et al. and aztreonam showed that monocyclic -lactams, collectively known as monobactams, can have antibiotic activity. this activity is poor but compensated by the unique effect they can induce on certain microbial cell membranes. our quest for new non-conventional surfactants for various biomedical applications led us to synthesize bioactive compounds with structural similarities to nocardicins. we present here the preparation and the study of original trimodular biosurfactants of type i: spermine and amine oxidase induce a cytotoxic effect on multidrug resistant chinese hamster ovary cells e. agostinelli , s. lord-fontaine , e. przybytkowski , and d. a. averill-bates department of biochemical sciences "a. rossi fanelli", university of rome "la sapienza" and cnr, centre of molecular biology, rome, italy department de chimie/biochimie and toxen (centre de recherche en toxicologie de l'environnement), université du québec à montréal, canada the occurrence of resistance to cytotoxic agents in tumor cells is a major obstacle to successful anticancer chemotherapy. multidrug resistance (mdr) is associated with several phenotypic alterations. cells with the mdr phenotype display decreased drug accumulation due to overexpression of pglycoprotein (p-gp), encoded by the mdr- gene, which acts as an energy-dependent pump involved in extrusion of drugs. we studied a new strategy to eliminate mdr cells using an enzyme, bovine serum amine oxidase, capable of forming cytotoxic products, h o and aldehyde(s), from polyamines (spermine). the involvement of both toxic products, formed by the bsao/spermine enzymatic system, in causing cytotoxicity was investigated in multidrug resistant chinese hamster ovary cells, ch r c , at and °c. we observed that hyperthermia, depletion of intracellular glutathione (by l-buthionine sulfoximine) and inhibition of glutathione s-transferase (by ethacrynic acid), sensitized ch r c cells to the cytotoxic effect of spermine enzymatic oxidation products. mdr cells showed no resistance to h o and aldehyde(s) relative to their drug-sensitive counterparts, auxb cells, in experimental conditions of: higher temperature, higher spermine concentration and longer incubation time. the inhibition of cellular detoxification systems led to increased cytotoxic effects of spermine enzymatic oxidation products on both mdr and sensitive cell lines. these results might be of great interest and suggest that toxic oxidation products formed from spermine and amine oxidase could be used in anticancer therapy, mainly against multidrug resistant tumor cells. [acknowledgements: this work was supported by cnr "target project on biotechnology", ministero della sanità tar these compounds present a hydrophobic part introduced by an ester or amide linkage with an aminoacid, a junction modulus which corresponds to -lactam, and a hydrophilic part which contains a triazole, well-known in pharmaceutical industry for its inhibitor effect against -lactamase. the compounds are synthesised from -hydroxymethyl- methyl propionic acid in five steps. selective activation of one of the primary hydroxyl groups was accomplished by the formation of alkoxy tris(dimethylamino)phosphonium (atdp) salts from the corresponding diol. treatment of with excess potassium carbonate in refluxing anhydrous acetone yields the monobactams . activation by atdp salts followed by treat-ment with sodium azide and reflux in toluene gives the azido compound. the reaction with acetylenic derivatives allows to obtain the surfactants. the compounds show classical surfactant behavior and the evaluation of their biological properties give evidence for their antibacterial and antiviral activity, which corresponds apparently to antiprotease activity. a prodrug approach to glutathione derivatives with in vitro antiparasitic activity department of chemistry and materials manchester, faculty of science and engineering, metropolitan university, manchester, u.k. the potential chemotherapeutic activity of peptides are lost in many cases in vitro, due to their inability to cross cell plasma membranes. the recent identification of a series of glutathione diesters with high antiparastic activities in vitro against t.b.brucei (african sleeping sickness) lead us to investigate the determinants associated with their activity. a qsar study on some twenty-five diester derivatives against t.b.brucei and t.b. rhodesiense lead us to conclude that the mechanism of action of these compounds is related to membrane penetration and hydrolysis, controlled by hydrophobicity and steric factors. a hplc and sensor study have confirmed the de-esterified diacid as the active agent of these prodrugs. dietary taurine prevents oxidative stress and morphological alterations in the retina of diabetic rat f. franconi , m. a. s. di leo , s. caputo , n. gentiloni silveri , and g. ghirlanda department of pharmacology, university of sassari, and department of internal and geriatric medicine, catholic university, rome, italy diabetes mellitus can cause various complications including retinopathy, which is the earliest and most common complications of diabetes mellitus, affecting % of diabetics and progressing to blindness in about %. considerable evidence implicates oxidative stress in the pathogenesis of diabetic retinopathy. in fact, hyperglycemia generates reactive oxygen species and free radical defense is reduced in diabetic patients. thus, the prevention of oxidative stress may have important implications for pharmacological attempts to prevent diabetic retinopathy. at this regard, it has been found that taurine, a semi essential amino acid with antioxidant activity, is decreased both in type and type diabetes mellitus. moreover, taurine seems to have a peculiar role of taurine in terms of cellular physiology and pathophysiology of the retina. among others, taurine is thought to produce important physiological effects through osmoregulation, calcium modulation and antioxidant effects. therefore, we examined the effect of dietary chronic ( months) taurine ( % and %) supplementation in diabetic rats in comparison with vitamin e ( and ui). dietary taurine supplementation, for months, does not influence conjugated dienes (cd), lipid peroxides (lp) and na/k atpase activity in the retina of non diabetic rats. using rats streptotozocin (stz) induced diabetes of -month duration, we found that cd, lp are significantly increased and they remained elevated for months. while, the na/k atpase is significantly decreased during the whole experimental time ( months). moreover, an inverse correlation has been found among the cd and lp and atpase activity. in the retina of stz rats, these biochemical alterations are accomplished with marked profound morphological changes. in stz rats, taurine enriched diets decrease the lipid peroxidation and preserve the atpase activity, being % taurine more effective than % diets. the morphological examination reveals that in rats feed with % taurine no proliferative changes are present. moreover, the beneficial effects of taurine are more marked than of those of vitamin e. these results and previous findings encourage new investigations to evaluate the efficacy of taurine as an adjunctive agent ch ch (ch ) n xco iran applicated be ( mg/kg - days) and the third -control. enzyme activities were determined spectrophotometrically in brain homogenate. results: polyamine oxidase activity decreased significantly lower dose of be didn't induce any significant change in diamine oxidase activity gaba-transaminase activity increased significantly (p Ͻ . ; p Ͻ . ) and dose dependently upon be treatment we have been examined the effects of propofol, taurine and propofol combined with taurine on free intracellular amino acid (aa) profiles, superoxide anion formation (o Ϫ ), hydrogen peroxide production (h o ) and released myeloperoxidase activity (mpo) in polymorphonuclear leucocytes (pmn). propofol led to significant changes in pmn free taurine, glutamine, glutamate, aspartate, methionine, basic, neutral (naa) and branched chain amino acid concentrations. exogenous taurine reduced pmn naa while increasing intracellular taurine. taurine supplemented to propofol significantly reversed the changes in taurine, naa and alanine only. regarding pmn immune functions propofol significantly decreased o Ϫ , h o formation and mpo. taurine decreased o Ϫ and h o production, while increasing released mpo. when propofol and taurine were combined they appeared to by reacting tyrosine with -nitroso- -naphthol in the presence of nitric acid - -benzyo- -(alanyl)- -phenoxazone (blp) an analog of actinomycin d is produced. the structural similarity of blp to actinomycin d prompted the national cancer institute (nci) to investigate its antitumor activities. the nci investigations revealed that blp exhibits growth inhibitory effects on various cancer cells and as a result blp has received the u.s. patent from the u.s. patent office. the purposed of this investigation was to synthesize similar benzo phenoxazone derivative by reacting -nitroso- -naphthol with -(α-hydroxy -methylaminopropyl)phenol in the presence of nitric acid. during the study, it was found out that , -benzo- phenoxazone derivative is not produced but a hydrogenated form of , -benzo- -phenoxazone which is probably , -benzo- -(α-hydroxy -methylaminopropyl)- -hydroxyphenoxazine (bhmhp) which has been suhhested from mass spectra obtained by electron ionization, ei, chemical ionization, ci and electro-spray ionization, esi, methods. bhmhp was screened against various cancer cell lines by nci and has shown promising effect against three ( ) breast cancer cell lines: mda-mb- , mda-n and hs- t. the % growth inhibitory (gi ) concentrations for these three cell lines were . ϫ Ϫ , . ϫ Ϫ and . ϫ Ϫ molar respectively. a. bocheva and t. pajpanova institute of molecular biology, bulgarian academy of sciences, and institute of physiology, bulgarian academy of sciences, sofia, bulgariathe histamine is an endogenous substance with neurotransmitter and neuromodulator functions in the organism. its antagonists are used in the therapy of allergic diseases and inflammatory reactions and as antiulcer drugs.the limited potentialities of the antihistamine therapy together with the increasing number of the people suffering from allergic diseases give rise to the design and synthesis of new histamine analogues as a perspective area in the chemistry of therapeutic drugs.additionally, compounds containing the guanidine, oxyamino and sufonamide moieties are known to elicit a variety of pharmacological responses and are present in several marketing drugs or drug candidates.on the other hand, similar compounds, being a part of bigger structures (for instance peptides), can imitate the molecules of already known at ii-receptor antagonists.having in mind these data we aimed to synthesize new analogs of histamine containing sulfo-and oxy-guanidino groups with common formula: a. bocheva , s. pancheva , and t. pajpanova institute of physiology, bulgarian academy of sciences, and institute of molecular biology, bulgarian academy of sciences, sofia, bulgariathe problem of the efficient therapy of pain is important not only from clinical but from social and economic point of view. the great achievements in medicine are connected with the research on the development of antinociceptive drugs.melanocyte-inhibiting factor (mif) is a tripeptide (pro-leu-gly-nh ) that was discovered in hypotalamus.the mif- exerted a weak analgesic effect. the synthesis of non-protein amino acids and their incorporation into biologically active peptides might become a powerful method for the design and development of modified analogues of natural peptides. having in mind these data we synthezied a number of new mif-analogues, containing unnatural amino acids such as cav, slys, sleu, slle and snie and in vivo experiments were performed to study their action on the nociception. the changes in nociceptive effects were examined in male wistar rats by the tail-flick (tf) and hot-plate (hp), as well as, the randall-seitto paw-pressure tests. the peptides were applied intaperitoneal (i.p) injection at a does mg/kg. the results show that the newly sinthesized analogues exert an antinociceptive effects in all tests used. naloxone at a dose mg/kg (i.p) antagonized the antinociceptive effects of mif-analogues. the interaction between platelets and fibrinogen is known to be mediated by the intergrin gp iib/iiia. the arg-gly-asp (rgd) sequence located on fibrinogen and other proteins of blood and extracellular matrix is the minimum requirement for cell attachment and adhesion. it has been found that peptides containing the rgd sequence can effectively inhibit the binding of fibrinogen to gp iib/iiia. in addition aspirin has been shown to be beneficial in the treatment of stable and unstable angina, acute myocardial infraction. aspirin acetylates and inhibits the enzyme cyclooxygenase, the first enzyme involved in thromboxane a (txa ) synthesis, an activator of platelet aggregation and adhesion.we have already reported that the combination in the same molecule of dipeptide amides, containing amino acid(s) of rgd sequence, with salicylic-residue -ro-c h -coϳ, {where rϭh or ch co} at their n-terminal amino group have shown inhibitory activity on human platelet aggregation. continuing this research project on salicyl-peptides we have synthesized a series of rgd analogs, incorporating salicylic acid derivatives, by conventional solution techniques and/or by solid phase. the synthesized rgd analogs were identified by ir, nmr and es-ms spectra and tested for inhibitory activity on human platelet aggregation in vitro, by adding common aggregation reagents (collagen, adp, thrombin) to citrated platelet rich plasma (prp). platelets were obtained from venous blood of healthy donors and the prp was isolated by centrifugation at g for min at °c. the aggregation was determined using a dual channel electronic aggregometer. malonyl dialdehyde (mda) production was measured using thiobarbituric acid reagent. in order to confirm these results, flow cytometry with monoclonal antibodies against gpib, gpiib/iiia, gpiiia and gmp was used. the ic values of the synthesized and tested compounds, as well as their mda production and flow cytometry results will be discussed. amino acids have a long tradition as building blocks, chiral auxiliaries and/or ligands in advanced organic synthesis and catalysis. at dsm an enzymatic kinetic resolution process has been developed, based on an aminopeptidase catalyzed stereoselective hydrolysis of racemic amino acid amides to form a mixture of l-amino acid and unchanged d-amino acid amide.several small peptides currently are under investigation as possible anti-tumor agents. neuropeptides such as substance p (sp) and neuropeptide y (npy), have been studied for their ability to prevent tumor growth or the proliferation of several cancer cell lines. these neuropeptides have been investigated for their effect to prostate cancer, small cell lung cancer (sclc) and breast cancer. the synthetic sp analog [d-arg , d-phe , d-trp , , leu ]sp (antagonist d) and the c-terminal analog [arg , d-trp , , mephe ]sp - (antagonist g) inhibit sclc cell proliferation in vitro and in vivo, while the analogs [glp , glu(bu t ) ]sp - and [glp , glu(bu t ) ]sp - showed significant inhibition in the proliferation of the cancer cell lines hela and t d.in the present study the c-terminal analogs of sp [glp , d-trp , glu(bu t ) ]sp [ ] [ ] [ ] [ ] [ ] [ ] ( ), [glp , d-trp , , glu(bu t ) ]sp [ ] [ ] [ ] [ ] [ ] [ ] ( ), [glp , d-trp , , mephe , glu(bu t ) ]sp - ( ), [glp , d-trp , mephe , glu(bu t ) ]sp - ( ), [glp , trp , mephe , glu(bu t ) ]sp - ( ), [glp , mephe , d-trp , glu(bu t ) ]sp - ( ), [glp , d-trp , mephe , glu(bu t ) -oh]sp - ( ), [glp , d-trp , cys(acm) -oh]sp - ( ), [glp , d-trp , mephe , cys(acm) -oh]sp [ ] [ ] [ ] [ ] [ ] [ ] ( ), [glp , d-trp , , mephe , cys(acm) -oh]sp - ( ) have been synthesized and tested for their antineoplastic properties in several cancer cell lines. they were also examined for their cytotoxicity to normal cells.the analogs - are peptide amides whereas the analogs - are peptide acids. they were performed using the stepwise synthesis either in solution, using the method of mixed anhydrides with carbonic acids or in spps using the fmoc/bu t methodology. the fragment condensation method in solution, using phosphonium reagents, such as pybop, was also applied. the analogs were purified (hplc) and identified (ft-ir, es-ms, h-nmr).the antineoplastic properties of the analogs were studied using sister chromatide exchange (sce) and proliferation rate index (pri). as it is known the sce method is an indicator of dna damages or its repair mechanism, while the method of pri is a sensitive marker of cytotoxicity. the experiments were carried out using cultured human lymphocytes from healthy donors and these results will be discussed.semiempirical quantum chemical investigation of some thymidine derivatives modified with amino acids and peptides at Ј, Ј-positions j. velkov , i. stankova , a. ivanova , and a. tadjer department of chemistry, south-west university "neophit rilski", blagoevgrad, and department of chemistry, sofia university "st. kl. ohridsky", sofia, bulgaria optimized geometry and electron charge distribution for some thymidine derivatives ( Ј, Ј-bis-o-n-α-benzyloxycarbonyl-alanyl-, Ј, Ј-bis-o-n-α-benzyloxycarbonyl-valyl , Ј, Ј-bis-o-n-α-benzyloxy-carbonyl-glycyl-glycyl-glycyl, Ј, Јbis-o-n-α-benzyloxycarbonyl-phenylalanyl, Ј, Ј-bis-o-n-αbenzyloxycarbonyl-glycyl) were calculated at the semiempirical (am ) level. the choice of method is limited by the molecular size. in addition, the differences between the ground state energy of the compounds and that of the hydrolysis reaction intermediates were compared to the experimentally found stability towards hydrolysis.with a few notable exceptions, attempts to crystallise integral membrane proteins have failed due to the difficulties in finding appropriate conditions for proteins that have both hydrophobic and hydrophilic domains. thus structural information is largely limited to predictions of secondary structure from the amino acid sequence and computer modelling, neither of which can as yet give high resolution detail. thus alternative approaches are required, and one that we have employed is to look at the substrate binding/transport characteristics of compounds and predict what features the binding site might have. the membrane transport protein that we are interested in is the proton-coupled di/tri-peptide transporter, which has a wide range of natural substrates and is known to transport therapeutically important non-peptides such as ᮀ-lactam antibiotics and angiotensin converting enzyme inhibitors.the initial question that interested us was what makes a di/ tri-peptide a substrate, but not an amino acid? while the obvious answer is the peptide bond, studies with 'space mimic' compounds (which have the space filling properties of a dipeptide but no peptide bond) gave the surprising result that the peptide bond was not essential for binding and translocation. although these space mimics had n and c termini, studies from our laboratory and others have shown that the presence of free amino or carboxyl groups are not a prerequisite for binding or translocation either. this leaves the question of what does distinguish a pept substrate from a non-substrate?computer modelling of a large number of pept substrates has allowed the development of a substrate template, whereby potential substrates can be scored according to their predicted binding affinity. from this it is clear that it is a sum of energies derived from a number of substrate-transporter interactions that determine binding affinity, including the n-and c-termini, the peptide bond components and the substrate side-chain groups. further studies aim to refine this model through the complimentary approaches of novel substrate design and sitedirected mutagenesis of the transporter protein.why are we interested in this? a large number of promising therapeutic compounds are found to have little or no bioavailability. compared with most membrane transporters pept has a wide range of potential substrates, and amongst its non-peptide substrates are a range of peptidomimetic therapeutic compounds. the recent finding that a peptide bond is not a prerequisite for transport opens up the possibility of designing prodrugs to be substrates for pept , and this has found to be an effective strategy for example with the antiviral drug valacyclovir.(we thank the wellcome trust for their generous support.) nuklearmedizinische klinik und poliklinik der technischen universität münchen, germanyaim: the high amino acid metabolism of tumor cells allows tumor imaging with radiolabeled amino acids as c-methionine (met) by positron-emission-tomography (pet). however in recent experimental and clinical studies met uptake was also found in inflammatory tissue thus leading to false positive results. the aim of the study was to compare [ f]fluoroethyltyrosine (fet), a new amino acid analogue, with met to assess their suitability for differentiating between tumor cells and inflammatory cells in vivo and in vitro.methods: popliteal lymph nodes of balb/c and dba/ mice were stimulated either by streptocotocin (stz), causing chronic lymphadenitis, or by concanavalin a (con a), causing in acute lymphadenitis. tumor infiltrated lymph nodes were induced by inoculating cells from a lacz transfected t-cell mouse lymphoma line into the footpads of syngenic dba/ mice. the uptake of met and fet was determined quantitatively in tumor infiltrated and inflammatory lymph nodes as well as in the lymph nodes of untreated mice. in vivo imaging of tracer uptake in mouse lymph nodes was performed using a high resolution ( . mm) small animal pet (madpet). in vitro the uptake of the amino acids met and fet was investigated in different cells, such as sw human colon carcinoma cells and c rat glioma cells, stimulated human lymphocytes and macrophages. about ϫ cells of each cell line were incubated in a buffered medium containing either different concentrations of unlabeled amino acids or con a (stimulation of lymphocytes) or the transport inhibitors amino-norbornane-carboxylic acid (bch, l-system), α-(methylamino)-isobutyric acid (meaib, a-system) or l-serin (asc-system). . mbq of each amino acid tracer were added and incubated. uptake was stopped by using ice-cold pbs, cells were washed three times and uptake was analyzed.results: in tumor infiltrated lymph nodes uptake of both tracers was higher than in control lymph nodes. met showed an increased uptake in both lymphadenitis models, whereas fet did not accumulate significantly. met and fet uptake in tumor infiltrated lymph nodes was also seen in madpet images, however inflammatory lymph nodes could only be detected in met images.the amount of tumor uptake was different in the various cell types investigated. c cells showed the highest uptake of all cells investigated and a slightly lower uptake was found in sw cells. in con a stimulated lymphocytes, the uptake of fet was negligible, while met uptake was significantly higher than in both tumor cell lines. since bch reduced the uptake of fet and met to approximately %, fet seems to be also predominantly transported into tumor cells by the l-system. the results indicate, that fet appears to differentiate between tumor and inflammatory tissue, as a result of the low uptake of fet in inflammatory cells. nuklearmedizinische klinik und poliklinik der technischen universität münchen, germanyover the past few years numerous studies have documented the high diagnostic accuracy of positron emission tomography (pet) using the glucose analogue f- -fluordeoxyglucose (fdg) for detection and staging of malignant tumors. a significant limitation of fdg-pet, however, is that increased uptake is not only observed in malignant tumors but also in activated inflammatory cells. due to the high glucose utilization of the normal brain and the lower protein synthesis in the normal gray matter the radiolabelled amino acid c- -methionine (met) gives higher contrast between brain tumors and normal tissue than fdg-pet. rapid uptake of met has been documented for several malignant tumors like gliomas, lung cancer, bladder cancer and malignant lymphomas since amino acid transport and protein synthesis are generally increased in malignancies. the application of met-pet however has been limited by the short half life of the radioactive label c- ( min) in contrast to f- ( min). amino acid analogous labeled with f- like f- -fluoro-α-methyltyrosine (fmt), f- -fluoro-ethyltyrosine (fet), f- -fluoro-phenylanaline, f- -fluore-proline will allow a more widespread application of amino acid pet in oncology. an other amino acid analogue i- -iodo-α-methyltyrosine (imt) is of clinical interest because the radionuclid i- allows it applicability for single-photoemission-computer-tomography (spect). the uptake of the amino acid analogues can only be regarded as a measure for the increased amino acid transport in the tumor cells because they are not incorporated into proteins. clinical data show that radiolabelled amino acids that are only transported into the cells are not inferior to those that enter protein synthesis. this tracers may also help to differentiate tumor lesions from inflammatory lesions when the expression of the transport systems for amino acids in tumor cells and inflammatory cells is different.lysinuric protein intolerance: understanding the pathophysiology of a multi-system disorder of dibasic amino acid transport m. p. sperandeo , , v. fiorito , a. pietrosanto , a. pepe , g. andria , and g. sebastio telethon foundation, rome, and department of pediatrics, federico ii university, naples, italy lysinuric protein intolerance (lpi; mim ) is an autosomal recessive disease, mainly found in finland and italy. clinical findings of lpi include: vomiting, diarrhea, failure to thrive, hepatosplenomegaly, osteoporosis, episodes of coma, and mental retardation. a life-threatening lung involvement (alveolar proteinosis) and renal insufficiency were also reported. metabolic derangement of lpi includes: reduced intestinal absorption of cationic amino acids (lysine, ornithine, arginine, caa), increased renal excretion of caa and dysfunction of the urea cycle leading to hyperammonemia and orotic aciduria. most of the clinical findings cannot be explained by a selective deficiency of amino acid transport, as indeed observed for cystinuria (mim ), a cognate disease of lpi. the molecular basis of lpi resides in an abnormal caa carrier functioning at the level of basolateral membrane of epithelial cells in the intestine and the kidney. caa transport is mediated by y ϩ l system, that is exerted by heterodimers consisting of the f heavy chain ( f hc) and a light chain represented by either the solute carrier family a, member (slc a ) or (slc a ). after excluding the f hc as the causative gene of lpi, we identified slc a as the lpi gene and characterized mutations in twenty-five patients from families ( italian, japanese, moroccan, greek, and pakistani; independent alleles) affected by lpi. thirty-two of the independent alleles ( . %) were characterized and fourteen mutations were identified. only five mutations (namely insatca, w x, delctct, ivs ϩ gaea, s r) were identified in more than one independent family. most mutations are located in the slc a coding region, except for two splicing mutations. the pathogenesis of some clinical findings of lpi, namely alveolar proteinosis and renal involvement, remains mostly unknown. we are currently investigating the role of slc a gene in lpi, which, in addition to slc a , is responsible of the y ϩ l activity. in fact, the regulation of the y ϩ l system, exerted by either f hc/ slc a or f hc/slc a , is still unknown. hypothetically, the activation of f hc/slc a in all tissues might be the "simple" way to a lpi gene-therapy.[acknowledgements: m. p. s. is supported by telethon-italy (grant n. cp) and is an assistant telethon scientist.] pre-eclampsia (pe) is a potentially life threatening complication of pregnancy and is one of the leading causes of maternal and fetal morbidity and mortality. pe is associated with endothelial cell dysfunction and inadequate placental perfusion. fetal plasma l-arginine levels are decreased in pe and there is controversy as to whether nitric oxide (no) production is altered. we have investigated whether the kinetics of l-arginine transport via system y ϩ and no production are altered in fetal umbilical vein endothelial cells (huvec) from pe pregnancies. kinetics of l-arginine transport were similar in huvec isolated from normal, preterm and pe pregnancies, however nethylmaleimide inhibited transport in normal but not pe huvec. basal and histamine-stimulated no production was similar in normal and preterm huvec, whereas pe increased basal ( Ϯ vs . ϫ pmol/ cell/ min) and histaminestimulated ( Ϯ vs Ϯ pmol/ / min) no production. whole-cell patch clamp measurements revealed similar inward rectifying k ϩ currents in normal and pe huvec, with resting membrane potentials of Ϫ Ϯ and Ϫ Ϯ mv in normal and pe huvec, respectively. increased enos activity in pe endothelial cells may serve as a compensatory mechanism to counteract the hypertension observed in pe, however, elevated no production is apparently not associated with enhanced larginine transport. department of pharmacology, university of cambridge, u.k.over the past years, concerns have heightened over the escalating numbers of pathogenic microorganisms that are resistant to multiple antibiotics. this phenomenon poses major problems in the treatment of patients with hospital or community-acquired infections caused by bacteria, yeast, fungi and parasitic organisms. particularly intriguing are the so-called multidrug transporters, which have specificity of compounds with very different chemical structures and cellular targets. this lecture will focus on the molecular properties of the atpbinding cassette multidrug transporter lmra in the lactic acid bacterium lactococcus lactis. lmra is a close homolog of the human multidrug resistance p-glycoprotein, overexpression of which is one of the major causes of resistance of human cancers to chemotherapy. surprisingly, lmra can even substitute for pglycoprotein in human lung fibroblast cells. recent biochemical and pharmacological studies on lmra suggest that the protein may operate by a two-cylinder engine mechanism to transport amphiphilic drugs from the inner leaflet of the plasma membrane. this mechanism will be discussed in more detail. bone and bone marrow are important sites of metastasis formation in breast cancer; so, we studied the level of bone sialoprotein (bsn) and fibronectin (fn), two key connective tissue antigens, in patients with metastatic breast carcinoma. our data reveled that bsn have a statistically significant association with bone metastases in that disease. fn level was also significantly changed in metastatic breast carcinoma when compared to the non metastatic cases. kharkov national university, radiophysical department, chair of molecular and applied biophysics, kharkov, ukraine * present address: institute of cell and molecular biology, university of edinburgh, edinburgh, scotland, u.k.in the work the temperature dependencies of dielectric parameters of human serum albumin (hsa) and fibrinogen solutions ( . m nacl, ph . ) were obtained in the temperature interval - c degrees. the measurements of the dielectric parameters were carried out at the frequency of . hhz, i.e. in the range of free water molecules dispersion. in contrast to dependencies for poor solvent, temperature dependencies of dielectric parameters for protein solutions are of nonmonotonous character; they have a number of peculiarities in the temperature ranges of - , - and - c degrees. this fact means that at these temperatures redistribution of free and bound water in protein-water system occurs due to structural changes in protein molecules. the dependencies of hydration of hsa and fibrinogen on temperature were obtained as well.in the work the mechanism of temperature changes of spatial organisation of protein molecules was proposed. perhaps, this mechanism is responsible for maintenance of thermal stability of the functionally active conformation of native proteins. as peculiarities on temperature dependencies of dielectric parameters of solutions of globular (hsa) and fibrillar (fibrinogen) proteins were in the same temperature regions, one may to assume that the mechanism of proteins thermal stabilisation in physiological temperatures interval has a general character. laboratory of cell pharmacology, university of leuven, medical school, campus gasthuisberg (o&n), leuven, belgium n-pomc was purified from conditioned medium of att cells using a sequence of concentration, fractionation by ion exchange, rp-hplc and gel-filtration. twenty isoforms of n-pomc, for both and kda, were identified by means of mass spectrometry and n-terminal sequencing. these isoforms are assumed to be pomc - or pomc - with heterogeneous glycosylation.the n-pomc isoforms were tested on prolactin (prl) gene expression and lactotroph mitosis in pituitary cell aggregate cultures. prl mrna content was quantified by means of real time rt-pcr. three kda n-pomc fractions enhanced prl mrna levels by - %, while all other isoforms were inactive. this effect was abolished by immunoneutralization with n-pomc monoclonal antibody. only one fraction stimulated lactotroph proliferation ( . Ϯ . %) as assessed by brdu incorporation in prl-immunoreactive cells. several (but not all) kda n-pomc fractions stimulated prl mrna level and lactotroph mitosis. on the other hand, all and kda isoforms activated the mc- and mc- receptor in cell lines in which these receptors were transfected. thus, att cells produce various n-pomc isoforms, only a part of which display an effect on prl mrna expression. even fewer isoforms affect lactotroph proliferation. since all isoforms activate the mc- and mc- receptor, it is suggested that the effect of the few isoforms on lactotrophs is mediated by (a) different receptor(s). are widely prescribed for the treatment of mild to moderate depression and the putative antidepressant constituent is probably hyperforin. in this study the effect of hyperforin was investigated on the release of neurotransmitter amino acids.coronal cortical slices ( µm) were cut and perfused with gassed ( % o , % co ) acsf at °c. two-minute samples of perfusate were collected and aspartate and glutamate were assayed by hplc. potassium-and veratridine-stimulated release was elicited by administering pulses of k ϩ ( mm) or veratridine ( µm) minutes apart.in control experiments the second k ϩ pulse elicited glutamate release which was % of the first pulse. hyperforin ( µm) perfused for minutes prior to, and during, the second k ϩ pulse significantly increased glutamate release to % (p Ͻ . , n ϭ - ). release elicited by the second veratridine pulse was % of the first pulse for both glutamate and aspartate. hyperforin ( µm) increased this release to the second pulse to % and % respectively (p Ͻ . , n ϭ - ). when perfused on its own for minutes, hyperforin ( µm) increased the basal release of glutamate (p Ͻ . , n ϭ - ).in conclusion, the increase in the release of neurotransmitter amino acids observed following hyperforin is possibly mediated through a facilitatory action on voltage-operated ca ϩ or na ϩ channels.glucagon-like peptide- ( - ) amide (glp ) is the main product of the glucagons gene expression in intestinal l cells into the cirulation in response to the ingestion of food and is the most potent stimulator of glucose-induced insulin secretion. glp receptors have also been detected in discrete areas of rat brain and intracerebroventricular glp has been shown to inhibit feeding in fasted rats. in this study hplc techniques were employed to evaluate the effects of glp on serotonin ( -ht) and γ aminobutyric acid (gaba) metabolism in rat brain. glp ( . µm) produced a significant decrease in levels of -ht by % after minutes of incubation with combined hypothalamus and brain sterr. synaptosomes. levels of hydroxyindolacetic acid ( -hiaa), the principal metabolite of -ht, and tryptophan the amino acid precursor of -ht, were also decreased significantly by % and % respectively. gaba and its amino acid precursor glutamic acid were both measured at the same conditions as above, but a precolumn derivatization hplc technique was used. the increase in levels of gaba ( %) and glu ( %) by glp was not significant.the results suggest that decreased synaptosomal levels of -ht and -hiaa caused by glp are due to diminished availability of typtophan by the peptide. in experimental model of iron overload we obtained the following results: the concentration of carbonyl groups tended to increase, while mda level significantly increased after feso treatment ( . Ϯ . vs control . Ϯ . µmol/mg prot.) and ( . Ϯ . vs control . Ϯ . nmol/mg protein p Ͻ . ) respectively. it was associated with significantly increased iron content ( . Ϯ . µg/mg prot. vs control . Ϯ . p Ͻ . ). it is clear that oxidative stress occurs in experimental iron overload, if sufficiently high levels of iron within hepatocytes are achieved. in group treated with feso and spermine, iron content was significantly decreased ( . Ϯ . p Ͻ . compared with fe treated only) and carbonyl group content tended to be lower in comparison to feso treated only ( . Ϯ . ), but mda level didn't change ( . Ϯ . ). in addition, treatment with spermine alone resulted in increase of mda level ( . Ϯ . vs control p Ͻ . ), iron content didn't change ( . Ϯ . ), but carbonyl groups were decreased ( . Ϯ . vs control p Ͻ . ). feso treatment increased gsh level ( . Ϯ . nmol/mg prot. vs control Ϯ . ; p Ͻ . ) while in combination with spermine this increase was more profound ( . Ϯ . ; p Ͻ . vs control, p Ͻ . vs feso ). spermine alone produced similar increase of gsh level ( . Ϯ . , p Ͻ . vs control; p Ͼ . vs feso ). the results emanating from the human genome programme have required a reappraisal of protein science and have led to the rapid upsurge in interest in the area of proteomics. this sudden re-emergence of protein science, in fact, was predictable and should not have been surprising.recent experience of protecting group design with respect to lysine and aspartic acid will be discussed together with aspects of chemical synthesis of small proteins of biological significance and in the context of chemical synthesis methodology making contributions to the general field of proteomics. using a cell line permanently expressing the mouse taurine transporter (mtaut) as a fusion protein, we investigated the underlying mechanism by which the immunosuppressive drug cyclosporin a (csa) inhibits taurine transport. csa inhibited the recombinantly expressed mtaut function both in dose and time dependent manner. the inhibitory effect of csa was reversible. thus, washing out the csa resulted in almost complete recovery of taurine uptake. to obtain further insight, we examined the surface abundance of the mtaut as a function of csa treatment using a surface-labeling assay. our results demonstrated that csa treatment altered the surface expression of the mtaut without significantly altering its total expression level, and the reduction in the cell surface expression paralleled the decrease in taurine uptake. upon removal of csa, the virtual recovery in taurine uptake was due to the concomitant increase in the number of taurine transporters on the cell surface. taken together, our results suggest that csa induced inhibition of taurine uptake was either due to the impaired targeting of the taurine transporters to the cell surface or due to the removal of the transporters from the cell surface. polyamines are neuromodulators in a number of physiological and pathological conditions in cns. since application of ethylene glycols causes hypoactivity and lethargy of experimental animals, depression of cns and various neurological symptoms, the aim of this study was to examine the effects of butoxyetanol on polyamine and gaba catabolism, taking in account an alternative pathway of gaba synthesis from putrescine. methods: male wister rats were allocated into three groups: first treated by be ( mg/kg - days), second key: cord- -h ttoo z authors: bowman, grant r.; cowan, andrew t.; turkewitz, aaron p. title: biogenesis of dense-core secretory granules date: - - journal: trafficking inside cells doi: . / - - - - _ sha: doc_id: cord_uid: h ttoo z dense core granules (dcgs) are vesicular organelles derived from outbound traffic through the eukaryotic secretory pathway. as dcgs are formed, the secretory pathway can also give rise to other types of vesicles, such as those bound for endosomes, lysosomes, and the cell surface. dcgs differ from these other vesicular carriers in both content and function, storing highly concentrated cores’ of condensed cargo in vesicles that are stably maintained within the cell until a specific extracellular stimulus causes their fusion with the plasma membrane. these unique features are imparted by the activities of membrane and lumenal proteins that are specifically delivered to the vesicles during synthesis. this chapter will describe the dcg biogenesis pathway, beginning with the sorting of dcg proteins from proteins that are destined for other types of vesicle carriers. in the trans-golgi network (tgn), sorting occurs as dcg proteins aggregate, causing physical separation from non-dcg proteins. recent work addresses the nature of interactions that produce these aggregates, as well as potentially important interactions with membranes and membrane proteins. dcg proteins are released from the tgn in vesicles called immature secretory granules (isgs). the mechanism of isg formation is largely unclear but is not believed to rely on the assembly of vesicle coats like those observed in other secretory pathways. the required cytosolic factors are now beginning to be identified using in vitro systems with purified cellular components. isg transformation into a mature fusion-competent, stimulus-dependent dcg occurs as endoproteolytic processing of many dcg proteins causes continued condensation of the lumenal contents. at the same time, proteins that fail to be incorporated into the condensing core are removed by a coat-mediated budding mechanism, which also serves to remove excess membrane and membrane proteins from the maturing vesicle. this chapter will summarize the work leading to our current view of granule synthesis, and will discuss questions that need to be addressed in order to gain a more complete understanding of the pathway. in eukaryotes, newly-synthesized proteins destined for secretion are first transferred from the cytoplasm to the lumen ofthe endoplasmic reticulum, and then progress through the golgi apparatus to the trans-golgi network (tgn). at the tgn, the choice of secretory pathways broadens. one route, which appears to be present in all cells, is constitutive in the sense that secretion does not depend on extracellular signals. such secretion involves the budding of vesicles or tubular elements from the tgn and their subsequent transport to and fusion with the plasma membrane, and is essential for cell growth since, among other functions, it provides new material to expand the cell surface. in addition to a constitutive route , many cells maintain a secretory mode that is adapted for the tight coupling of protein release to extracellular stimuli. for such regulated exocytosis, the vesicles that carry newly-synthesized protein from the tgn accumulate in the cytoplasm until specific extracellular events trigger their fusion with the plasma membrane, resulting in the release of vesicle contents.f the vesicles involved are called dense-core granules (dcgs), the name reflecting the fact that the contents are so highly condensed that they form a large electron-dense plug in the vesicle lumen. a large amount ofprotein, as well as other molecular cargo, is thus efficiently stored in vesicular reservoirs and later released on demand. this pathway therefore permits larger and more rapid secretory responses than can be generated via constitutive secretion. classical dcgs in endocrine, exocrine and neuroendocrine cells are responsible for storage of a wide array of signaling molecules (e.g., peptide hormones) and secreted enzymes, and related vesiclesare found in metazoan cells of other lineages as well as in numerous unicellular organisms. the secreted proteins and macromolecules playa vast range of functions, from tissue coordination in metazoans to cyst formation in protists . regulated secretion also depends upon mechanisms for controlling the timely release of dcg contents, and this is accomplished by regulating the fusion of the vesicle membrane with the plasma membrane. much of the progress in understanding the mechanisms that mediate this step has been preceded or aided by studies of synaptic vesicles (reviewed in ref ) , which undergo regulated fusion with the plasma membrane, but differ from dcgs in their biogenesis and acquisition of contents. comparable work in dcg secretion has shown that many of the molecular components involved in regulating exoeytosis and achieving membrane fusion are shared by these two vesicle types. in addition to proteins that appear to be specific for regulated fusion with the plasma membrane, the mechanisms include factors, such as snare s and rab proteins, which are members of families of proteins that are of central importance to vesicular trafficking at multiple stages in the eukaryotic secretory pathway. thus, regulated exocytosis appears to be accomplished by the coupling of a regulatory mechanism to a universal core of membrane trafficking machinery. although many of the protein components have been identified, and a more complete understanding of the process remains an important goal for ongoing research. the mechanistic studies of regulated membrane fusion are too extensive to be included in this chapter, but have been covered in many reviews, - and above. figure .at least pathways diverge at the tgn in neuronal, endocrineand exocrinecells. a) a subsetof proteinsare destinedfor densecoresecretorygranules for release via regulatedexocytosis, these are found asaggregates in distendedareasofthe cisterna. b)proteinsdestinedforconstitutivesecretionaretransported via vesicles or tubules. c) proteinsdestinedfor lysosomes are concentrated,via the mannose- -phosphate receptor, into clathrin coated pits and vesicles. darkly-shaded squaresand circlesrepresentproteins that tend to coaggregate under tgn conditions, and that are subsequentlystored in dcgs. lightly-shaded formsrepresentproteinsthat primarilyexitthetgn viaother pathways.arrowheads representproteinsthat areligandsforthe mannose- -phosphatereceptor. crescents representproteinsthat arefound in a relatively evendistribution throughout the lumen. the extent to whichsomeconstitutively secretedproteinsmaybe concentrated within specific regionsof individualcisternaeis not incorporated into this model. this chapter will instead focus on a part of the pathway that precedes regulated exocytosis, namely the synthesis steps that lead to the formation of dcgs, beginning at the tgn. the tgn is a complex compartment that gives rise not only to the regulated and constitutively released classes of secretory vesicles but also to vesicles that carry hydrolytic enzymes to lysesomes. - therefore, we seek an understanding ofthe signals that guide outbound proteins in a -way tgn sorting problem (fig. ) . dcg protein sorting also continues in a post-tgn compartment, where additional factors come into play. this pathway has been the subject of numerous valuable reviews, - with a particularly thorough treatment by arvan and cascle. the anatomy ofdcg formation a number of important insights into the pathway of dcg formation have come from electron microscopy, providing a context for molecular and genetic studies. first , the fact that dcgs appear dense implies the existence of mechanisms to drive a degree of macromolecular aggregation that is unusual within the secretory pathway. many lines of research have led to the conclusion that protein sorting and concentration are intimately linked in this pathway, both relating to the self-aggregating tendency of dcg proteins that will be discussed below. rambourg and colleagues have investigated the localization of protein aggre:fiates, using serial thin sections to reconstruct the golgi apparatus during granule formation. - in cells producing mucous-containing dcgs, the cis and medial golgi appear as flat cisternae, and secretory proteins are evenly distributed in their lumina. in contrast, cisterna in trans regions are marked by multiple perforations and are dilated in regions that accumulate aggregates of secretory material. those dilations grow progressively larger in the more distal regions, while the nondilated portions take on a tubular appearance. at the trans-most cisterna, the dilated regions with their concentrated secretory cargo appear to exist as independent bodies, separate from a residual network oftubular membranes. several points were established or reinforced by these images. the first is that the visible concentration of dcg proteins begins within golgi cisternae . a second point is that the tgn, the vesicle donor, appears to be undergoing large-scale changes itself. the images also indicate that the vesicles do not bud conventionally in the manner that is well-established for coat (e.g., clarhrinl-mediared steps, since no coats are seen. electron microscopy also suggested that the aggregates undergo progressive changes, and are therefore likelyto be dynamic in nature. in pancreatic cellsthat are synthesizing insulin-storage granules, the proteinaceous cores seen in golgi dilations appeared less dense than the cores of insulin granules in the cell cyroplasm. since the latter are derived from the former, this implied that proteins reorganize during an organellar maturation process. an important conclusion is that dcg formation should be considered as a multi-step process that plays out in sequential compartments. an early phase occurs at the trans face ofthe golgi and results in the production of vesicles bearing concentrated secretory proteins. these are called immature secretory granules (isgs). subsequently those vesiclesare remodeled, as reflected morphologically by cargo condensation and biochemically by changes in protein composition, to become mature dcgs. . for simplicity, we will refer to the first process as budding, and the second as maturation. central issues to be considered in this chapter are the mechanisms responsible for protein sorting during those successivesteps. again for simplicity, we will largely confine our discussion to the dcgs found in neuronal and endocrine cells. many of the same mechanisms are likely to apply to other classes of dcgs, among which are those in hematopoetic cells; for example, see references and . genetics frequently offers a natural complement to morphological studies for developing an overview ofa pathway. unfortunately, a weakness in current approaches to analyzing dcg biogenesis is the absence of developed genetic models, although several systems show promise. no human diseases are known to stem from an inability to synthesize neuroendocrine dcgs. presumably, strong defects in dcg formation would result in embryonic lethality in a complex multicellular organism, since such a defecr would preclude regulated secretion of many peptides involved in tissue coordination. however, this has not prevented the generation of regulated exocytosis mutants in more simple systems, such as drosophila. in flies, the null phenotype of a gene called dcaps (calcium-activated protein for secretion), is embyronic lethal, but analysis of the larva has shown that the gene product is necessary for dcg exocytosis, as predicted from earlier work in mammalian chromaffin cells. . although mutations affecting earlier stages in the pathway (i.e., dcg synthesis) have not been characterized in this organism, the characterization of caps mutants in this system provides hope that the earlier steps will be accessible by further mutational analysis. c elegans offers another potentially useful system for the genetic analysis of dcg synthesis, and several mutations affecting regulated exocytosis have been identified in this organism (reviewed in ref. ) . currently, the only examples of dcg synthesis mutants are found in single-cell systems: the unicellular ciliates tetrahymena thermophila and paramecium tetraurelia, in which the mutations were chemically induced, and spontaneously-arising clones of the rat pheochromocytoma line pci . [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] the viability ofthese mutants substantiates the idea that regulated exocytosis, unlike constitutive secretion, is not involved in basal cell growth . that is, dcgs are essential for organismal survival in metazoans, but not for individual cell viability. in the pcl lines, some mutations appear to disrupt the transcription ofnumerous granule protein genes. , in the ciliate mutants, which appear to be due to single recessivealleles, the cargo genes are still expressed though no granules are synthesized. in one tetrahymena line, normal granule cargo appears to be shunted to the constitutive secretory pathway. this phenotype indicates that dcg cargo proteins are not sufficient to direct granule formation, a result which was particularly interesting in the context of experiments in which mammalian dcg cargo proteins were expressed in tissue culture cells that do not normally make dcgs. - such cells make vesicles with dense cores, presumably because cargo proteins expressed in nonspecialized cells can induce the formation oftheir own carriers from the tgn. these results implied that the capacity to make dcgs was inherent in the basic organization of the golgi/tgn since it could also occur in such nonspecialized cells. since this capacity appears to have been lost in the tetrahymena mutant, the defect in that line may point to an aspect of goigi/tgn function that is critical for regulated but not constitutive secretion. the full relevance of the ciliate or pc cell mutants to dcg biosynthesis will only be known when the mutations themselves havebeen identified. such geneticapproaches provide an unbiasedmethod for the identification of novel genes, and mayprovecritical in broadening our understanding of the granule synthesis pathway. although many of the dcg cargoproteins themselves have been cloned and characterized, much less is known about the mechanismsthat control protein sorting and condensation. geneticsystems may help to identifythe regulatory factors that are involved in theseprocesses. proteinsortingtakes place in thetgn and duringmaturation. in eachcase, a single compartment gives rise to multiple pathways, and the challenge is in understanding how dcg proteins, both in the lumen and the membrane, are cosorted from a larger cohort that includes proteins destined for other pathways. the relevant contributions oftgn vs, isg sortingarelikely to be cell-type specific and aregenerally difficult to quantify experimentally. however, the mechanisms for controlling sorting at both stages may be fundamentally similar. in particular, the considerationsthat arise from protein aggregation are relevant for both compartments. a long-standing issue is whether the primary mode of dcg protein sorting is active or passive. the model of active sorting was initially inspired by the paradigmof sorting to lysosornes, in which sorting derives from recognition of a set of soluble lumenal proteins by a transmembranereceptor. extendingthis to dcg biogenesis, the modelpositedthat a subsetof proteinshavepositive sorting signals for inclusionin isgs. • in this scheme, proteinsin the tgn lumen that lack targetingsignals are presumedto follow an alternative, default pathway of constitutive secretion. this model has been called "sorting for entry" (fig. la) . an alternative model posits that newly synthesized proteins can be targeted to isgs by default , even in the absence of specific targeting signals, if the flux of bulk membrane traffic toward isgs is greater than that to constitutive or lysosomal carriers. this may indeed be the case for cells that are highly committed to regulated exoeytosis. , in this case, the major sorting events occur in the isg, which becomes a functional extension of the tgn. proteins that are retained as isgs undergo maturation end up as the contents of mature granules. nongranule proteins can be selectivelywithdrawn from isgs during this period, and this model is termed "sorting by retention" (fig. b) . in evaluating either model, the sorting of dcg proteins cannot be considered in precisely the same terms that apply in other pathways, because the tendency of such proteins to self-aggregate facilitates a unique mode of targeting. among other things, it allows a large group of proteins to be sorted together in a single step. one implication is that sorting receptors, if present, could presumably function at concentrations that are dramatically sub-stoichiometric to their dcg protein ligands. furthermore, such receptors would only have to recognize some subset ofdcg proteins , since the remainder could be sorted indirectly via aggregation. in fact, no receptor has ever been unambiguously identified in this pathway. this does not by itself eliminate a "sorting for entry" model, because a second unusual feature of many dcg proteins is a tendency to bind to membranes. this has implications for sorting that will be discussed in a later section. many isolated dcg proteins will self-associate under in vitro conditions believed to approximate the tgn; namely, a slightly acidic flh and high calcium concentration relative to earlier compartments in the secretory pathway. , this can serve as a mechanism for sorting because it is selective: proteins that are constitutively secreted tend to remain soluble under conditions that promote dcg protein aggregation. this first sorting step can therefore be imagined as the evolutionary version of ammonium sulfate precipitation, with the collective behavior based on the proteins' individual biophysical properties, for example their surface charge. while the ability of individual proteins to aggregate is variable, mixtures of proteins may show cooperativiry in vitro, thereby increasing the efficiency of the step (fig. a) . efficient protein aggregation might be expected to show concentration-dependence, and indeed isolated dcg proteins only self-associate above a threshold concentration. this in turn suggests that minor constituents of dcgs may depend for their efficient sorting on coassociation with more abundant species, whose concentrations must be sufficiently high to drive their independent self-aggregation. the sorting efficiency of individual proteins can be experimentally measured as the fraction that is stored in dcgs as opposed to being mistargeted to the constitutive pathway. asexpected from coassociation models, the sorting efficiency of a protein may vary widely between different cell lines. one would also predict that the sorting efficiency ofa protein could be boosted by increasing the expression levelof other proteins with which it coaggregates, particularly those which are most abundant. chiefamong the abundant metazoan dcg proteins are the chromogranin/secretogranins, a group of proteins with shared physical characteristics despite their very limited sequence similariry. , indeed, the overexpression ofchromogranin b (cgb) in the att- neuroendocrine cell line increased the sorting efficiency of a second dcg protein, pro-opiomelanocortin (pomc). nonetheless , it is inherently difficult to test the proposition that self-or coaggregation is a primary sorting determinant using conventional structurefunction analysis, since aggregation is thought to be directed by gross biophysical properties of dcg proteins , and there are no clear "aggregation signals" at the amino acid sequence level. however, recent studies have shown that sorting efficiency can be increased by providing an artificial aggregation signal. heterologous expression of a his-tagged secretory protein enhanced the aggregation and dcg storage, in a calcium-dependent fashion, of cga. , the authors speculate that the tag functions as an "aggregation chaperone" by providing a local site for the binding of divalent cations, thereby nucleating the aggregation process. curiously stably incorporated into the aggregates, suggesting that dcg proteins in their aggregated form interact more strongly with other dcg proteins than with the his tagged peptide. whether endogenous proteins have similar nucleation-promoting properties remains to be determined. identifying the role of any single protein or protein domain in dcg sorting is complicated by the high degree ofcooperativity that is hypothesized to exist within dcg protein aggregates. colo mer et al took advantage of the observation that two exocrine dcg proteins, amylase and gp , do not coaggregate with neuroendocrine dcg proteins in solution, to study the sorting of dcg proteins the absence of coaggregation. when expressed in the neuroendocrine cells, the exocrine proteins were not stored in dcgs but instead secreted constitutively. in similar experiments, an endothelial dcg protein, von willebrand factor, was expressed in neuroendocrine att cells. this resulted, however, not in the constitutive secretion ofvon willebrand factor but instead in the formation of two morphologically-distinct classes of granules. one contained endogenous chromogranins, while the other contained von willebrand factor. a possibility is that two sets ofproteins aggregate independently in the tgn, which could be determined by a number of factors. for example, the two sets could precipitate at relatively distinct ph and/or calcium concentrations and thus be spatially or temporally separated. specific aggregate formation can also arise from conventional protein-protein interactions. in pituitary and pancreatic islet cells, for example, efficient sorting ofcga to dcgs depends on its association with secretogranin iii, and an essential targeting sequence in cga has been determined by gene truncation. cga sorting in pc cells also depends on a specific sequence in the protein, which overlaps with, but is not identical to, that region which is required in pituitary cells. this difference suggests that cga may be interacting with a different partner in pc cells, and indeed these cells do not express secretogranin iii. one possibility is that different surfaces of a cga domain can interact specifically with a range of partners, like a good host at a cocktail party. in summary, the data indicate that the aggregation of a particular protein depends on a number offactors, including its attraction to other potential binding partners within the aggregate, and the physiologic qualities of the lumenal environment, such as ph and calcium concentration, which affect the strengths of those interactions. the expression of proteins that are differentially sensitive to lumenal conditions or that form exclusive sets of protein-protein interactions can potentially result in the formation of multiple distinct aggregates in the same tgn compartment, each comprised of different proteins . these mechanisms could underlie the natural ability ofsome cell types to produce more than one classofdcgs, as is observed in aplysia bag cell neurons, bovine pituitary cells, as well as some protozoa. o-n though the model of sorring-by-aggregation is well established, the actual nature of the molecular interactions within such aggregates is difficult to define. the process of aggregation must be reversible so that the contents can be released into solution following exocyrosis, and moreover, it must be dynamic enough to permit the reorganization oftheir substituents during maturation.v' the latter isparticularly clear in pancreatic~-cells, in which the insulin-containing dcgs exhibit a crystalline ultrastructure, observed by electron microscopy, that is not found in isgs. in comparison to the production of insulin crystals, which involves the assembly of a single protein, the formation of dcg ultrastructure in protozoa may be significantly more complex. in these cells, the lumen ofmature dcgs is filled by a crystalline core that consists of multiple varieties ofproteins,? , indeed , the localization of different proteins within the cores of paramecium dcgs has revealed that the crystals contain at least two distinct layers, each with a different set of protein componenrs. f" images ofisgs reveal that the components of the two layers are interspersed in this compartment, indicating that the layersare formed during a subsequent reorganization phase. thus, there is a significant amount of reorganization that must occur during crystal assembly. overall, the term "aggregation" may be misleading insofar as it suggests a phenomenon based on "stickiness", as for example for misfolded proteins in the endoplasmic reticulum. instead, the interactions that occur between individual proteins in an aggregate may be transient and weak, stimulating formation of aggregates in the tgn due to stabilizing effects provided by multivalent interactions while also allowing for reorganization of the proteins during crystallization, as in figure . some of the nonspecific, low-affinity interactions that occur in aggregates are likely to be mediated by the effects of calcium and ph in charge neutralization, leading to intermolecular interactions of acidic proteins by coordinate association with calcium ions (fig. a) . it is noteworthy that the chromogranins/secretogranins contain a preponderance of acidic amino acids, which endow these proteins with the capacity to bind large numbers of calcium ions with low affinity. ! acidic calcium-binding proteins also form the core of some lrotist dcgs, though they show little overall sequence homology with mammalian proreins.f . an attractive explanation for the similarities is that they reflect a common aggregation-based dcg synthesis mechanism between protozoa and multicellular organisms, and that the amino acid sequences have evolved under similar constraints. following dcg synthesis, the regulated secretion of dcg cargo proteins is dependent on mechanisms that bring the vesicles to the cell surface and control their fusion with the plasma membrane. these activities are dependent on the activity of dcg membrane proteins, for example those that interact with cytoskeleton-based motors for intracellular transpon and those that mediate regulated exocytosis.f it follows that the aggregation ofcore proteins during dcg synthesis cannot by itself be sufficient to form functional dcgs, and that there must be specific, though not necessarily direct, interactions between the lumenal proteins and the membrane constituents in order to ensure efficient sorting of these proteins to the same vesicles. these interactions have been difficult to detect, although some possible examples are discussed in a later section. what is clear, however, is that many lumenal proteins can themselves associate with membranes in unconventional ways. however, the nature and the functional significance of those associations are largely unsettled. five to ten percent of cgb adheres tightly, in a calcium and ph sensitive manner, to mernbranes. ! whether this fraction is in dynamic equilibrium with the remaining~ % is not known, but there is no known chemical difference between the two cohorts. the membrane binding ofcgb is associated with an n-terminal domain defined by a disulfide-anchored loop, which is sufficient to confer membrane association when linked to an otherwise soluble protein. importantly, the chimeric protein was sorted to dcgs in spite of the fact that it did not appear to aggregate, suggesting that the n-terminal domain constitutes an independent targeting signal. that same domain may promote homodimerization at neutral~h, implying that it may mediate different interactions in sequential secretory compartments. cgb, as discussedearlier,also showsa strong tendency to aggregatein a controlled fashion. the coexistence in a single protein of domains that facilitate both protein-membrane binding and homo-or heterotypic protein-protein aggregation, offers the potential to generate cooperative networks with physiologically-useful properties (fig. b) . first, the total concentration of dcg proteins needed to reach the aggregation threshold in the tgn may be reduced for any proteins that interact with the membrane, since the local concentration may be increased depending on local membrane geometry. secondly, the avidity of a cgb aggregate for the membrane will be greater than that ofa monomer, since multiple n-terminal domains are availablefor independent membrane binding. validation of this came from an extension of the experiments with cgb chimeras outlined above. while a single n-terminal cgb domain was able to direct sorting to dcgs, efficient sorting only occurred when two such domains were present. this suggests that the membrane affinity of a single domain may be only marginally sufficient, but is more than adequate if two or more such domains are linked, as would be the case in a cgb aggregate. in a nonconventional sense, cgb could be considered as a dcg sorting receptor: a membrane-associated protein that is itself targeted to dcgs, and that can potentially cotranspon any proteins with which it associates. a similar argument has been made for the enzyme carboxypeptidase e (cpe), which is targeted to dcgs by a c-terminal amphipathic alpha helical domain. • in addition to acting as an enzyme to modify dcg cargo, cpe can also bind a subset ofdcg proteins , for example the hormone precursor pro-opiomelanocortin (pomc). the cpe recognition site involved is different from the enzymatic cleft, and binding may be important for efficient sorting ofpomc, a conclusion based on experiments with cpe knockout mice and from cpe -deficient cell lines. . cpe has been called a receptor for pomc and perhaps for other cargo proteins, though use of the term "receptor" has remained contentious since cpe can also aggregate with pome, chromogranins, and other cargo proteins in a conventional ca + and ph-dependent fashion. , o membrane association of cgb and cpe may be a property that has arisen convergently in these proteins, albeit by different mechanisms, reflecting the importance of this activity in dcg cargo sorting. an n-terminal disulfide bonded loop such as that found in cgb is found in several dcg proteins, including pomc and chromogranin a (cga), though the homology does not extend beyond the structural level, and evidence to date suggests that its role in sorting may be protein-specific. as in cgb , n -terminal disulfide loop domain in pomc is both necessary and sufficient for sorting, but sur~risingly,it appears to interact with the membrane indirectly, through interaction with cpe. the disulfide loop in chromogranin a was not necessary for the sorting of this protein in pcl cells, and instead an interior domain is essential for sorting in these cells,via interaction with membrane-associated secretogranin . • these studies find no evidence for a conserved dcg targeting signal, but they do indicate that specific protein-protein interactions can be important for efficient sorting oflumenal cargo. precisely how cgb, cpe, secretogranin iii, and other ostensibly soluble lumenal proteins associate with membranes is not resolved. there is some evidence that they associate preferentially with cholesterol-rich membranes, so-called lipid rafts. . consistent with this, depletion of cholesterol from tissue culture cells decreased the sorting efficiency of both cpe and cgb, though it is difficult to distinguish direct from indirect effects in such experiments. . in addition, because both constitutive and regulated secretion were inhibited by cholesterol withdrawal, the results do not demonstrate a specific role for cholesterol in dcg formation. the experimental limitations notwithstanding, these data suggest that the association of cpe and cgb with specific membrane sub-domains could be an important aspect ofsorting. iflipid rafts are indeed involved in this pathway, it could add another level of complexity to the cooperative mechanisms that may perrain (fig. ) . interestingly, cgb is also differentially sorted between the apical and basolateral pathways in polarized epithelial cells, which do not make figure . selective association ofocg proteinswithlipidraftsin thetgn. implications ofsuchassociation includethe following possibilities: . independentassociation ofproteinswith a singleraftwouldpromote protein-protein aggregation . . protein aggtegates could stabilize rafts with which they associate. large aggregates couldleadto formationofextensive rafis, in principle, thisprocess couldbesufficient to generate ocgs with a highly biased lipid composition, which is indeed observed. l l the thickened, patterned regions of the cisternal membrane representputativelipid subdomains. dcgs, and this also requires signals within the n-terminal domain. this may suggest a similarity in sorting mechanisms used in epithelial and regulated secretory cells. a final complication in dissecting dcg sorting signals is that the requirement for the disulfide loop in cgb depends on cell type. disulfide bond reduction led to the constitutive secretion of newly synthesized cgb in pcl cells. as expected, this treatment did not affect the sorting of secretogranin ii, a protein that undergoes aggregation but does not contain cysteine residues. however, in gh ci cells, the same treatment did not perturb the sorting of cgb. similarly, cga soning also appears to exhibit cell~e specificity: a c-terminal truncation was correctly sorted in pcl but not in gh ci cells, and an n-terminal region, which does not contain a disulfide loop, was important for sorting in pcl cells. thus, the sorting requirements for cga, cgb, and granule proteins more generally, may depend on the cell type, specificallybecause the efficiency of any protein's sorting will depend on the available interacting parmers . in some cases, a protein's interacting parmer could be a membrane raft, whereas in other cases, the same protein may be delivered to dcgs by virtue of its ability to aggregate with other lumenal cargo proteins. our current understanding of signals involved in dcg membrane protein targeting is relatively primitive. in principle, membrane proteins could be targeted by signals in their lumenal, transmembrane and/or cytoplasmic domains; however the characterization ofsuch signals has not been straightforward. a significant obstacle has been the fact that relatively few membrane proteins have been identified that are exclusivelylocalized to granules. phogrin (phosphatase homolog in granules of insulinoma) localizes to dcgs in a range of neuronal and endocrine tissues. it is a transmembrane protein with an n-terminallumenal domain and c-terminal cytoplasmic domain, and is synthesized with a large n-terminal proregion that is later cleaved within isgs. either the pro-domain or the lumenal domain of the processed protein can be independently stored in dcgs, indicating that each contain signals sufficient for rargeting. one possibility is that these, and by implication the full length phogrin as well, can be sorted by associating with the condensing core of granule cargo in the tgn. this may also be true for two dcg membrane proteins of the anterior pituitary and adrenal medulla, peptidylglycine a-amidating monooxygenase (pam) and dopaminẽ -hydroxylase. in these cases, there is physiological evidence that the lumenal domains can sort independently of the transmembrane or cytosolic domains, since both the soluble forms and the transmembrane forms occur naturally in dcgs. loo nonetheless, efficient storage of the transmembrane form of pam also requires signals within the cytoplasmic tail the idea that sorting of transmembrane proteins in dcg involves cytosolic signals is also supported by analyses ofvamp , a widely distributed dcg v-snare, and p-selectin, a protein of platelets and endothelial cells. the sorting ofvamp to insulin-containing dcgs is impaired by a point mutation in the cytosolic portion of the protein, and the expression of this incorrectly sorted mutant protein is unable to support regulated exocytosis in the absence of wildrype vamp . analysis ofp-selectin targeting is complicated by the fact that it can be found in more than one intracellular compartment, suggesting that it contains hierarchical targeting signals. in addition, the dcgs of platelets and endothelial cells share some properties with lysosomes, and mechanisms involved in their biogenesis may differ from those in neuronal and endocrine cells. . · nonetheless, p-selectin expressed heterologously in the neuroendocrine cell line att- was targeted to dcgs, and this depended on a tyrosine-containing motif in the cytoplasmic domain.l . the same motif is important in the endogenous endothelial cell context, indicating that the rargeting mechanisms may be similar. the tyrosine-based motif suggests that this protein can interact with a coat-associated adaptor, and indeed a functional role for ap- in the sorting ofp-selectin to dcgs has been suggested,llo but no ident ified coats are involved in the formation ofisgs in the tgn. one possibility is that conventional adaptor/coat-mediated sorting of p-selectin occurs at a step distinct from the known budding and maturation steps in dcg bio~enesis; a second is that adaptors may have noncanonical roles unrelated to coat recruitment.i i the studies ofp-selectin have revealed clear evidence that a cytosolic signal can be important for the sorting of transmembrane proteins to dcgs. further analysis of the targeting mechanism will likely be an important topic in future research, as it represents an activity that is topologically distinct from the relatively well characterized aggregation-based sorting events in the lumen. in an intriguing set of experiments, cutler and colleagues found evidence that the function ofthis cytosolic sorting determinant can be coupled to the expression ofa lumenal dcg protein. when the lumenal dcg protein von willebrand factor (vwf) was coexpressed with p-selectin in neuroendocrine att- cells, the vwf was stored in vesicles that were distinct from dcgs containing endogenously-expressed cgb, l , l a finding that is consistent with previous results.i the novel and intriguing finding was that p-selectin was preferentially targeted to the vwf-containing vesicles, indicating that vwf and pvselectin, which are normally expressed in platelet and endothelial cells, could be cosorred in a cell type in which they are heterologously expressed (fig. ) . there was no indication, however, of a direct interaction between the two proteins, and the sorting ofp-selectin in this context was instead dependent on the same tyrosine-containing cytoplasmic motif that had previously been shown to be necessary for targeting to dcgs. the targeting of one class of membrane proteins, those linked via a gpi-anchor, cannot depend on cytosolic signals, since anchors of this type do not penetrate the cytoplasmic membrane leaflet.i for gp- , the major membrane protein of zymogen granules in pancreatic acinar cells, sorting may occur via a coaggregation mechanism. its lumenal domain has been found to associate with a lectin (zg p), sulphated matrix proteoglycans, and syncollin, the last a lumenal protein that may itself interact with the membraney these proteins have been postulated to form a membrane-associated matrix that could serve as a sorting intermediary between the membrane and the zymogen core contents. l this is a variation on the model described for cgb and cpe as sorting receptors , and suggests by analogy that gp- or syncollin might serve as the membrane anchor for the zymogen core. however, dcg assembly is normal in the absence of either protein, indicating that neither is playing a unique role in that regardys, l in summary, the relatively limited evidence to date suggests that a mechanism similar to that involved in cargo protein condensation is involved in sorting of some, but not all membrane proteins with lumenal dcg contents. in principle, indirect interactions between membrane and core proteins may be equally important in the cosorting of membrane and lumenal cargo. iflumenal proteins like cpe preferentially insert into membrane sub-domains based on their lipid composition, then any membrane proteins that independently partition into the same sub-domains would be cosorted. in support of this hypothesis, recent studies have suggested that prohormone convertases and , which are responsible for proteolytic cleavageof lumenal proteins during granule maturation, are sorted to isgs by virtue of c-terminal membrane raft-associated tails, which are by themselves necessary and sufficient for targeting to dcgs,uo additionally, cytoplasmic signals on some transmembrane proteins appear to play important roles in sorting, but the mechanisms are unknown. the canonical mechanism of vesicle budding, as for example that involved in the emergence oflysosome-bound carriers from the tgn, involves transmembrane receptors, adaptors , and coat proteins . since there is no evidence that transmembrane proteins or coat proteins are relevant in isg budding, other mechanisms are likely to apply. there has been some progress in reconstituting this process using cell-free systems, though the field has generally suffered from a lack of in vivo models, for example a well developed genetic system with mutations that affect this step in the pathway. the general approach has been to start with labeled dcg protein in the tgn of permeabilized cells or in golgi-enr iched fractions, then measure the transfer of the label from the relatively large and pelletable golgi membranes to nonpelletable vesicles,using medium speed centrifugation to separate the two pools. the appearance oflabel in smaller vesiclesis taken as an indication of cargo transfer to isgs via vesicle budding. since little is known about dcg biogenesis, it is important to note that "budding" as defined by this assay may include a large number of steps, including the establishment of golgi/tgn microdomains, and the release of previously budded, weakly associated vesicles. thus, the results of these experiments could depend upon on the nature of the starting material. in addition, it has not yet been rigorously demonstrated in any system that the released vesicles are bona fide isgs, for example by testing whether they are competent to fuse with their appropriate target membrane. the reconst ituted buddin~reactions utilize atp, as expected , and most but not all require a cytosol extract . , - the small gtpase arf is required, although the targets for this regulatory protein are not yet clear. one potentially relevant arf target is phosfholipase d (pld) , the binding of which to the membrane can enhance isg buddingp pld converts phospharidyl choline to phosphatidic acid, perhaps thereby effecting a change in membrane curvature. this idea is appealing because, in the absence of coat proteins, the membrane curvature required for isg budding must be induced by other mechanisms. in addition, the indirect products of pld activity may recruit additional effectors to the budding site, including the unconventional gtpase dynamin_ . dynamin mediates membrane scission events, such as pinching off vesicle buds . however, pld does not stimulate budding in all reconstituted systems; the differences may reflect the variery of ways in which donor fractions are prepared. there is evidence that kinases and phosphatases, heterotrimeric g-proteins, and a phosphatidyl inositol transfer protein (pitp) are involved in isg budding, but the enzymatic substrates have not been established. [ ] [ ] [ ] [ ] [ ] an important unanswered question is whether any of these activities, the majoriry of which are as yet unidentified at the molecular level, is specifically required for the formation ofdcgs and not other membrane carriers. pld, for example, has been implicated in tgn tubularion, but the downstream effectors, as for dcg budding, the reaction depends upon ap-l, which can be recruited by the cytoplasmic tails of furin , the mannose-o-phospharereceptor,and other membrane proteins. the mannose- -phosphatereceptorcan in turn bind any soluble lysosomal enzymes in the isg lumen, so thesewillalsobewithdrawn in the budding vesicles. other soluble proteins may also be included based on random partitioning, but the aggregated deg proteins will be excluded.at the end of this process, the mature secretory granule is no longer a budding donor compartment, perhaps becauseit no longer contains membrane proteins that can recruit ap-l (representedby stars). are unknown. the noncanonical gtpase dynamin has been implicated in dcg budding as well as in constitutive secretion. . one possibility is that these activities are only indirectly involved in dcg synthesis. according to the "sort ing by exclusion" model ( fig. a ), isgs are created by a passive process, as aggregation prevents dcg cargo from entering into outbound vesicles and tubules that bear lysosomal or constitutively secreted proteins. instead of being actively budded from the tgn, aggregated proteins would be enriched in a separate subset of relatively large membrane carriers, while non-dcg proteins are removed from the compartment by active coat-dependent processes . thus, the cyeosolic components identified as isg budding factors by in vitro reconstitution assays may really be parts of the mechanisms for other secretory pathways. according to this model, the so-called sorting receptors need only act as membrane tethers in associating the lumenal aggregates with membrane rafts. as there is no need to transport this material to a new compartment, the receptors do not recruit cytosolic coat proteins for vesicle budding, as the traditional membrane receptor proteins do in sorting proteins to other pathways. an alternative to the "sorting by exclusion" model proposes that isg budding is indeed an active process, and that the same mechanism is also involved in driving the budding and tubulation of constitutive secretory carriers from the tgn. although the two pathways give rise to vesicles of vastly different sizes, it is possible that the difference is caused by the cargo proteins (large aggregates versus soluble material) and is not a reflection of different cytosolic budding machinery. the formation ofconstitutive secretory carriers, like isg budding, differs from clathrin-dependent transport at the tgn in that is not associated with the appearance of vesicle coats. there are similarities between the budding of constitutive and regulated vesicles at the molecular level as well: in addition to rab proteins, constitutive traffic has been shown to rely on the activity of dynamin- , protein kinase d, and heterotrimeric g-proteins , factors which may also be associated with isg budding (above). cholesterol depletion has been shown to inhibit both pathways, however it is difficult to know whether the treatment has a direct effect on both pathways, or whether the inhibition of one pathway could inhibit the second via some indirect mechanism. thorough testing ofthis model requires experiments that avoid this problem. the only concrete indication that there are dcg-specific budding factors is that, at least in one reconstituted system, the cytosol re~uirement cannot be substituted for by an extract from hela cells, which do not make dcgs. one possibility that is compatible with both sorting models is that specific cytosolic proteins are involved in establishing or facilitating golgi subdomains in which dcg proteins condense. the structural and functional analysis of the tgn is at a very early stage, but the existence of sub-domains is consistent with the observed nonuniform protein distribution within a single cisterna, as well as with live imaging of heterogenous budding structures. , however, the cisternal dilations involved in isg budding do not necessarily reflect the active maintenance of sub-domains. a simpler view is that the cisterna are passively stretched around the forming granule protein cores, like the bulges in a pancake around blueberries. future work with in vitro systems may provide molecular identification ofactivities that are required for isg budding, but the question ofwhether cells have machinery that is specifically used for this purpose will need to be addressed by other types of analyses. if the budding mechanism is specific to isgs, and not indirectly required for isg production, as in the "sorting by exclusion" model, the prediction is that knocking out individual components would inhibit isg formation without also inhibiting the exit of lysosomal or constitutive proteins from the tgn. depending on cell type, the importance of isgs as a locus of protein sorting may be as important as that of the tgn. sorting at this level involves the budding of vesicles from isg membranes, resulting in the remodeling of membrane and lumenal contents by selective withdrawal (fig. b ). this targeted removal occurs via clathrin-coat recruitment to isgs occurs via the ap- adaptor, wh ich is recruited by membrane proteins in an arf-dependent, bfa-inhibitable step. proteins known to be withdrawn from isgs include the cargo protease furin and the mannose- -phosphate receptor, both of which can interact directly with ap_l. . the mannose-s-phosphate receptor can bind any lysosomal enzymes that may have been incorrectly sorted upon exit from the tgn, and this step therefore leads to selective withdrawal of some lumenal proteins by classical receptor-based sorting. mature dcgs do not support ccv formation, the simplest explanation for which is that isgs become progressivelydepleted ofproteins that act in the recruitment ofap- . consistent with this, myristoylated arf binds to isgs but not mature granules in vitro. recent evidence suggests that the full cohort of arfs and adaptors present on isgs includes arf , and , and ap- and _ . these may all be present on a uniform population of vesicles, or may reflect heterogeneity within isgs. coated vesicles budding from the isg will also withdraw any soluble proteins that randomly partition by diffusion into the vesicle lumen during budding. however, large aggregates of proteins that are condensing in the isg are too large to fit into the buds , and are therefore selectively retained. the efficiency of this separation is increased by the tendency of trafficking imide cells: pathways, mechanisms and regulation nonaggregating proteins to be concentrated at the periphery of the vesicle lumen , as they are excluded from the dense core forming in the center ofthe isg. as a result, the soluble proteins accumulate in a place where they can readily enter the vesicles that are budding from the membrane. these may include proteins that randomly partition at the tgn into budding isgs, but will also include some soluble products of dcg proprotein processing. the best characterized of these is derived from proinsulin, which is processed into and a, b, and c peptides. the first two are disulfide linked, and crystallize to form the granule core. the c peptide is soluble and is largely excluded from the core, and is selectivelywithdrawn. . a collateral consequence of isg maturation is the generation of a set of coated vesicles bearing newly synthesized proteins, some ofwhich have undergone processing by isg-specific enzymes. at least in some cell types, these can deliver their cargo to the plasma membrane, probably via an endosomal intermediate. this has been called "constitutive-like" secretion: constitutive-like in that it is independent ofextracellular stimulation, but with kinetics that are slower than those of true constitutive secretion. in pancreatic~cells, the c peptide that is withdrawn from isgs is secreted via this route . the model that describes the progressive enrichment of granule cargo during isg maturation has been given the name "sorting by retention" and essentially posits that sorting in isgs can be based on a protein's ability to aggregate, rather than depending on specific targeting signals. the concepts are like those of the "sorting by exclusion" model that may apply at the tgn, and the similarity in models may be a reflection of similar molecular mechanisms in vivo. thus, isgs may simply be a functional extension of the tgn, which becomes progressivelyenriched in dcg contents as nonaggregating proteins are actively removed during maturation. thus, there may not be any mechanistic differences between coat mediated sorting at the tgn versus the isgs, though the material that is included in the budding vesicles could change as the compartment matures. alternatively, modification of the coat mediated sorting machinery may be required in order to facilitate sorting from a compartment that is progressively changing. for example, such modifications may be necessary for the trafficking of proteins that are allowed to enter isgs but are not stored in mature dcgs, such as proteases (see "structural maturation of isgs" section) , or for adapting to differences in membrane composition between the tgn and isgs. indirect evidence in support of this possibility comes from the study of the membrane lipid component phospharidyl inosirol-i-phosphare (pi- -p) and its derivatives. in the tgn, these molecules play important modulating roles, including the recruitment of ap-l/clathrin coat proteins for vesicle budding. the levels on pi- -p in the tgn are affected by the activity of pi- kinase, which is stimulated by myristoylated arf -gti~a part of the coat formation machinery. , interestin~r: isgs have been found to contain a pi- -k activity that is not stimulated by arfi-gtp' the tgn has two different pi- kinases (ii and iii) , and it is possible that isgs only recruit one of these. . coat recruitment at the tgn vs. isgs may also be differentially regulated by modification of the vesicle cargo, since the binding of ap-l to the cytoplasmic tails of both furln and the mannose- ~hosphate receptor is stimulated following their phosphorylation by casein kinase ii. , in this regard, a very interesting observation is that newly-budded isgs are rapidly transported to the cell periphery, at least in some cell types, and therefore primarily inhabit a different cellular microenvironment from the tgn. this may be relevant for differential regulation of similar activities at the tgn vs. isgs, for example if receptors in isgs are selectively modified. although the data is not yet conclusive, the emerging view of sorting from isgs is that it is directed by the core elements of a "flexible" ap-l/clathrin dependent sorting mechanism that is differentially controlled at the isgs versus the tgn. the model holds that the sorting events ofisg maturation are not mediated by a unique vesicle trafficking mechanism, but are instead accomplished by pathway-specific modifications ofmachinery that is common to all cell rypes. a similar phenomenon may occur at an earlier stage ofthe pathway, where the coat-independent machinery that drives the formation ofconstitutive carriers from the tgn may be adapted for the budding of isgs, as discussed in the "mechanisms of immature secretory granule (isg) budding" section. this apparent mechanistic conservation may explain the abiliry offibroblast cells, which do not normally make dcgs, to make dense-cored vesicles when expressing heterologous chromogranin genes or vonwillebrand factor. . . however, these observations do not preclude the possibility that specialized dcg-producing cells express proteins that specifically modify parts of the conserved cellular trafficking machinery to enhance dcg synthesis. the cores of newly-budded isgs apftear less electron-opaque than those in mature dcgs, and are also lower in buoyant density, indicating that granule cargo becomes increasingly condensed during granule maturation. this is one reflection of the larger remodeling of protein and lipid constituents during the maturation process, which includes the selective withdrawal of components that are present in im matu re, but not mature, granules . this overall process serves important structural functions. the tighter packing offers increasingly efficient storage, and not simply because more material can be contained in a fixed vesicle volume. protein condensation overcomes an energetic barrier that is posed by a vesicle filled with concentrated soluble macromolecules, which is hyperosmolar when compared to cytosol. maintaining such a vesiclewould require constant pumping ofosmolytes to counter vesicle swelling, an expensive cellular proposition. within dcgs, aggregated proteins are no longer solvated, and are therefore osmotically inert. the progressive condensation durin~maturation parallels, and is likely to be controlled by, changes in the lumenal environment. in neuroendocrine cells, the tgn is acidified to ph . by vacuolar atpases. these are also present in the isg membrane, with the result that the isg continues to acidify - at the same time there is an increase in calcium that , along with other cations, is important for charge neutralization of the largely acidic core proteins. this calcium may be cotransported from the endoplasmic reticulum with calcium-binding dcg cargo proteins, or imported via isg membrane ion exchangers. the ionic changes can trigger changes in dcg protein conformations or interactions . for example, cgb forms horno-oligomers under the conditions found in isgs. . the functional significance is as yet unknown, but these are presumably based on contacts different from those involved in aggregative sorting. one well-established consequence ofisg acidification, in combination with increased ca +, is the activation of proteases that are specifically localized to dcgs. the contents of neuronal and endocrine dcgs are largely synthesized as proteins that are proteolytically processed to generate bioactive peptides, the species that are eventually released during exocytosis. l proteolytic processing involves a variety of enzymes including amino-and carboxypeptidases, and a family ofaspartyl proteases called prohormone convertases. [ ] [ ] [ ] [ ] members of this family are differentially active over a range of proton and calcium concentrations, and may thus act sequentially on their substrates during isg maturation, in a cell type-dependent fashion [davidson, # ;laslop, # ;goodge, # ;. though isgs are considered to be the major compartment of proprotein processing, in some cell types processing may begin in the tgn, and moore and colleagues have begun to resolve the requirements for isg budding from those required for the onset of processing. . . in their cell-free system, the onset of processing precedes budding. both require hydrolyzable gtp, but at two distinct concentrations . this difference suggested a model in which the former requires arf, while the latter depends upon a heterotrimeric g-protein. in addition to generating mature peptides, proprotein processing may drive the physical reorganization of the core, in cases where mature peptides can pack more tightly than the precursors . the best example of this is found in~-cell granules, in which mature insulin but not proinsulin can assemble into hexagonal crysrals, simply because processing relieves a packing constraint . , (fig. ) . the control of assembly via proteolytic processing is strongly reminiscent of mechanisms involved in viral capsid formation. the process ofdcg maturation, which includes the generation of active peptides by proteolytic processing and the condensation of cargo into a densely packed, osmotically inert form, serves to increase the efficiency of the regulated secretory pathway in several ways. first, the condensation ofmaterial allows great quantities of protein to be stored in the vesicles,with the consequence that a small number ofexocytic events can generate a relativelylarge secretory response. second, proteolytic processing in isgs allows the cell to combine multiple dcg peptides into a single proprotein, thereby linking the sorting of these proteins at earlier stages of the pathway. in neuroendocrine isgs, for example, the chromogranin proteins are cleaved into multiple biologically active peptides with different postexocytic functions. , furthermore, limiting the site of proteolytic processing to isgs may provide a failsafe mechanism, ensuring that the active forms of the proteins are only found in a compartment that is under direct control of the regulated secretory pathway, therefore leaving any incorrectly sorted proteins as uncleaved precursors. the remodeling of the membrane attending the budding of clathrin coated vesicles does not simply serve to remove proteins that may have been incorrectly targeted at the tgn. rather, it also underlies differences in the activity ofisgs and mature granules. this was suggested by the observation that isgs and mature granules differ dramatically with regard to exocytosis: whereas mature dcgs undergo efficient exocytic fusion with the plasma membrane in a stimulus-dependent fashion , isgs exhibit an increased tendency to fuse with the plasma membrane in the absence of stimulation. in att cells, unregulated release ofdcgs from isgs proceeds for - hours after isg budding from the tgn. these isgs contain two snares, vamp and synaptota~min iv (syt iv) , which are withdrawn during maturation in a brefeldin a-inhibitable step.i , , during the same period , the maturing granules become responsive to exocytic stimuli, a process also blocked by bfa. that the two phenomena may be linked is suggested by the observation that overexpression ofsyt iv itself decreased the responsiveness of maturing granules to secretory stimuli. l syt iv is thought to act as a negative regulator of calcium-induced exocytosis, and the withdrawal of this inhibitory factor from isgs may foster maturation. a recent study showed that the removal ofvamp from isgs depends upon interactions with ap-l and the coat protein pacs-l, thereby providing genetic confirmation and molecular detail to this model. however, a complication ofthis model is that syt iv is thought to inhibit membrane fusion by forming inactive heterodimers with synaptotagmin i, and the mechanism by which the heterodimers are separated and syt n is selectively removed from the isgs is unknown. another functional characteristic that may, in some cell types, distinguish isgs from dcgs, is that isgs can undergo homotypic fusion, a reaction that has been more extensively characterized in vitro than in vivo. , , the specific function ofthis reaction is not clear. in some systems homotypic-like fusion might allow for the synthesis ofspecialized dcg cores in which the contents are not randomly distributed. in pseudomicrotborax dubius, two kinds of isgs, containing morpholo §ically-distinguishable cargo, fuse during the process ofassembling a complex core structure. more generally, consolidation could potentially define the size of the granules, which in many systems appear to be controlled. disruption of the gene encoding rab d, an exocrine granule-associated small gtpase, resulted in a doubling ofmature granule volume, and one possibility is that rab d acts as a negative regulator of homotypic fusion. at some level, membrane remodeling must account for the difference in the fusogenic behavior of isgs vs. mature granules, and attention has focused on the snares, due to their importance in regulating membrane fusion. isgs from pcl cells contain syntaxin , which must be present on both donor and acceptor membranes for efficient homotypic fusion in vitro.in syntaxin is also present in clathrin-coated vesicles which bud from the isg rnernbrane , consistent with the idea that it is selectively removed during maturation via several likely ap-i binding sites in its cytoplasmic domain. as isg maturation appears to involve the removal of specific factors via the budding of clathrin coated vesicles, it is possible that more thorough analyses of the target proteins and their interacting partners will help to uncover isg-specific machinery that regulates clathrin-dependenr sorting in this compartment. more broadly, the identification of the molecules that define the functional maturity ofdcgs by their presence or absence in the vesicle will provide insights into the nature of organelle identity, a topic that is central to an understanding the general principles of vesicular traffic. finally, recent evidence hints at aspects of granule maturation that have not previously been recognized. functional maturation of secretory granules may extend beyond the period of morphological change, based on the observation that the distribution and fusogenic activity of granules may change with vesicle age. the majority of the work on dcg synthesis has focussed on the sorting of the lumenal content proteins in the tgn and isgs. these studies have, for the most part, supported the nonspecific aggregation-based model for sorting that was proposed by chanat and huttner in . not surprisingly, studies of many granule cargo proteins in multiple systems have revealedsome casesthat are possible exceptions to this general rule, where specificprotein-protein interactions are required for the sorting of a particular protein, as discussed in the "protein sorting into isgs" section of this chapter. overall, the precise requirements for the sorting of any particular protein is likely to be both context (which other granule cargo proteins are being expressed, and in what quantities) and cell type dependent (protein aggregation is sensitive to physiological properties of the lumen, such as calcium concentration and ph, which may vary between cell types), though it is likely that the general principles of aggregation-based sorting apply in all cells that produce dcgs. further analysis of the specific sorting requirements for individual proteins may lead to a greater knowledge of the details ofaggregation-based sorting, but the next leap forward in our understanding of they system will more likely come from experimental approaches that expand beyond the level of individual proteins and consider the dcg synthesis pathway more broadly. for example, cargo protein aggregation is known to be sensitive to lumenal calcium concentration and ph levels, but the mechanisms that control these physiologic parameters have not been elucidated. secondly, how are granule cargo proteins sorted to the same destination as other proteins that are essential for dcg function, such as membrane fusion machinery? the answers to these questions may be learned from studies in genetic systems, such as c. elegans, drosophila, and ciliated protozoans, which offer promising avenues for further experimentation. these orf.anisms have recently been used to identify elements of the regulated exocytosis machinery.' , , and similar studies could uncover genes that are involved in vesicle synthesis. another major gap in our understanding of the granule synthesis pathway is the extent of its functional relationship with other branches of the secretory pathway. two decades ago, dgc formation was considered to be one of a small number of distinct, post-tgn secretory pathways. this carried the assumption that vesicles bound for constitutive or regulated exocytosis, or toward lysosomes, would rely on distinct mechanisms for their biogenesis. that view now seems, paradoxically, to have been both too simple and too complex. it was too simple because post tgn traffic cannot be neatly divided into three branches: for example, what was called the constitutive pathway may in fact consist of multiple branches. . this was initially established for apical vs. basolateral targeting in polarized epithelia, but there is evidence in other cell types as well. furthermore, the mechanisms for dcg formation are not easily separated from those that are directly involved in other pathways, implying that the secretory pathway cannot be divided into distinct, independently functioning branches. for example, ap- dependent sorting of proteins to the lysosomal pathway is associated with isg maturation, and may also be part of the driving force for the "sorting by exclusion" of dcg contents in the tgn (see "protein sorting in isgs" section). at the same time, the fact that the dcg synthesis pathway and lysosomal pathway use some of the same machinery argues that the historical view of distinct mechanisms was too complex. similarly, the historical view that constitutive and regulated secretory carriers are fundamentally different may also be incorrect. the idea that constitutive traffic is based on small vesiclesis being modified by the recognition that tgn tubularion may be as, if not more, important in this pathway, at least in some cell types (referencesin ref ) . thus coat-mediated vesicle formation may be the exception rather than the rule for anterograde traffic to the plasma membrane, and the formation of constitutive and regulated secretory carriers may share common mechanisms . in the extreme, the mechanisms may be mostly conserved, and the end products depend upon the behavior of the vesicle cargo. addressing these issues directly will require identification of factors required for isg budding and tgn rubulation. while progress has recently been made toward the latter, details regarding the former are extremely limited. success in this may depend on further exploitation of cell-free systems, strengthened by development of new genetic models. secretion and cell-surface growth are blocked in a temperaturesensitive mutant of saccharomyces cerevisiae secretory granule exocytosis the synaptic vesicle cycle snares and snare regulators in membrane fusion and exocytosis calcium sensors in regulated exocytosis principles of exoeytosis and membrane fusion regulated exocytosis and snare function (review) the trans-most cisternae of the golgi complex: a compartment for sorting of secretory and plasma membrane proteins sorting within the regulated secretory pathway occurs in the trans-golgi network sorting of progeny coronavirus from condensed secretory proteins at the exit from the trans-golgi network of att cells intracellular aspects of the process of protein synthesis pathways of protein secretion in eukaryotes biogenesis of secretory granules in the trans-golgi network of neuroendocrine and endocrine cells constitutive and regulated secretion of proteins biogenesis of constitutive secretory vesicles, secretory granules and synaptic vesicles protein hormone storage in secretory granules: mechanisms for concentration and sorting lumenal protein multimerization in the distal secretory pathway/secretory granules secretory granule biogenesis: rafting to the snare sorting and storage during secretory granule biogenesis: looking backward and looking forward formation of secretion granules in the golgi apparatus of pancreatic acinar cells of the rat trans-golgi network (tgn) of different cell rypes: three-dimensional structural characteristics and variabiliry transport of casein submicelles and formation of secretion granules in the golgi apparatus of epithelial cells of the lactating mammary gland of the rat modulation of the golgi apparatus in stimulated and nonstimulated prolactin cells of female rats formation of secretory granules in the golgi apparatus of prolactin cells in the rat pituitary gland: a stereoscopic study three-dimensional electron microscopy: structure of the golgi apparatus formation of secretion granules in the golgi apparatus of plasma cells in the rat nonconverted, amino acid analog-modified proinsulin stays in a golgi-derived clarhrin-coated membrane compartment biogenesis of secretory granules. implications arising from the immature secretory granule in the regulated pathway of secretion intermediates in the constitutive and regulated secretory pathways released in vitro from semi-intact cells tooze jet ai. characterization of the immature secretory granule, an intermediate in granule biogenesis structural requirements for targeting of surfactant protein b (sp-b) to secretory granules in vitro and in vivo procathepsin l self-association as a mechanism for selective secretion drosophila caps is an essential gene that regulates dense-core vesicle release and synaptic vesicle fusion ca( +)-dependenr activator protein for secretion is critical for the fusion of dense-core vesicles with the membrane in calf adrenal chromaffin cells a novel kd brain cyrosolic protein reconstitutes ca + -regulated secretion in permeable neuroendocrine cells the synaptic vesicle cycle: exocytosis and endocytosis in drosophila and c e egans isolation and ultrastructural characterization of secretory mutants of tetrahymena thermophila protein secretion in tetrahymena thermophila: characterization of the secretory mutant strain sb genetic characterization of tetrahymena thermophila mutants unable to secrete capsules maturation of dense core granules in wild rype and mutant tetrahymena thermophila mutational analysis of regulated exocytosis in tetrahymena mutations affecting the trichocysts in paramecium aurelia. i morphology and description of the mutants evidence for defects in membrane traffic in paramecium secretory mutants unable to produce functional storage granules overall lack of regulated secretion in a pci variant cell clone a pci variant lacking regulated secretory organelles: aberrant protein targeting and evidence for a factor inhibiting neuroendocrine gene expression analysis of a mutant exhibiting conditional sorting ro dense core secretory granules in tetrahymena thermophila biogenesis of von willebrand factor-containing organelles in heterologous rransfected cv-i cells induction of specific storage organelles by von willebrand facror propolypeptide chromogranin a, an «on/off. switch controlling dense-core secretory granule biogenesis chromogranin b-induced secretory granule biogenesis: comparison with the similar role of chromogranin a molecular sorting in the secretory pathway protein secretion: puzzling receptors proteins synthesized and secreted during rat pancreatic development phasic release of newly synthesized secretory proteins in the unstimulated rat exocrine pancreas ph-and ca +-dependent aggregation properry of secretory vesicle matrix proteins and the potential role of chromogranins a and b in secretory vesicle biogenesis milieu-induced, selective aggregation of regulated secretory proteins in the trans-golgi network signal-mediated sorting to the regulated pathway of protein secretion +)-induced conformational change and aggregation of chromogranin b. comparison with chromogranin a and implication in secretory vesicle biogenesis secretory granule content proteins and the luminal domains of granule membrane prot eins aggregate in vitro at mildly acidic ph the granin (chromagranin/secretogranin) family the chromogranins: th eir roles in secretion from neuroendocrine cells and as markers for neuroendocrine neoplasia chro mogranin b (secretogranin i) promotes sorting to the regulated secretory pathway of processing intermediates derived from a peptide hormone precursor gorr suo aggregation chaperones enhance aggregation and storage of secretory proteins in endocrine cells in vitro aggregation of the regulated secretory protein chromogranin a reconstitution in vitro of the ph-dependent aggregation of pancreatic zymogens en route to the secretory granule: implication of gp- exocrine granule specific packaging signals are present in the polypeptide moiery of the pancreatic granule membrane protein gp and in amylase: implications for protein targeting to secretory granules identification of a chromogranin a domain that mediates binding to secretogranin iii and targeting to secretory granules in pituitary cells and pancreatic beta-cells identification of a novel sorting determinant for the regulated pathway in the secretory protein chromogranin a sorting of three secretory proteins to distinct secretory granules in acidophilic cells of cow anterior pituitary multiple neuropept ides derived from a common precursor are differentially packaged and transported structure et ultrastructure de lacrymaria olor (o.f.m. ) cocrystallization of proinsulin and insulin vayssie let ai. a large multigenic family codes for the polypeptides of the crystalline trichocyst matrix in paramecium protein secretion in tetrahymena thermophila. characterization of the major proteinaceous secretory proteins growth and form of secretory granules involves stepwise assembly but not differential sorting of a family of secretory proteins in paramecium quality control in the endoplasmic reticulum proteolytic processing and ca +-binding activity of dense-core vesicle polypeptides in tetrahymena identification and characterization of a novel secretory granule calcium-binding protein from the early branching eukaryote giardia lamblia myosin va facilitates the distribution of secretory granules in the f-actin rich cortex of pc cells chromogranin b (secretogranin i) a secretory protein of the regulated pathway, is also present in a tightly membrane-associated form in pc cells the disulfide-bonded loop of chromogranin b mediates membrane binding and directs sorting from the trans-golgi network to secretory granules the disulfide-bonded loop of chrornogranins, which is essential for sorting to secretory granules, mediates hornodimerizarion identification of the sorting signal motif within pro-opiomelanocortin for the regulated secretory pathway carboxypeptidase e, a prohormone sorting receptor, is anchored to secretory granules via a c-terminal transmembrane insertion carboxypeptidase e is a re~lated secretory pathway sorting receptor: genetic obliteration leads to endocrine disorders in cpe at mice identification of a novel prohormone sorting signal-binding site on carboxypeptidase e, a regulated secretory pathway-sorting receptor depletion of carboxypeptidase e, a regulated secretory pathway sorting receptor, causes misrouting and constitut ive secretion of proinsulin and proenkephalin, but not chromogranin a proinsulin targeting to the regulated pathway is not impaired in carboxypeptidase e-deficient cpefat/cpefat mice carboxypeptidase e, a peripheral membrane protein implicated in the targeting of hormones to secretory granules, coaggregates with granule content proteins at acidic ph secretogranin iii binds to cholesterol in the secretory granule membrane as an adapter for chromogranin a sorting of carboxypeptidase e to the regulated secretory pathway requires interaction of its transmembrane domain with lipid rafts cholesterol is required for the formation of regulated and constitutive secretory vesicles from the trans-golgi network selective delivery of secretory cargo in golgi-derived carriers of nonepithelial cells reduction of the disulfide bond of chromogranin b (secretogranin i) in the trans-golgi network causes its missorting to the constitutive secretory pathways disruption of disulfide bonds exhibits differential effects on trafficking of regulated secretory proteins n-and c-terminal domains direct cell type-specific sorting of chromogranin a to secretory granules molecular cloning of phogrin, a protein-ryrosine phosphatase homologue localized to insulin secretory granule membranes the lumenal domain of the integral membrane protein phogrin mediates targeting to secretory granules differential trafficking of soluble and integral membrane secretory granule-associated proteins identification of routing determinants in the cytosolic domain of a secretory granule-associated integral membrane protein biogenesis of weibel-palade bodies mutational analysis of vamp domains implicated in ca +-induced insulin exocytosis targeting of p-selectin to two regulated secretory organelles in pc cells p-selectin targeting to secretory iysosomes of rbl- h cells regulated secretion of conventionallysosomes angelica ec the molecular machinery for the biogenesis of lysosome-related organelles: lessons from hermansky-pudlak syndrome p-selectin, a granule membrane protein of platelets and endothelial cells, follows the regulated secretory pathway in att- cells a complex web of signal-dependent trafficking underlies the triorganellar distribution of p-selectin in neuroendocrine pc cells ap- adaptor functions in targeting p-selectin to secretory granules in endothelial cells coat proteins: shaping membrane transport selective and signal-dependent recruitment of membrane proteins to secretory granules formed by heterologously expressed von willebrand factor weibel-palade bodies recruit rab by a content-driven , maturation-dependent mechanism that is independent of cell type assembly of multimeric von willebrand factor directs sorting of p-selectin how glycosylphosphatidylinositol-anchored membrane proteins are made interaction of syncollin with gp- , the major membrane protein of pancreatic zymogen granules, and association with lipid microdomains a submembranous matrix of proteoglycans on zymogen granule membranes is involved in granule formation in rat pancreatic acinar cells the major zymogen granule membrane protein gp- in the rat pancreas is not involved in granule formation loss of the zymogen granule protein syncollin affects pancreatic protein synthesis and transport but not secretion recycling of raft-associated prohormone sorting receptor carboxypeptidase e requires interaction with arf requirement for gtp hydrolysis in the formation of secretory vesicles the use of permeabilized cells to investigate secretory granule biogenesis exocytic transport vesicles generated in vitro from the trans-golgi network carry secretory and plasma membrane proteins reconstitution of constitutive secretion using semi-intact cells: regulation by gtp but not calcium phospholipase d stimulates release of nascent secretory vesicles from the trans-golgi network secretory vesicle budding from the trans-golgi network is mediated by phosphatidic acid levels implications of lipid microdomains for membrane curvature, budding and fission role of dynamin in the formation of transport vesicles from the trans-golgi network a role for phosphatidylinositol transfer protein in secretory vesicle formation multiple trimeric g-proteins on the trans-golgi network exert stimulatory and inhibitory effects on secretory vesicle formation t rimeric g-proteins of the trans-golgi network are involved in the formation of constitutive secretory vesicles and immature secretory granules formation of nascent secretory vesicles from the trans-golgi network of endocrine cells is inhibited by tytosine kinase and phosphatase inhibitors cooperativity of phosphatidylinositol transfer protein and phospholipase d in secretory vesicle formation from the tgn-phosphoinositides as a common denominator? role of diacylglycerol in pkd recruitment to the tgn and protein transport to the plasma membrane functional diversity in the dynamin family protein kinase d regulates the fission of cell surface destined transport carriers from the trans-golgi network biogenesis of processing-competent secretory organelles in vitro two independent targeting signals in the cytoplasmic domain determine trans-golgi network localization and endosomal trafficking of the proprorein convertase furin direct and gtp-dependent interaction of adp-ribosylation factor with clathrin adaptor protein ap-l on immature secretory granules the ap-l adaptor complex binds to immature secretory granules from pc cells, arid is regulated by adp-ribosylation factor interaction of furin in immature secretory granules from neuroendocrine cells with the ap- adaptor complex is modulated by casein kinase ii phosphorylation mannose -phosphate receptors are sorted from immature secretory granules via adaptor protein ap- , clathrin, and syntaxin -positive vesicles differential distribution of mannose-e-phosphare receptors and furin in immature secretory granules differential sorting of lysosomal enzymes out of the regulated secretory pathway in pancreatic beta cells site-specific cross-linking reveals a differential direct interaction of class , , and adp-ribosylation factors with adaptor protein complexes and proinsulin endoproteolysis confers enhanced targeting of processed insulin to the regulated secretory pathway the role of assembly in insulin's biosynthesis protein discharge from immature secretory granules displays both regulated and constitutive characteristics protein targeting via the "constitutive-like" secretory pathway in isolated pancreatic islets: passive sorting in the immature granule compartment protein traffic from the secretory pathway to the endosomal system in pancreatic beta-cells phospharidyiinosirol phosphate regulates targeting of clathrin adaptor ap-l complexes to the golgi arf mediates recruitment of ptdlns- - h kinase-beta and stimulates synthesis of ptdlns( , )p on the golgi complex type i phosphatidylinositol -phosphate -kinase directly interacts with adp-ribosylation factor and is responsible for phosphatidylinositol , -bisphosphate synthesis in the golgi compartment regulation and recruitment of phosphatidylinositol -kinase on immature secretory granules is independent of adp-ribosylation factor dynamics of immature secretory granules: role of eytoskeletal elements during transport, cortical restriction, and f-actin-dependent tethering mechanisms of ph regulation in the regulated secretory pathway endoproteolytic cleavage is mediated by a vacuolar atpase that generates an acidic ph in the trans-golgi network ph-independent and -dependent cleavage of proinsulin in the same secretory vesicle biosynthesis and secretion of pituitary hormones: dynamics and regulation inhibition of the vacuolar h+-atpase perturbs the transport , sorting, processing and release of regulated secretory proteins low-molecular-weight constituents of isolated insulin-secretory granules . bivalent cations , adenine nucleotides and inorganic phosphate endoplasmic reticulum ca + is important for the proteolytic processing and intracellular transport of proinsulin in the pancreatic beta-cell effects of ph and ca + on heterodimer and heteroretramer formation by chromogranin a and chromogranin b prohormone and proneuropeptide processing calcium-and ph-dependent aggregation and membrane association of the precursor of the prohormone convertase pc ionic milieu controls the compartment-specific activation of pro-opiomelanocortin processing in att- cells molecular and cellular regulation of prohormone processing the proprotein convertases furin and prohormone convertase / are major convertases in the processing of mouse pro-growth hormone-releasing hormone an antibody specific for an endoproteolytic cleavage site provides evidence that pro-opiomelanocortin is packaged into secretory granules in att cells before its cleavage lau et ai. biogenesis of regulated exocytotic carriers in neuroendocrine cells comparison of secondary structures of insulin and proinsulin by ftir nmr and photo-cidnp studies of human proinsulin and prohormone processing intermediates with application to endopeptidase recognition the role of proteolytic processing in the morphogenesis of virus particles peptides derived from the granins (chromograninslsecretogranins) distinct molecular events during secretory granule biogenesis revealed by sensitivities to brefeldin a homotypic fusion of immature secretory granules during maturation requires syntaxin synaptic function modulated by changes in the ratio of synaptoragmin i and iv ap-l recruitment to vamp is modulated by phosphorylationdependent binding of pacs- cytoplasmic granule formation in mouse pancreatic acinar cells. evidence for formation of immature granules (condensing vacuoles) by aggregation and fusion of progranules of unit size, and for reductions in membrane surface area and immature granule volume during granule maturation homotypic fusion of immature secretory granules during maturation in a cell-free assay immunological characterization of trichocyst proteins in the ciliate pseudomicrothorax dubius lysosome function in the regulation of the secretory process in cells of the anterior pituitary gland rab d is not required for exocrine exocytosis but for maintenance of normally sized secretory granules functional and spatial segregation of secretory vesicle pools according to vesicle age genetic approach to regulated exocyrosis using functional complementation in paramecium: identification of the nd gene required for membrane fusion novel secretory vesicle proteins essential for membrane fusion display extracellular-marrix domains post-golgi biosynthetic trafficking multicolour imaging of post-golgi sorting and trafficking in live cells gaip participates in budding of membrane carriers at the rrans-golgi network macro-and micro-domains in the endocrine pancreas key: cord- -qohf pxp authors: loa, chien chang; wu, ching ching; lin, tsang long title: recombinant turkey coronavirus nucleocapsid protein expressed in escherichia coli date: - - journal: animal coronaviruses doi: . / - - - - _ sha: doc_id: cord_uid: qohf pxp expression and purification of turkey coronavirus (tcov) nucleocapsid (n) protein from a prokaryotic expression system as histidine-tagged fusion protein are presented in this chapter. expression of histidine-tagged fusion n protein with a molecular mass of kda is induced with isopropyl β-d- -thiogalactopyranoside (iptg). the expressed n protein inclusion body is extracted and purified by chromatography on nickel-agarose column to near homogeneity. the protein recovery can be mg from ml of bacterial culture. the purified n protein is a superior source of tcov antigen for antibody-capture elisa for detection of antibodies to tcov. turkey coronavirus (tcov) is the cause of an acute and highly contagious enteric disease affecting turkeys of all ages. the disease is severe in -to -week-old turkey poults [ ] . turkey fl ocks that recover from natural or experimental coronaviral enteritis may develop lifelong immunity [ ] . tcov has been recognized as an important pathogen of young turkeys. tcov infection causes signifi cant economic losses in the turkey industry due to poor feed conversion and uneven growth. outbreaks of tcov enteritis in turkey poults remain as a threat to the turkey industry. in order to rapidly diagnose and effectively control turkey coronaviral enteritis , development of an antibody-capture enzymelinked immunosorbent assay ( elisa ) for detecting antibodies to tcov is essential. development of elisa for detection of tcov infection requires large amounts of tcov antigen . molecular cloning and expression of tcov n protein were carried out for preparation of large quantities of highly purifi ed viral proteins. coronavirus is an enveloped and positive-stranded rna virus that possesses three major structural proteins including a predominant phosphorylated nucleocapsid (n) protein, peplomeric glycoprotein, and spike (s) protein that makes up the large surface projections of the virion, and membrane protein (m) [ , ] . the n protein is abundantly produced in coronavirus-infected cells and is highly immunogenic. the n protein binds to the viral genomic rna and composes the structural feature of helical nucleocapsid. the n protein is a preferred choice for developing a groupspecifi c serologic assay because of highly conserved sequence and antigenicity. the nucleocapsid protein s of various rna viruses, such as mumps, rabies, vesicular stomatitis, measles, newcastle disease, and infectious bronchitis (ibv) viruses, have been used as the coating antigens in diagnostic elisa [ - ] . prokaryotic expression is an economic and convenient system to prepare large amount of pure recombinant protein. in addition, the antigenic integrity of n protein expressed in prokaryotic system is expected to be maintained due to the lack of glycosylation. this chapter describes expression and purifi cation of tcov n protein with a prokaryotic system for preparation of a large quantity of highly purifi ed viral protein, which can be used as coating antigen for antibody-capture elisa for serologic diagnosis of tcov infection [ , ] . . add μl of the above rt mixture to the pcr amplifi cation reaction ( μl) with primers nf and nr. a mix of taq and pfu at : is recommended to maintain pcr fi delity (table ) . . the nuclease reagent benzonase is added with μl ( units) for every ml of bugbuster reagent used. . ampicillin antibiotic marker is on the expression vector ptriex and chloramphenicol antibiotic marker is on plasmid placi in the expression host strain tuner cells. carbenicillin is recommended to be in place of ampicillin for better stability for ph changes throughout the bacterial cultures. . the binding capacity of . ml of his-bind resin is mg of target protein per column. as for any affi nity chromatography, the best purity of target protein is achieved when the amount of protein extract is near the binding capacity. . the purpose of m urea is to improve resolution of the sticky inclusion bodies. the presence of m urea does not affect binding of his-bind resin to target n protein. . the suggested ratio of rnapure reagent to sample is : . excess amount of rnapure reagent has no negative impact. the lower ratio ( : ) in this step is intended to obtain higher concentration of viral rna in the fi nal supernatants. if the upper aqueous phase after centrifugation at step is more than half of the total volume, there is not enough rnapure reagent added. the appropriate reagent amount may be adjusted. chloroform is applied at μl for every milliliter of lysate. . the sample mixture with chloroform at this step can be stored at − °c or even lower temperature before proceeding to the next step. . optional: inverting the tube for - min for air-drying of rna pellet is a helpful tip to completely remove any residual ethanol that may interfere the following rt reaction. . it is critical to make sure that the jellylike rna pellet is completely dissolved into solution by repeat pipetting. . the synthesized cdna in the rt reaction can be stored at − °c or even lower temperature until used. . pcr product may be purifi ed. the vector must be gel purifi ed due to the long digested fragment size above bp. . the molar ratio between vector and insert is suggested at : to : . the volumes in this step are illustrated for initial exploration. the concentration of digested vector and insert can be estimated by od or agarose gel electrophoresis with known amount of dna of similar size in adjacent wells. the ligation reaction mixture can be stored at °c until used for transformation or at − °c for longer term. . after plating, the leftover transformation mix can be stored at °c for further plating in the following days at different amount if needed. . the starter culture can be prepared from a fresh colony on a plate or directly from a glycerol storage stock. an od around . represents a culture at log phase when the cells are at the best condition to expand and for protein expression. . this usually takes about - h to reach the od range. the higher the od of starter culture in the previous step, the shorter the time to reach this od range. . the centrifuge tubes should be weighed before and after collection of cell pellet for estimation of wet pellet amount and the volume of bugbuster to be applied in the next step. frozen storage of cell pellets may improve the extraction effi ciency of bugbuster reagents through the freeze/thaw cycle. . it is important to completely resuspend the cell pellets for the best results of bugbuster extraction. higher volume of bugbuster reagent does not have adverse effect. roughly - ml of bugbuster reagent should be enough for cell pellets collected from a ml of culture. bugbuster reagent can be added directly to frozen cell pellets. there is no need to wait for the temperature to return to room temperature. protease inhibitors may be added at this step but usually not necessary. . it is critical but somewhat diffi cult to completely dissolve the sticky inclusion bodies. repeat pipetting up and down until the protein solution is homogeneous. any undissolved particles will clot the his-bind column and affect the purifi cation process. it is advisable to centrifuge the dissolved inclusion body protein solution at - , × g for - min at °c for clarifi cation before application to the column. . eluate may be collected in fractions such as . or ml each fraction. . the presence of m urea is compatible with the protein assay reagent. the assay range can be adjusted for protein concentrations from low μg/ml to mg/ml with different assay format. the protein concentration of the target n protein eluate as obtained following this process is about - mg/ml. the presence of m urea has no adverse effect on plate coating for elisa performance. given the coating concentration of n protein at μg/ml, the eluate is usually diluted in coating buffer for at least : to reduce the urea content to less than mm and, subsequently, further diminish any possible effect on elisa performance. accordingly, the purifi ed n protein eluate can be directly applied to the elisa assay for detection of antibodies to tcov. coronaviral enteritis of turkeys (blue comb disease) immunity to transmissible coronaviral enteritis of turkeys (blue comb) identifi cation of the structural proteins of turkey enteric coronavirus coronavirus immunogens immunoglobulin class and immunoglobulin g subclass enzyme-linked immunosorbent assays compared with microneutralisation assay for sero-diagnosis of mumps infection and determination of immunity rabies diagnostic reagents prepared from a rabies n gene recombinant expressed in baculovirus baculovirus expression of the nucleocapsid gene of measles virus and utility of the recombinant protein in diagnostic enzyme immunoassays immunological characterization of the vsv nucleocapsid (n) protein expressed by recombinant baculovirus in spodoptera exigua larva: use in differential diagnosis between vaccinated and infected animals a diagnostic immunoassay for newcastle disease virus based on the nucleocapsid protein expressed by a recombinant baculovirus recombinant nucleocapsid protein is potentially an inexpensive, effective serodiagnostic reagent for infectious bronchitis virus expression and purifi cation of turkey coronavirus nucleocapsid protein in escheria coli recombinant nucleocapsid proteinbased enzyme-linked immunosorbent assay for detection of antibody to turkey coronavirus the protocol "recombinant turkey coronavirus nucleocapsid protein expressed in escherichia coli " detailed in this chapter had been successfully carried out in the authors' studies on characterization and immunology of turkey coronaviral enteritis . those studies were in part fi nancially supported by usda, north carolina poultry federation, and/or indiana department of agriculture and technically assisted by drs. tom brien and david hermes, mr. tom hooper, and ms. donna schrader for clinical and diagnostic investigation, virus isolation and propagation, and animal experimentation. key: cord- -ztp w yh authors: land, walter gottlieb title: cell-autonomous (cell-intrinsic) stress responses date: - - journal: damage-associated molecular patterns in human diseases doi: . / - - - - _ sha: doc_id: cord_uid: ztp w yh in this chapter, the role of cell-intrinsic stress responses is examined which include autophagic processes, the oxidative stress response, the heat shock response, the unfolded proteins response, and the dna damage response. autophagy (macroautophagy, microautophagy, and chaperone-mediated autophagy) is a self-digestive process in response to environmental stress to eukaryotic cells, by which cytoplasmic components are delivered to the lysosome for recycling and degradation. the oxidative stress response is directed against any oxidative stress and is mediated by antioxidative defense systems including antioxidant enzymes such as superoxide dismutase, detoxifying enzymes such as glutathione peroxidase, and energy-dependent efflux pumps. the heat shock response is induced upon exposure of cells to any stress condition and characterized by emission of heat shock proteins which operate as damps to maintain and restore homeostasis. the unfolded protein response is induced by any stress of the endoplasmic reticulum that is perceived by three sensor molecules. under remediable endoplasmic reticulum stress conditions, the sensors trigger signalling pathways to resolve this stress. however, in severe irremediable endoplasmic reticulum stress, the unfolded protein response may lead to pro-inflammatory and pro-apoptotic responses resulting in regulated cell death. finally, the dna damage response is induced by any dna damage that occurs in a variety of exogenous and endogenous conditions. when successful, this stress response leads to dna repair and is associated with the emission of various damps which contribute to restoration of homeostasis. when unsuccessful, the dna damage response, like the unsuccessful unfolded protein response, can result in regulated cell death, either in form of apoptosis or necrosis. together, the ultimate goal of all the stress responses is to maintain cellular homeostasis and ensure cell integrity. when they fail, the incidence of regulated cell death is frequently observed. as comprehensively described in part ii, prms are specifically involved in the recognition of mamps and damps. as will be discussed in part vi, each of these recognition receptors can trigger distinct signalling cascades in innate immune cells that modify their gene expression to create and execute efferent innate immune responses that involve ( ) production of inflammatory mediator substances such as cytokines and chemokines, ( ) phagocytosis, and ( ) cytotoxicity, as well as, as described in part viii, may elicit and shape antigen-specific adaptive immune responses. beyond this well-characterized mamp/damp engagement of prms leading to a variety of downstream efferent cellular and humoral responses, the innate immune defense program also depends on cell-autonomous, that is, cell-intrinsic, responses which counteract any stressful insult [ ] . constitutive cell-autonomous immunity mobilizes pre-existing molecules and processes in order to primarily and quickly defend the cell and the host against infectious and sterile injury. hence it can be considered as the very first line of innate immune defense. here, the role of constitutive cell-autonomous responses will be examined, whose involvement in the innate immune defense to stress and injury has only been appreciated within the last few years. the focus of this brief overview will be mainly directed toward cellular stress responses. the term autophagy comes from the greek words "phagy" meaning eat and "auto" meaning self. autophagy is an evolutionarily highly conserved self-digestive process in response to environmental stress to eukaryotic cells, by which cytoplasmic components such as defective/damaged or redundant organelles or protein aggregates are delivered to the lysosome for recycling and degradation. there is convincing evidence indicating that activation of the autophagic process is promoted by mamps and/or damps [ , ] . in more simple words, autophagy is a classical cellprotective and cell-autonomous process of the innate immune system aimed at maintaining and restoring homeostasis at both the cellular (cell-intrinsic) and organismal (cell-extrinsic) level [ ] . although autophagy was initially identified in mammals, a significant breakthrough in our understanding of how autophagy is controlled came from the analysis in the genetically tractable yeast system. pioneering work from ohsumi's group showed that the morphology of autophagy in yeast was similar to that documented in mammals [ ] . (as known, ohsumi received the nobel prize in physiology or medicine .) in fact, the discovery of the autophagyrelated genes in yeast has significantly advanced the understanding of the molecular mechanisms participating in autophagy and the genes involved in regulating the autophagic pathway. many yeast genes have mammalian homologues, confirming that the basic machinery for autophagy has been evolutionarily conserved along the eukaryotic phylum [ ] [ ] [ ] [ ] . notably, a panel of leading experts in the field of autophagy has recently published a new definition of several autophagy-related terms based on specific biochemical features [ ] . accordingly, in the following, three types of autophagy are briefly sketched including macroautophagy, microautophagy, and, in mammals, chaperone-mediated autophagy. each of them fulfils very specific tasks in intracellular degradation. there is general agreement on two main features that characterize bona fide, functional autophagic responses, irrespective of type: ( ) they involve cytoplasmic material; and ( ) they culminate with (and strictly depend on) lysosomal degradation [ ] . thus, although autophagy substrates can be endogenous such as damaged cellular organelles or exogenous such as viruses or bacteria escaping phagosomes, autophagy acts on entities that are freely accessible to cytosolic proteins. this property is essential in order to distinguish between autophagic responses and branches of vesicular trafficking that originate at the plasma membrane, which also culminates in lysosomal degradation. such endocytic processes include phagocytosis, receptor-mediated endocytosis, and macropinocytosis, that is, processes which will be dealt with in part vi, sect. . . of note, however, some forms of autophagy and the endocytic pathway interact at multiple levels, and the molecular machinery responsible for the fusion of late endosomes (also known as mvbs) or autophagosomes with lysosomes is essentially the same [ ] . as stressed [ ] , the strict dependency of autophagic responses on lysosomal activity is necessary to discriminate them from other catabolic pathways that also involve cytoplasmic material, such as proteasomal degradation [ ] . thus, the s proteasome (box . ) degrades a large number of misfolded cytoplasmic proteins that have been ubiquitinated (for (poly)ubiquitination, see box . ) as well as properly folded proteins that expose specific degradation signals, such as the socalled n-degrons [ ] . on the other hand, the proteasome system shares some substrates with different forms of autophagy whereby these two catabolic pathways differ drastically in their final products. thus, proteasomal degradation results in short peptides that are not necessarily degraded further but may flow into additional processes including but not limited to antigen presentation/cross-presentation at the plasma membrane, thereby generating mhc-ii and mhc-i epitopes (compare part viii, chap. ). by contrast, lysosomal proteases fully catabolize polypeptides to their constituting amino acids which eventually become available for metabolic reactions or repair processes. together, as summarized [ ] , bona fide functional autophagic responses navigate cytoplasmic material of endogenous or exogenous origin to degradation within lysosomes (or late endosomes, in specific cases). the binding of many ubiquitin molecules to the same target protein. in its simplest form, ubiquitin can be attached to the target protein as a single moiety resulting in monoubiquitination. ubiquitin itself can be ubiquitinated, resulting in the formation of ubiquitin chains attached to the target protein: polyubiquitination. polyubiquitination of proteins is the triggering signal that leads to subsequent degradation of the protein in the proteasome. ligases play a central role in polyubiquitination. ligases are enzymes that catalyze the synthesis of polyubiquitin chains. ubiquitin conjugation requires typically box the proteasome is a common complex for all living cells, needed to recycle and eliminate unwanted proteins. in analogy, it resembles a chaff-cutter. this molecular machine provides a pathway that is involved in many cellular levels such as protein degradation, antigen processing, cell cycle, apoptosis, and dna repair. the s proteasome that is present in the cytoplasm and nucleus is usually formed by one s proteasome complex and two s proteasome complexes, which are composed of proteases and structural units. the s proteasome is a giant protease responsible for the regulated degradation of polyubiquitylated proteins (see box . ) . it consists of at least distinct subunits and is arranged into two modules: core particle containing catalytic sites and regulatory particles. the cylinder-shaped proteolytic core is the s core particle, which is capped at one or both ends by s regulatory particles. further reading: wehmer m, sakata e. recent advances in the structural biology of the s proteasome. int j biochem cell biol ; : - . basically, the term macroautophagy is often used when describing autophagy in general. the phenomenon is characterized by its typical morphological features which involve dedicated vesicles that can occupy a considerable part of the cytoplasm. typically, macroautophagy is one type of autophagic processes in which the substrates are sequestered within cytosolic double-membrane vesicles termed autophagosomes. the substrates of macroautophagy include superfluous and damaged organelles, cytosolic proteins, and invasive microbes. mechanism of formation and regulation of macroautophagy are very complex and complicated processes that are outlined here in a considerably simplified way. macroautophagy involves the sequestration of cytoplasm via a double-membrane intermediate structure termed the phagophore which matures into an autophagosome; the latter compartment fuses with a lysosome allowing degradation and recycling of the cargo [ ] . in more detail, the process begins with the formation of a membrane of unknown origin, the initial phagophore or isolation membrane. the phagophore then expands, surrounds proteins or organelles, sequesters cytoplasm, and, on completion, develops into a large double-membrane transport vesicle, the autophagosome. subsequently, the autophagosome fuses with a lysosome containing acid hydrolases and releases its contents into the lytic acid hydrolases-containing compartment as part of single-membrane vesicles, termed autophagic bodies. the fused compartment where the autophagic body and its contents are degraded is called an autophagolysosome or autolysosome ( fig. . ) . notably, the process of phagophore expansion three classes of enzymes. e (ubiquitin-activating enzyme) hydrolyzes atp and forms a thioester-linked complex between itself and ubiquitin. e (ubiquitin-conjugating enzyme) receives ubiquitin from e and forms a similar thioester-linked intermediate with ubiquitin. e (ubiquitin ligase) finally binds both the e and a substrate and catalyzes the transfer of ubiquitin to the substrate. ubiquitin itself is often a substrate for further ubiquitylation, which results in the formation of so-called polyubiquitin chains. ubiquitin has seven lysine residues, and depending on the lysine residue used for ubiquitin-ubiquitin chain formation, the polyubiquitin chain can signal different functions. proteins modified by lysine- (k )-or lysine- (k )-linked chains are usually degraded by the proteasome. in contrast, modification by k -linked chains or by a single ubiquitin moiety (monoubiquitylation) seems to trigger other functions, e.g., protein sorting, gene expression, and dna repair. further reading: callis j. the ubiquitination machinery of the ubiquitin system. arabidopsis book ; :e . provides tremendous flexibility and capacity with regard to cargo, allowing entire organelles to be deleted via autophagy; however, this flexibility also means that autophagy must be tightly controlled in order to prevent inappropriate degradation, which could lead to cell death (for relevant papers, see [ ] [ ] [ ] [ ] [ ] [ ] [ ] . intensive studies have been carried out in the past two decades to understand the mechanism and regulation of autophagy. the biogenesis of autophagosomes needs the ordered intervention of autophagy-regulated (atg) proteins that act on different modules. thus, more than atg genes have been identified in human that orchestrate the complex membrane dynamics involved in autophagic sequestration. these atg proteins act sequentially in three macromolecular complexes involved in the three successive stages of autophagy. initiation of autophagy requires the unc- like kinase (ulk )-atg -fip (also known as rb -inducible coiled-coil ) complex, whereby the kinase activity of ulk is controlled by the kinase mammalian target of rapamycin (mtor) in mtor complex (mtorc ), which is sensitive to rapamycin [ ] . the next process, membrane nucleation, requires the beclin / class iii pi k complex, which also plays a major role in membrane trafficking and restructuring involved in autophagy [ , ] ; the final process refers to the elongation, expansion, and closure of the phagophore membrane/autophagosome which mainly relies on atg /microtubule-associated protein light chain (lc ) lipidation. in fact, atg /lc lipidation is regarded as a hallmark of autophagy and is established by a covalent linkage of cytosolic lc to the lipid phosphatidylethanolamine on the surface of the autophagosome [ , ] . of note, in addition to the cytoplasmic ptm of various atg proteins, recent studies have explored the transcriptional and epigenetic control of autophagy [ ] . notably, in human cells, tfeb (for transcription factor eb) and zkscan (for zinc finger with krab and scan domains ) were shown to be implicated in playing a crucial role in autophagy regulation [ , ] . also, there is growing evidence in support of the notion that histone modification/dna methylation acts as an alternative approach for long-term autophagy control [ ] (for histone modification, see part vi, sect. . . ) . also recently, a new ampk→skp →carm (for: ampactivated protein kinase; s-phase kinase-associated protein (p ); coactivatorassociated arginine methyltransferase ) regulatory axis was reported that incorporated cellular nutrient sensing with transcriptional as well as epigenetic control of autophagy [ ] . as concluded by xu and klionsky [ ] , "…this ampk-skp -carm signalling axis integrates the various levels of autophagy regulation including cell signalling, and transcriptional regulation as well as epigenetic modification. epigenetic and transcriptional regulation provides an energy-saving approach for control and also create an enduring memory in preparation for future adverse events. thus, this study has deepened our understanding of how autophagy can be controlled in a holistic manner by pathways linking a multitude of regulation mechanisms. given the extensive involvement of autophagy in human diseases, this work also presents potential directions for novel therapeutic intervention." indeed, besides its beneficial function in controlling cellular homeostasis, macroautophagic pathways when disrupted can have severe consequences leading to major diseases such as cancer, metabolic and neurodegenerative disorders, and cardiovascular and pulmonary diseases [ ] . of note, macroautophagy can be divided into two subtypes depending on the organelle that is targeted for autophagic degradation; thus, the process of mitophagy corresponds to autophagy of mitochondria, whereas the term er-phagy refers to autophagy of the endoplasmic reticulum (er). both processes deserve a few more words in the following subsection. the term mitophagy corresponds to cargo-specific autophagy of mitochondria, a process which mediates the selective removal of mitochondria [ , ] . the aim of mitophagy is to eliminate mitochondria, either to regulate their number to adjust to metabolic demand or to explicitly remove those that are damaged in terms of a quality control. mechanistically, mitochondria are selectively recruited into isolation membranes, which seal and then fuse with lysosomes to eliminate the trapped mitochondria. as discussed [ ] , mitophagy is preceded by so-called mitochondrial fission that divides elongated mitochondria into pieces of manageable size for encapsulation and also controls segregation of damaged mitochondrial material for selective removal by mitophagy. the term er-phagy (also called micro-er-phagy) refers to a process of distinct selective degradation of er membranes and proteins in the lysosome under stress, and this is independent of the core autophagy machinery [ ] [ ] [ ] (for er stress, see sect. . ) . studies on yeast showed that er-phagy is characterized by the fact that stress-induced er whorls are selectively taken up into the vacuole, the yeast lysosome. import into the vacuole was found not to involve autophagosomes but occurs through invagination of the vacuolar membrane, indicating that er-phagy is topologically equivalent to microautophagy [ ] . recent studies on yeast provide evidence suggesting that the atg proteins atg and atg are specific receptors for this pathway of er-phagy [ ] . at this point, it also appears worthwhile to mention that a more recent study on yeast revealed a novel er quality-control pathway, namely, the so-called macro-er-phagy. first results from this study suggest that this pathway delivers an excess of integral-membrane proteins from the er to the lysosome for degradation and, typically, requires the core autophagy machinery [ ] . the brief overview about macroautophagy provides another typical example of innate immune responses which, when controlled, operate in a beneficial homeostatic way but, when uncontrolled, may lead to severe pathologies. for other forms of phagocytic responses, this phenomenon has not been investigated sufficiently. clearly, mitophagy also plays a key homeostatic role in mitochondrial quality control. upregulation of mitophagy has been shown to mitigate excessive mitochondrial accumulation and toxicity to safeguard mitochondrial fitness. hence, mitophagy is a viable target to promote longevity and prevent age-related pathologies [ ] . concerning the two types of er-phagy (micro-and macro-er-phagy), one has to state that research in this exciting field has just begun. several questions remain to be addressed, for example, what is the purpose of er-phagy and what are the underlying mechanisms. future studies will probably provide a clue to elucidating the molecular mechanisms and physiologic roles of er-phagy in other organisms. of note, besides mitophagy and er-phagy, other specific forms of phagocytic pathways have been described. they include -pexophagy as a macroautophagic response preferentially targeting peroxisomes -nucleophagy as an autophagic response selectively targeting portions of the nucleus -ribophagy as a specific autophagic response targeting ribosomes -aggrephagy as an autophagic response specific for protein aggregates -lipophagy in terms of selective autophagic degradation of neutral lipid droplets -bacterial xenophagy as a macroautophagic removal of cytoplasmic bacteria which have escaped the phagosomal compartment upon phagocytosis -viral xenophagy as a macroautophagic response targeting fully formed cytoplasmic virions or components thereof -proteaphagy in terms of macroautophagic responses specific for inactive proteasomes -lysophagy as a specific macroautophagic disposal of damaged lysosomes in mammalian cells for details of these specific forms of phagocytic responses, the reader is referred to the excellent comprehensive review article of galluzzi et al. [ ] . in addition to macroautophagy, two other types of autophagy have been described called microautophagy and chaperone-mediated autophagy (cma). microautophagy together with macroautophagy plays, for example, a role in nutrient recycling under starvation. on the other hand, cma is known to contribute to the maintenance of cellular homeostasis by facilitating recycling of amino acids of the degraded proteins and by eliminating abnormal or damaged proteins, thereby exerting major regulatory functions in different pathophysiological scenarios such as metabolic regulation. here, a few aspects of these two types of autophagic responses are skimpily touched. by contrast to macroautophagy, the process of microautophagy is much less defined in mammals since most studies have been performed in yeast and plants. according to current models, the term refers to a collection of diverse processes. unlike autophagy, microautophagy does not involve the autophagosome-dependent degradation of cytoplasmic components but rather and characteristically relies on the direct engulfment of small portions of cytoplasm into lysosomes or late endosomes by invagination and inward budding of the lysosomal/endosomal membrane, a process that leads to their degradation [ ] [ ] [ ] . though microautophagy is the least studied form of autophagy, a molecular signature of the process has begun to emerge and has led to the definition of microautophagy as a type of autophagy in which the cargo is directly internalized in small vesicles that form at the surface of the lysosome/vacuole or late endosomes (multivesicular bodies), respectively [ ] . in addition to macroautophagy and microautophagy, there is another type of autophagy experiencing increased attention, the cma. characteristically, in cma, cargo delivery also occurs directly at lysosomes, but it does not require formation of vesicles nor membrane invagination. instead, the substrate proteins for this autophagic pathway cross the lysosomal membrane through a protein-translocation complex, that is, a process that requires protein interaction with the chaperone hspa (also known as hsc ) and association of hspa with a specific splicing isoform of lamp- , that is, the lysosomal protein lamp- a. thus, chaperone-bound autophagy substrates bind lamp- a monomers on the cytosolic side of the lysosome, which stimulate the formation of an oligomeric lamp- a translocation complex [ , [ ] [ ] [ ] . essential functions that cma fulfils in cells include a contribution to amino acid recycling during prolonged starvation as well as quality control, directly linked to the ability of this pathway to selectively remove single proteins from the cytosol. for example, cma is up-regulated during oxidative stress where it contributes to the degradation of oxidized proteins (reviewed in [ ] ) (see next sect. . ). of note, growing evidence demonstrates that malfunction of cma plays a vital role in the pathogenesis of severe human disorders. often, the mechanisms underlying the alterations of cma in these pathologies involve perturbations in the functioning of the cma translocation complex. both diminished and enhanced cma activities have been shown to associate with diseases, an observation that emphasizes the importance of a tight regulation of cma activity (highlighted in [ ] ). as argued above, current knowledge about mechanism and physiological relevance of microautophagy in mammalian cells is hard to judge since most findings derive from studies in yeast. however, future studies aimed at identifying proteins controlling microautophagy-related vacuolar membrane changes in yeast will probably allow to search for homologues in mammals and then investigate their contribution to mammalian microautophagy. by contrast, research on cma has already made substantial progress. for example, the recent identification of a plethora of new cma substrates and deficiencies in cma associated with diverse human pathologies has expanded our understanding of the importance of cma in multiple cellular functions. in fact, the growing number of connections between cma and human diseases has already generated interest in modulating cma activity for therapeutic purposes. there is a close relationship between autophagy and mamps and/or damps in the cellular response to injury. in fact, autophagy cannot be restricted to an innate immune mechanism that controls intracellular homeostasis alone but has to be extended in terms of "immunological autophagy" to a process that is committed to control and regulate efferent innate and potential adaptive immune responses. the "medium" for achieving this goal is the mamps and/or damps which operate as a link between intracellular and extracellular events. this scenario is briefly touched in the following. growing evidence indicates that autophagy regulates release and degradation of damps-here in terms of inducible damps-including hmgb , atp, and dna in several cell types [ ] . for example, autophagic mechanisms reportedly promote and regulate the release and secretion of hmgb in a ros-dependent manner in fibroblasts, macrophages, and cancer as well as net-mediated release of hmgb in neutrophils [ , ] . moreover, autophagy has been shown to be required for the liberation/active secretion of atp by dying cancer cells [ , ] . in addition, autophagy was found to contribute to the regulation of the ddr at multiple levels, that is, a process associated with the emission of damps [ , ] (for ddr, see sect. . ). via emission of damps, eventually, together with mamps, autophagy can amplify or even instigate mamp/damp-prm signalling leading to efferent innate immune responses. on the other hand, autophagy can inhibit pro-inflammatory signalling cascades. for example, the atg -atg complex, a key regulator of the autophagic process, was shown to negatively regulate rlr signalling by direct binding to card domains of rig-i and interferon promoter-stimulating factor- (ips- ) [ ] (compare part vi, sect. . . ). moreover, as reviewed elsewhere [ ] , autophagy has been found to inhibit both nlrp and aim inflammasome activation and subsequent production of pro-inflammatory cytokines il- β and il- (for inflammasomes, see part vi, sects. . . and . . ). as a possible mechanism, the authors propose that inflammasome components and pro-il- β are subjected to ubiquitination and subsequent degradation by autophagy, thereby leading to functional inactivation of inflammasomes. conversely, an increasing number of studies suggest that damps, including hmgb , atp, and dna, are powerful stimuli and regulators to elicit autophagic responses [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] . for example, hmgb was demonstrated to be an important regulator of autophagy in various types of cancer cells and keratinocytes. mechanistically, the reduced form of the hmgb protein was proposed to be responsible for the promotion of autophagy in an ager/rage-dependent fashion [ , ] . also, and of high interest, in a clinical study on patients with chronic hepatitis b, hmgb -induced autophagy was found to maintain treg function during chronic viral infection [ ] . moreover, in studies on a mini pig lung iri model, evidence was provided indicating that autophagy, when triggered by damps such as hmgb and hsp during iri, amplifies the inflammatory response through enhancing k -linked ubiquitination of traf and activation of the downstream mapk and nf-κb signalling (for traf , mapk, and nf-κb signalling, see part moreover, there is already first evidence suggesting a role of atp in the regulation of autophagy [ ] . in addition, there are accumulating data indicating that cytosolic dna, dislocated as a result of dna damage, may contribute to the regulation of autophagy whereby the dna damage-regulated autophagy modulator (dram ) appears to play a mechanistically crucial role [ ] [ ] [ ] . the mechanisms involved in mamp/damp-activated autophagic responses have only partially been elucidated. in fact, there is convincing evidence suggesting that many mamp/damp-recognizing prms including tlrs (in particular endosomal tlrs), nlrs, and anti-dna receptors can activate autophagic responses by triggering specific signalling pathways (reviewed or discussed in [ , , , ] ). the crosstalk between autophagic responses and damps represents a powerful instrument of the innate immune system to integrate and unify various tools for the promotion and regulation of injury-induced inflammation and, in the presence of nonself-or altered self-antigens, injury-induced adaptive immunity. thus, on the one hand, autophagy is known to promote and regulate the release of damps (though the exact mechanisms are still elusive); subsequently, damps via prmtriggered pathways participate in the regulation of inflammation. on the other hand, activation of prms by mamps and/or damps promotes autophagy activation through a mechanism that has been partially elucidated. in fact, an increasing number of findings suggest that this activation process is triggered by prms following recognition of mamps and/or damps. nevertheless, the precise molecular mechanisms by which prms modulate autophagy remain largely unknown. there is increasing evidence in support of the notion that mamp/damp-activated autophagic responses promote emission of damps which in turn support cellular homeostasis in the course of adaptive stress responses in healthy cells. notably, this cellular homeostatic effect may spread out and affect the whole organism via emission of autophagy-dependent damps. in other words, via damps, autophagy as a cell-intrinsic stress response can fortify its defending capability by providing a link to promotion and regulation of cell-extrinsic efferent innate immune and eventually subsequent adaptive immune responses. however, despite the fact that autophagy is one of the best-known cell-autonomous responses in innate immunity and has clearly been shown to counteract dangerous infectious and sterile cell stress, much is left unclear. one such issue concerns the definition of autophagy-dependent cell death. as discussed and summarized [ ] , autophagy-dependent cell death can be defined as a form of rcd (see next chapter) that can be retarded by pharmacological or genetic inhibition of macroautophagy. in this context, as stressed by galluzzi et al. [ ] , it is important to note that ( ) specificity issues affect most, if not all, pharmacological agents employed so far for suppressing macroautophagic responses and ( ) multiple components of the macroautophagy machinery have autophagy-independent functions. in view of these facts and findings, these authors recommend to favor genetic approaches and to test the involvement of at least two different proteins of the macroautophagy apparatus in a specific instance of rcd before etiologically attributing it to macroautophagy. other unclear issues refer to the specific modulation of autophagy by mamps and/or damps, the precise interaction of autophagy with innate immune signalling cascades, and the cooperation between autophagy and other physiologic cell-intrinsic and cell-extrinsic processes during scenarios of cell stress and tissue injury. efforts to solve these problems are of utmost importance in view of the fact that autophagy-when induced by excessive, chronic, or acute-repetitive emission of damps-can contribute to the pathogenesis of many human diseases, that is, acute and chronic, infectious, or sterile inflammatory disorders. at the respective places, they will often be mentioned in the following chapters as well as in volume . oxidative cell stress and tissue injury reflect most potent and omnipresent threats an organism is exposed to. though there is a robust defense response continuously operating, this kind of injury is known to contribute to the pathogenesis of many human diseases. how can this be? oxidative stress is caused by an imbalance between the production of oxidants such as ros on one side and the biological antioxidative defense system's ability on the other side to counter the oxidant levels with antioxidants, that is, to readily detoxify the toxic reactive species or easily repair the resulting damage. thus, it is the excessive production of ros-overriding the antioxidative capacities-that is pathophysiological and contributes to dysfunction, damage, and even death of cells. by contrast, generation of ros in physiological low/ moderate concentrations-operating as second messenger molecules and causing so-called oxidative "eustress" [ ] -assists in intracellular signalling pathways and, thus, is essential for optimal cell functions and homeostasis of an organism. in other words, the biological effects of ros-beneficial or deleterious-considerably depend on the amounts of ros present and, in action, a phenomenon that is in agreement with the idea that cellular ros generation has characteristics of hormesis implying a dose-response phenomenon that is characterized by beneficial effects at low doses and deleterious effectivity at high toxic doses [ ] . to guarantee this homeostatic function of ros, to keep these molecules within physiological limits, and to prevent their deleterious effects, that is, to maintain hormesis, a smooth running of the oxidative stress response is of utmost importance. accordingly, a few aspects of this critical stress response are addressed in the following. reactive oxygen species are produced from molecular oxygen as a result of normal cellular metabolism. to understand any discussion on a role of ros in host defense or human diseases, one should define free radicals. according to halliwell and gutteridge [ ] , "a free radical is any species capable of independent existence that contains or more unpaired electrons." an unpaired electron is one that occupies an atomic or molecular orbital by itself. radicals can be formed by the loss of a single electron from a non-radical, or by the gain of a single electron by a non-radical. in this sense, superoxide anions (o ·− ), hydroxyl radicals ( · oh), peroxyl radicals (ro · ), and alkoxyl radicals (ro˙) are oxygen radicals. of note, ros is a collective term often used by scientists to include not only the oxygen radicals but also some non-radical derivatives of oxygen such as h o , hypochlorous acid (hocl), ozone (o ), and singlet oxygen ( o ). nitrogen-containing oxidants, such as no · are called rns. generation of ros is generally a cascade of reactions that starts with the production of superoxide anions. superoxide rapidly dismutases to h o either spontaneously (especially at the low ph) or catalyzed by sod. other elements in the cascade of ros generation include the reaction of superoxide with no to form the very toxic peroxynitrite, the peroxidase-catalyzed formation of hocl from h o , and the iron-catalyzed fenton reaction, leading to the generation of hydroxyl radical. the oxidants are produced endogenously as by-products or metabolites of various metabolic processes. multiple enzyme systems produce superoxide radicals and their derivatives including xanthine oxidoreductase (xor), the reduced form of nadph oxidases (noxes), and mitochondrial electron transport chain (etc)associated molecular complexes ( fig. [ , , [ ] [ ] [ ] [ ] [ ] [ ] [ ] mitochondrial etc is believed to be the main source of ros. in the following, some aspects of these three systems (other systems not mentioned here) are briefly touched, exemplified by and focused on their role as vascular sources of ros production as investigated on models of iri. xanthine oxidoreductase (xor), as a housekeeping enzyme, is probably expressed in all cells but primarily in surface epithelia such as capillary endothelial tissue of various organs. xanthine oxidoreductase, a complex molybdoflavoprotein, is the ratelimiting step in the catabolism of purines, where it catalyzes the last steps of purine metabolism: the conversion of hypoxanthine to xanthine and of xanthine to uric acid, with superoxide/h o generated as by-products. there is a definite role of xor in reperfusion of tissue and organs [ , ] . for example, experiments performed in isolated rat hearts have demonstrated that radical generation and functional injury are decreased by inhibition of xor with oxypurinol. similarly, in human aortic or venous ecs, xor-mediated ros generation has been shown to be a central mechanism of oxygen radical generation upon postischemic reoxygenation [ , ] . the nadph oxidases were initially considered as enzymes expressed only in phagocytic cells involved in host defense and innate immunity; however, recent evidence indicates that there is an entire family of noxes based on the discovery of gp phox homologues. the family comprises seven members, including nox , nox (formerly termed gp phox), nox , nox , nox , duox , and duox [ ] . three members out of the enzyme family are important sources of ros in the vasculature, namely, nox , nox , and nox [ ] [ ] [ ] . today, noxes are perhaps the best-studied enzymes involved in ros production in the blood vessels. remarkably, different members of the nox/duox family engaged in iri are localized in various cells, that is, in vascular cells and phagocytes. this may lead to the notion that noxes in vascular cells are responsible for the first wave of ros production because vascular cells are first confronted with reintroduced molecular oxygen. as generally believed, there is, in fact, no vascular specific nox isoform but rather a complex expression of various nox isoforms in different cells and regions of the vascular system. nevertheless, in arteries from humans and animals, nox , nox , and a shallow level of nox have been consistently found to be present both as messenger rna and as protein [ ] . altogether, the findings and data briefly described here make clear that vascular cells are equipped with efficient machinery able to efficiently produce ros. regarding the different enzymatic sources, superoxide radicals appear to be predominantly generated compared, for example, to hydroxyl radicals. mitochondria have been implicated as potential oxygen sensors by increasing the generation of ros which regulate a variety of hypoxic responses [ ] [ ] [ ] [ ] . in fact, mitochondria are increasingly recognized as lynchpins in the evolution of tissue injury during postischemic reperfusion. it is generally acknowledged that the majority of intracellular ros production is generated in the mitochondrial etc and its associated metabolic enzymes. however, very little is known about which mitochondrial sites are involved in physiological or pathological ros production under native conditions. of note, using inhibitors to manipulate the redox states of particular sites and prevent superoxide generation from others, at least ten different locations of superoxide/h o production in the etc and associated enzymes (krebs cycle, β-oxidation, etc.) have been identified in mammalian mitochondria. in fact, the relative and absolute contributions of specific sites to the production of ros in isolated mitochondria depend very strongly on the substrates being oxidized, and the same is likely valid in cells and in vivo [ ] . for example, superoxide formation occurs on the outer mitochondrial membrane, in the matrix, and on both sides of the inner mitochondrial membrane (fig. . ) . complex i (nadh-ubiquinone oxidoreductase) accepts electrons from nadh; these electrons are carried to complex ii (the succinate dehydrogenase-coq oxidoreductase), where they are used to oxidize succinate to fumarate. afterward, electrons continue to travel down their electrochemical gradient to complex iii (the cytochrome bc complex (ubiquinol-cytochrome c oxidoreductase)), and subsequently to complex iv (cytochrome c oxidase); finally, the electrons are used to reduce molecular oxygen to water. thus, complex i and complex ii oxidize the energy-rich molecules nadh and flavin adenine dinucleotide h , respectively, and then transfer the resulting electrons to ubiquinol that carries it up to complex iii (for competent articles, see [ , ] notably, complexes i and ii generate superoxide within the mitochondrial matrix, whereas complex iii produces superoxide at the qo site, resulting in the release of superoxide into either the intermembrane space or the matrix. regarding complex i, it was recently demonstrated that inhibition of nd , a subunit of complex i, suppresses the activity of this complex and thus ros production [ ] . furthermore, data from another set of studies on complex i showed that stable down-modulation of its subunits grim- and ndufs decreased complex i activity that was associated with a significant reduction in the overall nadh oxidation rate but with an increased production of ros by the target cells [ ] . similar results have been found in studies on complex ii: there is evidence suggesting that inhibition of complex ii on the level of subunits even leads to an increase in ros production. the phenomena can be explained by assuming that, if electrons provided in the course of the etc cannot efficiently be transferred to the next complex, they would leak out from the inhibited complex and generate ros [ ] . the complex iii subunits rieske iron-sulfur protein (risp) encoded by ubiquinol-cytochrome c reductase, rieske iron-sulfur polypeptide (uqcrfs ), and ubiquinol-cytochrome c reductase binding (uqcrb) protein appear to play a crucial role in hypoxia-triggered mitochondrial ros generation (for rieske, see box . ). thus, it was shown that risp promotes the hypoxic stabilization of the transcription factor hif- α protein [ ] and uqcrb was found to mediate hypoxia-induced tumor angiogenesis via mitochondrial ros-mediated signalling [ , ] . also, and of note, a mouse model to permit conditional deletion of the nuclear-encoded risp gene was recently developed to assess its role in hypoxiainduced ros signalling in the pulmonary circulation [ ] . it was found that depletion of risp abolishes the ros response to hypoxia in isolated pulmonary the rieske protein is an iron-sulfur protein (isp) component of the cytochrome bc complex that was first discovered and isolated by john s. rieske and coworkers in . the rieske iron-sulfur protein is an essential subunit of mitochondrial cytochrome bc complexes and, like the majority of mitochondrial proteins, is encoded by a nuclear gene and synthesized on cytoplasmic ribosomes as a precursor with a -residue amino-terminal extension. the iron-sulfur protein is then post-translationally imported into the mitochondria where it is inserted into the bc complex in the inner mitochondrial membrane. at first, the precursor is translocated via translocation contact sites into the matrix. there, cleavage to an intermediate containing an -residue extension occurs. the intermediate is then redirected across the inner membrane, processed to the mature subunit, and assembled into complex iii. arterial smcs and isolated pulmonary artery segments. further, in this article, it was discussed that mitochondria are not the only source of ros during hypoxia. thus, studies using a genetic knockout of p phox suggested that cytosolic nadph oxidase systems may also contribute to a hypoxic pulmonary vasoconstriction response during acute hypoxia [ , ] . according to the authors' conclusion, the blockade of hypoxia-induced ros responses (in these studies observed with depletion of risp) suggests that the mitochondria may act as the initiators of ros production, which could be amplified by engagement of nadph oxidase systems elsewhere in the cell. such "ros-induced ros release" might permit small ros signals generated by mitochondria to activate ros signalling throughout the cell, thereby avoiding mitochondrial damage that might arise if the entire cellular oxidant signal originated from that organelle [ ] (or even leading to excessive ros production?). indeed, the substantial advances in oxidative stress research of recent times, in particular, the specification of hypoxia-sensing ros-producing enzyme systems, will contribute to new therapeutic strategies to be applied in acute and chronic human diseases known to be influenced by oxidative stress. for example, discrimination of oxidative eustress, a fundamental process in maintaining health, from oxidative damage will improve clarity in developing "redox medicine" [ ] . when the redox equilibrium of a cell is upset by pro-oxidant environmental stimuli, that is, when oxidative stress exists, an adaptive stress response takes place which can result in upregulation of antioxidant proteins and detoxification enzymes. these antioxidative defense molecules comprise the following [ ] : ( ) agents that catalytically remove free oxygen radicals and other reactive species, for example, sod, catalase, peroxidase, and thiol-specific antioxidants; ( ) proteins that minimize the availability of pro-oxidants such as iron ions, copper ions, and heme, for example, transferrins, haptoglobins, hemopexin, and metallothionein; ( ) proteins that protect biomolecules against damage (including oxidative damage) by other mechanisms, for example, hsps; and ( ) low-molecular-mass agents that scavenge ros and rns, for example, glutathione, α-tocopherol, and (possibly) bilirubin and uric acid. in the past, it was fashionable to divide the oxidative stress response into three main tiers: ( ) antioxidant enzymes including sod, catalase, glutathione peroxidase, and glutathione; ( ) detoxifying enzymes such as glutathione peroxidase, glutathione s-transferase, aldo-keto reductase, and aldehyde dehydrogenase; and ( ) energy-dependent efflux pumps. as a fourth defense system, the antioxidant nutrients such as vitamins e and c as well as carotenoids were appreciated [ ] . it was generally accepted that the first line of enzymes is of enormous importance in limiting ros-mediated damage to intracellular macromolecules. for example, among the most important regulators of ros levels were the sod enzymes: cu/ znsod in the cytoplasm and outer mitochondrial space and mnsod exclusively in the inner mitochondrial space. mechanistically, superoxide is converted to h o and oxygen (o ·− + o ·− + h + → h o + o ) by sod. peroxiredoxins and abundant catalase enzyme then scavenge h o , converting it to molecular oxygen and water. another example of a first-line defense molecule is trx. thioredoxin contains two adjacent -sh groups in its reduced form which are converted to a disulfide in oxidized trx. notably, it can undergo redox reactions with multiple proteins using the reaction trx however, it then turned out that these antioxidative principles were clearly not % effective at performing this task, as under normal physiological conditions, lipid and dna oxidation products can be detected in blood and urine. because certain compounds of the chemicals generated after an interaction of ros with macromolecules are highly reactive, there must be an equal necessity to detoxify these secondary oxidation products to prevent them from also damaging dna, proteins, and lipids. without the adequate detoxification of such products, an extended chain reaction will occur resulting in the degradation of cellular components and the ultimate death of the cell. this second line of defense against ros is provided by those detoxifying enzymes. finally, detoxified metabolites produced by these enzymes are eliminated from the cell by energy-dependent efflux pumps such as the glutathione s-conjugate transporter, also called the multidrug resistance-associated protein (mrp) [ ] . today, one must state that these previous notions are incomplete. at first, it became apparent that members of the so-called cnc (for cap 'n' collar)-basic region-leucine zipper (bzip) family of transcription factors are principal mediators of defense responses to redox stress. in mammals, the cnc family members nrf and nrf were shown to be involved in the transcriptional upregulation of cytoprotective genes encoding a large number of diverse detoxification, antioxidant, and anti-inflammatory proteins (e.g., glutamate cysteine ligase, nadph-quinone oxidoreductase, glutathione s-transferases, and aldo-keto reductases) as well as enzymes with essential roles in cell metabolism [ ] . more recent studies then revealed that these transcription factors, notably nrf , are activated by keap as the primary negative regulator of nrf , that is, a molecule that simultaneously operates as a sensor protein able to perceive dyshomeostatic subclass iic- damps, for example, in terms of redox changes reflecting electrophilic stress. it is worth to add here that six critical domains have been defined in nfr (neh -neh ), and it is neh , located at the n terminus of nrf that acts as the regulatory domain for the cellular stress response. actually, neh interacts with the cytoplasmic protein keap [ ] . in the following, this very important damp-induced and gene-based antioxidative and cytoprotective system is addressed a bit more in detail. increasing evidence indicates that several redox-regulated gene products serve to protect cells from ros damage. the antioxidant response element (are), a cis-acting dna regulatory element or enhancer sequence is known to be activated by oxidative stress and to be responsible for the transcriptional regulation of several redox-regulated gene products. both nrf and bind to are and regulate are-mediated gene expression and induction. the molecule nrf is more potent than nrf in activation of are-regulated gene expression and is regarded as the principal transcription factor that binds to the are. this transcription factor is ubiquitously expressed and present in various organs and tissues including the kidney, muscle, lung, heart, liver, and brain (for reviews, see [ ] [ ] [ ] [ ] [ ] ). as touched in part ii, sect. . . and part iv, sect. . . , the nrf -triggered antioxidant response is initiated by activation of keap that functions as a substrate adaptor protein for the degradation of nrf and serves as an intracellular sensor for redox changes reflecting the presence of subclass iic- damps (for reviews see [ ] [ ] [ ] ). earlier studies had already shown that nrf is a bzip transcription factor that translocates to the nucleus after liberation under oxidative stress conditions from its cytosolic inhibitor keap [ ] . in the nucleus, nrf was found to form dimers with the proteins maf, jun, fos, activating transcription factors (atf ), and/or creb binding protein (cbp) and, in addition, regulates transcription by binding to the are upstream of a variety of cytoprotective and detoxification target genes to combat the oxidative stress [ ] . thus, established nrf -regulated genes reportedly included cu/zn sod, catalase, trx, trx reductase, glutathione reductase (gr), glutathione peroxidase (gpx), and ferritin l (ftl) [ ] (ftl and ferritin h (fth) subunits are responsible for intracellular iron storage). all of these genes are involved in the response to oxidative stress. there are several other genes also known to be engaged in the response to oxidative stress that are not described here. recently, the molecular signalling mechanism involved in the keap ↔ nrf pathway has been further elucidated and specified. the core can be seen in an axis consisting of redox change (subclass iic- damps)-initiated → keap -induced → nrf -triggered → are-driven expression of antioxidant and detoxifying genes ( fig. . ) (discussed in [ - , , ] ). the complex and complicated sequelae of the pathway are simplified in the following text. under homeostatic and stress-free conditions, binding of keap to the nrf molecule leads to its polyubiquitination and subsequent degradation by the proteasomal pathway, thereby maintaining a consistent generation of nrf and retaining its very low levels in the cytoplasm. in this scenario, keap homodimer binds to a single nrf protein via a high-affinity so-called "etge" motif and low-affinity so-called "dlg" motif. the twosite recognition of nrf by the keap dimer is essential for polyubiquitination of nrf (also see part ii, sect. . . ) (for polyubiquitination, see box . ) . in this sense, the keap ↔ nrf system can be regarded as a vital part of regulating cells under a homeostatic environment. however, in case of dangerous and threatening oxidative (and xenobiotic) stress, the system instigates a stress response that is characterized by a rapid and dramatic cessation of the keap -dependent polyubiquitination process resulting in a rapid increase of nrf abundance. in fact, exposure to ros-mediated stress (or electrophilic stress) is thought to modify the reactive "iic- damps-sensing" cysteine residues in keap , which is associated with a conformational change of the protein resulting in a loss of keap ubiquitination activity (fig. . ) . notably, as a cysteine-rich protein, the human keap possesses cysteine residues, and they are all reactive to stress to varying degrees. among these sensor cysteines of keap , c is best characterized. evidence from a number of studies has suggested that c is the most reactive and critical to the keap ↔ nrf stress-sensing response. even, there is already evidence from a first atomic-level view suggesting that the unique environment of cys (besides cys , cys , and cys ) appears to be the critical residue of keap responsible for detecting increased levels of oxidative stress [ , , ] . oxidative modification of cysteine sensors of keap leads to a loss of its polyubiquitination and degradation activity thereby stabilizing nrf . consequently, fig. . the oxidative stress-induced keap ↔ nrf pathway. under non-oxidative homeostatic conditions, the sensor keap is bound to the nrf molecule resulting in its polyubiquitination and subsequent degradation via the proteasomal pathway. oxidative stress modifies the ros-sensing cysteine residues of keap leading to loss of its polyubiquitination and degradation activity, dissociation of nrf that becomes stabilized and accumulates. then, nrf translocates to the nucleus and forms a heterodimer with the smaf transcription factor. the nrf /smaf heterodimer binds to are and induces transcription of numerous cytoprotective antioxidant and detoxification genes. are antioxidant response element, keap kelch-like erythroid cell-derived protein with cnc homology (ech)-associated protein , nrf nuclear factor-erythroid p -related factors , smaf small musculoaponeurotic fibrosarcoma, ub-ub-ub poly-ubiquitin chain. sources: refs. [ - , , ] stabilized nrf accumulates in the cytoplasm, translocates into the nucleus, and forms a heterodimer with a smaf transcription factor (smaf, small musculoaponeurotic fibrosarcoma). thus, this accumulation of nrf in response to ros (and electrophiles) cannot be regarded as an induction in a strict sense but instead is a mechanism referred to as derepression, that is, from the rapid degradation-based repression. as highlighted and discussed [ ] , there are two models of how to explain the cytoplasmic accumulation process of nrf . the "hinge-and-latch" model holds that the modification of the sensing cysteine residues of keap reduces its affinity for nrf but does not result in release. instead, newly synthesized nrf is translocated to the nucleus to trigger the transcription of nrf -dependent genes. the other model denoted as the "conformation cycling" model claims that keap uses a cyclic mechanism to target nrf for polyubiquitination and proteasomal degradation. an important feature of this cyclic mechanism is that it ensures regeneration of keap which allows the cycle to proceed. modification of specific reactive cysteine residues of keap may block the cycle of keap -dependent nrf degradation allowing de novo synthesized nrf to accumulate. the subsequent transcriptional process in the nucleus has been specified as well. as partially mentioned above, the nrf -smaf heterodimer binds to are or electrophile-responsive element (epre) and induces transcription of numerous cytoprotective genes. of note, recently, an extensive genome-wide analysis of the nrf -smaf-binding sequence, that is, the are/epre, and the maf homodimerbinding sequence (the so-called maf responsive element or "mare") was conducted, and the differences between these elements were clarified. as a result, it was proposed that are, epre, and the nf-e binding sequence be collectively named cnc-smaf-binding elements (csmbe) [ ] . interestingly, prms appear to be co-players in this scenario. thus, tlrs have been observed to induce nrf activation. remarkably, in a recent study, tlr agonists were shown to activate nrf signalling via reduction of keap [ ] . the authors could demonstrate that tlr signalling-induced keap reduction promotes nrf translocation from the cytoplasm to the nucleus, where it activated transcription of its target genes. further, tlr agonists were found to modulate keap at the protein post-translation level through autophagy. in fact, tlr signalling increased the expression of autophagy protein p and lc -ii and induced their association with keap in the autophagosome-like structures. the keap ↔ nrf system as briefly described here is a robust oxidative stress response. its regulatory mechanisms, for example, stress-sensing mechanism, proteasome-based regulation of nrf activity, and selection of target genes have been elucidated mainly in mammals (for proteasome, see box . ). nevertheless, the pathway is now regarded as an evolutionarily conserved defense mechanism against oxidative and xenobiotic stress across the tree of life. thus, the keap ↔ nrf system has been found to be also present in zebrafish, fruit fly, and caenorhabditis elegans indicating that its roles in cellular defense are conserved throughout evolution among vertebrates and suggesting that analogous defense systems are widely conserved throughout the animal kingdom [ , ] . as briefly demonstrated in this subchapter, aerobic organisms have integrated antioxidant systems, which include damp-promoted generation of enzymatic and nonenzymatic antioxidants that are usually effective in blocking harmful effects of ros. however, when ros is produced in excess causing pathological conditions, the stress response against oxidative damage can be overridden. thus, oxidative stress is known to contribute to many pathological conditions, including cancer, neurological disorders, atherosclerosis, hypertension, ards, and chronic obstructive pulmonary disease, just to mention a few of them. plausibly, these disorders are motivation enough to search for new effective therapeutic options by harnessing the new insights into mechanisms of the keap ↔ nrf system. certainly, intense research is essential for a detailed understanding of the precise consequences of targeting keap for disease prevention and treatment. all the more so as the oxidative stress response is integrated into other forms of innate stress responses that will be further outlined in the following subchapters. the heat shock response-one of the most ancient and evolutionarily conserved cytoprotective mechanisms found in nature-is induced upon exposure of living cells to acute, subacute, or chronic stress conditions. this defense response is characterized by the expression of a group of phylogenetically conserved intracellular hsps, which possess the capacity to recognize structures commonly found in the interior of proteins and to bind such structures. thanks to this property, they form a chaperone network involved in correct protein folding, trafficking, and complex assembly (for reviews, see [ ] [ ] [ ] [ ] [ ] [ ] [ ] ). in order to be released from engagement with proteins after folding and take part in further rounds of activity, hsp and other chaperones utilize an intrinsic atpase domain to hydrolyze atp and assume a free conformation [ ] . once released, hsps mediate protective cellular defense mechanisms including regulation of apoptosis, the aim being to maintain and restore cellular protein homeostasis (proteostasis). exposure of almost any cell to heat shock most often leads to the rapid transcription, translation, and accumulation of a variety of hsps that increase to quite considerable levels when the stress is pronounced. in fact, these molecules are induced by various environmental insults that can cause protein denaturation and unfolding within the cells, leading to the formation of nonnative proteins and protein aggregates, thereby emitting dyshomeostatic damps. of note, besides their protective role in different intracellular compartments, hsps in terms of inducible damps can translocate to the cell surface to get exposed or are actively secreted via non-canonical pathways. in case of necrotic cell death, they are passively released in large amounts into the extracellular space to act as constitutively expressed damps (compare part iv, sect. . . and sects . . . and . . . ). once emitted, hsps are sensed by classical recognition receptors and/or non-classical receptors (e.g., cd ) [ ] [ ] [ ] [ ] [ ] [ ] [ ] . interestingly, recent knowledge about similarities between allograft and tumor rejection has visualized that the processes of both iri to allografts [ , , ] and therapy-mediated injury to tumors [ ] [ ] [ ] [ ] are characteristically associated with emission of hsps. in all eukaryotes, the hsr is primarily regulated and controlled by the hsfs, in particular, hsf , a sequence-specific factor that binds upstream to heat shock elements in the promoters of target genes [ ] (fig. . ). for example, hsf is activated by environmental stress including oxidative and tumor-associated stress [ , ] . notably, in all eukaryotes, hsf responds to such stress conditions by undergoing a monomer to trimer transition and becomes massively phosphorylated, leading to its acquiring ability to bind to dna rapidly and activate transcription [ ] . there is at least preliminary evidence suggesting that intracellular perturbations reflecting dyshomeostatic damps may activate hsfs [ ] [ ] [ ] [ ] . such molecular alterations reportedly include changes in cytosolic ca + concentration, for example, caused by increase of fluidity in specific membrane domains [ , ] . interestingly, a recent study in support of these earlier findings provided evidence of the existence of a plasma membrane-dependent mechanism of hsf activation in animal cells, which is initiated by specific membrane-dependent trpv calcium channel-like receptors [ ] . these findings lend support to the notion that heat sensing and signalling in mammalian cells are dependent on trpv channels, suggesting that these receptors may act as a major hsr sensor in different epithelial non-cancerous and cancerous cells, capable of triggering the cellular hsr [ ] . in another line of experiments, trpv was demonstrated to mediate the effects of transient heat shock on endocytosis of human monocyte-derived dcs, suggesting a central role of trpv in mediating the cellular action of heat shock on these important cells of the innate immune system [ ] (for trpv channel receptors, also compare part ii, sect. . . ). induction of an hsr is not only mediated by sterile stress conditions but is also believed to be promoted by cell-invading viruses or bacteria. consequent emission of hsps in their role as damps may reflect a mechanism by which pathogens may contribute to sterile inflammation. for example, exacerbation of hepatitis b virus (hbv)-associated liver injury is reportedly characterized by an abnormal immune response that not only mobilizes specific antiviral effects but also poses a potentially lethal non-specific sterile inflammation to the host [ ] . heat shock proteins may be the most extensively studied damps in the context of hbv infection. a number of hsps such as hsp and hsp have been reported to be supportive factors in the process of hbv replication, and selective inhibition of these hsps was proposed to be host-based anti-hbv strategies [ ] [ ] [ ] . thus, as argued [ ] , both infective and sterile inflammation may synergistically contribute to the exaggeration of chronic hepatitis, if the hbv cannot be cleared entirely. likewise, bacterial infections were also reported to promote induction of the hsr [ , ] . for example, in studies on h. pylori and e. coli infection models, the initiation of an hsp stress response could be demonstrated [ , ] . the hsr is recognized and accepted as a classical stress response in nearly all species across the tree of life. it reflects the desperate efforts of a cell to restore homeostasis and survive upon both infectious and sterile insults. its products, the hsps, operate as damps in commission of the innate immune system to reach this goal. notably, via this mechanism, an hsr, induced by infectious damage to a cell, can promote sterile inflammation. whereas a successful hsr upon stress leads to restoration of cellular homeostasis and cell survival, an unsuccessful hsr may result in rcd such as apoptosis [ ] . this phenomenon will be resumed in the following subchapters. the er is a continuous membrane system that forms a series of flattened sacs within the cytoplasm of eukaryotic cells. as a subcellular organelle in the control of proteostasis, it is responsible for calcium storage and lipid biosynthesis as well as the synthesis, correct folding, processing, and maturation of proteins as well as for the orchestration of their transport along the classical/conventional secretory pathway. the er delivers these components to their destination compartments which include the er itself, the golgi apparatus, the plasma membrane, and the extracellular milieu or the endocytic and autophagic pathways. plausibly, the multifunctional nature of this organelle requires a myriad of proteins, unique physical structures, and coordination with and response to perturbations in the intracellular environment. a series of chaperones, folding enzymes, glucosidases, and carbohydrate transferases support and execute these processes. perturbation of er-associated functions such as accumulation of unfolded/misfolded proteins, excessive ros production, hypoxia, calcium and glucose depletion, or viral and bacterial infections reflect stress of the organelle and result in activation of an er stress-coping response, the evolutionary conserved upr [ ] [ ] [ ] [ ] . for example, the processes of both iri-mediated cell damage/cell death [ ] [ ] [ ] and induction of the icd of cancer cells [ ] [ ] [ ] are characterized by demonstration of er stress that is almost always associated with oxidative stress and vice versa [ ] . to cope with er stress, cells have the unique possibility to activate the upr, a dynamic signalling network that orchestrates the recovery of homeostasis or triggers rcd modalities, depending on the level of damage. perception of any perturbation of the er is provided by three sensor molecules of the upr embedded in the er membrane: the perk, ire- , and atf . the upr-mediated recovery of er homeostasis mainly occurs through the perk-eif α-mediated temporary shutdown of protein translation and the activation of a complex genetic program that aims to improve er quality control and adaptive responses [ ] [ ] [ ] [ ] (fig. . ) . accordingly, perk, devoted to perceiving dyshomeostatic damp-emitting er perturbations (so far, no clear evidence for ire- and atf in this respect), may be regarded as a new family of non-classical prms, at least in a broader sense. in nonstressed conditions, the three sensors of the er homeostasis are kept in an inactive state by the er-luminal binding immunoglobulin protein (bip). the protein bip, also known as glucose regulated protein (grp ), is a member of the hsp family of proteins (specifically hspa ). it functions as a chaperone to selectively bind unfolded proteins in the er lumen by interacting with exposed hydrophobic . simplified scenario model illustrating the er stress-induced three arms of the unfolded protein response. perception of any perturbation of the er is provided by three sensor molecules of the unfolded protein response embedded in the er membrane: the perk, ire- , and atf . perk perceives dyshomeostatic damp (subclass iic- damp)-emitting er perturbations (ire and atf still questionable). perk phosphorylates eif a to up-regulate transcription factor atf that induces the expression of transcription factor chop. ire a signals through its rnase via the splicing of xbp mrna. the active transcription factor xbp s translocates to the nucleus. atf is exported from the er to the golgi complex to enter the nucleus as a potent transcription factor. together, these transcription factors of the unfolded protein response determine the cell fate by the regulation of distinct subsets of target genes toward recovery of er homeostasis and cell survival or the induction of regulated cell death in form of apoptosis and ferroptosis. in addition, the transcription factor atf is differentially translated, up-regulating genes participating in autophagy and other homeostatic pathways. atf activating transcription factor, chop cytidine-cytidine-adenosine-adenosine thymidine-enhancer-binding homologous protein, eif α eukaryotic translational initiation factor α, er endoplasmic reticulum, ire α inositol-requiring transmembrane kinase/endoribonuclease α, perk protein kinase-like eukaryotic initiation factor α kinase, upr unfolded protein response, xbp x-box binding protein . xbp s, x-box binding protein whereby the "s" stands for the spliced form of xbp . sources: refs. [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] residues on nascent peptides [ ] . however, in conditions of er stress, bip is detached from these sensors allowing their activation to trigger pathways, collectively included in the term of upr. this stress response acts as a corrective path, capable of both increasing the er folding capacity and decreasing the incoming polypeptide load. of note, this downstream pathway of each of the three upr sensors appears to have an innate preference for a particular type of er stress. moreover, as reviewed [ ] , upon dissociating from bip, each of the three sensors modifies the er to mitigate stress in its own unique way. for example, atf is often the first sensor to respond to er stress. once atf dissociates from bip, it is translocated to the golgi apparatus for cleavage. the cytosolic domain of atf is then free to move to the nucleus, where it moderates increased expression of several proteins involved in lipid biosynthesis and chaperones. this allows an increase in the volume of the er and provides more chaperone proteins to aid in folding, thus relieving some of the er stress. the other two sensors, ire- and perk, remain as integral er proteins but oligomerize and autophosphorylate following bip disassociation (autophosphorylation, a type of post-translational modification of proteins (see part vi, sect. . ); typical for this biochemical process, a phosphate is added to a protein kinase by itself). under remediable er stress conditions, the three sensors trigger signalling pathways to resolve er stress aiming at maintaining cellular integrity (fig. . ) . they are briefly touched in the following (for reviews and original articles, see refs [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] .). for example, misfolded proteins are dislocated in the cytosol where degradation processes such as the er-associated degradation (erad) and autophagy will clear them, thereby reducing their potential toxicity. characteristically, erad is a protein quality control mechanism conserved in all eukaryotic cells and represents a critical arm of the upr, necessary to alleviate er stress. the erad mechanism results in the selective dislocation of unfolded and misfolded proteins from the er to the cytosol via specific membrane machinery. the erad targets are subsequently degraded by the cytosolic ubiquitin proteasome system (ups). the whole process includes transcriptional activation of a variety of er-associated chaperones and folding enzymes which include but are not limited to bip and the lectins calr, calmodulin (cam), and calnexin (cnx). a pathway that represents the most conserved branch of the upr is mediated by ire- , a multifunctional protein that possesses kinase and endonuclease activities. upon activation, ire- aggregates and autophosphorylates, thereby activating its endonuclease activity to catalyze the unconventional splicing of x-box binding protein (xbp ) via removal of a -nucleotide intron. this processing event changes the open reading frame of the mrna, resulting in the translation of the transcription factor now termed xbp s ("s" stands for the spliced form of xbp ) (for splicing, see box . ) . production of xbp s leads to upregulation of several genes involved in the upr's adaptive phase, for example, expression of er-resident molecular chaperones and protein folding enzymes. activation of perk leads to the phosphorylation of eif α that is required for the initiation of translation. this factor inhibits global protein synthesis by inhibiting the assembly of the s ribosome, thereby reducing er load and promoting cellular survival. at the same time and under these conditions, the transcription factor atf is differentially translated, up-regulating genes participating in protein folding, amino acid metabolism and transport, autophagy, and oxidative stress resistance/redox homeostasis. under conditions of prolonged or severe er stress that the upr cannot resolve (see below), atf also contributes to apoptosis through the induction of the transcription factor chop (for cytidine-cytidine-adenosine-adenosine-thymidine-enhancer-binding homologous protein) and by enhancing oxidative stress and protein synthesis. finally, the third branch of the upr is initiated by atf . the sensor atf is retained at the er under homeostatic conditions but translocates to the golgi apparatus under er stress where it is cleaved by the golgi-resident proteases site- protease (sp ) and sp . this event leads to the release of atf n-terminal fragment, a potent transcription factor that moves to the nucleus, where it binds the er stress response element upstream of a subset of upr genes to activate their transcription. together with xbp , this fragment regulates the expression of several genes with functions in protein folding, protein transport, and lipid biosynthesis, that is, genes involved in re-establishing er homeostasis. strikingly, there is a considerable interconnectedness of the er stress-promoted upr with other innate immune processes. for example, an increasing number of studies support the view that oxidative stress has a strong connection with er stress. during the protein folding process, ros are produced as by-products, leading to impaired redox balance conferring oxidative stress. as the protein in molecular biology, splicing is a modification of an rna after transcription in which introns are removed and exons are joined. thus, an intron is any noncoding nucleotide sequence within a gene that is removed by rna splicing during maturation of the final rna product; an exon is any part of a gene that encodes a part of the final mature rna produced by that gene after introns have been removed by rna splicing. this process is needed for the typical eukaryotic messenger rna before it can be used to produce a correct protein through translation. for many eukaryotic introns, splicing is done in a series of reactions catalyzed by the spliceosome, a complex of small nuclear ribonucleoproteins (snrnps), but there are also self-splicing introns. folding process is dependent on redox homeostasis, the oxidative stress can disrupt the protein folding mechanism and facilitate the production of misfolded proteins, causing further er stress [ ] . moreover, this stress response functions as a productive source of damps. typically, hsps and calr, like other er chaperones, can translocate to the cytosol and eventually to the surface of cells. once exposed, these molecules can operate as subclass ib- damps to facilitate engulfment of antigens as mostly described in the context of induction of icd [ , [ ] [ ] [ ] [ ] [ ] . via these mechanisms and by crosstalk with other molecular machines of the innate immune system including nlrp inflammasome activation via txnip (see part iv, sect. . . . and part vi, sect. . . . ) , the upr may contribute to sterile inflammation and immunity (for articles, see refs [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] ). also, several signalling cascades triggered by the three sensors are apparently potent inducers of autophagy at a cellwide level that-as mentioned above-normally has an adaptive/protective function and consists of the three major types: chaperone-mediated, macro-and microautophagy. interestingly, recent evidence has indicated that upr-induced autophagic processes are capable of alleviating the upr pointing to a crosstalk between these two innate immune defense mechanisms [ , ] . strikingly, a complex relationship reportedly exists between autophagy and damps in cellular adaption to stress and injury and cell death characterized by a crosstalk between autophagy induction and secretion or release of damps. in fact, growing evidence indicates that autophagic mechanisms are involved in regulating release and degradation of damps including calr, hmgb , atp, and dna in several cell types [ , , ] . this scenario may contribute to the observation that autophagy is also able to shape a supportive cellular immune response [ , ] . this kind of innate immune interrelationships is, for example, involved in mechanisms of both reperfusion-mediated cell injury and icd of cancer cells (see refs [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] ). overall, depending on the duration and intensity of the stress, the upr engages different cellular pathways to restore and maintain cell survival, on the one hand, or trigger apoptosis, on the other hand. in cases of severe irremediable er stress, however, the balance is tipped in favor of pro-death signalling; that is, the upr, now mediated by different biochemical pathways, may lead to pro-inflammatory and pro-apoptotic responses resulting in catastrophic rcd (fig. . ) . while the precise pathways of apoptosis induced by er stress are not known, the up-regulated perk → eif α → atf → chop pathway plays an essential role by reversing translational arrest, increasing generation of ros, and promoting calcium efflux from the er. together, these signals lead to cytochrome c release from mitochondria and loss of membrane potential, resulting in apoptosis [ ] . in this context, it is interesting that one pathway of rn (ferroptosis; see sect. . . ) apparently shares a partially overlapping machinery with er stress, suggesting a molecular interconnectivity between these two events [ ] . the issue of er-stress-promoted upr is a parade example of an injury-induced innate immune response that can decide besides life and death of a cell. placed at the very beginning of defense processes of multicellular organisms upon injury, this stress response nicely reflects a hierarchy of damp emission (see part iv, chap. ) . striking is also the existence of an inter-organelle communication, for example, between upr, inflammasome activation, and autophagic pathways which emerge as a homeostatic network determining the switch from adaptive life-saving programs to cell death under stress conditions, where specialized sentinels are localized at organelle membranes to induce the core apoptosis pathway. as briefly sketched in the next section, this innate immune defense response is not only directed against sterile stress but also against pathogen-mediated stress. in the previous section, induction of a upr was mainly exemplified by referring to reperfusion-mediated cell damage and icd of cancer cells, that is, instances of sterile stress. however, growing evidence is coming to light clearly indicating that viruses and bacteria also induce er stress, thereby activating a robust upr. in fact, the constitution of cellular stress responses is meanwhile regarded as the first line of defense against both viral and bacterial infection. however, the outcome is not always in favor of the host; the pathogen may also profit from this stress response, at least under certain circumstances. an increasing number of reports have recently been published on this emerging topic (such as refs [ ] [ ] [ ] [ ] [ ] [ ] ), the quintessence briefly being addressed here. there are several mechanisms described of how a virus can induce er stress. the central mechanisms of perturbation of the er during virus infection can be seen in the production of large amounts of viral proteins by the virus concerned. such accumulation of viral proteins in the er implies a challenge to the protein folding machinery which may cause er stress and, in turn, activate the upr resulting in restoration of the er homeostasis or apoptosis. so far, at least viruses have been found to be able to induce er stress and activate the three upr stress signalling pathways [ ] . moreover, er stress can be caused by viruses via other mechanisms, for example, as a result of er membrane exploitation, imbalance of calcium concentration, or sabotage/depletion of the er membrane during virion release. viral infections may activate these pathways resulting in the inhibition or promotion of viral replication. for example, the perk-mediated global translation shutdown is a very efficient antiviral mechanism, and a similar shutdown by pkr has been used in the interferon pathway to defend against viral infection [ ] . also, the virus-related upr was found to trigger host inflammatory signalling cascade through innate immune signalling pathways that activate nf-κb and ap- transcription factors as a result of chronic er stress. in fact, overexpression of viral proteins in the er has long been known to activate these transcription factors which induce expression of pro-inflammatory cytokines such as il- [ , ] . increasing evidence supports the notion that the upr signalling synergistically interacts with virus-induced signalling to produce inflammatory cytokines and type i ifns. in addition, other lines of studies have shed light on a role of nod and nod receptors in transducing virus-related er stress signals to elicit inflammation [ ] . so far, however, a possible contribution of damps to the promotion of these signalling pathways has not been investigated. importantly, however, the effects of virus-induced upr have been observed not only to inhibit but also to potentiate viral infection. in fact, manipulation of the upr response has become an asset for many viruses to promote their translation, thereby leading to chronic er stress. in other words, during infection, viruses are capable of hijacking the host translational machinery and fill the er with viral proteins. for example, this is the case for many positive-strand rna viruses, which house the virus replication machinery in the protective er-membrane. in fact, viruses need host er to produce increased quantities of viral proteins to continue replication. intriguingly, as discussed [ ] , many viruses have evolved strategies aimed at continuing the replication cycle. thus, viruses were shown to manipulate the host upr in various ways to stimulate protein synthesis capacity and to improve cell survival by inhibiting cellular apoptosis. in particular, the link between the upr and autophagy are intensely discussed to be involved in this scenario. these two systems may act dependently, or the induction of one system may interfere with the other [ ] . thus, experimental studies could demonstrate that different viruses modulate these mechanisms to allow them to circumvent and bypass the host immune response or, worse, to exploit the host's defense to their advantage. according to current knowledge, rna viruses including influenza virus, poliovirus, coxsackievirus, enterovirus, japanese encephalitis virus, hcv, and dengue virus were shown to regulate these processes. for example, recent studies on hcv-infected hepatocytes confirmed the evidence that virus-associated er stress and upr are linked to cellular autophagy. thus, induction of the cellular autophagic response is reportedly required to improve survival of infected cells by inhibition of cellular apoptosis. moreover, the autophagic response was demonstrated to inhibit the cellular innate antiviral program that usually inhibits virus replication. nevertheless, as argued by the authors [ ] , though hcv induces er stress and autophagy, their cause-effect relationship is not clear. notably, the upr is not only modulated by viruses! recent evidence indicates that this stress response plays multiple roles during bacterial infections as well. thus, as comprehensively reviewed by celli and tsolis [ ] , the upr has been shown to be induced in murine lungs by m. tuberculosis (associated with apoptotic events) and also be correlated with helicobacter-induced gastric carcinogenesis. similarly, in vitro infectious models have revealed upr induction in macrophages and epithelial cells infected with either brucella melitensis and b. abortus or listeria monocytogenes. on the other hand, there is evidence suggesting that bacteria can subvert the upr for their own advantage. nevertheless, indications that bacteria can modulate this response are still somewhat sparse. one example refers to l. pneumophila that recruits components of the erad on its vacuole to mediate turnover of bacterial effectors on the vacuolar surface and uses the proteasome to generate amino acids necessary for its intracellular growth. interestingly, like with virus-induced upr, a bacteria-initiated upr may turn out to be of advantage for the host or in favor of the bacteria, but it is not clear in each case whether this response benefits the host or the pathogen. thus, as argued by celli and tsolis [ ] , "modulation of er function during infection by intracellular bacteria can promote bacterial infection by providing a replicative niche, but at the same time the resulting disruption of the secretory pathway can provide a pattern of pathogenesis that aids the innate immune system in recognizing intracellular infection and in mounting an appropriate defence. however, considering the more rapid evolution of bacterial pathogens compared to their hosts, it is likely that bacteria have evolved to modulate the upr to their advantage during infection." the function of the er stress-induced upr appears to provide another impressive evidence in support of the notion that immunity is induced by both sterile and infectious stressful injury and not primarily by invading nonself. in fact, this stress response is at the forefront of any stressful injury and is dedicated to initiating involvement of further damp-promoted defense mechanisms. however, like all the other innate immune processes, the upr may become uncontrolled. together, this may be reason enough that this stress response is involved in the pathogenesis of many human systemic and organ-specific disorders. in fact, the list of human diseases that are pathogenetically associated with er stress and activation of the upr is steadily growing [ ] [ ] [ ] . they include neurodegenerative and metabolic diseases, autoimmune disorders, and atherosclerosis. given such an ever-increasing list of diseases, it is not a surprise that chemical compounds and inhibitors targeting the upr signalling pathways with a high degree of specificity have been and will be further developed [ ] [ ] [ ] . maintaining genome integrity and transmission of intact genomes is a condition sine qua non for cellular, organismal, and species survival [ ] . this homeostatic integrity is threatened by dna damage that occurs in a variety of conditions including but not limited to ionizing radiation, chemical reactions, and viral infections, two of the most dominant conditions being oxidative stress and replication stress. these exogenous and endogenous factors induce diverse lesions in the dna such as nucleotide alterations (substitution, deletion, and insertion), bulky adducts, collapsed dna replication forks, ssbs, and dsbs (see refs [ , [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] ). in response to dna damage, cells initiate and activate a complex network of cellular signalling cascades that cooperate to sense and repair lesions in dna, denoted as the ddr. this stress response plays an important role in fighting against detrimental effects of cell stress and injury. it orchestrates many processes, including not only dna repair but also regulation of cell-cycle checkpoints, transcription of ddr genes, and autophagy ( fig. . ). the ddr is controlled by three pi k-related kinases (pikks): the atm, the dna-pk, and the atr kinases (mostly nuclear proteins). all three pikks are enormous polypeptides with similar domain organizations and various common structural features. equipped with the capability to autophosphorylate (see above however, at the beginning of this dance, sensors have recently been identified which, as prms in a wider sense or even as a new group of recognition receptors, are capable of detecting dna damage-as manifested by the toxic dsbs or generation of ssdna to activate these three pikks. as already briefly touched in part ii, sect. . . . , two highly conserved multiprotein complexes, mrn and ku, are considered the primary sensors of dsbs to subsequently activate atm and dna-pk [ , , ] . in addition, ssdna is sensed by the recognition molecule rpa that then, analogous to ku, acts as a signalling and repair platform for downstream factors such as the prp , an e ubiquitin ligase involved in pre-mrna splicing, and atr kinase [ ] . other lines of studies have provided evidence suggesting that the protein prp itself may act as the primary sensor of rpa-ssdna to subsequently activate atr [ ] (fig. . ) . in brief, upon dna damage, dsbs are recognized by the mrn complex. following recognition, mrn recruits atm to this dna lesion where it binds to the c-terminus of nbs as a component of mrn [ ] . following binding, atm kinase is activated. however, the exact mechanism whereby mrn activates atm is still not fully understood (discussed in [ ] ). recognition of dsbs is also realized by ku / heterodimer that has been shown to bind broken dsdna ends preferentially [ ] . it is the ku / then that recruits the catalytic subunit of dna-pk (dna-pkcs) at the site of dsbs to form the dna-pk holoenzyme. the major role of activated dna-pk is to promote a peculiar dna repair mechanism called non-homologous end joining (nhej), a pathway that repairs double-strand breaks in dna [ ] . in fact, nhej repairs most dsbs in mammalian cells. as its name implies, nhej involves ligation of two broken dna ends without needing a repair template [ ] . in contrast to atm and dna-pkcs which respond primarily to dsbs, atr is activated by a much wider range of genotoxic stresses, for example, reflected by exposure of increasing amounts of ssdna as a consequence of compromised activity of replication proteins or nucleolytic processing of various forms of damaged dna. in fact, first evidence suggests that ssdna is initially sensed by rpa which then recruits and activates prp [ ] . in turn, prp facilitates the accumulation of atr-interacting protein (atrip, the regulatory partner of the atr kinase) at dna damage sites, thereby activating atr [ , ] . plausibly, the recent discovery of dna damage-sensing molecules such as mrn, ku / , and rpa calls for the definition of those molecules they recognize, that is, broken dna ends at the site of dsbs and ssdna. as outlined in part iv, sect. . . , they have been tentatively sorted into a subclass of cell-intrinsically emitted damps (subclass iic- ). future studies will have to assign their exact place in the world of damps. apart from those damps emitted in the nucleus, other lines of studies lend support to the suggestion that dna damage or dna replication stress, in case of unsuccessful dna repair, promotes release of aberrant dna structures into the cytosol in the form of ssdna and dsdna breaks/fragments which operate as cell-intrinsically emitted dislocated damps. they are sensed by the dna receptor cgas and probably other as-yet-not-identified dna receptors to activate sting-dependent pathways to promote defense pathways [ , ] (compare part ii, sect. . . ; part iv, sect. . . ; as well as part vi, sect. . . ). as discussed by the authors [ ] , it is conceivable that cytosolic dna is released by dysfunctional mitochondria upon dna damage or generated during repair of damaged genomic dna. moreover, as shown in studies on tumor models, the ddr-through the activation of sting-mediated pathways-induces the expression of another class of constitutive damps which are exposed at the cell surface, namely, the mics and different ulbps [ ] [ ] [ ] [ ] [ ] (compare part iv, sect. . . ). of note, however, the ddr does not always result in a happy end. in fact, when unsuccessful in repairing dna damage, the ddr can lead to cellular senescence or-like a upr in case of irremediable er stress-ultimately induces an rcd, most often in the form of apoptosis, less frequently in the form of necrosis. such subroutines of rcd are presumably aimed at mitigating the propagation of potentially mutated cells leading to cancer or other age-related pathologies (fig. . ) [ , , ]. the dna damage response must be regarded as an efficient cell-intrinsic defense process, which is sophistically connected with other stress responses such as autophagy that also plays a significant role in maintaining genome stability [ ] and the er stress-induced upr [ ] . also, of particular interest is the observation that three tiers of damps are involved in this pathway, starting with broken dna ends at the site of dsbs and ssdna in the nucleus immediately generated upon dna damage; followed by dna fragments dislocated in the cytosol and operating as class iic- damps to promote intracellular innate immune signalling; and finishing with the exposure of subclass ib- damps (e.g., mics) able to activate nk, nkt, and γδ t cells (compare part vii, sects. . . , . . and . . ). this phenomenon may again reflect a particular hierarchy in the work of damps to maintain homeostasis. in fact, genomic integrity is of utmost importance and key to human health, and this is retained by the ddr. this stress response, however, may fail in maintaining and restoring homeostasis. accordingly, dna damage has been observed to play a causal role in numerous human pathologies associated with genome instability or aberrant pikk function such as cancer, leukemia, premature ageing, and certain chronic inflammatory conditions. furthermore, there is growing evidence suggesting that uncontrolled ddr signalling is associated with various neurodegenerative diseases, as documented by recent work showing that atm inhibition alleviates pathologies in models of huntington's disease [ ] . this is reason enough that ddr pathways have been and are being explored therapeutically to induce freedom from these diseases. in particular, small molecule inhibitors of atm, dna-pk, and atr are regarded as potential therapeutic agents, for example, as innovative drugs in cancer treatment [ , ] . an increasing amount of publications in the international literature provides convincing evidence that cell-intrinsic, cell-autonomous stress responses are initiated by any damage to a cell, regardless of being of sterile or infectious in nature. the ultimate goal of those responses, which are all highly conserved among eukaryotic species, is to maintain cellular homeostasis and ensure cell integrity. again this knowledge is in support of the concept that any form of host defense is primarily directed against injury and not against microbes. autophagy in terms of "immunological autophagy" appears to act as the center of all stress responses that is committed to control and regulate efferent innate and potential adaptive immune responses. the "medium" for achieving this goal is the mamps and/or damps which operate as a link between intracellular and extracellular events. another characteristic feature of cell-intrinsic stress responses is their interconnectivity and interdependency. a typical example is represented by oxidative stress and er stress responses, which appear to act mutually in any form of cellular damage. strikingly, this kind of interconnectedness is not restricted to a collaboration between the stress responses but expands to the elicitation of innate and adaptive immune processes. again, the damps-in terms of a particular hierarchy in their emission -appear to take center stage in this scenario. a better understanding of these multiple connections between cell-autonomous processes, on the one hand, and innate/adaptive immune responses, on the other hand, will undoubtedly stimulate new approaches to therapeutic interference with injury-induced pathologies. cell-autonomous stress responses in innate immunity autophagy in infection, inflammation and immunity autophagy induced by damps facilitates the inflammation response in lungs undergoing ischemia-reperfusion injury through promoting traf ubiquitination autophagy in human health and disease autophagy in yeast demonstrated with proteinase-deficient mutants and conditions for its induction autophagy: process and function regulation mechanisms and signaling pathways of autophagy eaten alive: a history of macroautophagy mammalian autophagy: core molecular machinery and signaling regulation molecular definitions of autophagy and related processes endocytosis and autophagy: exploitation or cooperation? regulated protein turnover: snapshots of the proteasome in action the n-end rule pathway: emerging functions and molecular principles of substrate recognition the ampk-skp -carm axis links nutrient sensing to transcriptional and epigenetic regulation of autophagy autophagy: cellular and molecular mechanisms autophagy fights disease through cellular self-digestion beclin , an essential component and master regulator of pi k-iii in health and disease the return of the nucleus: transcriptional and epigenetic control of autophagy tfeb links autophagy to lysosomal biogenesis zkscan is a master transcriptional repressor of autophagy hmof histone acetyltransferase is required for histone h lysine acetylation in mammalian cells ampk-skp -carm signalling cascade in transcriptional regulation of autophagy the significance of macroautophagy in health and disease mechanisms of mitophagy mitophagy transcriptome: mechanistic insights into polyphenol-mediated mitophagy er-phagy: selective autophagy of the endoplasmic reticulum er-phagy mediates selective degradation of endoplasmic reticulum independently of the core autophagy machinery eating the er and the nucleus for survival under starvation conditions a role for macro-er-phagy in er quality control microautophagy in mammalian cells: revisiting a -year-old conundrum emerging regulation and functions of autophagy microautophagy: lesser-known self-eating chaperone-mediated autophagy at a glance chaperone-mediated autophagy: a unique way to enter the lysosome world role of chaperone-mediated autophagy in metabolism chaperone-mediated autophagy: roles in disease and aging damps and autophagy: cellular adaptation to injury and unscheduled cell death oxidative stress-mediated hmgb biology premortem autophagy determines the immunogenicity of chemotherapy-induced cancer cell death molecular mechanisms of atp secretion during immunogenic cell death autophagy-dependent regulation of the dna damage response protein ribonucleotide reductase the atg atg conjugate associates with innate antiviral immune responses endogenous hmgb regulates autophagy mir- targets the ′ utr of hmgb and inhibits the hmgb -associated autophagy in osteosarcoma cells during chemotherapy hmgb -mediated autophagy promotes docetaxel resistance in human lung adenocarcinoma interference with hmgb increases the sensitivity to chemotherapy drugs by inhibiting hmgb -mediated cell autophagy and inducing cell apoptosis hmgb -mediated autophagy modulates sensitivity of colorectal cancer cells to oxaliplatin via mek/erk signaling pathway hmgb /rage axis promotes autophagy and protects keratinocytes from ultraviolet radiation-induced cell death hmgb -induced autophagy: a new pathway to maintain treg function during chronic hepatitis b virus infection autophagy occurs within an hour of adenosine triphosphate treatment after nerve cell damage: the neuroprotective effects of adenosine triphosphate against apoptosis the dna damage-regulated autophagy modulator dram links mycobacterial recognition via tlr-myd to autophagic defense dnase a deficiency uncovers lysosomal clearance of damaged nuclear dna via autophagy dram promotes the targeting of mycobacteria to selective autophagy autophagy downstream of endosomal toll-like receptors signaling in macrophages is a key mechanism for resistance to leishmania major infection the role of tlr in stress-dependent autophagy formation oxidative stress ros function in redox signaling and oxidative stress free radicals in biology and medicine effects of allopurinol and deferoxamine on reperfusion injury of the brain in newborn piglets after neonatal hypoxia-ischemia xanthine oxidoreductase-catalyzed reactive species generation: a process in critical need of reevaluation the role of oxidants and free radicals in reperfusion injury oxidative stress and ischemia-reperfusion injury in gastrointestinal tract and antioxidant, protective agents the nox family of ros-generating nadph oxidases: physiology and pathophysiology nox nadph oxidase mediates oxidative stress and apoptosis caused by tnf-alpha in cerebral vascular endothelial cells redox regulation of nox proteins nadph oxidases: functions and pathologies in the vasculature mitochondrial complex iii: an essential component of universal oxygen sensing machinery? superoxide production by cytochrome bc complex: a mathematical model cardiac mitochondria and reactive oxygen species generation mitochondrial reactive oxygen species trigger hypoxia-induced transcription sites of reactive oxygen species generation by mitochondria oxidizing different substrates mitochondrial formation of reactive oxygen species in vivo reactive oxygen species production induced by ischemia in muscle arterioles of mice: involvement of xanthine oxidase and mitochondria nuclear and cytoplasmic p suppress cell invasion by inhibiting respiratory complex-i activity via bcl- family proteins suppression of mitochondrial complex i influences cell metastatic properties specific disintegration of complex ii succinate:ubiquinone oxidoreductase links ph changes to oxidative stress for apoptosis induction the qo site of the mitochondrial complex iii is required for the transduction of hypoxic signaling via reactive oxygen species production exploring the role of mitochondrial uqcrb in angiogenesis using small molecules development of a novel class of mitochondrial ubiquinol-cytochrome c reductase binding protein (uqcrb) modulators as promising antiangiogenic leads superoxide generated at mitochondrial complex iii triggers acute responses to hypoxia in the pulmonary circulation impact of mitochondria and nadph oxidases on acute and sustained hypoxic pulmonary vasoconstriction redox signaling and reactive oxygen species in hypoxic pulmonary vasoconstriction hypoxia activates nadph oxidase to increase [ros]i and [ca +]i through the mitochondrial ros-pkcepsilon signaling axis in pulmonary artery smooth muscle cells glutathione and glutathione-dependent enzymes represent a co-ordinately regulated defence against oxidative stress the nrf regulatory network provides an interface between redox and intermediary metabolism keap represses nuclear activation of antioxidant responsive elements by nrf through binding to the amino-terminal neh domain nrf :inrf (keap ) signaling in oxidative stress regulation of nrf -an update molecular and cellular basis for the unique functioning of nrf , an indispensable transcription factor for maintaining cell homeostasis and organ integrity the keap -nrf system in cancer the keap -nrf pathway: promising therapeutic target to counteract ros-mediated damage in cancers and neurodegenerative diseases characterizations of three major cysteine sensors of keap in stress response keap , the cysteine-based mammalian intracellular sensor for electrophiles and oxidants keap as the redox sensor of the antioxidant response two domains of nrf cooperatively bind cbp, a creb binding protein, and synergistically activate transcription induction of cytoprotective genes through nrf /antioxidant response element pathway: a new therapeutic approach for the treatment of inflammatory diseases structure of the btb domain of keap and its interaction with the triterpenoid antagonist cddo an important role of nrf -are pathway in the cellular defense mechanism unique cistrome defined as csmbe is strictly required for nrf -smaf heterodimer function in cytoprotection toll-like receptor signaling induces nrf pathway activation through p -triggered keap degradation molecular mechanisms activating the nrf -keap pathway of antioxidant gene regulation conservation of the keap -nrf system: an evolutionary journey through stressful space and time heat shock, stress proteins, chaperones, and proteotoxicity heat shock in vertebrate cells allograft injury mediated by reactive oxygen species: from conserved proteins of drosophila to acute and chronic rejection of human transplants. part ii: role of reactive oxygen species in the induction of the heat shock response as a regulator of innate the stress of dying": the role of heat shock proteins in the regulation of apoptosis heat shock response modulators as therapeutic tools for diseases of protein conformation innate alloimmunity part : innate immunity and allograft rejection regulation of apoptosis by heat shock proteins pathways of allosteric regulation in hsp chaperones mechanisms for hsp secretion: crossing membranes without a leader extracellular heat shock proteins in cell signaling hsp is actively secreted by human tumor cells the odyssey of hsp from tumor cells to other destinations includes plasma membrane-associated stages and golgi and exosomal protein-trafficking modalities hsp is transported through the secretory pathway of -mca-induced fibrosarcoma tumour cells and undergoes n-glycosylation nucleocytoplasmic transport under stress conditions and its role in hsp chaperone systems membrane-bound heat shock proteins facilitate the uptake of dying cells and cross-presentation of cellular antigen emerging role of innate immunity in organ transplantation part ii: potential of damage-associated molecular patterns to generate immunostimulatory dendritic cells emerging role of innate immunity in organ transplantation: part i: evolution of innate immunity and oxidative allograft injury heat shock proteins and toll-like receptors photodynamic therapy: illuminating the road from cell death towards anti-tumour immunity molecular and translational classifications of damps in immunogenic cell death hyperthermic intraperitoneal chemotherapy leads to an anticancer immune response via exposure of cell surface heat shock protein roles of the heat shock transcription factors in regulation of the heat shock response and beyond reactive oxygen species play an important role in the activation of heat shock factor in ischemic-reperfused heart multifaceted roles of hsf in cancer heat shock factors: integrators of cell stress, development and lifespan in vitro activation of heat shock transcription factor dna-binding by calcium and biochemical conditions that affect protein conformation the hyperfluidization of mammalian cell membranes acts as a signal to initiate the heat shock protein response the membrane-associated transient receptor potential vanilloid channel is the central heat shock receptor controlling the cellular heat shock response in epithelial cells transient receptor potential vanilloid- mediates the effects of transient heat shock on endocytosis of human monocyte-derived dendritic cells the sterile inflammation in the exacerbation of hbvassociated liver injury requirement of heat shock protein for human hepatitis b virus reverse transcriptase function heat stress cognate host protein as a potential drug target against drug resistance in hepatitis b virus in vitro activity of cepharanthine hydrochloride against clinical wild-type and lamivudine-resistant hepatitis b virus isolates fever, immunity, and molecular adaptations heat shock proteins: possible biomarkers in pulmonary and extrapulmonary tuberculosis the dynamics of heat shock system activation in monomac- cells upon helicobacter pylori infection enterococcus faecium ncimb modulates epithelial integrity, heat shock protein, and proinflammatory cytokine response in intestinal cells the mammalian unfolded protein response signal integration in the endoplasmic reticulum unfolded protein response glycoprotein quality control and endoplasmic reticulum stress protein folding and mechanisms of proteostasis ferroptosis: process and function novel ferroptosis inhibitors with improved potency and adme properties perturbations in maturation of secretory proteins and their association with endoplasmic reticulum chaperones in a cell culture model for epithelial ischemia er stress, autophagy and immunogenic cell death in photodynamic therapy-induced anti-cancer immune responses the perks of damage-associated molecular patterns mediating cancer immunogenicity: from sensor to the plasma membrane and beyond immune modulation by er stress and inflammation in the tumor microenvironment cross talk between er stress, oxidative stress, and inflammation in health and disease overexpression of grp mitigates stress induction of glucose regulated proteins and blocks secretion of selective proteins in chinese hamster ovary cells bip: master regulator of the unfolded protein response and crucial factor in flavivirus biology regulation of innate immunity by signaling pathways emerging from the endoplasmic reticulum integrating the mechanisms of apoptosis induced by endoplasmic reticulum stress the impact of the unfolded protein response on human disease endoplasmic reticulum stress and oxidative stress in cell fate decision and human disease proteostasis control by the unfolded protein response endoplasmic reticulum stress and its regulator xbp- contributes to dendritic cell maturation and activation induced by high mobility group box- protein multi-layered molecular mechanisms of polypeptide holding, unfolding and disaggregation by hsp /hsp chaperones definition of the lectin-like properties of the molecular chaperone, calreticulin, and demonstration of its copurification with endomannosidase from rat liver golgi protein disulfide isomerase and nox: new partners in redox signaling the unfolded protein response: at the intersection between endoplasmic reticulum function and mitochondrial bioenergetics endoplasmic reticulum stress and oxidative stress: a vicious nexus implicated in bowel disease pathophysiology a novel pathway combining calreticulin exposure and atp secretion in immunogenic cancer cell death mechanisms of translocation of er chaperones to the cell surface and immunomodulatory roles in cancer and autoimmunity the co-translocation of erp and calreticulin determines the immunogenicity of cell death mechanisms of pre-apoptotic calreticulin exposure in immunogenic cell death er stress-induced clearance of misfolded gpi-anchored proteins via the secretory pathway reactive oxygen species at the crossroads of inflammasome and inflammation molecular mechanisms regulating nlrp inflammasome activation thioredoxin-interacting protein links oxidative stress to inflammasome activation ire- α induces thioredoxin-interacting protein to activate the nlrp inflammasome and promote programmed cell death under irremediable er stress endoplasmic reticulum stress in sepsis endoplasmic reticulum stress and nrf signaling in cardiovascular diseases rosinduced autophagy in cancer cells assists in evasion from determinants of immunogenic cell death autophagy and cellular immune responses autophagy as a stress response pathway in the immune system cell biology of ischemia/reperfusion injury cell death in the pathogenesis of heart disease: mechanisms and significance molecules and their functions in autophagy autophagy, a process within reperfusion injury: an update ros and autophagy: interactions and molecular regulatory mechanisms mitochondrial membrane permeabilization in cell death targeting post-mitochondrial effectors of apoptosis for neuroprotection apoptotic mechanisms after cerebral ischemia molecular definitions of cell death subroutines: recommendations of the nomenclature committee on cell death the mitochondrial permeability transition: a current perspective on its identity and role in ischaemia/reperfusion injury atp release from dying autophagic cells and their phagocytosis are crucial for inflammasome activation in macrophages calreticulin surface exposure is abrogated in cells lacking, chaperone-mediated autophagy-essential gene, lamp a when er stress reaches a dead end pharmacological inhibition of cystine-glutamate exchange induces endoplasmic reticulum stress and ferroptosis. elife arms race between enveloped viruses and the host erad machinery hepatitis c virus infection induces autophagy as a prosurvival mechanism to alleviate hepatic er-stress response respiratory syncytial virus and cellular stress responses: impact on replication and physiopathology nod and nod signalling links er stress with inflammation nod and nod : new functions linking endoplasmic reticulum stress and inflammation the expanding roles of endoplasmic reticulum stress in virus replication and pathogenesis signal integration via pkr expression of influenza virus hemagglutinin activates transcription factor nf-kappa b a novel signal transduction pathway from the endoplasmic reticulum to the nucleus is mediated by transcription factor nf-kappa b er stress, autophagy, and rna viruses bacteria, the endoplasmic reticulum and the unfolded protein response: friends or foes? the role of damage-associated molecular patterns in human diseases: part i -promoting inflammation and immunity the role of damage-associated molecular patterns (damps) in human diseases: part ii: damps as diagnostics, prognostics and therapeutics in clinical medicine recent insights into the role of unfolded protein response in er stress in health and disease preventing proteostasis diseases by selective inhibition of a phosphatase regulatory subunit ceapins are a new class of unfolded protein response inhibitors, selectively targeting the atf α branch modulating protein quality control what's the damage? the impact of pathogens on pathways that maintain host genome integrity role of oxidative stress and dna damage in human carcinogenesis oncogeneinduced reactive oxygen species fuel hyperproliferation and dna damage response activation oxidized extracellular dna as a stress signal that may modify response to anticancer therapy the emerging role of ros-generating nadph oxidase nox in dna-damage responses oxidative genome damage and its repair: implications in aging and neurodegenerative diseases dna replication and oncogene-induced replicative stress exploiting replicative stress to treat cancer replication stress and cancer maintaining genome stability at the replication fork the dna-damage response in human biology and disease the dna damage response: making it safe to play with knives autophagy in dna damage response dna damage-induced cell death: from specific dna lesions to the dna damage response and apoptosis dna damage sensing by the atm and atr kinases molecular mechanisms of mammalian dna repair and the dna damage checkpoints the trinity at the heart of the dna damage response double-strand break end resection and repair pathway choice mechanism and regulation of dna end resection in eukaryotes conserved modes of recruitment of atm, atr and dna-pkcs to sites of dna damage the dna-dependent protein kinase: a multifunctional protein kinase with roles in dna double strand break repair and mitosis two-stage synapsis of dna ends during non-homologous end joining prp transforms into a sensor of rpa-ssdna after dna damage and drives atr activation via a ubiquitin-mediated circuitry a phosphorylation-and-ubiquitylation circuitry driving atr activation and homologous recombination dna damage primes the type i interferon system via the cytosolic dna sensor sting to promote antimicrobial innate immunity sting-dependent cytosolic dna sensor pathways regulate nkg d ligand expression regulation of ligands for the nkg d activating receptor the dna damage pathway regulates innate immune system ligands of the nkg d receptor rae ligands for the nkg d receptor are regulated by sting-dependent dna sensor pathways in lymphoma atm-atr-dependent up-regulation of dnam- and nkg d ligands on multiple myeloma cells by therapeutic agents results in enhanced nk-cell susceptibility and is associated with a senescent phenotype biphasic ros production, p and bik dictate the mode of cell death in response to dna damage in colon cancer cells autophagy regulates dna repair through sqstm /p endoplasmic reticulum stress, genome damage, and cancer targeting atm ameliorates mutant huntingtin toxicity in cell and animal models of huntington's disease phosphatidylinositol -kinase (pi k) and phosphatidylinositol -kinase-related kinase (pikk) inhibitors: importance of the morpholine ring targeting atr in cancer medicine the mechanism of mitochondrial superoxide production by the cytochrome bc complex key: cord- - d a z authors: lewinski, martin; köster, tino title: systems approaches to map in vivo rna–protein interactions in arabidopsis thaliana date: - - journal: systems biology doi: . / - - - - _ sha: doc_id: cord_uid: d a z proteins that specifically interact with mrnas orchestrate mrna processing steps all the way from transcription to decay. thus, these rna-binding proteins represent an important control mechanism to double check which proportion of nascent pre-mrnas is ultimately available for translation into distinct proteins. here, we discuss recent progress to obtain a systems-level understanding of in vivo rna–protein interactions in the reference plant arabidopsis thaliana using protein-centric and rna-centric methods as well as combined protein binding site and structure probing. rna-binding proteins (rbps) are a diverse class of proteins that control every step of rna processing and rna function in the cell. they are characterized by dedicated domains involved in rna binding and can have accessory domains engaged in protein-protein interactions or enzymatic activities. in higher plants, rbp function so far has been best studied in the reference plant arabidopsis thaliana. among the rbps present in the arabidopsis genome are proteins with an rna recognition motif (rrm), the most abundant type of rna-binding domain, and k homology (kh) domain proteins first identified in mammalian heterogeneous nuclear protein hnrnp k (silverman et al. ). in addition, pumilio (pum) domain proteins, nine dead-box helicases as well as five proteins with cold shock domains (csds) have been identified (silverman et al. ) . another proteins harbor pentatricopeptide repeat (ppr) domains. ppr domains consist of multiple -amino acid repeats of which two are known to be engaged in specific rna recognition (barkan and small ) . these proteins are imported into mitochondria or chloroplasts and regulate all aspects of rna metabolism, e.g., rna editing, splicing, rna cleavage, and translation in organelles (schmitz-linneweber and small ; barkan and small ) . a suite of arabidopsis rbps have been experimentally characterized, mainly through loss-of-function mutants and transgenic plants ectopically overexpressing rbps. these approaches revealed a crucial role for rbps in development (kalyna et al. ; ripoll et al. ; kupsch et al. ; völz et al. ; ferrari et al. ; foley et al. ; teubner et al. ) , timing of plant reproduction (macknight et al. ; streitner et al. ; hornyik et al. ) , responses to abiotic stress (kim et al. b (kim et al. , c, (kim et al. , park et al. ), pathogen defense (fu et al. ; qi et al. ; jeong et al. ; lyons et al. ; nicaise et al. ), responses to phytohormones (lu and fedoroff ; hugouvieux et al. ; riera et al. ; carvalho et al. ; hackmann et al. ; löhr et al. ) , and circadian timekeeping (heintzen et al. ; staiger ; jones et al. ; schmal et al. ; perez-santángelo et al. ) . at the biochemical level, an impact of defined rbps on rna processing including pre-mrna splicing, end processing, processing of microrna precursors, and translation has been described (lopato et al. ; simpson et al. ; vazquez et al. ; dong et al. ; stauffer et al. ; ren et al. ; rühl et al. ; juntawong et al. ; sorenson and bailey-serres ; staiger ; carvalho et al. ) . recent attempts to comprehensively identify rbps, summarized in sect. , provided experimental evidence for rna binding for most of the previously identified arabidopsis rbps and identified a plethora of proteins with noncanonical rbds. systems approaches to describe rna-protein interactions globally come in two main flavors (fig. ) . in rna-centric approaches, proteins associated with mrnas are recovered by rna pull-down and identified by mass spectrometry, a technique referred to as mrna interactome capture (baltz et al. ; castello et al. ) (fig. a) . in protein-centric approaches, the focus is laid on a particular rbp. the rna complement associated with the rbp of interest, the ribonome, is identified via immunoprecipitation of the rbp from cell lysates and identification of the bound target rnas, initially by microarrays (tenenbaum et al. ; galgano and gerber ; guerreiro et al. ) or more recently via high throughput sequencing (licatalosi et al. ; könig et al. ; rossbach et al. ; müller-mcnicoll et al. ) (fig. b) . of all predicted rbps in arabidopsis, rna binding has only been experimentally confirmed for a limited number of them. a first attempt to globally identify proteins based on their ability to interact with mrnas in vivo was made for cultured arabidopsis cells (schmidt et al. ) . in this study, mrnas and interactors were recovered under native conditions by affinity chromatography on an oligo(dt) cellulose column followed by two-dimensional gel electrophoresis. the protein components were identified via maldi-tof. in the rna-bound proteome were a suite of rrm proteins including members of the family of glycine-rich rnabinding proteins like atgrp (arabidopsis thaliana glycine rich rna-binding protein ), atgrp and atgrp (lewinski et al. ) , the two oligouridylatespecific rbp and rbp proteins (lorkovic et al. ) , and csd proteins. in , mrna interactome capture was reported to comprehensively identify proteins interacting with mrnas in mammalian cells (baltz et al. ; castello et al. ). this technique employs in vivo cross-linking of mrna and bound proteins by uv light irradiation. the rna-protein complexes are recovered by pull-down of polyadenylated rnas using magnetic beads coated with oligo(dt). proteins are released by rnase treatment, subjected to tryptic digest and identified via mass spectrometry (fig. a) . following these pioneering studies, this technique was applied to a wide range of organisms including yeast, drosophila melanogaster, caenorhabditis elegans, leishmania, trypanosomes, and plasmodium (mitchell et al. ; beckmann et al. ; matia-gonzalez et al. ; bunnik et al. ; lueong et al. ; sysoev et al. ; wessels et al. ; nandan et al. ) . a minimal core mrna bound proteome occurring in both human and yeast was defined by beckmann and coworkers (beckmann et al. ) . lately, mrna interactome capture has also been successfully applied to arabidopsis (marondedze et al. ; reichel et al. ; zhang et al. ). the first mrna interactome capture experiments in arabidopsis employed widely differing tissues to catalog rbps. gueten and coworkers chose protoplasts, cells without a cell wall, assuming that uv cross-linking should occur as efficiently as xing et al. , or relative to polyadenylated rna, e.g., meyer et al. . in iclip (könig et al. ) , rna-protein complexes are subjected to rnase treatment. bound proteins are digested with proteinase, leaving a polypeptide at the crosslink site. reverse transcriptase stops there, allowing the detection of the cross-link site at the − position of the processed sequencing reads in mammalian cell monolayers (zhang et al. ) . leaf mesophyll protoplasts are also widely used in transient assays to study the regulation of gene expression. a mesophyll protoplast mrna interactome was defined with a total of proteins based on enrichment in cross-linked samples vs. non-cross-linked controls with a log fold change above (zhang et al. ) . of these, one class was represented by ribosomal proteins of which were also present in the core mrna-bound proteome of human and yeast cells (beckmann et al. ) . the second class comprised proteins with a known rbd. for of them, a role in mrna binding and rna biology had already been described while the remaining proteins had a potential role in mrna processing. moreover, of the rbps in the second class overlapped with the rbps identified in the native oligo(dt) affinity chromatography approach (schmidt et al. ) . the third class comprised candidate rbps. of these, were metabolic enzymes, mainly oxidoreductases. moreover, numerous proteins related to photosynthesis were found. as these are generally strongly expressed, their rna binding activity and the domains involved beg for an independent validation. one of the enzymes was the arabidopsis ortholog of phosphoglycerate kinase whose rna binding capacity has previously been validated in yeast and human cells (beckmann et al. ). another mrna interactome capture experiment employed -days-old etiolated arabidopsis seedlings (reichel et al. ). this was based on the rationale that uv-absorbing pigments present in green plant tissue may interfere with uv crosslinking in planta and their absence in etiolated tissue may allow more efficient uv cross-linking. around of the proteins identified altogether were significantly enriched in uv cross-linked samples vs. non-cross-linked controls with a false discovery rate below % and designated the "at-rbp set." eighty percent of these have a known rbd, and % have been linked to rna biology. more than additional proteins did not meet the significance criteria applied for the "at-rbp set" and were classified as "candidate rbps." notably, of the computationally predicted rrm proteins in arabidopsis were detected in the input fraction in etiolated seedlings (silverman et al. ). half of these were recovered in the "at-rbd set" and another were present among the "candidate rbps." similarly, seven of the predicted kh proteins were present in the "at-rbd set" and were among the "candidate rbps." of the predicted members of the ppr protein family only were detected in the input fraction, likely due to low abundance (schmitz-linneweber and small ; reichel et al. ) . only six ppr proteins were found in the "at-rbp set" and another twelve in the "candidate rbps," likely because most rnas in the organelles lack poly(a) tails. a comparison of the identified proteins to the mrna interactome in other model organisms revealed that were present in the interactomes of humans (baltz et al. ; beckmann et al. ) , mice (kwon et al. ; liao et al. ) , and yeast (beckmann et al. ) and were assigned to basic functions in rna metabolism such as translation, splicing, and rna unwinding. in addition to rbps with known rbds many arabidopsis proteins emerged that have not been linked to rna binding so far. among novel rbps were proteins harboring a yt -b homology (yth) domain (li et al. ) . yth domain proteins have been shown to bind n methyladenosine and thus serve as readers of the m a mark in mammals (wang et al. ). in addition, alba domain containing proteins have been identified. alba domain proteins are well characterized in archaebacteria where they act as transcriptional repressors and in other eukaryotes where they control translation (goyal et al. ) . in plants, they have not yet been functionally characterized. the only observation pointing to rna binding is the recovery of an arabidopsis alba domain protein by rna-affinity chromatography (gosai et al. ) . whirly domain containing proteins have been characterized as single-stranded dna binding proteins in organelles (krause et al. ) and in maize, association of a whirly protein with chloroplast transcripts has been observed (prikryl et al. ). the identification of three whirly proteins in the etiolated seedling interactome (reichel et al. ) and of whirly upon oligo(dt) affinity chromatography in arabidopsis cells (schmidt et al. ) now provides evidence for global in vivo rna binding. in addition, a plethora of proteins with potential rna binding activity have been detected. to substantiate their rna-binding properties, independent replication is desirable. among those are proteins with the domain of unknown function , cytoskeletal proteins, and photoreceptors. the identification of plasma membrane intrinsic proteins has led to the speculation that aquaporins may be involved in transport of rnas between cells (reichel et al. ). another mrna interactome capture experiment was performed on cell suspension cultures generated from roots of the arabidopsis accessions col- and landsberg erecta. in parallel, leaves of four-weeks-old arabidopsis col- plants were investigated (marondedze et al. ) . of proteins identified altogether in these three samples, appeared only in uv cross-linked samples, and proteins were significantly enriched upon uv cross-linking relative to non-cross-linked samples. more than proteins were known rbds whereas were novel candidate rbps not previously assigned an rna-related function or known rbd, including many enzymes of intermediary metabolism, and thus await further experimental proof (marondedze et al. ). the discovery of many novel rbps begs for further investigation of the rnabinding properties of these proteins. accordingly, methods to define rna targets of candidate rbps genome wide using protein-centric methods have recently been adapted for the use in arabidopsis, as discussed below. approaches to globally identify in vivo targets of an rbp in arabidopsis mostly rely on transgenic plants expressing an epitope-tagged version of the rbp. immunopurification is performed via an antibody directed against the epitope tag. to mirror-image the endogenous expression pattern, authentic promoters are used and the constructs are introduced into a loss-of-function mutant (köster and staiger ) . alternatively, endogenous rbps can be recovered with dedicated antibodies. to freeze the in vivo rna-protein interactions before cell lysis, cross-linking is performed by exposing plants to formaldehyde in rna immunoprecipitation (rip) or by uv irradiation in uv cross-linking and immunoprecipitation (clip) (fig. b) . formaldehyde efficiently cross-links nucleic acids and proteins in vivo but also cross-links proteins. thus, not only direct targets are recovered. this is circumvented by using nm uv light that cross-links proteins directly binding to nucleic acids in the neighborhood of the excited nucleobase but does not cross-link proteins. to date, a comprehensive determination of in vivo targets, the ribonome, has been performed for only a few arabidopsis rbps, both nucleocytoplasmic proteins and chloroplast-localized proteins with different tasks in posttranscriptional regulation. in the subsequent sections, selected examples are presented. hlp is an arabidopsis rbp resembling mammalian hnrnp a/b-like proteins (zhang et al. ) . high throughput sequencing (hits)-clip of hlp fused to gfp and expressed under control of the strong, constitutive cauliflower mosaic virus s rna promoter identified above transcripts bound in vivo (zhang et al. ) . when endogenous hlp protein was precipitated by a specific antibody, transcripts bound in vivo were detected with an overlap of above transcripts to the hlp -gfp precipitation. the prevalence of cross-linked regions near polyadenylation sites provoked the hypothesis that hlp may control polyadenylation. indeed, in more than transcripts the distal polyadenylation site was preferred over the proximal polyadenylation site in hlp mutant plants. around % of these transcripts were also recovered by hlp hits-clip, pointing to a role for hlp in the control of alternative polyadenylation, at least partly by direct binding. in line with this, meme motifs overrepresented in the crosslink regions, namely a-rich ( -agaaaa- ) and u-rich ( -uuuucu- ) motifs, resembled motifs enriched in the vicinity of the poly(a) site, -aaagaaaa- and -uguuuc- . the presence of cross-link regions in other parts of the transcripts apart from the untranslated region (utr) suggests that hlp may also affect other aspects of pre-mrna processing in addition to polyadenylation. atgrp (arabidopsis thaliana glycine rich rna-binding protein ) is another hnrnp-like protein with an n-terminal rrm and a c-terminus enriched in contiguous glycine residues. atgrp is regulated by the circadian clock and negatively autoregulates its own oscillations by alternative splicing and nonsense-mediated decay (staiger et al. ; schmal et al. ) . additionally, it is involved in several steps of posttranscriptional regulation including alternative splicing, nucleic acid chaperone function, and pri-mirna processing (kim et al. a; streitner et al. ; . to gain insights into the breadth of its in vivo targets, individual nucleotide resolution cross-linking and immunoprecipitation (iclip) and rip-seq were performed . atgrp fused to gfp was expressed from its own promoter including all regulatory elements ( utr, intron, and utr) in the atgrp - loss-of-function mutant. in parallel, transgenic plants expressing gfp alone or an rna-binding dead variant of atgrp with a single conserved arginine in the rrm mutated to glutamine (atgrp r q) were used as negative controls. iclip identified transcripts with significant iclip hits in four out of five biological replicates for atgrp -gfp that were not present in the controls. rip-seq identified transcripts enriched by atgrp -gfp relative to total polyadenylated rna. the higher number may be due to the higher cross-linking efficiency of formaldehyde compared to uv light, and the recovery of many indirect targets. transcripts were common in both data sets, suggesting that they represent a set of high confidence binders. the iclip cross-link sites were observed in all transcript regions, the utrs, coding sequence and introns. after correcting for the length of the feature in the genome, cross-link sites in the utr prevailed. conserved motifs in the vicinity of the cross-link sites generally were u/c rich. to determine how atgrp may impact its downstream targets, the binding targets were cross-referenced against transcriptome data from atgrp overexpressing plants or loss-of-function mutants. in both, the atgrp overexpressors or the mutant, a similar number of transcripts was expressed at elevated or reduced levels compared to wild-type plants. notably, significantly more differentially expressed iclip targets were downregulated in atgrp -overexpressors than upregulated. in turn, more of the differentially expressed atgrp iclip targets were expressed at elevated in the mutant than at reduced levels. this indicates a predominantly negative effect of atgrp on its targets. among the targets were more circadianly regulated transcripts than expected. in particular, elevated atgrp levels lead to damping of circadian oscillations of target transcripts including dor-mancy/auxin associated family protein and ccr-like. this conforms with the idea that the circadian clock regulated atgrp functions as a molecular slave oscillator, conveying temporal information from the core circadian clock within the cell (rudolf et al. ). in addition, changes in splicing patterns were observed for iclip and rip-seq targets upon misexpression of atgrp , confirming a role for atgrp in the control of alternative splicing. arabidopsis thaliana serine/arginine rich (sr)-like protein sr , the counterpart of metazoan rnps , is an sr-like protein with two rs domains, flanking either side of the rrm (badolato et al. ; golovkin and reddy ) . notably, recombinant arabidopsis sr can activate splicing of a β-globin splicing reporter in hela cell s extracts (ali et al. ). sr occurs in two splice isoforms that arise through differential usage of a splice site in intron . this leads to two protein isoforms that differ by seven amino acid residues and in their function: sr . is involved in petal development in flowers, whereas sr . is important for root growth (zhang and mount ) . genome-wide targets for sr . were determined during early seedling development (xing et al. ) and in inflorescences (zhang et al. ) , respectively. in seedlings, rip-seq identified transcripts from genes that were enriched upon precipitation of sr . -gfp from nuclei of transgenic plants compared to mock precipitation from wild type plants (xing et al. ) . these were designated sars, for sr associated rnas. a gene ontology term analysis showed that of abscisic acid (aba) signaling genes ( %) were among the sars, in line with a function for sr in the aba signaling pathway (carvalho et al. ) . hundred and forty-eight of the sars had an altered expression in the sr - mutant, suggesting that binding of sr has functional consequences. a meme search for sr binding motifs revealed four overrepresented motifs within sar genes. two g/a rich motifs are largely positioned within exons and show strong similarity to the binding motifs of two metazoan splicing regulators transformer (tra ) and serine/arginine-rich splicing factor (srsf ). furthermore, one g/a rich motif closely resembles the gaag motif, a known cisregulatory element in regulating alternative splicing in plants. in contrast, two u/c rich motifs peak within intronic regions near and splice sites, in line with the observation that the majority of sars were from intron-containing genes and the known role as a splicing regulator (xing et al. ) . to gain insights into a potential role of sr in flower development, rip-seq was performed for sr . -gfp in inflorescence tissue (zhang et al. ) . the resulting reads were analyzed by two different bioinformatics pipelines, one based on mapping reads to the genome and one directly quantifying annotated transcripts. sars in inflorescence were defined based on a twofold enrichment compared to gfp only controls and the identification by both pipelines. of sars in inflorescence, overlapped with the sars in seedlings. notably, transcripts encoding splicing factors were among the sars including sr itself, the three sr proteins sr , sr , and scl , the pre-mrna processing factors prp , prp a, prp b, and prp , and the rna helicase rh , pointing to a hierarchical regulation of posttranscriptional regulators (keene ) . genes upregulated in the sr - mutant are enriched for defense response genes. indeed, the sr - mutant was more resistant to bacterial and fungal pathogens. of upregulated defense response genes in sr - , were sars. thus, sr has an additional role as a negative regulator of plant immunity. furthermore, of the inflorescence sars were aberrantly spliced in the sr - mutant. determination of potential sr binding sites in inflorescence sars uncovered an overrepresentation of the purine-rich motifs ggngg, gngga, and gnggnng. importantly, ggngg and related motifs are enriched in introns and exons that are alternatively spliced in the sr - mutant, irrespective of the splicing event is favored or suppressed by sr . this led to the suggestion that sr identifies regions for alternative splicing and acts as a facilitator for other splicing factors. however, the identified binding motifs for sr in inflorescences differ from that in seedlings, which might be in part due to the different bioinformatic tools used for motif determination. both rip-seq data sets nevertheless strengthen sr 's key role as an important splicing factor in arabidopsis. however, in both rip-seq experiments intron-less transcripts were identified in addition to intron-containing transcripts, pointing to functions of sr beyond its known role in pre-mrna splicing. interestingly, a comparison between the u/c-rich motifs of atgrp and the u/crich motifs of sr identified by meme in seedlings revealed a high degree of similarity . the functional significance remains to be tested. in bacteria, csps are upregulated upon cold stress and destabilize rna secondary structure at low temperatures (sommerville ) . to elucidate a potential involvement of arabidopsis csps in the regulation of cold responsive genes, rip followed by gene chip analysis was performed for csp (juntawong et al. ) . more than mrnas were identified. comparison of these csp -associated transcripts in total rna and rna loaded onto polysomes revealed an enrichment of mrnas associated with ribosome biogenesis in the pool of actively translating rnas. the high gc content in utrs of these mrnas suggested that csp is involved in removing secondary structures in the utr to facilitate their translation. accordingly, these mrnas were less efficiently loaded onto polysomes at low temperature in the atcsp - mutant compared to wild type plants or csp overexpressing plants (juntawong et al. ). the highly abundant chloroplast ribonucleoproteins (cprnps) have been well characterized for their role in regulating chloroplast transcripts (ohta et al. ) . the cprnps comprise an acidic domain and two rrms. they are encoded in the nucleus and imported into chloroplasts. mutants in distinct cprnps are widely affected in processing of transcripts in the chloroplast, leading to defects in chloroplast development and, consequently, plant performance owing to the essential role of the chloroplast in photosynthetic energy (ruwe et al. ) . for example, mutants deficient in cp a ( kda chloroplast protein a) and cp a ( kda chloroplast protein a) showed gross defects at low ambient temperature. rip performed with antibodies against the endogenous proteins and subsequent hybridization of coprecipitated rnas on tiling arrays covering the arabidopsis chloroplast genome (rip-chip) showed that cp a and cp a associate with large overlapping sets of chloroplast transcripts including strong enrichment for psbb, psbd, psaa/b, atpb, ndhb and intermediate enrichment for almost all chloroplast mrnas (kupsch et al. ) . both cp a and cp a are required for accumulation of chloroplast mrnas under cold stress. furthermore, binding of cp a to ends of certain transcripts serves to protect these transcripts against exonuclease activity (kupsch et al. ) . together with the known role of cp a in rna (tillich et al. ) this points to multiple functions in posttranscriptional regulation in chloroplasts. for cp a ( kda chloroplast protein a), rip-chip revealed an association with a large body of chloroplast mrnas (teubner et al. ) . a global reduction in mrnas and proteins making up the photosynthetic apparatus was found in the cp a mutant. in line with a crucial role for cp a in the development of the photosynthetic apparatus, cp a null mutants have an albino phenotype and are not able to survive without external sucrose supply (teubner et al. ). in contrast to the broad substrate specificity of the cprnps, a very narrow substrate specificity was found for a representative of the ppr class of nuclear-encoded rbps that are imported into organelles. atcpr (arabidopsis thaliana chloroplast rna processing ) is important for the production of subunits of the thylakoid protein complexes (ferrari et al. ) . atcpr mutants are yellow-white because the subunits of the photosynthetic apparatus do not accumulate. rip-chip was performed for atcpr under native conditions. hybridization of bound targets to chloroplast tiling arrays revealed specific binding of atcpr to only few transcripts, the psac transcript encoding a photosystem i subunit, petb-petd encoding cytochrome b and the subunit iv of the cytochrome b /f complex. because during rip rnase was used to digest unprotected rna, it was possible to delineate the binding regions. binding to the petb-petd intergenic region correlated with a requirement for processing of the polycistronic transcript comprising petb and petd (ferrari et al. ) , thus providing proof for the functional relevance of the observed in vivo binding. in addition to rna sequence, rna secondary structure also strongly influences the interaction of rbps with their cognate rna binding motifs (cruz and westhof ; vandivier et al. ) . rna structure may facilitate binding of rbds with a preference for double-stranded rna or inhibit binding of rbps with a preference for single-stranded rna. protein interaction profile sequencing (pip-seq) allows simultaneous delineation of in vivo rna secondary structure and protein-protected sites (ppss) (fig. ) (gosai et al. ) . to identify ppss, samples are treated with a single-strand specific or double-strand specific rnase. proteins are then denatured before library preparation. to determine the rna secondary structure, proteins are denatured by sds and removed by protease digestion to make sites protected by proteins in vivo accessible for rnases. collectively, motifs that are enriched in the samples used to determine protein protected sites compared to the samples used for structure determination are in vivo target sites of rbds. gregory and coworkers applied pip-seq to the nuclei of two specific cell types in the arabidopsis roots that derive from epidermal cells through distinct differentiation, those cells bearing root hairs and those that do not (foley et al. ) . distinct protein binding patterns were detected, and binding motifs either specific to hair cells, non-hair cells or common to both cell types were determined. to identify candidate proteins, rna affinity chromatography was performed on immobilized oligonucleotides derived from enriched motifs. a ggn repeat motif enriched in sites protected in both hair cells and non-hair cells recovered serrate (se) from root lysates, a zinc finger containing rbp involved in processing of mirna precursors. a tg rich motif enriched in hair cell-specific protected sites identified atgrp , atgrp and atgrp . subsequently, atgrp was shown to regulate root hair development at the posttranscriptional level. an advantage of pip-seq is that it does not rely on an antibody to identify target sites within bound transcripts. in contrast, subsequent identification of the cognate binding proteins requires in vitro binding techniques. thus, binding in vivo has to be confirmed by independent means. the recent mrna interactome capture studies are very valuable in having established uv cross-linking and oligo(dt) affinity capture to determine the mrna binding proteome also in arabidopsis. a large number of previously predicted rbps in arabidopsis were now identified experimentally and many novel proteins without a previous assignment to rna biology unearthed. reichel and colleagues noticed a bias toward proteins with higher abundance in the interactome compared to the input (reichel et al. ) , suggesting that additional proteins with lower expression level may still be identified in the future. only few of the mrna interacting proteins were present in all three interactomes ). this may partly be attributed to the widely differing developmental stages investigated. among the commonly identified proteins are numerous cytoplasmic ribosomal proteins from the small and large ribosomal subunits, likely due to their high abundance, as well as the ubiquitously expressed glycine-rich rbps atgrp and atgrp . future applications are the dynamics of posttranscriptional networks in response to endogenous and exogenous stimuli cues by describing changes in the mrna bound proteomes. furthermore, as proteins binding to nonpolyadenylated rnas obviously remain elusive in these approaches, transcript-specific approaches have to be developed. transcriptome-wide identification of target transcripts bound by selected rbps in vivo has overcome a major limitation in research on plant rna-based regulation. nevertheless, except for the ppr proteins, we are still far from understanding the exact binding specificity of most proteins and the consequences in vivo binding has for the targets. to correlate in vivo binding with function, the impact of mutated candidate binding motifs on rbp binding and target gene expression has to be determined. most bioinformatics pipelines today discussing motif discovery are limited to sequence data. current efforts focus on developing bioinformatics pipelines for identifying conserved motifs taking rna structure context into consideration (maticzka et al. ) . molecular dynamics of rna molecules are still compute intensive but can shed light on possible interaction sites and three dimensional structures (tuszynska et al. ; boniecki et al. ) . finally, heterogeneous datasets and analyses, fusing several kinds of sources, can improve meta-analysis with in silico and in vivo datasets. this is yet limited in arabidopsis but will improve the information quality in the near future. additionally, it will be important to have comprehensive databases on rbp target sites linked to the arabidopsis information portal (the international arabidopsis informatics consortium ). such resources will be of great value to improve a systems understanding of rnaprotein interaction. regulation of plant developmental processes by a novel splicing factor identification and characterisation of a novel human rna-binding protein the mrna-bound proteome and its global occupancy profile on protein-coding transcripts pentatricopeptide repeat proteins in plants the rna-binding proteomes from yeast to man harbour conserved enigmrbps simrna: a coarse-grained method for rna folding simulations and d structure prediction the mrna-bound proteome of the human malaria parasite plasmodium falciparum the plant-specific sr protein negatively regulates glucose and aba signaling during early seedling development in arabidopsis the arabidopsis sr splicing factor, a negative regulator of sugar signaling, modulates snf -related protein kinase stability insights into rna biology from an atlas of mammalian mrna-binding proteins the dynamic landscapes of rna architecture the rna-binding proteins hyl and se promote accurate in vitro processing of pri-mirna by dcl crp protein: (dis)similarities between arabidopsis thaliana and zea mays a global view of rna-protein interactions identifies post-transcriptional regulators of root hair cell fate a type iii effector adp-ribosylates rna-binding proteins and quells plant immunity rna-binding protein immunopurification-microarray (rip-chip) analysis to profile localized rnas an sc -like protein and a novel serine/arginine-rich protein interact with arabidopsis u - k protein global analysis of the rna-protein interaction and rna secondary structure landscapes of the arabidopsis nucleus the alba protein family: structure and function genome-wide rip-chip analysis of translational repressor-bound mrnas in the plasmodium gametocyte salicylic acid-dependent and -independent impact of an rna-binding protein on plant immunity a light-and temperature-entrained circadian clock controls expression of transcripts encoding nuclear proteins with homology to rna-binding proteins in meristematic tissue the spen family protein fpa controls alternative cleavage and polyadenylation of rna an mrna cap binding protein, abh , modulates early abscisic acid signal transduction in arabidopsis structure function analysis of an adp-ribosyltransferase type iii effector and its rna-binding target in plant immunity mutation of arabidopsis spliceosomal timekeeper locus causes circadian clock defects cold shock protein chaperones mrnas during translation in arabidopsis thaliana ectopic expression of at rsz reveals its function in splicing and causes pleiotropic changes in development rna regulons: coordination of post-transcriptional events cold shock domain proteins and glycine-rich rna-binding proteins from arabidopsis thaliana can promote the cold adaptation process in escherichia coli functional characterization of a glycine-rich rna-binding protein in arabidopsis thaliana under abiotic stress conditions a zinc finger-containing glycine-rich rna-binding protein, at rz- a, has a negative impact on seed germination and seedling growth of arabidopsis thaliana under salt or drought stress conditions functional characterization of dead-box rna helicases in arabidopsis thaliana under abiotic stress conditions glycine-rich rna-binding proteins are functionally conserved in arabidopsis thaliana and oryza sativa during cold adaptation process iclip reveals the function of hnrnp particles in splicing at individual nucleotide resolution rna-binding protein immunoprecipitation from whole-cell extracts regulation of pri-mirna processing by the hnrnplike protein atgrp in arabidopsis rna-binding proteins revisited: the emerging arabidopsis mrna interactome whirly proteins as communicators between plant organelles and the nucleus? arabidopsis chloroplast rna binding proteins cp a and cp a associate with large transcript pools and confer cold stress tolerance by influencing multiple chloroplast rna processing steps the rna-binding protein repertoire of embryonic stem cells genome-wide identification and phylogenetic analysis of plant rna binding proteins comprising both rna recognition motifs and contiguous glycine residues genome-wide identification, biochemical characterization, and expression analyses of the yth domain-containing rna-binding protein family in arabidopsis and rice the cardiomyocyte rna-binding proteome: links to intermediary metabolism and heart disease hits-clip yields genome-wide insights into brain alternative rna processing a glycine-rich rna-binding protein affects gibberellin biosynthesis in arabidopsis ) atsrp , one of two sf /asf-like proteins from arabidopsis thaliana, regulates splicing of specific plant genes rbp and rbp , two oligouridylatespecific hnrnp-like proteins interacting with poly(a)+ rna in nuclei of plant cells a mutation in the arabidopsis hyl gene encoding a dsrna binding protein affects responses to abscisic acid, auxin, and cytokinin gene expression regulatory networks in trypanosoma brucei: insights into the role of the mrna-binding proteome the rna-binding protein fpa regulates flg -triggered defense responses and transcription factor activity by alternative polyadenylation fca, a gene controlling flowering time in arabidopsis, encodes a protein containing rna-binding domains the rna-binding protein repertoire of arabidopsis thaliana conserved mrna-binding proteomes in eukaryotic organisms graphprot: modeling binding preferences of rnabinding proteins adaptation of iclip to plants determines the binding landscape of the clock-regulated rna-binding protein atgrp global analysis of yeast mrnps sr proteins are nxf adaptors that link alternative rna processing to mrna export comprehensive identification of mrna-binding proteins of leishmania donovani by interactome capture pseudomonas hopu affects interaction of plant immune receptor mrnas to the rna-binding protein grp three types of nuclear genes encoding chloroplast rnabinding proteins (cp , cp and cp ) are present in arabidopsis thaliana: presence of cp in chloroplasts and its homologue in nuclei/cytoplasms cold shock domain proteins affect seed germination and growth of arabidopsis thaliana under abiotic stress conditions role for lsm genes in the regulation of circadian rhythms a member of the whirly family is a multifunctional rna-and dna-binding protein that is essential for chloroplast biogenesis a putative rna-binding protein positively regulates salicylic acid-mediated immunity in arabidopsis in planta determination of the mrna-binding proteome of arabidopsis etiolated seedlings regulation of mirna abundance by rna binding protein tough in arabidopsis arabidopsis rna-binding protein uba a relocalizes into nuclear speckles in response to abscisic acid pepper, a novel k-homology domain gene, regulates vegetative and gynoecium development in arabidopsis crosslinking-immunoprecipitation (iclip) analysis reveals global regulatory roles of hnrnp l slave to the rhythm polypyrimidine tract binding protein homologs from arabidopsis are key regulators of alternative splicing with implications in fundamental developmental processes the rna-recognition motif in chloroplasts a circadian clock-regulated toggle switch explains atgrp and atgrp oscillations in arabidopsis thaliana a proteomic analysis of oligo(dt)-bound mrnp containing oxidative stress-induced arabidopsis thaliana rna-binding proteins atgrp and atgrp pentatricopeptide repeat proteins: a socket set for organelle gene expression genomic era analyses of rna secondary structure and rna-binding proteins reveal their significance to post-transcriptional regulation in plants fy is an rna end-processing factor that interacts with fca to control the arabidopsis floral transition activities of cold-shock domain proteins in translation control selective mrna sequestration by oligouridylate-binding protein contributes to translational control during hypoxia in arabidopsis rna-binding proteins and circadian rhythms in arabidopsis thaliana shaping the arabidopsis transcriptome through alternative splicing the circadian clock regulated rna-binding protein atgrp autoregulates its expression by influencing alternative splicing of its own pre-mrna polypyrimidine tract-binding protein homologues from arabidopsis underlie regulatory circuits based on alternative splicing and downstream control the small glycine-rich rna-binding protein atgrp promotes floral transition in arabidopsis thaliana an hnrnp-like rna-binding protein affects alternative splicing by in vivo interaction with target transcripts in arabidopsis thaliana global changes of the rna-bound proteome during the maternal-to-zygotic transition in drosophila identifying mrna subsets in messenger ribonucleoprotein complexes by using cdna arrays the rrm protein cp a is a global ligand of chloroplast mrnas and is essential for plastid biogenesis and plant development taking the next step: building an arabidopsis information portal chloroplast ribonucleoprotein cp a is required for editing and stability of specific chloroplast mrnas npdock: a web server for protein-nucleic acid docking the conservation and function of rna secondary structure in plants the nuclear dsrna binding protein hyl is required for microrna accumulation and plant development, but not posttranscriptional transgene silencing lachesis-dependent egg-cell signaling regulates the development of female gametophytic cells n -methyladenosine-dependent regulation of messenger rna stability the mrna-bound proteome of the early fly embryo transcriptome-wide identification of rna targets of arabidopsis serine/arginine-rich uncovers the unexpected roles of this rna binding protein in rna processing two alternatively spliced isoforms of the arabidopsis thaliana sr protein have distinct roles during normal plant development integrative genome-wide analysis reveals hlp , a novel rnabinding protein, regulates plant flowering by targeting alternative polyadenylation uv crosslinked mrna-binding proteins captured from leaf mesophyll protoplasts transcriptome analyses reveal sr to be a neutral splicing regulator and a suppressor of innate immunity in arabidopsis thaliana the work in tino köster's lab is supported by the dfg through grant ko / - . martin lewinski is supported by the dfg through grant sta / - to dorothee staiger. key: cord- -pgyzluwp authors: nan title: programmed cell death date: journal: experientia doi: . /bf sha: doc_id: cord_uid: pgyzluwp nan it is widely held that all developmental cell death is of a single type (apoptosis) and that neuronal death is primarily for adjusting the number of neurons in a population to the size of their target field through competition between equals for target-derived factors. we shall draw on our research and on that of others to criticize these views and replace them by the following. at least three types of neuronal death occur, only one of which resembles apoptosis; a neuron can choose between several self-destruct mechanisms depending on the cause of its death. the purpose of the death is to regulate connectivity, not neuron number. competitors for trophic factors are unequal, and many losers have made axonal targeting errors. a neuron's survival and differentiation depend on multiple anterograde and retrograde signals. activity affects retrograde signals and some but not all anterograde ones. the pattern of activity is more important than the overall amount. in rodents, the period of naturally occuring cell death of motoneurons is followed by a period of supersensitivity to axonal injury. thus, in newborn rodents lesion of the facial nerve leads to a rapid degeneration of the injured motoneurons. we have tested whether overexpression, in rive, of the bcl- proto-oncogene was capable of preventing death of axotomized motoneurons. to address this question we used transgenic mice whose motoneurons overexpress the bcl- protein. one of the two facial nerves of newborn mice was transected on the nd- rd post-natal day. seven days after the lesion, the morphology of the facial nuclei was analyzed. in control mice, and when compared to the intact nucleus, to % of axotomized motoneurons had disappeared. in contrast, in the transgenic animals, the number of motoneurons on the lesioned side remained unchanged when compared to the eontralateral nucleus. furthermore, their axons remained visible up to the distal lesion site. these experiments show that, in rive, motoneurons overexpressing the bcl- protein survive after axotomy, and suggest that, in rive, bcl- protect neurons from experimentally induced cell death and could be a target for treatment of motoneurons degenerative diseases. messmer s., mattenberger l., sager y., blatter-garin m-c., pometta d., kate a., james r.w. drpt de mrdeeine, drpt. de pharmacologie, div. de neurophysiologie clinique, facult de mrdecine, gen~ve. clusterin is a widely expressed glycoprotein, highly conserved across species. numerous functions have been postulated for this protein. the most important are roles in lipid transport, as elusterin is associated with apolipoprotein ai in hdl, complement regulation and tissue remodelling, in particular during cell death and differentiation. using cultures of rat spinal cord neurones ( % neurons and - % non-neuronal cells), we have studied the expression of clusterin and ape e in glutamate-induced neuronal cell death to examine potential roles in lipid management. up-regulation of the two proteins was observed. clusterin and ape e appear in the conditioned medium respectively h and . h after incubation with glutamate. control studies, in the presence of a noncompetitive nmda receptor agonist showed the secretion of clusterin and ape e to be diminished by > %. no up-regulation of either protein was observed in complementary studies with exclusively non-neuronal cell cultures. the cellular origin of the secreted proteins is presently under investigation. programmed cell death and tissue remodelling are consequences of hormonally induced restructuring of the rat ventral prostate after castration and the rat mammary gland after weaning. we used the "differential display"-method (liang and pardee, , science : ) to detect and isolate edna fragments whose corresponding rnas are regulated either coincidentally, or in an organ specific fashion during mammary gland involution and postcastrational prostate regression. partial sequencing of clones revealed high, but not absolute homology of fragments with sequences, previously characterized in different biological contexts. these five encode functions which could be anticipated to be important for cell growth and/or programmed cell death, we are presently investigating the functions of several of these transcripts in cell culture and in rive. antisense oligos are being employed in vivo to determine whether these genes contribute to the phenotype of programmed cell death. b epitopes derived from the envelope gp glycoprotein (ep ) or from the viral superantigen of mmtv have been incorporated into inert or live vaccines. the inert vaccine consists of purified chimeric proteins which contain the b epitopes alone or fused to multimeric promiscuous t helper epitopes from tetanus toxin. mice were immunized subcutaneously with these chimeric proteins. the live vaccine consists of an avirulent strain of salmonella typhimurium which expresses the mmtv epitopes in the form of chimeric proteins fused to the nucleocapsid protein of hepatitis b virus. this vaccine is given to mice in one oral dose. the level, duration and isotype of the immune response generated by each vaccine have been measured and compared. the level of protection has been investigated by systemically challenging immunized mice with the relzovims. a reduced binding of oxytocin (ot) occurs with aging in some, but not all, areas of the rat brain (arsenijevic et al., experientia , , a ) . the candate putamen showed the most impressive loss of ot receptors. two other regions, the hypothalamic ventromedial nucleus (vmh) and the islands of caueja (icj) had also an important deficit of ot binding sites. on the other hand, these two regions were known to be sensitive to sex steroids. in the present work, we treated from month old rats during one month with testosterone propionate ( #g/kg s.c., once every days) dissolved in oil. three rats of the same age injected with oil only served as controls. we labelled ot receptors throughout the brain of old rats using a i-labelled ligand specific for ot receptors. analysis of autoradiograms by an image analyzer revealed that the testosterone treatment increased ot binding sites in the vmh, in the icj, and, to a lesser extent, in the bed nucleus of the stria terminalis, a region also sensitive to sex steroids, by contrast, in the caudate putamen, the disappearance of ot receptors was not compensated. in conclusion, the decrease of ot receptors occurring in vmh and icj with aging can be reversed by administration of gonadal steroids. in contrast, the loss of ot receptors in the striatum appears to depend on another mecanism. vasopressin (avp) receptors are expressed transiently in the facial nucleus during development (tribollet et el., , dev. brain res., , - ) . avp may therefore play a role in the maturation of neuromuscular connexions in the neonate rat, and possibly in the restanration of these connexions after nerve lesion in the adult. in order to investigate the latter proposition, we have sectionned the facial nerve in adult rats and used quantitative autoradiography to look at avp binding sites in the facial nucleus at various postoperative times. we observed a massive and transient increase of avp binding sites on the operated side. the number of facial avp binding sites reaches a maximum about one week after nerve section, remains stable during - weeks, then begin to decrease towards control level. the induction of avp receptors is markedly delayed if the proximal stump of the nerve is ligated. to assess whether other motor nuclei would also react to axotomy by up-regulating the expression of avp receptors, we have sectionned the hypoglossal nerve and the sciatic nerve. in both cases, the binding of avp receptor ligand increases massively in the respective motor nuclei, with a time-course similar to that found in the facial nucleus. altogether, our data suggest that central avp could be involved in the process of nerve regeneration. cytotoxic t-cell mediated apoptosis schaerer,e, karapetian,o.,adrian,m. and tschopp,j. inst.de biochimie, univ.de lausanne, epalinges. an apoptotic cell death mechanism is used by cytolytic t cells (ctl) to lyse appropriate target cells. ctl harbor cytoplasmic storage compartments, containing the lytic protein perforin and serineproteases (granzymes), whose content is released upon target cell interaction. we show that these granules are multivesieular bodies and that degranulation releases these intragranular vesicles (igv) having granzymes, t-cell receptor and yet undefined proteins associated. isolated igvs and perforin induce dna breakdown in target cells within minutes. microscopic analysis demonstrates that igv specifically interact with target cell via the t-cell receptor and that their contents is taken up by the target cell. already min. after interaction, distinct igv proteins are found in the nucleus of the target cell.one of the molecules has been identified to be granzyme a, previously reported to be involved in apoptosis. we propose that lymphocytes transfer apoptosisinducing proteins to the nucleus of the target cells using vesicles as vehicles for delivery. cytotoxic t cells kill their targets by a mechanism involving membranolysis and dna degradation (apoptosis). recently, two sets of proteins have been proposed as dna breakdown-inducing molecules in t cells: granzyme a, b and tia-i. in this study, we cloned and further characterized the tia-i mouse homologue. aa sequence comparison with the human tia- showed an overall identity of %. devoid of a signal peptide, tia is yet localized to cytotoxic granules, probably targeted via a gly-tyr-motif. as tia-i, its mouse homolcgue contains three rnabinding domains. expression of tia during development shows a very strong signal in the brain and weaker signals in thymus, heart and other organs. during embryonic development several structures that contribute to organogenesis form transiently and are later eliminated by apoptosis. this pattern of tia expression could indicate its involvement in apoptosis. prostate involution occurs after castration in rats and is associated with the death by apoptosis of a large fraction of the epithelial cells. we have isolated several genes from a prostate involution bacteriophage lambda library using differential screening methods. among these clones, one d~monstrated an especially strong signal when used as a probe against northern blots of prostate mlhna obtained before, and at different times after castration. this gene is down-regulated after castration by -fold within days. intramuscular injection of a testosterone depot resulted in complete restoration of expression within hours. upon sequencing it became apparent that this clone has a high degree of homology to a known ndah dehydrogenase encoded in mitochondrial dna. the clone failed to hybridize to any transcripts from rat organs other than prostate. we are now in the process of isolating the htm~n hc~olog to this gene for use as a biomarker in study of benign hyperplasia and developing carcinoma. this gene is a possible indicator for testosterone-independent cell populations or of cells lacking ftl~ctional testosterone receptor. during the first three postnatal weeks the rat lung undergoes the last two developmental stages, the phase of alveolarization and the phase of microvascular maturation. the latter involves a decrease of the connective tissue mass in the alveolar septa and a merging of the two capillary layers to a single one. speculating that programmed cell death may play a role during this remodeling, we searched for the presence of apoptotie cells in rat lungs between days and . lung paraffin sections were treated with y-terminal transferase, digoxigenin-dutp, and anti-digoxigeninfluorescein-f(ab)-fragments, and the number of fluorescent nuclei was compared between sections at different days. while the number of apoptotie ceils was low until the end of the second week and at day , we observed an about eight fold increase of fluorescent nuclei towards the end of the third week. we conclude that programmed cell death is involved in the structural maturation of the lung. brunner, a., wallrapp, ch., pollack, i, twardzik, t. and schneuwly, s. lehrstuhl genetik, biozentrum universit~t w~rzburg, mutants in the giant lens (g/l) gene show a strong disturbance in ommatidial development. in the absence of any gene product, additional phetoreceptors, cone cells and pigment cells develop. opposite effects can be seen in flies in which the gene product of the giant lens gene can be ectopically expressed by heat shock. a second very typical phenotype is the disturbance of photoreceptor axon guidance. molecular analysis of gil shows that it encodes a secreted protein of aa containing three evolutionary conserved cystein-motives very similar to egf-like repeats. we propose that gil functions as a secreted signal, most likely a lateral inhibitor for the development of specific cell fates and that gil, either directly or indirectly, is involved in targeting photoreceptor axons into the brain. the decrease in cellularity during scar establishment is mediated through apoptosis desmouliere, a., redard, m., darby, i., and g. gabbiani department of pathology, cmu, rue michel server, gen~ve dudng the healing of an open wound, granulation tissue formation is characterized by replication and accumulation of fibroblastic cells, many of which acquire morphological and biochemical features of smooth muscle cells and have been named myofibroblasts (sch rch et el., histology for pathologists, t ). as the wound evolves into a scar, there is an important decrease in ceuuladty, including disappearance of myofibroblasts. the question adses as to which process is responsible for myofibroblast disappearance. during a previous investigation on the expression of (z-smooth muscle actin in myofibroblasts, we have obsewed that in late phases of wound healing, many of myofibroblasts show signs of apoptosis end suggested that this type of cell death is responsible for the disappearance of myofibroblasts (darby et al., lab. invest. : , ) . we have tested this hypothesis by means of electron microscopy and morphometry and by in situ end-labeling of fragmented dna (wijsman et al., j. histochem. cytochem. : , t ) . our results show that the number of apoptotic cells increases as the wound closes and suggest that this may be the mechanism for the disappearance of myofibroblasts as well as for the evolution of granulation tissue into a scar. (supported by the swiss national science foundation, grant n~ s - r. jaggl, a. marti and b. jehn. universit~t bern, akef, tiefenaustr. , bern at weaning the mammary gland undergoes a reductive remodelling process (involution) which is associated with the cessation of milk protein gene expression and apoptosis of milk-produclng epithelial cells. this process can be reversed by returning the pups to the mother within day. elevated nuclear protein kinase a (pka) activity was observed from one day post-lactation, paralleled by increased c-los, junb, ]und and to a lesser extent c-]un mrna levels. ap- dna binding activity was transiently induced and the ap- complex was shown to consist principally of cfos/jund. oct- dna binding activity and oct- protein were gradually lost from the gland over the first four days of involution, whereas oct- m_rna levels remained unchanged. comparing nuclear extracts from normal mammary glands with nuclear extracts from glands which had been cleared of all epithelial cells three weeks after birth revealed that pka activation, ap- induction and oct- inactivation are all dependent on the presence of the epithelial compartment. the increased fos/jtm expression and the inactivation of oct- may be consequences of the increased pka activity. when involution is reversed, both, pica activity and ap- dna binding activity (and fos andjun mrna levels) are reduced to basal levels. our data suggests a role for pka and ap- on progranlmed cell death of manlnmry epithelial ceils. bcl- ~ does not require membrane attachment for its survival activity c. borner*, i. martinout, c. mattmann*, m. irmler*, e. sch&rrer*, j.-c. martinou-j-, and j. tschopp*. * institute of biochemistry, university of lausanne, epalinges, institute of molecular biology, glaxo inc., plan los ouates. cl- (z is a mitochondrial or perinuclear-associated oncoprotein that prolongs the life span of a variety of cell types by interfering with programmed cell death. how it exerts this activity is unknown but it is believed that membrane attachment is required. to identify critical regions in bcl- o~ for subcellular localization and survival activity, we created by site-directed mutagenesis, various mutations in regions which are most conserved between the different bcl- species. we show here that membrane attachment is not required for the survival activity of bcl- o< a truncation mutant of bcl- (z lacking the last amino acids (t ) including the hydrophobic domain is soluble, yet fully active in blocking apoptosis of sympathetic neurons induced by ngf deprivation or l fibroblasts induced by tnfc~ treatment. we further provide evidence for a putative functional region in bcl- which lies in the conserved domains and upstream of the hydrophobic cooh terminal tail. the breakdown of nuclear dna is considered to be a hallmark of apoptosis. we previously identified the perinuclear membrane localized dnase i as the endonuclease involved in the formation of oligonucleosomal-sized fragments (dna ladder). it is not clear how the nuclease is activated and has access to the dna. we show that in thymocytes induced to undergo apoptosis, lamin breakdown preceded dna laddering. by transfeeting hela cells with a constitutively active cdc mutant, nuclear envelope breakdown and typical apoptotic features (ehromatin condensation) were observed. moreover, co-transfection with cdc mutant and dnase i led to dna degradation. we propose that apoptosis can be induced by wrongly timed and hence abortive mitosis leading to uncontrolled nuclear membrane disintegration. s - s - platelet-derived growth factor (pdgf) is thought to play an active role in fibrosing diseases. bronchiolitis obliterans-organizing pneumonia (boop) is a condition characterized by intraluminal proliferation of connective tissue inside distal air spaces. to evaluate pdgf expression in boop we performed immunohistoehemistry on lung biopsies from patients and controls free of fibrosis. sedal sections were stained with an antibody against either pdgf or the monoeyte/macrophage marker cd , in both groups the pdgf ~ cells were essentially tissue macrophages. using point counting to measure volume fraction (vv) , pdgf-pesitive cells represented . + . % (mean+sd) of the volume occupied by lung tissue in the boop cases, and , + . % in the controls (! < , ). similarily, . + . % of the lung tissue was occupied by cd e~ macrophages in the boop cases, compared to . :~ . % in the controls (p