key: cord-0326961-8heoeyal authors: Rosenau, Jana; Grothaus, Isabell Louise; Yang, Yikun; Ciacchi, Lucio Colombi; Kelm, Sørge; Waespy, Mario title: N-glycosylation modulates enzymatic activity of Trypanosoma congolense trans-sialidase date: 2021-12-14 journal: bioRxiv DOI: 10.1101/2021.12.13.472379 sha: ca94e623cd6978b1979d5232cf35317291881898 doc_id: 326961 cord_uid: 8heoeyal Trypanosomes cause the devastating disease trypanosomiasis, in which the action of trans-sialidase (TS) enzymes harbored on their surface is a key virulence factor. TS are highly N-glycosylated, but the biological functions of the glycans remain elusive. In this study, we investigated the influence of N-glycans on the enzymatic activity and structure stability of TconTS1, a recombinant TS from the African parasite Trypanosoma congolense. MALDI-TOF MS revealed that eight asparagine sites were glycosylated with high-mannose type N-glycans. Deglycosylation of TconTS1 led to a 5-fold decrease in substrate affinity but to the same conversion rate relative to the untreated enzyme. After deglycosylation, no changes in secondary structure elements were observed in circular dichroism experiments. Molecular dynamics simulations revealed interactions between the highly flexible N-glycans and some conserved amino acids belonging to the catalytic site. These interactions led to conformational changes, possibly enhancing substrate accessibility and promoting enzyme/substrate complex stability. The here-observed modulation of catalytic activity via the N-glycan shield may be a structure-function relationship intrinsic of several members of the TS family. Glycosylation, or the covalent attachment of carbohydrates to the polypeptide chain of proteins, is the most diverse protein modification in eukaryotic cells. One type of protein glycosylation is the N-linked glycosylation, the attachment of an oligosaccharide to an asparagine residue (N-glycan) in the N-X-S/T motif. N-glycans are branched, tree-like polysaccharide structures built up of different monosaccharide units, which are linked via different glycosidic bonds. N-glycans are involved in folding and stability of proteins, regulate protein functions, represent target structures for lectins or antibodies, and can function as mediators of cell-matrix interactions and cell-cell recognition and communication (1) (2) (3) . Recent studies of N-glycans include molecular dynamics simulations of the SARS-CoV-2 spike protein's glycan shield, the characterization of the glycosylated infectious bronchitis virus spike protein as well as the heavily glycosylated receptor CD22 on B-cells by means of wet lab-based approaches (4) (5) (6) . N-glycosylation has also been suggested to affect the secretion and thermal stability of enzymes, and to alter their kinetic activity (1) . In this study, we use the highly N-glycosylated enzyme trans-sialidase (TS) as a model to investigate the influence of N-glycans on enzymatic activity, thermal stability, and structural conformation, combining kinetic and mass spectrometry (MS) data with molecular dynamics simulations. TS are unusual enzymes expressed by different species of the parasitic genus Trypanosoma (T.). This includes human pathogens causing Chagas disease in South America (T. cruzi) and sleeping sickness in Africa (T. brucei), as well as animal pathogens being responsible for the animal African trypanosomiasis (T. brucei, T. congolense, T. vivax) (7) (8) (9) (10) . The glycosylphosphatidylinositol (GPI)-anchored enzymes preferentially transfer α2,3linked sialic acids (Sia) from host-cell glycoconjugates to β-galactose residues of glycoproteins present on the parasite's surface, thus creating a new α2,3-glycosidic linkage (7, 8, 11) . This surface sialylation has different beneficial functions for the parasite, which is unable to synthesize Sia de novo. In particular it promotes survival in the insect vector and enables to escape the host's immune system (12) (13) (14) (15) (16) . Thus, TS are important virulence factors and represent promising drug targets and vaccine candidates to combat the fatal diseases caused by trypanosomes. The enzymatic mechanisms and the biological functions of TS have been under study for many years, although little attention has been drawn to the influence of posttranslational modifications such as glycosylation (17) (18) (19) (20) (21) (22) . Interestingly, high-mannose type N-glycans have been reported for many TS indirectly by concavalin A (ConA) purification (8, 23, 24) . It has been postulated that these glycans are involved in TS oligomerization (25) . However, its influence on enzyme activity is still debated (17, 18, 21, 22) . Previous experiments were performed with recombinant TS expressed by Pichia (P.) pastoris, which produces hypermannosylated N-glycans (21, 22) . However, trypanosomal surface proteins were reported to harbor shorter high-mannose type N-glycans (26) (27) (28) (29) (30) (31) . No differences in Sia transfer activity were observed between glycosylated and deglycosylated recombinant TS expressed by P. pastoris (21, 22) . We investigated the role of high-mannose type N-glycans using recombinant TS1 from the animal pathogen T. congolense (TconTS1) (32) . TconTS1 is expressed by procyclic insect-infective trypanosomes as well as bloodstream-form trypanosomes in mammalian hosts, and is involved in desialylation of erythrocytes, which contributes to anemia (33, 34) . Like all TS, TconTS1 consists of an N-terminal catalytic domain (CD) responsible for the transfer of Sia, and of a C-terminal lectin-like domain (LD) whose biological function remains unclear. Recently, we demonstrated that TconTS-LD can modulate enzymatic activity (35) . CD and LD are connected via an α-helix ( Fig. 1) (17, 18, 32) . The N-terminus includes a signal sequence, whereas the C-terminus comprises a potential GPI anchor attachment site (32) . So far, 17 TconTS-like genes have been described for T. congolense, from which 11 can be grouped into the TconTS1 family due to their high amino acid sequence similarity (>96 %). TconTS1b was chosen in this study, because it possesses one of the highest enzyme activities (32, 36) and is simply called TconTS1 onwards. An important structural feature of TconTS1 are the predicted nine N-glycosylation recognition sequences (Figure 1 ), distributed across both the CD and LD (25, 32) . It is noteworthy that TS sequences from African trypanosomal species contain a higher number of N-glycosylation sites (6-9) compared to those of species from South-American trypanosomes (2) (3) (4) (5) including the structural closely related sialidase (SA) from T. rangeli (10, 21, 32, (36) (37) (38) (39) . To investigate N-glycosylation induced structure-function effects, we expressed recombinant TconTS1 in CHO Lec1 fibroblasts. This glycosylation mutant mainly produces N-glycans of the composition Man5GlcNAc2 which is similar to the highmannose type N-glycans described for African trypanosomes (Man5-9GlcNAc2) (26-31, 40, 41) . The actual presence of N-glycans at the putative N-glycosylation sites was analyzed via MALDI-TOF MS. Enzyme activities of glycosylated and Endoglycosidase H (EndoHf)-treated TconTS1 were then compared in activity assays. Circular Dichroism spectroscopy was used to compare the secondary structures and stabilities of glycosylated versus deglycosylated TconTS1. Finally, molecular dynamics (MD) simulations were performed to analyze potential protein-glycan and glycan-glycan interactions altering the dynamics of amino acids in close proximity to the CD. To characterize N-glycosylation of TconTS1 in detail, we expressed TconTS1 in CHO Lec1 cells (Fig. S1A, S2) (32) . High-mannose type N-glycans of the recombinant protein could be detected in lectin blots using ConA ( Fig. 2A) . For deglycosylation, enzymatical removal of N-glycans, TconTS1 was treated with EndoHf, called D-TconTS1 in this study. A different gel electrophoresis migration profile and reduced ConA binding was observed for D-TconTS1 ( Fig. 2A) . However, deglycosylation was not complete, as a faint band was still visible in ConA blots. Even further addition of EndoHf or increased incubation time did not result in complete deglycosylation, potentially suggesting poor accessibility of some glycans to EndoHf. Thus, this partially deglycosylated protein was used in the following experiments. TconTS1 harbors nine potential N-glycosylation sites, five in the N-terminal CD and four in the LD. MALDI-TOF MS experiments were performed to identify N-glycan structures attached at potential N-glycosylation sites. In order to analyze TconTS1 with MALDI-TOF MS, it was necessary to cut the enzyme in shorter peptide and glycopeptide fragments via protease digestion using either trypsin or chymotrypsin. The resulting fragments were separated by their mass-to-charge ratio. Glycosylated peptides have a higher mass compared to their unglycosylated counterparts due to the molecular mass of the attached glycan. Glycopeptide analysis via MALDI-TOF MS is challenging, because they have a lower ionization efficiency than unglycosylated peptides (42) . Therefore, different analyses were performed with varying sample preparations to unravel the complete N-glycosylation profile of TconTS1. A summarizing evaluation is given in Table 1 . In a first approach, generated peptides of TconTS1 were analyzed in order to identify all peptide fragments with potential Nglycosylation sites in their glycosylated or unglycosylated state. Although glycopeptides are usually under-represented in MALDI-TOF mass spectra of the entire protein, this approach confirmed high-mannose type N-glycans at six out of the nine positions with the N-glycosylation consensus sequence N-X-S/T (N45, N113, N206, N240, N281, N693; exemplary spectrum shown in Fig. S3A ). Interestingly, five of them were located in the CD. As reported for protein expression in the glycosylation mutant cell line CHO Lec1, Nglycans were almost exclusively composed of Man5GlcNAc2. Glycopeptides harboring the potential N-glycosylation sites N45 and N281 were only found with high-mannose type N-glycans, whereas peptides containing N113, N206, N240 and N693 were detected both with and without N-glycans, indicating N-glycosylation heterogeneity at these sites. For the peptide containing the first potential glycosylation site in the LD (N625), we only detected the mass corresponding to the ungycosylated peptide, suggesting that this site is not glycosylated. In the following approach, glycopeptides were separated from unglycosylated peptides using ConA adsorbent. The removal of unglycosylated peptides reduced the complexity of the MALDI-TOF mass spectra and confirmed high-mannose type Nglycans at the six N-glycosylation sites identified in the first approach (N45, N113, N206, N240, N281, N693; exemplary spectrum shown in Fig. 2B ). Additionally, one further N-glycan was identified at position N657 in the LD. The approach also unveiled a number of glycans other than Man5GlcNAc2 at position N206, namely Man6GlcNAc2 and Man8GlcNAc2. In a third experimental set-up, the EndoHf-deglycosylated D-TconTS1 was analyzed after protease digestion. This approach further allowed the identification of Nglycosylation sites since EndoHf catalyzes the hydrolysis of the β-1,4-glycosidic linkage in the chitobiose core of high-mannose type N-glycans and leaves one GlcNAc unit attached to the glycopeptide. Glycopeptides harboring a N-glycosylation site and a residual GlcNAc in these preparations are therefore an additional indication of Nglycosylation sites carrying high-mannose type N-glycans. We were able to verify all N-glycosylation sites which were identified in the previous approaches (exemplary spectrum shown in Fig. S3B ). However, results for two glycosylation sites (N206, N281) were ambiguous since masses corresponding to the glycopeptides with an additional GlcNAc were also detected in the untreated TconTS1. Additionally, we identified N-glycosylation sites with this approach (N113, N206, N693) that exhibited high-mannose type N-glycans even after 16 h of EndoHf-treatment (Table 1) , consistent with the ConA blot experiment ( Fig. 2A ). All three different approaches revealed a predominant N-glycosylation in TconTS1-CD and allowed for a detailed analysis of the composition of N-glycans at both constant and variable N-glycosylation sites (Table 1 ). In none of these experiments we could identify a leucine-rich peptide containing the Nglycosylation consensus sequence N-E-S (662-664), neither in the glycosylated nor in the unglycosylated form. The peptide was not detectable also when the mass lists were analyzed for different glycans and post-translational modifications. Among the many different proteases which are suitable for MALDI-TOF MS analysis, the leucine-rich fragment only contained cleavage sites for chymotrypsin. However, the protease cleaves at the carboxyl side of different amino acids (Y, F, W, L, M), but the cleavage efficiency is rather low after leucine and methionine. It is therefore possible that the leucine-rich fragment was not sufficiently cleaved by chymotrypsin and thus too large to be detected. Another possibility is that the fragment has been digested into many different peptides in quantities below the detection limit. To test the impact of N-glycosylation on enzyme activity, we performed activity assays with D-TconTS1 and a control TconTS1 as reference. The control TconTS1 was treated at similar conditions as used in the deglycosylation protocol, including an incubation step for 16 hours at 37°C. Fetuin and lactose were used as Sia donoracceptor pairs, as described (32, 36) . The transfer reaction product 3'sialyllactose (3'SL) was analyzed via high performance anion exchange chromatography (HPAEC) with pulsed amperometric detection (PAD). The method allows for the separation and detection of the educt lactose, the product 3'SL, and the side product Nacetylneuraminic acid (Neu5Ac). The latter potentially occurs from hydrolytic activity of TS releasing free Neu5Ac from generated 3'SL. Incubation times of 30 minutes and 600 µM fetuin-bound Sia were used in these experiments. The lactose concentrations were varied to determine the kinetic parameters KM and Vmax. Under the same reaction conditions, D-TconTS1 produced less 3'SL than TconTS1 (Fig. 3A) . However, the calculated Vmax for the acceptor lactose using 600 µM fetuinbound Sia as donor substrate was identical within the error range for both enzymes and amounted to 2.4 µmol 3'SL/(min x mg enzyme) (Fig. 3B ). This implies that TconTS1 produces the same amount of 3'SL per time unit at high acceptor concentrations irrespective of its glycosylation state. However, the KM values for lactose differed between TconTS1 (1.7 mM) and D-TconTS1 (9.0 mM), indicating a 5fold lower acceptor substrate affinity for the deglycosylated enzyme. Once we were able to determine TconTS1's N-glycan profile and its influence on the enzyme activity, we addressed the question if N-glycans are able to modulate the enzyme stability. Since the presence of N-glycans can impact the protein's function via altering the protein structure (43), we performed circular dichroism experiments to investigate the influence of N-glycans on TconTS1's secondary structure. Circular dichroism spectra of TconTS1 and D-TconTS1 were analyzed under similar conditions used for the enzyme activity measurements (35 °C) (Fig. 4A ). The recorded spectra do not show any significant difference over the recorded wavelength range, indicating that both enzymes share the same common secondary structure. In fact, calculated secondary structural elements were identical in both cases with 37% of βsheets, 13% of α-helices, 11% of turns and 39% of unstructured (other) components in both cases (Table S1 ). Spectra recorded during temperature-ramping experiments revealed a high heat stability, as both TconTS1 and D-TconTS1 still possessed intact secondary structural elements up to 95°C ( Fig. 4C /D). The variation of the spectra intensity in the range between 190 and 210 nm during heating indicates a partial unfolding, taking place between 60°C and 70°C, with a transition temperature (TM) of about 62°C in both proteins (Fig. 4B ). In conclusion, the circular dichroism experiments indicated that the altered substrate affinity observed for D-TconTS1 did not originate from changes in the protein's secondary structure, which presents the same secondary element distribution and the same heat stability (as quantified by the melting temperature (TM) for partial unfolding) for both glycosylation states. However, N-glycosylation-induced changes of the tertiary structure cannot be excluded because they do not influence the circular dichroism spectra in the measured wavelength region. Molecular dynamics simulations can facilitate the study of mechanisms at the atomistic level and were performed to obtain an in-depth picture of N-glycan-protein interactions in TconTS1. Due to the lack of experimentally derived structures for TconTS enzymes, homology models were constructed on the basis of T. rangeli sialidase (TranSA) and T. cruzi TS (TcruTS). Although TconTS1 only has an amino acid sequence identity of 48 % compared to TcruTS, both reveal an overall high structural similarity and only differ in three amino acids reported to be important for TS activity (32) . In order to generate a N-glycosylated structure model of TconTS1, Man5GlcNAc2 glycans were attached at positions N45, N113, N206, N240, N281 and N693, as identified by MALDI-TOF MS. In this model, all potential N-glycosylation sites are glycosylated in the CD, whereas only one out of four potential sites is glycosylated in the LD. At first, standard MD simulations have been performed for TconTS1 and D-TconTS1 without bound substrates, in order to observe the dynamical behavior of the covalently linked N-glycans. Six out of nine asparagine residues in the motif N-X-T/S are located at the tail of loop regions (Fig. 5A) , which are mostly part of turns or coils and framed by β-sheet regions (Fig. S4A ). The terminal position and flexibility of these structural elements allow for large motion amplitudes and internal flexibility of the N-glycan trees. These movements enable interactions among glycans in structural proximity, for instance hydrogen bonds between the trees at positions N113 and N240 (Fig. 5B) , notwithstanding their distance in the protein sequence. Furthermore, an overlay of the averaged glycan distribution recorded every 5 ns during the simulation (Fig. 5C /D) revealed a dense glycan coverage (shielding) of the protein, especially in the CD and except for the direct entrance to the active site (Fig. 5D ). Interactions of N-glycans with important and highly conserved amino acids involved in the catalytic activity of TconTS1 (32, 44) were analyzed over the 500 ns MD trajectories. We especially focused on residues involved in catalysis (D150, E324, Y438) and substrate binding (R126, R144, Y211, W212, R339, Y408, R410). In TconTS1, D150 was observed to shift from the interior of the active site ( Fig. 6A ) towards an exterior position (Fig. 6B ), increasing its distance from R410, which was stationary within the active site (Fig. 6C) . Interestingly, as a consequence of this shift, a hydrogen bond formed between D150 and the N206 glycan ( Fig. 6C/D) . Detailed analysis revealed that this process was initiated by hydrogen bond formation between Y151 and glycan N206, already leading to a partial shift of D150 and making it more prone to interact with glycan N206 (Fig. 6C) . In D-TconTS1, instead, D150 was far less mobile (Fig. 6C ). Two important residues in TcruTS (Y119, W312) have been shown to serve as a holder for lactose once a substrate is bound in the active site and can be present in an open or stacked conformation (45, 46) . Consistently with the results reported by Mitchell et al. (46) , the corresponding residues Y211 and Y408 in TconTS1 ( Fig. 6 A/B, orange circles) showed a high flexibility and presented an open conformation with an average distance of 2.2 nm in the absence of a substrate (Fig. 7) . For comparison, these two residues stayed in much closer contact in D-TconTS1, remaining at a distance of about 1.6 nm during our simulation (Fig. 7A ). Further MD simulations of TconTS1 and D-TconTS1 were performed in complex with the substrate 3'SL. 3'SL was positioned in the binding site of TconTS1 in alignment with the crystal structure of the TcruTS/3'SL complex (PDB entry 1S0I). In the starting structure, 3'SL is bound at the acceptor substrate binding site between Y211 and Y408, and in close contact to both D150 and the well-conserved arginines R339, R410 (see Fig. 8A /B for TconTS1). As already seen in the substrate-free simulation, D150 of TconTS1 formed hydrogen bonds with mannose residues of an N-glycan. However, in this case it was glycan N113, which is also structurally in close proximity to the active site, and not glycan N206 (Fig. 8D ). This interaction occured after 20 ns, when D150 moved slightly off the catalytic site and becoming more accessible for the N-glycan (Fig. 8C) . After a structural rearrangement of 3'SL within the binding site after around 70 ns, D150 again interacted with glycan N113 and was dragged out of the binding site (Fig. 8C) . In striking contrast, catalytic amino acids in D-TconTS1 did not experience any interactions with GlcNAc residues, which remained on the protein surface after EndoHftreatment. In the presence of a substrate, the behavior of Y211 and Y408 was reversed with respect to the substrate-free protein. Namely, the distance between the two amino acids was about 2 nm in TconTS1 but increased to more than 3 nm in D-TconTS1 (Fig. 7B ). In this study, we have focused on the influence of N-glycosylation on enzyme activity and stability of TconTS1 from the animal pathogen T. congolense. Recombinant TconTS1 was expressed in the N-glycosylation mutant CHO Lec1 (32, 40) . The cell line mainly yields Man5GlcNAc2 N-glycans as these glycans cannot be further processed to complex glycans in the Golgi apparatus due to a lack of Nacetylglucosaminyltransferase I (GlcNAc-TI) activity (41, 47) . By means of MALDI-TOF MS analyses we were able to show that at least seven out of nine potential N-glycosylation sites in TconTS1 carry high-mannose type N-glycans, five of which are located in the CD. A glycan heterogeneity was observed at the three potential N-glycosylation site (N113, N206, N693) as also higher mannose structures (Man6GlcNAc2, Man8GlcNAc2) and fucosylated glycans (Man5GlcNAc2Fuc) were detected in some MS spectra. This heterogeneity might be explained by the lack of accessibility of the glycan for glycosidases, since N-glycan precursors are trimmed from Man8GlcNAc2 to Man5GlcNAc2 in the Golgi (48) . This hypothesis is supported by the fact that the glycosidase EndoHf was not able to fully deglycosylate TconTS1 at these sites even after 16 h of incubation. Additionally, our MD simulations revealed protein-glycan interactions at sites N113 and N206 by hydrogen bonds. It was previously shown by biochemical in vitro assays and computational studies that such interactions decrease glycan accessibility, which might hinder deglycosylation by EndoHf (49) . The secondary structure of TconTS1 was not influenced by the presence of highmannose type N-glycans. First, circular dichroism spectra of deglycosylated and glycosylated proteins did not reveal differences in secondary structure elements. Second, a similar midpoint of unfolding transition was determined for both enzymes in temperature-ramping experiments, suggesting no difference in stability. Similar results were found in single-site N-glycosylation mutants of the IBV spike protein by circular dichroism experiments (5). The observed thermal stability of TconTS1 might be explained by the high ß-sheet content of the protein as well as by an extended interface between CD and LD stabilized by salt bridges and a well-structured hydrogen bond network, making unfolding rather unlikely (35) . Although N-glycans do not seem to influence the stability of TconTS1 after successful expression, glycans are known to be required for proper protein folding (3). Whether or not TconTS1 requires Nglycosylation for accurate protein folding cannot be excluded by our experiments, and the role of N-glycans in TconTS folding still needs to be investigated. A prominent influence of N-glycosylation, however, was observed in enzyme activity assays. Lactose affinity of D-TconTS1 was decreased by a factor of five compared to TconTS1. However, the Vmax value was the same for both enzymes, which corroborates the structural integrity of D-TconTS1. Haynes et al. (21) studied the influence of N-glycosylation on TvivTS1 and did not observe an effect on enzyme activity. The same applies to investigations of a mutated variant of TranSA, which expresses transfer activity (22) . However, these studies did not determine the KM values of the enzyme reaction. Another study of TranSA in which the sialidase activity was investigated, did not observe strong effects on KM when recombinant proteins were expressed in Escherichia (E.) coli were compared with the native enzymes isolated from trypanosomes (39) . For TcruTS, enzymes expressed in E. coli still showed transfer activity although to a lesser extent than observed for the native protein, which might be explained by the absence of N-glycans or incorrect protein folding (37) . Molecular insight into the effect of N-glycosylation of TconTS1 at the atomic level were obtained in a series of MD simulations. The N-glycans, especially in TconTS1-CD, were observed to form a highly dynamical 'shield' enclosing the enzyme, while leaving the entrance to the catalytic center open for substrate binding. Moreover, the simulations suggest a large structural flexibility of the aspartate at position 150, which acts as a proton donor in the enzymatic transfer of Sia, and is therefore crucial for substrate conversion (17, 44) . In several simulations of TconTS1, both in the absence and presence of bound 3'SL, D150 forms either intermittent or stable hydrogen-bond interactions with N-glycans at positions N206 or N113. Interestingly, these are conserved N-glycosylation sites among TS or SA from different species, despite the dissimilar amino acid sequences and distributions of other N-glycosylation sites. In particular, N206 is conserved in all TS variants of T. congolense as already described by Waespy et al. (25) but also in enzymes from T. cruzi , T. rangeli, T. brucei and T. vivax as revealed by sequence alignment (Fig. S5) . Additionally, N113 is also conserved in TconTS, TvivTS and TranSA. Along this line, we propose that there might be a common mechanism of TS activity mediated by N-glycan interactions with amino acids of the active site, in particular with D150. MD simulations showed that through this interaction, which can only take place in glycosylated TconTS1, D150 was pulled out of the active site. Moreover, in 3'SL-free TconTS1 the distance between the ligand-binding residues Y211 and Y408 was significantly larger than that found in D-TconTS1. Both these observations suggest a more open structure of the catalytic pocket of TconTS1 in contrast to D-TconTS1, which facilitates the initial binding of sialoglycans. Interaction of the N-glycan shield at position N113 with D150 was also observed in MD simulations of TconTS1 in complex with 3'SL, although more intermittent. Interestingly, the substrate was tightly bound between Y211 and Y408 in TconTS1, but not in D-TconTS1. These findings provide good evidence for higher substrate-binding affinity in case of the glycosylated enzyme and are in line with our experimental observations. We note that the heterogeneity of N-glycosylation at N206, as determined experimentally in this study, was not considered in our MD simulations, so that the influence of different N-glycan structures at this position on the catalytic activity remains a task for future works. However, our results underline the obvious importance of the flexibility of the N-glycan shield in its interaction with key amino acid residues located in the CD, which remained unnoticed in previous studies. The short time scale and lack of N-glycans in previously performed MD simulations probably prevented the exploration of the D150 structural shift (45, 46, 50, 51) . Mechanisms underlying the Nglycan-dependent modulation of enzymatic activity remain to be explored by advanced techniques enabling prediction of binding free energies and including hybrid quantum mechanics/molecular mechanics approaches, which have been beyond the scope of this study. Our data further underline the general importance of MD simulations, since differences in the dynamical behavior of catalytic amino acids are not always detectable in wet-lab experiments. In particular, the D150 shift could not be detected in our circular dichroism experiments, because D150 is part of a turn/coil structural motif, which does not change its secondary structural composition irrespective of the location of D150 (Fig. S4 ). In conclusion, our study demonstrates that N-glycosylation can enhance substrate affinity in TS, putatively via interactions of N-glycans with amino acids located in the catalytic pocket. To the best of our knowledge this is the first time that interaction of the N-glycan shield with the underlying protein have been suggested to alter enzyme kinetics of TS. As N-glycosylation sites are partially conserved among the transsialidase enzyme family, the here-observed protein-glycan interactions may also occur in TS of other trypanosomal species. This newly discovered regulatory feature opens up new avenues in TS and enzyme research in general and may pave the way for novel approaches to treat trypanosomiasis. Chemicals and reagents used in this study were of cell-culture and analytical grade and were purchased from Sigma-Aldrich/Merck KGaA (Darmstadt, Germany) or Carl Roth (Karlsruhe, Germany) if not stated otherwise. Recombinant TconTS1 was produced in CHO Lec1 cells as already described (32) . In brief, a modified pDEF vector was used for stable transfection comprising a transin sequence for protein secretion followed by the TconTS1-encoding sequence excluding signal peptide and GPI anchor sequence, a C3 cleavage site and a C-terminal SNAP and Strep-tag. Monoclonal cells were grown in serum-free CHO medium (Bio&SELL, Feucht, Germany) or Excell medium supplemented with 50 μg/mL gentamicin sulphate (Lonza™ BioWhittaker™, Walkersville, MD, USA) at 37°C and 5% CO2. Cell culture supernatant comprising the recombinant protein was harvested every second day and stabilized with 10 mM Tris/HCl pH 8.0, 10 mM EDTA, 10 mM ascorbic acid and 0.02% sodium azide. Ultracentrifugation was performed at 7,800 rcf for 15 min followed by 40 In-solution digestion of glycosylated and deglycosylated TconTS1 was performed with trypsin and chymotrypsin (Promega, Walldorf, Germany) to analyze peptides with MALDI-TOF MS. Protein solutions with a final concentration of 0.1 mg/mL were prepared in 50 mM NH4HCO3 buffer pH 7.8. Cysteine bonds were reduced with 1.4 mM DTT for 30 min at 50°C and the protein was denatured for 10 min at 95°C. After a short cooling step on ice, cysteine residues were alkylated with IAA at a final concentration of 3.2 mM for 30 min at 37°C and protected from light. Protein samples were digested with trypsin or chymotrypsin overnight at 37°C at protein:protease ratios of 25-100:1. Two negative controls were prepared accordingly, either lacking the protein TconTS1 (N1) or the protease (N2). Samples were spotted directly on the target (Bruker Daltonics, Bremen, Germany) or glycopeptides were further purified with ConA beads as described under "Glycopeptide purification of protease-digested TconTS1 with ConA beads". For spotting, 1 µL protein solution was directly mixed with 1 µL HCCA matrix solution (40 mM in 50% acetonitrile/0.1% TFA) on the target. Samples were measured in the positive reflector mode using the MALDI-TOF autoflex TM speed (Bruker Daltonics) which was calibrated with the peptide calibration standard II (Bruker Daltonics). Detailed settings can be found in Table S2 . The software Flex analysis 3.4 (Bruker Corporation, USA) was employed for data evaluation. Monoisotopic peaks were identified with Sophisticated Numerical Annotation Procedure (SNAP) algorithm a signal to noise ratio of 6. Further settings included baseline subtraction (TopHat) and spectra smoothing (algorithm: Savitzky Golay). Peaks present in any control have been automatically excluded from sample mass lists. To concentrate glycopeptides from protease-digested TconTS1, samples were incubated with AffiSep® ConA adsorbent (Galab). The adsorbent was equilibrated in 1x binding buffer (50 mM Tris/HCl pH 7.4, 150 mM NaCl, 1 mM CaCl2, 1 mM MgCl2, 1 mM MnCl2) and was added to trypsin-digested and heat-inactivated protein samples in a ratio of 1µL adsorbent/2.5 µg protein). 5x binding buffer of appropriate volumes was supplied to samples to yield a final concentration of 1x. Samples were rotated overnight at 8 rpm and 4°C and centrifuged for 30 s at 130 rcf. The supernatant was discarded, and the adsorbent was washed thrice with 50 mM NH4HCO3 buffer (pH 7.8) in binding buffer and at the same centrifugation conditions. After the last centrifugation step the adsorbent was resuspended in 10-50 μL of NH4HCO3 buffer (pH 7.8). Elution of glycopeptides was performed at 95°C for 10 min. Samples were centrifuged at 4,300 rcf for 30 s. The supernatant was used for MALDI-TOF analysis. Negative controls were prepared in parallel with identical components apart from the trypsin-digested TconTS1. Peaks resulting from negative controls were excluded for data analysis. TS activity assays were executed as previously described (32, 36) . Fetuin and lactose served as Sia donor and acceptor, respectively. Reactions were carried out in 50 µL of 10 mM potassium phosphate buffer, pH 7.4 containing 100 µg dialyzed fetuin (corresponding to 600 µM bound Neu5Ac), varying concentrations of lactose (0.01-5 mM) and 50 ng deglycosylated TconTS1 or the corresponding control. Samples were incubated at 37°C for 30 min and the reaction was terminated with 200 µL ice-cold acetone. After protein precipitation overnight at -20°C, samples were centrifuged (20000 rcf, 30 min, 4°C), lyophilized and resuspended in 125 µL water. Transfer activity was measured as 3'SL implementing the HPAEC-PAD system ICS-5000+ (Dionex/Thermo Fisher Scientific). 25 µL sample were applied to a CarboPac100 analytical column (250x2 mm, 8.5 µm, Thermo Fisher Scientific) equipped with a guard column (50x2 mm, Thermo Fisher Scientific). Chromatography was performed at isocratic conditions with 100 mM NaOH and 100 mM NaOAc for 12 min followed by a wash step with 100 mM NaOH and 500 mM NaOAc for 5 min and an equilibration step for 8 min to previous conditions. Production of 3'SL was quantified with a purchased 3'SL standard (Carbosynth, Compton, United Kingdom 4) only, in the respective cuvette. Regarding data processing, the baseline was subtracted from recorded spectra and repetitive scans were averaged before a Savitsky-Golay smoothing filter with smoothing windows of three data points was applied. Spectra have been recorded in raw ellipticity ( ) and were further converted to mean residue ellipticity , where is the protein concentration, the quartz cuvette pathlength and the number of amino acids in the protein sequence. To be consistent with the parameters of the enzyme assay, constant-temperature measurements were performed at 35°C to check for a general influence of glycans on the protein structure. To estimate the secondary structure components of the measured sample, circular dichroism spectra were analyzed using the BeStSel Web server (52, 53) . Temperatureramping experiments were performed following the suggestions of (54) in order to analyze protein stability, unfolding intermediates and the midpoint of the unfolding transition (melting temperature, TM). In detail, protein samples were heated from 20°C up to 95°C with 5°C temperature steps employing the stepped ramp mode. After 5 min of equilibration time at the respective temperature at least three spectra were recorded and averaged. The TM was calculated from the fraction of protein folded at any temperature ( ) defined as: where is the ellipticity at any temperature, is the ellipticity at the unfolded state and at the folded state. TM is defined as the temperature at which = 0.5 [Greenfield 2006 ] and also referred to as the melting temperature. In order to calculate , we chose 195 nm as the wavelength to plot recorded !"# values against the temperature. Afterwards, the calculated α values were again plotted with reference to temperature and a sigmoid curve used for fitting to obtain a precise TM value. As we did not observe a complete unfolding of TconTS1 in any temperature-ramping experiment, is defined as the average ellipticity of the two highest temperatures. Therefore, was set as the average ellipticity that was recorded for the two lowest temperatures. Structures of TS from T. congolense have not been resolved so far by experimental techniques. Therefore an atomistic structure of TconTS1 was modelled by the I-TASSER web server for protein structure and function predictions (55, 56) based on the recombinant sequence without the transin signal (Fig. S1 -amino acids 23-951) . However, the numbering of amino acids is in correspondence with the native sequence, as depicted in Figure S1B (32) . As TconTS1 was modelled with the engineered SNAP-Strep for consistency and better comparison with experimental data, restraints were used to achieve proper folding of this structural part. In detail, a secondary structure restraint as well as a structure template for the SNAP-Strep region, constructed by I-TASSER beforehand using only the SNAP-Strep sequence, were employed. Major templates employed by the threading algorithm were TranSA (PDB entry: 2agsA, 2A75) as well as TcruTS (PDB entry: 1ms9). The first model with a C-score of -2.99 was further used in this study. The freely accessible CHARMM-GUI Glycan Modeler (www.charmm-gui.org) was employed to form a disulfide bond between residues C493 and C503. For TconTS1, Man5GlcNAc2 N-glycans were attached at positions N45, N113, N206, N240, N281 and N693 with CHARMM-GUI (57-62) as identified with MALDI-TOF MS. Man5GlcNAc2 was chosen for all sites, as it represents the simplest and most often found N-glycan in CHO Lec1 cells (41) . N657 was not glycosylated, although found in our MALDI experiments, because model building was already completed at the time this glycan was found. D-TconTS1 was constructed by the addition of one GlcNAc residue via a covalent linkage at positions N45, N113, N206, N240, N281 and N693. The simulation box was constructed with CHARMM-GUI, filling it with water molecules to obtain a distance of 15 Å between the protein and box edge. 22 K + ions have been added for charge neutralization. All MD simulations were performed with the GROMACS 2018 version (63), using the CHARMM36m (64) force field for proteins, N-glycans and 3'SL (65, 66) in combination with the TIP3P water model. The leap-frog algorithm was used as an integrator and the LINCS algorithm (67) was employed to constrain bonds connected to hydrogens atoms. Temperature coupling was performed with velocity rescaling using a τ parameter of 0.1 ps (68). The verlet cut-off scheme (69) was employed for van der Waals parameters using PME and the standardized parameters suggested for CHARMM36 in the GROMACS manual version 2019. Energy minimizations of water and ions (with restrained proteins) were performed using the steepest descent algorithm with a tolerance of 1,000 kJ mol -1 nm -1 . Equilibration of water (with restrained proteins) was done in an NVT and an NPT ensemble for 1 ns, respectively. It followed the energy minimization of the proteins (with restrained water and ions) under the same conditions as before. Finally, unrestrained equilibrations were performed under NVT and NPT for 1 ns each. The production runs were performed for 500 ns in the NVT ensemble at 310.15 K, writing coordinates to file every 10 ps. A time step of 2 fs was set for all parts of the simulations if not mentioned otherwise. The systems were analyzed and visualized every 500 ps using VMD (http://www.ks.uiuc.edu/Research/vmd/), the open-source communitydeveloped PLUMED library version 2.6 and Python version 3.7 (70) (71) (72) (73) (74) . 3'SL was chosen as a substrate since its composition is similar to the typical terminal branches of complex type N-glycans and was already used in previous enzyme assays (9) . To ensure a correct binding pose of the ligand in the catalytic site, the homologymodelled TconTS1 structure was aligned to the pdb structure of TcruTS (pdb: 1s0i) in complex with 3'SL by VMD. The position of 3'SL was copied to the TconTS structure, and the ligand-protein complex was subjected to CHARMM-GUI for further processing, as described under section "Construction of the simulation cell". Further minimization and equilibration steps were performed as described under section "Molecular Dynamics simulation", using a timestep of 1 or 2 fs to resolve steric clashes. Production runs were performed as mentioned above. The underlying data of this study, such as raw spectra of MALDI-TOF MS and circular dichroism experiments as well as structure and trajectory files of MD simulations are made available under https://doi.org/10.5281/zenodo.5775026. This article contains supporting information. congolense trans-sialidase; TM, melting temperature; TranSA, Trypanosoma rangeli sialidase; TS, trans-sialidase; Trypanosoma vivax TvivTS; 3'SL, 3'sialyllactose Tables Table 1 : Summary of MALDI-TOF MS analyses of the N-glycan profile of TconTS1. Different highmannose-type N-glycans were identified at potential N-glycosylation sites of untreated TconTS1 (glycosylated, G) and the EndoHf-deglycosylated D-TconTS1 (D). Note that a residual GlcNAc remains attached to the protein N-glycosylation sites after EndoHf-cleavage. Branched glycans could still be identified at position N113, N206 and N693 even after EndoHf-treatment. Glycosylation site N635 (*) could not be detected in the experiments (see text). (Snapshot at 100 ns) . B) D150 moved out of the catalytic site and formed hydrogen bonds with glycan N206 until the end of the simulation (Snapshot at 350 ns). C) Time evolution of the distance between the center of D150 and the center of R410 for TconTS1 (black) and D-TconTS1 (grey) as well as numbers of hydrogen bonds for TconTS1 between glycan N206 and D150/Y151. For D-TconTS1, no hydrogen bonds were observed. D) Detail of the hydrogen bonds (black dashed lines) between D150 and glycan N206 at its terminal mannose branches. D150 is circled in yellow and the ligand-binding residues Y211 and Y408 are circled in orange and represented in balland-stick with the following color code: oxygen (red), carbon (cyan), nitrogen (blue), hydrogen (white). The underlying protein structure is represented in cartoon style in grey with asparagine residues of Nglycosylation sites labelled in red spheres. Glycan color code: Man (green), GlcNAc (blue). in the binding pocket, forming hydrogen bonds (black dashed lines) to conserved arginine residues. C) Distance between the center of D150 and the center of R410 for TconTS1 (black) and D-TconTS1 (grey). Number of hydrogen bonds between D150 and glycan N113 for TconTS1 over the course of the simulation. D) D150 interacts with glycan N113 via hydrogen bond formation (black dashed lines) at its mannose branches (snapshot at 97 ns). Color code of the amino acids as in Fig. 6 . Glycan color code: Man (green), GlcNAc/Glc (blue), Gal (yellow), Neu5Ac (violet). The effect of individual N-glycans on enzyme activity Biological roles of oligosaccharides: all of the theories are correct Essentials of glycobiology Beyond shielding: The roles of glycans in the SARS-CoV-2 spike protein Glycosylation of the viral attachment protein of avian coronavirus is essential for host cell and receptor binding N-Linked glycosylation regulates CD22 organization and function. Front Immunol A novel cell surface transsialidase of Trypanosoma cruzi generates a stage-specific epitope required for invasion of mammalian cells The developmentally regulated trans-sialidase from Trypanosoma brucei sialylates the procyclic acidic repetitive protein Distribution of developmentally regulated transsialidases in the Kinetoplastida and characterization of a shed trans-sialidase activity from procyclic Trypanosoma congolense Erythrophagocytosis of desialylated red blood cells is responsible for anaemia during Trypanosoma vivax infection: Erythrophagocytosis during T. vivax infection Characterization of a novel trans-sialidase of Trypanosoma brucei procyclic trypomastigotes and identification of procyclin as the main sialic acid acceptor Mediation of Trypanosoma cruzi invasion by sialic acid on the host cell and trans-sialidase on the trypanosome Mucin-like glycoproteins linked to the membrane by glycosylphosphatidylinositol anchor are the major acceptors of sialic acid in a reaction catalyzed by trans-sialidase in metacyclic forms of Trypanosoma cruzi Role of sialic acid in the resistance of Trypanosoma cruzi trypomastigotes to complement Mucin-like molecules form a negatively charged coat that protects Trypanosoma cruzi trypomastigotes from killing by human anti-α-galactosyl antibodies Surface sialic acids taken from the host allow trypanosome survival in tsetse fly vectors Structural basis of sialyltransferase activity in trypanosomal sialidases The crystal structure and mode of action of trans-sialidase, a key enzyme in Trypanosoma cruzi pathogenesis Structural insights into the catalytic mechanism of Trypanosoma cruzi trans-sialidase Free energy study of the catalytic mechanism of Trypanosoma cruzi trans -sialidase. From the Michaelis complex to the covalent intermediate Production, purification and crystallization of a trans -sialidase from Trypanosoma vivax Automated N -glycan profiling of a mutant Trypanosoma rangeli sialidase expressed in Pichia pastoris , using tandem mass spectrometry and bioinformatics Trypanosoma cruzi transsialidase and neuraminidase activities can be mediated by the same enzymes Purification and characterization of a novel sialidase found in procyclic culture forms of Trypanosoma brucei Carbohydrate recognition specificity of transsialidase lectin domain from Trypanosoma congolense Structural studies on the major oligosaccharides in a variant surface glycoprotein of Trypanosoma congolense Characterisation of the asparagine-linked oligosaccharides from Trypanosoma brucei type-I variant surface glycoproteins Structural characterization of the asparagine-linked oligosaccharides from Trypanosoma brucei type II and type III variant surface glycoproteins Structural characterisation of two forms of procyclic acidic repetitive protein expressed by procyclic forms of Trypanosoma brucei The glycosylation of the variant surface glycoproteins and procyclic acidic repetitive proteins of Trypanosoma brucei Trypanosoma congolense procyclins: unmasking cryptic major surface glycoproteins in procyclic forms Biochemical characterization of trans-sialidase TS1 variants from Trypanosoma congolense Sialidases play a key role in infection and anaemia in Trypanosoma congolense animal trypanosomiasis A bloodstream Trypanosoma congolense sialidase could be involved in anemia during experimental trypanosomiasis Cooperativity of catalytic and lectin-like domain of T. congolense trans-sialidase modulates its catalytic activity Biochemical diversity in the Trypanosoma congolense trans-sialidase family A recombinant Trypanosoma cruzi trans-sialidase lacking the amino acid repeats retains the enzymatic activity Procyclic Trypanosoma brucei expresses separate sialidase and trans-sialidase enzymes on its surface membrane Trypanosoma rangeli sialidase: cloning, expression and similarity to T.cruzi trans-sialidase Glycosylation mutants of animal cells Glycomics profiling of chinese hamster ovary cell glycosylation mutants reveals N-glycans of a novel size and complexity Sensitive and structure-informative N-glycosylation analysis by MALDI-MS Ionization, fragmentation, and derivatization The impact of N-glycosylation on conformation and stability of immunoglobulin Y from egg yolk Trans-sialidaselike sequences from Trypanosoma congolense conserve most of the critical active site residues found in other trans-sialidases Modulation of catalytic function by differential plasticity of the active site: case study of Trypanosoma cruzi trans-sialidase and Trypanosoma rangeli sialidase Tryptophan as a molecular shovel in the glycosyl transfer activity of Trypanosoma cruzi trans-sialidase Five Lec1 CHO cell mutants have distinct Mgat1 gene mutations that encode truncated N-acetylglucosaminyltransferase I N-glycans Glycan-protein interactions determine kinetics of N-glycan remodeling Insights into the activity and specificity of Trypanosoma cruzi trans -sialidase from molecular dynamics simulations Evidence of ternary complex formation in Trypanosoma cruzi trans-sialidase catalysis Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy BeStSel: a web server for accurate protein secondary structure prediction and fold recognition from the circular dichroism spectra Using circular dichroism collected as a function of temperature to determine the thermodynamics of protein unfolding and binding interactions I-TASSER: a unified platform for automated protein structure and function prediction 2015) I-TASSER server: new development for protein structure and function predictions CHARMM: The biomolecular simulation program CHARMM-GUI: a web-based graphical user interface for CHARMM Glycan Reader: automated sugar identification and simulation preparation for carbohydrates and glycoproteins CHARMM-GUI input generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM simulations using the CHARMM36 additive force field Glycan Reader is improved to recognize most sugar types and chemical modifications in the Protein Data Bank CHARMM-GUI Glycan Modeler for modeling and simulation of carbohydrates and glycoconjugates GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers CHARMM36m: an improved force field for folded and intrinsically disordered proteins CHARMM additive all-atom force field for glycosidic linkages between hexopyranoses CHARMM additive all-atom force field for carbohydrate derivatives and its utility in polysaccharide and carbohydrate-protein modeling LINCS: A linear constraint solver for molecular simulations Canonical sampling through velocity rescaling A flexible algorithm for calculating pair interactions on SIMD architectures PLUMED: a portable plugin for free-energy calculations with molecular dynamics Promoting transparency and reproducibility in enhanced molecular simulations PLUMED 2: New feathers for an old bird VMD: visual molecular dynamics An Efficient Library for Parallel Ray Tracing and Animation Symbol nomenclature for graphical representations of glycans Knowledge-based protein secondary structure assignment WebLogo: a sequence logo generator Abbreviations 3'SL, 3'-sialyllactose EndoHf-deglycosylated TconTS1; E., Escherichia; EndoHf, recombinant protein fusion of Endoglycosidase H and maltose binding protein; fs, femtosecond; Gal, galactose; Glc, glucose; GlcNAc, N-Acetylglucosamine; HPAEC-PAD high performance anion exchange chromatography with pulsed amperometric detection; K, Kelvin; kJ matrix-assisted laser desorption ionization time-of-flight Molecular dynamics; MS, mass spectrometry; Neu5Ac, N-acetylneuraminic acid; nm, nanometer; ns, nanosecond; ps, picosecond; P., Pichia; SA, sialidase; Sia, sialic acid; T., Trypanosoma; TcruTS The authors would like to thank Dr. Jan-Hendrik Hehemann (MARUM MPG Bridge Group Marine Glycobiology, Bremen, Germany) for the possibility of performing the HPAEC-PAD measurements in his lab and Alek Bolte for the technical support. They further thank Monika Michaelis for her assistance with circular dichroism measurements and for fruitful discussions. Computational resources were provided by the North German Supercomputing Alliance (HLRN) under the project number hbb00001. This study's financial support came from the Deutsche Forschungsgemeinschaft (DFG: https://www.dfg.de/), project grants to SK, Ke428/10-1 and Ke428/13-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors declare that they have no conflicts of interest with the contents of this article.